Update experiment plan to TODO format with running jobs

- Added currently running SLURM job IDs (3367710-3367723)
- Converted to TODO list format with checkboxes
- Added reminders to install IsaacLab and HumanoidBench before Phase 2/3
- Phase 1 (MuJoCo) batch submitted and pending in queue
This commit is contained in:
ys1087@partner.kit.edu 2025-07-22 17:12:05 +02:00
parent e7e3ae48f1
commit e95c2c4e11

View File

@ -10,58 +10,74 @@
- **Key Achievement**: Fixed JAX/PyTorch dtype mismatch issue (removed JAX_ENABLE_X64)
- **Status**: ✅ Environment working, ready for full-scale experiments
## Experiments to Replicate
## 🚧 Currently Running Jobs
### Phase 1: MuJoCo Playground - SUBMITTED ✅
**SLURM Job IDs**: 3367710-3367723 (12 jobs total)
- ⏳ T1JoystickFlatTerrain (seeds 1,2,3) - Jobs: 3367710, 3367711, 3367712
- ⏳ T1JoystickRoughTerrain (seeds 1,2,3) - Jobs: 3367713, 3367716, 3367717
- ⏳ G1JoystickFlatTerrain (seeds 1,2,3) - Jobs: 3367718, 3367719, 3367720
- ⏳ G1JoystickRoughTerrain (seeds 1,2,3) - Jobs: 3367721, 3367722, 3367723
- **Status**: All jobs pending in queue
- **Monitor**: `python monitor_experiments.py logs/experiment_tracking_1753196960.yaml --watch`
## 📋 TODO List
### Phase 1: MuJoCo Playground
- [x] Set up MuJoCo Playground environment
- [x] Test 5000-step run successfully
- [x] Submit full batch (4 tasks × 3 seeds)
- [ ] Wait for jobs to complete (~1 hour each)
- [ ] Verify results match paper Figure 11
- [ ] Download wandb data for analysis
### Phase 2: IsaacLab
- [ ] **INSTALL ISAACLAB ENVIRONMENT FIRST**
- [ ] Test single IsaacLab task
- [ ] Submit batch: `python submit_experiment_batch.py --phase 2 --seeds 3`
- [ ] Monitor 6 tasks × 3 seeds (18 jobs total)
- [ ] Verify results match paper Figure 10
### Phase 3: HumanoidBench
- [ ] **INSTALL HUMANOIDBENCH ENVIRONMENT FIRST**
- [ ] Test single HumanoidBench task
- [ ] Submit batch: `python submit_experiment_batch.py --phase 3 --seeds 3`
- [ ] Monitor 5 tasks × 3 seeds (15 jobs total)
- [ ] Verify results match paper Figure 9
### Analysis & Completion
- [ ] Collect all results from wandb
- [ ] Generate comparison plots vs paper
- [ ] Document findings and performance
- [ ] Create final report
## 📊 Experiment Details
### Phase 1: MuJoCo Playground (Figure 11 from paper)
- `T1JoystickFlatTerrain` (3600s)
- `T1JoystickRoughTerrain` (3600s)
- `G1JoystickFlatTerrain` (3600s)
- `G1JoystickRoughTerrain` (3600s)
- `T1JoystickFlatTerrain`, `T1JoystickRoughTerrain`, `G1JoystickFlatTerrain`, `G1JoystickRoughTerrain`
- **Duration**: 3600s each
- **Hyperparameters**: total_timesteps=500000, num_envs=2048, batch_size=32768, buffer_size=102400, eval_interval=25000
**Hyperparameters (from paper):**
- total_timesteps: 500000
- num_envs: 2048
- batch_size: 32768
- buffer_size: 102400 (50K per env)
- eval_interval: 25000
### Phase 2: IsaacLab (Figure 10 from paper)
- `Isaac-Velocity-Flat-G1-v0` (3600s)
- `Isaac-Velocity-Rough-G1-v0` (3600s)
- `Isaac-Repose-Cube-Allegro-Direct-v0` (3600s)
- `Isaac-Repose-Cube-Shadow-Direct-v0` (3600s)
- `Isaac-Velocity-Flat-H1-v0` (3600s)
- `Isaac-Velocity-Rough-H1-v0` (3600s)
**Hyperparameters:**
- total_timesteps: 1000000
- num_envs: 1024
- batch_size: 32768
- buffer_size: 51200
- eval_interval: 50000
### Phase 2: IsaacLab (Figure 10 from paper)
- `Isaac-Velocity-Flat-G1-v0`, `Isaac-Velocity-Rough-G1-v0`, `Isaac-Repose-Cube-Allegro-Direct-v0`, `Isaac-Repose-Cube-Shadow-Direct-v0`, `Isaac-Velocity-Flat-H1-v0`, `Isaac-Velocity-Rough-H1-v0`
- **Duration**: 3600s each
- **Hyperparameters**: total_timesteps=1000000, num_envs=1024, batch_size=32768, buffer_size=51200, eval_interval=50000
### Phase 3: HumanoidBench (Figure 9 from paper - subset)
- `h1hand-walk` (10800s)
- `h1hand-run` (10800s)
- `h1hand-hurdle` (10800s)
- `h1hand-stair` (10800s)
- `h1hand-slide` (10800s)
- `h1hand-walk`, `h1hand-run`, `h1hand-hurdle`, `h1hand-stair`, `h1hand-slide`
- **Duration**: 10800s each
- **Hyperparameters**: total_timesteps=2000000, num_envs=256, batch_size=16384, buffer_size=12800, eval_interval=100000
**Hyperparameters:**
- total_timesteps: 2000000
- num_envs: 256
- batch_size: 16384
- buffer_size: 12800
- eval_interval: 100000
## 🔧 Commands
## Usage
Submit Phase 1:
Monitor jobs:
```bash
python submit_experiment_batch.py --phase 1 --seeds 3
squeue -u $USER
python monitor_experiments.py logs/experiment_tracking_1753196960.yaml --watch
```
Monitor progress:
Submit next phases:
```bash
python monitor_experiments.py --watch
python submit_experiment_batch.py --phase 2 --seeds 3
python submit_experiment_batch.py --phase 3 --seeds 3
```