Update experiment_plan.md

This commit is contained in:
Dominik Moritz Roth 2025-10-08 21:07:06 +02:00
parent 4eaec644ec
commit eff135a860

View File

@ -1,7 +1,7 @@
# FastTD3 HoReKa Experiment Plan # FastTD3 HoReKa Experiment Plan
*Added by Dominik - Paper Replication Study* *Added by Dominik - Paper Replication Study*
## Proof of Concept Results ## Proof of Concept Results
**Initial Success**: [HoReKa Dev Run](https://wandb.ai/rl-network-scaling/FastTD3_HoReKa_Dev?nw=nwuserdominik_roth) **Initial Success**: [HoReKa Dev Run](https://wandb.ai/rl-network-scaling/FastTD3_HoReKa_Dev?nw=nwuserdominik_roth)
- **Task**: T1JoystickFlatTerrain - **Task**: T1JoystickFlatTerrain
@ -10,7 +10,7 @@
- **Key Achievement**: Fixed JAX/PyTorch dtype mismatch issue (removed JAX_ENABLE_X64) - **Key Achievement**: Fixed JAX/PyTorch dtype mismatch issue (removed JAX_ENABLE_X64)
- **Status**: ✅ Environment working, ready for full-scale experiments - **Status**: ✅ Environment working, ready for full-scale experiments
## 🚧 Currently Running Jobs ## Currently Running Jobs
### Phase 1: MuJoCo Playground - RESUBMITTED TO H100 ✅ ### Phase 1: MuJoCo Playground - RESUBMITTED TO H100 ✅
**NEW SLURM Job IDs**: 3371681-3371692 (12 jobs total) - Using accelerated-h100 partition (94GB GPU RAM) **NEW SLURM Job IDs**: 3371681-3371692 (12 jobs total) - Using accelerated-h100 partition (94GB GPU RAM)
@ -22,7 +22,7 @@
- **Monitor**: `python monitor_experiments.py experiment_tracking_1753312228.yaml --watch` - **Monitor**: `python monitor_experiments.py experiment_tracking_1753312228.yaml --watch`
- **Note**: Previous jobs (3367710-3367723) crashed due to insufficient GPU RAM on standard partition - **Note**: Previous jobs (3367710-3367723) crashed due to insufficient GPU RAM on standard partition
## 📋 TODO List ## TODO List
### Phase 1: MuJoCo Playground ### Phase 1: MuJoCo Playground
- [x] Set up MuJoCo Playground environment - [x] Set up MuJoCo Playground environment
@ -52,7 +52,7 @@
- [ ] Document findings and performance - [ ] Document findings and performance
- [ ] Create final report - [ ] Create final report
## 📊 Experiment Details ## Experiment Details
### Phase 1: MuJoCo Playground (Figure 11 from paper) ### Phase 1: MuJoCo Playground (Figure 11 from paper)
- `T1JoystickFlatTerrain`, `T1JoystickRoughTerrain`, `G1JoystickFlatTerrain`, `G1JoystickRoughTerrain` - `T1JoystickFlatTerrain`, `T1JoystickRoughTerrain`, `G1JoystickFlatTerrain`, `G1JoystickRoughTerrain`