Update experiment_plan.md
This commit is contained in:
parent
4eaec644ec
commit
eff135a860
@ -1,7 +1,7 @@
|
|||||||
# FastTD3 HoReKa Experiment Plan
|
# FastTD3 HoReKa Experiment Plan
|
||||||
*Added by Dominik - Paper Replication Study*
|
*Added by Dominik - Paper Replication Study*
|
||||||
|
|
||||||
## ✅ Proof of Concept Results
|
## Proof of Concept Results
|
||||||
**Initial Success**: [HoReKa Dev Run](https://wandb.ai/rl-network-scaling/FastTD3_HoReKa_Dev?nw=nwuserdominik_roth)
|
**Initial Success**: [HoReKa Dev Run](https://wandb.ai/rl-network-scaling/FastTD3_HoReKa_Dev?nw=nwuserdominik_roth)
|
||||||
|
|
||||||
- **Task**: T1JoystickFlatTerrain
|
- **Task**: T1JoystickFlatTerrain
|
||||||
@ -10,7 +10,7 @@
|
|||||||
- **Key Achievement**: Fixed JAX/PyTorch dtype mismatch issue (removed JAX_ENABLE_X64)
|
- **Key Achievement**: Fixed JAX/PyTorch dtype mismatch issue (removed JAX_ENABLE_X64)
|
||||||
- **Status**: ✅ Environment working, ready for full-scale experiments
|
- **Status**: ✅ Environment working, ready for full-scale experiments
|
||||||
|
|
||||||
## 🚧 Currently Running Jobs
|
## Currently Running Jobs
|
||||||
|
|
||||||
### Phase 1: MuJoCo Playground - RESUBMITTED TO H100 ✅
|
### Phase 1: MuJoCo Playground - RESUBMITTED TO H100 ✅
|
||||||
**NEW SLURM Job IDs**: 3371681-3371692 (12 jobs total) - Using accelerated-h100 partition (94GB GPU RAM)
|
**NEW SLURM Job IDs**: 3371681-3371692 (12 jobs total) - Using accelerated-h100 partition (94GB GPU RAM)
|
||||||
@ -22,7 +22,7 @@
|
|||||||
- **Monitor**: `python monitor_experiments.py experiment_tracking_1753312228.yaml --watch`
|
- **Monitor**: `python monitor_experiments.py experiment_tracking_1753312228.yaml --watch`
|
||||||
- **Note**: Previous jobs (3367710-3367723) crashed due to insufficient GPU RAM on standard partition
|
- **Note**: Previous jobs (3367710-3367723) crashed due to insufficient GPU RAM on standard partition
|
||||||
|
|
||||||
## 📋 TODO List
|
## TODO List
|
||||||
|
|
||||||
### Phase 1: MuJoCo Playground
|
### Phase 1: MuJoCo Playground
|
||||||
- [x] Set up MuJoCo Playground environment
|
- [x] Set up MuJoCo Playground environment
|
||||||
@ -52,7 +52,7 @@
|
|||||||
- [ ] Document findings and performance
|
- [ ] Document findings and performance
|
||||||
- [ ] Create final report
|
- [ ] Create final report
|
||||||
|
|
||||||
## 📊 Experiment Details
|
## Experiment Details
|
||||||
|
|
||||||
### Phase 1: MuJoCo Playground (Figure 11 from paper)
|
### Phase 1: MuJoCo Playground (Figure 11 from paper)
|
||||||
- `T1JoystickFlatTerrain`, `T1JoystickRoughTerrain`, `G1JoystickFlatTerrain`, `G1JoystickRoughTerrain`
|
- `T1JoystickFlatTerrain`, `T1JoystickRoughTerrain`, `G1JoystickFlatTerrain`, `G1JoystickRoughTerrain`
|
||||||
|
Loading…
Reference in New Issue
Block a user