dppo/slurm
ys1087@partner.kit.edu e8e7233d98 Fix WandB config issue and achieve working DPPO setup
- Disable WandB in dev script to avoid config object vs string error
- Successfully completed development test (Job 3445106)
- Confirmed: pre-training works, loss reduces, checkpoints save
- Update experiment tracking with successful results
2025-08-27 12:19:38 +02:00
..
run_dppo_dev.sh Fix WandB config issue and achieve working DPPO setup 2025-08-27 12:19:38 +02:00
run_dppo_gym.sh Add HoReKa cluster setup and SLURM scripts 2025-08-27 11:57:32 +02:00