Commit Graph

3 Commits

Author SHA1 Message Date
ys1087@partner.kit.edu
e7e3ae48f1 Add FastTD3 HoReKa experiment management system
- Fixed JAX/PyTorch dtype mismatch for successful training
- Added experiment plan with paper-accurate hyperparameters
- Created batch submission and monitoring scripts
- Cleaned up log files and updated gitignore
- Ready for systematic paper replication
2025-07-22 17:08:03 +02:00
ys1087@partner.kit.edu
15750f56b2 Fix JAX compatibility and CUDA module issues for HoReKa
- Update SLURM scripts to use correct CUDA modules (devel/cuda/12.4, intel compiler)
- Add JAX downgrade to 0.4.35 for CuDNN 9.5.1 compatibility
- Fix JAX_PLATFORMS environment variable (cuda vs gpu,cpu)
- Update README with cluster-specific JAX installation steps
- Tested successfully: Both PyTorch and JAX working on GPU with full training
2025-07-22 16:36:06 +02:00
ys1087@partner.kit.edu
336c96bb7b Add HoReKa cluster support with SLURM and wandb integration
- Add complete HoReKa installation guide without conda dependency
- Include SLURM job script with GPU configuration and account setup
- Add helper scripts for job submission and environment testing
- Integrate wandb logging with both online and offline modes
- Support MuJoCo Playground environments for humanoid control
- Update README with clear separation of added vs original content
2025-07-22 16:15:30 +02:00