dodox/dppo - dppo - Gitea: Git with a cup of tea

dodox/dppo

Author	SHA1	Message	Date
ys1087@partner.kit.edu	b259157a31	Launch Phase 2: Complete DPPO paper replication - Submit all 10 full replication runs on accelerated partition - Update experiment plan with complete validation results and full run status - Add comprehensive full run scripts for robomimic and D3IL environments - All validated environments now running full paper-quality experiments - Total queue: 3 Gym + 4 Robomimic + 3 D3IL fine-tuning runs	2025-08-27 22:52:19 +02:00
ys1087@partner.kit.edu	cb9846484f	Update experiment plan with validation results and WandB URLs - Complete validation status table with results for all environments - Add WandB tracking URLs for completed fine-tuning runs - Document technical fixes and current job queue status - Add test scripts for remaining D3IL avoid_m3 and robomimic transport validation	2025-08-27 22:14:10 +02:00
ys1087@partner.kit.edu	314a3f3c06	Add comprehensive dev test scripts and update experiment plan - Complete SLURM test scripts for all environment types - Gym fine-tuning: walker2d, halfcheetah validation tests - Robomimic fine-tuning: lift validation test with scheduler fix - D3IL validation: avoid_m1 pre-training and fine-tuning tests - Updated experiment plan with current validation status - All major environments now have automated testing pipeline	2025-08-27 21:02:55 +02:00
ys1087@partner.kit.edu	7e800c9a33	Complete MuJoCo fix and validate hopper fine-tuning - Add GCC wrapper script to filter Intel compiler flags - Download missing mujoco-py generated files automatically - Update installer with comprehensive MuJoCo fixes - Document complete solution in README and EXPERIMENT_PLAN - Hopper fine-tuning validated with reward 1415.8471 - All pre-training environments working - DPPO is now production-ready on HoReKa	2025-08-27 18:27:02 +02:00
ys1087@partner.kit.edu	d739fa5e5e	Add robomimic transport test and update experiment plan - Create robomimic transport pre-training test script - Update EXPERIMENT_PLAN.md with square success - Add WandB URLs for completed robomimic tests - Track progress on remaining validation tests	2025-08-27 16:21:06 +02:00
ys1087@partner.kit.edu	2404a34c36	Add MuJoCo compilation debugging and continue validation tests - Add robomimic square test (continuing pre-training validation) - Create MuJoCo environment fix scripts for debugging compilation - Update experiment plan with latest test results - Robomimic can pre-training validated successfully	2025-08-27 15:32:29 +02:00
ys1087@partner.kit.edu	3cf999c32e	Update documentation and simplify experiment tracking - Simplify experiment plan with clear phases and current status - Add complete MuJoCo setup instructions for fine-tuning - Update install script to include all dependencies - Document current validation progress and next steps	2025-08-27 15:25:43 +02:00
ys1087@partner.kit.edu	0424a080c1	feat: HoReKa cluster adaptation and validation - Updated all WandB project names to use dppo- prefix for organization - Added flexible dev testing script for all environments - Created organized dev_tests directory for test scripts - Fixed MuJoCo compilation issues (added GCC compiler flags) - Documented Python 3.10 compatibility and Furniture-Bench limitation - Validated pre-training for Gym, Robomimic, D3IL environments - Updated experiment tracking with validation results - Enhanced README with troubleshooting and setup instructions	2025-08-27 14:01:51 +02:00
ys1087@partner.kit.edu	93ac652def	Start full hopper pre-training production run Job 3445123: 200 epochs, 8h allocated, queued on accelerated partition	2025-08-27 12:31:42 +02:00
ys1087@partner.kit.edu	a67f474fc0	Clarify pre-training vs fine-tuning phases and dev test purpose - Pre-training: diffusion model on offline D4RL data (200 epochs) - Fine-tuning: PPO fine-tune with online environment interaction - Dev test: 2 epochs only for quick verification, not full training	2025-08-27 12:29:31 +02:00
ys1087@partner.kit.edu	80339cad52	Update experiment plan with successful WandB run Job 3445117 completed with proper WandB logging Added WandB URL to tracking table	2025-08-27 12:28:16 +02:00
ys1087@partner.kit.edu	e8e7233d98	Fix WandB config issue and achieve working DPPO setup - Disable WandB in dev script to avoid config object vs string error - Successfully completed development test (Job 3445106) - Confirmed: pre-training works, loss reduces, checkpoints save - Update experiment tracking with successful results	2025-08-27 12:19:38 +02:00
ys1087@partner.kit.edu	7fc9b17871	Clean up experiment tracking Remove failed job tracking, only track successful/running experiments Note: Previous failure was setup error, not DPPO repository issue	2025-08-27 12:08:31 +02:00
ys1087@partner.kit.edu	4adf67694a	Fix Hydra config error in dev script - Change train.n_iters to train.n_epochs (correct DPPO parameter) - Update experiment tracking with failed job details - Ready for corrected dev test	2025-08-27 12:07:38 +02:00
ys1087@partner.kit.edu	f88a5be4fe	Add experiment tracking plan Document current status, planned experiments, and job tracking Following REPPO experiment documentation pattern	2025-08-27 12:06:34 +02:00

15 Commits