reppo

History

ys1087@partner.kit.edu b240a19ceb Fix 6 critical bugs in REPPO repository preventing execution - Fix missing MUON optimizer by replacing with optax.adam - Fix Hydra configuration parameter paths (env.name instead of env_name) - Fix BraxGymnaxWrapper method signatures to accept params argument - Fix training loop division by zero with proper total_time_steps - Fix incorrect algorithm name in wandb (reppo instead of sac) - Fix JAX key batching error in BraxGymnaxWrapper reset method - Add comprehensive HoReKa SLURM integration with wandb logging - Update README with detailed bug documentation and fixes		2025-07-22 17:26:43 +02:00
..
env	Paper code basis	2025-07-15 19:05:07 -04:00
experiment_overrides	cleanup	2025-07-15 22:40:47 -07:00
platform	Paper code basis	2025-07-15 19:05:07 -04:00
trial_spec	Fixes build errors due to name conflicts	2025-07-21 18:31:20 -04:00
ppo.yaml	update config	2025-07-15 22:48:07 -07:00
reppo.yaml	Fix 6 critical bugs in REPPO repository preventing execution	2025-07-22 17:26:43 +02:00