dppo/agent/finetune
2025-02-04 11:51:47 -05:00
..
train_agent.py add separate eval model class that also initializes the pre-trained policy for early denoising steps 2025-02-04 11:51:47 -05:00
train_awr_diffusion_agent.py v0.5 to main (#10) 2024-10-07 16:35:13 -04:00
train_calql_agent.py v0.6 (#18) 2024-10-30 19:58:06 -04:00
train_dipo_diffusion_agent.py v0.5 to main (#10) 2024-10-07 16:35:13 -04:00
train_dql_diffusion_agent.py v0.5 to main (#10) 2024-10-07 16:35:13 -04:00
train_ibrl_agent.py v0.6 (#18) 2024-10-30 19:58:06 -04:00
train_idql_diffusion_agent.py v0.5 to main (#10) 2024-10-07 16:35:13 -04:00
train_ppo_agent.py release 2024-09-03 21:03:27 -04:00
train_ppo_diffusion_agent.py v0.6 (#18) 2024-10-30 19:58:06 -04:00
train_ppo_diffusion_img_agent.py v0.6 (#18) 2024-10-30 19:58:06 -04:00
train_ppo_exact_diffusion_agent.py v0.6 (#18) 2024-10-30 19:58:06 -04:00
train_ppo_gaussian_agent.py v0.6 (#18) 2024-10-30 19:58:06 -04:00
train_ppo_gaussian_img_agent.py v0.6 (#18) 2024-10-30 19:58:06 -04:00
train_qsm_diffusion_agent.py v0.5 to main (#10) 2024-10-07 16:35:13 -04:00
train_rlpd_agent.py v0.5 to main (#10) 2024-10-07 16:35:13 -04:00
train_rwr_diffusion_agent.py v0.5 to main (#10) 2024-10-07 16:35:13 -04:00
train_sac_agent.py v0.5 to main (#10) 2024-10-07 16:35:13 -04:00