dppo/agent
2024-11-07 10:55:16 -05:00
..
dataset v0.5 to main (#10) 2024-10-07 16:35:13 -04:00
eval fix itr initialization in eval agents 2024-10-10 14:10:16 -04:00
finetune v0.6 (#18) 2024-10-30 19:58:06 -04:00
pretrain use default epoch_start_ema=20 and update_ema_freq=10 2024-11-07 10:55:16 -05:00