- Change train.n_iters to train.n_epochs (correct DPPO parameter) - Update experiment tracking with failed job details - Ready for corrected dev test |
||
|---|---|---|
| .. | ||
| run_dppo_dev.sh | ||
| run_dppo_gym.sh | ||
- Change train.n_iters to train.n_epochs (correct DPPO parameter) - Update experiment tracking with failed job details - Ready for corrected dev test |
||
|---|---|---|
| .. | ||
| run_dppo_dev.sh | ||
| run_dppo_gym.sh | ||