Commit Graph

4 Commits

Author SHA1 Message Date
Younggyo Seo
799624b202
Bug fix -- MTBench evaluation and missing code (#18)
This PR includes these changes:
- Fixing a bug in MTBench evaluation
- Add a missing `critic_cls` in `train.py` (resolving an issue https://github.com/younggyoseo/FastTD3/issues/17)
- Updating hyperparameters for MTBench
2025-06-25 09:21:04 -07:00
Younggyo Seo
cef44108d8
Support MTBench (#15)
This PR incorporates MTBench into the current codebase, as a good demonstration that shows how to use FastTD3 for multi-task setup.

- Add support for MTBench along with its wrapper
- Add support for per-task reward normalizer useful for multi-task RL, motivated by BRC paper (https://arxiv.org/abs/2505.23150v1)
2025-06-20 21:52:43 -07:00
Younggyo Seo
c156ba93fb black formatting and update tuned_reward for T1 2025-05-29 08:29:44 +00:00
Younggyo Seo
258bfe67dd Initial Public Release 2025-05-29 01:49:23 +00:00