FastTD3

dodox/FastTD3

Fork 0

Commit Graph

Author	SHA1	Message	Date
Younggyo Seo	799624b202	Bug fix -- MTBench evaluation and missing code (#18 ) This PR includes these changes: - Fixing a bug in MTBench evaluation - Add a missing `critic_cls` in `train.py` (resolving an issue https://github.com/younggyoseo/FastTD3/issues/17) - Updating hyperparameters for MTBench	2025-06-25 09:21:04 -07:00
Younggyo Seo	cef44108d8	Support MTBench (#15 ) This PR incorporates MTBench into the current codebase, as a good demonstration that shows how to use FastTD3 for multi-task setup. - Add support for MTBench along with its wrapper - Add support for per-task reward normalizer useful for multi-task RL, motivated by BRC paper (https://arxiv.org/abs/2505.23150v1)	2025-06-20 21:52:43 -07:00

Author

SHA1

Message

Date

Younggyo Seo

799624b202

Bug fix -- MTBench evaluation and missing code (#18 )

This PR includes these changes:
- Fixing a bug in MTBench evaluation
- Add a missing `critic_cls` in `train.py` (resolving an issue https://github.com/younggyoseo/FastTD3/issues/17)
- Updating hyperparameters for MTBench

2025-06-25 09:21:04 -07:00

Younggyo Seo

cef44108d8

Support MTBench (#15 )

This PR incorporates MTBench into the current codebase, as a good demonstration that shows how to use FastTD3 for multi-task setup.

- Add support for MTBench along with its wrapper
- Add support for per-task reward normalizer useful for multi-task RL, motivated by BRC paper (https://arxiv.org/abs/2505.23150v1)

2025-06-20 21:52:43 -07:00

2 Commits