FastTD3/fast_td3
Younggyo Seo cef44108d8
Support MTBench (#15)
This PR incorporates MTBench into the current codebase, as a good demonstration that shows how to use FastTD3 for multi-task setup.

- Add support for MTBench along with its wrapper
- Add support for per-task reward normalizer useful for multi-task RL, motivated by BRC paper (https://arxiv.org/abs/2505.23150v1)
2025-06-20 21:52:43 -07:00
..
environments Support MTBench (#15) 2025-06-20 21:52:43 -07:00
__init__.py Initial Public Release 2025-05-29 01:49:23 +00:00
fast_td3_deploy.py Support FastTD3 + SimbaV2 (#13) 2025-06-15 12:49:59 -07:00
fast_td3_simbav2.py Support MTBench (#15) 2025-06-20 21:52:43 -07:00
fast_td3_utils.py Support MTBench (#15) 2025-06-20 21:52:43 -07:00
fast_td3.py Support MTBench (#15) 2025-06-20 21:52:43 -07:00
hyperparams.py Support MTBench (#15) 2025-06-20 21:52:43 -07:00
train.py Support MTBench (#15) 2025-06-20 21:52:43 -07:00
training_notebook.ipynb memory optimization for playground 2025-05-29 06:58:28 +00:00