This PR incorporates MTBench into the current codebase, as a good demonstration that shows how to use FastTD3 for multi-task setup. - Add support for MTBench along with its wrapper - Add support for per-task reward normalizer useful for multi-task RL, motivated by BRC paper (https://arxiv.org/abs/2505.23150v1) |
||
---|---|---|
.. | ||
humanoid_bench_env.py | ||
isaaclab_env.py | ||
mtbench_env.py | ||
mujoco_playground_env.py |