- Modified codes to be compatible with torch.compile - Modified empirical normalizer to use in-place operator to avoid costly __setattr__ - Parallel soft Q-update - As a default option, we disabled gradient norm clipping as it is quite expensive |
||
---|---|---|
.. | ||
environments | ||
__init__.py | ||
fast_td3_deploy.py | ||
fast_td3_simbav2.py | ||
fast_td3_utils.py | ||
fast_td3.py | ||
hyperparams.py | ||
train.py | ||
training_notebook.ipynb |