FastTD3

History

Younggyo Seo 85cb1c65c7 Fix replay buffer issues when n_steps > 1 (#7 ) - Fix an issue where the n-step reward is not properly computed for end-of-episode transitions when using n_step > 1. - Fix an issue where the observation and next_observations are sampled across different episodes when using n_step > 1 and the buffer is full - Fix an issue where the discount is not properly computed when n_step > 1		2025-06-07 01:20:48 -04:00
..
environments	black formatting and update tuned_reward for T1	2025-05-29 08:29:44 +00:00
__init__.py	Initial Public Release	2025-05-29 01:49:23 +00:00
fast_td3_deploy.py	Initial Public Release	2025-05-29 01:49:23 +00:00
fast_td3_utils.py	Fix replay buffer issues when n_steps > 1 (#7 )	2025-06-07 01:20:48 -04:00
fast_td3.py	Fix replay buffer issues when n_steps > 1 (#7 )	2025-06-07 01:20:48 -04:00
hyperparams.py	Fix replay buffer issues when n_steps > 1 (#7 )	2025-06-07 01:20:48 -04:00
train.py	Fix replay buffer issues when n_steps > 1 (#7 )	2025-06-07 01:20:48 -04:00
training_notebook.ipynb	memory optimization for playground	2025-05-29 06:58:28 +00:00