Commit Graph

21 Commits

Author SHA1 Message Date
afec4e709c Fixed bug in RolloutBuffer when using parallel envs 2022-08-27 16:02:40 +02:00
a9e3f295b2 Fixed numerical issues with Wasserstein 2022-08-17 23:25:24 +02:00
d35c3d8520 Fixed all the bugs in TRPL 2022-08-15 16:55:17 +02:00
75d73049b4 Fixing bugs with w2 and sqrt_induced_gaussian 2022-08-06 21:25:49 +02:00
802094a50f Enabled w2 (can now get sqrt from dist) 2022-08-06 14:54:59 +02:00
54113fd40c Removed unused dependency 2022-08-06 14:37:06 +02:00
683644f77d Removed weird line... 2022-08-05 21:06:05 +02:00
199ce0c8cb ProbSquashing implemented (tanh) 2022-07-20 10:32:19 +02:00
74697e8773 Smol bug fixes 2022-07-15 18:46:17 +02:00
a86d19053d Smashing bugs (dimension mismatch between Normal and
Independent/MultivariateNormal)
2022-07-15 15:46:31 +02:00
ab557a8856 Making MultivariateNormal Policies work (and porting Normal to
Independent)
2022-07-15 15:03:51 +02:00
fae19509bc Implemented Policies with Contextual Covariance 2022-07-13 19:38:20 +02:00
41e4170b2f Fixes + spherical_chol 2022-07-11 17:28:08 +02:00
4c4b12ee0e Allow cloning UniversalGaussianDistribution (new_dist_like) 2022-07-09 14:46:11 +02:00
e09950b30c Work on Contextual Covariances 2022-07-09 12:26:39 +02:00
0911a04e98 Factored out Gaussian Collection for RolloutBuffer 2022-07-04 21:21:16 +02:00
91f64c10d7 Fixed bug with initialization of buffer 2022-07-01 20:02:09 +02:00
84d1cda96c Trying to get kl to work 2022-07-01 13:45:58 +02:00
2626ec82a6 Added code-src-note 2022-07-01 11:52:50 +02:00
ab1b269af9 Allow checking whether a dist is contextual 2022-07-01 11:52:14 +02:00
2e378d0a7d Rebranding to Metastable Baselines 2022-06-30 20:40:30 +02:00