Commit Graph

19 Commits

Author SHA1 Message Date
ab557a8856 Making MultivariateNormal Policies work (and porting Normal to
Independent)
2022-07-15 15:03:51 +02:00
b1ed9fc2b8 Renamed TRL_PG to PPO 2022-07-13 19:51:33 +02:00
3304fd49f6 Working on UniversalGaussianDistribution 2022-07-13 19:38:57 +02:00
fae19509bc Implemented Policies with Contextual Covariance 2022-07-13 19:38:20 +02:00
41e4170b2f Fixes + spherical_chol 2022-07-11 17:28:08 +02:00
e4440428f8 Working on SDC 2022-07-11 11:55:23 +02:00
4c4b12ee0e Allow cloning UniversalGaussianDistribution (new_dist_like) 2022-07-09 14:46:11 +02:00
c08ea1cb91 Making UniversalGaussianDistribution ready for tanh-squashing-support 2022-07-09 14:33:07 +02:00
249754ee89 Wrote a little helper-function to generate all allowed combinations of
cov-parameterizations
2022-07-09 14:03:56 +02:00
e09950b30c Work on Contextual Covariances 2022-07-09 12:26:39 +02:00
0911a04e98 Factored out Gaussian Collection for RolloutBuffer 2022-07-04 21:21:16 +02:00
1f3eac5398 Cleanup 2022-07-02 16:42:14 +02:00
91f64c10d7 Fixed bug with initialization of buffer 2022-07-01 20:02:09 +02:00
14100cccc8 Working on SDC-impl. 2022-07-01 15:14:41 +02:00
84d1cda96c Trying to get kl to work 2022-07-01 13:45:58 +02:00
2626ec82a6 Added code-src-note 2022-07-01 11:52:50 +02:00
ab1b269af9 Allow checking whether a dist is contextual 2022-07-01 11:52:14 +02:00
a8b9c63965 Making dez covariances contextual 2022-07-01 11:29:12 +02:00
2e378d0a7d Rebranding to Metastable Baselines 2022-06-30 20:40:30 +02:00