Commit Graph

25 Commits

Author SHA1 Message Date
199ce0c8cb ProbSquashing implemented (tanh) 2022-07-20 10:32:19 +02:00
b7de99b1fc EnforcePositiveType makes no sense for Strength.NONE 2022-07-19 10:06:40 +02:00
49f9acff3e Fixed: Wrong simplification for Hybrid[SCALAR=>FULL] 2022-07-17 00:47:47 +02:00
046fa78206 Fixed: _chol_from_sphe_chol was unable to handle batches 2022-07-16 17:34:25 +02:00
c141599662 Fixed Bug: _chol_from_sphe_chol dependet on action_dim, not able to use
for Hybrid methods
2022-07-16 15:47:09 +02:00
bc0e188a0d Removed debugging-code 2022-07-16 15:19:56 +02:00
4a24381f46 Fixed bug when using batches with SPHERICAL_CHOL 2022-07-16 15:17:48 +02:00
04529e8261 Removed debug-point 2022-07-16 14:58:29 +02:00
cb9ee4f302 Fixed bugs for Hybrid[Diag=>Full] 2022-07-16 14:57:34 +02:00
ad584d70fd Removed debugging prints 2022-07-16 13:07:08 +02:00
72754525cd Allow using newly implemented hybrid method 2022-07-16 13:06:07 +02:00
fa167b3e5f Hybrid Diag -> Full Implemented; Made spherical_chol more efficient 2022-07-16 13:05:35 +02:00
2e0f46b0f3 Fixing ser/deser bug (cloudpickle cant handle some enums) 2022-07-15 18:45:38 +02:00
a86d19053d Smashing bugs (dimension mismatch between Normal and
Independent/MultivariateNormal)
2022-07-15 15:46:31 +02:00
ab557a8856 Making MultivariateNormal Policies work (and porting Normal to
Independent)
2022-07-15 15:03:51 +02:00
3304fd49f6 Working on UniversalGaussianDistribution 2022-07-13 19:38:57 +02:00
41e4170b2f Fixes + spherical_chol 2022-07-11 17:28:08 +02:00
e4440428f8 Working on SDC 2022-07-11 11:55:23 +02:00
4c4b12ee0e Allow cloning UniversalGaussianDistribution (new_dist_like) 2022-07-09 14:46:11 +02:00
c08ea1cb91 Making UniversalGaussianDistribution ready for tanh-squashing-support 2022-07-09 14:33:07 +02:00
249754ee89 Wrote a little helper-function to generate all allowed combinations of
cov-parameterizations
2022-07-09 14:03:56 +02:00
e09950b30c Work on Contextual Covariances 2022-07-09 12:26:39 +02:00
14100cccc8 Working on SDC-impl. 2022-07-01 15:14:41 +02:00
a8b9c63965 Making dez covariances contextual 2022-07-01 11:29:12 +02:00
2e378d0a7d Rebranding to Metastable Baselines 2022-06-30 20:40:30 +02:00