Commit Graph

69 Commits

Author SHA1 Message Date
e4a8cfc349 Implemented action_loss 2022-09-03 11:16:29 +02:00
4080ad8135 Removed old TODOs 2022-08-28 12:07:19 +02:00
eb881559d6 Support clip_range None 2022-08-28 02:07:18 +02:00
1d3c2fe005 Allow completely disabling some PPO features (for TRPL) 2022-08-28 00:26:44 +02:00
afec4e709c Fixed bug in RolloutBuffer when using parallel envs 2022-08-27 16:02:40 +02:00
02e4ed1510 Added support for parallel envs 2022-08-27 15:19:00 +02:00
c6a58b15dd Fixing SDE bug 2022-08-22 14:19:40 +02:00
197de7997c Fixed bug with SDE 2022-08-22 13:36:17 +02:00
a9e3f295b2 Fixed numerical issues with Wasserstein 2022-08-17 23:25:24 +02:00
9fffe048af Fixed Spherical_Chol not accepting batches 2022-08-17 22:55:42 +02:00
86e6bfb65b Higher epsilon to deal with numerical instabilities 2022-08-17 19:31:54 +02:00
64a7d5ec59 Guarante minimum epsilon when ensuring non-zero (CholNet) 2022-08-16 20:02:33 +02:00
d35c3d8520 Fixed all the bugs in TRPL 2022-08-15 16:55:17 +02:00
28d0c609bc Fixed SDE: sampling had dimension mismatches 2022-08-14 20:09:10 +02:00
e1c59cffd0 Removed debug-print 2022-08-14 18:45:17 +02:00
bb1f9ecf2b Fixed UniversalGaussianDistribution lost SDE when cloning 2022-08-14 18:42:19 +02:00
0ee65e789b Fixing sde's bugs 2022-08-14 16:10:22 +02:00
0e4eedae5e Fixed gradient throught spherical-chol 2022-08-10 11:55:08 +02:00
520dc98eb5 Implemented SDE 2022-08-10 11:54:52 +02:00
12e422aec7 Why does KL double free? 2022-08-07 18:04:40 +02:00
75d73049b4 Fixing bugs with w2 and sqrt_induced_gaussian 2022-08-06 21:25:49 +02:00
802094a50f Enabled w2 (can now get sqrt from dist) 2022-08-06 14:54:59 +02:00
508ebf51f0 Implemented sqrt-induced-gaussian for W2-Projection 2022-08-06 14:46:42 +02:00
54113fd40c Removed unused dependency 2022-08-06 14:37:06 +02:00
2c1689fbbc Fixed smol bugs when instanciating based on string-names of enums 2022-08-06 14:36:35 +02:00
8b82347056 Automatic casting to enums 2022-08-05 21:06:31 +02:00
683644f77d Removed weird line... 2022-08-05 21:06:05 +02:00
a78e81b9e1 Allowing prob squashing 2022-07-21 09:42:25 +02:00
199ce0c8cb ProbSquashing implemented (tanh) 2022-07-20 10:32:19 +02:00
0162a36824 Renamed TRL_SAC => SAC 2022-07-19 10:08:14 +02:00
7b667e9650 SAC is back; with SDC; without Projections 2022-07-19 10:07:50 +02:00
5f32435751 Smashing bugs: dont confuse chol with chol_net 2022-07-19 10:07:20 +02:00
b7de99b1fc EnforcePositiveType makes no sense for Strength.NONE 2022-07-19 10:06:40 +02:00
49f9acff3e Fixed: Wrong simplification for Hybrid[SCALAR=>FULL] 2022-07-17 00:47:47 +02:00
046fa78206 Fixed: _chol_from_sphe_chol was unable to handle batches 2022-07-16 17:34:25 +02:00
c141599662 Fixed Bug: _chol_from_sphe_chol dependet on action_dim, not able to use
for Hybrid methods
2022-07-16 15:47:09 +02:00
3fa6de7e66 Broader sampling of stds for logging with batched full covs 2022-07-16 15:28:16 +02:00
bc0e188a0d Removed debugging-code 2022-07-16 15:19:56 +02:00
d2d84d3287 Fixed bug for logging std-estimates when using batched data 2022-07-16 15:18:24 +02:00
4a24381f46 Fixed bug when using batches with SPHERICAL_CHOL 2022-07-16 15:17:48 +02:00
04529e8261 Removed debug-point 2022-07-16 14:58:29 +02:00
4854346f2d Fixed bug with logging of std for full-cov 2022-07-16 14:58:00 +02:00
cb9ee4f302 Fixed bugs for Hybrid[Diag=>Full] 2022-07-16 14:57:34 +02:00
ad584d70fd Removed debugging prints 2022-07-16 13:07:08 +02:00
72754525cd Allow using newly implemented hybrid method 2022-07-16 13:06:07 +02:00
fa167b3e5f Hybrid Diag -> Full Implemented; Made spherical_chol more efficient 2022-07-16 13:05:35 +02:00
f184b88f19 Allow std logging for full and diagonal cov policies 2022-07-15 18:46:42 +02:00
74697e8773 Smol bug fixes 2022-07-15 18:46:17 +02:00
2e0f46b0f3 Fixing ser/deser bug (cloudpickle cant handle some enums) 2022-07-15 18:45:38 +02:00
a86d19053d Smashing bugs (dimension mismatch between Normal and
Independent/MultivariateNormal)
2022-07-15 15:46:31 +02:00