Commit Graph

170 Commits

Author SHA1 Message Date
f37d3215a6 Bug Fix for Full Cov 2022-09-23 23:06:19 +02:00
00dbc9bdd8 Error when calculating action_loss 2022-09-12 22:28:57 +02:00
2c14edd3b0 Fix: Dependency 2022-09-03 23:41:06 +02:00
d3fa3cc997 Fixed Givens Dependency 2022-09-03 23:16:54 +02:00
6e1a7cecd5 Implemented correct clipping (from original SAC) 2022-09-03 13:46:17 +02:00
ee4a0eed56 Fixed SAC+SDE+SDC bugs 2022-09-03 13:08:31 +02:00
4532135812 Finalized factoring out projections 2022-09-03 11:59:16 +02:00
0aeea4e2e5 Fixed Bug: Wrong dimensions for action_loss 2022-09-03 11:44:01 +02:00
4bb772a251 Factor Projections out into metastable-projections 2022-09-03 11:37:41 +02:00
0a037deccc Implemented cov parametrization via eigen-decomp 2022-09-03 11:16:41 +02:00
e4a8cfc349 Implemented action_loss 2022-09-03 11:16:29 +02:00
2f05474091 Fixed a bug with KL-proj 2022-08-28 20:48:02 +02:00
4080ad8135 Removed old TODOs 2022-08-28 12:07:19 +02:00
eb881559d6 Support clip_range None 2022-08-28 02:07:18 +02:00
1d3c2fe005 Allow completely disabling some PPO features (for TRPL) 2022-08-28 00:26:44 +02:00
afec4e709c Fixed bug in RolloutBuffer when using parallel envs 2022-08-27 16:02:40 +02:00
02e4ed1510 Added support for parallel envs 2022-08-27 15:19:00 +02:00
5c39be5ead Testing Observables 2022-08-22 15:05:42 +02:00
c6a58b15dd Fixing SDE bug 2022-08-22 14:19:40 +02:00
197de7997c Fixed bug with SDE 2022-08-22 13:36:17 +02:00
a9e3f295b2 Fixed numerical issues with Wasserstein 2022-08-17 23:25:24 +02:00
9fffe048af Fixed Spherical_Chol not accepting batches 2022-08-17 22:55:42 +02:00
86e6bfb65b Higher epsilon to deal with numerical instabilities 2022-08-17 19:31:54 +02:00
64a7d5ec59 Guarante minimum epsilon when ensuring non-zero (CholNet) 2022-08-16 20:02:33 +02:00
d35c3d8520 Fixed all the bugs in TRPL 2022-08-15 16:55:17 +02:00
28d0c609bc Fixed SDE: sampling had dimension mismatches 2022-08-14 20:09:10 +02:00
e1c59cffd0 Removed debug-print 2022-08-14 18:45:17 +02:00
639aae7f42 Testing SDE... 2022-08-14 18:42:45 +02:00
bb1f9ecf2b Fixed UniversalGaussianDistribution lost SDE when cloning 2022-08-14 18:42:19 +02:00
0ee65e789b Fixing sde's bugs 2022-08-14 16:10:22 +02:00
0e4eedae5e Fixed gradient throught spherical-chol 2022-08-10 11:55:08 +02:00
520dc98eb5 Implemented SDE 2022-08-10 11:54:52 +02:00
12e422aec7 Why does KL double free? 2022-08-07 18:04:40 +02:00
75d73049b4 Fixing bugs with w2 and sqrt_induced_gaussian 2022-08-06 21:25:49 +02:00
802094a50f Enabled w2 (can now get sqrt from dist) 2022-08-06 14:54:59 +02:00
508ebf51f0 Implemented sqrt-induced-gaussian for W2-Projection 2022-08-06 14:46:42 +02:00
fcd9953b37 Testing... 2022-08-06 14:37:30 +02:00
54113fd40c Removed unused dependency 2022-08-06 14:37:06 +02:00
2c1689fbbc Fixed smol bugs when instanciating based on string-names of enums 2022-08-06 14:36:35 +02:00
e074294b88 +1 2022-08-05 21:07:38 +02:00
8b82347056 Automatic casting to enums 2022-08-05 21:06:31 +02:00
683644f77d Removed weird line... 2022-08-05 21:06:05 +02:00
a78e81b9e1 Allowing prob squashing 2022-07-21 09:42:25 +02:00
199ce0c8cb ProbSquashing implemented (tanh) 2022-07-20 10:32:19 +02:00
05dad44b6e Support SAC for testing 2022-07-19 10:08:47 +02:00
6384d411a9 Support SAC in replays 2022-07-19 10:08:34 +02:00
0162a36824 Renamed TRL_SAC => SAC 2022-07-19 10:08:14 +02:00
7b667e9650 SAC is back; with SDC; without Projections 2022-07-19 10:07:50 +02:00
5f32435751 Smashing bugs: dont confuse chol with chol_net 2022-07-19 10:07:20 +02:00
b7de99b1fc EnforcePositiveType makes no sense for Strength.NONE 2022-07-19 10:06:40 +02:00