Commit Graph

36 Commits

Author SHA1 Message Date
69fdd7b103 Fix Bug: Let's let thge pca-lib handle this exception 2023-05-21 17:19:39 +02:00
c473990fbd Fix smol Bug 2023-05-21 17:15:46 +02:00
6a9d7012c7 Bugs 2023-05-21 17:03:22 +02:00
8921421732 Fixed Typo 2023-05-21 16:14:34 +02:00
2efa6c18fb Experimental Support for PCA 2023-05-21 14:27:09 +02:00
76ea3a6326 Implemented prior conditioned annealing (untested) 2023-04-25 17:05:34 +02:00
71670782b6 . 2023-01-28 21:47:19 +01:00
ffbf2b3fe5 Allow reduced latent sde dim 2023-01-27 13:34:28 +01:00
6f1837bda5 calc episodic infos for some fancy envs 2023-01-26 17:27:34 +01:00
f3e03916c8 Upgrading to SB3 1.7 (probably broke some stuff...) 2022-12-13 19:14:28 +01:00
4ec5c65cf2 Tiny fix for other envs 2022-11-07 13:23:55 +01:00
479d73ac4b Hotfix for exploding gradients 2022-11-03 20:13:36 +01:00
00dbc9bdd8 Error when calculating action_loss 2022-09-12 22:28:57 +02:00
4532135812 Finalized factoring out projections 2022-09-03 11:59:16 +02:00
0aeea4e2e5 Fixed Bug: Wrong dimensions for action_loss 2022-09-03 11:44:01 +02:00
4bb772a251 Factor Projections out into metastable-projections 2022-09-03 11:37:41 +02:00
e4a8cfc349 Implemented action_loss 2022-09-03 11:16:29 +02:00
4080ad8135 Removed old TODOs 2022-08-28 12:07:19 +02:00
eb881559d6 Support clip_range None 2022-08-28 02:07:18 +02:00
1d3c2fe005 Allow completely disabling some PPO features (for TRPL) 2022-08-28 00:26:44 +02:00
02e4ed1510 Added support for parallel envs 2022-08-27 15:19:00 +02:00
d35c3d8520 Fixed all the bugs in TRPL 2022-08-15 16:55:17 +02:00
28d0c609bc Fixed SDE: sampling had dimension mismatches 2022-08-14 20:09:10 +02:00
0ee65e789b Fixing sde's bugs 2022-08-14 16:10:22 +02:00
520dc98eb5 Implemented SDE 2022-08-10 11:54:52 +02:00
12e422aec7 Why does KL double free? 2022-08-07 18:04:40 +02:00
75d73049b4 Fixing bugs with w2 and sqrt_induced_gaussian 2022-08-06 21:25:49 +02:00
802094a50f Enabled w2 (can now get sqrt from dist) 2022-08-06 14:54:59 +02:00
508ebf51f0 Implemented sqrt-induced-gaussian for W2-Projection 2022-08-06 14:46:42 +02:00
3fa6de7e66 Broader sampling of stds for logging with batched full covs 2022-07-16 15:28:16 +02:00
d2d84d3287 Fixed bug for logging std-estimates when using batched data 2022-07-16 15:18:24 +02:00
4854346f2d Fixed bug with logging of std for full-cov 2022-07-16 14:58:00 +02:00
f184b88f19 Allow std logging for full and diagonal cov policies 2022-07-15 18:46:42 +02:00
a86d19053d Smashing bugs (dimension mismatch between Normal and
Independent/MultivariateNormal)
2022-07-15 15:46:31 +02:00
ab557a8856 Making MultivariateNormal Policies work (and porting Normal to
Independent)
2022-07-15 15:03:51 +02:00
b1ed9fc2b8 Renamed TRL_PG to PPO 2022-07-13 19:51:33 +02:00