Commit Graph

19 Commits

Author SHA1 Message Date
8921421732 Fixed Typo 2023-05-21 16:14:34 +02:00
2efa6c18fb Experimental Support for PCA 2023-05-21 14:27:09 +02:00
76ea3a6326 Implemented prior conditioned annealing (untested) 2023-04-25 17:05:34 +02:00
71670782b6 . 2023-01-28 21:47:19 +01:00
ffbf2b3fe5 Allow reduced latent sde dim 2023-01-27 13:34:28 +01:00
4ec5c65cf2 Tiny fix for other envs 2022-11-07 13:23:55 +01:00
4bb772a251 Factor Projections out into metastable-projections 2022-09-03 11:37:41 +02:00
4080ad8135 Removed old TODOs 2022-08-28 12:07:19 +02:00
02e4ed1510 Added support for parallel envs 2022-08-27 15:19:00 +02:00
d35c3d8520 Fixed all the bugs in TRPL 2022-08-15 16:55:17 +02:00
28d0c609bc Fixed SDE: sampling had dimension mismatches 2022-08-14 20:09:10 +02:00
0ee65e789b Fixing sde's bugs 2022-08-14 16:10:22 +02:00
520dc98eb5 Implemented SDE 2022-08-10 11:54:52 +02:00
12e422aec7 Why does KL double free? 2022-08-07 18:04:40 +02:00
75d73049b4 Fixing bugs with w2 and sqrt_induced_gaussian 2022-08-06 21:25:49 +02:00
802094a50f Enabled w2 (can now get sqrt from dist) 2022-08-06 14:54:59 +02:00
508ebf51f0 Implemented sqrt-induced-gaussian for W2-Projection 2022-08-06 14:46:42 +02:00
ab557a8856 Making MultivariateNormal Policies work (and porting Normal to
Independent)
2022-07-15 15:03:51 +02:00
b1ed9fc2b8 Renamed TRL_PG to PPO 2022-07-13 19:51:33 +02:00