Commit Graph

29 Commits

Author SHA1 Message Date
4080ad8135 Removed old TODOs 2022-08-28 12:07:19 +02:00
5c39be5ead Testing Observables 2022-08-22 15:05:42 +02:00
d35c3d8520 Fixed all the bugs in TRPL 2022-08-15 16:55:17 +02:00
639aae7f42 Testing SDE... 2022-08-14 18:42:45 +02:00
fcd9953b37 Testing... 2022-08-06 14:37:30 +02:00
05dad44b6e Support SAC for testing 2022-07-19 10:08:47 +02:00
a86d19053d Smashing bugs (dimension mismatch between Normal and
Independent/MultivariateNormal)
2022-07-15 15:46:31 +02:00
ab557a8856 Making MultivariateNormal Policies work (and porting Normal to
Independent)
2022-07-15 15:03:51 +02:00
b1ed9fc2b8 Renamed TRL_PG to PPO 2022-07-13 19:51:33 +02:00
1706bea571 Testing SDC 2022-07-13 19:39:09 +02:00
92204c448f Testing kl-projection (it's working!) 2022-07-02 16:41:25 +02:00
ad7ed0071b Silence! 2022-07-01 20:02:29 +02:00
0dc9edf112 We no longer use venv (breaks cpp_projection...) 2022-07-01 19:52:22 +02:00
81ae3e3707 Finalized venv support and added installation-instructions 2022-07-01 12:19:57 +02:00
2e378d0a7d Rebranding to Metastable Baselines 2022-06-30 20:40:30 +02:00
28561b9bb2 Allow manual early stopping of training (Ctrl+C) 2022-06-29 12:46:57 +02:00
024a9a0265 StillTesting 2022-06-26 16:38:46 +02:00
60c954c8c1 LunarLanderContinuous-v2 is our new default test-env 2022-06-25 21:47:21 +02:00
1a49a412c0 expanding automatic testing 2022-06-25 14:50:19 +02:00
941b7347f1 Get FPS from env 2022-06-22 13:12:55 +02:00
0e17b4c07e Fixed model storage location bug 2022-06-22 13:00:40 +02:00
84b3710850 Testing SACs ability to solve EasierObstacles-v0 2022-06-21 15:15:38 +02:00
e71735bf79 Trying to converge on simple columbus envs 2022-06-20 23:12:42 +02:00
b9303416ac Split PPO_SDE into PPO_BASE_SDE and PPO_LATENT_SDE 2022-06-19 22:47:04 +02:00
477a3c48b1 Testing the RayObserver 2022-06-19 20:34:04 +02:00
c45e3627ea Trying to get ColumbusEnv to work... 2022-06-19 15:59:55 +02:00
fcc33d7bdc Work on Columbus integration 2022-06-19 15:50:54 +02:00
bb160d2837 changed locations / names for logs 2022-06-17 13:16:32 +02:00
d74477e6c2 Added test.py for testing the algos (and tensorboard integration) 2022-06-17 11:29:36 +02:00