|
4532135812
|
Finalized factoring out projections
|
2022-09-03 11:59:16 +02:00 |
|
|
4bb772a251
|
Factor Projections out into metastable-projections
|
2022-09-03 11:37:41 +02:00 |
|
|
2f05474091
|
Fixed a bug with KL-proj
|
2022-08-28 20:48:02 +02:00 |
|
|
4080ad8135
|
Removed old TODOs
|
2022-08-28 12:07:19 +02:00 |
|
|
5c39be5ead
|
Testing Observables
|
2022-08-22 15:05:42 +02:00 |
|
|
d35c3d8520
|
Fixed all the bugs in TRPL
|
2022-08-15 16:55:17 +02:00 |
|
|
639aae7f42
|
Testing SDE...
|
2022-08-14 18:42:45 +02:00 |
|
|
fcd9953b37
|
Testing...
|
2022-08-06 14:37:30 +02:00 |
|
|
05dad44b6e
|
Support SAC for testing
|
2022-07-19 10:08:47 +02:00 |
|
|
a86d19053d
|
Smashing bugs (dimension mismatch between Normal and
Independent/MultivariateNormal)
|
2022-07-15 15:46:31 +02:00 |
|
|
ab557a8856
|
Making MultivariateNormal Policies work (and porting Normal to
Independent)
|
2022-07-15 15:03:51 +02:00 |
|
|
b1ed9fc2b8
|
Renamed TRL_PG to PPO
|
2022-07-13 19:51:33 +02:00 |
|
|
1706bea571
|
Testing SDC
|
2022-07-13 19:39:09 +02:00 |
|
|
92204c448f
|
Testing kl-projection (it's working!)
|
2022-07-02 16:41:25 +02:00 |
|
|
ad7ed0071b
|
Silence!
|
2022-07-01 20:02:29 +02:00 |
|
|
0dc9edf112
|
We no longer use venv (breaks cpp_projection...)
|
2022-07-01 19:52:22 +02:00 |
|
|
81ae3e3707
|
Finalized venv support and added installation-instructions
|
2022-07-01 12:19:57 +02:00 |
|
|
2e378d0a7d
|
Rebranding to Metastable Baselines
|
2022-06-30 20:40:30 +02:00 |
|
|
28561b9bb2
|
Allow manual early stopping of training (Ctrl+C)
|
2022-06-29 12:46:57 +02:00 |
|
|
024a9a0265
|
StillTesting
|
2022-06-26 16:38:46 +02:00 |
|
|
60c954c8c1
|
LunarLanderContinuous-v2 is our new default test-env
|
2022-06-25 21:47:21 +02:00 |
|
|
1a49a412c0
|
expanding automatic testing
|
2022-06-25 14:50:19 +02:00 |
|
|
941b7347f1
|
Get FPS from env
|
2022-06-22 13:12:55 +02:00 |
|
|
0e17b4c07e
|
Fixed model storage location bug
|
2022-06-22 13:00:40 +02:00 |
|
|
84b3710850
|
Testing SACs ability to solve EasierObstacles-v0
|
2022-06-21 15:15:38 +02:00 |
|
|
e71735bf79
|
Trying to converge on simple columbus envs
|
2022-06-20 23:12:42 +02:00 |
|
|
b9303416ac
|
Split PPO_SDE into PPO_BASE_SDE and PPO_LATENT_SDE
|
2022-06-19 22:47:04 +02:00 |
|
|
477a3c48b1
|
Testing the RayObserver
|
2022-06-19 20:34:04 +02:00 |
|
|
c45e3627ea
|
Trying to get ColumbusEnv to work...
|
2022-06-19 15:59:55 +02:00 |
|
|
fcc33d7bdc
|
Work on Columbus integration
|
2022-06-19 15:50:54 +02:00 |
|
|
bb160d2837
|
changed locations / names for logs
|
2022-06-17 13:16:32 +02:00 |
|
|
d74477e6c2
|
Added test.py for testing the algos (and tensorboard integration)
|
2022-06-17 11:29:36 +02:00 |
|