Commit Graph

184 Commits

Author SHA1 Message Date
14100cccc8 Working on SDC-impl. 2022-07-01 15:14:41 +02:00
4f2e75b7ae Slimmed 3rd-party-licenses 2022-07-01 14:03:50 +02:00
90b3c68a56 Maybe better licensing 2022-07-01 13:59:07 +02:00
cc51547681 Fixed CppProjection in requirements 2022-07-01 13:47:04 +02:00
84d1cda96c Trying to get kl to work 2022-07-01 13:45:58 +02:00
9d7ce73a0b README: We also have to install cpp_projection 2022-07-01 12:30:29 +02:00
111b1b846d Now we even include a setup.py 2022-07-01 12:22:27 +02:00
81ae3e3707 Finalized venv support and added installation-instructions 2022-07-01 12:19:57 +02:00
f25d7a103b Now ready to pvenv 2022-07-01 12:03:20 +02:00
2626ec82a6 Added code-src-note 2022-07-01 11:52:50 +02:00
ab1b269af9 Allow checking whether a dist is contextual 2022-07-01 11:52:14 +02:00
a8b9c63965 Making dez covariances contextual 2022-07-01 11:29:12 +02:00
155a298e41 Switching to new icon 2022-06-30 21:02:22 +02:00
7e7ba65e51 Testing alternative icon 2022-06-30 21:01:25 +02:00
2e378d0a7d Rebranding to Metastable Baselines 2022-06-30 20:40:30 +02:00
30c9e93967 Fixed replay 2022-06-29 17:02:40 +02:00
28561b9bb2 Allow manual early stopping of training (Ctrl+C) 2022-06-29 12:46:57 +02:00
e8d423f91f Testing the new WassersteinProjectionLayer 2022-06-29 12:46:37 +02:00
4e77190d8e Fixed chol not expanding bug and function to shrink chol to diag 2022-06-29 12:44:13 +02:00
7c117cfca5 Added possibility to load models and run hem again (currently bugged) 2022-06-29 12:43:21 +02:00
416dde202d Factored out frob_sq and perf improvement for spd input 2022-06-27 13:44:08 +02:00
f4c87c9cdc Better handling of diagonal-covariance as vector and matrix 2022-06-26 18:14:12 +02:00
bc61a6db32 Refactored some stuff out 2022-06-26 16:39:37 +02:00
edf00553dd Renamed our RolloutBuffer and testing the FrobeniusProjectionLayer 2022-06-26 16:39:06 +02:00
024a9a0265 StillTesting 2022-06-26 16:38:46 +02:00
80741776d2 Removed old comments 2022-06-25 21:56:07 +02:00
b8488c531b Implemented TRLRolloutBuffer 2022-06-25 21:47:39 +02:00
60c954c8c1 LunarLanderContinuous-v2 is our new default test-env 2022-06-25 21:47:21 +02:00
cf5a2e82fc mean and std are now saved to the rollout 2022-06-25 18:29:55 +02:00
df21e1dc3f Fixed reference to gym.spaces 2022-06-25 17:49:46 +02:00
5f7cfd2e10 Note about Code-Src 2022-06-25 15:37:45 +02:00
b9f66dd95d Using BaseProjectionLayer as Default 2022-06-25 15:33:43 +02:00
cafc90409f _global_steps ctr added 2022-06-25 15:05:54 +02:00
866f863d70 overriding collect_rollouts 2022-06-25 14:57:27 +02:00
1a49a412c0 expanding automatic testing 2022-06-25 14:50:19 +02:00
25316ec0b8 Incremental progress at implementing trl_pg 2022-06-25 13:58:58 +02:00
941b7347f1 Get FPS from env 2022-06-22 13:12:55 +02:00
0e17b4c07e Fixed model storage location bug 2022-06-22 13:00:40 +02:00
41d4e94dbe Extended .gitignore 2022-06-22 13:00:28 +02:00
84b3710850 Testing SACs ability to solve EasierObstacles-v0 2022-06-21 15:15:38 +02:00
13d335f856 models added to .gitignore 2022-06-21 15:15:22 +02:00
e71735bf79 Trying to converge on simple columbus envs 2022-06-20 23:12:42 +02:00
b9303416ac Split PPO_SDE into PPO_BASE_SDE and PPO_LATENT_SDE 2022-06-19 22:47:04 +02:00
c0a5b6650d Added model*.py to .gitignore 2022-06-19 20:34:33 +02:00
477a3c48b1 Testing the RayObserver 2022-06-19 20:34:04 +02:00
605a81c81c No longer needing update_subtrees.sh 2022-06-19 16:30:46 +02:00
7b8f2e7109 I dont like subtrees 2022-06-19 16:30:32 +02:00
12e9e09add Updating Columbus - Merge commit '132176985464f6ba2e8f7c88794215644f7f9066' 2022-06-19 16:15:58 +02:00
1321769854 Squashed 'subtrees/columbus/' changes from 489a098..6ebd95e
6ebd95e Trying to get remote imports to work

git-subtree-dir: subtrees/columbus
git-subtree-split: 6ebd95e74f0a22ba6fbd8d0386d6e625a736926a
2022-06-19 16:15:58 +02:00
26ba8c64c7 Update Columbus - Merge commit 'a49eae378dc65a73e101cd094703ebd80534de77' 2022-06-19 16:07:12 +02:00