|
f421dc2ab5
|
Removed hack again...
|
2023-01-26 18:18:38 +01:00 |
|
|
f86c6c736f
|
Hack is back
|
2023-01-26 18:15:06 +01:00 |
|
|
20646f07f3
|
Reverting last two commits (was ugly hack)
|
2023-01-26 17:43:57 +01:00 |
|
|
6f1837bda5
|
calc episodic infos for some fancy envs
|
2023-01-26 17:27:34 +01:00 |
|
|
f37c8caaa4
|
Force include reward in env infos (for vec env)
|
2023-01-26 12:00:18 +01:00 |
|
|
82a174122a
|
Fixed bugs with givens-rotator for eigen
|
2022-10-24 10:08:31 +02:00 |
|
|
4532135812
|
Finalized factoring out projections
|
2022-09-03 11:59:16 +02:00 |
|
|
0a037deccc
|
Implemented cov parametrization via eigen-decomp
|
2022-09-03 11:16:41 +02:00 |
|
|
afec4e709c
|
Fixed bug in RolloutBuffer when using parallel envs
|
2022-08-27 16:02:40 +02:00 |
|
|
a9e3f295b2
|
Fixed numerical issues with Wasserstein
|
2022-08-17 23:25:24 +02:00 |
|
|
d35c3d8520
|
Fixed all the bugs in TRPL
|
2022-08-15 16:55:17 +02:00 |
|
|
75d73049b4
|
Fixing bugs with w2 and sqrt_induced_gaussian
|
2022-08-06 21:25:49 +02:00 |
|
|
802094a50f
|
Enabled w2 (can now get sqrt from dist)
|
2022-08-06 14:54:59 +02:00 |
|
|
54113fd40c
|
Removed unused dependency
|
2022-08-06 14:37:06 +02:00 |
|
|
683644f77d
|
Removed weird line...
|
2022-08-05 21:06:05 +02:00 |
|
|
199ce0c8cb
|
ProbSquashing implemented (tanh)
|
2022-07-20 10:32:19 +02:00 |
|
|
74697e8773
|
Smol bug fixes
|
2022-07-15 18:46:17 +02:00 |
|
|
a86d19053d
|
Smashing bugs (dimension mismatch between Normal and
Independent/MultivariateNormal)
|
2022-07-15 15:46:31 +02:00 |
|
|
ab557a8856
|
Making MultivariateNormal Policies work (and porting Normal to
Independent)
|
2022-07-15 15:03:51 +02:00 |
|
|
fae19509bc
|
Implemented Policies with Contextual Covariance
|
2022-07-13 19:38:20 +02:00 |
|
|
41e4170b2f
|
Fixes + spherical_chol
|
2022-07-11 17:28:08 +02:00 |
|
|
4c4b12ee0e
|
Allow cloning UniversalGaussianDistribution (new_dist_like)
|
2022-07-09 14:46:11 +02:00 |
|
|
e09950b30c
|
Work on Contextual Covariances
|
2022-07-09 12:26:39 +02:00 |
|
|
0911a04e98
|
Factored out Gaussian Collection for RolloutBuffer
|
2022-07-04 21:21:16 +02:00 |
|
|
91f64c10d7
|
Fixed bug with initialization of buffer
|
2022-07-01 20:02:09 +02:00 |
|
|
84d1cda96c
|
Trying to get kl to work
|
2022-07-01 13:45:58 +02:00 |
|
|
2626ec82a6
|
Added code-src-note
|
2022-07-01 11:52:50 +02:00 |
|
|
ab1b269af9
|
Allow checking whether a dist is contextual
|
2022-07-01 11:52:14 +02:00 |
|
|
2e378d0a7d
|
Rebranding to Metastable Baselines
|
2022-06-30 20:40:30 +02:00 |
|