|
6384d411a9
|
Support SAC in replays
|
2022-07-19 10:08:34 +02:00 |
|
|
0162a36824
|
Renamed TRL_SAC => SAC
|
2022-07-19 10:08:14 +02:00 |
|
|
7b667e9650
|
SAC is back; with SDC; without Projections
|
2022-07-19 10:07:50 +02:00 |
|
|
5f32435751
|
Smashing bugs: dont confuse chol with chol_net
|
2022-07-19 10:07:20 +02:00 |
|
|
b7de99b1fc
|
EnforcePositiveType makes no sense for Strength.NONE
|
2022-07-19 10:06:40 +02:00 |
|
|
9133ecd61b
|
Show confidence-ellipsoid for supported envs
|
2022-07-17 00:48:17 +02:00 |
|
|
49f9acff3e
|
Fixed: Wrong simplification for Hybrid[SCALAR=>FULL]
|
2022-07-17 00:47:47 +02:00 |
|
|
046fa78206
|
Fixed: _chol_from_sphe_chol was unable to handle batches
|
2022-07-16 17:34:25 +02:00 |
|
|
c141599662
|
Fixed Bug: _chol_from_sphe_chol dependet on action_dim, not able to use
for Hybrid methods
|
2022-07-16 15:47:09 +02:00 |
|
|
3fa6de7e66
|
Broader sampling of stds for logging with batched full covs
|
2022-07-16 15:28:16 +02:00 |
|
|
bc0e188a0d
|
Removed debugging-code
|
2022-07-16 15:19:56 +02:00 |
|
|
d2d84d3287
|
Fixed bug for logging std-estimates when using batched data
|
2022-07-16 15:18:24 +02:00 |
|
|
4a24381f46
|
Fixed bug when using batches with SPHERICAL_CHOL
|
2022-07-16 15:17:48 +02:00 |
|
|
04529e8261
|
Removed debug-point
|
2022-07-16 14:58:29 +02:00 |
|
|
4854346f2d
|
Fixed bug with logging of std for full-cov
|
2022-07-16 14:58:00 +02:00 |
|
|
cb9ee4f302
|
Fixed bugs for Hybrid[Diag=>Full]
|
2022-07-16 14:57:34 +02:00 |
|
|
ad584d70fd
|
Removed debugging prints
|
2022-07-16 13:07:08 +02:00 |
|
|
72754525cd
|
Allow using newly implemented hybrid method
|
2022-07-16 13:06:07 +02:00 |
|
|
fa167b3e5f
|
Hybrid Diag -> Full Implemented; Made spherical_chol more efficient
|
2022-07-16 13:05:35 +02:00 |
|
|
f184b88f19
|
Allow std logging for full and diagonal cov policies
|
2022-07-15 18:46:42 +02:00 |
|
|
74697e8773
|
Smol bug fixes
|
2022-07-15 18:46:17 +02:00 |
|
|
2e0f46b0f3
|
Fixing ser/deser bug (cloudpickle cant handle some enums)
|
2022-07-15 18:45:38 +02:00 |
|
|
a86d19053d
|
Smashing bugs (dimension mismatch between Normal and
Independent/MultivariateNormal)
|
2022-07-15 15:46:31 +02:00 |
|
|
ab557a8856
|
Making MultivariateNormal Policies work (and porting Normal to
Independent)
|
2022-07-15 15:03:51 +02:00 |
|
|
b1ed9fc2b8
|
Renamed TRL_PG to PPO
|
2022-07-13 19:51:33 +02:00 |
|
|
1706bea571
|
Testing SDC
|
2022-07-13 19:39:09 +02:00 |
|
|
3304fd49f6
|
Working on UniversalGaussianDistribution
|
2022-07-13 19:38:57 +02:00 |
|
|
fae19509bc
|
Implemented Policies with Contextual Covariance
|
2022-07-13 19:38:20 +02:00 |
|
|
41e4170b2f
|
Fixes + spherical_chol
|
2022-07-11 17:28:08 +02:00 |
|
|
e4440428f8
|
Working on SDC
|
2022-07-11 11:55:23 +02:00 |
|
|
4c4b12ee0e
|
Allow cloning UniversalGaussianDistribution (new_dist_like)
|
2022-07-09 14:46:11 +02:00 |
|
|
c08ea1cb91
|
Making UniversalGaussianDistribution ready for tanh-squashing-support
|
2022-07-09 14:33:07 +02:00 |
|
|
249754ee89
|
Wrote a little helper-function to generate all allowed combinations of
cov-parameterizations
|
2022-07-09 14:03:56 +02:00 |
|
|
e09950b30c
|
Work on Contextual Covariances
|
2022-07-09 12:26:39 +02:00 |
|
|
aacacebfc4
|
Fixed license
|
2022-07-06 13:26:33 +02:00 |
|
|
0911a04e98
|
Factored out Gaussian Collection for RolloutBuffer
|
2022-07-04 21:21:16 +02:00 |
|
|
1f3eac5398
|
Cleanup
|
2022-07-02 16:42:14 +02:00 |
|
|
92204c448f
|
Testing kl-projection (it's working!)
|
2022-07-02 16:41:25 +02:00 |
|
|
ad7ed0071b
|
Silence!
|
2022-07-01 20:02:29 +02:00 |
|
|
91f64c10d7
|
Fixed bug with initialization of buffer
|
2022-07-01 20:02:09 +02:00 |
|
|
0dc9edf112
|
We no longer use venv (breaks cpp_projection...)
|
2022-07-01 19:52:22 +02:00 |
|
|
14100cccc8
|
Working on SDC-impl.
|
2022-07-01 15:14:41 +02:00 |
|
|
4f2e75b7ae
|
Slimmed 3rd-party-licenses
|
2022-07-01 14:03:50 +02:00 |
|
|
90b3c68a56
|
Maybe better licensing
|
2022-07-01 13:59:07 +02:00 |
|
|
cc51547681
|
Fixed CppProjection in requirements
|
2022-07-01 13:47:04 +02:00 |
|
|
84d1cda96c
|
Trying to get kl to work
|
2022-07-01 13:45:58 +02:00 |
|
|
9d7ce73a0b
|
README: We also have to install cpp_projection
|
2022-07-01 12:30:29 +02:00 |
|
|
111b1b846d
|
Now we even include a setup.py
|
2022-07-01 12:22:27 +02:00 |
|
|
81ae3e3707
|
Finalized venv support and added installation-instructions
|
2022-07-01 12:19:57 +02:00 |
|
|
f25d7a103b
|
Now ready to pvenv
|
2022-07-01 12:03:20 +02:00 |
|