|
ee4a0eed56
|
Fixed SAC+SDE+SDC bugs
|
2022-09-03 13:08:31 +02:00 |
|
|
4532135812
|
Finalized factoring out projections
|
2022-09-03 11:59:16 +02:00 |
|
|
0aeea4e2e5
|
Fixed Bug: Wrong dimensions for action_loss
|
2022-09-03 11:44:01 +02:00 |
|
|
4bb772a251
|
Factor Projections out into metastable-projections
|
2022-09-03 11:37:41 +02:00 |
|
|
0a037deccc
|
Implemented cov parametrization via eigen-decomp
|
2022-09-03 11:16:41 +02:00 |
|
|
e4a8cfc349
|
Implemented action_loss
|
2022-09-03 11:16:29 +02:00 |
|
|
2f05474091
|
Fixed a bug with KL-proj
|
2022-08-28 20:48:02 +02:00 |
|
|
4080ad8135
|
Removed old TODOs
|
2022-08-28 12:07:19 +02:00 |
|
|
eb881559d6
|
Support clip_range None
|
2022-08-28 02:07:18 +02:00 |
|
|
1d3c2fe005
|
Allow completely disabling some PPO features (for TRPL)
|
2022-08-28 00:26:44 +02:00 |
|
|
afec4e709c
|
Fixed bug in RolloutBuffer when using parallel envs
|
2022-08-27 16:02:40 +02:00 |
|
|
02e4ed1510
|
Added support for parallel envs
|
2022-08-27 15:19:00 +02:00 |
|
|
5c39be5ead
|
Testing Observables
|
2022-08-22 15:05:42 +02:00 |
|
|
c6a58b15dd
|
Fixing SDE bug
|
2022-08-22 14:19:40 +02:00 |
|
|
197de7997c
|
Fixed bug with SDE
|
2022-08-22 13:36:17 +02:00 |
|
|
a9e3f295b2
|
Fixed numerical issues with Wasserstein
|
2022-08-17 23:25:24 +02:00 |
|
|
9fffe048af
|
Fixed Spherical_Chol not accepting batches
|
2022-08-17 22:55:42 +02:00 |
|
|
86e6bfb65b
|
Higher epsilon to deal with numerical instabilities
|
2022-08-17 19:31:54 +02:00 |
|
|
64a7d5ec59
|
Guarante minimum epsilon when ensuring non-zero (CholNet)
|
2022-08-16 20:02:33 +02:00 |
|
|
d35c3d8520
|
Fixed all the bugs in TRPL
|
2022-08-15 16:55:17 +02:00 |
|
|
28d0c609bc
|
Fixed SDE: sampling had dimension mismatches
|
2022-08-14 20:09:10 +02:00 |
|
|
e1c59cffd0
|
Removed debug-print
|
2022-08-14 18:45:17 +02:00 |
|
|
639aae7f42
|
Testing SDE...
|
2022-08-14 18:42:45 +02:00 |
|
|
bb1f9ecf2b
|
Fixed UniversalGaussianDistribution lost SDE when cloning
|
2022-08-14 18:42:19 +02:00 |
|
|
0ee65e789b
|
Fixing sde's bugs
|
2022-08-14 16:10:22 +02:00 |
|
|
0e4eedae5e
|
Fixed gradient throught spherical-chol
|
2022-08-10 11:55:08 +02:00 |
|
|
520dc98eb5
|
Implemented SDE
|
2022-08-10 11:54:52 +02:00 |
|
|
12e422aec7
|
Why does KL double free?
|
2022-08-07 18:04:40 +02:00 |
|
|
75d73049b4
|
Fixing bugs with w2 and sqrt_induced_gaussian
|
2022-08-06 21:25:49 +02:00 |
|
|
802094a50f
|
Enabled w2 (can now get sqrt from dist)
|
2022-08-06 14:54:59 +02:00 |
|
|
508ebf51f0
|
Implemented sqrt-induced-gaussian for W2-Projection
|
2022-08-06 14:46:42 +02:00 |
|
|
fcd9953b37
|
Testing...
|
2022-08-06 14:37:30 +02:00 |
|
|
54113fd40c
|
Removed unused dependency
|
2022-08-06 14:37:06 +02:00 |
|
|
2c1689fbbc
|
Fixed smol bugs when instanciating based on string-names of enums
|
2022-08-06 14:36:35 +02:00 |
|
|
e074294b88
|
+1
|
2022-08-05 21:07:38 +02:00 |
|
|
8b82347056
|
Automatic casting to enums
|
2022-08-05 21:06:31 +02:00 |
|
|
683644f77d
|
Removed weird line...
|
2022-08-05 21:06:05 +02:00 |
|
|
a78e81b9e1
|
Allowing prob squashing
|
2022-07-21 09:42:25 +02:00 |
|
|
199ce0c8cb
|
ProbSquashing implemented (tanh)
|
2022-07-20 10:32:19 +02:00 |
|
|
05dad44b6e
|
Support SAC for testing
|
2022-07-19 10:08:47 +02:00 |
|
|
6384d411a9
|
Support SAC in replays
|
2022-07-19 10:08:34 +02:00 |
|
|
0162a36824
|
Renamed TRL_SAC => SAC
|
2022-07-19 10:08:14 +02:00 |
|
|
7b667e9650
|
SAC is back; with SDC; without Projections
|
2022-07-19 10:07:50 +02:00 |
|
|
5f32435751
|
Smashing bugs: dont confuse chol with chol_net
|
2022-07-19 10:07:20 +02:00 |
|
|
b7de99b1fc
|
EnforcePositiveType makes no sense for Strength.NONE
|
2022-07-19 10:06:40 +02:00 |
|
|
9133ecd61b
|
Show confidence-ellipsoid for supported envs
|
2022-07-17 00:48:17 +02:00 |
|
|
49f9acff3e
|
Fixed: Wrong simplification for Hybrid[SCALAR=>FULL]
|
2022-07-17 00:47:47 +02:00 |
|
|
046fa78206
|
Fixed: _chol_from_sphe_chol was unable to handle batches
|
2022-07-16 17:34:25 +02:00 |
|
|
c141599662
|
Fixed Bug: _chol_from_sphe_chol dependet on action_dim, not able to use
for Hybrid methods
|
2022-07-16 15:47:09 +02:00 |
|
|
3fa6de7e66
|
Broader sampling of stds for logging with batched full covs
|
2022-07-16 15:28:16 +02:00 |
|