Maximilian Huettenrauch
|
a0692b1089
|
updates
|
2021-03-19 16:31:46 +01:00 |
|
Maximilian Huettenrauch
|
1d8b22245d
|
updates
|
2021-02-26 17:34:31 +01:00 |
|
Maximilian Huettenrauch
|
27d7da6774
|
biac weightscale change
|
2021-02-24 22:58:48 +01:00 |
|
Maximilian Huettenrauch
|
7e988758fe
|
updates
|
2021-02-24 15:37:54 +01:00 |
|
ottofabian
|
b7b5af03ba
|
added kl full cov projection and cov plots
|
2021-02-22 17:05:12 +01:00 |
|
Maximilian Huettenrauch
|
60e1673ee1
|
viapoint reacher reward bug fix
|
2021-02-19 16:17:55 +01:00 |
|
Maximilian Huettenrauch
|
1e3f036478
|
updates
|
2021-02-19 10:00:26 +01:00 |
|
Maximilian Huettenrauch
|
dd18a04df6
|
biac reward function update
|
2021-02-18 11:33:55 +01:00 |
|
Maximilian Huettenrauch
|
7ed22df778
|
update biac reward
|
2021-02-17 18:50:55 +01:00 |
|
Maximilian Huettenrauch
|
46fc642c36
|
updates
|
2021-02-17 17:48:05 +01:00 |
|
Maximilian Huettenrauch
|
420fe10506
|
biac normal cost
|
2021-02-16 18:47:08 +01:00 |
|
Maximilian Huettenrauch
|
7eef78d620
|
biac updates
|
2021-02-16 15:47:32 +01:00 |
|
Maximilian Huettenrauch
|
0916daf3b5
|
updates
|
2021-02-15 16:31:34 +01:00 |
|
Maximilian Huettenrauch
|
77d0cbd00a
|
updates
|
2021-02-15 09:03:19 +01:00 |
|
Maximilian Huettenrauch
|
95250af31c
|
added viapoint reacher
|
2021-02-12 17:12:40 +01:00 |
|
Maximilian Huettenrauch
|
708478c626
|
updates in biac
|
2021-02-11 16:19:57 +01:00 |
|
Maximilian Huettenrauch
|
13a292f0e0
|
updates
|
2021-02-11 12:32:32 +01:00 |
|
Maximilian Huettenrauch
|
c81378b9e7
|
support for contexts, policy classes, pd controller example, breaking changes etc
|
2021-02-11 10:49:57 +01:00 |
|
ottofabian
|
d026ebc427
|
added balancing to reacher
|
2021-02-09 17:07:52 +01:00 |
|
Maximilian Huettenrauch
|
07195fa2dc
|
lots of new stuff...
|
2021-02-05 17:10:03 +01:00 |
|
Maximilian Huettenrauch
|
cab2c249bb
|
started table tennis env
|
2021-01-21 09:42:04 +01:00 |
|
Maximilian Huettenrauch
|
2d9e7fb3eb
|
fixes in holereacher
|
2021-01-15 17:16:52 +01:00 |
|
Maximilian Huettenrauch
|
b7400c477d
|
updates
|
2021-01-14 17:10:03 +01:00 |
|
Maximilian Huettenrauch
|
104281fe16
|
changed from step to rollout method
|
2021-01-12 10:52:08 +01:00 |
|
Maximilian Huettenrauch
|
a8fcbd6fb0
|
dmp env wrappers initial
|
2021-01-11 16:08:42 +01:00 |
|
ottofabian
|
f171117d8f
|
fixed imports and first mpo version
|
2020-12-18 14:24:02 +01:00 |
|
ottofabian
|
b8f0c91a90
|
refractoring of projection layer, improved modularization of code
|
2020-12-11 09:46:35 +01:00 |
|
ottofabian
|
58131ef470
|
Added balancing reacher task and stochastic search task interface
|
2020-12-07 11:13:27 +01:00 |
|
ottofabian
|
741f1cb636
|
smaller changes
|
2020-11-03 11:26:06 +01:00 |
|
ottofabian
|
bac7a87b61
|
added Sparse Short version
|
2020-09-26 15:07:42 +02:00 |
|
ottofabian
|
d9f52194f7
|
reacher updates
|
2020-09-22 17:41:25 +02:00 |
|
ottofabian
|
8fc1210f1e
|
some new stuff
|
2020-09-19 17:47:20 +02:00 |
|
ottofabian
|
cbdd5d1854
|
removed action scaling
|
2020-09-08 12:43:14 +02:00 |
|
ottofabian
|
df4361564a
|
added max entropy RL and loading/testing of models
|
2020-09-04 13:35:05 +02:00 |
|
ottofabian
|
93e9c77356
|
removed EZPickle from SimpleReacher
|
2020-09-01 17:57:51 +02:00 |
|
ottofabian
|
a9d3e718bb
|
some changes to reward
|
2020-08-31 15:52:15 +02:00 |
|
ottofabian
|
3cc4d6e667
|
some changes to reward
|
2020-08-31 15:51:47 +02:00 |
|
ottofabian
|
7f4f52ab10
|
SimpleReacher state space changed
|
2020-08-31 11:26:32 +02:00 |
|
ottofabian
|
aec332ff0c
|
fixed some issues with SimpleReacher state space
|
2020-08-31 10:33:11 +02:00 |
|
ottofabian
|
2ff850328b
|
fixed some issues with SimpleReacher rendering
|
2020-08-31 10:18:59 +02:00 |
|
ottofabian
|
31156cec4d
|
added simple reacher task
|
2020-08-28 18:31:06 +02:00 |
|