Commit Graph

242 Commits

Author SHA1 Message Date
Fabian
89439f7532 updated incorrect angle normalization. 2021-03-22 15:25:22 +01:00
Maximilian Huettenrauch
a0692b1089 updates 2021-03-19 16:31:46 +01:00
Maximilian Huettenrauch
1d8b22245d updates 2021-02-26 17:34:31 +01:00
Maximilian Huettenrauch
27d7da6774 biac weightscale change 2021-02-24 22:58:48 +01:00
Maximilian Huettenrauch
7e988758fe updates 2021-02-24 15:37:54 +01:00
ottofabian
b7b5af03ba added kl full cov projection and cov plots 2021-02-22 17:05:12 +01:00
Maximilian Huettenrauch
60e1673ee1 viapoint reacher reward bug fix 2021-02-19 16:17:55 +01:00
Maximilian Huettenrauch
1e3f036478 updates 2021-02-19 10:00:26 +01:00
Maximilian Huettenrauch
dd18a04df6 biac reward function update 2021-02-18 11:33:55 +01:00
Maximilian Huettenrauch
7ed22df778 update biac reward 2021-02-17 18:50:55 +01:00
Maximilian Huettenrauch
46fc642c36 updates 2021-02-17 17:48:05 +01:00
Maximilian Huettenrauch
420fe10506 biac normal cost 2021-02-16 18:47:08 +01:00
Maximilian Huettenrauch
7eef78d620 biac updates 2021-02-16 15:47:32 +01:00
Maximilian Huettenrauch
0916daf3b5 updates 2021-02-15 16:31:34 +01:00
Maximilian Huettenrauch
77d0cbd00a updates 2021-02-15 09:03:19 +01:00
Maximilian Huettenrauch
95250af31c added viapoint reacher 2021-02-12 17:12:40 +01:00
Maximilian Huettenrauch
708478c626 updates in biac 2021-02-11 16:19:57 +01:00
Maximilian Huettenrauch
13a292f0e0 updates 2021-02-11 12:32:32 +01:00
Maximilian Huettenrauch
c81378b9e7 support for contexts, policy classes, pd controller example, breaking changes etc 2021-02-11 10:49:57 +01:00
ottofabian
d026ebc427 added balancing to reacher 2021-02-09 17:07:52 +01:00
Maximilian Huettenrauch
07195fa2dc lots of new stuff... 2021-02-05 17:10:03 +01:00
Maximilian Huettenrauch
cab2c249bb started table tennis env 2021-01-21 09:42:04 +01:00
Maximilian Huettenrauch
2d9e7fb3eb fixes in holereacher 2021-01-15 17:16:52 +01:00
Maximilian Huettenrauch
b7400c477d updates 2021-01-14 17:10:03 +01:00
Maximilian Huettenrauch
104281fe16 changed from step to rollout method 2021-01-12 10:52:08 +01:00
Maximilian Huettenrauch
a8fcbd6fb0 dmp env wrappers initial 2021-01-11 16:08:42 +01:00
ottofabian
f171117d8f fixed imports and first mpo version 2020-12-18 14:24:02 +01:00
ottofabian
b8f0c91a90 refractoring of projection layer, improved modularization of code 2020-12-11 09:46:35 +01:00
ottofabian
58131ef470 Added balancing reacher task and stochastic search task interface 2020-12-07 11:13:27 +01:00
ottofabian
741f1cb636 smaller changes 2020-11-03 11:26:06 +01:00
ottofabian
bac7a87b61 added Sparse Short version 2020-09-26 15:07:42 +02:00
ottofabian
d9f52194f7 reacher updates 2020-09-22 17:41:25 +02:00
ottofabian
8fc1210f1e some new stuff 2020-09-19 17:47:20 +02:00
ottofabian
cbdd5d1854 removed action scaling 2020-09-08 12:43:14 +02:00
ottofabian
df4361564a added max entropy RL and loading/testing of models 2020-09-04 13:35:05 +02:00
ottofabian
93e9c77356 removed EZPickle from SimpleReacher 2020-09-01 17:57:51 +02:00
ottofabian
a9d3e718bb some changes to reward 2020-08-31 15:52:15 +02:00
ottofabian
3cc4d6e667 some changes to reward 2020-08-31 15:51:47 +02:00
ottofabian
7f4f52ab10 SimpleReacher state space changed 2020-08-31 11:26:32 +02:00
ottofabian
aec332ff0c fixed some issues with SimpleReacher state space 2020-08-31 10:33:11 +02:00
ottofabian
2ff850328b fixed some issues with SimpleReacher rendering 2020-08-31 10:18:59 +02:00
ottofabian
31156cec4d added simple reacher task 2020-08-28 18:31:06 +02:00