Commit Graph

660 Commits

Author SHA1 Message Date
ottofabian
6a7c6991bb added balancing task 2021-03-26 15:32:50 +01:00
Maximilian Huettenrauch
f730cb92ba updates 2021-03-26 14:30:58 +01:00
ottofabian
7ceadeff0a refractoring of DMP environmets to fit gym interface better. 2021-03-26 14:05:16 +01:00
Maximilian Huettenrauch
6233c85904 update on holereacher reward 2021-03-22 15:28:50 +01:00
Fabian
89439f7532 updated incorrect angle normalization. 2021-03-22 15:25:22 +01:00
Maximilian Huettenrauch
a0692b1089 updates 2021-03-19 16:31:46 +01:00
Maximilian Huettenrauch
1d8b22245d updates 2021-02-26 17:34:31 +01:00
Maximilian Huettenrauch
27d7da6774 biac weightscale change 2021-02-24 22:58:48 +01:00
Maximilian Huettenrauch
7e988758fe updates 2021-02-24 15:37:54 +01:00
ottofabian
b7b5af03ba added kl full cov projection and cov plots 2021-02-22 17:05:12 +01:00
Maximilian Huettenrauch
60e1673ee1 viapoint reacher reward bug fix 2021-02-19 16:17:55 +01:00
Maximilian Huettenrauch
1e3f036478 updates 2021-02-19 10:00:26 +01:00
Maximilian Huettenrauch
dd18a04df6 biac reward function update 2021-02-18 11:33:55 +01:00
Maximilian Huettenrauch
7ed22df778 update biac reward 2021-02-17 18:50:55 +01:00
Maximilian Huettenrauch
46fc642c36 updates 2021-02-17 17:48:05 +01:00
Maximilian Huettenrauch
420fe10506 biac normal cost 2021-02-16 18:47:08 +01:00
Maximilian Huettenrauch
7eef78d620 biac updates 2021-02-16 15:47:32 +01:00
Maximilian Huettenrauch
0916daf3b5 updates 2021-02-15 16:31:34 +01:00
Maximilian Huettenrauch
77d0cbd00a updates 2021-02-15 09:03:19 +01:00
Maximilian Huettenrauch
95250af31c added viapoint reacher 2021-02-12 17:12:40 +01:00
Maximilian Huettenrauch
708478c626 updates in biac 2021-02-11 16:19:57 +01:00
Maximilian Huettenrauch
13a292f0e0 updates 2021-02-11 12:32:32 +01:00
Maximilian Huettenrauch
c81378b9e7 support for contexts, policy classes, pd controller example, breaking changes etc 2021-02-11 10:49:57 +01:00
ottofabian
d026ebc427 added balancing to reacher 2021-02-09 17:07:52 +01:00
Maximilian Huettenrauch
07195fa2dc lots of new stuff... 2021-02-05 17:10:03 +01:00
Maximilian Huettenrauch
cab2c249bb started table tennis env 2021-01-21 09:42:04 +01:00
ottofabian
c008614214 Merge remote-tracking branch 'origin/master' 2021-01-19 14:27:27 +01:00
Maximilian Huettenrauch
2d9e7fb3eb fixes in holereacher 2021-01-15 17:16:52 +01:00
Maximilian Huettenrauch
b7400c477d updates 2021-01-14 17:10:03 +01:00
Maximilian Huettenrauch
104281fe16 changed from step to rollout method 2021-01-12 10:52:08 +01:00
Maximilian Huettenrauch
a8fcbd6fb0 dmp env wrappers initial 2021-01-11 16:08:42 +01:00
ottofabian
f171117d8f fixed imports and first mpo version 2020-12-18 14:24:02 +01:00
Maximilian Huettenrauch
72b5e2bfc9 remove .idea stuff v2 2020-12-14 16:20:25 +01:00
Maximilian Huettenrauch
8e0a5ba29e removed pycharm .idea folder 2020-12-14 16:19:07 +01:00
ottofabian
b8f0c91a90 refractoring of projection layer, improved modularization of code 2020-12-11 09:46:35 +01:00
ottofabian
293012ba77
Update README.md 2020-12-07 11:28:06 +01:00
ottofabian
768ae14655 Updated README.md 2020-12-07 11:25:58 +01:00
ottofabian
b4096ad8a2 Merge branch 'master' of github.com:ALRhub/reacher_5_links 2020-12-07 11:15:23 +01:00
ottofabian
58131ef470 Added balancing reacher task and stochastic search task interface 2020-12-07 11:13:27 +01:00
ottofabian
6e15066fb3
Update README.md 2020-11-25 10:00:36 +01:00
ottofabian
741f1cb636 smaller changes 2020-11-03 11:26:06 +01:00
ottofabian
bac7a87b61 added Sparse Short version 2020-09-26 15:07:42 +02:00
ottofabian
d9f52194f7 reacher updates 2020-09-22 17:41:25 +02:00
ottofabian
8fc1210f1e some new stuff 2020-09-19 17:47:20 +02:00
ottofabian
cbdd5d1854 removed action scaling 2020-09-08 12:43:14 +02:00
ottofabian
df4361564a added max entropy RL and loading/testing of models 2020-09-04 13:35:05 +02:00
ottofabian
93e9c77356 removed EZPickle from SimpleReacher 2020-09-01 17:57:51 +02:00
ottofabian
a9d3e718bb some changes to reward 2020-08-31 15:52:15 +02:00
ottofabian
3cc4d6e667 some changes to reward 2020-08-31 15:51:47 +02:00
ottofabian
7f4f52ab10 SimpleReacher state space changed 2020-08-31 11:26:32 +02:00