2020-05-21 12:05:06 +02:00
|
|
|
# mujoco-maze
|
|
|
|
|
|
|
|
Some maze environments for reinforcement learning(RL) using [mujoco-py] and
|
|
|
|
[openai gym][gym].
|
|
|
|
|
2020-06-04 07:34:00 +02:00
|
|
|
Thankfully, this project is based on the code from [rllab] and [tensorflow/models][models].
|
|
|
|
|
2020-08-03 18:45:22 +02:00
|
|
|
## Environments
|
|
|
|
|
|
|
|
- PointUMaze/AntUmaze
|
|
|
|
|
|
|
|
![PointUMaze](./screenshots/PointUMaze.png)
|
|
|
|
- PointUMaze-v0/AntUMaze-v0 (Distance-based Reward)
|
|
|
|
- PointUmaze-v1/AntUMaze-v1 (Goal-based Reward i.e., 1.0 or -ε)
|
|
|
|
|
|
|
|
- Point4Rooms/Ant4Rooms
|
|
|
|
|
|
|
|
![Point4Rooms](./screenshots/Point4Rooms.png)
|
|
|
|
- Point4Rooms-v0/Ant4Rooms-v0 (Distance-based Reward)
|
|
|
|
- Point4Rooms-v1/Ant4Rooms-v1 (Goal-based Reward)
|
|
|
|
- Point4Rooms-v2/Ant4Rooms-v2 (Multiple Goals (0.5 pt or 1.0 pt))
|
|
|
|
|
|
|
|
- PointPush/AntPush
|
|
|
|
|
|
|
|
![PointPush](./screenshots/PointPush.png)
|
|
|
|
- PointPush-v0/AntPush-v0 (Distance-based Reward)
|
|
|
|
- PointPush-v1/AntPush-v1 (Goal-based Reward)
|
|
|
|
|
|
|
|
- PointFall/AntFall
|
|
|
|
|
|
|
|
![PointFall](./screenshots/PointFall.png)
|
|
|
|
- PointFall-v0/AntFall-v0 (Distance-based Reward)
|
|
|
|
- PointFall-v1/AntFall-v1 (Goal-based Reward)
|
2020-05-21 12:05:06 +02:00
|
|
|
|
2020-09-26 11:37:20 +02:00
|
|
|
## Caveats
|
|
|
|
This project has some other features (e.g., block maze and other
|
|
|
|
robots) but they are work in progress.
|
|
|
|
|
2020-05-21 12:05:06 +02:00
|
|
|
## License
|
|
|
|
This project is licensed under Apache License, Version 2.0
|
|
|
|
([LICENSE-APACHE](LICENSE) or http://www.apache.org/licenses/LICENSE-2.0).
|
|
|
|
|
|
|
|
[gym]: https://github.com/openai/gym
|
|
|
|
[models]: https://github.com/tensorflow/models/tree/master/research/efficient-hrl
|
|
|
|
[mujoco-py]: https://github.com/openai/mujoco-py
|
|
|
|
[rllab]: https://github.com/rll/rllab
|