Trivial OpenAI Gym environments for exploratory tasks.
Go to file
2022-12-06 19:11:12 +01:00
columbus Config-configurable width / height and minimal docs 2022-12-06 19:11:12 +01:00
configs Config-configurable width / height and minimal docs 2022-12-06 19:11:12 +01:00
.gitignore Getting ready for pvenv 2022-07-01 12:10:48 +02:00
Columbus.pdf Extended README, added Columbus.pdf 2022-11-25 11:28:19 +01:00
icon.svg New Icon v2 2022-07-02 14:45:05 +02:00
img_README.png Added image to README 2022-06-19 15:04:19 +02:00
LICENSE MIT License 2022-11-13 20:07:00 +01:00
README.md Implemented Units, Updated README, Example file for Units 2022-12-06 12:00:45 +01:00
requirements.txt Added requirements 2022-07-01 12:22:49 +02:00
setup.py Converted to python-package 2022-06-19 16:37:53 +02:00

Project Columbus

Project Columbus is a framework for trivial 2D OpenAI Gym environments that are supposed to test a agents ability to solve tasks that require different forms of exploration effectively and efficiently.

Installation

(If you want to install Columbus as a dependency for metastable-baselines, activate (source) the venv from metastable-baselines before running this command.)

pip install -e .

Overview

Columbus.pdf contains a overview of columbus.

Layout of the Repo

env.py

Screenshot
Contains the ColumbusEnv. There exist two ways to implement new envs:

  • Subclassing ColumbusEnv and expanding init and overriding setup.
  • Using the ColumbusConfigDefined with a desired configuration. This makes configuring ColumbusEnvs via ClusterWorks2-configs possible. (See configs/example.yaml for an example of how the parameters are supposed to look like (uses yaml format)
  • We now support using units (px, em, ct) in config files, examples can be found in configs/Example_Units.yaml
  • The environments used in my thesis can also be found in configs/
Some caveats / infos
  • If you want to render to a window (pygame-gui) call render with mode='human'
  • If you want visualize the covariance you have to supply the cholesky-decomp of the cov-matrix to render
  • If you want to render into a mp4, you have to call render with a mode!='human' and assemble/encode the returned frames yourself into a mp4/webm/...
  • Even while the agent plays, some keyboard-inputs are possible (to test the agents reaction to situations he would never enter by itself. Look at _handle_user_input in env.py for avaible keys)
  • The sampling-rate of the physics engine is bound to the frame-rate of the rendering engine (1:1). This means too low fps / too fast agents / too thin barriers will lead to the agent tunneling through barriers. You can fix this by setting a higher agent-drag (which decreases the maximum speed) or making barriers thicker. A feature allowing the physics engine to sample multiple smaller steps within a single rendering step could be added in the future.

entities.py

Contains all implemented entities (e.g. the Agent, Rewards and Enemies)

observables.py

Contains all 'oberservables'. These are attached to envs to define what kind of output is given to the agent. This way environments can be designed independently from the observation machanism that is used by the agent to play it.

Some caveats
  • CNNObservable seems to be broken currently. (Fixing it is also no priority for me)
  • RayObservable is using a naive ray-marching (basicaly just line-sweeping). For large amounts of rays this turn out to be the computational bottleneck of the environment. Switching to a more efficient algorithm (based on euclidean formulars and line intersects) would be possible in the future...

humanPlayer.py

Allows environments to be played by a human using mouse input. Now even works for ColumbusConfigDefined.