DeepMind Control (DMC)
These are the Environment Wrappers for selected DeepMind Control environments in order to use our Motion Primitive gym interface with them.
Step-Based Environments
When installing fancy_gym with the optional dmc extra (e.g. pip install fancy_gym[dmc]), all regular dmc tasks are avaible via Shimmy.
Name  | 
Description  | 
Action Dim  | 
Observation Dim  | 
|---|---|---|---|
  | 
Underactuated double pendulum (Acrobot) with torque applied to the second joint to swing up and balance.  | 
1  | 
6  | 
  | 
Similar to   | 
1  | 
6  | 
  | 
Planar ball-in-cup task where a receptacle must swing to catch a ball. Sparse reward for catching.  | 
2  | 
8  | 
  | 
Cart-pole task where the goal is to balance an unactuated pole by moving a cart, starting near upright.  | 
1  | 
5  | 
  | 
Similar to   | 
1  | 
5  | 
  | 
Cart-pole task with the pole starting downward, requiring swinging up and balancing.  | 
1  | 
5  | 
  | 
Similar to   | 
1  | 
5  | 
  | 
Extension of the Cart-pole domain with two serially connected poles, increasing the balancing challenge.  | 
1  | 
8  | 
  | 
Extension of the Cart-pole domain with three serially connected poles, further increasing the challenge.  | 
1  | 
11  | 
  | 
Planar bipedal cheetah robot tasked with running. The reward is proportional to forward velocity up to a maximum speed.  | 
6  | 
17  | 
  | 
Dog robot task focusing on achieving a standing posture.  | 
38  | 
223  | 
  | 
Dog robot tasked with walking, requiring coordination of movements.  | 
38  | 
223  | 
  | 
Dog robot performing a trotting gait.  | 
38  | 
223  | 
  | 
Dog robot tasked with running, combining speed with stability.  | 
38  | 
223  | 
  | 
Dog robot playing fetch, involving locomotion and object interaction.  | 
38  | 
232  | 
  | 
Finger robot required to rotate an unactuated body on a hinge.  | 
2  | 
9  | 
  | 
Finger robot task to align the tip of a free body with a target, easier version with a larger target.  | 
2  | 
12  | 
  | 
Similar to   | 
2  | 
12  | 
  | 
Fish robot task focused on righting itself in a fluid environment.  | 
5  | 
21  | 
  | 
Fish robot swimming to a target, incorporating fluid dynamics.  | 
5  | 
24  | 
  | 
One-legged hopper robot tasked with achieving a minimal torso height.  | 
4  | 
15  | 
  | 
One-legged hopper robot required to hop, combining height and forward velocity.  | 
4  | 
15  | 
  | 
Simplified humanoid robot maintaining a standing posture.  | 
21  | 
67  | 
  | 
Simplified humanoid robot walking at a specified speed.  | 
21  | 
67  | 
  | 
Simplified humanoid robot running, aiming for a high horizontal speed.  | 
21  | 
67  | 
  | 
Simplified humanoid robot running with a focus on pure state control.  | 
21  | 
55  | 
  | 
Advanced humanoid robot (CMU model) maintaining a standing posture.  | 
56  | 
137  | 
  | 
Advanced humanoid robot (CMU model) walking.  | 
56  | 
137  | 
  | 
Advanced humanoid robot (CMU model) running.  | 
56  | 
137  | 
  | 
Linear quadratic regulator (LQR) task with 2 masses and 1 actuator, focusing on position and control optimization.  | 
1  | 
4  | 
  | 
Linear quadratic regulator (LQR) task with 6 masses and 2 actuators, more complex control optimization challenge.  | 
2  | 
12  | 
  | 
Planar manipulator robot bringing a ball to a target location, with object initialization variations.  | 
5  | 
44  | 
  | 
Planar manipulator task of bringing a peg to a target peg.  | 
5  | 
44  | 
  | 
Planar manipulator task requiring inserting a ball into a basket.  | 
5  | 
44  | 
  | 
Planar manipulator challenge of inserting a peg into a slot.  | 
5  | 
44  | 
  | 
Classic inverted pendulum task with a torque-limited actuator, requiring multiple swings to swing up and balance.  | 
1  | 
3  | 
  | 
Planar point mass in an easy task, with actuators corresponding to global x and y axes  | 
2  | 
4  | 
  | 
Planar point mass in a hard task, with randomized control gains per episode, challenging memoryless agents.  | 
2  | 
4  | 
  | 
Four-legged robot (Quadruped) tasked with walking.  | 
12  | 
78  | 
  | 
Quadruped robot required to run, balancing speed and stability.  | 
12  | 
78  | 
  | 
Quadruped robot in an escape task, combining locomotion with environmental interaction.  | 
12  | 
101  | 
  | 
Quadruped robot playing fetch, involving complex movements and object interaction.  | 
12  | 
90  | 
  | 
Two-link planar reacher with a randomized target, easier version with a larger target sphere.  | 
2  | 
6  | 
  | 
Similar to   | 
2  | 
6  | 
  | 
Stacker task requiring stacking of 2 boxes, with rewards for correct placement and gripper positioning.  | 
5  | 
49  | 
  | 
More complex stacker task with 4 boxes, increasing the stacking challenge.  | 
5  | 
63  | 
  | 
Six-link planar swimmer in fluid dynamics, rewarded for positioning the nose inside a target.  | 
5  | 
25  | 
  | 
Fifteen-link planar swimmer, a more complex version of the swimmer task with extended dynamics.  | 
14  | 
61  | 
  | 
Planar walker robot tasked with maintaining an upright torso and minimal height.  | 
6  | 
24  | 
  | 
Planar walker robot walking task, focusing on forward velocity and stability.  | 
6  | 
24  | 
  | 
Planar walker robot running task, aiming for high speed and balance.  | 
6  | 
24  | 
MP Environments
Name  | 
Description  | 
Trajectory Horizon  | 
Action Dimension  | 
Context Dimension  | 
|---|---|---|---|---|
  | 
A ProMP wrapped version of the “catch” task for the “ball_in_cup” environment.  | 
1000  | 
10  | 
2  | 
  | 
A DMP wrapped version of the “catch” task for the “ball_in_cup” environment.  | 
1000  | 
10  | 
2  | 
  | 
A ProMP wrapped version of the “easy” task for the “reacher” environment.  | 
1000  | 
10  | 
4  | 
  | 
A DMP wrapped version of the “easy” task for the “reacher” environment.  | 
1000  | 
10  | 
4  | 
  | 
A ProMP wrapped version of the “hard” task for the “reacher” environment.  | 
1000  | 
10  | 
4  | 
  | 
A DMP wrapped version of the “hard” task for the “reacher” environment.  | 
1000  | 
10  | 
4  |