Better guide
This commit is contained in:
		
							parent
							
								
									57c3d36490
								
							
						
					
					
						commit
						ddd0eaf88a
					
				@ -1,9 +1,7 @@
 | 
				
			|||||||
Basic Usage
 | 
					Basic Usage
 | 
				
			||||||
-----------
 | 
					-----------
 | 
				
			||||||
 | 
					
 | 
				
			||||||
We will only show the basics here and prepared `multiple
 | 
					We will only show the basics here and prepared :ref:`multiple examples <example-general>` for a more detailed look.
 | 
				
			||||||
examples <https://github.com/ALRhub/fancy_gym/tree/master/fancy_gym/examples/>`__
 | 
					 | 
				
			||||||
for a more detailed look.
 | 
					 | 
				
			||||||
 | 
					
 | 
				
			||||||
Step-Based Environments
 | 
					Step-Based Environments
 | 
				
			||||||
~~~~~~~~~~~~~~~~~~~~~~~
 | 
					~~~~~~~~~~~~~~~~~~~~~~~
 | 
				
			||||||
@ -34,7 +32,7 @@ Regular step based environments added by Fancy Gym are added into the
 | 
				
			|||||||
       if terminated or truncated:
 | 
					       if terminated or truncated:
 | 
				
			||||||
           observation, info = env.reset()
 | 
					           observation, info = env.reset()
 | 
				
			||||||
 | 
					
 | 
				
			||||||
Black-box Environments
 | 
					Black-Box Environments
 | 
				
			||||||
~~~~~~~~~~~~~~~~~~~~~~
 | 
					~~~~~~~~~~~~~~~~~~~~~~
 | 
				
			||||||
 | 
					
 | 
				
			||||||
All environments provide by default the cumulative episode reward, this
 | 
					All environments provide by default the cumulative episode reward, this
 | 
				
			||||||
@ -67,13 +65,13 @@ a MP-variant of an environment is given by
 | 
				
			|||||||
``<original namespace>_<MP name>/``. Just keep in mind, calling
 | 
					``<original namespace>_<MP name>/``. Just keep in mind, calling
 | 
				
			||||||
``step()`` executes a full trajectory.
 | 
					``step()`` executes a full trajectory.
 | 
				
			||||||
 | 
					
 | 
				
			||||||
   | **Note:**
 | 
					.. note::
 | 
				
			||||||
   | Currently, we are also in the process of enabling replanning as
 | 
					    Currently, we are also in the process of enabling replanning as
 | 
				
			||||||
     well as learning of sub-trajectories. This allows to split the
 | 
					    well as learning of sub-trajectories. This allows to split the
 | 
				
			||||||
     episode into multiple trajectories and is a hybrid setting between
 | 
					    episode into multiple trajectories and is a hybrid setting between
 | 
				
			||||||
     step-based and black-box leaning. While this is already
 | 
					    step-based and black-box leaning. While this is already
 | 
				
			||||||
     implemented, it is still in beta and requires further testing. Feel
 | 
					    implemented, it is still in beta and requires further testing. Feel
 | 
				
			||||||
     free to try it and open an issue with any problems that occur.
 | 
					    free to try it and open an issue with any problems that occur.
 | 
				
			||||||
 | 
					
 | 
				
			||||||
.. code:: python
 | 
					.. code:: python
 | 
				
			||||||
 | 
					
 | 
				
			||||||
 | 
				
			|||||||
@ -3,7 +3,7 @@ What is Episodic RL?
 | 
				
			|||||||
 | 
					
 | 
				
			||||||
.. raw:: html
 | 
					.. raw:: html
 | 
				
			||||||
 | 
					
 | 
				
			||||||
   <p align="justify">
 | 
					    <div class="justify">
 | 
				
			||||||
 | 
					
 | 
				
			||||||
Movement primitive (MP) environments differ from traditional step-based
 | 
					Movement primitive (MP) environments differ from traditional step-based
 | 
				
			||||||
environments. They align more with concepts from stochastic search,
 | 
					environments. They align more with concepts from stochastic search,
 | 
				
			||||||
@ -14,13 +14,6 @@ produced by trajectory generators like Dynamic Movement Primitives
 | 
				
			|||||||
(DMP), Probabilistic Movement Primitives (ProMP) or Probabilistic
 | 
					(DMP), Probabilistic Movement Primitives (ProMP) or Probabilistic
 | 
				
			||||||
Dynamic Movement Primitives (ProDMP).
 | 
					Dynamic Movement Primitives (ProDMP).
 | 
				
			||||||
 | 
					
 | 
				
			||||||
.. raw:: html
 | 
					 | 
				
			||||||
 | 
					 | 
				
			||||||
   </p>
 | 
					 | 
				
			||||||
 | 
					 | 
				
			||||||
.. raw:: html
 | 
					 | 
				
			||||||
 | 
					 | 
				
			||||||
   <p align="justify">
 | 
					 | 
				
			||||||
 | 
					
 | 
				
			||||||
Once generated, these trajectories are converted into step-by-step
 | 
					Once generated, these trajectories are converted into step-by-step
 | 
				
			||||||
actions using a trajectory tracking controller. The specific controller
 | 
					actions using a trajectory tracking controller. The specific controller
 | 
				
			||||||
@ -29,13 +22,6 @@ position, velocity, and PD-Controllers tailored for position, velocity,
 | 
				
			|||||||
and torque control. Additionally, we have a specialized controller
 | 
					and torque control. Additionally, we have a specialized controller
 | 
				
			||||||
designed for the MetaWorld control suite.
 | 
					designed for the MetaWorld control suite.
 | 
				
			||||||
 | 
					
 | 
				
			||||||
.. raw:: html
 | 
					 | 
				
			||||||
 | 
					 | 
				
			||||||
   </p>
 | 
					 | 
				
			||||||
 | 
					 | 
				
			||||||
.. raw:: html
 | 
					 | 
				
			||||||
 | 
					 | 
				
			||||||
   <p align="justify">
 | 
					 | 
				
			||||||
 | 
					
 | 
				
			||||||
While the overarching objective of MP environments remains the learning
 | 
					While the overarching objective of MP environments remains the learning
 | 
				
			||||||
of an optimal policy, the actions here represent the parametrization of
 | 
					of an optimal policy, the actions here represent the parametrization of
 | 
				
			||||||
@ -47,4 +33,4 @@ every unique context.
 | 
				
			|||||||
 | 
					
 | 
				
			||||||
.. raw:: html
 | 
					.. raw:: html
 | 
				
			||||||
 | 
					
 | 
				
			||||||
   </p>
 | 
					    </div>
 | 
				
			||||||
 | 
				
			|||||||
@ -1,10 +1,11 @@
 | 
				
			|||||||
Installation
 | 
					Installation
 | 
				
			||||||
------------
 | 
					------------
 | 
				
			||||||
 | 
					
 | 
				
			||||||
We recommend installing ``fancy_gym`` into a virtual environment as
 | 
					.. note::
 | 
				
			||||||
provided by `venv <https://docs.python.org/3/library/venv.html>`__. 3rd
 | 
					   We recommend installing ``fancy_gym`` into a virtual environment as
 | 
				
			||||||
party alternatives to venv like `Poetry <https://python-poetry.org/>`__
 | 
					   provided by `venv <https://docs.python.org/3/library/venv.html>`__. 3rd
 | 
				
			||||||
or `Conda <https://docs.conda.io/en/latest/>`__ can also be used.
 | 
					   party alternatives to venv like `Poetry <https://python-poetry.org/>`__
 | 
				
			||||||
 | 
					   or `Conda <https://docs.conda.io/en/latest/>`__ can also be used.
 | 
				
			||||||
 | 
					
 | 
				
			||||||
Installation from PyPI (recommended)
 | 
					Installation from PyPI (recommended)
 | 
				
			||||||
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | 
					~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | 
				
			||||||
 | 
				
			|||||||
@ -92,7 +92,7 @@ If you created a new task wrapper, feel free to open a PR, so we can
 | 
				
			|||||||
integrate it for others to use as well. Without the integration the task
 | 
					integrate it for others to use as well. Without the integration the task
 | 
				
			||||||
can still be used. A rough outline can be shown here, for more details
 | 
					can still be used. A rough outline can be shown here, for more details
 | 
				
			||||||
we recommend having a look at the
 | 
					we recommend having a look at the
 | 
				
			||||||
`examples <https://github.com/ALRhub/fancy_gym/tree/master/fancy_gym/examples/>`__.
 | 
					:ref:`multiple examples <example-mp>`.
 | 
				
			||||||
 | 
					
 | 
				
			||||||
If the step-based is already registered with gym, you can simply do the
 | 
					If the step-based is already registered with gym, you can simply do the
 | 
				
			||||||
following:
 | 
					following:
 | 
				
			||||||
 | 
				
			|||||||
		Loading…
	
		Reference in New Issue
	
	Block a user