Pruning of README 2nd try
This commit is contained in:
		
							parent
							
								
									c5ed794325
								
							
						
					
					
						commit
						302dbf6dde
					
				
							
								
								
									
										35
									
								
								README.md
									
									
									
									
									
										Normal file
									
								
							
							
						
						
									
										35
									
								
								README.md
									
									
									
									
									
										Normal file
									
								
							| @ -0,0 +1,35 @@ | ||||
| # Metastable Baselines | ||||
| 
 | ||||
| <p align='center'> | ||||
|   <img src='./icon.svg'> | ||||
| </p> | ||||
| 
 | ||||
| During training of a RL-Agent we follow the gradient of the loss, which leads us to a minimum. In cases where the found minimum is merely a local minimum, this can be seen as a *false vacuum* in our loss space. Exploration mechanisms try to let our training procedure escape these *stable states*: Making them *metastable*.  | ||||
| 
 | ||||
| In order to archive this, this Repo contains some extensions for [Stable Baselines 3 by DLR-RM](https://github.com/DLR-RM/stable-baselines3)   | ||||
| These extensions include: | ||||
| 
 | ||||
| - An implementation of ["Differentiable Trust Region Layers for Deep Reinforcement Learning" by Fabian Otto et al.](https://arxiv.org/abs/2101.09207) | ||||
| - Support for Contextual Covariances | ||||
| - Multiple parameterization strategies for the Covariance | ||||
| 
 | ||||
| The resulting algorithms can than be tested for their ability of exploration in the enviroments provided by [Project Columbus](https://git.dominik-roth.eu/dodox/Columbus)   | ||||
| 
 | ||||
| This Repo was created as part of my bachelor-thesis at ALR (KIT). | ||||
| 
 | ||||
| ## Installation | ||||
| #### (optional) Columbus for test.py and replay.py | ||||
| Install [Project Columbus](https://git.dominik-roth.eu/dodox/Columbus) by following the instructions in the repo.   | ||||
| 
 | ||||
| #### Install dependency: Metastable Projections | ||||
| Follow instructions for the [Public Version](https://git.dominik-roth.eu/dodox/metastable-projections-public) ([GitHub Mirror](https://github.com/D-o-d-o-x/metastable-projections-public)) / [Private Version](https://git.dominik-roth.eu/dodox/metastable-projections) ([GitHub Mirror](https://github.com/D-o-d-o-x/metastable-projections)). | ||||
| The private version also requires ALR's ITPAL as a dependency. Only the private version supports KL Projections. | ||||
| 
 | ||||
| #### Install as a package | ||||
| Then install this repo as a package: | ||||
| ``` | ||||
| pip install -e . | ||||
| ``` | ||||
| 
 | ||||
| ## License | ||||
| Since this Repo is an extension to [Stable Baselines 3 by DLR-RM](https://github.com/DLR-RM/stable-baselines3), it contains some of it's code. SB3 is licensed under the [MIT-License](https://github.com/DLR-RM/stable-baselines3/blob/master/LICENSE). | ||||
		Loading…
	
		Reference in New Issue
	
	Block a user