| 
						
					 | 
					
						
						
						
						
							
						
						
							fa167b3e5f
							
						
					 | 
					
						
						
							
							Hybrid Diag -> Full Implemented; Made spherical_chol more efficient
						
						
						
						
						
					 | 
					
						2022-07-16 13:05:35 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							f184b88f19
							
						
					 | 
					
						
						
							
							Allow std logging for full and diagonal cov policies
						
						
						
						
						
					 | 
					
						2022-07-15 18:46:42 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							74697e8773
							
						
					 | 
					
						
						
							
							Smol bug fixes
						
						
						
						
						
					 | 
					
						2022-07-15 18:46:17 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							2e0f46b0f3
							
						
					 | 
					
						
						
							
							Fixing ser/deser bug (cloudpickle cant handle some enums)
						
						
						
						
						
					 | 
					
						2022-07-15 18:45:38 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							a86d19053d
							
						
					 | 
					
						
						
							
							Smashing bugs (dimension mismatch between Normal and
						
						
						
						
						
						
						
						Independent/MultivariateNormal) 
						
					 | 
					
						2022-07-15 15:46:31 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							ab557a8856
							
						
					 | 
					
						
						
							
							Making MultivariateNormal Policies work (and porting Normal to
						
						
						
						
						
						
						
						Independent) 
						
					 | 
					
						2022-07-15 15:03:51 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							b1ed9fc2b8
							
						
					 | 
					
						
						
							
							Renamed TRL_PG to PPO
						
						
						
						
						
					 | 
					
						2022-07-13 19:51:33 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							3304fd49f6
							
						
					 | 
					
						
						
							
							Working on UniversalGaussianDistribution
						
						
						
						
						
					 | 
					
						2022-07-13 19:38:57 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							fae19509bc
							
						
					 | 
					
						
						
							
							Implemented Policies with Contextual Covariance
						
						
						
						
						
					 | 
					
						2022-07-13 19:38:20 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							41e4170b2f
							
						
					 | 
					
						
						
							
							Fixes + spherical_chol
						
						
						
						
						
					 | 
					
						2022-07-11 17:28:08 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							e4440428f8
							
						
					 | 
					
						
						
							
							Working on SDC
						
						
						
						
						
					 | 
					
						2022-07-11 11:55:23 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							4c4b12ee0e
							
						
					 | 
					
						
						
							
							Allow cloning UniversalGaussianDistribution (new_dist_like)
						
						
						
						
						
					 | 
					
						2022-07-09 14:46:11 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							c08ea1cb91
							
						
					 | 
					
						
						
							
							Making UniversalGaussianDistribution ready for tanh-squashing-support
						
						
						
						
						
					 | 
					
						2022-07-09 14:33:07 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							249754ee89
							
						
					 | 
					
						
						
							
							Wrote a little helper-function to generate all allowed combinations of
						
						
						
						
						
						
						
						cov-parameterizations 
						
					 | 
					
						2022-07-09 14:03:56 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							e09950b30c
							
						
					 | 
					
						
						
							
							Work on Contextual Covariances
						
						
						
						
						
					 | 
					
						2022-07-09 12:26:39 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							0911a04e98
							
						
					 | 
					
						
						
							
							Factored out Gaussian Collection for RolloutBuffer
						
						
						
						
						
					 | 
					
						2022-07-04 21:21:16 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							1f3eac5398
							
						
					 | 
					
						
						
							
							Cleanup
						
						
						
						
						
					 | 
					
						2022-07-02 16:42:14 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							91f64c10d7
							
						
					 | 
					
						
						
							
							Fixed bug with initialization of buffer
						
						
						
						
						
					 | 
					
						2022-07-01 20:02:09 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							14100cccc8
							
						
					 | 
					
						
						
							
							Working on SDC-impl.
						
						
						
						
						
					 | 
					
						2022-07-01 15:14:41 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							84d1cda96c
							
						
					 | 
					
						
						
							
							Trying to get kl to work
						
						
						
						
						
					 | 
					
						2022-07-01 13:45:58 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							2626ec82a6
							
						
					 | 
					
						
						
							
							Added code-src-note
						
						
						
						
						
					 | 
					
						2022-07-01 11:52:50 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							ab1b269af9
							
						
					 | 
					
						
						
							
							Allow checking whether a dist is contextual
						
						
						
						
						
					 | 
					
						2022-07-01 11:52:14 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							a8b9c63965
							
						
					 | 
					
						
						
							
							Making dez covariances contextual
						
						
						
						
						
					 | 
					
						2022-07-01 11:29:12 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
						
						
							
						
						
							2e378d0a7d
							
						
					 | 
					
						
						
							
							Rebranding to Metastable Baselines
						
						
						
						
						
					 | 
					
						2022-06-30 20:40:30 +02:00 | 
					
					
						
						
							
							
							
						
					 |