Updated 2025-01-18 21:42:44 +01:00
JAX bindings and native implementations of differentiable trust region projections for Gaussian policies.
Updated 2025-01-07 18:24:49 +01:00
Fancy RL provides minimalistic and efficient implementations of PPO and TRPL for torchrl.
Updated 2024-11-07 11:41:18 +01:00
Python library to interface with and control Nucleares, a nuclear reactor simulation game. Includes gymnasium bindings for Reinforcement Learning and Model Learning.
Updated 2024-10-11 08:58:25 +02:00
Updated 2024-10-08 08:12:07 +02:00
Updated 2024-10-08 00:05:04 +02:00
Updated 2024-10-07 23:10:14 +02:00
Abusing the CloudFlare Infrastructure as a Proxy
Updated 2024-10-07 22:03:30 +02:00
Projections for Metastable Baselines
Updated 2024-10-07 21:05:26 +02:00
Updated 2024-10-07 19:40:58 +02:00
Updated 2024-10-07 19:39:07 +02:00
Updated 2024-10-07 19:26:10 +02:00
Updated 2024-10-07 19:20:08 +02:00
Updated 2024-10-07 19:09:44 +02:00
Updated 2024-10-07 18:44:53 +02:00
Updated 2024-10-07 18:22:40 +02:00
Projections for Metastable Baselines - Public Version
Updated 2024-10-07 17:49:57 +02:00
Updated 2024-10-07 17:49:27 +02:00
An async CLI-Interface for Python.
Updated 2024-10-07 17:48:54 +02:00
Updated 2024-10-07 17:46:30 +02:00