Update README.md

Looked a bit weird :)
This commit is contained in:
cvoelcker 2025-07-15 18:21:07 -07:00 committed by GitHub
parent 9f1e794eb5
commit fc9dfa0660
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -1,4 +1,6 @@
# Relative Entropy Pathwise Policy Optimization -- On-policy value-based reinforcement learning without endless hyperparameter tuning
# Relative Entropy Pathwise Policy Optimization
## On-policy value-based reinforcement learning without endless hyperparameter tuning
This repository contains the official implementation for REPPO - Relative Entropy Pathwise Policy Optimization [arXiv paper link](https://arxiv.org/abs/2507.11019).