Update README.md
Looked a bit weird :)
This commit is contained in:
parent
9f1e794eb5
commit
fc9dfa0660
@ -1,4 +1,6 @@
|
|||||||
# Relative Entropy Pathwise Policy Optimization -- On-policy value-based reinforcement learning without endless hyperparameter tuning
|
# Relative Entropy Pathwise Policy Optimization
|
||||||
|
|
||||||
|
## On-policy value-based reinforcement learning without endless hyperparameter tuning
|
||||||
|
|
||||||
This repository contains the official implementation for REPPO - Relative Entropy Pathwise Policy Optimization [arXiv paper link](https://arxiv.org/abs/2507.11019).
|
This repository contains the official implementation for REPPO - Relative Entropy Pathwise Policy Optimization [arXiv paper link](https://arxiv.org/abs/2507.11019).
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user