From fc9dfa0660dbb617293dc50d90bf43af585078a4 Mon Sep 17 00:00:00 2001 From: cvoelcker Date: Tue, 15 Jul 2025 18:21:07 -0700 Subject: [PATCH] Update README.md Looked a bit weird :) --- README.md | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 3815aac..04218d8 100644 --- a/README.md +++ b/README.md @@ -1,4 +1,6 @@ -# Relative Entropy Pathwise Policy Optimization -- On-policy value-based reinforcement learning without endless hyperparameter tuning +# Relative Entropy Pathwise Policy Optimization + +## On-policy value-based reinforcement learning without endless hyperparameter tuning This repository contains the official implementation for REPPO - Relative Entropy Pathwise Policy Optimization [arXiv paper link](https://arxiv.org/abs/2507.11019).