README <br>
This commit is contained in:
parent
c59d8dd635
commit
44eb3335ff
@ -1,8 +1,9 @@
|
|||||||
<h1 align="center">
|
<h1 align="center">
|
||||||
|
<br>
|
||||||
<img src='./itpal_jax.svg' width="250px">
|
<img src='./itpal_jax.svg' width="250px">
|
||||||
<br>
|
<br><br>
|
||||||
<b>ITPAL JAX</b>
|
<b>ITPAL JAX</b>
|
||||||
<br>
|
<br><br>
|
||||||
</h1>
|
</h1>
|
||||||
|
|
||||||
JAX bindings and native implementations of differentiable trust region projections for Gaussian policies. The KL projection is handled by [ITPAL](https://github.com/ALRhub/ITPAL)'s C++ implementation, while Wasserstein and Frobenius projections are implemented in JAX. These projections provide exact solutions for trust region constraints, unlike approximate methods like PPO.
|
JAX bindings and native implementations of differentiable trust region projections for Gaussian policies. The KL projection is handled by [ITPAL](https://github.com/ALRhub/ITPAL)'s C++ implementation, while Wasserstein and Frobenius projections are implemented in JAX. These projections provide exact solutions for trust region constraints, unlike approximate methods like PPO.
|
||||||
|
Loading…
Reference in New Issue
Block a user