Commit Graph

67 Commits

Author SHA1 Message Date
e938018494 trl spec for .reset 2024-11-07 11:41:01 +01:00
4f8fc500b7 Simplify operations on spaces (is_discrete, shape) 2024-11-07 11:40:32 +01:00
5c44448e53 Use trl space definitions (not gym) 2024-11-07 11:39:45 +01:00
8a078fb59e Fix: Issue with env wrapping (ensure batch dim) 2024-11-07 11:39:09 +01:00
52b3f3b71e Updated README 2024-11-07 11:38:34 +01:00
df1ba6fe53 Updated README 2024-10-21 15:25:01 +02:00
8eb9b384c7 Pytest as optional dep 2024-10-21 15:24:45 +02:00
abc8dcbda1 Expand Tests 2024-10-21 15:24:36 +02:00
e927afcc30 Extend policy impl 2024-10-21 15:24:20 +02:00
ca1ee980ef Fix: Bug in loss calc for TRPLLoss 2024-10-21 15:23:57 +02:00
0c6e58634f Rework algo impls 2024-10-21 15:23:39 +02:00
651ef1522f Fixing issues with projections 2024-10-21 15:23:17 +02:00
71cb8593d9 Fix: Tried to reference gym space classes even if no gym avaible 2024-08-30 08:05:41 +02:00
906240e145 Fix README typo 2024-08-28 12:15:51 +02:00
af444d85e7 Updated README 2024-08-28 12:15:20 +02:00
e6d78083aa Fix trpl test using wrong hps 2024-08-28 11:58:58 +02:00
54bab221ef Disable vlearn for now... 2024-08-28 11:55:43 +02:00
1a02568f3c Rename frob_projection file 2024-08-28 11:55:30 +02:00
0464fbabe8 Disable vlearn test for now 2024-08-28 11:55:04 +02:00
f7d171399f Updated pyproject.toml 2024-08-28 11:46:31 +02:00
ebff2551cd projections_legacy no longer requierd 2024-08-28 11:34:26 +02:00
9c55b6a110 remove old vleasrn objective impl 2024-08-28 11:33:30 +02:00
4f58ce0ff2 refactor algo impls 2024-08-28 11:33:20 +02:00
dd98af9f77 Refactor objective losses 2024-08-28 11:32:51 +02:00
25988bab54 Added tmp (untested) vlearn objective impl 2024-08-28 11:32:22 +02:00
cb48badcff Adapt init 2024-08-28 11:32:07 +02:00
5fc4b30ea8 New projection impls 2024-08-28 11:31:42 +02:00
d29417187f Implemenetd new test suite 2024-08-28 11:30:37 +02:00
416c2036a5 vlearn loss draft 2024-07-17 14:53:11 +02:00
e106d8701f Vlearn draft 2024-07-17 14:52:44 +02:00
5f279beccf New proj implementations 2024-07-17 14:51:59 +02:00
4240f611ac Updated TODO 2024-06-02 16:37:15 +02:00
8d5d44e992 Refactor out some func into general Algo class 2024-06-02 16:36:59 +02:00
5f186af9fb Fix class name for TRPL 2024-06-02 16:25:18 +02:00
6cb320f432 TODO and WIP warning for README 2024-06-02 16:18:11 +02:00
7861821d0d Worked on TRPL module 2024-06-02 14:14:12 +02:00
65c6a950aa Use loggers correclty 2024-06-02 14:13:36 +02:00
4091df45f5 Implement rudimentary TerminalLogger 2024-06-02 14:12:41 +02:00
c7f5fcbf0f I hate debugging tensordict weirdness 2024-06-02 13:56:54 +02:00
a867a74138 Reorder... 2024-06-02 12:09:26 +02:00
d51bf948d4 Make sure we work without original gym installed 2024-06-02 12:09:13 +02:00
b4f89c9b7a Added TODO fro module docs 2024-06-02 12:00:19 +02:00
dd6c6b6165 Why was that h3? 2024-06-02 11:59:26 +02:00
78d79cf705 Initial code fro projections 2024-06-02 11:57:19 +02:00
add8e92b4a Add norms 2024-06-02 11:52:20 +02:00
c6a12aa27b Started TRPL impl 2024-06-02 11:49:09 +02:00
3931f5e31b Started implementation of TRPL loss objective module 2024-06-02 11:48:51 +02:00
b34224f189 refactor dir structure 2024-06-02 11:13:04 +02:00
8a013a4746 Expanded .gitignore 2024-06-02 11:11:35 +02:00
0a95ea652a Support for torchrl envs 2024-06-02 11:11:18 +02:00