Add HoReKa cluster documentation to README

- Document installation process using Python 3.10 venv - Add usage examples for SLURM job submission - Document available environments and resource allocations - Add WandB configuration instructions - List all repository changes made for HoReKa compatibility
2025-08-27 11:59:32 +02:00 · 2025-08-27 11:59:32 +02:00 · 30f59aaa9b
commit 30f59aaa9b
parent 05dddfa10c
1 changed files with 98 additions and 0 deletions
--- a/README.md
+++ b/README.md
@ -44,6 +44,104 @@ pip install -e .[all] # except for Kitchen
 source script/set_path.sh
 ```

+## HoReKa Cluster Setup
+
+### Installation on HoReKa
+
+The DPPO repository has been adapted to run on the HoReKa cluster. The original repository recommends conda, but we use vanilla Python with venv for consistency with cluster policies.
+
+1. **Clone the repository and navigate to it:**
+   ```bash
+   git clone git@dominik-roth.eu:dodox/dppo.git
+   cd dppo
+   ```
+
+2. **Create virtual environment with Python 3.10:**
+   ```bash
+   python3.10 -m venv .venv
+   source .venv/bin/activate
+   ```
+
+3. **Install the package and dependencies:**
+   ```bash
+   # Use the provided installation script
+   sbatch install_dppo.sh
+   
+   # Or install manually:
+   pip install --upgrade pip
+   pip install -e .
+   pip install -e .[gym]  # For Gym environments
+   ```
+
+### Running on HoReKa
+
+The repository includes pre-configured SLURM scripts for job submission:
+
+#### Quick Start
+```bash
+# Run a development test (30 minutes, 24GB RAM)
+./submit_job.sh dev
+
+# Run Gym pre-training
+./submit_job.sh gym hopper pretrain
+
+# Run Gym fine-tuning
+./submit_job.sh gym walker2d finetune
+```
+
+#### Manual Job Submission
+```bash
+# Submit development test
+sbatch slurm/run_dppo_dev.sh
+
+# Submit Gym experiments with parameters
+TASK=hopper MODE=pretrain sbatch slurm/run_dppo_gym.sh
+```
+
+#### Supported Tasks
+
+**Gym environments:**
+- `hopper`, `walker2d`, `halfcheetah`
+
+**Modes:**
+- `pretrain` - Pre-train diffusion policy
+- `finetune` - Fine-tune with PPO
+
+#### Resource Allocations
+- **Development**: 30 minutes, 24GB RAM, 8 CPUs, dev_accelerated partition
+- **Production**: 8 hours, 32GB RAM, 40 CPUs, accelerated partition
+
+#### Monitoring Jobs
+```bash
+# Check job status
+squeue -u $USER
+
+# View logs
+tail -f logs/dppo_<job_id>.out
+```
+
+### Configuration
+
+Before running experiments, set your WandB credentials:
+
+```bash
+export WANDB_API_KEY=<your_api_key>
+export WANDB_ENTITY=<your_username_or_team>
+```
+
+Or disable WandB by adding `wandb=null` to your python command.
+
+### Repository Changes
+
+This fork includes the following additions for HoReKa compatibility:
+- `install_dppo.sh` - Automated installation script for SLURM
+- `submit_job.sh` - Convenient job submission wrapper
+- `slurm/` directory with job scripts for different experiment types
+- Updated `.gitignore` to allow shell scripts (removed `*.sh` exclusion)
+- Git remotes configured: `upstream` (original repository) and `origin` (this fork)
+
+Note: The installation was successful without any code modifications. All dependencies installed correctly with Python 3.10.
+
 ## Usage - Pre-training

 **Note**: You may skip pre-training if you would like to use the default checkpoint (available for download) for fine-tuning.