Add experiment tracking plan

Document current status, planned experiments, and job tracking
Following REPPO experiment documentation pattern
This commit is contained in:
ys1087@partner.kit.edu 2025-08-27 12:06:34 +02:00
parent add21c7019
commit f88a5be4fe

97
EXPERIMENT_PLAN.md Normal file
View File

@ -0,0 +1,97 @@
# DPPO Experiment Plan
## Current Status
### Setup Complete ✅
- Installation successful on HoReKa with Python 3.10 venv
- SLURM scripts created for automated job submission
- All dependencies installed including PyTorch, d4rl, dm-control
### Initial Testing
🔄 **Job ID 3445081**: Development test (30min) - PENDING
- Command: `./submit_job.sh dev`
- Status: Waiting for resources on dev_accelerated partition
- Purpose: Verify DPPO can run on HoReKa with basic pre-training
## Experiments To Run
### 1. Reproduce Paper Results - Gym Tasks
**Pre-training Phase**:
- hopper-medium-v2
- walker2d-medium-v2
- halfcheetah-medium-v2
**Fine-tuning Phase**:
- hopper-v2
- walker2d-v2
- halfcheetah-v2
**Settings**: Paper hyperparameters, 3 seeds each
### 2. Additional Environments (Future)
**Robomimic Suite**:
- lift, can, square, transport
**D3IL Suite**:
- avoid_m1, avoid_m2, avoid_m3
**Furniture-Bench Suite**:
- one_leg, lamp, round_table (low/med difficulty)
## Running Experiments
### Quick Development Test
```bash
./submit_job.sh dev
```
### Gym Pre-training
```bash
./submit_job.sh gym hopper pretrain
./submit_job.sh gym walker2d pretrain
./submit_job.sh gym halfcheetah pretrain
```
### Gym Fine-tuning (after pre-training completes)
```bash
./submit_job.sh gym hopper finetune
./submit_job.sh gym walker2d finetune
./submit_job.sh gym halfcheetah finetune
```
### Manual SLURM Submission
```bash
# With environment variables
TASK=hopper MODE=pretrain sbatch slurm/run_dppo_gym.sh
```
## Job Tracking
| Job ID | Type | Task | Mode | Status | Duration | Results |
|--------|------|------|------|---------|----------|---------|
| 3445081 | dev test | hopper | pretrain | PENDING | 30min | - |
## Configuration Notes
### WandB Setup Required
```bash
export WANDB_API_KEY=<your_api_key>
export WANDB_ENTITY=<your_username>
```
### Resource Requirements
- **Dev jobs**: 30min, 24GB RAM, 8 CPUs, dev_accelerated
- **Production**: 8h, 32GB RAM, 40 CPUs, accelerated
## Issues Encountered
None so far - installation completed without code modifications.
## Next Steps
1. Wait for dev test to complete
2. Analyze dev test results
3. Begin systematic pre-training experiments
4. Document any issues or required fixes