- nucon/rl.py: delta_action_scale action space, bool handling (>=0.5),
direct sim read/write bypassing HTTP for ~2000fps env throughput;
remove uncertainty_abort from training (use penalty-only), larger
default batch sizes; fix _read_obs and step for in-process sim
- nucon/model.py: optimise _lookup with einsum squared-L2, vectorised
rbf kernel; forward_with_uncertainty uses pre-built normalised arrays
- nucon/sim.py: _update_reactor_state writes outputs via setattr directly
- scripts/train_sac.py: moved from root; full SAC+HER example with kNN-GP
sim, delta actions, uncertainty penalty, init_states
- scripts/collect_dataset.py: CLI tool to collect dynamics dataset from
live game session (--steps, --delta, --out, --merge)
- README.md: add Scripts section, reference both scripts in training loop
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>