Environment Backends#
TorchWM ships environment adapters for pixel-based world-model training, model-based reinforcement learning, and benchmark collection. Each backend page explains the installation requirements, factory functions, observation and action conventions, configuration fields, and common troubleshooting steps for that backend.
Choosing a backend#
Backend |
Best for |
Primary APIs |
Typical observations |
Typical actions |
|---|---|---|---|---|
Dreamer-style continuous-control tasks with state and rendered image observations |
|
Dict with DMC state keys plus |
Continuous |
|
3D navigation and puzzle tasks from DeepMind Lab |
|
Dict with |
Normalized one-hot |
|
Classic control, Box2D, custom Gym environments, and generic rendered tasks |
|
Dict with |
Original continuous |
|
DeepMind BSuite |
Small diagnostic RL benchmark tasks such as |
|
Dict with synthetic |
One-hot vector for discrete actions |
JAX/Brax continuous-control tasks through a Gym-like image adapter |
|
Dict with synthesized |
Continuous |
|
Atari 2600 environments through Gymnasium/ALE |
|
ALE RGB/RAM observations |
Discrete Atari actions |
|
Procedurally generated benchmark games |
|
Dict with |
One-hot vector for discrete Procgen actions |
|
Gymnasium MuJoCo task ids and native MJCF/MJB models |
|
Image dict via |
Continuous |
|
All ids registered by the installed Gymnasium Robotics package, including moved legacy MuJoCo v2/v3 ids |
|
Image dict via |
Continuous |
|
External Unity executable simulations with continuous-control behaviors |
|
Dict with |
Continuous |
|
Multiprocess/vector rollout collection and native ALE vectorization |
|
Batched observations |
Batched actions |
|
Model-based RL, policy optimization, and evaluation inside learned dynamics |
|
Adapter-defined Gymnasium space |
Adapter-defined Gymnasium space |
|
Shared preprocessing, action conversion, time limits, reward observations, and image transforms |
|
Backend-dependent |
Backend-dependent |