WorldFlux

Unified Interface for World Models in Reinforcement Learning

One API. Multiple Architectures. Clear Contracts.

Alpha (v0.1.1) — Under active development. API may change between minor versions.

WorldFlux provides a unified Python interface for world models used in reinforcement learning.

Why WorldFlux?

World models let RL agents imagine before acting by predicting future states, rewards, and outcomes without touching the real environment. Upstream literature reports strong sample-efficiency gains for world-model methods in many settings (Hafner et al., 2023; Hansen et al., 2024).

The problem: every research team reimplements the same core components from scratch. DreamerV3, TD-MPC2, JEPA — different codebases, different APIs, incompatible training loops. Want to swap an encoder while keeping DreamerV3's dynamics? Rewrite everything.

WorldFlux solves this with a unified interface:

# One API for any world model architecture
model = create_world_model("dreamerv3:size12m")
state = model.encode(obs)
trajectory = model.rollout(state, actions)  # imagine 15 steps ahead

Swap components independently with the 5-layer pluggable architecture
Reference-family implementations with proof-mode parity workflows against upstream baselines; public proof claims require published evidence bundles
Training infrastructure with replay buffers, checkpointing, and callbacks
One API — encode(), transition(), decode(), rollout() — works across all model families

Features

Unified API: Common interface across model families
v3-first API: create_world_model() defaults to api_version="v3" (strict contracts enabled)
Universal Payload Layer: ActionPayload / ConditionPayload for polymorphic conditioning
Planner Contract: planners return ActionPayload with extras["wf.planner.horizon"]
Simple Usage: One-liner model creation with create_world_model()
Pluggable 5-layer core: optional component_overrides for encoder/dynamics/conditioner/decoder/rollout
Training Infrastructure: Complete training loop with callbacks, checkpointing, and logging
Type Safe: Full type annotations and mypy compatibility

Installation

Install uv first if you do not have it yet: uv installation guide.

Global CLI Install (cargo new style)

uv tool install worldflux
worldflux init my-world-model

Optional: enable the InquirerPy-powered prompt UI.

uv tool install --with inquirerpy worldflux

worldflux init now performs cross-platform pre-init dependency assurance. It provisions a user-scoped bootstrap virtual environment and installs the selected environment dependencies before scaffolding:

Linux/macOS default: ~/.worldflux/bootstrap/py<major><minor>
Windows default: %LOCALAPPDATA%/WorldFlux/bootstrap/py<major><minor>

Environment variables:

WORLDFLUX_BOOTSTRAP_HOME: override bootstrap root path
WORLDFLUX_INIT_ENSURE_DEPS=0: disable auto-bootstrap (emergency bypass)

From Source (recommended)

git clone https://github.com/worldflux/WorldFlux.git
cd worldflux
uv sync
source .venv/bin/activate
worldflux init my-world-model

# With training dependencies
uv sync --extra training

# With all optional dependencies
uv sync --extra all

# For development
uv sync --extra dev

From PyPI

uv pip install worldflux
worldflux init my-world-model

Verify Environment

worldflux doctor

Build Docs Locally

cd website
npm ci
npm run build

# Optional: local docs dev server
npm start

Quick Start

CPU-First Success Path (Official)

uv sync --extra dev
uv run python examples/quickstart_cpu_success.py --quick

This official smoke path uses a random replay buffer and a CI-sized model to validate installation and core contracts on CPU. It is not a benchmark or a real-environment reproduction path.

Create a Model

from worldflux import create_world_model

model = create_world_model("dreamerv3:size12m")

Universal Payload Usage (v3)

from worldflux import ActionPayload, ConditionPayload

state = model.encode(obs)
next_state = model.transition(
    state,
    ActionPayload(kind="continuous", tensor=action),
    conditions=ConditionPayload(goal=goal_tensor),
)

Component Overrides (5-layer core)

from worldflux import create_world_model

model = create_world_model(
    "tdmpc2:ci",
    obs_shape=(4,),
    action_dim=2,
    component_overrides={
        # values can be registered component ids, classes, or instances
        "action_conditioner": "my_plugin.zero_action_conditioner",
    },
)

External packages can register plugins through entry-point groups:

worldflux.models
worldflux.components

Imagination Rollout

import torch

obs = torch.randn(1, 3, 64, 64)
state = model.encode(obs)

actions = torch.randn(15, 1, 6)  # [horizon, batch, action_dim]
trajectory = model.rollout(state, actions)

print(f"Predicted rewards: {trajectory.rewards.shape}")
print(f"Continue probs: {trajectory.continues.shape}")

Train a Model

from worldflux import create_world_model
from worldflux.training import train, ReplayBuffer

model = create_world_model("dreamerv3:size12m", obs_shape=(3, 64, 64), action_dim=6)
buffer = ReplayBuffer.load("trajectories.npz")
trained_model = train(model, buffer, total_steps=50_000)
trained_model.save_pretrained("./my_model")

Available Models

Family	Presets	Status
DreamerV3	`size12m`, `size25m`, `size50m`, `size100m`, `size200m`	Reference-family
TD-MPC2	`5m`, `19m`, `48m`, `317m`	Reference-family
JEPA	`base`	Experimental
V-JEPA2	`ci`, `tiny`, `base`	Experimental
Token	`base`	Experimental
Diffusion	`base`	Experimental

Reference-family models map to maintained upstream families and internal proof-mode parity workflows. Public proof claims require published evidence bundles; local fixtures and internal runs are not enough on their own. Experimental models implement the full API but do not carry the same parity workflow coverage and may return None for some predictions (e.g. rewards).

This table lists commonly used presets. For the full catalog (including CI, experimental, and skeleton families), run:

worldflux models list --verbose

API Reference

Core Methods

All world models implement the WorldModel base class:

state = model.encode(obs)
next_state = model.transition(state, action)
next_state = model.update(state, action, obs)
output = model.decode(state)
preds = output.preds  # e.g. {"obs", "reward", "continue"}
trajectory = model.rollout(initial_state, actions)
loss_out = model.loss(batch)  # LossOutput (loss_out.loss, loss_out.components)

Training API

from worldflux.training import (
    Trainer,
    TrainingConfig,
    ReplayBuffer,
    train,
)

from worldflux.training.callbacks import (
    LoggingCallback,
    CheckpointCallback,
    EarlyStoppingCallback,
    ProgressCallback,
)

Examples

See the examples/ directory:

quickstart_cpu_success.py - Official CPU-first smoke path using a random replay buffer
compare_unified_training.py - Shared-contract smoke comparison for DreamerV3 and TD-MPC2
worldflux_quickstart.ipynb - Interactive Colab notebook
train_dreamer.py - Training example
train_tdmpc2.py - Training example
visualize_imagination.py - Imagination rollout visualization

uv run python examples/quickstart_cpu_success.py --quick
uv run python examples/compare_unified_training.py --quick
uv run python examples/train_dreamer.py --test
uv run python examples/train_dreamer.py --data trajectories.npz --steps 100000

Documentation

Full Documentation - Guides and API reference
API Reference - Contract and symbol-level docs
Reference - Operational and quality docs

Community

Join our Discord to discuss world models, get help, and connect with other researchers and developers.

Support channels and response paths: SUPPORT.md
Community expectations and reporting: CODE_OF_CONDUCT.md

Security

See SECURITY.md for security considerations, especially regarding loading model checkpoints from untrusted sources.

License

Apache License 2.0 - see LICENSE and NOTICE for details.

Contributing

Contributions are welcome. Please read our Contributing Guide before submitting pull requests.

Citation

If you use this library in your research, please cite:

@software{worldflux,
  title = {WorldFlux: Unified Interface for World Models},
  year = {2026},
  url = {https://github.com/worldflux/WorldFlux}
}

Name		Name	Last commit message	Last commit date
Latest commit History 342 Commits
.claude/agents		.claude/agents
.devcontainer		.devcontainer
.github		.github
assets		assets
benchmarks		benchmarks
docs		docs
examples		examples
reports/parity		reports/parity
scripts		scripts
spaces		spaces
src/worldflux		src/worldflux
tests		tests
website		website
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
GOVERNANCE.md		GOVERNANCE.md
LICENSE		LICENSE
MAINTAINERS.md		MAINTAINERS.md
Makefile		Makefile
NOTICE		NOTICE
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

WorldFlux

Why WorldFlux?

Features

Installation

Global CLI Install (cargo new style)

From Source (recommended)

From PyPI

Verify Environment

Build Docs Locally

Quick Start

CPU-First Success Path (Official)

Create a Model

Universal Payload Usage (v3)

Component Overrides (5-layer core)

Imagination Rollout

Train a Model

Available Models

API Reference

Core Methods

Training API

Examples

Documentation

Community

Security

License

Contributing

Citation

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages