Helix

Helix is a dual-system neural architecture that combines high-level vision-language understanding with precise visuomotor control for robotic applications.

Architecture

Helix consists of two main systems:

System 2 (S2)

High-level vision-language model based on PaliGemma2
Processes low-frequency inputs (7-9 Hz)
Integrates visual information, robot state, and text commands
Generates semantic latent representations

System 1 (S1)

Transformer-based visuomotor controller
Processes high-frequency inputs (200 Hz)
Generates continuous action sequences
Conditioned on latent vectors from System 2

Installation

git clone https://github.com/yourusername/helix.git
cd helix
pip install torch

Usage

To train the model:

python train.py

Configuration

Key hyperparameters can be modified in config.py:

Batch size: 16
Learning rate: 1e-4
Number of epochs: 10
Action dimensions: 35 (DoF)
Sequence length: 200

Dataset

The project uses the Open X-Embodiment dataset format. The current implementation includes a dummy dataset class that can be replaced with the actual dataset implementation.

Model Components

System 2 (S2)
- Vision-language model for high-level understanding
- Processes images, state vectors, and text commands
- Outputs latent vectors of dimension 512
System 1 (S1)
- Visuomotor transformer with 4 layers and 4 attention heads
- Processes high-frequency visual and state inputs
- Generates action sequences conditioned on S2 latents

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

This implementation is inspired by Figure AI's Helix architecture.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
datasets_rob		datasets_rob
models		models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.py		config.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Helix

Architecture

System 2 (S2)

System 1 (S1)

Installation

Usage

Configuration

Dataset

Model Components

License

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Helix

Architecture

System 2 (S2)

System 1 (S1)

Installation

Usage

Configuration

Dataset

Model Components

License

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages