Xmixers: A collection of SOTA efficient token/channel mixers

Introduction

This repository aims to implement SOTA efficient token/channel mixers. Any technologies related to non-Vanilla Transformer are welcome. If you are interested in this repository, please join our Discord.

Install

Install Torch(>=2.6.0), fla, xopes first, then install xmixers:

git clone https://github.com/Doraemonzzz/xmixers.git
cd xmixers
pip install -e .

Models

Paper	Code	Config
Elucidating the Design Space of Decay in Linear Attention	Link, core code: line 247 to 262.	Core method link, Ablation link

Training

To reproduce the results of the paper, we conducted training using the flame framework. First, we configured the environment in accordance with flame's requirements, then used flame's training script, and only needed to replace "config" with the corresponding name.

Name		Name	Last commit message	Last commit date
Latest commit History 306 Commits
benchmarks		benchmarks
configs		configs
evals		evals
examples		examples
scripts		scripts
tests		tests
xmixers		xmixers
.gitignore		.gitignore
README.md		README.md
setup.py		setup.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Xmixers: A collection of SOTA efficient token/channel mixers

Introduction

Install

Models

Training

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Xmixers: A collection of SOTA efficient token/channel mixers

Introduction

Install

Models

Training

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages