Parallel forward with state by mathieu-charbonnel · Pull Request #830 · state-spaces/mamba

mathieu-charbonnel · 2025-12-29T15:30:10Z

This PR answers the issue #536.
My answer here #536 as well as the OP describe the motivation behind this change.

In this PR we propose a step_chunk function that applies the parallel scan approach to the inference step on a chunk of tokens. Just like the step function this function allows taking last_inputs and hidden_state as arguments. The usage of pscan brings the number of steps in sequence length from O[L] to O(log(L)).

step_chunk combines:

the last_input handling (similarly to step code) for convolution continuity
Update of the first state,
The ssm update can be written Ht = Δt * Bt * xt + exp(A × Δt) * Ht-1
If we denote X[t] = Δt * Bt * xt, A[t] = exp(A × Δt)
Then the first state should be H[0] = X[0] + A[0] * H[-1] where H[-1] is the last state
This is done inside the cuda kernel as Δt * Bt and exp(A × Δt) -which are done in the kernel- need to be computed first
Then parallel scan is applied exactly the way it is done in forward

In terms of implementation I modified the forward to handle new inference params, and initial state modification in cuda kernel.

Looking forward for some reviews, please note that I am not experienced in cuda kernel development and relied heavily on AI tools for updating it.
I added a few tests to verify correctness of step_chunk processing.

parallel forward with state first implementation

d7796f8

mathieu-charbonnel changed the title ~~Mathieu.charbonnel/parallel forward with state~~ Parallel forward with state Jan 6, 2026

mathieu-charbonnel added 2 commits January 27, 2026 15:03

Inject the state only to the first state

ed38d4a

Makes the test more demanding

0818d8b

mathieu-charbonnel force-pushed the mathieu.charbonnel/parallel_forward_with_state branch from 2d562b2 to 0818d8b Compare January 27, 2026 14:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallel forward with state#830

Parallel forward with state#830
mathieu-charbonnel wants to merge 3 commits intostate-spaces:mainfrom
DataDog:mathieu.charbonnel/parallel_forward_with_state

mathieu-charbonnel commented Dec 29, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

mathieu-charbonnel commented Dec 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mathieu-charbonnel commented Dec 29, 2025 •

edited

Loading