In place deinterleave by Shnatsel · Pull Request #95 · QuState/PhastFT

Shnatsel · 2026-03-24T21:11:51Z

Addresses #93

WIP. Works but needs tuning for parallelism to switch on/off at a given size. I don't think there's a way around measuring how each approach performs at runtime because there isn't a portable way to get cache information and there's a very sudden switch from single-threaded to multi-threaded being beneficial as we leave the cache and hit the memory wall.

There's also a very sudden transition from out-of-place being faster to in-place being faster as we leave the cache on Zen4, but Apple M4 is not affected by that and keeps out-of-place performance almost unchanged far past the advertised cache size. That is a rather surprising result.

…k size

…t enough

… version shows no benefit

…since benchmarks show parallelizing one step but not the other isn't beneficial

…rleaving template

codecov-commenter · 2026-03-24T21:15:30Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 99.79%. Comparing base (6ca05ed) to head (d6bf80f).
⚠️ Report is 2 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main      #95   +/-   ##
=======================================
  Coverage   99.79%   99.79%           
=======================================
  Files           8        8           
  Lines        1441     1471   +30     
=======================================
+ Hits         1438     1468   +30     
  Misses          3        3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…macro boilerplate

Shnatsel added 16 commits March 24, 2026 11:56

Add deinterleave_into() variants that don't allocate

b0732f3

Initial implementation of in-place deinterleaving

ef4c2b1

Add benchmarks for in-place interleaving/deinterleaving with 8kb bloc…

a420c69

…k size

use idiomatic iterators instead of scary math

34801b7

Rename test

c7d4fc1

Split out interleaving within blocks into a separate helper function

f797664

PoC: parallelize stage 2 of in-place interleaving

65fd88c

Extend the deinterleaving benchmarks a little further up, they're fas…

5adab9b

…t enough

Make block sizes more explicit

44054ac

PoC: even more parallel in-place interleaving

08b7024

Experiment: spawn right away without collecting to vec first

aaca5fc

Experiment: use thread-local scratch buffers

bfed9e4

Roll back to earlier version that collects to a Vec; the scope::spawn…

c0c7d87

… version shows no benefit

inline block processing functions into respective in-place functions …

87c60a7

…since benchmarks show parallelizing one step but not the other isn't beneficial

Delete the now-inlined functions

ba947bc

Add a parallel version of in-place deinterleaving, following the inte…

ee34caf

…rleaving template

Shnatsel mentioned this pull request Mar 24, 2026

PoC: parallel BRAVO #97

Draft

Shnatsel added 4 commits March 25, 2026 10:16

Add a standalone benchmark for FFT in interleaved format

744fbec

Refactor out-of-place deinterleaving to use generics, to cut down on …

e3d3dcf

…macro boilerplate

Jankily wire up in-place interleaving/deinterleaving to complex_nums FFT

e123cd8

Declare that benchmark_interleaved example requires complex-nums feature

d6bf80f

This was referenced Mar 27, 2026

Optimize FFT on Complex<T> #99

Open

Make out-of-place deinterleaving multi-threaded #100

Draft

PoC: compute twiddles on the fly #102

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

In place deinterleave#95

In place deinterleave#95
Shnatsel wants to merge 20 commits intomainfrom
in-place-deinterleave

Shnatsel commented Mar 24, 2026

Uh oh!

codecov-commenter commented Mar 24, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Shnatsel commented Mar 24, 2026

Uh oh!

codecov-commenter commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov-commenter commented Mar 24, 2026 •

edited

Loading