Skip to content

Initial work on interleaving performance#94

Merged
Shnatsel merged 7 commits intomainfrom
interleaving-performance
Mar 24, 2026
Merged

Initial work on interleaving performance#94
Shnatsel merged 7 commits intomainfrom
interleaving-performance

Conversation

@Shnatsel
Copy link
Copy Markdown
Collaborator

@Shnatsel Shnatsel commented Mar 24, 2026

  • Add a benchmark for interleaving/deinterleaving
  • Simplify and slightly speed up deinterleaving code
  • Improve tests

…passing --cfg bench so they could be benchmarked
… operations are part of base SSE; they appear to be already enough to saturate cache bandwidth. The compiler seems to unroll the loop and allow for ILP, so no wonder. AVX2 also requires a more expensive cross-lane shuffle so there's not a whole lot of benefit to it even in theory, and we get the exact same performance on benchmarks.
@codecov-commenter
Copy link
Copy Markdown

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 99.79%. Comparing base (c583ef5) to head (02fa234).

Additional details and impacted files
@@           Coverage Diff           @@
##             main      #94   +/-   ##
=======================================
  Coverage   99.79%   99.79%           
=======================================
  Files           8        8           
  Lines        1441     1441           
=======================================
  Hits         1438     1438           
  Misses          3        3           

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@Shnatsel Shnatsel merged commit 6ca05ed into main Mar 24, 2026
10 checks passed
@Shnatsel Shnatsel deleted the interleaving-performance branch March 24, 2026 16:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants