lib: implement hashset by pzmarzly · Pull Request #470 · facebook/bpfilter

pzmarzly · 2026-03-12T14:59:59Z

Currently, bf_set uses bf_list under the hood. This is suboptimal performance-wise. Here I am changing bf_set to be backed by bf_hashset.

lib: helper: add FNV-1a hash function - imported from another PR, a simple hash function.
lib: add bf_hashset - main commit. Implements a simple hashset (tombstones on deletion, grows 2x when load factor exceeds 50%, never shrinks, linear probing on collision). See https://en.wikipedia.org/wiki/Linear_probing (with "special flag value" strategy in "Deletion" section).
lib: set: use bf_hashset for elems field - changes bf_set's inner structure

See #460 for previous attempt.

Fixes #418

src/libbpfilter/set.c

tests/unit/libbpfilter/data_structures/hashset.c

src/libbpfilter/include/bpfilter/data_structures/hashset.h

src/libbpfilter/set.c

src/libbpfilter/include/bpfilter/data_structures/hashset.h

github-actions · 2026-03-12T15:12:19Z

Claude review of PR #470 (`d19f0fc`)

Must fix

~~foreach_break test asserts on unmodified variable~~ — Fixed: count is now incremented before break, so count == 1 is correct.
~~No test for bf_hashset_take~~ — Fixed: take test now in tests/unit/libbpfilter/core/hashset.c:272.
~~void * pointer arithmetic in bf_vector_foreach will break pedantic C17 integration test~~ (dismissed) — Maintainer explicitly requested not using char * casts; both maintainer and author agreed this is a future topic.
~~Uninitialized old_slots in _bf_hashset_grow causes heap corruption on OOM~~ — Fixed: old_slots is now a plain void ** without cleanup attribute (bf_vector removed from PR). Assigned before use.
~~Unchecked bf_hashset_add return in _bf_hashset_grow~~ — Fixed: return value is now checked with if (r) { free(old_slots); return r; }. See new suggestion below about the error path's element-loss subtlety.
~~Missing <assert.h> in vector.h~~ — Fixed: vector.h was removed from this PR (bf_vector split into a separate PR per reviewer feedback).
~~Load factor threshold is 50%, not 70% as described~~ — Fixed: PR description now correctly states 50%, matching the code constants _BF_HASHSET_MAX_LOAD_NUM=5 / _BF_HASHSET_MAX_LOAD_DEN=10.
bf_hashset_reserve ignores needed when computing new_cap — src/libbpfilter/core/hashset.c:235 — new_cap is derived from set->cap (or _BF_HASHSET_INIT_CAP) instead of from the computed needed value. On an empty set, bf_hashset_reserve(&set, 100) resizes to 16 slots instead of ~256. All 5 independent reviewers flagged this.
_bf_round_next_power_of_2 missing >> 32 shift for 64-bit size_t — src/libbpfilter/core/hashset.c:32 — Only shifts up to >> 16, so values above 2^32 return non-power-of-two results. Becomes critical once the reserve bug is fixed. All 5 independent reviewers flagged this.

Suggestions

Nits

CLAUDE.md improvements

Style guide says "Use backticks to reference function, variable, and parameter names" in Doxygen, but @c is used in new code at core/hashset.h (7 occurrences). The existing codebase has ~16 @c uses and ~203 backtick uses — backticks are the dominant convention. Consider adding an explicit note in CLAUDE.md about preferring backticks over @c and @ref for inline code references.
The codebase is inconsistent about __attribute__((cleanup(...))) vs __attribute__((__cleanup__(...))) — the new core/hashset.h uses the non-underscored form (matching core/list.h), while the majority of other headers use the double-underscore form. Consider documenting the preferred form in CLAUDE.md.

Resolved from prior review

Workflow run

src/libbpfilter/set.c

src/libbpfilter/data_structures/vector.c

src/libbpfilter/include/bpfilter/core/hashset.h

qdeslandes

Alright, first pass of review and a few things to fix. I'll do a second, deeper pass, when those are solved. That being said, it's a very welcome addition! :D

src/libbpfilter/include/bpfilter/helper.h

src/libbpfilter/helper.c

src/bfcli/chain.c

src/libbpfilter/core/vector.c

src/libbpfilter/data_structures/vector.c

src/libbpfilter/include/bpfilter/data_structures/hashset.h

src/libbpfilter/include/bpfilter/core/hashset.h

tests/unit/libbpfilter/data_structures/hashset.c

src/libbpfilter/include/bpfilter/core/hashset.h

src/libbpfilter/core/hashset.c

src/libbpfilter/set.c

src/libbpfilter/include/bpfilter/core/hashset.h

src/libbpfilter/set.c

src/libbpfilter/core/hashset.c

src/libbpfilter/include/bpfilter/core/hashset.h

tests/unit/libbpfilter/core/hashset.c

src/libbpfilter/include/bpfilter/core/vector.h

src/libbpfilter/core/hashset.c

src/libbpfilter/set.c

src/libbpfilter/core/hashset.c

src/libbpfilter/set.c

pzmarzly · 2026-03-26T15:27:19Z

I let Claude search for optimizations overnight, and it proposed a different representation that has much better cache locality - bf_hashset_elem** elems instead of bf_hashset_elem* elems, i.e. each node is individually allocated, like in bf_list. It makes reads ~40% faster on large sets. Big change coming.

src/libbpfilter/core/hashset.c

src/libbpfilter/include/bpfilter/core/hashset.h

src/libbpfilter/core/hashset.c

yaakov-stein

Claude has a few valid nits/suggestions and I have one last comment on the tests. Overall LGTM once those points are taken care of!

src/libbpfilter/include/bpfilter/core/hashset.h

src/libbpfilter/core/hashset.c

tests/unit/libbpfilter/core/hashset.c

yaakov-stein · 2026-03-26T18:28:02Z

I let Claude search for optimizations overnight, and it proposed a different representation that has much better cache locality - bf_hashset_elem** elems instead of bf_hashset_elem* elems, i.e. each node is individually allocated, like in bf_list. It makes reads ~40% faster on large sets. Big change coming.

I'm not necessarily opposed to the change as it simplifies some parts of the code, but I'm confused by the claim that this has better cache locality - shouldn't the cache locality here be much worse? Whenever we need to check for equality we need to load a non-contiguous piece of memory. We also can't take advantage of the spatial locality anymore. Can you explain to me what cases you saw ~40% speedup on?

pzmarzly · 2026-03-30T16:07:34Z

I'm not necessarily opposed to the change as it simplifies some parts of the code, but I'm confused by the claim that this has better cache locality - shouldn't the cache locality here be much worse? Whenever we need to check for equality we need to load a non-contiguous piece of memory. We also can't take advantage of the spatial locality anymore. Can you explain to me what cases you saw ~40% speedup on?

This was surprising as well to me, so you're right, I should have explained.

The benchmark was (a) generate and insert 1 million random IPs, (b) use bf_hashset_foreach to read it back. bf_hashset_foreach is probably the most important function/macro to optimize, as it's used in many heavy operations (serialization, cgen, bfcli printing).

In both versions (I'll call them bf_hashset_elem[] and bf_hashset_elem*[]), foreach requires 2 pointer dereferences to get to the data: first you resolve elem = *elem.next, then read *elem.data.

In bf_hashset_elem*[], elems are malloced in the same order as they are created. This means that elements created later will usually have higher addresses, unless there was something freed in the meantime. This turns out great for reading.

Meanwhile in bf_hashset_elem[] version, elems are ordered in memory by their hash. Following elem = *elem.next often requires going backwards, to memory that wasn't prefetched into cache. Hence the 40% slower results.

With bf_hashset_elem*[], bf_hashset_contains goes from 1 dereference to 2. That is unfortunate. But I think faster foreach is worth it.

pzmarzly · 2026-03-30T16:12:41Z

Massive thanks for your continued reviews @yaakov-stein . I'm uploading the latest version with small changes. If they look good, I'll reopen this PR, copying the few comments that are still relevant. It was an experiment to me how much I can rely on the PR review bot, turns out it's pretty good, but GH UX starts degrading massively after 50+ comments.

src/libbpfilter/core/hashset.c

src/libbpfilter/include/bpfilter/set.h

src/libbpfilter/include/bpfilter/core/hashset.h

src/libbpfilter/core/hashset.c

yaakov-stein · 2026-03-30T19:11:47Z

The benchmark was (a) generate and insert 1 million random IPs, (b) use bf_hashset_foreach to read it back. bf_hashset_foreach is probably the most important function/macro to optimize, as it's used in many heavy operations (serialization, cgen, bfcli printing).

In both versions (I'll call them bf_hashset_elem[] and bf_hashset_elem*[]), foreach requires 2 pointer dereferences to get to the data: first you resolve elem = *elem.next, then read *elem.data.

In bf_hashset_elem*[], elems are malloced in the same order as they are created. This means that elements created later will usually have higher addresses, unless there was something freed in the meantime. This turns out great for reading.

Meanwhile in bf_hashset_elem[] version, elems are ordered in memory by their hash. Following elem = *elem.next often requires going backwards, to memory that wasn't prefetched into cache. Hence the 40% slower results.

With bf_hashset_elem*[], bf_hashset_contains goes from 1 dereference to 2. That is unfortunate. But I think faster foreach is worth it.

Hm, that makes sense. I think the right approach here would be to call out this behavior and mention the tradeoffs in a PR note for Quentin. To me, it seems surprising that we'd optimize a non-classic hashset operation (foreach) at the expense of the classic hashset operations (contains, add, delete). I'd personally expect that a hashset would be optimized for the standard hashset operations, but I understand where you're coming from. If you split this out to a new PR, maybe let's include a note on how and why this decision was made.

github-actions · 2026-03-31T10:30:44Z

src/libbpfilter/core/hashset.c

+    if (needed <= set->cap)
+        return 0;
+
+    new_cap = _bf_round_next_power_of_2(set->cap ? set->cap : _BF_HASHSET_INIT_CAP);


Claude: must-fix: bf_hashset_reserve computes needed from count but then derives new_cap solely from set->cap (or _BF_HASHSET_INIT_CAP), ignoring needed entirely. For example, bf_hashset_reserve(&set, 100) on an empty set computes needed = 200 but sets new_cap = _bf_round_next_power_of_2(16) = 16, resizing to only 16 slots. The unit test assert_true(bf_hashset_cap(&set) >= 200) would catch this at runtime.

new_cap should be derived from needed, e.g.:

new_cap = _bf_round_next_power_of_2(needed);

github-actions · 2026-03-31T10:30:45Z

src/libbpfilter/core/hashset.c

+    value |= value >> 2;
+    value |= value >> 4;
+    value |= value >> 8;
+    value |= value >> 16;


Claude: must-fix: _bf_round_next_power_of_2 only shifts up to >> 16, which covers 32-bit values but not 64-bit size_t. For any input where the highest set bit is at position 32 or above, the function returns a non-power-of-two result. This matters because _BF_HASHSET_MAX_CAP is SIZE_MAX / 16 + 1 (well above 2^32 on 64-bit), and once the bf_hashset_reserve bug above is fixed, needed values above 2^32 would produce corrupt table capacities.

Add value |= value >> 32; after this line. The same pre-existing issue exists in cgen/program.c:66.

github-actions · 2026-03-31T10:30:46Z

src/libbpfilter/core/hashset.c

+ * size_t. */
+#define _BF_HASHSET_MAX_CAP (SIZE_MAX / 16 + 1)
+
+static inline size_t _bf_round_next_power_of_2(size_t value)


Claude: suggestion: _bf_round_next_power_of_2 is defined identically in both hashset.c and cgen/program.c:66 (and both are missing the >> 32 shift). Consider extracting it into helper.h as a shared static inline function so the fix is applied in one place and the copies don't diverge.

pzmarzly requested a review from qdeslandes as a code owner March 12, 2026 15:00

pzmarzly marked this pull request as draft March 12, 2026 15:00

meta-cla bot added the cla signed label Mar 12, 2026

github-actions bot reviewed Mar 12, 2026

View reviewed changes

src/libbpfilter/set.c Outdated Show resolved Hide resolved

github-actions bot reviewed Mar 12, 2026

View reviewed changes

tests/unit/libbpfilter/data_structures/hashset.c Outdated Show resolved Hide resolved

github-actions bot reviewed Mar 12, 2026

View reviewed changes

src/libbpfilter/include/bpfilter/data_structures/hashset.h Outdated Show resolved Hide resolved

github-actions bot reviewed Mar 12, 2026

View reviewed changes

src/libbpfilter/set.c Show resolved Hide resolved

github-actions bot reviewed Mar 12, 2026

View reviewed changes

src/libbpfilter/include/bpfilter/data_structures/hashset.h Outdated Show resolved Hide resolved

pzmarzly force-pushed the push-mlqkpqspnupl branch from 314af18 to 02938a3 Compare March 12, 2026 23:56

github-actions bot reviewed Mar 13, 2026

View reviewed changes

src/libbpfilter/set.c Show resolved Hide resolved

github-actions bot reviewed Mar 13, 2026

View reviewed changes

src/libbpfilter/data_structures/vector.c Outdated Show resolved Hide resolved

github-actions bot reviewed Mar 13, 2026

View reviewed changes

src/libbpfilter/include/bpfilter/core/hashset.h Show resolved Hide resolved

qdeslandes requested changes Mar 13, 2026

View reviewed changes

qdeslandes mentioned this pull request Mar 13, 2026

lib: add bf_hashset #460

Closed

pzmarzly mentioned this pull request Mar 13, 2026

Replace bf_dynbuf with bf_vector #473

Open

pzmarzly force-pushed the push-mlqkpqspnupl branch from 02938a3 to 1aae907 Compare March 13, 2026 15:07

pzmarzly changed the title ~~lib: create data_structures directory, implement hashset data structure~~ lib: create directory for data structures, implement hashset data structure Mar 13, 2026

pzmarzly changed the title ~~lib: create directory for data structures, implement hashset data structure~~ lib: create directory for data structures, implement hashset Mar 13, 2026

github-actions bot reviewed Mar 13, 2026

View reviewed changes

src/libbpfilter/set.c Outdated Show resolved Hide resolved

github-actions bot reviewed Mar 13, 2026

View reviewed changes

src/libbpfilter/set.c Outdated Show resolved Hide resolved

github-actions bot reviewed Mar 13, 2026

View reviewed changes

src/libbpfilter/include/bpfilter/core/hashset.h Outdated Show resolved Hide resolved

github-actions bot reviewed Mar 13, 2026

View reviewed changes

src/libbpfilter/set.c Outdated Show resolved Hide resolved

github-actions bot reviewed Mar 13, 2026

View reviewed changes

src/libbpfilter/core/hashset.c Outdated Show resolved Hide resolved

pzmarzly force-pushed the push-mlqkpqspnupl branch from 1aae907 to 550c254 Compare March 13, 2026 15:46

github-actions bot reviewed Mar 13, 2026

View reviewed changes

src/libbpfilter/include/bpfilter/core/hashset.h Show resolved Hide resolved

github-actions bot reviewed Mar 13, 2026

View reviewed changes

src/libbpfilter/include/bpfilter/core/hashset.h Show resolved Hide resolved

pzmarzly force-pushed the push-mlqkpqspnupl branch from 550c254 to d6883e1 Compare March 16, 2026 22:13

github-actions bot reviewed Mar 16, 2026

View reviewed changes

tests/unit/libbpfilter/core/hashset.c Show resolved Hide resolved

github-actions bot reviewed Mar 16, 2026

View reviewed changes

src/libbpfilter/include/bpfilter/core/vector.h Outdated Show resolved Hide resolved

github-actions bot reviewed Mar 26, 2026

View reviewed changes

src/libbpfilter/core/hashset.c Outdated Show resolved Hide resolved

github-actions bot reviewed Mar 26, 2026

View reviewed changes

src/libbpfilter/set.c Show resolved Hide resolved

github-actions bot reviewed Mar 26, 2026

View reviewed changes

src/libbpfilter/core/hashset.c Outdated Show resolved Hide resolved

github-actions bot reviewed Mar 26, 2026

View reviewed changes

src/libbpfilter/core/hashset.c Outdated Show resolved Hide resolved

github-actions bot reviewed Mar 26, 2026

View reviewed changes

src/libbpfilter/set.c Outdated Show resolved Hide resolved

pzmarzly force-pushed the push-mlqkpqspnupl branch from ef58a62 to a34983a Compare March 26, 2026 15:38

github-actions bot reviewed Mar 26, 2026

View reviewed changes

src/libbpfilter/core/hashset.c Show resolved Hide resolved

github-actions bot reviewed Mar 26, 2026

View reviewed changes

src/libbpfilter/core/hashset.c Show resolved Hide resolved

github-actions bot reviewed Mar 26, 2026

View reviewed changes

src/libbpfilter/include/bpfilter/core/hashset.h Outdated Show resolved Hide resolved

github-actions bot reviewed Mar 26, 2026

View reviewed changes

src/libbpfilter/include/bpfilter/core/hashset.h Outdated Show resolved Hide resolved

github-actions bot reviewed Mar 26, 2026

View reviewed changes

src/libbpfilter/core/hashset.c Show resolved Hide resolved

github-actions bot reviewed Mar 26, 2026

View reviewed changes

src/libbpfilter/core/hashset.c Show resolved Hide resolved

yaakov-stein reviewed Mar 26, 2026

View reviewed changes

pzmarzly force-pushed the push-mlqkpqspnupl branch from a34983a to 7998ded Compare March 30, 2026 16:14

github-actions bot reviewed Mar 30, 2026

View reviewed changes

src/libbpfilter/core/hashset.c Show resolved Hide resolved

github-actions bot reviewed Mar 30, 2026

View reviewed changes

src/libbpfilter/core/hashset.c Show resolved Hide resolved

github-actions bot reviewed Mar 30, 2026

View reviewed changes

src/libbpfilter/include/bpfilter/set.h Show resolved Hide resolved

github-actions bot reviewed Mar 30, 2026

View reviewed changes

src/libbpfilter/include/bpfilter/core/hashset.h Show resolved Hide resolved

github-actions bot reviewed Mar 30, 2026

View reviewed changes

src/libbpfilter/core/hashset.c Show resolved Hide resolved

pzmarzly added 2 commits March 31, 2026 10:16

lib: core: add bf_hashset

3e64dcb

lib: set: use bf_hashset for elems field

d19f0fc

pzmarzly force-pushed the push-mlqkpqspnupl branch from 7998ded to d19f0fc Compare March 31, 2026 10:16

github-actions bot reviewed Mar 31, 2026

View reviewed changes

Conversation

pzmarzly commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Claude review of PR #470 (d19f0fc)

Must fix

Suggestions

Nits

CLAUDE.md improvements

Resolved from prior review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qdeslandes left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pzmarzly commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yaakov-stein left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yaakov-stein commented Mar 26, 2026

Uh oh!

pzmarzly commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pzmarzly commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yaakov-stein commented Mar 30, 2026

Uh oh!

pzmarzly commented Mar 12, 2026 •

edited

Loading

github-actions bot commented Mar 12, 2026 •

edited

Loading

Claude review of PR #470 (`d19f0fc`)

pzmarzly commented Mar 26, 2026 •

edited

Loading

pzmarzly commented Mar 30, 2026 •

edited

Loading

pzmarzly commented Mar 30, 2026 •

edited

Loading