weakref: Don't leak cyclic weakrefs. by AJMansfield · Pull Request #21 · dpgeorge/micropython

AJMansfield · 2026-03-02T15:09:28Z

Summary

In micropython#18822 (review), I discussed how that PRs implementation of weakref leaks weakrefs whose callbacks are closures of their target, with a discussion of the underlying mathematics and one primitive way to resolve this issue.

This PR proposes a more refined version of that resolution, achieved by moving mp_weakref_map table from the root pointer section of mp_state_ctx.vm to a non-root-pointer part of mp_state_ctx.mem and adding it to the GC's root closure as a separate step, so that membership in that table is only considered for the gc_sweep_free_blocks step, and not for the gc_sweep_run_finalisers step.

Testing

This PR adds an automated test that verifies the lack of a leak (with very loose bounds). I've checked this test for sensitivity against Unix, and can confirm that it passes on Unix and RP2.

I've rigorously checked the underlying mathematical theory. While it's not impossible that I've made an error in my proofs I'm confident that this change is theoretically sound --- this change maintains the GC's "conservativeness" guarantee and still prevents python code from forming an invalid.

Trade-offs and Alternatives

Rerunning a root-collection pass from this code region does come with a minor increase to code size and runtime; however, since this is able to re-use functionality that already existed in the garbage collector this extra increase should be minimal.

The behaviour follows CPython. It adds a new WTB table to the GC to efficiently track which heap pointers have a weak reference attached. Enabled at the "everything" level. Signed-off-by: Damien George <damien@micropython.org>

Signed-off-by: Damien George <damien@micropython.org>

Needs a native exp file because native code doesn't print line numbers in the traceback. Signed-off-by: Damien George <damien@micropython.org>

Following a69425b. Signed-off-by: Damien George <damien@micropython.org>

Signed-off-by: Damien George <damien@micropython.org>

The amount of GC heap got smaller because weakref WTB takes some of the available RAM. Signed-off-by: Damien George <damien@micropython.org>

Signed-off-by: Damien George <damien@micropython.org>

It takes longer now that weakref is enabled in the coverage build. Signed-off-by: Damien George <damien@micropython.org>

Signed-off-by: Damien George <damien@micropython.org>

Signed-off-by: Anson Mansfield <amansfield@mantaro.com>

dpgeorge · 2026-03-03T03:46:37Z

Thanks for this, it's interesting.

But consider the following case: a weakref.finalize() holds on to (as args) an unrelated object that has a __del__ itself, eg a file. If that file is only reachable from the weakref.finalize() then it'll have its finaliser run but then it won't be reclaimed (because it'll be reached when scanning the weakref map). Each GC cycle it'll run its finaliser again but not be reclaimed (until the weakref.finalize() is resolved itself).

Eg:

l = []
def callback(arg):
    pass
file = <some opened file>
weakref.finalize(l, callback, file)
gc.collect()  # will close the file!
gc.collect()  # will close the file again!
l = None
gc.collect()  # file is closed again, callback runs, everything is reclaimed

Am I correct about what will happen there?

AJMansfield · 2026-03-03T03:54:37Z

The FTB_CLEAR(area, block) operation at the end of the finaliser call prevents any object's finaliser from being run a second time; once an object's finaliser runs it's then permanently a post-final object on any subsequent GC pass even if it somehow were reattached to the object graph in the mean time.

Though, you're right that there's still the possibility of this being unexpected if the finaliser's file argument ends up closed even once before the finaliser runs.

dpgeorge · 2026-03-03T04:14:49Z

The FTB_CLEAR(area, block) operation at the end of the finaliser call prevents any object's finaliser from being run a second time

Ah, yes, you're right. It'll only run once.

But still, it's left in a kind of half-finalised state. And application code would definitely not expect a file to be closed before it's really ready to be GC'd. Eg the callback in my example above would expect the file to be still open so it can write to it.

AJMansfield · 2026-03-03T04:22:49Z

Took me a moment of puzzling through how or even whether micropython#18840 avoids that, too --- the reference inserted into the target's __weakref__ ends up keeping the file open and ensures that the finalize will run in the same GC generation as file.__del__, but there's nothing that enforces any kind of sequencing there.

TBH it's probably worth testing the comparison to CPython here too --- this feels like a limitation that could easily be present there too.

dpgeorge · 2026-03-03T05:28:32Z

Took me a moment of puzzling through how or even whether micropython#18840 avoids that, too --- the reference inserted into the target's __weakref__ ends up keeping the file open and ensures that the finalize will run in the same GC generation as file.__del__, but there's nothing that enforces any kind of sequencing there.

Yes, that's a good point, it looks like micropython#18840 can finalise args/kwargs before running finalize's __del__ method. You probably need to explicitly retain a pointer to all alive finalize objects so they stay reachable until the thing they weakref is gone.

TBH it's probably worth testing the comparison to CPython here too --- this feels like a limitation that could easily be present there too.

Looking at CPython's implementation, it keeps strong references to all finalize objects (and their args/kwargs) in a big dict. So it won't prematurely finalise any of the args/kwargs.

dpgeorge and others added 11 commits February 24, 2026 23:40

py/modweakref: Implement weakref module with ref class.

f146bed

The behaviour follows CPython. It adds a new WTB table to the GC to efficiently track which heap pointers have a weak reference attached. Enabled at the "everything" level. Signed-off-by: Damien George <damien@micropython.org>

tests/basics: Add tests for weakref.ref and weakref.finalize.

fa80182

Signed-off-by: Damien George <damien@micropython.org>

tests/basics: Add test for weakref having exception in callback.

6fc21ef

Needs a native exp file because native code doesn't print line numbers in the traceback. Signed-off-by: Damien George <damien@micropython.org>

webassembly/Makefile: Add test//% target.

6973f73

Following a69425b. Signed-off-by: Damien George <damien@micropython.org>

webassembly/variants/pyscript: Enable weakref module and add tests.

5f4bdfd

Signed-off-by: Damien George <damien@micropython.org>

tests/ports/webassembly: Update heap_expand expected output.

b2fad56

The amount of GC heap got smaller because weakref WTB takes some of the available RAM. Signed-off-by: Damien George <damien@micropython.org>

docs/library/weakref: Add documentation for weakref module.

bcd4413

Signed-off-by: Damien George <damien@micropython.org>

tools/ci.sh: Increase qemu_arm test run timeout.

180db59

It takes longer now that weakref is enabled in the coverage build. Signed-off-by: Damien George <damien@micropython.org>

tests/basics/weakref_multiple_refs.py: Get working on webassembly.

5955972

Signed-off-by: Damien George <damien@micropython.org>

tests/basics/weakref: Test that cyclic closures are not leaked.

5b07252

Signed-off-by: Anson Mansfield <amansfield@mantaro.com>

py/gc: Move mp_weakref_map state from VM to MEM.

266ef6c

Signed-off-by: Anson Mansfield <amansfield@mantaro.com>

AJMansfield changed the title ~~Weakref leak~~ weakref: Don't leak cyclic weakrefs. Mar 2, 2026

AJMansfield mentioned this pull request Mar 2, 2026

py: add weakref module with ref and finalize classes micropython/micropython#18822

Open

dpgeorge force-pushed the py-add-weakref branch from 5955972 to 0d6ff50 Compare March 11, 2026 08:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

weakref: Don't leak cyclic weakrefs.#21

weakref: Don't leak cyclic weakrefs.#21
AJMansfield wants to merge 11 commits intodpgeorge:py-add-weakreffrom
AJMansfield:weakref-leak

AJMansfield commented Mar 2, 2026

Uh oh!

dpgeorge commented Mar 3, 2026

Uh oh!

AJMansfield commented Mar 3, 2026 •

edited

Loading

Uh oh!

dpgeorge commented Mar 3, 2026

Uh oh!

AJMansfield commented Mar 3, 2026 •

edited

Loading

Uh oh!

dpgeorge commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

AJMansfield commented Mar 2, 2026

Summary

Testing

Trade-offs and Alternatives

Uh oh!

dpgeorge commented Mar 3, 2026

Uh oh!

AJMansfield commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dpgeorge commented Mar 3, 2026

Uh oh!

AJMansfield commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dpgeorge commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

AJMansfield commented Mar 3, 2026 •

edited

Loading

AJMansfield commented Mar 3, 2026 •

edited

Loading