[6.17]NVIDIA: VR: SAUCE: firmware: smccc: lfa: fix work item re-initialization race by nirmoy · Pull Request #343 · NVIDIA/NV-Kernels

nirmoy · 2026-03-13T09:52:44Z

Move INIT_WORK() for fw_images_update_work from update_fw_images_tree() to lfa_init() so the work item is initialized once at module load rather than re-initialized on every firmware image tree update. Re-initializing a work item that may already be queued is unsafe and can corrupt the workqueue.

Add flush_workqueue() in lfa_notify_handler() before rescanning the image list to ensure any pending remove_invalid_fw_images work completes first, preventing use-after-free on the image list.

Fixes: 1dd9a8f ("NVIDIA: VR: SAUCE: firmware: smccc: add support for Live Firmware Activation (LFA)")

LP: https://bugs.launchpad.net/ubuntu/+source/linux-nvidia/+bug/2138342

…ion race Move INIT_WORK() for fw_images_update_work from update_fw_images_tree() to lfa_init() so the work item is initialized once at module load rather than re-initialized on every firmware image tree update. Re-initializing a work item that may already be queued is unsafe and can corrupt the workqueue. Add flush_workqueue() in lfa_notify_handler() before rescanning the image list to ensure any pending remove_invalid_fw_images work completes first, preventing use-after-free on the image list. Fixes: 1dd9a8f ("NVIDIA: VR: SAUCE: firmware: smccc: add support for Live Firmware Activation (LFA)") Signed-off-by: Nirmoy Das <nirmoyd@nvidia.com>

clsotog

Acked-by: Carol L Soto <csoto@nvidia.com>

nvmochs · 2026-03-13T16:05:42Z

@nirmoy I agree this fixes the issue, but the usage convention seems a bit awkward with the single global work struct.

Basically, if anyone calls update_fw_images_tree() they need to ensure the workqueue is flushed before calling again. Maybe it would be cleaner to dynamically allocate the work struct in update_fw_images_tree(), INIT/enqueue it, and then free in the handler? Then we don't need to "serialize" the flushes and calling update_fw_images_tree(). Of course, the downside to that approach is what to do if the kmalloc fails...

nvmochs · 2026-03-13T17:44:54Z

@nirmoy I agree this fixes the issue, but the usage convention seems a bit awkward with the single global work struct.

Basically, if anyone calls update_fw_images_tree() they need to ensure the workqueue is flushed before calling again. Maybe it would be cleaner to dynamically allocate the work struct in update_fw_images_tree(), INIT/enqueue it, and then free in the handler? Then we don't need to "serialize" the flushes and calling update_fw_images_tree(). Of course, the downside to that approach is what to do if the kmalloc fails...

Nirmoy and I met and reviewed his proposed changes and the workqueue API, specifically queue_work(). That service is tolerant of the work item already residing in the list, so I no longer have a concern about the usage convention. The key change that is being made via this PR is moving INIT_WORK to the init() path so that it is only invoked once.

nvmochs

No further issues or concerns from me.

Acked-by: Matthew R. Ochs <mochs@nvidia.com>

nvmochs · 2026-03-13T18:44:44Z

PR sent to Canonical.

KobaKoNvidia · 2026-03-20T05:06:14Z

this PR is a fix for real bug I encountered

Unable to handle kernel NULL pointer dereference at virtual address 0000000000000008
CPU: 0 UID: 0 PID: 4521 Comm: kworker/u1409:6
Workqueue: remove_invalid_fw_images (fw_images_update_wq)
pc : process_one_work+0xd4/0x430
lr : worker_thread+0x310/0x430
Code: 91020279 d2800401 53041ee0 b9003260 (f94006b8)

Acked

nirmoy requested review from clsotog, jamieNguyenNVIDIA and nvmochs March 13, 2026 09:53

nirmoy changed the title ~~NVIDIA: VR: SAUCE: firmware: smccc: lfa: fix work item re-initialization race~~ [6.17]NVIDIA: VR: SAUCE: firmware: smccc: lfa: fix work item re-initialization race Mar 13, 2026

clsotog approved these changes Mar 13, 2026

View reviewed changes

nvmochs approved these changes Mar 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[6.17]NVIDIA: VR: SAUCE: firmware: smccc: lfa: fix work item re-initialization race#343

[6.17]NVIDIA: VR: SAUCE: firmware: smccc: lfa: fix work item re-initialization race#343
nirmoy wants to merge 1 commit intoNVIDIA:24.04_linux-nvidia-6.17-nextfrom
nirmoy:lfa_fix

nirmoy commented Mar 13, 2026

Uh oh!

clsotog left a comment

Uh oh!

nvmochs commented Mar 13, 2026

Uh oh!

nvmochs commented Mar 13, 2026

Uh oh!

nvmochs left a comment

Uh oh!

nvmochs commented Mar 13, 2026

Uh oh!

KobaKoNvidia commented Mar 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

nirmoy commented Mar 13, 2026

Uh oh!

clsotog left a comment

Choose a reason for hiding this comment

Uh oh!

nvmochs commented Mar 13, 2026

Uh oh!

nvmochs commented Mar 13, 2026

Uh oh!

nvmochs left a comment

Choose a reason for hiding this comment

Uh oh!

nvmochs commented Mar 13, 2026

Uh oh!

KobaKoNvidia commented Mar 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants