In the Linux kernel, the following vulnerability has been resolved:
perf: Fix hang while freeing sigtrap event
Perf can hang while freeing a sigtrap event if a related deferred signal hadn't managed to be sent before the file got closed:
perfeventoverflow() taskworkadd(perfpendingtask)
fput() taskworkadd(__fput())
taskworkrun() _fput() perfrelease() perfeventreleasekernel() _freeevent() perfpendingtasksync() taskworkcancel() -> FAILED rcuwaitwait_event()
Once taskworkrun() is running, the list of pending callbacks is removed from the taskstruct and from this point on taskworkcancel() can't remove any pending and not yet started work items, hence the taskworkcancel() failure and the hang on rcuwaitwait_event().
Task work could be changed to remove one work at a time, so a work running on the current task can always cancel a pending one, however the wait / wake design is still subject to inverted dependencies when remote targets are involved, as pictured by Oleg:
T1 T2
fd = perfeventopen(pid => T2->pid); fd = perfeventopen(pid => T1->pid); close(fd) close(fd) <IRQ> <IRQ> perfeventoverflow() perfeventoverflow() taskworkadd(perfpendingtask) taskworkadd(perfpendingtask) </IRQ> </IRQ> fput() fput() taskworkadd(_fput()) taskworkadd(_fput())
task_work_run() task_work_run()
____fput() ____fput()
perf_release() perf_release()
perf_event_release_kernel() perf_event_release_kernel()
_free_event() _free_event()
perf_pending_task_sync() perf_pending_task_sync()
rcuwait_wait_event() rcuwait_wait_event()
Therefore the only option left is to acquire the event reference count upon queueing the perf task work and release it from the task work, just like it was done before 3a5465418f5f ("perf: Fix event leak upon exec and file release") but without the leaks it fixed.
Some adjustments are necessary to make it work:
A child event might dereference its parent upon freeing. Care must be taken to release the parent last.
Some places assuming the event doesn't have any reference held and therefore can be freed right away must instead put the reference and let the reference counting to its job.