Re: [PATCH 2/2] eventpoll: Fix epoll_wait() report false negative

18 Jul 2025

      On Fri, Jul 18, 2025 at 09:38:27AM +0100, Soheil Hassas Yeganeh wrote:
...
On Fri, Jul 18, 2025 at 8:52 AM Nam Cao namcao@linutronix.de wrote:
...
ep_events_available() checks for available events by looking at ep->rdllist
and ep->ovflist. However, this is done without a lock, therefore the
returned value is not reliable. Because it is possible that both checks on
ep->rdllist and ep->ovflist are false while ep_start_scan() or
ep_done_scan() is being executed on other CPUs, despite events are
available.
This bug can be observed by:

Create an eventpoll with at least one ready level-triggered event

Create multiple threads who do epoll_wait() with zero timeout. The
threads do not consume the events, therefore all epoll_wait() should
return at least one event.

If one thread is executing ep_events_available() while another thread is
executing ep_start_scan() or ep_done_scan(), epoll_wait() may wrongly
return no event for the former thread.
That is the whole point of epoll_wait with a zero timeout. We would want to
opportunistically poll without much overhead, which will have more
false positives.
A caller that calls with a zero timeout should retry later, and will
at some point observe the event.
Is this a documented behavior that users expect? I do not see this in the
man page.
It sounds completely broken to me that an event has been sitting there for
some time, but epoll_wait() says there is nothing.
...
I'm not sure if we would want to add much more overheads, for higher precision.
Correctness before performance please. And I'm not sure what you mean by
"much more overheads". While this patch definitely adds some cycles in case
there is no event, epoll_wait() still returns "instantly".
Nam

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

Re: [PATCH 2/2] eventpoll: Fix epoll_wait() report false negative