Peter Zijlstra peterz@infradead.org writes:
On Mon, Feb 15, 2021 at 12:23:52PM -0300, André Almeida wrote:
Create a new set of futex syscalls known as futex2. This new interface is aimed to implement a more maintainable code, while removing obsolete features and expanding it with new functionalities.
Implements wait and wake semantics for futexes, along with the base infrastructure for future operations.
- futex_table = alloc_large_system_hash("futex2", sizeof(struct futex_bucket),
futex2_hashsize, 0,
futex2_hashsize < 256 ? HASH_SMALL : 0,
&futex_shift, NULL,
futex2_hashsize, futex2_hashsize);
So why are we implementing a whole second infrastrure and doubling the memory footprint of all this?
Sure, futex.c is a pain in the ass, but most of that is not because of the interface, most of it is having to deal with sharing state with userspace and that being fundamentally unreliable.
Once you want to add {,UN}LOCK{,_PI} and robust futex support, you're back to it being a giant rats nest of corner cases. Thinking a new interface can solve any of that is naive.
So while I'm in favour of adding a new interface, I'm not sure I see benefit of reimplementing the basics, sure it seems simpler now, but that's because you've not implemented all the 'fun' stuff.
Peter,
I think there was a question of how we'd introduce this new interface, since the community is wary of introducing new features in the original futex code. The approach we discussed in the last LPC was writing a new code from scratch that could even sit on the RT tree while it get stabilized.
The code base duplication, and - perhaps more importantly - the double memory footprint is bad, for sure. But considering this implementation modifies what is enqueued to separate the waiter structure from its (potentially multiple) keys, we'd be looking at large changes in the original futex code, which doesn't seem (as it should be) welcome by the community. At least this feedback was the reason we started working on futex2.
So, my question is, how else should we present this interface, if not in a new code base? If it is just a matter of integrating it into the original code, I'd go back and ask why this new interface was made a prerequisite for the futex wait multiple patches André originally wanted to merge? For sure, the multiplexing interface is not the sole reason that stopped that work from being accepted. The bigger goal was not increasing the mess and not causing new bugs in the existing overcomplex futex implementation.
Regarding NUMA support, I wouldn't expect André to implement all other features before getting something upstreamable in the list. His interest in futex is directed at solving the multiple wait problem, but consider he has already gone way beyond that by re-implementing the basics of a new interface. Collabora is willing to do the heavy lifting necessary (within reason) on this patchset to get something the community accepts, including NUMA support, provided we also get semantics to fix the problem we are trying to solve. But for that, we gonna need more directioning on what the community is willing to accept not only regarding the interface, but on internals too.