Hi Ted,
Às 12:01 de 04/03/21, Theodore Ts'o escreveu:
On Wed, Mar 03, 2021 at 09:42:06PM -0300, André Almeida wrote:
** Performance
- For comparing futex() and futex2() performance, I used the artificial benchmarks implemented at perf (wake, wake-parallel, hash and requeue). The setup was 200 runs for each test and using 8, 80, 800, 8000 for the number of threads, Note that for this test, I'm not using patch 14 ("kernel: Enable waitpid() for futex2") , for reasons explained at "The patchset" section.
How heavily contended where the benchmarks? One of the benefits of the original futex was that no system call was necessary in the happy path when the lock is uncontended.
futex2 has the same design in that aspect, no syscall is needed in the happy path. Did something in the cover letter gave the impression that is not the case? I would like to reword it to clarify this.
Especially on a non-NUMA system (which are the far more common case), since that's where relying on a single memory access was a huge win for the original futex. I would expect that futex2 will fare worse in this particular case, since it requires a system call entry for all operations --- the question is how large is the delta in this worst case (for futex2) and best case (for futex) scenario.
Cheers,
- Ted
Thanks, André