On 9/28/24 20:55, Jason A. Donenfeld wrote:
This prevents false sharing, which makes a large difference on machines with several NUMA nodes, such as on a dual socket Intel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz, where the "bench-multi" test goes from 2.7s down to 1.9s. While this is just test code, it also forms the basis of how folks will wind up implementing this in libraries, so we should implement this simple cache alignment improvement here.
Suggested-by: Florian Weimer fweimer@redhat.com Cc: Adhemerval Zanella adhemerval.zanella@linaro.org Signed-off-by: Jason A. Donenfeld Jason@zx2c4.com
Thank you. Applied linux-kselftest fixes for next rc.
thanks, -- Shuah