On Tue, May 16, 2023 at 4:24 PM Dave Hansen dave.hansen@intel.com wrote:
On 5/15/23 06:05, jeffxu@chromium.org wrote:
From: Jeff Xu jeffxu@google.com
This patch enables PKEY_ENFORCE_API for the munmap syscall.
The basic problem here is how we know when the set of syscalls that are patched here is good enough and how we catch future functionality that might need to be captured as well.
This mechanism really needs to be able to defend against *any* changes to the address space. I assume that folks are using syscall filtering to prevent new syscalls from causing havoc, but is there anything that can be done for, say, things like madvise()? I bet it was harmless for a long time until MADV_DONTNEED showed up and made it able to effectively zero memory.
Not any change, just a limited set of syscall from user space. I think it is reasonable to hope that any kind of syscall ABI change that affects VMA will get reviewed thoroughly from now on.
Also, if we continue to add mseal() to the kernel, we will have to pay more attention to syscalls related to VMA.
Thanks -Jeff Xu