On Wed, Dec 28, 2022 at 07:01:36PM +0700, Ammar Faizi wrote:
On 12/28/22 1:49 AM, Willy Tarreau wrote:
I'll try to do it but do not want to make you wait too long in case it gets delayed. In the worst case we should only postpone the getauxval() patch and not the other ones.
I will split it into 2 patchset then.
OK thanks!
I've pushed for you an update which starts to do what I proposed. Errno and environ are now marked weak for all archs, and _auxv is set for i386, x86_64, arm64 and arm for now:
https://git.kernel.org/pub/scm/linux/kernel/git/wtarreau/nolibc.git/log/?h=2...
You can already use it to implement getauxval(), it will normally work for these archs.
BTW, do you think your arch-specific changes for sigaction() will be easily portable to other architectures ? I feel a bit wary of starting to have different features per architecture given the purpose of the lib, so the more uniform the coverage the better.
The 'rt_sigaction()' itself doesn't seem to be an arch specific, but the way it resumes the execution needs to call 'rt_sigreturn()' which is arch specific. I took a look at the kernel source code, most architectures read 'struct rt_sigframe' from the stack pointer.
https://github.com/torvalds/linux/blob/631aa744423173bf921191ba695bbc7c1aabd... https://github.com/torvalds/linux/blob/631aa744423173bf921191ba695bbc7c1aabd... https://github.com/torvalds/linux/blob/a6b450573b912316ad36262bfc70e7c3870c5... https://github.com/torvalds/linux/blob/a6b450573b912316ad36262bfc70e7c3870c5... https://github.com/torvalds/linux/blob/eb67d239f3aa1711afb0a42eab50459d9f3d6...
On the x86-64 arch, the implementation is just like this:
__arch_restore_rt: # # ((%rsp - sizeof(long)) must point to 'struct rt_sigframe') # # 'struct rt_sigframe' is automatically constructed by # the kernel when a signal is caught. # movl $0xf, %eax // __NR_rt_sigreturn == 0xf syscall
I think we could avoid the asm specific stuff is we get rid of the frame pointer. Please look below:
__attribute__((weak,unused,noreturn,optimize("omit-frame-pointer"),section(".text.nolibc_rt_sigreturn"))) void sys_rt_sigreturn() { my_syscall0(__NR_rt_sigreturn); __builtin_unreachable(); }
It gives me the correct code for x86_64 and i586. I don't know if other architectures will want to add a prologue. I tried with "naked" but it's ignored by the compiler since the function is not purely asm. Not very important but given that we already have everything to perform our calls it would make sense to stay on this. By the way, for the sake of consistency with other syscalls, I do think the function (or label if we can't do otherwise) should be called "sys_rt_sigreturn" as it just performs a syscall.
I believe aarch64 and RISCV don't behave differently, but different registers.
Not sure what PowerPC does here, it seems a bit different: https://github.com/torvalds/linux/blob/1612c382ffbdf1f673caec76502b1c00e6d35...
It looks similar to me, it's just that the kernel side differs but I think it's the same.
I haven't taken a look at other archs.
What do you think? Is it affordable for nolibc to implement all of these?
Yes I think so. I suspect that we might need to have a few arch-specific implementations, but we've already had this case a few times and we could easily use a pair of #define/#ifdef to skip the generic version.
Best regards, Willy