Re: [musl] Re: [PATCH v8 00/38] arm64/gcs: Provide support for GCS in userspace

20 Feb 2024


      On Tue, 2024-02-20 at 13:57 -0500, Rich Felker wrote:
...
On Tue, Feb 20, 2024 at 06:41:05PM +0000, Edgecombe, Rick P wrote:
...
Hmm, could the shadow stack underflow onto the real stack then? Not
sure how bad that is. INCSSP (incrementing the SSP register on x86)
loops are not rare so it seems like something that could happen.
Shadow stack underflow should fault on attempt to access
non-shadow-stack memory as shadow-stack, no?
Maybe I'm misunderstanding. I thought the proposal included allowing
shadow stack access to convert normal address ranges to shadow stack,
and normal writes to convert shadow stack to normal.
...
...
Won't this prevent catching stack overflows when they happen? An
overflow will just turn the shadow stack into normal stack and only
get
detected when the shadow stack unwinds?
I don't think that's as big a problem as it sounds like. It might
make
pinpointing the spot at which things went wrong take a little bit
more
work, but it should not admit any wrong-execution.
Right, it's a point about debugging. I'm just trying to analyze the
pros and cons and not calling it a showstopper.
...
...
Shadow stacks currently have automatic guard gaps to try to prevent
one
thread from overflowing onto another thread's shadow stack. This
would
somewhat opens that up, as the stack guard gaps are usually
maintained
by userspace for new threads. It would have to be thought through
if
these could still be enforced with checking at additional spots.
I would think the existing guard pages would already do that if a
thread's shadow stack is contiguous with its own data stack.
The difference is that the kernel provides the guard gaps, where this
would rely on userspace to do it. It is not a showstopper either.
I think my biggest question on this is how does it change the
capability for two threads to share a shadow stack. It might require
some special rules around the syscall that writes restore tokens. So
I'm not sure. It probably needs a POC.
...
From the musl side, I have always looked at the entirely of shadow
stack stuff with very heavy skepticism, and anything that breaks
existing interface contracts, introduced places where apps can get
auto-killed because a late resource allocation fails, or requires
applications to code around the existence of something that should be
an implementation detail, is a non-starter. To even consider shadow
stack support, it must truely be fully non-breaking.
The manual assembly stack switching and JIT code in the apps needs to
be updated. I don't think there is a way around it.
I agree though that the late allocation failures are not great. Mark is
working on clone3 support which should allow moving the shadow stack
allocation to happen in userspace with the normal stack. Even for riscv
though, doesn't it need to update a new register in stack switching?
BTW, x86 shadow stack has a mode where the shadow stack is writable
with a special instruction (WRSS). It enables the SSP to be set
arbitrarily by writing restore tokens. We discussed this as an option
to make the existing longjmp() and signal stuff work more transparently
for glibc.
...
...
...
_Without_ doing this, sigaltstack cannot be used to recover from
stack
overflows if the shadow stack limit is reached first, and
makecontext
cannot be supported without memory leaks and unreportable error
conditions.
FWIW, I think the makecontext() shadow stack leaking is a bad idea.
I
would prefer the existing makecontext() interface just didn't
support
shadow stack, rather than the leaking solution glibc does today.
AIUI the proposal by Stefan makes it non-leaking because it's just
using normal memory that reverts to normal usage on any
non-shadow-stack access.
Right, but does it break any existing apps anyway (because of small
ucontext stack sizes)?
BTW, when I talk about "not supporting" I don't mean the app should
crash. I mean it should instead run normally, just without shadow stack
enabled. Not sure if that was clear. Since shadow stack is not
essential for an application to function, it is only security hardening
on top.
Although determining if an application supports shadow stack has turned
out to be difficult in practice. Handling dlopen() is especially hard.

2025

2024

2023

2022

2021

2020

2019

2018

2017

Re: [musl] Re: [PATCH v8 00/38] arm64/gcs: Provide support for GCS in userspace