From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mathieu Desnoyers Subject: Re: [RFC PATCH v4 1/5] glibc: Perform rseq(2) registration at nptl init and thread creation Date: Thu, 22 Nov 2018 10:33:19 -0500 (EST) Message-ID: <1306224240.10055.1542900799576.JavaMail.zimbra@efficios.com> References: <20181121183936.8176-1-mathieu.desnoyers@efficios.com> <20181122143603.GD23599@brightrain.aerifal.cx> <782067422.9852.1542899056778.JavaMail.zimbra@efficios.com> <87a7m1ywni.fsf@oldenburg.str.redhat.com> <20181122151710.GF23599@brightrain.aerifal.cx> <875zwpyw81.fsf@oldenburg.str.redhat.com> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <875zwpyw81.fsf@oldenburg.str.redhat.com> Sender: linux-kernel-owner@vger.kernel.org To: Florian Weimer Cc: Rich Felker , carlos , Joseph Myers , Szabolcs Nagy , libc-alpha , Thomas Gleixner , Ben Maurer , Peter Zijlstra , "Paul E. McKenney" , Boqun Feng , Will Deacon , Dave Watson , Paul Turner , linux-kernel , linux-api List-Id: linux-api@vger.kernel.org ----- On Nov 22, 2018, at 10:21 AM, Florian Weimer fweimer@redhat.com wrote: > * Rich Felker: > >> On Thu, Nov 22, 2018 at 04:11:45PM +0100, Florian Weimer wrote: >>> * Mathieu Desnoyers: >>> >>> > Thoughts ? >>> > >>> > /* Unregister rseq TLS from kernel. */ >>> > if (has_rseq && __rseq_unregister_current_thread ()) >>> > abort(); >>> > >>> > advise_stack_range (pd->stackblock, pd->stackblock_size, (uintptr_t) pd, >>> > pd->guardsize); >>> > >>> > /* If the thread is detached free the TCB. */ >>> > if (IS_DETACHED (pd)) >>> > /* Free the TCB. */ >>> > __free_tcb (pd); >>> >>> Considering that we proceed to free the TCB, I really hope that all >>> signals are blocked at this point. (I have not checked this, though.) >>> >>> Wouldn't this address your concern about access to the rseq area? >> >> I'm not familiar with glibc's logic here, but for other reasons, I >> don't think freeing it is safe until the kernel task exit futex (set >> via clone or set_tid_address) has fired. I would guess __free_tcb just >> sets up for it to be reclaimable when this happens rather than >> immediately freeing it for reuse. > > Right, but in case of user-supplied stacks, we actually free TLS memory > at this point, so signals need to be blocked because the TCB is > (partially) gone after that. Unfortuntately, disabling signals is not enough. With rseq registered, the kernel accesses the rseq TLS area when returning to user-space after _preemption_ of user-space, which can be triggered at any point by an interrupt or a fault, even if signals are blocked. So if there are cases where the TLS memory is freed while the thread is still running, we _need_ to explicitly unregister rseq beforehand. Thanks, Mathieu -- Mathieu Desnoyers EfficiOS Inc. http://www.efficios.com