From mboxrd@z Thu Jan 1 00:00:00 1970 From: ebiederm@xmission.com (Eric W. Biederman) Subject: Re: Kernel stack read with PTRACE_EVENT_EXIT and io_uring threads Date: Mon, 21 Jun 2021 11:50:56 -0500 Message-ID: <87czsfi2kv.fsf@disp2133> References: <87sg1lwhvm.fsf@disp2133> <6e47eff8-d0a4-8390-1222-e975bfbf3a65@gmail.com> <924ec53c-2fd9-2e1c-bbb1-3fda49809be4@gmail.com> <87eed4v2dc.fsf@disp2133> <5929e116-fa61-b211-342a-c706dcb834ca@gmail.com> <87fsxjorgs.fsf@disp2133> Mime-Version: 1.0 Return-path: In-Reply-To: (Al Viro's message of "Mon, 21 Jun 2021 14:16:21 +0000") List-ID: Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit To: Al Viro Cc: Linus Torvalds , Michael Schmitz , linux-arch , Jens Axboe , Oleg Nesterov , Linux Kernel Mailing List , Richard Henderson , Ivan Kokshaysky , Matt Turner , alpha , Geert Uytterhoeven , linux-m68k , Arnd Bergmann , Ley Foon Tan , Tejun Heo , Kees Cook Al Viro writes: > On Mon, Jun 21, 2021 at 01:54:56PM +0000, Al Viro wrote: >> On Tue, Jun 15, 2021 at 02:58:12PM -0700, Linus Torvalds wrote: >> >> > And I think our horrible "kernel threads return to user space when >> > done" is absolutely horrifically nasty. Maybe of the clever sort, but >> > mostly of the historical horror sort. >> >> How would you prefer to handle that, then? Separate magical path from >> kernel_execve() to switch to userland? We used to have something of >> that sort, and that had been a real horror... >> >> As it is, it's "kernel thread is spawned at the point similar to >> ret_from_fork(), runs the payload (which almost never returns) and >> then proceeds out to userland, same way fork(2) would've done." >> That way kernel_execve() doesn't have to do anything magical. >> >> Al, digging through the old notes and current call graph... > > FWIW, the major assumption back then had been that get_signal(), > signal_delivered() and all associated machinery (including coredumps) > runs *only* from SIGPENDING/NOTIFY_SIGNAL handling. > > And "has complete registers on stack" is only a part of that; > there was other fun stuff in the area ;-/ Do we want coredumps for > those, and if we do, will the de_thread stuff work there? Do we want coredumps from processes that use io_uring? yes Exactly what we want from io_uring threads is less clear. We can't really give much that is meaningful beyond the thread ids of the io_uring threads. What problems do are you seeing beyond the missing registers on the stack for kernel threads? I don't immediately see the connection between coredumps and de_thread. The function de_thread arranges for the fatal_signal_pending to be true, and that should work just fine for io_uring threads. The io_uring threads process the fatal_signal with get_signal and then proceed to exit eventually calling do_exit. Eric