From: Andy Lutomirski <luto@kernel.org>
To: Thomas Gleixner <tglx@linutronix.de>, x86@kernel.org
Cc: Dave Hansen <dave.hansen@intel.com>,
stable@vger.kernel.org,
syzbot+2067e764dbcd10721e2e@syzkaller.appspotmail.com
Subject: Re: [RFC v2 1/2] x86/fpu: Fix state corruption in __fpu__restore_sig()
Date: Sun, 30 May 2021 16:41:52 -0700 [thread overview]
Message-ID: <27deedb9-1e80-fe62-e072-1cc4904f20d5@kernel.org> (raw)
In-Reply-To: <87a6ob6ft2.ffs@nanos.tec.linutronix.de>
On 5/30/21 3:02 PM, Thomas Gleixner wrote:
> Andy,
>
> On Sat, May 29 2021 at 22:12, Andy Lutomirski wrote:
>>
>> Cc: stable@vger.kernel.org
>> Fixes: b860eb8dce59 ("x86/fpu/xstate: Define new functions for clearing fpregs and xstates")
>> Reported-by: syzbot+2067e764dbcd10721e2e@syzkaller.appspotmail.com
>
> Debugged-by ...
>
>> Signed-off-by: Andy Lutomirski <luto@kernel.org>
>
> ...
>
>> /*
>> - * Clear the FPU state back to init state.
>> - *
>> - * Called by sys_execve(), by the signal handler code and by various
>> - * error paths.
>> + * Reset current's user FPU states to the init states. The caller promises
>> + * that current's supervisor states (in memory or CPU regs as appropriate)
>> + * as well as the XSAVE header in memory are intact.
^^^ The caller promises this
>> */
>> -static void fpu__clear(struct fpu *fpu, bool user_only)
>> +void fpu__clear_user_states(struct fpu *fpu)
>> {
>> WARN_ON_FPU(fpu != ¤t->thread.fpu);
>>
...
>> + /*
>> + * Reset user states in registers.
>> + */
>> + copy_init_fpstate_to_fpregs(xfeatures_mask_user());
>> +
>> + /*
>> + * Now all FPU registers have their desired values. Inform the
>> + * FPU state machine that current's FPU registers are in the
>> + * hardware registers.
>> + */
>> fpregs_mark_activate();
>> +
>> fpregs_unlock();
>
> This is as wrong as before. The corrupted task->fpu.state still
> survives.
See above. This function is not intended to fix the header.
>
> For f*cks sake, I gave you a reproducer and a working patch and I
> explained it in great length what's broken and what needs to be fixed.
This patch fixes your reproducer and my (to-be-sent) reproducer. I
tested it on a machine that physically has the XRSTORS instruction but I
disabled it using virt. Are you still seeing failures with this patch
applied? I can try to test on a different CPU.
>
> And of course you kept the bug which was in the offending commit,
> i.e. not wiping the task->fpu.state corruption which causes the next
> XRSTOR to fail:
>
> [ 34.095020] Bad FPU state detected at copy_kernel_to_fpregs+0x28/0x40, reinitializing FPU registers.
> [ 34.095052] WARNING: CPU: 0 PID: 1364 at arch/x86/mm/extable.c:65 ex_handler_fprestore+0x5f/0x70
> ...
> [ 34.153472] switch_fpu_return+0x40/0xb0
> [ 34.154196] exit_to_user_mode_prepare+0x8f/0x180
> [ 34.155060] syscall_exit_to_user_mode+0x23/0x50
> [ 34.155912] do_syscall_64+0x4d/0xb0
I don't think that the code path that calls fpu__clear_user_states() is
subject to header corruption. If it is, then both your patch and my
patch have issues -- trying to fish the supervisor state out of a
corrupt XSTATE buffer is asking for trouble.
>
> IOW, this is exactly the same shit as we had before. So what is decent
> about this? Define decent...
>
> Why the heck do you think I wasted a couple of days to:
>
> - Analyze the root cause
>
> - Destill a trivial C reproducer
>
> - Come up with a fully working and completely correct fix
>
> Just because, right?
>
> I'm fine with splitting up clear_all() and clear_user(), but what you
> provided is as much of a clusterfuck as the commit it pretends to fix.
>
> Your's seriously grumpy
>
> Thomas
>
next prev parent reply other threads:[~2021-05-30 23:41 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <cover.1622351443.git.luto@kernel.org>
2021-05-30 5:12 ` [RFC v2 1/2] x86/fpu: Fix state corruption in __fpu__restore_sig() Andy Lutomirski
2021-05-30 22:02 ` Thomas Gleixner
2021-05-30 23:41 ` Andy Lutomirski [this message]
2021-05-31 9:03 ` Thomas Gleixner
2021-05-31 10:01 ` Thomas Gleixner
2021-05-31 18:56 ` Thomas Gleixner
2021-05-31 19:30 ` Andy Lutomirski
2021-05-31 22:46 ` Thomas Gleixner
2021-06-01 4:58 ` Andy Lutomirski
2021-06-01 14:48 ` Dave Hansen
2021-06-01 18:06 ` [PATCH v3 3/5] x86/fpu: Clean up the fpu__clear() variants Dave Hansen
2021-06-01 18:14 ` Andy Lutomirski
2021-06-01 18:35 ` Dave Hansen
2021-06-01 22:44 ` Andy Lutomirski
2021-06-01 18:25 ` Thomas Gleixner
2021-06-01 23:17 ` [RFC v2 1/2] x86/fpu: Fix state corruption in __fpu__restore_sig() Thomas Gleixner
2021-05-31 19:48 ` Thomas Gleixner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=27deedb9-1e80-fe62-e072-1cc4904f20d5@kernel.org \
--to=luto@kernel.org \
--cc=dave.hansen@intel.com \
--cc=stable@vger.kernel.org \
--cc=syzbot+2067e764dbcd10721e2e@syzkaller.appspotmail.com \
--cc=tglx@linutronix.de \
--cc=x86@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox