From: Andy Lutomirski <luto@kernel.org>
To: Thomas Gleixner <tglx@linutronix.de>, x86@kernel.org
Cc: Dave Hansen <dave.hansen@intel.com>,
stable@vger.kernel.org,
syzbot+2067e764dbcd10721e2e@syzkaller.appspotmail.com
Subject: Re: [RFC v2 1/2] x86/fpu: Fix state corruption in __fpu__restore_sig()
Date: Sun, 30 May 2021 16:41:52 -0700 [thread overview]
Message-ID: <27deedb9-1e80-fe62-e072-1cc4904f20d5@kernel.org> (raw)
In-Reply-To: <87a6ob6ft2.ffs@nanos.tec.linutronix.de>
On 5/30/21 3:02 PM, Thomas Gleixner wrote:
> Andy,
>
> On Sat, May 29 2021 at 22:12, Andy Lutomirski wrote:
>>
>> Cc: stable@vger.kernel.org
>> Fixes: b860eb8dce59 ("x86/fpu/xstate: Define new functions for clearing fpregs and xstates")
>> Reported-by: syzbot+2067e764dbcd10721e2e@syzkaller.appspotmail.com
>
> Debugged-by ...
>
>> Signed-off-by: Andy Lutomirski <luto@kernel.org>
>
> ...
>
>> /*
>> - * Clear the FPU state back to init state.
>> - *
>> - * Called by sys_execve(), by the signal handler code and by various
>> - * error paths.
>> + * Reset current's user FPU states to the init states. The caller promises
>> + * that current's supervisor states (in memory or CPU regs as appropriate)
>> + * as well as the XSAVE header in memory are intact.
^^^ The caller promises this
>> */
>> -static void fpu__clear(struct fpu *fpu, bool user_only)
>> +void fpu__clear_user_states(struct fpu *fpu)
>> {
>> WARN_ON_FPU(fpu != ¤t->thread.fpu);
>>
...
>> + /*
>> + * Reset user states in registers.
>> + */
>> + copy_init_fpstate_to_fpregs(xfeatures_mask_user());
>> +
>> + /*
>> + * Now all FPU registers have their desired values. Inform the
>> + * FPU state machine that current's FPU registers are in the
>> + * hardware registers.
>> + */
>> fpregs_mark_activate();
>> +
>> fpregs_unlock();
>
> This is as wrong as before. The corrupted task->fpu.state still
> survives.
See above. This function is not intended to fix the header.
>
> For f*cks sake, I gave you a reproducer and a working patch and I
> explained it in great length what's broken and what needs to be fixed.
This patch fixes your reproducer and my (to-be-sent) reproducer. I
tested it on a machine that physically has the XRSTORS instruction but I
disabled it using virt. Are you still seeing failures with this patch
applied? I can try to test on a different CPU.
>
> And of course you kept the bug which was in the offending commit,
> i.e. not wiping the task->fpu.state corruption which causes the next
> XRSTOR to fail:
>
> [ 34.095020] Bad FPU state detected at copy_kernel_to_fpregs+0x28/0x40, reinitializing FPU registers.
> [ 34.095052] WARNING: CPU: 0 PID: 1364 at arch/x86/mm/extable.c:65 ex_handler_fprestore+0x5f/0x70
> ...
> [ 34.153472] switch_fpu_return+0x40/0xb0
> [ 34.154196] exit_to_user_mode_prepare+0x8f/0x180
> [ 34.155060] syscall_exit_to_user_mode+0x23/0x50
> [ 34.155912] do_syscall_64+0x4d/0xb0
I don't think that the code path that calls fpu__clear_user_states() is
subject to header corruption. If it is, then both your patch and my
patch have issues -- trying to fish the supervisor state out of a
corrupt XSTATE buffer is asking for trouble.
>
> IOW, this is exactly the same shit as we had before. So what is decent
> about this? Define decent...
>
> Why the heck do you think I wasted a couple of days to:
>
> - Analyze the root cause
>
> - Destill a trivial C reproducer
>
> - Come up with a fully working and completely correct fix
>
> Just because, right?
>
> I'm fine with splitting up clear_all() and clear_user(), but what you
> provided is as much of a clusterfuck as the commit it pretends to fix.
>
> Your's seriously grumpy
>
> Thomas
>
next prev parent reply other threads:[~2021-05-30 23:41 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <cover.1622351443.git.luto@kernel.org>
2021-05-30 5:12 ` [RFC v2 1/2] x86/fpu: Fix state corruption in __fpu__restore_sig() Andy Lutomirski
2021-05-30 22:02 ` Thomas Gleixner
2021-05-30 23:41 ` Andy Lutomirski [this message]
2021-05-31 9:03 ` Thomas Gleixner
2021-05-31 10:01 ` Thomas Gleixner
2021-05-31 18:56 ` Thomas Gleixner
2021-05-31 19:30 ` Andy Lutomirski
2021-05-31 22:46 ` Thomas Gleixner
2021-06-01 4:58 ` Andy Lutomirski
2021-06-01 14:48 ` Dave Hansen
2021-06-01 18:06 ` [PATCH v3 3/5] x86/fpu: Clean up the fpu__clear() variants Dave Hansen
2021-06-01 18:14 ` Andy Lutomirski
2021-06-01 18:35 ` Dave Hansen
2021-06-01 22:44 ` Andy Lutomirski
2021-06-01 18:25 ` Thomas Gleixner
2021-06-01 23:17 ` [RFC v2 1/2] x86/fpu: Fix state corruption in __fpu__restore_sig() Thomas Gleixner
2021-05-31 19:48 ` Thomas Gleixner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=27deedb9-1e80-fe62-e072-1cc4904f20d5@kernel.org \
--to=luto@kernel.org \
--cc=dave.hansen@intel.com \
--cc=stable@vger.kernel.org \
--cc=syzbot+2067e764dbcd10721e2e@syzkaller.appspotmail.com \
--cc=tglx@linutronix.de \
--cc=x86@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.