* [PATCH v2] umh: fix out of scope usage when the process is being killed
@ 2022-12-14 13:46 Schspa Shi
2022-12-14 19:59 ` Luis Chamberlain
0 siblings, 1 reply; 3+ messages in thread
From: Schspa Shi @ 2022-12-14 13:46 UTC (permalink / raw)
To: mcgrof
Cc: linux-kernel, Schspa Shi, syzbot+10d19d528d9755d9af22,
syzbot+70d5d5d83d03db2c813d, syzbot+83cb0411d0fcf0a30fc1,
syzbot+c92c6a251d49ceceb625
When the process is killed, wait_for_completion_state will return with
-ERESTARTSYS, and the completion variable in the stack will be unavailable,
even freed. If the user-mode thread is complete at the same time, there
will be a race to use a unavailable variable.
Please refer to the following scenarios.
T1 T2
------------------------------------------------------------------
call_usermodehelper_exec
call_usermodehelper_exec_async
<< do something >>
umh_complete(sub_info);
comp = xchg(&sub_info->complete, NULL);
/* we got the completion */
<< context switch >>
<< Being killed >>
retval = wait_for_completion_state(sub_info->complete, state);
if (!retval)
goto wait_done;
if (wait & UMH_KILLABLE) {
/* umh_complete() will see NULL and free sub_info */
if (xchg(&sub_info->complete, NULL))
goto unlock;
<< we can't got the completion, because T2 take it already >>
}
....
return retval;
}
/**
* the completion variable in stack is end of life cycle.
* and maybe freed due to process is recycled.
*/
-------- BUG here----------
if (comp)
complete(comp);
To fix it, we can add an additional wait_for_completion to ensure the
completion object is completely unused. And this is what
kthread_create_on_node does to handle this race.
Reported-by: syzbot+10d19d528d9755d9af22@syzkaller.appspotmail.com
Reported-by: syzbot+70d5d5d83d03db2c813d@syzkaller.appspotmail.com
Reported-by: syzbot+83cb0411d0fcf0a30fc1@syzkaller.appspotmail.com
Reported-by: syzbot+c92c6a251d49ceceb625@syzkaller.appspotmail.com
Signed-off-by: Schspa Shi <schspa@gmail.com>
---
v1->v2:
- Use a new way to fix the race as kthread_create_on_node do.
- Optimize comments and use more accurate words to describe the problem.
kernel/umh.c | 4 ++++
1 file changed, 4 insertions(+)
diff --git a/kernel/umh.c b/kernel/umh.c
index 850631518665..d8350a195c7f 100644
--- a/kernel/umh.c
+++ b/kernel/umh.c
@@ -452,6 +452,10 @@ int call_usermodehelper_exec(struct subprocess_info *sub_info, int wait)
/* umh_complete() will see NULL and free sub_info */
if (xchg(&sub_info->complete, NULL))
goto unlock;
+ /*
+ * umh_complete will call complete() shortly.
+ */
+ wait_for_completion(&done);
}
wait_done:
--
2.37.3
^ permalink raw reply related [flat|nested] 3+ messages in thread* Re: [PATCH v2] umh: fix out of scope usage when the process is being killed
2022-12-14 13:46 [PATCH v2] umh: fix out of scope usage when the process is being killed Schspa Shi
@ 2022-12-14 19:59 ` Luis Chamberlain
2022-12-15 5:11 ` Schspa Shi
0 siblings, 1 reply; 3+ messages in thread
From: Luis Chamberlain @ 2022-12-14 19:59 UTC (permalink / raw)
To: Schspa Shi
Cc: linux-kernel, syzbot+10d19d528d9755d9af22,
syzbot+70d5d5d83d03db2c813d, syzbot+83cb0411d0fcf0a30fc1,
syzbot+c92c6a251d49ceceb625
On Wed, Dec 14, 2022 at 09:46:56PM +0800, Schspa Shi wrote:
> When the process is killed, wait_for_completion_state will return with
> -ERESTARTSYS, and the completion variable in the stack will be unavailable,
> even freed. If the user-mode thread is complete at the same time, there
> will be a race to use a unavailable variable.
>
> Please refer to the following scenarios.
> T1 T2
> ------------------------------------------------------------------
> call_usermodehelper_exec
> call_usermodehelper_exec_async
> << do something >>
> umh_complete(sub_info);
> comp = xchg(&sub_info->complete, NULL);
> /* we got the completion */
> << context switch >>
>
> << Being killed >>
> retval = wait_for_completion_state(sub_info->complete, state);
> if (!retval)
> goto wait_done;
>
> if (wait & UMH_KILLABLE) {
> /* umh_complete() will see NULL and free sub_info */
> if (xchg(&sub_info->complete, NULL))
> goto unlock;
> << we can't got the completion, because T2 take it already >>
> }
> ....
> return retval;
> }
>
> /**
> * the completion variable in stack is end of life cycle.
> * and maybe freed due to process is recycled.
> */
> -------- BUG here----------
> if (comp)
> complete(comp);
>
> To fix it, we can add an additional wait_for_completion to ensure the
> completion object is completely unused. And this is what
> kthread_create_on_node does to handle this race.
>
> Reported-by: syzbot+10d19d528d9755d9af22@syzkaller.appspotmail.com
> Reported-by: syzbot+70d5d5d83d03db2c813d@syzkaller.appspotmail.com
> Reported-by: syzbot+83cb0411d0fcf0a30fc1@syzkaller.appspotmail.com
> Reported-by: syzbot+c92c6a251d49ceceb625@syzkaller.appspotmail.com
> Signed-off-by: Schspa Shi <schspa@gmail.com>
> ---
Please fix the commit log a bit more with the cotext I provided, *if*
on the other thread the community agrees with the approach to be
compartamentalized. I am still not sure why this would fix the
UAF after thinking about it some more, and the issue would mean
there likely could be a generic fix / issue to consider.
So for now I'd like more review of this race and the proposed fix
as I mentioned in the follow-up threaad in your v1 patch. Let's
follow up there and see how that discussion goes.
Luis
^ permalink raw reply [flat|nested] 3+ messages in thread* Re: [PATCH v2] umh: fix out of scope usage when the process is being killed
2022-12-14 19:59 ` Luis Chamberlain
@ 2022-12-15 5:11 ` Schspa Shi
0 siblings, 0 replies; 3+ messages in thread
From: Schspa Shi @ 2022-12-15 5:11 UTC (permalink / raw)
To: Luis Chamberlain
Cc: linux-kernel, syzbot+10d19d528d9755d9af22,
syzbot+70d5d5d83d03db2c813d, syzbot+83cb0411d0fcf0a30fc1,
syzbot+c92c6a251d49ceceb625
Luis Chamberlain <mcgrof@kernel.org> writes:
> On Wed, Dec 14, 2022 at 09:46:56PM +0800, Schspa Shi wrote:
>> When the process is killed, wait_for_completion_state will return with
>> -ERESTARTSYS, and the completion variable in the stack will be unavailable,
>> even freed. If the user-mode thread is complete at the same time, there
>> will be a race to use a unavailable variable.
>>
>> Please refer to the following scenarios.
>> T1 T2
>> ------------------------------------------------------------------
>> call_usermodehelper_exec
>> call_usermodehelper_exec_async
>> << do something >>
>> umh_complete(sub_info);
>> comp = xchg(&sub_info->complete, NULL);
>> /* we got the completion */
>> << context switch >>
>>
>> << Being killed >>
>> retval = wait_for_completion_state(sub_info->complete, state);
>> if (!retval)
>> goto wait_done;
>>
>> if (wait & UMH_KILLABLE) {
>> /* umh_complete() will see NULL and free sub_info */
>> if (xchg(&sub_info->complete, NULL))
>> goto unlock;
>> << we can't got the completion, because T2 take it already >>
>> }
>> ....
>> return retval;
>> }
>>
>> /**
>> * the completion variable in stack is end of life cycle.
>> * and maybe freed due to process is recycled.
>> */
>> -------- BUG here----------
>> if (comp)
>> complete(comp);
>>
>> To fix it, we can add an additional wait_for_completion to ensure the
>> completion object is completely unused. And this is what
>> kthread_create_on_node does to handle this race.
>>
>> Reported-by: syzbot+10d19d528d9755d9af22@syzkaller.appspotmail.com
>> Reported-by: syzbot+70d5d5d83d03db2c813d@syzkaller.appspotmail.com
>> Reported-by: syzbot+83cb0411d0fcf0a30fc1@syzkaller.appspotmail.com
>> Reported-by: syzbot+c92c6a251d49ceceb625@syzkaller.appspotmail.com
>> Signed-off-by: Schspa Shi <schspa@gmail.com>
>> ---
>
> Please fix the commit log a bit more with the cotext I provided, *if*
> on the other thread the community agrees with the approach to be
> compartamentalized. I am still not sure why this would fix the
> UAF after thinking about it some more, and the issue would mean
> there likely could be a generic fix / issue to consider.
>
I think a syntactic sugar for a complete api can be added here for a
generic usage.
> So for now I'd like more review of this race and the proposed fix
> as I mentioned in the follow-up threaad in your v1 patch. Let's
> follow up there and see how that discussion goes.
>
Ok, let's talk about this on the v1 patch's thread.
> Luis
--
BRs
Schspa Shi
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2022-12-15 6:06 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2022-12-14 13:46 [PATCH v2] umh: fix out of scope usage when the process is being killed Schspa Shi
2022-12-14 19:59 ` Luis Chamberlain
2022-12-15 5:11 ` Schspa Shi
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox