* [PATCH] mm_access: simplify the security checks
@ 2026-05-30 13:56 Oleg Nesterov
2026-05-30 14:10 ` Oleg Nesterov
` (2 more replies)
0 siblings, 3 replies; 9+ messages in thread
From: Oleg Nesterov @ 2026-05-30 13:56 UTC (permalink / raw)
To: Andrew Morton
Cc: Christian Brauner, David Hildenbrand, Jann Horn, Kees Cook,
Lorenzo Stoakes, Michal Hocko, linux-kernel, linux-mm
1. Shift the fast-path "mm == current->mm" check from may_access_mm()
to mm_access(), and do it locklessly.
task->mm is not stable but we do not care. We can race with exec,
but in this case we pin/return current->mm. This doesn't differ
from the case where the target execs after we drop exec_update_lock.
All we need for correctness is READ_ONCE() to ensure the compiler
won't reload task->mm. This is not enough for KCSAN, but we already
have other lockless ->mm LOAD's. We should probably change exec_mmap/
exit_mm to use WRITE_ONCE().
2. With the change above may_access_mm() doesn't need the "mm" argument,
so we do not need to call get_task_mm() beforehand, we can call it
only if may_access_mm() suceeds.
2. With the change above, may_access_mm() doesn't need the "mm" argument,
so we do not need to call get_task_mm() beforehand. We can call it
only if may_access_mm() succeeds.
Signed-off-by: Oleg Nesterov <oleg@redhat.com>
---
kernel/fork.c | 30 ++++++++++++++++--------------
1 file changed, 16 insertions(+), 14 deletions(-)
diff --git a/kernel/fork.c b/kernel/fork.c
index b8b651abce8b..3239380ab93b 100644
--- a/kernel/fork.c
+++ b/kernel/fork.c
@@ -1381,10 +1381,8 @@ struct mm_struct *get_task_mm(struct task_struct *task)
}
EXPORT_SYMBOL_GPL(get_task_mm);
-static bool may_access_mm(struct mm_struct *mm, struct task_struct *task, unsigned int mode)
+static bool may_access_mm(struct task_struct *task, unsigned int mode)
{
- if (mm == current->mm)
- return true;
if (ptrace_may_access(task, mode))
return true;
if ((mode & PTRACE_MODE_READ) && perfmon_capable())
@@ -1394,20 +1392,24 @@ static bool may_access_mm(struct mm_struct *mm, struct task_struct *task, unsign
struct mm_struct *mm_access(struct task_struct *task, unsigned int mode)
{
- struct mm_struct *mm;
- int err;
+ struct mm_struct *mm = READ_ONCE(task->mm);
- err = down_read_killable(&task->signal->exec_update_lock);
- if (err)
- return ERR_PTR(err);
+ if (!mm || (task->flags & PF_KTHREAD))
+ return ERR_PTR(-ESRCH);
- mm = get_task_mm(task);
- if (!mm) {
- mm = ERR_PTR(-ESRCH);
- } else if (!may_access_mm(mm, task, mode)) {
- mmput(mm);
- mm = ERR_PTR(-EACCES);
+ if (mm == current->mm) {
+ mmget(mm);
+ return mm;
}
+
+ if (down_read_killable(&task->signal->exec_update_lock))
+ return ERR_PTR(-EINTR);
+
+ if (may_access_mm(task, mode))
+ mm = get_task_mm(task) ?: ERR_PTR(-ESRCH);
+ else
+ mm = ERR_PTR(-EACCES);
+
up_read(&task->signal->exec_update_lock);
return mm;
--
2.52.0
^ permalink raw reply related [flat|nested] 9+ messages in thread
* Re: [PATCH] mm_access: simplify the security checks
2026-05-30 13:56 [PATCH] mm_access: simplify the security checks Oleg Nesterov
@ 2026-05-30 14:10 ` Oleg Nesterov
2026-06-01 11:16 ` Lorenzo Stoakes
2026-05-30 14:12 ` [PATCH v2] " Oleg Nesterov
2026-05-30 15:00 ` [PATCH] " Oleg Nesterov
2 siblings, 1 reply; 9+ messages in thread
From: Oleg Nesterov @ 2026-05-30 14:10 UTC (permalink / raw)
To: Andrew Morton
Cc: Christian Brauner, David Hildenbrand, Jann Horn, Kees Cook,
Lorenzo Stoakes, Michal Hocko, linux-kernel, linux-mm
On 05/30, Oleg Nesterov wrote:
>
> 2. With the change above may_access_mm() doesn't need the "mm" argument,
> so we do not need to call get_task_mm() beforehand, we can call it
> only if may_access_mm() suceeds.
>
> 2. With the change above, may_access_mm() doesn't need the "mm" argument,
> so we do not need to call get_task_mm() beforehand. We can call it
> only if may_access_mm() succeeds.
OOPS. Somehow I wrote the same twice. Will fix the changelog and send V2
in reply to this patch...
Oleg.
^ permalink raw reply [flat|nested] 9+ messages in thread
* [PATCH v2] mm_access: simplify the security checks
2026-05-30 13:56 [PATCH] mm_access: simplify the security checks Oleg Nesterov
2026-05-30 14:10 ` Oleg Nesterov
@ 2026-05-30 14:12 ` Oleg Nesterov
2026-06-01 12:04 ` David Hildenbrand (Arm)
2026-06-01 12:12 ` Lorenzo Stoakes
2026-05-30 15:00 ` [PATCH] " Oleg Nesterov
2 siblings, 2 replies; 9+ messages in thread
From: Oleg Nesterov @ 2026-05-30 14:12 UTC (permalink / raw)
To: Andrew Morton
Cc: Christian Brauner, David Hildenbrand, Jann Horn, Kees Cook,
Lorenzo Stoakes, Michal Hocko, linux-kernel, linux-mm
1. Shift the fast-path "mm == current->mm" check from may_access_mm()
to mm_access(), and do it locklessly.
task->mm is not stable but we do not care. We can race with exec,
but in this case we pin/return current->mm. This doesn't differ
from the case where the target execs after we drop exec_update_lock.
All we need for correctness is READ_ONCE() to ensure the compiler
won't reload task->mm. This is not enough for KCSAN, but we already
have other lockless ->mm LOAD's. We should probably change exec_mmap/
exit_mm to use WRITE_ONCE().
2. With the change above, may_access_mm() doesn't need the "mm" argument,
so we do not need to call get_task_mm() beforehand. We can call it
only if may_access_mm() succeeds.
Signed-off-by: Oleg Nesterov <oleg@redhat.com>
---
kernel/fork.c | 30 ++++++++++++++++--------------
1 file changed, 16 insertions(+), 14 deletions(-)
diff --git a/kernel/fork.c b/kernel/fork.c
index b8b651abce8b..3239380ab93b 100644
--- a/kernel/fork.c
+++ b/kernel/fork.c
@@ -1381,10 +1381,8 @@ struct mm_struct *get_task_mm(struct task_struct *task)
}
EXPORT_SYMBOL_GPL(get_task_mm);
-static bool may_access_mm(struct mm_struct *mm, struct task_struct *task, unsigned int mode)
+static bool may_access_mm(struct task_struct *task, unsigned int mode)
{
- if (mm == current->mm)
- return true;
if (ptrace_may_access(task, mode))
return true;
if ((mode & PTRACE_MODE_READ) && perfmon_capable())
@@ -1394,20 +1392,24 @@ static bool may_access_mm(struct mm_struct *mm, struct task_struct *task, unsign
struct mm_struct *mm_access(struct task_struct *task, unsigned int mode)
{
- struct mm_struct *mm;
- int err;
+ struct mm_struct *mm = READ_ONCE(task->mm);
- err = down_read_killable(&task->signal->exec_update_lock);
- if (err)
- return ERR_PTR(err);
+ if (!mm || (task->flags & PF_KTHREAD))
+ return ERR_PTR(-ESRCH);
- mm = get_task_mm(task);
- if (!mm) {
- mm = ERR_PTR(-ESRCH);
- } else if (!may_access_mm(mm, task, mode)) {
- mmput(mm);
- mm = ERR_PTR(-EACCES);
+ if (mm == current->mm) {
+ mmget(mm);
+ return mm;
}
+
+ if (down_read_killable(&task->signal->exec_update_lock))
+ return ERR_PTR(-EINTR);
+
+ if (may_access_mm(task, mode))
+ mm = get_task_mm(task) ?: ERR_PTR(-ESRCH);
+ else
+ mm = ERR_PTR(-EACCES);
+
up_read(&task->signal->exec_update_lock);
return mm;
--
2.52.0
^ permalink raw reply related [flat|nested] 9+ messages in thread
* Re: [PATCH] mm_access: simplify the security checks
2026-05-30 13:56 [PATCH] mm_access: simplify the security checks Oleg Nesterov
2026-05-30 14:10 ` Oleg Nesterov
2026-05-30 14:12 ` [PATCH v2] " Oleg Nesterov
@ 2026-05-30 15:00 ` Oleg Nesterov
2 siblings, 0 replies; 9+ messages in thread
From: Oleg Nesterov @ 2026-05-30 15:00 UTC (permalink / raw)
To: Andrew Morton
Cc: Christian Brauner, David Hildenbrand, Jann Horn, Kees Cook,
Lorenzo Stoakes, Michal Hocko, linux-kernel, linux-mm
On 05/30, Oleg Nesterov wrote:
>
> 1. Shift the fast-path "mm == current->mm" check from may_access_mm()
> to mm_access(), and do it locklessly.
Heh ;) sashiko has some concerns, see
https://sashiko.dev/#/patchset/ahrsfJE3NkKjShEX%40redhat.com
I still think this is fine, but I'll recheck tomorrow.
Oleg.
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [PATCH] mm_access: simplify the security checks
2026-05-30 14:10 ` Oleg Nesterov
@ 2026-06-01 11:16 ` Lorenzo Stoakes
0 siblings, 0 replies; 9+ messages in thread
From: Lorenzo Stoakes @ 2026-06-01 11:16 UTC (permalink / raw)
To: Oleg Nesterov
Cc: Andrew Morton, Christian Brauner, David Hildenbrand, Jann Horn,
Kees Cook, Michal Hocko, linux-kernel, linux-mm
On Sat, May 30, 2026 at 04:10:07PM +0200, Oleg Nesterov wrote:
> On 05/30, Oleg Nesterov wrote:
> >
> > 2. With the change above may_access_mm() doesn't need the "mm" argument,
> > so we do not need to call get_task_mm() beforehand, we can call it
> > only if may_access_mm() suceeds.
> >
> > 2. With the change above, may_access_mm() doesn't need the "mm" argument,
> > so we do not need to call get_task_mm() beforehand. We can call it
> > only if may_access_mm() succeeds.
>
> OOPS. Somehow I wrote the same twice. Will fix the changelog and send V2
> in reply to this patch...
Sorry, pedantic side-note here but - in general in mm we prefer you don't send
new revision in-reply-to the prior one, as it makes them easier to miss! :)
>
> Oleg.
>
Cheers, Lorenzo
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [PATCH v2] mm_access: simplify the security checks
2026-05-30 14:12 ` [PATCH v2] " Oleg Nesterov
@ 2026-06-01 12:04 ` David Hildenbrand (Arm)
2026-06-01 12:31 ` Oleg Nesterov
2026-06-01 12:12 ` Lorenzo Stoakes
1 sibling, 1 reply; 9+ messages in thread
From: David Hildenbrand (Arm) @ 2026-06-01 12:04 UTC (permalink / raw)
To: Oleg Nesterov, Andrew Morton
Cc: Christian Brauner, Jann Horn, Kees Cook, Lorenzo Stoakes,
Michal Hocko, linux-kernel, linux-mm
On 5/30/26 16:12, Oleg Nesterov wrote:
I am not sure I spot the "simplification" here?
Looks more like an optimization that makes the code slightly more complicated.
> 1. Shift the fast-path "mm == current->mm" check from may_access_mm()
> to mm_access(), and do it locklessly.
>
> task->mm is not stable but we do not care. We can race with exec,
> but in this case we pin/return current->mm. This doesn't differ
> from the case where the target execs after we drop exec_update_lock.
>
> All we need for correctness is READ_ONCE() to ensure the compiler
> won't reload task->mm. This is not enough for KCSAN, but we already
> have other lockless ->mm LOAD's. We should probably change exec_mmap/
> exit_mm to use WRITE_ONCE().
>
> 2. With the change above, may_access_mm() doesn't need the "mm" argument,
> so we do not need to call get_task_mm() beforehand. We can call it
> only if may_access_mm() succeeds.
>
> Signed-off-by: Oleg Nesterov <oleg@redhat.com>
> ---
> kernel/fork.c | 30 ++++++++++++++++--------------
> 1 file changed, 16 insertions(+), 14 deletions(-)
>
> diff --git a/kernel/fork.c b/kernel/fork.c
> index b8b651abce8b..3239380ab93b 100644
> --- a/kernel/fork.c
> +++ b/kernel/fork.c
> @@ -1381,10 +1381,8 @@ struct mm_struct *get_task_mm(struct task_struct *task)
> }
> EXPORT_SYMBOL_GPL(get_task_mm);
>
> -static bool may_access_mm(struct mm_struct *mm, struct task_struct *task, unsigned int mode)
> +static bool may_access_mm(struct task_struct *task, unsigned int mode)
We now have a function named "may_access_mm" and not providing an mm struct
pointer ... it now looks more like an extended, mm-independent ptrace check.
So probably we should rename that function.
> {
> - if (mm == current->mm)
> - return true;
> if (ptrace_may_access(task, mode))
> return true;
> if ((mode & PTRACE_MODE_READ) && perfmon_capable())
> @@ -1394,20 +1392,24 @@ static bool may_access_mm(struct mm_struct *mm, struct task_struct *task, unsign
>
> struct mm_struct *mm_access(struct task_struct *task, unsigned int mode)
> {
> - struct mm_struct *mm;
> - int err;
> + struct mm_struct *mm = READ_ONCE(task->mm);
>
> - err = down_read_killable(&task->signal->exec_update_lock);
> - if (err)
> - return ERR_PTR(err);
> + if (!mm || (task->flags & PF_KTHREAD))
> + return ERR_PTR(-ESRCH);
>
> - mm = get_task_mm(task);
> - if (!mm) {
> - mm = ERR_PTR(-ESRCH);
> - } else if (!may_access_mm(mm, task, mode)) {
> - mmput(mm);
> - mm = ERR_PTR(-EACCES);
> + if (mm == current->mm) {
> + mmget(mm);
> + return mm;
> }
> +
> + if (down_read_killable(&task->signal->exec_update_lock))
> + return ERR_PTR(-EINTR);
> +
> + if (may_access_mm(task, mode))
> + mm = get_task_mm(task) ?: ERR_PTR(-ESRCH);
> + else
> + mm = ERR_PTR(-EACCES);
> +
> up_read(&task->signal->exec_update_lock);
>
> return mm;
--
Cheers,
David
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [PATCH v2] mm_access: simplify the security checks
2026-05-30 14:12 ` [PATCH v2] " Oleg Nesterov
2026-06-01 12:04 ` David Hildenbrand (Arm)
@ 2026-06-01 12:12 ` Lorenzo Stoakes
2026-06-01 12:42 ` Oleg Nesterov
1 sibling, 1 reply; 9+ messages in thread
From: Lorenzo Stoakes @ 2026-06-01 12:12 UTC (permalink / raw)
To: Oleg Nesterov
Cc: Andrew Morton, Christian Brauner, David Hildenbrand, Jann Horn,
Kees Cook, Michal Hocko, linux-kernel, linux-mm, Liam R. Howlett
+cc Liam for mm lifecycle stuff :)
The subject here seems not quite right - you're adding complexity here in that
now there's a racey fast path.
On Sat, May 30, 2026 at 04:12:32PM +0200, Oleg Nesterov wrote:
> 1. Shift the fast-path "mm == current->mm" check from may_access_mm()
> to mm_access(), and do it locklessly.
>
> task->mm is not stable but we do not care. We can race with exec,
> but in this case we pin/return current->mm. This doesn't differ
> from the case where the target execs after we drop exec_update_lock.
Well it does differ in that previously we increment a reference counter
on the mm with the exec lock held, and now we don't?
Also how often are we invoking this where mm == current->mm?
I think the reasoning here is more so that current is guaranteed to hold a
reference on current->mm (de_thread() will have issued a fatal signal not
yet processed).
So the commit message should say that I think.
One behavioural change here though is that down_read_killable() was used
previously, so such a situation would return -EINTR, but now would instead
succeed.
>
> All we need for correctness is READ_ONCE() to ensure the compiler
> won't reload task->mm. This is not enough for KCSAN, but we already
I'm not sure 'this is not enough for KCSAN' is really reassuring :)
> have other lockless ->mm LOAD's. We should probably change exec_mmap/
> exit_mm to use WRITE_ONCE().
>
> 2. With the change above, may_access_mm() doesn't need the "mm" argument,
> so we do not need to call get_task_mm() beforehand. We can call it
> only if may_access_mm() succeeds.
>
> Signed-off-by: Oleg Nesterov <oleg@redhat.com>
> ---
It's useful to put a revision history (ideally with links to prior revisions)
below the --- line to explain how vN differs from v(N-1).
> kernel/fork.c | 30 ++++++++++++++++--------------
> 1 file changed, 16 insertions(+), 14 deletions(-)
>
> diff --git a/kernel/fork.c b/kernel/fork.c
> index b8b651abce8b..3239380ab93b 100644
> --- a/kernel/fork.c
> +++ b/kernel/fork.c
> @@ -1381,10 +1381,8 @@ struct mm_struct *get_task_mm(struct task_struct *task)
> }
> EXPORT_SYMBOL_GPL(get_task_mm);
>
> -static bool may_access_mm(struct mm_struct *mm, struct task_struct *task, unsigned int mode)
> +static bool may_access_mm(struct task_struct *task, unsigned int mode)
> {
> - if (mm == current->mm)
> - return true;
> if (ptrace_may_access(task, mode))
> return true;
> if ((mode & PTRACE_MODE_READ) && perfmon_capable())
> @@ -1394,20 +1392,24 @@ static bool may_access_mm(struct mm_struct *mm, struct task_struct *task, unsign
>
> struct mm_struct *mm_access(struct task_struct *task, unsigned int mode)
> {
> - struct mm_struct *mm;
> - int err;
> + struct mm_struct *mm = READ_ONCE(task->mm);
>
> - err = down_read_killable(&task->signal->exec_update_lock);
> - if (err)
> - return ERR_PTR(err);
> + if (!mm || (task->flags & PF_KTHREAD))
> + return ERR_PTR(-ESRCH);
This really needs a comment to explain your reasoning for while READ_ONCE()
suffices here.
>
> - mm = get_task_mm(task);
> - if (!mm) {
> - mm = ERR_PTR(-ESRCH);
> - } else if (!may_access_mm(mm, task, mode)) {
> - mmput(mm);
> - mm = ERR_PTR(-EACCES);
> + if (mm == current->mm) {
> + mmget(mm);
> + return mm;
> }
> +
> + if (down_read_killable(&task->signal->exec_update_lock))
> + return ERR_PTR(-EINTR);
> +
> + if (may_access_mm(task, mode))
> + mm = get_task_mm(task) ?: ERR_PTR(-ESRCH);
> + else
> + mm = ERR_PTR(-EACCES);
> +
> up_read(&task->signal->exec_update_lock);
>
> return mm;
> --
> 2.52.0
>
>
(Side-note - we should really have these functions (and anything else
mm-related) in mm files.)
Overall I'm not really convinced about this patch - this isn't simplifying
things, it's introducing subtle assumptions and I don't really see the
benefit?
So I think it's a no unless you can provide a really solid justification.
And if it's a performance thing - how often are we actually calling
mm_access() for current->mm?
mm lifecycle is a very horrible part of mm and I think we should only make
changes when really necessary.
Thanks, Lorenzo
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [PATCH v2] mm_access: simplify the security checks
2026-06-01 12:04 ` David Hildenbrand (Arm)
@ 2026-06-01 12:31 ` Oleg Nesterov
0 siblings, 0 replies; 9+ messages in thread
From: Oleg Nesterov @ 2026-06-01 12:31 UTC (permalink / raw)
To: David Hildenbrand (Arm)
Cc: Andrew Morton, Christian Brauner, Jann Horn, Kees Cook,
Lorenzo Stoakes, Michal Hocko, linux-kernel, linux-mm
On 06/01, David Hildenbrand (Arm) wrote:
>
> On 5/30/26 16:12, Oleg Nesterov wrote:
>
> I am not sure I spot the "simplification" here?
Ah, then lets forget this patch ;)
Cleanups are always subjective, if maintainer doesn't agree - forget it.
> Looks more like an optimization that makes the code slightly more complicated.
I don't think that !task->mm or current->mm == task->mm is the common case.
Just the code looks more clear to me if it takes exec_update_lock only when
necessary.
But see above, please forget.
Oleg.
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [PATCH v2] mm_access: simplify the security checks
2026-06-01 12:12 ` Lorenzo Stoakes
@ 2026-06-01 12:42 ` Oleg Nesterov
0 siblings, 0 replies; 9+ messages in thread
From: Oleg Nesterov @ 2026-06-01 12:42 UTC (permalink / raw)
To: Lorenzo Stoakes
Cc: Andrew Morton, Christian Brauner, David Hildenbrand, Jann Horn,
Kees Cook, Michal Hocko, linux-kernel, linux-mm, Liam R. Howlett
On 06/01, Lorenzo Stoakes wrote:
>
> +cc Liam for mm lifecycle stuff :)
>
> The subject here seems not quite right - you're adding complexity here in that
> now there's a racey fast path.
OK. See my reply to David. If it doesn't look like a simplification -
lets forget this patch ;)
> One behavioural change here though is that down_read_killable() was used
> previously, so such a situation would return -EINTR, but now would instead
> succeed.
I don't really follow... SIGKILL from de_thread() or anything else can
come right after down_read_killable().
> > All we need for correctness is READ_ONCE() to ensure the compiler
> > won't reload task->mm. This is not enough for KCSAN, but we already
>
> I'm not sure 'this is not enough for KCSAN' is really reassuring :)
If I understand correctly KCSAN will complain if (say) we race with the
exiting task which does current->mm = NULL without WRITE_ONCE in exit_mm().
> It's useful to put a revision history (ideally with links to prior revisions)
> below the --- line to explain how vN differs from v(N-1).
Yes... I didn't do it this time because V2 doesn't differ from V1, I just removed
the duplicated paragraph from the changelog.
> Overall I'm not really convinced about this patch - this isn't simplifying
> things, it's introducing subtle assumptions and I don't really see the
> benefit?
Thanks for review! lets forget this patch then.
Oleg.
^ permalink raw reply [flat|nested] 9+ messages in thread
end of thread, other threads:[~2026-06-01 12:42 UTC | newest]
Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2026-05-30 13:56 [PATCH] mm_access: simplify the security checks Oleg Nesterov
2026-05-30 14:10 ` Oleg Nesterov
2026-06-01 11:16 ` Lorenzo Stoakes
2026-05-30 14:12 ` [PATCH v2] " Oleg Nesterov
2026-06-01 12:04 ` David Hildenbrand (Arm)
2026-06-01 12:31 ` Oleg Nesterov
2026-06-01 12:12 ` Lorenzo Stoakes
2026-06-01 12:42 ` Oleg Nesterov
2026-05-30 15:00 ` [PATCH] " Oleg Nesterov
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox