* Regression on linux-next (next-20240712)
@ 2024-07-16 5:37 Borah, Chaitanya Kumar
2024-07-17 11:00 ` David Hildenbrand
0 siblings, 1 reply; 4+ messages in thread
From: Borah, Chaitanya Kumar @ 2024-07-16 5:37 UTC (permalink / raw)
To: peili.dev@gmail.com
Cc: Nikula, Jani, Saarinen, Jani, Kurmi, Suresh Kumar,
intel-gfx@lists.freedesktop.org, linux-mm@kvack.org,
linux-kernel@vger.kernel.org
Hello Pei,
Hope you are doing well. I am Chaitanya from the linux graphics team in Intel.
This mail is regarding a regression we are seeing in our CI runs[1] on linux-next repository.
In version next-20240712[2], we saw the following regression (currently being masked by another regression)
`````````````````````````````````````````````````````````````````````````````````
<4>[ 14.530533] ============================================
<4>[ 14.530533] WARNING: possible recursive locking detected
<4>[ 14.530534] 6.10.0-rc7-next-20240712-next-20240712-g3fe121b62282+ #1 Not tainted
<4>[ 14.530535] --------------------------------------------
<4>[ 14.530535] (direxec)/171 is trying to acquire lock:
<4>[ 14.530536] ffff8881010725d8 (&mm->mmap_lock){++++}-{3:3}, at: unmap_single_vma+0xea/0x170
<4>[ 14.530541]
but task is already holding lock:
<4>[ 14.530542] ffff8881010725d8 (&mm->mmap_lock){++++}-{3:3}, at: exit_mmap+0x6a/0x450
<4>[ 14.530545]
other info that might help us debug this:
<4>[ 14.530545] Possible unsafe locking scenario:
`````````````````````````````````````````````````````````````````````````````````
Details log can be found in [3].
After bisecting the tree, the following patch [4] seems to be the first "bad"
commit
`````````````````````````````````````````````````````````````````````````````````````````````````````````
commit a13252049629a8225f38a9be7d8d4fc4ff5350e8
Author: Pei Li mailto:peili.dev@gmail.com
Date: Wed Jul 10 22:13:17 2024 -0700
mm: fix mmap_assert_locked() in follow_pte()
`````````````````````````````````````````````````````````````````````````````````````````````````````````
We also verified that if we revert the patch the issue is not seen.
Could you please check why the patch causes this regression and provide a fix if necessary?
Thank you.
Regards
Chaitanya
[1] https://intel-gfx-ci.01.org/tree/linux-next/combined-alt.html?
[2] https://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git/commit/?h=next-20240712
[3] https://gfx-ci.igk.intel.com/tree/linux-next/next-20240712/bat-arlh-2/boot0.txt
[4] https://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git/commit/?h=next-20240712&id=a13252049629a8225f38a9be7d8d4fc4ff5350e8
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: Regression on linux-next (next-20240712)
2024-07-16 5:37 Regression on linux-next (next-20240712) Borah, Chaitanya Kumar
@ 2024-07-17 11:00 ` David Hildenbrand
2024-07-17 20:00 ` Andrew Morton
0 siblings, 1 reply; 4+ messages in thread
From: David Hildenbrand @ 2024-07-17 11:00 UTC (permalink / raw)
To: Borah, Chaitanya Kumar, peili.dev@gmail.com,
akpm@linux-foundation.org
Cc: Nikula, Jani, Saarinen, Jani, Kurmi, Suresh Kumar,
intel-gfx@lists.freedesktop.org, linux-mm@kvack.org,
linux-kernel@vger.kernel.org
On 16.07.24 07:37, Borah, Chaitanya Kumar wrote:
> Hello Pei,
>
> Hope you are doing well. I am Chaitanya from the linux graphics team in Intel.
>
> This mail is regarding a regression we are seeing in our CI runs[1] on linux-next repository.
>
> In version next-20240712[2], we saw the following regression (currently being masked by another regression)
>
> `````````````````````````````````````````````````````````````````````````````````
> <4>[ 14.530533] ============================================
> <4>[ 14.530533] WARNING: possible recursive locking detected
> <4>[ 14.530534] 6.10.0-rc7-next-20240712-next-20240712-g3fe121b62282+ #1 Not tainted
> <4>[ 14.530535] --------------------------------------------
> <4>[ 14.530535] (direxec)/171 is trying to acquire lock:
> <4>[ 14.530536] ffff8881010725d8 (&mm->mmap_lock){++++}-{3:3}, at: unmap_single_vma+0xea/0x170
> <4>[ 14.530541]
> but task is already holding lock:
> <4>[ 14.530542] ffff8881010725d8 (&mm->mmap_lock){++++}-{3:3}, at: exit_mmap+0x6a/0x450
> <4>[ 14.530545]
> other info that might help us debug this:
> <4>[ 14.530545] Possible unsafe locking scenario:
> `````````````````````````````````````````````````````````````````````````````````
> Details log can be found in [3].
>
> After bisecting the tree, the following patch [4] seems to be the first "bad"
> commit
>
> `````````````````````````````````````````````````````````````````````````````````````````````````````````
> commit a13252049629a8225f38a9be7d8d4fc4ff5350e8
> Author: Pei Li mailto:peili.dev@gmail.com
> Date: Wed Jul 10 22:13:17 2024 -0700
>
> mm: fix mmap_assert_locked() in follow_pte()
>
> `````````````````````````````````````````````````````````````````````````````````````````````````````````
>
> We also verified that if we revert the patch the issue is not seen.
>
> Could you please check why the patch causes this regression and provide a fix if necessary?
This is know.
There is a discussion along the original patch [1] on how to do it
differently. But likely we'll tackle it differently [2]. So this patch
should be dropped for -- which I think already happened because I cannot
spot that patch in mm-unstable anymore.
[1] https://lore.kernel.org/all/20240710-bug12-v1-1-0e5440f9b8d3@gmail.com/
[2] https://lkml.kernel.org/r/20240712144244.3090089-1-peterx@redhat.com
--
Cheers,
David / dhildenb
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: Regression on linux-next (next-20240712)
2024-07-17 11:00 ` David Hildenbrand
@ 2024-07-17 20:00 ` Andrew Morton
2024-07-18 5:09 ` Borah, Chaitanya Kumar
0 siblings, 1 reply; 4+ messages in thread
From: Andrew Morton @ 2024-07-17 20:00 UTC (permalink / raw)
To: David Hildenbrand
Cc: Borah, Chaitanya Kumar, peili.dev@gmail.com, Nikula, Jani,
Saarinen, Jani, Kurmi, Suresh Kumar,
intel-gfx@lists.freedesktop.org, linux-mm@kvack.org,
linux-kernel@vger.kernel.org
On Wed, 17 Jul 2024 13:00:58 +0200 David Hildenbrand <david@redhat.com> wrote:
> > Could you please check why the patch causes this regression and provide a fix if necessary?
>
> This is know.
>
> There is a discussion along the original patch [1] on how to do it
> differently. But likely we'll tackle it differently [2]. So this patch
> should be dropped for -- which I think already happened because I cannot
> spot that patch in mm-unstable anymore.
Yes, I dropped it on July 15.
^ permalink raw reply [flat|nested] 4+ messages in thread
* RE: Regression on linux-next (next-20240712)
2024-07-17 20:00 ` Andrew Morton
@ 2024-07-18 5:09 ` Borah, Chaitanya Kumar
0 siblings, 0 replies; 4+ messages in thread
From: Borah, Chaitanya Kumar @ 2024-07-18 5:09 UTC (permalink / raw)
To: Andrew Morton, David Hildenbrand
Cc: peili.dev@gmail.com, Nikula, Jani, Saarinen, Jani,
Kurmi, Suresh Kumar, intel-gfx@lists.freedesktop.org,
linux-mm@kvack.org, linux-kernel@vger.kernel.org
> -----Original Message-----
> From: Andrew Morton <akpm@linux-foundation.org>
> Sent: Thursday, July 18, 2024 1:31 AM
> To: David Hildenbrand <david@redhat.com>
> Cc: Borah, Chaitanya Kumar <chaitanya.kumar.borah@intel.com>;
> peili.dev@gmail.com; Nikula, Jani <jani.nikula@intel.com>; Saarinen, Jani
> <jani.saarinen@intel.com>; Kurmi, Suresh Kumar
> <suresh.kumar.kurmi@intel.com>; intel-gfx@lists.freedesktop.org; linux-
> mm@kvack.org; linux-kernel@vger.kernel.org
> Subject: Re: Regression on linux-next (next-20240712)
>
> On Wed, 17 Jul 2024 13:00:58 +0200 David Hildenbrand <david@redhat.com>
> wrote:
>
> > > Could you please check why the patch causes this regression and provide
> a fix if necessary?
> >
> > This is know.
> >
> > There is a discussion along the original patch [1] on how to do it
> > differently. But likely we'll tackle it differently [2]. So this patch
> > should be dropped for -- which I think already happened because I
> > cannot spot that patch in mm-unstable anymore.
>
> Yes, I dropped it on July 15.
Thank you. Our CI runs seems to have recovered from the regression.
Regards
Chaitanya
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2024-07-18 5:09 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-07-16 5:37 Regression on linux-next (next-20240712) Borah, Chaitanya Kumar
2024-07-17 11:00 ` David Hildenbrand
2024-07-17 20:00 ` Andrew Morton
2024-07-18 5:09 ` Borah, Chaitanya Kumar
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).