Intel-GFX Archive on lore.kernel.org
 help / color / mirror / Atom feed
From: Krister Johansen <kjlx@templeofstupid.com>
To: "Borah, Chaitanya Kumar" <chaitanya.kumar.borah@intel.com>
Cc: Miklos Szeredi <mszeredi@redhat.com>,
	"intel-gfx@lists.freedesktop.org"
	<intel-gfx@lists.freedesktop.org>,
	"kjlx@templeofstupid.com" <kjlx@templeofstupid.com>,
	"Kurmi, Suresh Kumar" <suresh.kumar.kurmi@intel.com>
Subject: Re: [Intel-gfx] Regression on linux-next (next-20231107)
Date: Thu, 9 Nov 2023 12:40:22 -0800	[thread overview]
Message-ID: <20231109204022.GA2073@templeofstupid.com> (raw)
In-Reply-To: <SJ1PR11MB6129508509896AD7D0E03114B9AFA@SJ1PR11MB6129.namprd11.prod.outlook.com>

Hi Chaitanya,

On Thu, Nov 09, 2023 at 05:00:09PM +0000, Borah, Chaitanya Kumar wrote:
> Hello Krister,
>  
> Hope you are doing well. I am Chaitanya from the linux graphics team in Intel.
>  
> This mail is regarding a regression we are seeing in our CI runs[1] for some machines (dg2 and adl-p) on linux-next  repository.
> 
> Since the version next-20231107 [2], we are seeing the following error
> ```````````````````````````````````````````````````````````````````````````````
> <4>[   32.015910] stack segment: 0000 [#1] PREEMPT SMP NOPTI
> <4>[   32.021048] CPU: 15 PID: 766 Comm: fusermount Not tainted 6.6.0-next-20231107-next-20231107-g5cd631a52568+ #1
> <4>[   32.031135] Hardware name: Intel Corporation Raptor Lake Client Platform/RPL-S ADP-S DDR5 UDIMM CRB, BIOS RPLSFWI1.R00.4221.A00.2305271351 05/27/2023
> <4>[   32.044657] RIP: 0010:fuse_evict_inode+0x61/0x150 [fuse]
> `````````````````````````````````````````````````````````````````````````````````
> 
> Details log can be found in [3].
> 
> After bisecting the tree, the following patch [4] seems to be the first "bad" commit
> 
>  `````````````````````````````````````````````````````````````````````````````````````````````````````````
> 513dfacefd712bcbfab64e1a9c9c3e0d51c2dca5 is the first bad commit
> commit 513dfacefd712bcbfab64e1a9c9c3e0d51c2dca5
> Author: Krister Johansen kjlx@templeofstupid.com
> Date:   Fri Nov 3 10:39:47 2023 -0700
> 
>     fuse: share lookup state between submount and its parent
> 
>     Fuse submounts do not perform a lookup for the nodeid that they inherit
>     from their parent.  Instead, the code decrements the nlookup on the
>     submount's fuse_inode when it is instantiated, and no forget is
>     performed when a submount root is evicted.
> 
>     Trouble arises when the submount's parent is evicted despite the
>     submount itself being in use.  In this author's case, the submount was
>     in a container and deatched from the initial mount namespace via a
>     MNT_DEATCH operation.  When memory pressure triggered the shrinker, the
>     inode from the parent was evicted, which triggered enough forgets to
>     render the submount's nodeid invalid.
> 
>     Since submounts should still function, even if their parent goes away,
>     solve this problem by sharing refcounted state between the parent and
>     its submount.  When all of the references on this shared state reach
>     zero, it's safe to forget the final lookup of the fuse nodeid.
> 
>  `````````````````````````````````````````````````````````````````````````````````````````````````````````
>  
> We also verified that if we revert the patch the issue is not seen.
> 
> Could you please check why the patch causes this regression and provide a fix if necessary?

Apologies for the inconvenience.  I've reproduced the problem, tested a
fix, and am in the process of preparing patches to send to Miklos.  I'll
cc the people on this e-mail in that thread.

> [3] http://gfx-ci.igk.intel.com/tree/linux-next/next-20231109/bat-dg2-14/boot0.txt

This link didn't resolve in DNS when I tried to access it.  I needed to
use intel-gfx-ci.01.org as the hostname instead.

Thanks,

-K

  reply	other threads:[~2023-11-13 13:45 UTC|newest]

Thread overview: 26+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-10-05 15:58 [Intel-gfx] Regression in linux-next Borah, Chaitanya Kumar
2023-10-06 20:30 ` Wysocki, Rafael J
2023-10-09  5:10   ` Borah, Chaitanya Kumar
2023-10-09 19:23     ` Wysocki, Rafael J
2023-10-11  4:00       ` Borah, Chaitanya Kumar
2023-10-11 16:14         ` Wysocki, Rafael J
2023-10-11 16:49           ` Borah, Chaitanya Kumar
2023-10-13 14:05             ` Borah, Chaitanya Kumar
2023-10-20  5:52 ` [Intel-gfx] Regression on linux-next (next-20231016) Borah, Chaitanya Kumar
2023-10-20  6:38   ` Lorenzo Stoakes
2023-10-20  7:58     ` Borah, Chaitanya Kumar
2023-10-25  6:32   ` [Intel-gfx] Regression on linux-next (next-20231013) Borah, Chaitanya Kumar
2023-10-25  7:32     ` Christian Brauner
2023-10-25 13:44       ` Borah, Chaitanya Kumar
2023-10-26 10:14         ` Borah, Chaitanya Kumar
2023-10-26 12:16           ` Christian Brauner
2023-11-09 17:00     ` [Intel-gfx] Regression on linux-next (next-20231107) Borah, Chaitanya Kumar
2023-11-09 20:40       ` Krister Johansen [this message]
2023-11-10  3:38         ` Borah, Chaitanya Kumar
2023-11-13  6:21           ` Borah, Chaitanya Kumar
     [not found]             ` <20231114174121.GA2064@templeofstupid.com>
2023-11-15  4:33               ` Borah, Chaitanya Kumar
2023-12-04 17:17       ` [Intel-gfx] Regression on linux-next (next-20231130) Borah, Chaitanya Kumar
2023-12-04 18:11         ` Berg, Johannes
2023-12-05  6:14           ` Borah, Chaitanya Kumar
2024-01-31  5:34         ` Regression on drm-tip Borah, Chaitanya Kumar
     [not found]           ` <b77d8588-6809-416c-b598-7a33a672c1e7@opensource.cirrus.com>
2024-02-01  5:13             ` Borah, Chaitanya Kumar

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20231109204022.GA2073@templeofstupid.com \
    --to=kjlx@templeofstupid.com \
    --cc=chaitanya.kumar.borah@intel.com \
    --cc=intel-gfx@lists.freedesktop.org \
    --cc=mszeredi@redhat.com \
    --cc=suresh.kumar.kurmi@intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox