From: Salvatore Bonaccorso <carnil@debian.org>
To: Chuck Lever <chuck.lever@oracle.com>
Cc: Scott Mayhew <smayhew@redhat.com>,
Jur van der Burg via Bugspray Bot <bugbot@kernel.org>,
anna@kernel.org, trondmy@kernel.org, jlayton@kernel.org,
linux-nfs@vger.kernel.org, cel@kernel.org,
1091439@bugs.debian.org, 1091439-submitter@bugs.debian.org,
1087900@bugs.debian.org, 1087900-submitter@bugs.debian.org
Subject: Re: kernel BUG at fs/nfsd/nfs4recover.c:534 Oops: invalid opcode: 0000
Date: Sat, 28 Dec 2024 07:09:02 +0100 [thread overview]
Message-ID: <Z2-V_reIDIgJ1AH7@eldamar.lan> (raw)
In-Reply-To: <9e988cfa-5a27-4139-b922-b5c416ae0c72@oracle.com>
Hi,
On Fri, Dec 27, 2024 at 04:31:44PM -0500, Chuck Lever wrote:
> On 12/27/24 1:36 AM, Salvatore Bonaccorso wrote:
> > Hi,
> >
> > On Thu, Dec 26, 2024 at 08:17:45PM +0100, Salvatore Bonaccorso wrote:
> > > Hi Chuck, hi all,
> > >
> > > On Thu, Dec 26, 2024 at 11:33:01AM -0500, Chuck Lever wrote:
> > > > On 12/26/24 11:24 AM, Salvatore Bonaccorso wrote:
> > > > > Hi Jur,
> > > > >
> > > > > On Mon, Dec 09, 2024 at 04:50:05PM +0000, Jur van der Burg via Bugspray Bot wrote:
> > > > > > Jur van der Burg writes via Kernel.org Bugzilla:
> > > > > >
> > > > > > I tried kernel 6.10.1 and that one is ok. In the mean time I
> > > > > > upgraded nfs-utils from 2.5.1 to 2.8.1 which seems to fix the issue.
> > > > > > Sorry for the noise, case closed.
> > > > > >
> > > > > > View: https://bugzilla.kernel.org/show_bug.cgi?id=219580#c2
> > > > > > You can reply to this message to join the discussion.
> > > > >
> > > > > Are you sure this is solved? I got hit by this today after trying to
> > > > > check the report from another Debian user:
> > > > >
> > > > > https://bugs.debian.org/1091439
> > > > > the earlier report was
> > > > > https://bugs.debian.org/1087900
> > > > >
> > > > > Surprisingly I managed to hit this, after:
> > > > >
> > > > > Doing a fresh Debian installation with Debian unstable, rebooting
> > > > > after installation. The running kernel is 6.12.6-1 (but now believe it
> > > > > might be hit in any sufficient earlier version):
> > > > >
> > > > > Notably, in kernel-log I see as well
> > > > >
> > > > > [ 50.295209] RPC: Registered tcp NFSv4.1 backchannel transport module.
> > > > > [ 52.158301] NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory
> > > > > [ 52.158333] NFSD: Using legacy client tracking operations.
> > > >
> > > > Hi Salvatore,
> > > >
> > > > If you no longer provision nfsdcltrack in user space, then you want to
> > > > set CONFIG_NFSD_LEGACY_CLIENT_TRACKING to 'N' in your kernel config.
> > >
> > > Right, while this might not be possible right now in the distribution,
> > > to confirm, setting CONFIG_NFSD_LEGACY_CLIENT_TRACKING would resolve
> > > the problem. In the distribution I think we would not yet be able to
> > > do a hard cut for planned next stable release.
> > >
> > > Remember, that in Debian we only with the current stable release got
> > > again somehow on "track" with nfs-utils code.
> > >
> > > > Otherwise, Scott Mayhew is the area expert (cc'd).
> > >
> > > Thanks!
> > >
> > > I will try to get more narrow down to the versions to see where the
> > > problem might be introduced, but if you already have a clue, and know
> > > what we might try (e.g. commit revert on top, or patch) I'm happy to
> > > test this as well (since now reliably able to trigger it).
> >
> > Okay so this was maybe obvious for you already but bisecting leads to
> > the first bad commit beeing:
> >
> > 74fd48739d04 ("nfsd: new Kconfig option for legacy client tracking")
> >
> > The Problem is not present in v6.7 and it is triggerable with
> > 74fd48739d04 ("nfsd: new Kconfig option for legacy client tracking")
> >
> > Most importantly as the switch to defaulting to y was only in later
> > versions, explicitly setting CONFIG_NFSD_LEGACY_CLIENT_TRACKING=y.
>
> Hi Salvatore -
>
> I see that Scott recently sent a fix for a similar crash to linux-nfs@ :
>
> https://lore.kernel.org/linux-nfs/032ff3ad487ce63656f95c6cdf3db8543fb0d061.camel@kernel.org/T/#t
Oh right, this described escactly the problem.
Do you think that can be made reaching 6.13 as well (and then
cherry-picked to the affected stable series 6.12.y) or do we have to
wait for landing in 6.14 first?
Regards,
Salvatore
next prev parent reply other threads:[~2024-12-28 6:09 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-12-09 11:55 kernel BUG at fs/nfsd/nfs4recover.c:534 Oops: invalid opcode: 0000 Jur van der Burg via Bugspray Bot
2024-12-09 14:45 ` Chuck Lever via Bugspray Bot
2024-12-09 16:50 ` Jur van der Burg via Bugspray Bot
2024-12-26 16:24 ` Salvatore Bonaccorso
2024-12-26 16:33 ` Chuck Lever
2024-12-26 19:17 ` Salvatore Bonaccorso
2024-12-27 6:36 ` Salvatore Bonaccorso
2024-12-27 21:31 ` Chuck Lever
2024-12-28 6:09 ` Salvatore Bonaccorso [this message]
2024-12-28 17:13 ` Chuck Lever
2024-12-28 19:36 ` Salvatore Bonaccorso
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=Z2-V_reIDIgJ1AH7@eldamar.lan \
--to=carnil@debian.org \
--cc=1087900-submitter@bugs.debian.org \
--cc=1087900@bugs.debian.org \
--cc=1091439-submitter@bugs.debian.org \
--cc=1091439@bugs.debian.org \
--cc=anna@kernel.org \
--cc=bugbot@kernel.org \
--cc=cel@kernel.org \
--cc=chuck.lever@oracle.com \
--cc=jlayton@kernel.org \
--cc=linux-nfs@vger.kernel.org \
--cc=smayhew@redhat.com \
--cc=trondmy@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox