From: Chuck Lever III <chuck.lever@oracle.com>
To: Christoph Hellwig <hch@infradead.org>
Cc: Linux NFS Mailing List <linux-nfs@vger.kernel.org>
Subject: Re: reservation errors during fstests on pNFS block
Date: Fri, 14 Jun 2024 17:46:21 +0000 [thread overview]
Message-ID: <C02C8230-4ACA-4F2D-AC28-B9583ADCADA5@oracle.com> (raw)
In-Reply-To: <Zmxx-H2KrT5QpJ-g@infradead.org>
> On Jun 14, 2024, at 12:38 PM, Christoph Hellwig <hch@infradead.org> wrote:
>
> On Fri, Jun 14, 2024 at 02:46:49PM +0000, Chuck Lever III wrote:
>> I've finally gotten kdevops and pNFS block to the point where
>> it can run fstests smoothly with an iSCSI target. I'm seeing
>> error messages on occasion in the system journal. This set is
>> from generic/069:
>
> Reservation means another node has an active reservation on that LU.
There are only two accessors of the LUN: the NFS server and
the NFS client running the test. That's why these errors are
a little surprising to me.
> Either you did another previous attempt that fail and let the
> reservation linger, or something else in the system claimed it.
This is the first fstests run after the systems were provisioned.
kdevops lets me provision from scratch before every run [1].
>> But note that generic/069 is recorded as passing without error.
>
> When pNFS layout access fails we fall back to normal access through the
> MDS, so this is expected.
Expected, OK. From a usability standpoint, error messages like
this would probably be alarming to administrators. I plan to
convert the printk's and dprintk's in the NFSD layout code into
trace points, but that doesn't help the messages emitted by the
block and SCSI drivers. Ideally this should be less noisy.
> Is generic/069 that first test that failed when doing a full xfstests
> run?
Yes, it's a full run. generic/069 is the first test where there
are remarkable system journal messages (ie, PR errors), though
there are a few subsequent tests that are also whinging.
> Do you see LAYOUT* ops in /proc/self/mountstats for the previous
> tests?
generic/013 is known to generate layout recalls, for example,
so there is layout activity during the test run.
I can go back and try reproducing with just generic/069 and
tcpdump as a first step. Is there a way I can tell that the
PR errors are not reporting a possible data corruption? I
guess the PASS report from generic/069 is one way. The pass/fail
log from xfstests for pNFS block looks just the non-pNFS runs,
so maybe this is must ado about nothing.
--
Chuck Lever
[1] - https://github.com/chucklever/kdevops/tree/pnfs-block-testing
next prev parent reply other threads:[~2024-06-14 17:46 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-06-14 14:46 reservation errors during fstests on pNFS block Chuck Lever III
2024-06-14 16:38 ` Christoph Hellwig
2024-06-14 17:46 ` Chuck Lever III [this message]
2024-06-14 18:26 ` Christoph Hellwig
2024-06-14 18:33 ` Chuck Lever III
2024-06-14 18:34 ` Christoph Hellwig
2024-06-15 18:09 ` Chuck Lever III
2024-06-17 5:38 ` Christoph Hellwig
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=C02C8230-4ACA-4F2D-AC28-B9583ADCADA5@oracle.com \
--to=chuck.lever@oracle.com \
--cc=hch@infradead.org \
--cc=linux-nfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox