From: "Theodore Y. Ts'o" <tytso@mit.edu>
To: Ming Lei <ming.lei@redhat.com>
Cc: Jens Axboe <axboe@kernel.dk>,
"linux-block@vger.kernel.org" <linux-block@vger.kernel.org>
Subject: Re: [PATCH] blk-mq: fix corruption with direct issue
Date: Thu, 6 Dec 2018 21:46:42 -0500 [thread overview]
Message-ID: <20181207024642.GA13460@thunk.org> (raw)
In-Reply-To: <20181205030300.GG17845@ming.t460p>
On Wed, Dec 05, 2018 at 11:03:01AM +0800, Ming Lei wrote:
>
> But at that time, there isn't io scheduler for MQ, so in theory the
> issue should be there since v4.11, especially 945ffb60c11d ("mq-deadline:
> add blk-mq adaptation of the deadline IO scheduler").
Hi Ming,
How were serious you about this issue being there (theoretically) an
issue since 4.11? Can you talk about how it might get triggered, and
how we can test for it? The reason why I ask is because we're trying
to track down a mysterious file system corruption problem on a 4.14.x
stable kernel. The symptoms are *very* eerily similar to kernel
bugzilla #201685.
The problem is that the problem is super-rare --- roughly once a week
out of a popuation of about 2500 systems. The workload is NFS
serving. Unfortunately, the problem is since 4.14.63, we can no
longer disable blk-mq for the virtio-scsi driver, thanks to the commit
b5b6e8c8d3b4 ("scsi: virtio_scsi: fix IO hang caused by automatic irq
vector affinity") getting backported into 4.14.63 as commit
70b522f163bbb32.
We're considering reverting this patch in our 4.14 LTS kernel, and
seeing whether it makes the problem go away. Is there any thing else
you might suggest?
Thanks,
- Ted
P.S. Unlike the repro's that users were seeing in #201685, we *did*
have an I/O scheduler enabled --- it was mq-deadline. But right now,
given your comments, and the corruptions that we're seeing, I'm not
feeling very warm and fuzzy about block-mq. :-( :-( :-(
next prev parent reply other threads:[~2018-12-07 2:46 UTC|newest]
Thread overview: 22+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-12-04 22:47 [PATCH] blk-mq: fix corruption with direct issue Jens Axboe
2018-12-05 1:37 ` Ming Lei
2018-12-05 2:16 ` Jens Axboe
2018-12-05 2:23 ` Jens Axboe
2018-12-05 2:27 ` Ming Lei
2018-12-05 2:30 ` Jens Axboe
2018-12-05 2:58 ` Ming Lei
2018-12-05 3:03 ` Ming Lei
2018-12-05 3:05 ` Jens Axboe
2018-12-07 2:46 ` Theodore Y. Ts'o [this message]
2018-12-07 3:04 ` Jens Axboe
2018-12-07 3:44 ` Ming Lei
2018-12-07 9:30 ` Ming Lei
2018-12-05 3:04 ` Jens Axboe
2018-12-05 1:38 ` Guenter Roeck
2018-12-05 2:25 ` Jens Axboe
2018-12-05 17:55 ` Guenter Roeck
2018-12-05 17:59 ` Jens Axboe
2018-12-05 19:09 ` Guenter Roeck
2018-12-05 20:11 ` Jens Axboe
2018-12-05 14:41 ` Christoph Hellwig
2018-12-05 15:15 ` Jens Axboe
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20181207024642.GA13460@thunk.org \
--to=tytso@mit.edu \
--cc=axboe@kernel.dk \
--cc=linux-block@vger.kernel.org \
--cc=ming.lei@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox