From: "Martin K. Petersen" <martin.petersen@oracle.com>
To: Damien Le Moal <dlemoal@kernel.org>
Cc: "Yu Kuai" <yukuai1@huaweicloud.com>,
"Csordás Hunor" <csordas.hunor@gmail.com>,
"Coly Li" <colyli@kernel.org>,
hch@lst.de, linux-block@vger.kernel.org,
"James E.J. Bottomley" <James.Bottomley@HansenPartnership.com>,
"Martin K. Petersen" <martin.petersen@oracle.com>,
linux-scsi@vger.kernel.org, "yukuai (C)" <yukuai3@huawei.com>
Subject: Re: Improper io_opt setting for md raid5
Date: Tue, 29 Jul 2025 00:08:22 -0400 [thread overview]
Message-ID: <yq15xfbmz2u.fsf@ca-mkp.ca.oracle.com> (raw)
In-Reply-To: <fa2f9406-4ee8-45f9-a784-b5042e9f4411@kernel.org> (Damien Le Moal's message of "Mon, 28 Jul 2025 16:44:14 +0900")
Damien,
> My bad, yes, that is the definition in sysfs. So io_min is the stride
> size, where:
Depends on the RAID type. For RAID0 and 1 there is no inherent penalty
wrt. writing less than the stride size. But for RAID 5/6 there clearly
is.
> stride size x number of data disks == stripe_size.
>
> Note that chunk_sectors limit is the *stripe* size, not per drive stride.
> Beware of the wording here to avoid confusion (this is all already super
> confusing !).
The choice of "chunk" to describe the LBA boundary queue limit is
unfortunate since MD uses chunk_sectors as the term for what you call
stride.
> Well, at least, that is how I interpret the io_min definition of
> minimum_io_size in Documentation/ABI/stable/sysfs-block. But the wording "For
> RAID arrays it is often the stripe chunk size." is super confusing. Not
> entirely sure if stride or stripe was meant here...
The stripe chunk or stripe unit is what you call stride. Stripe width is
the full stripe across all drives.
> As for read_ahead_kb, it should be bounded by io_opt (upper bound) but
> should be initialized to a smaller value aligned to io_min (if io_opt
> is unreasonably large).
In retrospect I am not really a fan of using io_opt for read_ahead_kb
since, to my knowledge, there is no guarantee that the readahead I/O
will be naturally aligned.
That said, I don't really know of devices where this matters much for
reads. With writes, this would be much more of an issue.
--
Martin K. Petersen
next prev parent reply other threads:[~2025-07-29 4:08 UTC|newest]
Thread overview: 20+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <ywsfp3lqnijgig6yrlv2ztxram6ohf5z4yfeebswjkvp2dzisd@f5ikoyo3sfq5>
2025-07-27 10:50 ` Improper io_opt setting for md raid5 Csordás Hunor
2025-07-28 0:39 ` Damien Le Moal
2025-07-28 0:55 ` Yu Kuai
2025-07-28 2:41 ` Damien Le Moal
2025-07-28 3:08 ` Yu Kuai
2025-07-28 3:49 ` Damien Le Moal
2025-07-28 7:14 ` Yu Kuai
2025-07-28 7:44 ` Damien Le Moal
2025-07-28 9:02 ` Yu Kuai
2025-07-29 4:23 ` Martin K. Petersen
2025-07-29 6:25 ` Yu Kuai
2025-07-29 22:02 ` Tony Battersby
2025-07-29 6:13 ` Hannes Reinecke
2025-07-29 6:29 ` Yu Kuai
2025-07-29 22:24 ` Keith Busch
2025-07-28 10:56 ` Csordás Hunor
2025-07-29 4:08 ` Martin K. Petersen [this message]
2025-07-29 3:53 ` Martin K. Petersen
2025-07-29 3:49 ` Martin K. Petersen
2025-07-29 4:44 ` Martin K. Petersen
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=yq15xfbmz2u.fsf@ca-mkp.ca.oracle.com \
--to=martin.petersen@oracle.com \
--cc=James.Bottomley@HansenPartnership.com \
--cc=colyli@kernel.org \
--cc=csordas.hunor@gmail.com \
--cc=dlemoal@kernel.org \
--cc=hch@lst.de \
--cc=linux-block@vger.kernel.org \
--cc=linux-scsi@vger.kernel.org \
--cc=yukuai1@huaweicloud.com \
--cc=yukuai3@huawei.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox