From: Damien Le Moal <dlemoal@kernel.org>
To: Shin'ichiro Kawasaki <shinichiro.kawasaki@wdc.com>,
fio@vger.kernel.org, Jens Axboe <axboe@kernel.dk>,
Vincent Fu <vincentfu@gmail.com>
Subject: Re: [PATCH v2 3/8] zbd: introduce write_zone_remainder option
Date: Fri, 27 Feb 2026 13:59:39 +0900 [thread overview]
Message-ID: <f3df9bb5-d6bc-4d60-97f3-e8d3805387ef@kernel.org> (raw)
In-Reply-To: <20260216075936.3318729-4-shinichiro.kawasaki@wdc.com>
On 2/16/26 16:59, Shin'ichiro Kawasaki wrote:
> When the specified block size is not aligned with the zone size or the
> write pointer positions at workload start, write workloads create
> unwritten remainder areas at the ends of zones. These remainder areas
> leave zones in the open condition. This disrupts the intended write
> target zone selection.
>
> Previous commits e1a1b59b0b9b ("zbd: finish zones with remainder smaller
> than minimum write block size") and e2e29bf6f830 ("zbd: finish zone when
> all random write target zones have small remainder") attempted to solve
> this problem by issuing zone finish operation for zones with small
> remainders. However, this approach caused performance degradation due to
> two reasons. First, the zone finish operation requires substantial
> execution time. Second, zone finish operation requires to wait for in-
> flight writes from other jobs to complete, which is done by calling
> io_u_quiesce() before the zone finish operation.
>
> To avoid the performance degradation, introduce the new option named
> "write_zone_remainder". When the option is specified, issue writes to
> the remainder areas instead of issuing zone finish operation. The write
> operation makes the zones in the full condition in the same manner as
> the zone finish operation, freeing up the zone resource of the device
> and enabling writing to other zones. Also when the option is set, skip
> the io_u_quiesce() which was required before the zone finish operation.
> The performance benefit by eliminating the waits on in-flight writes are
> particularly significant in asynchronous I/O workloads, where the write
> operations to the remainder areas are managed as part of queued I/Os.
>
> The drawback of this approach is that writing these remainders requires
> write sizes smaller than the minimum block size. As a result, when using
> the write_zone_remainder option, the random map feature must be disabled
> using the norandommap=1 option, which is automatically done when the
> option is specified.
Looks OK to me.
Reviewed-by: Damien Le Moal <dlemoal@kernel.org>
--
Damien Le Moal
Western Digital Research
next prev parent reply other threads:[~2026-02-27 4:59 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-02-16 7:59 [PATCH v2 0/8] zbd: fix problems of random write with unaligned block size Shin'ichiro Kawasaki
2026-02-16 7:59 ` [PATCH v2 1/8] zbd: fix zone selection of random writes Shin'ichiro Kawasaki
2026-02-27 4:48 ` Damien Le Moal
2026-02-16 7:59 ` [PATCH v2 2/8] zbd: fix write zone accounting Shin'ichiro Kawasaki
2026-02-27 4:53 ` Damien Le Moal
2026-02-27 12:08 ` Shinichiro Kawasaki
2026-02-16 7:59 ` [PATCH v2 3/8] zbd: introduce write_zone_remainder option Shin'ichiro Kawasaki
2026-02-27 4:59 ` Damien Le Moal [this message]
2026-02-16 7:59 ` [PATCH v2 4/8] doc: explain the option write_zone_remainder Shin'ichiro Kawasaki
2026-02-27 5:06 ` Damien Le Moal
2026-02-16 7:59 ` [PATCH v2 5/8] t/zbd: add -m option to enable write_zone_remainder option Shin'ichiro Kawasaki
2026-02-16 7:59 ` [PATCH v2 6/8] t/zbd: avoid test case 14 failure with " Shin'ichiro Kawasaki
2026-02-16 7:59 ` [PATCH v2 7/8] t/zbd: avoid test case 33 " Shin'ichiro Kawasaki
2026-02-16 7:59 ` [PATCH v2 8/8] t/zbd: avoid test case 71 " Shin'ichiro Kawasaki
2026-02-16 9:10 ` [PATCH v2 0/8] zbd: fix problems of random write with unaligned block size fiotestbot
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=f3df9bb5-d6bc-4d60-97f3-e8d3805387ef@kernel.org \
--to=dlemoal@kernel.org \
--cc=axboe@kernel.dk \
--cc=fio@vger.kernel.org \
--cc=shinichiro.kawasaki@wdc.com \
--cc=vincentfu@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.