From: "Darrick J. Wong" <djwong@kernel.org>
To: Zhang Yi <yi.zhang@huaweicloud.com>
Cc: linux-fsdevel@vger.kernel.org, linux-xfs@vger.kernel.org,
linux-ext4@vger.kernel.org, brauner@kernel.org,
hch@infradead.org, yi.zhang@huawei.com, yizhang089@gmail.com,
yangerkun@huawei.com, yukuai@fnnas.com
Subject: Re: [PATCH 4/4] iomap: fix out-of-bounds bitmap_set() with zero-length range
Date: Fri, 15 May 2026 10:30:47 -0700 [thread overview]
Message-ID: <20260515173047.GC9555@frogsfrogsfrogs> (raw)
In-Reply-To: <827c4380-4251-40af-bc14-207111736464@huaweicloud.com>
On Fri, May 15, 2026 at 09:50:08AM +0800, Zhang Yi wrote:
> On 5/15/2026 2:10 AM, Darrick J. Wong wrote:
> > On Thu, May 14, 2026 at 02:29:55PM +0800, Zhang Yi wrote:
> >> From: Zhang Yi <yi.zhang@huawei.com>
> >>
> >> ifs_set_range_dirty() and ifs_set_range_uptodate() compute last_blk
> >> as (off + len - 1) >> i_blkbits. When off is 0 and len is 0, the
> >> unsigned subtraction underflows to SIZE_MAX, producing a huge
> >> last_blk and nr_blks value that causes bitmap_set() to write far
> >> beyond the ifs->state allocation.
> >>
> >> Regarding ifs_set_range_uptodate(), it is temporarily safe because len
> >> cannot be passed in as 0. However, for ifs_set_range_dirty() this is
> >> reachable from __iomap_write_end(): when copy_folio_from_iter_atomic()
> >> returns 0 (e.g. user buffer fault) and the folio is already uptodate,
> >> the guard at the top of __iomap_write_end() does not trigger because
> >> !folio_test_uptodate() is false, and iomap_set_range_dirty() is called
> >> with copied == 0.
> >>
> >> Add a !len guard to both functions before the computation, so that a
> >> zero-length range is a no-op.
> >>
> >> Signed-off-by: Zhang Yi <yi.zhang@huawei.com>
> >> ---
> >> fs/iomap/buffered-io.c | 23 +++++++++++++++--------
> >> 1 file changed, 15 insertions(+), 8 deletions(-)
> >>
> >> diff --git a/fs/iomap/buffered-io.c b/fs/iomap/buffered-io.c
> >> index 27ab33edbdee..6fe5f7e998fd 100644
> >> --- a/fs/iomap/buffered-io.c
> >> +++ b/fs/iomap/buffered-io.c
> >> @@ -67,11 +67,14 @@ static bool ifs_set_range_uptodate(struct folio *folio,
> >> struct iomap_folio_state *ifs, size_t off, size_t len)
> >> {
> >> struct inode *inode = folio->mapping->host;
> >> - unsigned int first_blk = off >> inode->i_blkbits;
> >> - unsigned int last_blk = (off + len - 1) >> inode->i_blkbits;
> >> - unsigned int nr_blks = last_blk - first_blk + 1;
> >> + unsigned int first_blk, last_blk;
> >>
> >> - bitmap_set(ifs->state, first_blk, nr_blks);
> >> + if (!len)
> >> + return true;
> >> +
> >> + first_blk = off >> inode->i_blkbits;
> >> + last_blk = (off + len - 1) >> inode->i_blkbits;
> >> + bitmap_set(ifs->state, first_blk, last_blk - first_blk + 1);
> >> return ifs_is_fully_uptodate(folio, ifs);
> >> }
> >>
> >> @@ -203,13 +206,17 @@ static void ifs_set_range_dirty(struct folio *folio,
> >> {
> >> struct inode *inode = folio->mapping->host;
> >> unsigned int blks_per_folio = i_blocks_per_folio(inode, folio);
> >> - unsigned int first_blk = (off >> inode->i_blkbits);
> >> - unsigned int last_blk = (off + len - 1) >> inode->i_blkbits;
> >> - unsigned int nr_blks = last_blk - first_blk + 1;
> >> + unsigned int first_blk, last_blk;
> >> unsigned long flags;
> >>
> >> + if (!len)
> >> + return;
> >> +
> >> + first_blk = off >> inode->i_blkbits;
> >> + last_blk = (off + len - 1) >> inode->i_blkbits;
> >> spin_lock_irqsave(&ifs->state_lock, flags);
> >> - bitmap_set(ifs->state, first_blk + blks_per_folio, nr_blks);
> >> + bitmap_set(ifs->state, first_blk + blks_per_folio,
> >> + last_blk - first_blk + 1);
> >
> > I'm curious about the inconsistency in the computations between
> > ifs_clear_range_dirty and ifs_set_range_dirty now. In the function that
> > clears dirty bits, off/len are rounded inwards:
> >
> > unsigned int first_blk = round_up(off, i_blocksize(inode)) >>
> > inode->i_blkbits;
> > unsigned int last_blk = (off + len) >> inode->i_blkbits;
> > unsigned long flags;
> >
> > if (first_blk >= last_blk)
> > return;
> >
> > spin_lock_irqsave(&ifs->state_lock, flags);
> > bitmap_clear(ifs->state, first_blk + blks_per_folio,
> > last_blk - first_blk);
> >
> > but here we're still rounding outwards:
> >
> > if (!len)
> > return;
> >
> > first_blk = off >> inode->i_blkbits;
> > last_blk = (off + len - 1) >> inode->i_blkbits;
> > spin_lock_irqsave(&ifs->state_lock, flags);
> > bitmap_set(ifs->state, first_blk + blks_per_folio,
> > last_blk - first_blk + 1);
> >
> > That doesn't quite sound right to me without an explanation in the code,
> > which currently lacks one. I *think* the reason for the discrepancy is
> > that if we want to dirty part of an fsblock, we need to mark the whole
> > block dirty in the ifs so that all the blocks get written out; but when
> > we're clearing dirty bits, we want to leave an fsblock dirty if we only
> > wrote back part of that fsblock. Does that sound right?
> >
> > --D
> >
>
> Yes, that's right. The primary purpose of clearing the dirty bit is for
> invalidating a partial folio(e.g., when punching hole). Only when a full
> fsblock has been invalidated can its corresponding dirty bit be cleared.
> Write-back operations never seem to write back only part of a fsblock.
>
> I can add comments for them in my next iteration.
Sounds good, thank you.
(Does this patch also need a fixes tag like Joanne asked?)
--D
> Thanks,
> Yi.
>
>
>
next prev parent reply other threads:[~2026-05-15 17:30 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-05-14 6:29 [PATCH 0/4] iomap: trivial fixes for ext4 conversion Zhang Yi
2026-05-14 6:29 ` [PATCH 1/4] iomap: correct the range of a partial dirty clear Zhang Yi
2026-05-14 18:03 ` Darrick J. Wong
2026-05-15 1:20 ` Zhang Yi
2026-05-14 6:29 ` [PATCH 2/4] iomap: support invalidating partial folios Zhang Yi
2026-05-14 6:29 ` [PATCH 3/4] iomap: fix incorrect did_zero setting in iomap_zero_iter() Zhang Yi
2026-05-14 6:29 ` [PATCH 4/4] iomap: fix out-of-bounds bitmap_set() with zero-length range Zhang Yi
2026-05-14 15:08 ` Joanne Koong
2026-05-15 1:57 ` Zhang Yi
2026-05-14 18:10 ` Darrick J. Wong
2026-05-15 1:50 ` Zhang Yi
2026-05-15 17:30 ` Darrick J. Wong [this message]
2026-05-16 1:25 ` Zhang Yi
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20260515173047.GC9555@frogsfrogsfrogs \
--to=djwong@kernel.org \
--cc=brauner@kernel.org \
--cc=hch@infradead.org \
--cc=linux-ext4@vger.kernel.org \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-xfs@vger.kernel.org \
--cc=yangerkun@huawei.com \
--cc=yi.zhang@huawei.com \
--cc=yi.zhang@huaweicloud.com \
--cc=yizhang089@gmail.com \
--cc=yukuai@fnnas.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox