public inbox for linux-bcachefs@vger.kernel.org
 help / color / mirror / Atom feed
From: Brian Foster <bfoster@redhat.com>
To: Kent Overstreet <kent.overstreet@linux.dev>
Cc: linux-bcachefs@vger.kernel.org
Subject: Re: [PATCH] bcachefs: fix iov_iter count underflow on sub-block dio read
Date: Mon, 5 Feb 2024 14:38:13 -0500	[thread overview]
Message-ID: <ZcE5JY279PGUrP9F@bfoster> (raw)
In-Reply-To: <4fdyk3to57izolmljjiehnn5kt5dxckhh2oodo7vk6gssgqbsb@6gsjfytnv2xy>

On Mon, Feb 05, 2024 at 02:15:36PM -0500, Kent Overstreet wrote:
> On Mon, Feb 05, 2024 at 10:48:14AM -0500, Brian Foster wrote:
> > bch2_direct_IO_read() checks the request offset and size for sector
> > alignment and then falls through to a couple calculations to shrink
> > the size of the request based on the inode size. The problem is that
> > these checks round up to the fs block size, which runs the risk of
> > underflowing iter->count if the block size happens to be large
> > enough. This is triggered by fstest generic/361 with a 4k block
> > size, which subsequently leads to a crash.
> > 
> > After some discussion, the original purpose of the shorten logic in this
> > path is not totally clear. It appears to be intended as an optimization
> > of limited value, so simplify things and avoid the underflow problem by
> > removing it.
> 
> Thinking about this a bit more...
> 
> All that we're avoiding with the shorten is zeroing out the rest of the
> read, which will happen in bch2_read_extent(); we won't be issuing any
> IO, since that would require real data extents above i_size, which is of
> course not allowed.
> 
> But zeroing out the rest of the read is actual work that's being
> avoided, since if we return a short read (which is what we're doing
> here) it's undefined what happens to the rest of the buffer.
> 
> So... maybe we should keep it, for the crazy but inevitable application
> that uses a 1MB read buffer for everything, including O_DIRECT reads to
> 4k files?
> 

I had another change around that basically just protected against the
underflow. I.e., something along the lines of the following, to also
cover the additional use of shorten later in the function:

	if (shorten >= iter->count)
		shorten = 0;

... and IIRC that seemed to work as well. But I'll probably have to take
a closer look at that large buffer case to grok what's happening and the
original purpose of this logic.

Brian


  reply	other threads:[~2024-02-05 19:36 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-02-05 15:48 [PATCH] bcachefs: fix iov_iter count underflow on sub-block dio read Brian Foster
2024-02-05 19:15 ` Kent Overstreet
2024-02-05 19:38   ` Brian Foster [this message]
2024-02-09  2:39     ` Su Yue
2024-02-14 18:13       ` Brian Foster
2024-02-15  6:59         ` Su Yue

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=ZcE5JY279PGUrP9F@bfoster \
    --to=bfoster@redhat.com \
    --cc=kent.overstreet@linux.dev \
    --cc=linux-bcachefs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox