From: John Groves <John@groves.net>
To: Dave Jiang <dave.jiang@intel.com>
Cc: John Groves <john@jagalactic.com>, Dan Williams <djbw@kernel.org>,
John Groves <jgroves@micron.com>,
Vishal Verma <vishal.l.verma@intel.com>,
Matthew Wilcox <willy@infradead.org>, Jan Kara <jack@suse.cz>,
Alexander Viro <viro@zeniv.linux.org.uk>,
Christian Brauner <brauner@kernel.org>,
Miklos Szeredi <miklos@szeredi.hu>,
Alison Schofield <alison.schofield@intel.com>,
Ira Weiny <iweiny@kernel.org>,
Jonathan Cameron <jic23@kernel.org>,
"nvdimm@lists.linux.dev" <nvdimm@lists.linux.dev>,
"linux-cxl@vger.kernel.org" <linux-cxl@vger.kernel.org>,
"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
"linux-fsdevel@vger.kernel.org" <linux-fsdevel@vger.kernel.org>
Subject: Re: [PATCH V2 4/7] dax/fsdev: clamp direct_access return to current physical range
Date: Sat, 30 May 2026 08:06:21 -0500 [thread overview]
Message-ID: <ahrfpRtpeZKKZPyG@groves.net> (raw)
In-Reply-To: <16628b9f-a624-46f8-8a7f-3b9e7963963b@intel.com>
On 26/05/26 05:00PM, Dave Jiang wrote:
>
>
> On 5/22/26 12:19 PM, John Groves wrote:
> > From: John Groves <John@Groves.net>
> >
> > __fsdev_dax_direct_access() returned the number of available pages based
> > on cached_size (the total size across all ranges). For multi-range
> > devices with physical gaps between ranges, this over-reports the number
> > of physically contiguous pages available from the returned kaddr/pfn.
> > Callers trust this return value to mean contiguous pages, so accessing
> > beyond the current range boundary would hit unmapped or unrelated memory.
> >
> > Fix by finding the range that contains the translated physical address
> > and clamping the return to the remaining pages within that range.
> >
> > Also remove the now-unused cached_size field from struct dev_dax, since
> > it was only consumed by the old return calculation.
> >
> > Fixes: 099c81a1f0ab3 ("dax: Add dax_operations for use by fs-dax on fsdev dax")
> > Signed-off-by: John Groves <john@groves.net>
>
> I ran this through Claude and this is what it came back with and it looks reasonable to me:
>
> The claimed bug does not manifest in the current tree. This is a
> refactor + dead-field removal dressed as a bugfix. Either the justification is
> wrong or there's a missing companion change.
>
> The claim doesn't hold up
>
> Commit message:
>
> ▎ __fsdev_dax_direct_access() returned the number of available pages based on
> ▎ cached_size (the total size across all ranges). For multi-range devices with
> ▎ physical gaps between ranges, this over-reports the number of physically
> ▎ contiguous pages available from the returned kaddr/pfn.
>
> Walk through it. __fsdev_dax_direct_access does:
>
> phys = dax_pgoff_to_phys(dev_dax, pgoff, size);
> if (phys == -1)
> return -EFAULT;
> ...
> return PHYS_PFN(min(size, dev_dax->cached_size - offset));
>
> Now look at what dax_pgoff_to_phys (drivers/dax/bus.c:1433) enforces:
>
> phys = PFN_PHYS(pgoff - dax_range->pgoff) + range->start;
> if (phys + size - 1 <= range->end)
> return phys;
> break; /* falls through to return -1 */
>
> It returns valid phys only when the entire requested size fits inside a single
> range. If the request straddles a gap, it returns -1 and the caller bails
> with -EFAULT before ever reaching the clamp.
>
> So in the only path where the clamp runs, we already know size <= range->end -
> phys + 1, which means cached_size - offset >= size, which means min(size,
> cached_size - offset) == size. Pre-patch returns nr_pages. Post-patch returns
> nr_pages. Same answer.
>
> I worked it through with concrete numbers on a 2-range device with a physical
> gap (range[0]=4 pages at 0x1000_0000, range[1]=4 pages at 0x2000_0000):
> - pgoff=2, nr_pages=2: both return 2. ✓
> - pgoff=3, nr_pages=2 (straddles gap): both return -EFAULT via the early bail.
> ✓
> - pgoff=4, nr_pages=2: both return 2. ✓
>
> I cannot construct a case where pre-patch over-reports.
>
> What's actually changed
>
> 1. A no-op clamp loop replaces a constant calculation. Same return value in
> every reachable case.
> 2. The trailing return -EFAULT; is dead code. If dax_pgoff_to_phys returned
> valid phys, then by construction phys lies inside some range in
> dev_dax->ranges[], so the loop will match.
> 3. O(N) walk replaces O(1) arithmetic. Trivial for small nr_range, but it's a
> regression in the wrong direction for a hot path.
> 4. cached_size field is removed. Good cleanup — I grepped, no other consumers.
>
>
> DJ
Thanks Dave. Sashiko reviews can be great, but sometimes they make us (me)
dumber.
Dropping this patch from the series.
John
<snip>
next prev parent reply other threads:[~2026-05-30 13:06 UTC|newest]
Thread overview: 22+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <20260522191804.79088-1-john@jagalactic.com>
2026-05-22 19:18 ` [PATCH V2 0/7] Fixes to the previously-merged drivers/dax/fsdev series John Groves
2026-05-22 19:18 ` [PATCH V2 1/7] dax: fix misleading comment about share/index union in dax_folio_reset_order() John Groves
2026-05-26 23:07 ` Dave Jiang
2026-05-29 23:41 ` John Groves
2026-05-22 19:18 ` [PATCH V2 2/7] dax/fsdev: fix multi-range offset, vmemmap_shift leak, and probe error cleanup John Groves
2026-05-26 23:22 ` Dave Jiang
2026-05-29 23:59 ` John Groves
2026-05-22 19:19 ` [PATCH V2 3/7] dax/fsdev: fix kaddr for multi-range and fail probe on invalid pgmap offset John Groves
2026-05-26 23:31 ` Dave Jiang
2026-05-30 0:04 ` John Groves
2026-05-22 19:19 ` [PATCH V2 4/7] dax/fsdev: clamp direct_access return to current physical range John Groves
2026-05-27 0:00 ` Dave Jiang
2026-05-30 13:06 ` John Groves [this message]
2026-05-22 19:19 ` [PATCH V2 5/7] dax: fix holder_ops race in fs_put_dax() John Groves
2026-05-27 0:16 ` Dave Jiang
2026-05-30 14:02 ` John Groves
2026-05-30 14:32 ` John Groves
2026-05-22 19:19 ` [PATCH V2 6/7] dax: replace exported dax_dev_get() with non-allocating dax_dev_find() John Groves
2026-05-27 0:28 ` Dave Jiang
2026-05-30 14:19 ` John Groves
2026-05-22 19:19 ` [PATCH V2 7/7] dax: fsdev.c minor formatting cleanup John Groves
2026-05-27 0:31 ` Dave Jiang
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=ahrfpRtpeZKKZPyG@groves.net \
--to=john@groves.net \
--cc=alison.schofield@intel.com \
--cc=brauner@kernel.org \
--cc=dave.jiang@intel.com \
--cc=djbw@kernel.org \
--cc=iweiny@kernel.org \
--cc=jack@suse.cz \
--cc=jgroves@micron.com \
--cc=jic23@kernel.org \
--cc=john@jagalactic.com \
--cc=linux-cxl@vger.kernel.org \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=miklos@szeredi.hu \
--cc=nvdimm@lists.linux.dev \
--cc=viro@zeniv.linux.org.uk \
--cc=vishal.l.verma@intel.com \
--cc=willy@infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox