* [PATCH v3] nvme: core: reject invalid LBA data size from Identify Namespace
@ 2026-05-15 18:58 Chao Shi
2026-05-20 19:33 ` Keith Busch
2026-06-02 13:10 ` John Garry
0 siblings, 2 replies; 7+ messages in thread
From: Chao Shi @ 2026-05-15 18:58 UTC (permalink / raw)
To: Keith Busch, Jens Axboe, linux-nvme, linux-kernel
Cc: Christoph Hellwig, Sagi Grimberg, Daniel Wagner, Hannes Reinecke,
Maurizio Lombardi, Chao Shi, Sungwoo Kim, Dave Tian, Weidong Zhu
nvme_update_ns_info_block() trusts id->lbaf[lbaf].ds from the
controller and assigns it directly to ns->head->lba_shift without
bounds checking. nvme_lba_to_sect() then does:
return lba << (head->lba_shift - SECTOR_SHIFT);
When called with lba = le64_to_cpu(id->nsze) to compute the device
capacity, an attacker-controlled controller can choose ds < 9 or a
combination of (ds, nsze) that makes the left shift overflow
sector_t. The former is a C undefined behaviour that UBSAN reports
as a BUG; the latter silently yields a bogus capacity that the
block layer then trusts for bounds checking.
Validate ds against SECTOR_SHIFT and use check_shl_overflow() to
compute capacity so that any (ds, nsze) combination that would
overflow sector_t is rejected. The namespace is skipped with
-ENODEV instead of crashing the kernel. This is reachable by a
malicious NVMe device, a buggy firmware, or an attacker-controlled
NVMe-oF target.
The check is performed before queue_limits_start_update() and
blk_mq_freeze_queue(), so the error path is a plain `goto out` with
no cleanup needed.
Stack trace (UBSAN, ds < 9 variant):
RIP: nvme_lba_to_sect drivers/nvme/host/nvme.h:699 [inline]
RIP: nvme_update_ns_info_block.cold+0x5/0x7
Call Trace:
nvme_update_ns_info+0x175/0xd90 drivers/nvme/host/core.c:2467
nvme_validate_ns drivers/nvme/host/core.c:4299 [inline]
nvme_scan_ns drivers/nvme/host/core.c:4350
nvme_scan_ns_async+0xa5/0xe0 drivers/nvme/host/core.c:4383
async_run_entry_fn
process_one_work
worker_thread
kthread
Found by Syzkaller.
Acked-by: Sungwoo Kim <iam@sung-woo.kim>
Acked-by: Dave Tian <daveti@purdue.edu>
Acked-by: Weidong Zhu <weizhu@fiu.edu>
Signed-off-by: Chao Shi <coshi036@gmail.com>
---
Changes since v2:
- Hoist the validation above queue_limits_start_update() and
blk_mq_freeze_queue(); error path is now a plain `goto out`.
- Merge the ds == 0 case into the general invalid check.
v1: https://lore.kernel.org/linux-nvme/20260418042835.420281-1-coshi036@gmail.com/
v2: https://lore.kernel.org/linux-nvme/20260420231116.748204-2-coshi036@gmail.com/
drivers/nvme/host/core.c | 12 +++++++++++-
1 file changed, 11 insertions(+), 1 deletion(-)
diff --git a/drivers/nvme/host/core.c b/drivers/nvme/host/core.c
index 1e33af94c24b..12ff562dd142 100644
--- a/drivers/nvme/host/core.c
+++ b/drivers/nvme/host/core.c
@@ -2404,12 +2404,22 @@ static int nvme_update_ns_info_block(struct nvme_ns *ns,
goto out;
}
+ if (id->lbaf[lbaf].ds < SECTOR_SHIFT ||
+ check_shl_overflow(le64_to_cpu(id->nsze),
+ id->lbaf[lbaf].ds - SECTOR_SHIFT,
+ &capacity)) {
+ dev_warn_once(ns->ctrl->device,
+ "invalid LBA data size %u, skipping namespace\n",
+ id->lbaf[lbaf].ds);
+ ret = -ENODEV;
+ goto out;
+ }
+
lim = queue_limits_start_update(ns->disk->queue);
memflags = blk_mq_freeze_queue(ns->disk->queue);
ns->head->lba_shift = id->lbaf[lbaf].ds;
ns->head->nuse = le64_to_cpu(id->nuse);
- capacity = nvme_lba_to_sect(ns->head, le64_to_cpu(id->nsze));
nvme_set_ctrl_limits(ns->ctrl, &lim, false);
nvme_configure_metadata(ns->ctrl, ns->head, id, nvm, info);
nvme_set_chunk_sectors(ns, id, &lim);
--
2.43.0
^ permalink raw reply related [flat|nested] 7+ messages in thread* Re: [PATCH v3] nvme: core: reject invalid LBA data size from Identify Namespace 2026-05-15 18:58 [PATCH v3] nvme: core: reject invalid LBA data size from Identify Namespace Chao Shi @ 2026-05-20 19:33 ` Keith Busch 2026-06-02 13:10 ` John Garry 1 sibling, 0 replies; 7+ messages in thread From: Keith Busch @ 2026-05-20 19:33 UTC (permalink / raw) To: Chao Shi Cc: Jens Axboe, linux-nvme, linux-kernel, Christoph Hellwig, Sagi Grimberg, Daniel Wagner, Hannes Reinecke, Maurizio Lombardi, Sungwoo Kim, Dave Tian, Weidong Zhu On Fri, May 15, 2026 at 02:58:53PM -0400, Chao Shi wrote: > nvme_update_ns_info_block() trusts id->lbaf[lbaf].ds from the > controller and assigns it directly to ns->head->lba_shift without > bounds checking. nvme_lba_to_sect() then does: > > return lba << (head->lba_shift - SECTOR_SHIFT); > > When called with lba = le64_to_cpu(id->nsze) to compute the device > capacity, an attacker-controlled controller can choose ds < 9 or a > combination of (ds, nsze) that makes the left shift overflow > sector_t. The former is a C undefined behaviour that UBSAN reports > as a BUG; the latter silently yields a bogus capacity that the > block layer then trusts for bounds checking. > > Validate ds against SECTOR_SHIFT and use check_shl_overflow() to > compute capacity so that any (ds, nsze) combination that would > overflow sector_t is rejected. The namespace is skipped with > -ENODEV instead of crashing the kernel. This is reachable by a > malicious NVMe device, a buggy firmware, or an attacker-controlled > NVMe-oF target. Thanks, applied to nvme-7.2. ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH v3] nvme: core: reject invalid LBA data size from Identify Namespace 2026-05-15 18:58 [PATCH v3] nvme: core: reject invalid LBA data size from Identify Namespace Chao Shi 2026-05-20 19:33 ` Keith Busch @ 2026-06-02 13:10 ` John Garry 2026-06-02 15:15 ` Keith Busch 1 sibling, 1 reply; 7+ messages in thread From: John Garry @ 2026-06-02 13:10 UTC (permalink / raw) To: Chao Shi, Keith Busch, Jens Axboe, linux-nvme, linux-kernel Cc: Christoph Hellwig, Sagi Grimberg, Daniel Wagner, Hannes Reinecke, Maurizio Lombardi, Sungwoo Kim, Dave Tian, Weidong Zhu On 15/05/2026 19:58, Chao Shi wrote: > > + if (id->lbaf[lbaf].ds < SECTOR_SHIFT || > + check_shl_overflow(le64_to_cpu(id->nsze),> + id->lbaf[lbaf].ds - SECTOR_SHIFT, > + &capacity)) { > + dev_warn_once(ns->ctrl->device, > + "invalid LBA data size %u, skipping namespace\n", > + id->lbaf[lbaf].ds); > + ret = -ENODEV; > + goto out; > + } JFYI, this is giving a C=1 warning: drivers/nvme/host/core.c:2411:13: warning: unsigned value that used to be signed checked against zero? drivers/nvme/host/core.c:2411:13: signed value source I can't seem to quieten it myself, though. BTW, I would have thought that check_shl_overflow would catch id->lbaf[lbaf].ds < SECTOR_SHIFT (so that we don't need the extra check). ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH v3] nvme: core: reject invalid LBA data size from Identify Namespace 2026-06-02 13:10 ` John Garry @ 2026-06-02 15:15 ` Keith Busch 2026-06-02 15:42 ` Keith Busch 0 siblings, 1 reply; 7+ messages in thread From: Keith Busch @ 2026-06-02 15:15 UTC (permalink / raw) To: John Garry Cc: Chao Shi, Jens Axboe, linux-nvme, linux-kernel, Christoph Hellwig, Sagi Grimberg, Daniel Wagner, Hannes Reinecke, Maurizio Lombardi, Sungwoo Kim, Dave Tian, Weidong Zhu On Tue, Jun 02, 2026 at 02:10:07PM +0100, John Garry wrote: > On 15/05/2026 19:58, Chao Shi wrote: > > + if (id->lbaf[lbaf].ds < SECTOR_SHIFT || > > + check_shl_overflow(le64_to_cpu(id->nsze),> + id->lbaf[lbaf].ds - > SECTOR_SHIFT, > > + &capacity)) { > > + dev_warn_once(ns->ctrl->device, > > + "invalid LBA data size %u, skipping namespace\n", > > + id->lbaf[lbaf].ds); > > + ret = -ENODEV; > > + goto out; > > + } > > JFYI, this is giving a C=1 warning: > > drivers/nvme/host/core.c:2411:13: warning: unsigned value that used to be signed checked against zero? > drivers/nvme/host/core.c:2411:13: signed value source > > I can't seem to quieten it myself, though. > > BTW, I would have thought that check_shl_overflow would catch > id->lbaf[lbaf].ds < SECTOR_SHIFT (so that we don't need the extra check). I see it too. check_shl_overflow has checks that suggest it was expecting a signed type, as all the < 0 checks don't make sense for unsigned. The warning seems harmless, but I'd too like to see it suppressed. I think it's odd that I'm not seeing a similar error for the similar usage in generic_check_addressable() from fs/libfs.c. They look the same to me with respect to the types passed in. ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH v3] nvme: core: reject invalid LBA data size from Identify Namespace 2026-06-02 15:15 ` Keith Busch @ 2026-06-02 15:42 ` Keith Busch 2026-06-02 16:18 ` John Garry 0 siblings, 1 reply; 7+ messages in thread From: Keith Busch @ 2026-06-02 15:42 UTC (permalink / raw) To: John Garry Cc: Chao Shi, Jens Axboe, linux-nvme, linux-kernel, Christoph Hellwig, Sagi Grimberg, Daniel Wagner, Hannes Reinecke, Maurizio Lombardi, Sungwoo Kim, Dave Tian, Weidong Zhu On Tue, Jun 02, 2026 at 04:15:41PM +0100, Keith Busch wrote: > On Tue, Jun 02, 2026 at 02:10:07PM +0100, John Garry wrote: > > On 15/05/2026 19:58, Chao Shi wrote: > > > + if (id->lbaf[lbaf].ds < SECTOR_SHIFT || > > > + check_shl_overflow(le64_to_cpu(id->nsze),> + id->lbaf[lbaf].ds - > > SECTOR_SHIFT, > > > + &capacity)) { > > > + dev_warn_once(ns->ctrl->device, > > > + "invalid LBA data size %u, skipping namespace\n", > > > + id->lbaf[lbaf].ds); > > > + ret = -ENODEV; > > > + goto out; > > > + } > > > > JFYI, this is giving a C=1 warning: > > > > drivers/nvme/host/core.c:2411:13: warning: unsigned value that used to be signed checked against zero? > > drivers/nvme/host/core.c:2411:13: signed value source > > > > I can't seem to quieten it myself, though. > > > > BTW, I would have thought that check_shl_overflow would catch > > id->lbaf[lbaf].ds < SECTOR_SHIFT (so that we don't need the extra check). > > I see it too. check_shl_overflow has checks that suggest it was > expecting a signed type, as all the < 0 checks don't make sense for > unsigned. The warning seems harmless, but I'd too like to see it > suppressed. > > I think it's odd that I'm not seeing a similar error for the similar > usage in generic_check_addressable() from fs/libfs.c. They look the same > to me with respect to the types passed in. It appears that sparse is having trouble with the type provenance of a __bitwise __le64 type. No idea why. As a test, I replaced the le64_to_cpu() to a u64 type on stack initialized to a random ULL value and the warning goes away. I say we can ignore the sparse warning, or we can rewrite this to avoid the check_shl_overflow entirely. --- diff --git a/drivers/nvme/host/core.c b/drivers/nvme/host/core.c index cad9d97352615..6409a8218e3eb 100644 --- a/drivers/nvme/host/core.c +++ b/drivers/nvme/host/core.c @@ -2372,8 +2372,8 @@ static int nvme_update_ns_info_block(struct nvme_ns *ns, struct nvme_zone_info zi = {}; struct nvme_id_ns *id; unsigned int memflags; - sector_t capacity; - unsigned lbaf; + unsigned lbaf, shift = 0; + u64 capacity, nsze; int ret; ret = nvme_identify_ns(ns->ctrl, info->nsid, &id); @@ -2407,10 +2407,13 @@ static int nvme_update_ns_info_block(struct nvme_ns *ns, goto out; } - if (id->lbaf[lbaf].ds < SECTOR_SHIFT || - check_shl_overflow(le64_to_cpu(id->nsze), - id->lbaf[lbaf].ds - SECTOR_SHIFT, - &capacity)) { + nsze = le64_to_cpu(id->nsze); + if (id->lbaf[lbaf].ds >= SECTOR_SHIFT) + shift = id->lbaf[lbaf].ds - SECTOR_SHIFT; + + if (shift < SECTOR_SHIFT || shift >= 64 || nsze > U64_MAX >> shift) { dev_warn_once(ns->ctrl->device, "invalid LBA data size %u, skipping namespace\n", id->lbaf[lbaf].ds); -- ^ permalink raw reply related [flat|nested] 7+ messages in thread
* Re: [PATCH v3] nvme: core: reject invalid LBA data size from Identify Namespace 2026-06-02 15:42 ` Keith Busch @ 2026-06-02 16:18 ` John Garry 2026-06-03 10:08 ` John Garry 0 siblings, 1 reply; 7+ messages in thread From: John Garry @ 2026-06-02 16:18 UTC (permalink / raw) To: Keith Busch Cc: Chao Shi, Jens Axboe, linux-nvme, linux-kernel, Christoph Hellwig, Sagi Grimberg, Daniel Wagner, Hannes Reinecke, Maurizio Lombardi, Sungwoo Kim, Dave Tian, Weidong Zhu On 02/06/2026 16:42, Keith Busch wrote: > On Tue, Jun 02, 2026 at 04:15:41PM +0100, Keith Busch wrote: >> On Tue, Jun 02, 2026 at 02:10:07PM +0100, John Garry wrote: >>> On 15/05/2026 19:58, Chao Shi wrote: >>>> + if (id->lbaf[lbaf].ds < SECTOR_SHIFT || >>>> + check_shl_overflow(le64_to_cpu(id->nsze),> + id->lbaf[lbaf].ds - >>> SECTOR_SHIFT, >>>> + &capacity)) { >>>> + dev_warn_once(ns->ctrl->device, >>>> + "invalid LBA data size %u, skipping namespace\n", >>>> + id->lbaf[lbaf].ds); >>>> + ret = -ENODEV; >>>> + goto out; >>>> + } >>> >>> JFYI, this is giving a C=1 warning: >>> >>> drivers/nvme/host/core.c:2411:13: warning: unsigned value that used to be signed checked against zero? >>> drivers/nvme/host/core.c:2411:13: signed value source >>> >>> I can't seem to quieten it myself, though. >>> >>> BTW, I would have thought that check_shl_overflow would catch >>> id->lbaf[lbaf].ds < SECTOR_SHIFT (so that we don't need the extra check). >> >> I see it too. check_shl_overflow has checks that suggest it was >> expecting a signed type, as all the < 0 checks don't make sense for >> unsigned. The warning seems harmless, but I'd too like to see it >> suppressed. >> >> I think it's odd that I'm not seeing a similar error for the similar >> usage in generic_check_addressable() from fs/libfs.c. They look the same >> to me with respect to the types passed in. > > It appears that sparse is having trouble with the type provenance of a > __bitwise __le64 type. No idea why. As a test, I replaced the > le64_to_cpu() to a u64 type on stack initialized to a random ULL value > and the warning goes away. Yeah, ditto. > I say we can ignore the sparse warning, or we > can rewrite this to avoid the check_shl_overflow entirely. Since sparse is having problems with le64_to_cpu(), I suppose using your own version is ok. It would be nice to know the root cause of this issue, though... cheers > > --- > diff --git a/drivers/nvme/host/core.c b/drivers/nvme/host/core.c > index cad9d97352615..6409a8218e3eb 100644 > --- a/drivers/nvme/host/core.c > +++ b/drivers/nvme/host/core.c > @@ -2372,8 +2372,8 @@ static int nvme_update_ns_info_block(struct nvme_ns *ns, > struct nvme_zone_info zi = {}; > struct nvme_id_ns *id; > unsigned int memflags; > - sector_t capacity; > - unsigned lbaf; > + unsigned lbaf, shift = 0; > + u64 capacity, nsze; > int ret; > > ret = nvme_identify_ns(ns->ctrl, info->nsid, &id); > @@ -2407,10 +2407,13 @@ static int nvme_update_ns_info_block(struct nvme_ns *ns, > goto out; > } > > - if (id->lbaf[lbaf].ds < SECTOR_SHIFT || > - check_shl_overflow(le64_to_cpu(id->nsze), > - id->lbaf[lbaf].ds - SECTOR_SHIFT, > - &capacity)) { > + nsze = le64_to_cpu(id->nsze); > + if (id->lbaf[lbaf].ds >= SECTOR_SHIFT) > + shift = id->lbaf[lbaf].ds - SECTOR_SHIFT; > + > + if (shift < SECTOR_SHIFT || shift >= 64 || nsze > U64_MAX >> shift) { > dev_warn_once(ns->ctrl->device, > "invalid LBA data size %u, skipping namespace\n", > id->lbaf[lbaf].ds); > -- ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH v3] nvme: core: reject invalid LBA data size from Identify Namespace 2026-06-02 16:18 ` John Garry @ 2026-06-03 10:08 ` John Garry 0 siblings, 0 replies; 7+ messages in thread From: John Garry @ 2026-06-03 10:08 UTC (permalink / raw) To: Keith Busch Cc: Chao Shi, Jens Axboe, linux-nvme, linux-kernel, Christoph Hellwig, Sagi Grimberg, Daniel Wagner, Hannes Reinecke, Maurizio Lombardi, Sungwoo Kim, Dave Tian, Weidong Zhu On 02/06/2026 17:18, John Garry wrote: > >> I say we can ignore the sparse warning, or we >> can rewrite this to avoid the check_shl_overflow entirely. > FWIW, adding a separate function keeps sparse happy for me: diff --git a/drivers/nvme/host/core.c b/drivers/nvme/host/core.c index ea837b94d3e5..3ec98038668e 100644 --- a/drivers/nvme/host/core.c +++ b/drivers/nvme/host/core.c @@ -2364,6 +2364,11 @@ static int nvme_query_fdp_info(struct nvme_ns *ns, struct nvme_ns_info *info) return ret; } +static bool nvme_valid_ds(u64 nsze, signed int shift, u64 *capacity) +{ + return check_shl_overflow(nsze, shift, capacity); +} + static int nvme_update_ns_info_block(struct nvme_ns *ns, struct nvme_ns_info *info) { @@ -2407,10 +2412,8 @@ static int nvme_update_ns_info_block(struct nvme_ns *ns, goto out; } - if (id->lbaf[lbaf].ds < SECTOR_SHIFT || - check_shl_overflow(le64_to_cpu(id->nsze), - id->lbaf[lbaf].ds - SECTOR_SHIFT, - &capacity)) { + if (nvme_valid_ds(le64_to_cpu(id->nsze), + id->lbaf[lbaf].ds - SECTOR_SHIFT, &capacity)) { dev_warn_once(ns->ctrl->device, "invalid LBA data size %u, skipping namespace\n", id->lbaf[lbaf].ds); ^ permalink raw reply related [flat|nested] 7+ messages in thread
end of thread, other threads:[~2026-06-03 10:08 UTC | newest] Thread overview: 7+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2026-05-15 18:58 [PATCH v3] nvme: core: reject invalid LBA data size from Identify Namespace Chao Shi 2026-05-20 19:33 ` Keith Busch 2026-06-02 13:10 ` John Garry 2026-06-02 15:15 ` Keith Busch 2026-06-02 15:42 ` Keith Busch 2026-06-02 16:18 ` John Garry 2026-06-03 10:08 ` John Garry
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.