Linux CXL
 help / color / mirror / Atom feed
* [PATCH] libnvdimm/labels: Fix divide error in nd_label_data_init()
@ 2025-03-19 11:32 Robert Richter
  2025-03-19 12:10 ` Gupta, Pankaj
  2025-03-19 19:33 ` Ira Weiny
  0 siblings, 2 replies; 4+ messages in thread
From: Robert Richter @ 2025-03-19 11:32 UTC (permalink / raw)
  To: Vishal Verma, Ira Weiny, Dan Williams, Dave Jiang
  Cc: Alison Schofield, Jonathan Cameron, linux-cxl, linux-kernel,
	Davidlohr Bueso, Gregory Price, Terry Bowman, Robert Richter,
	nvdimm

If a CXL memory device returns a broken zero LSA size in its memory
device information (Identify Memory Device (Opcode 4000h), CXL
spec. 3.1, 8.2.9.9.1.1), a divide error occurs in the libnvdimm
driver:

 Oops: divide error: 0000 [#1] PREEMPT SMP NOPTI
 RIP: 0010:nd_label_data_init+0x10e/0x800 [libnvdimm]

Code and flow:

1) CXL Command 4000h returns LSA size = 0,
2) config_size is assigned to zero LSA size (CXL pmem driver):

drivers/cxl/pmem.c:             .config_size = mds->lsa_size,

3) max_xfer is set to zero (nvdimm driver):

drivers/nvdimm/label.c: max_xfer = min_t(size_t, ndd->nsarea.max_xfer, config_size);
drivers/nvdimm/label.c: if (read_size < max_xfer) {
drivers/nvdimm/label.c-         /* trim waste */

4) DIV_ROUND_UP() causes division by zero:

drivers/nvdimm/label.c:         max_xfer -= ((max_xfer - 1) - (config_size - 1) % max_xfer) /
drivers/nvdimm/label.c:                     DIV_ROUND_UP(config_size, max_xfer);
drivers/nvdimm/label.c-         /* make certain we read indexes in exactly 1 read */
drivers/nvdimm/label.c:         if (max_xfer < read_size)
drivers/nvdimm/label.c:                 max_xfer = read_size;
drivers/nvdimm/label.c- }

Fix this by checking the config size parameter by extending an
existing check.

Signed-off-by: Robert Richter <rrichter@amd.com>
---
 drivers/nvdimm/label.c | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/drivers/nvdimm/label.c b/drivers/nvdimm/label.c
index 082253a3a956..04f4a049599a 100644
--- a/drivers/nvdimm/label.c
+++ b/drivers/nvdimm/label.c
@@ -442,7 +442,8 @@ int nd_label_data_init(struct nvdimm_drvdata *ndd)
 	if (ndd->data)
 		return 0;
 
-	if (ndd->nsarea.status || ndd->nsarea.max_xfer == 0) {
+	if (ndd->nsarea.status || ndd->nsarea.max_xfer == 0 ||
+	    ndd->nsarea.config_size == 0) {
 		dev_dbg(ndd->dev, "failed to init config data area: (%u:%u)\n",
 			ndd->nsarea.max_xfer, ndd->nsarea.config_size);
 		return -ENXIO;
-- 
2.39.5


^ permalink raw reply related	[flat|nested] 4+ messages in thread

* Re: [PATCH] libnvdimm/labels: Fix divide error in nd_label_data_init()
  2025-03-19 11:32 [PATCH] libnvdimm/labels: Fix divide error in nd_label_data_init() Robert Richter
@ 2025-03-19 12:10 ` Gupta, Pankaj
  2025-03-19 19:33 ` Ira Weiny
  1 sibling, 0 replies; 4+ messages in thread
From: Gupta, Pankaj @ 2025-03-19 12:10 UTC (permalink / raw)
  To: Robert Richter, Vishal Verma, Ira Weiny, Dan Williams, Dave Jiang
  Cc: Alison Schofield, Jonathan Cameron, linux-cxl, linux-kernel,
	Davidlohr Bueso, Gregory Price, Terry Bowman, nvdimm

On 3/19/2025 12:32 PM, Robert Richter wrote:
> If a CXL memory device returns a broken zero LSA size in its memory
> device information (Identify Memory Device (Opcode 4000h), CXL
> spec. 3.1, 8.2.9.9.1.1), a divide error occurs in the libnvdimm
> driver:
> 
>   Oops: divide error: 0000 [#1] PREEMPT SMP NOPTI
>   RIP: 0010:nd_label_data_init+0x10e/0x800 [libnvdimm]
> 
> Code and flow:
> 
> 1) CXL Command 4000h returns LSA size = 0,
> 2) config_size is assigned to zero LSA size (CXL pmem driver):
> 
> drivers/cxl/pmem.c:             .config_size = mds->lsa_size,
> 
> 3) max_xfer is set to zero (nvdimm driver):
> 
> drivers/nvdimm/label.c: max_xfer = min_t(size_t, ndd->nsarea.max_xfer, config_size);
> drivers/nvdimm/label.c: if (read_size < max_xfer) {
> drivers/nvdimm/label.c-         /* trim waste */
> 
> 4) DIV_ROUND_UP() causes division by zero:
> 
> drivers/nvdimm/label.c:         max_xfer -= ((max_xfer - 1) - (config_size - 1) % max_xfer) /
> drivers/nvdimm/label.c:                     DIV_ROUND_UP(config_size, max_xfer);
> drivers/nvdimm/label.c-         /* make certain we read indexes in exactly 1 read */
> drivers/nvdimm/label.c:         if (max_xfer < read_size)
> drivers/nvdimm/label.c:                 max_xfer = read_size;
> drivers/nvdimm/label.c- }
> 
> Fix this by checking the config size parameter by extending an
> existing check.
> 
> Signed-off-by: Robert Richter <rrichter@amd.com>

LGTM

Reviewed-by: Pankaj Gupta <pankaj.gupta@amd.com>

> ---
>   drivers/nvdimm/label.c | 3 ++-
>   1 file changed, 2 insertions(+), 1 deletion(-)
> 
> diff --git a/drivers/nvdimm/label.c b/drivers/nvdimm/label.c
> index 082253a3a956..04f4a049599a 100644
> --- a/drivers/nvdimm/label.c
> +++ b/drivers/nvdimm/label.c
> @@ -442,7 +442,8 @@ int nd_label_data_init(struct nvdimm_drvdata *ndd)
>   	if (ndd->data)
>   		return 0;
>   
> -	if (ndd->nsarea.status || ndd->nsarea.max_xfer == 0) {
> +	if (ndd->nsarea.status || ndd->nsarea.max_xfer == 0 ||
> +	    ndd->nsarea.config_size == 0) {
>   		dev_dbg(ndd->dev, "failed to init config data area: (%u:%u)\n",
>   			ndd->nsarea.max_xfer, ndd->nsarea.config_size);
>   		return -ENXIO;


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH] libnvdimm/labels: Fix divide error in nd_label_data_init()
  2025-03-19 11:32 [PATCH] libnvdimm/labels: Fix divide error in nd_label_data_init() Robert Richter
  2025-03-19 12:10 ` Gupta, Pankaj
@ 2025-03-19 19:33 ` Ira Weiny
  2025-03-20 11:07   ` Robert Richter
  1 sibling, 1 reply; 4+ messages in thread
From: Ira Weiny @ 2025-03-19 19:33 UTC (permalink / raw)
  To: Robert Richter, Vishal Verma, Ira Weiny, Dan Williams, Dave Jiang
  Cc: Alison Schofield, Jonathan Cameron, linux-cxl, linux-kernel,
	Davidlohr Bueso, Gregory Price, Terry Bowman, Robert Richter,
	nvdimm

Robert Richter wrote:
> If a CXL memory device returns a broken zero LSA size in its memory
> device information (Identify Memory Device (Opcode 4000h), CXL
> spec. 3.1, 8.2.9.9.1.1), a divide error occurs in the libnvdimm
> driver:
> 
>  Oops: divide error: 0000 [#1] PREEMPT SMP NOPTI
>  RIP: 0010:nd_label_data_init+0x10e/0x800 [libnvdimm]
> 
> Code and flow:
> 
> 1) CXL Command 4000h returns LSA size = 0,
> 2) config_size is assigned to zero LSA size (CXL pmem driver):
> 
> drivers/cxl/pmem.c:             .config_size = mds->lsa_size,
> 
> 3) max_xfer is set to zero (nvdimm driver):
> 
> drivers/nvdimm/label.c: max_xfer = min_t(size_t, ndd->nsarea.max_xfer, config_size);
> drivers/nvdimm/label.c: if (read_size < max_xfer) {
> drivers/nvdimm/label.c-         /* trim waste */
> 
> 4) DIV_ROUND_UP() causes division by zero:
> 
> drivers/nvdimm/label.c:         max_xfer -= ((max_xfer - 1) - (config_size - 1) % max_xfer) /
> drivers/nvdimm/label.c:                     DIV_ROUND_UP(config_size, max_xfer);

I think this is the wrong DIV_ROUND_UP which is failing because read_size is
never less than max_xfer is it?

I believe the failing DIV_ROUND_UP is after if statement here:

 489         /* Make our initial read size a multiple of max_xfer size */
 490         read_size = min(DIV_ROUND_UP(read_size, max_xfer) * max_xfer,
 491                         config_size);

Apparently nvdimm_get_config_data() was intended to check for this implicitly
but it is too late.

Anyway all this side tracked me a bit.

I assume this is a broken device which is in the real world?  The fix looks
fine.  But could you re-spin with a clean up of the commit message and I'll
queue it up.

Reviewed-by: Ira Weiny <ira.weiny@intel.com>

[snip]

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH] libnvdimm/labels: Fix divide error in nd_label_data_init()
  2025-03-19 19:33 ` Ira Weiny
@ 2025-03-20 11:07   ` Robert Richter
  0 siblings, 0 replies; 4+ messages in thread
From: Robert Richter @ 2025-03-20 11:07 UTC (permalink / raw)
  To: Ira Weiny
  Cc: Vishal Verma, Dan Williams, Dave Jiang, Alison Schofield,
	Jonathan Cameron, linux-cxl, linux-kernel, Davidlohr Bueso,
	Gregory Price, Terry Bowman, nvdimm

On 19.03.25 14:33:54, Ira Weiny wrote:
> Robert Richter wrote:
> > If a CXL memory device returns a broken zero LSA size in its memory
> > device information (Identify Memory Device (Opcode 4000h), CXL
> > spec. 3.1, 8.2.9.9.1.1), a divide error occurs in the libnvdimm
> > driver:
> > 
> >  Oops: divide error: 0000 [#1] PREEMPT SMP NOPTI
> >  RIP: 0010:nd_label_data_init+0x10e/0x800 [libnvdimm]
> > 
> > Code and flow:
> > 
> > 1) CXL Command 4000h returns LSA size = 0,
> > 2) config_size is assigned to zero LSA size (CXL pmem driver):
> > 
> > drivers/cxl/pmem.c:             .config_size = mds->lsa_size,
> > 
> > 3) max_xfer is set to zero (nvdimm driver):
> > 
> > drivers/nvdimm/label.c: max_xfer = min_t(size_t, ndd->nsarea.max_xfer, config_size);
> > drivers/nvdimm/label.c: if (read_size < max_xfer) {
> > drivers/nvdimm/label.c-         /* trim waste */
> > 
> > 4) DIV_ROUND_UP() causes division by zero:
> > 
> > drivers/nvdimm/label.c:         max_xfer -= ((max_xfer - 1) - (config_size - 1) % max_xfer) /
> > drivers/nvdimm/label.c:                     DIV_ROUND_UP(config_size, max_xfer);
> 
> I think this is the wrong DIV_ROUND_UP which is failing because read_size is
> never less than max_xfer is it?
> 
> I believe the failing DIV_ROUND_UP is after if statement here:
> 
>  489         /* Make our initial read size a multiple of max_xfer size */
>  490         read_size = min(DIV_ROUND_UP(read_size, max_xfer) * max_xfer,
>  491                         config_size);

Yes, it is this one.

> 
> Apparently nvdimm_get_config_data() was intended to check for this implicitly
> but it is too late.
> 
> Anyway all this side tracked me a bit.
> 
> I assume this is a broken device which is in the real world?  The fix looks
> fine.  But could you re-spin with a clean up of the commit message and I'll
> queue it up.

Yes, it was caused by a faulty device.

Sure, will update description and resend.

> 
> Reviewed-by: Ira Weiny <ira.weiny@intel.com>

Thanks for review,

-Robert

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2025-03-20 11:07 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2025-03-19 11:32 [PATCH] libnvdimm/labels: Fix divide error in nd_label_data_init() Robert Richter
2025-03-19 12:10 ` Gupta, Pankaj
2025-03-19 19:33 ` Ira Weiny
2025-03-20 11:07   ` Robert Richter

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox