* [PATCH 0/4] dax: Fix use after free and other cleanups
@ 2023-06-03 6:13 Dan Williams
2023-06-03 6:13 ` [PATCH 1/4] dax: Fix dax_mapping_release() use after free Dan Williams
` (3 more replies)
0 siblings, 4 replies; 18+ messages in thread
From: Dan Williams @ 2023-06-03 6:13 UTC (permalink / raw)
To: nvdimm; +Cc: Yongqiang Liu, Paul Cassella, Ira Weiny, linux-cxl
As mentioned in patch3, the reference counting of dax_region objects is
needlessly complicated, has lead to confusion [1], and has hidden a bug
[2]. While testing the cleanup for those issues, a
CONFIG_DEBUG_KOBJECT_RELEASE test run uncovered a use-after-free in
dax_mapping_release(). Clean all of that up.
Thanks to Yongqiang, Paul, and Ira for their analysis.
[1]: http://lore.kernel.org/r/20221203095858.612027-1-liuyongqiang13@huawei.com
[2]: http://lore.kernel.org/r/3cf0890b-4eb0-e70e-cd9c-2ecc3d496263@hpe.com
---
Dan Williams (4):
dax: Fix dax_mapping_release() use after free
dax: Use device_unregister() in unregister_dax_mapping()
dax: Introduce alloc_dev_dax_id()
dax: Cleanup extra dax_region references
drivers/dax/bus.c | 64 +++++++++++++++++++++++++++------------------
drivers/dax/bus.h | 1 -
drivers/dax/cxl.c | 8 +-----
drivers/dax/dax-private.h | 4 ++-
drivers/dax/hmem/hmem.c | 8 +-----
drivers/dax/pmem.c | 7 +----
6 files changed, 44 insertions(+), 48 deletions(-)
base-commit: ac2263b588dffd3a1efd7ed0b156ea6c5aea200d
^ permalink raw reply [flat|nested] 18+ messages in thread
* [PATCH 1/4] dax: Fix dax_mapping_release() use after free
2023-06-03 6:13 [PATCH 0/4] dax: Fix use after free and other cleanups Dan Williams
@ 2023-06-03 6:13 ` Dan Williams
2023-06-04 2:40 ` Ira Weiny
` (2 more replies)
2023-06-03 6:13 ` [PATCH 2/4] dax: Use device_unregister() in unregister_dax_mapping() Dan Williams
` (2 subsequent siblings)
3 siblings, 3 replies; 18+ messages in thread
From: Dan Williams @ 2023-06-03 6:13 UTC (permalink / raw)
To: nvdimm; +Cc: linux-cxl
A CONFIG_DEBUG_KOBJECT_RELEASE test of removing a device-dax region
provider (like modprobe -r dax_hmem) yields:
kobject: 'mapping0' (ffff93eb460e8800): kobject_release, parent 0000000000000000 (delayed 2000)
[..]
DEBUG_LOCKS_WARN_ON(1)
WARNING: CPU: 23 PID: 282 at kernel/locking/lockdep.c:232 __lock_acquire+0x9fc/0x2260
[..]
RIP: 0010:__lock_acquire+0x9fc/0x2260
[..]
Call Trace:
<TASK>
[..]
lock_acquire+0xd4/0x2c0
? ida_free+0x62/0x130
_raw_spin_lock_irqsave+0x47/0x70
? ida_free+0x62/0x130
ida_free+0x62/0x130
dax_mapping_release+0x1f/0x30
device_release+0x36/0x90
kobject_delayed_cleanup+0x46/0x150
Due to attempting ida_free() on an ida object that has already been
freed. Devices typically only hold a reference on their parent while
registered. If a child needs a parent object to complete its release it
needs to hold a reference that it drops from its release callback.
Arrange for a dax_mapping to pin its parent dev_dax instance until
dax_mapping_release().
Fixes: 0b07ce872a9e ("device-dax: introduce 'mapping' devices")
Signed-off-by: Dan Williams <dan.j.williams@intel.com>
---
drivers/dax/bus.c | 5 ++++-
1 file changed, 4 insertions(+), 1 deletion(-)
diff --git a/drivers/dax/bus.c b/drivers/dax/bus.c
index 227800053309..aee695f86b44 100644
--- a/drivers/dax/bus.c
+++ b/drivers/dax/bus.c
@@ -635,10 +635,12 @@ EXPORT_SYMBOL_GPL(alloc_dax_region);
static void dax_mapping_release(struct device *dev)
{
struct dax_mapping *mapping = to_dax_mapping(dev);
- struct dev_dax *dev_dax = to_dev_dax(dev->parent);
+ struct device *parent = dev->parent;
+ struct dev_dax *dev_dax = to_dev_dax(parent);
ida_free(&dev_dax->ida, mapping->id);
kfree(mapping);
+ put_device(parent);
}
static void unregister_dax_mapping(void *data)
@@ -778,6 +780,7 @@ static int devm_register_dax_mapping(struct dev_dax *dev_dax, int range_id)
dev = &mapping->dev;
device_initialize(dev);
dev->parent = &dev_dax->dev;
+ get_device(dev->parent);
dev->type = &dax_mapping_type;
dev_set_name(dev, "mapping%d", mapping->id);
rc = device_add(dev);
^ permalink raw reply related [flat|nested] 18+ messages in thread
* [PATCH 2/4] dax: Use device_unregister() in unregister_dax_mapping()
2023-06-03 6:13 [PATCH 0/4] dax: Fix use after free and other cleanups Dan Williams
2023-06-03 6:13 ` [PATCH 1/4] dax: Fix dax_mapping_release() use after free Dan Williams
@ 2023-06-03 6:13 ` Dan Williams
2023-06-04 2:41 ` Ira Weiny
` (2 more replies)
2023-06-03 6:14 ` [PATCH 3/4] dax: Introduce alloc_dev_dax_id() Dan Williams
2023-06-03 6:14 ` [PATCH 4/4] dax: Cleanup extra dax_region references Dan Williams
3 siblings, 3 replies; 18+ messages in thread
From: Dan Williams @ 2023-06-03 6:13 UTC (permalink / raw)
To: nvdimm; +Cc: linux-cxl
Replace an open-coded device_unregister() sequence with the helper.
Signed-off-by: Dan Williams <dan.j.williams@intel.com>
---
drivers/dax/bus.c | 3 +--
1 file changed, 1 insertion(+), 2 deletions(-)
diff --git a/drivers/dax/bus.c b/drivers/dax/bus.c
index aee695f86b44..c99ea08aafc3 100644
--- a/drivers/dax/bus.c
+++ b/drivers/dax/bus.c
@@ -657,8 +657,7 @@ static void unregister_dax_mapping(void *data)
dev_dax->ranges[mapping->range_id].mapping = NULL;
mapping->range_id = -1;
- device_del(dev);
- put_device(dev);
+ device_unregister(dev);
}
static struct dev_dax_range *get_dax_range(struct device *dev)
^ permalink raw reply related [flat|nested] 18+ messages in thread
* [PATCH 3/4] dax: Introduce alloc_dev_dax_id()
2023-06-03 6:13 [PATCH 0/4] dax: Fix use after free and other cleanups Dan Williams
2023-06-03 6:13 ` [PATCH 1/4] dax: Fix dax_mapping_release() use after free Dan Williams
2023-06-03 6:13 ` [PATCH 2/4] dax: Use device_unregister() in unregister_dax_mapping() Dan Williams
@ 2023-06-03 6:14 ` Dan Williams
2023-06-04 2:57 ` Ira Weiny
2023-06-03 6:14 ` [PATCH 4/4] dax: Cleanup extra dax_region references Dan Williams
3 siblings, 1 reply; 18+ messages in thread
From: Dan Williams @ 2023-06-03 6:14 UTC (permalink / raw)
To: nvdimm; +Cc: Yongqiang Liu, Paul Cassella, Ira Weiny, linux-cxl
The reference counting of dax_region objects is needlessly complicated,
has lead to confusion [1], and has hidden a bug [2]. Towards cleaning up
that mess introduce alloc_dev_dax_id() to minimize the holding of a
dax_region reference to only what dev_dax_release() needs, the
dax_region->ida.
Part of the reason for the mess was the design to dereference a
dax_region in all cases in free_dev_dax_id() even if the id was
statically assigned by the upper level dax_region driver. Remove the
need to call "is_static(dax_region)" by tracking whether the id is
dynamic directly in the dev_dax instance itself.
With that flag the dax_region pinning and release per dev_dax instance
can move to alloc_dev_dax_id() and free_dev_dax_id() respectively.
A follow-on cleanup address the unnecessary references in the dax_region
setup and drivers.
Fixes: 0f3da14a4f05 ("device-dax: introduce 'seed' devices")
Link: http://lore.kernel.org/r/20221203095858.612027-1-liuyongqiang13@huawei.com [1]
Link: http://lore.kernel.org/r/3cf0890b-4eb0-e70e-cd9c-2ecc3d496263@hpe.com [2]
Reported-by: Yongqiang Liu <liuyongqiang13@huawei.com>
Reported-by: Paul Cassella <cassella@hpe.com>
Reported-by: Ira Weiny <ira.weiny@intel.com>
Signed-off-by: Dan Williams <dan.j.williams@intel.com>
---
drivers/dax/bus.c | 56 +++++++++++++++++++++++++++------------------
drivers/dax/dax-private.h | 4 ++-
2 files changed, 37 insertions(+), 23 deletions(-)
diff --git a/drivers/dax/bus.c b/drivers/dax/bus.c
index c99ea08aafc3..a4cc3eca774f 100644
--- a/drivers/dax/bus.c
+++ b/drivers/dax/bus.c
@@ -446,18 +446,34 @@ static void unregister_dev_dax(void *dev)
put_device(dev);
}
+static void dax_region_free(struct kref *kref)
+{
+ struct dax_region *dax_region;
+
+ dax_region = container_of(kref, struct dax_region, kref);
+ kfree(dax_region);
+}
+
+void dax_region_put(struct dax_region *dax_region)
+{
+ kref_put(&dax_region->kref, dax_region_free);
+}
+EXPORT_SYMBOL_GPL(dax_region_put);
+
/* a return value >= 0 indicates this invocation invalidated the id */
static int __free_dev_dax_id(struct dev_dax *dev_dax)
{
- struct dax_region *dax_region = dev_dax->region;
struct device *dev = &dev_dax->dev;
+ struct dax_region *dax_region;
int rc = dev_dax->id;
device_lock_assert(dev);
- if (is_static(dax_region) || dev_dax->id < 0)
+ if (!dev_dax->dyn_id || dev_dax->id < 0)
return -1;
+ dax_region = dev_dax->region;
ida_free(&dax_region->ida, dev_dax->id);
+ dax_region_put(dax_region);
dev_dax->id = -1;
return rc;
}
@@ -473,6 +489,20 @@ static int free_dev_dax_id(struct dev_dax *dev_dax)
return rc;
}
+static int alloc_dev_dax_id(struct dev_dax *dev_dax)
+{
+ struct dax_region *dax_region = dev_dax->region;
+ int id;
+
+ id = ida_alloc(&dax_region->ida, GFP_KERNEL);
+ if (id < 0)
+ return id;
+ kref_get(&dax_region->kref);
+ dev_dax->dyn_id = true;
+ dev_dax->id = id;
+ return id;
+}
+
static ssize_t delete_store(struct device *dev, struct device_attribute *attr,
const char *buf, size_t len)
{
@@ -560,20 +590,6 @@ static const struct attribute_group *dax_region_attribute_groups[] = {
NULL,
};
-static void dax_region_free(struct kref *kref)
-{
- struct dax_region *dax_region;
-
- dax_region = container_of(kref, struct dax_region, kref);
- kfree(dax_region);
-}
-
-void dax_region_put(struct dax_region *dax_region)
-{
- kref_put(&dax_region->kref, dax_region_free);
-}
-EXPORT_SYMBOL_GPL(dax_region_put);
-
static void dax_region_unregister(void *region)
{
struct dax_region *dax_region = region;
@@ -1297,12 +1313,10 @@ static const struct attribute_group *dax_attribute_groups[] = {
static void dev_dax_release(struct device *dev)
{
struct dev_dax *dev_dax = to_dev_dax(dev);
- struct dax_region *dax_region = dev_dax->region;
struct dax_device *dax_dev = dev_dax->dax_dev;
put_dax(dax_dev);
free_dev_dax_id(dev_dax);
- dax_region_put(dax_region);
kfree(dev_dax->pgmap);
kfree(dev_dax);
}
@@ -1326,6 +1340,7 @@ struct dev_dax *devm_create_dev_dax(struct dev_dax_data *data)
if (!dev_dax)
return ERR_PTR(-ENOMEM);
+ dev_dax->region = dax_region;
if (is_static(dax_region)) {
if (dev_WARN_ONCE(parent, data->id < 0,
"dynamic id specified to static region\n")) {
@@ -1341,13 +1356,11 @@ struct dev_dax *devm_create_dev_dax(struct dev_dax_data *data)
goto err_id;
}
- rc = ida_alloc(&dax_region->ida, GFP_KERNEL);
+ rc = alloc_dev_dax_id(dev_dax);
if (rc < 0)
goto err_id;
- dev_dax->id = rc;
}
- dev_dax->region = dax_region;
dev = &dev_dax->dev;
device_initialize(dev);
dev_set_name(dev, "dax%d.%d", dax_region->id, dev_dax->id);
@@ -1388,7 +1401,6 @@ struct dev_dax *devm_create_dev_dax(struct dev_dax_data *data)
dev_dax->target_node = dax_region->target_node;
dev_dax->align = dax_region->align;
ida_init(&dev_dax->ida);
- kref_get(&dax_region->kref);
inode = dax_inode(dax_dev);
dev->devt = inode->i_rdev;
diff --git a/drivers/dax/dax-private.h b/drivers/dax/dax-private.h
index 1c974b7caae6..afcada6fd2ed 100644
--- a/drivers/dax/dax-private.h
+++ b/drivers/dax/dax-private.h
@@ -52,7 +52,8 @@ struct dax_mapping {
* @region - parent region
* @dax_dev - core dax functionality
* @target_node: effective numa node if dev_dax memory range is onlined
- * @id: ida allocated id
+ * @dyn_id: is this a dynamic or statically created instance
+ * @id: ida allocated id when the dax_region is not static
* @ida: mapping id allocator
* @dev - device core
* @pgmap - pgmap for memmap setup / lifetime (driver owned)
@@ -64,6 +65,7 @@ struct dev_dax {
struct dax_device *dax_dev;
unsigned int align;
int target_node;
+ bool dyn_id;
int id;
struct ida ida;
struct device dev;
^ permalink raw reply related [flat|nested] 18+ messages in thread
* [PATCH 4/4] dax: Cleanup extra dax_region references
2023-06-03 6:13 [PATCH 0/4] dax: Fix use after free and other cleanups Dan Williams
` (2 preceding siblings ...)
2023-06-03 6:14 ` [PATCH 3/4] dax: Introduce alloc_dev_dax_id() Dan Williams
@ 2023-06-03 6:14 ` Dan Williams
2023-06-04 2:58 ` Ira Weiny
` (2 more replies)
3 siblings, 3 replies; 18+ messages in thread
From: Dan Williams @ 2023-06-03 6:14 UTC (permalink / raw)
To: nvdimm; +Cc: Ira Weiny, linux-cxl
Now that free_dev_dax_id() internally manages the references it needs
the extra references taken by the dax_region drivers are not needed.
Reported-by: Ira Weiny <ira.weiny@intel.com>
Signed-off-by: Dan Williams <dan.j.williams@intel.com>
---
drivers/dax/bus.c | 4 +---
drivers/dax/bus.h | 1 -
drivers/dax/cxl.c | 8 +-------
drivers/dax/hmem/hmem.c | 8 +-------
drivers/dax/pmem.c | 7 +------
5 files changed, 4 insertions(+), 24 deletions(-)
diff --git a/drivers/dax/bus.c b/drivers/dax/bus.c
index a4cc3eca774f..0ee96e6fc426 100644
--- a/drivers/dax/bus.c
+++ b/drivers/dax/bus.c
@@ -454,11 +454,10 @@ static void dax_region_free(struct kref *kref)
kfree(dax_region);
}
-void dax_region_put(struct dax_region *dax_region)
+static void dax_region_put(struct dax_region *dax_region)
{
kref_put(&dax_region->kref, dax_region_free);
}
-EXPORT_SYMBOL_GPL(dax_region_put);
/* a return value >= 0 indicates this invocation invalidated the id */
static int __free_dev_dax_id(struct dev_dax *dev_dax)
@@ -641,7 +640,6 @@ struct dax_region *alloc_dax_region(struct device *parent, int region_id,
return NULL;
}
- kref_get(&dax_region->kref);
if (devm_add_action_or_reset(parent, dax_region_unregister, dax_region))
return NULL;
return dax_region;
diff --git a/drivers/dax/bus.h b/drivers/dax/bus.h
index 8cd79ab34292..bdbf719df5c5 100644
--- a/drivers/dax/bus.h
+++ b/drivers/dax/bus.h
@@ -9,7 +9,6 @@ struct dev_dax;
struct resource;
struct dax_device;
struct dax_region;
-void dax_region_put(struct dax_region *dax_region);
/* dax bus specific ioresource flags */
#define IORESOURCE_DAX_STATIC BIT(0)
diff --git a/drivers/dax/cxl.c b/drivers/dax/cxl.c
index ccdf8de85bd5..8bc9d04034d6 100644
--- a/drivers/dax/cxl.c
+++ b/drivers/dax/cxl.c
@@ -13,7 +13,6 @@ static int cxl_dax_region_probe(struct device *dev)
struct cxl_region *cxlr = cxlr_dax->cxlr;
struct dax_region *dax_region;
struct dev_dax_data data;
- struct dev_dax *dev_dax;
if (nid == NUMA_NO_NODE)
nid = memory_add_physaddr_to_nid(cxlr_dax->hpa_range.start);
@@ -28,13 +27,8 @@ static int cxl_dax_region_probe(struct device *dev)
.id = -1,
.size = range_len(&cxlr_dax->hpa_range),
};
- dev_dax = devm_create_dev_dax(&data);
- if (IS_ERR(dev_dax))
- return PTR_ERR(dev_dax);
- /* child dev_dax instances now own the lifetime of the dax_region */
- dax_region_put(dax_region);
- return 0;
+ return PTR_ERR_OR_ZERO(devm_create_dev_dax(&data));
}
static struct cxl_driver cxl_dax_region_driver = {
diff --git a/drivers/dax/hmem/hmem.c b/drivers/dax/hmem/hmem.c
index e5fe8b39fb94..5d2ddef0f8f5 100644
--- a/drivers/dax/hmem/hmem.c
+++ b/drivers/dax/hmem/hmem.c
@@ -16,7 +16,6 @@ static int dax_hmem_probe(struct platform_device *pdev)
struct dax_region *dax_region;
struct memregion_info *mri;
struct dev_dax_data data;
- struct dev_dax *dev_dax;
/*
* @region_idle == true indicates that an administrative agent
@@ -38,13 +37,8 @@ static int dax_hmem_probe(struct platform_device *pdev)
.id = -1,
.size = region_idle ? 0 : range_len(&mri->range),
};
- dev_dax = devm_create_dev_dax(&data);
- if (IS_ERR(dev_dax))
- return PTR_ERR(dev_dax);
- /* child dev_dax instances now own the lifetime of the dax_region */
- dax_region_put(dax_region);
- return 0;
+ return PTR_ERR_OR_ZERO(devm_create_dev_dax(&data));
}
static struct platform_driver dax_hmem_driver = {
diff --git a/drivers/dax/pmem.c b/drivers/dax/pmem.c
index f050ea78bb83..ae0cb113a5d3 100644
--- a/drivers/dax/pmem.c
+++ b/drivers/dax/pmem.c
@@ -13,7 +13,6 @@ static struct dev_dax *__dax_pmem_probe(struct device *dev)
int rc, id, region_id;
resource_size_t offset;
struct nd_pfn_sb *pfn_sb;
- struct dev_dax *dev_dax;
struct dev_dax_data data;
struct nd_namespace_io *nsio;
struct dax_region *dax_region;
@@ -65,12 +64,8 @@ static struct dev_dax *__dax_pmem_probe(struct device *dev)
.pgmap = &pgmap,
.size = range_len(&range),
};
- dev_dax = devm_create_dev_dax(&data);
- /* child dev_dax instances now own the lifetime of the dax_region */
- dax_region_put(dax_region);
-
- return dev_dax;
+ return devm_create_dev_dax(&data);
}
static int dax_pmem_probe(struct device *dev)
^ permalink raw reply related [flat|nested] 18+ messages in thread
* Re: [PATCH 1/4] dax: Fix dax_mapping_release() use after free
2023-06-03 6:13 ` [PATCH 1/4] dax: Fix dax_mapping_release() use after free Dan Williams
@ 2023-06-04 2:40 ` Ira Weiny
2023-06-05 20:45 ` Fan Ni
2023-06-15 17:33 ` Dave Jiang
2 siblings, 0 replies; 18+ messages in thread
From: Ira Weiny @ 2023-06-04 2:40 UTC (permalink / raw)
To: Dan Williams, nvdimm; +Cc: linux-cxl
Dan Williams wrote:
> A CONFIG_DEBUG_KOBJECT_RELEASE test of removing a device-dax region
> provider (like modprobe -r dax_hmem) yields:
>
> kobject: 'mapping0' (ffff93eb460e8800): kobject_release, parent 0000000000000000 (delayed 2000)
> [..]
> DEBUG_LOCKS_WARN_ON(1)
> WARNING: CPU: 23 PID: 282 at kernel/locking/lockdep.c:232 __lock_acquire+0x9fc/0x2260
> [..]
> RIP: 0010:__lock_acquire+0x9fc/0x2260
> [..]
> Call Trace:
> <TASK>
> [..]
> lock_acquire+0xd4/0x2c0
> ? ida_free+0x62/0x130
> _raw_spin_lock_irqsave+0x47/0x70
> ? ida_free+0x62/0x130
> ida_free+0x62/0x130
> dax_mapping_release+0x1f/0x30
> device_release+0x36/0x90
> kobject_delayed_cleanup+0x46/0x150
>
> Due to attempting ida_free() on an ida object that has already been
> freed. Devices typically only hold a reference on their parent while
> registered. If a child needs a parent object to complete its release it
> needs to hold a reference that it drops from its release callback.
> Arrange for a dax_mapping to pin its parent dev_dax instance until
> dax_mapping_release().
>
> Fixes: 0b07ce872a9e ("device-dax: introduce 'mapping' devices")
Reviewed-by: Ira Weiny <ira.weiny@intel.com>
^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: [PATCH 2/4] dax: Use device_unregister() in unregister_dax_mapping()
2023-06-03 6:13 ` [PATCH 2/4] dax: Use device_unregister() in unregister_dax_mapping() Dan Williams
@ 2023-06-04 2:41 ` Ira Weiny
2023-06-05 20:46 ` Fan Ni
2023-06-15 17:34 ` Dave Jiang
2 siblings, 0 replies; 18+ messages in thread
From: Ira Weiny @ 2023-06-04 2:41 UTC (permalink / raw)
To: Dan Williams, nvdimm; +Cc: linux-cxl
Dan Williams wrote:
> Replace an open-coded device_unregister() sequence with the helper.
>
Reviewed-by: Ira Weiny <ira.weiny@intel.com>
> Signed-off-by: Dan Williams <dan.j.williams@intel.com>
> ---
^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: [PATCH 3/4] dax: Introduce alloc_dev_dax_id()
2023-06-03 6:14 ` [PATCH 3/4] dax: Introduce alloc_dev_dax_id() Dan Williams
@ 2023-06-04 2:57 ` Ira Weiny
2023-06-16 1:22 ` Dan Williams
0 siblings, 1 reply; 18+ messages in thread
From: Ira Weiny @ 2023-06-04 2:57 UTC (permalink / raw)
To: Dan Williams, nvdimm; +Cc: Yongqiang Liu, Paul Cassella, Ira Weiny, linux-cxl
Dan Williams wrote:
> The reference counting of dax_region objects is needlessly complicated,
> has lead to confusion [1], and has hidden a bug [2]. Towards cleaning up
> that mess introduce alloc_dev_dax_id() to minimize the holding of a
> dax_region reference to only what dev_dax_release() needs, the
> dax_region->ida.
>
> Part of the reason for the mess was the design to dereference a
> dax_region in all cases in free_dev_dax_id() even if the id was
> statically assigned by the upper level dax_region driver. Remove the
> need to call "is_static(dax_region)" by tracking whether the id is
> dynamic directly in the dev_dax instance itself.
>
> With that flag the dax_region pinning and release per dev_dax instance
> can move to alloc_dev_dax_id() and free_dev_dax_id() respectively.
>
> A follow-on cleanup address the unnecessary references in the dax_region
> setup and drivers.
>
> Fixes: 0f3da14a4f05 ("device-dax: introduce 'seed' devices")
> Link: http://lore.kernel.org/r/20221203095858.612027-1-liuyongqiang13@huawei.com [1]
> Link: http://lore.kernel.org/r/3cf0890b-4eb0-e70e-cd9c-2ecc3d496263@hpe.com [2]
> Reported-by: Yongqiang Liu <liuyongqiang13@huawei.com>
> Reported-by: Paul Cassella <cassella@hpe.com>
> Reported-by: Ira Weiny <ira.weiny@intel.com>
> Signed-off-by: Dan Williams <dan.j.williams@intel.com>
> ---
> drivers/dax/bus.c | 56 +++++++++++++++++++++++++++------------------
> drivers/dax/dax-private.h | 4 ++-
> 2 files changed, 37 insertions(+), 23 deletions(-)
>
> diff --git a/drivers/dax/bus.c b/drivers/dax/bus.c
> index c99ea08aafc3..a4cc3eca774f 100644
> --- a/drivers/dax/bus.c
> +++ b/drivers/dax/bus.c
> @@ -446,18 +446,34 @@ static void unregister_dev_dax(void *dev)
> put_device(dev);
> }
>
> +static void dax_region_free(struct kref *kref)
> +{
> + struct dax_region *dax_region;
> +
> + dax_region = container_of(kref, struct dax_region, kref);
> + kfree(dax_region);
> +}
> +
> +void dax_region_put(struct dax_region *dax_region)
> +{
> + kref_put(&dax_region->kref, dax_region_free);
> +}
> +EXPORT_SYMBOL_GPL(dax_region_put);
> +
> /* a return value >= 0 indicates this invocation invalidated the id */
> static int __free_dev_dax_id(struct dev_dax *dev_dax)
> {
> - struct dax_region *dax_region = dev_dax->region;
> struct device *dev = &dev_dax->dev;
> + struct dax_region *dax_region;
> int rc = dev_dax->id;
>
> device_lock_assert(dev);
>
> - if (is_static(dax_region) || dev_dax->id < 0)
> + if (!dev_dax->dyn_id || dev_dax->id < 0)
> return -1;
> + dax_region = dev_dax->region;
> ida_free(&dax_region->ida, dev_dax->id);
> + dax_region_put(dax_region);
> dev_dax->id = -1;
> return rc;
> }
> @@ -473,6 +489,20 @@ static int free_dev_dax_id(struct dev_dax *dev_dax)
> return rc;
> }
>
> +static int alloc_dev_dax_id(struct dev_dax *dev_dax)
> +{
> + struct dax_region *dax_region = dev_dax->region;
> + int id;
> +
> + id = ida_alloc(&dax_region->ida, GFP_KERNEL);
> + if (id < 0)
> + return id;
> + kref_get(&dax_region->kref);
> + dev_dax->dyn_id = true;
> + dev_dax->id = id;
> + return id;
> +}
> +
> static ssize_t delete_store(struct device *dev, struct device_attribute *attr,
> const char *buf, size_t len)
> {
> @@ -560,20 +590,6 @@ static const struct attribute_group *dax_region_attribute_groups[] = {
> NULL,
> };
>
> -static void dax_region_free(struct kref *kref)
> -{
> - struct dax_region *dax_region;
> -
> - dax_region = container_of(kref, struct dax_region, kref);
> - kfree(dax_region);
> -}
> -
> -void dax_region_put(struct dax_region *dax_region)
> -{
> - kref_put(&dax_region->kref, dax_region_free);
> -}
> -EXPORT_SYMBOL_GPL(dax_region_put);
> -
> static void dax_region_unregister(void *region)
> {
> struct dax_region *dax_region = region;
> @@ -1297,12 +1313,10 @@ static const struct attribute_group *dax_attribute_groups[] = {
> static void dev_dax_release(struct device *dev)
> {
> struct dev_dax *dev_dax = to_dev_dax(dev);
> - struct dax_region *dax_region = dev_dax->region;
> struct dax_device *dax_dev = dev_dax->dax_dev;
>
> put_dax(dax_dev);
> free_dev_dax_id(dev_dax);
> - dax_region_put(dax_region);
> kfree(dev_dax->pgmap);
> kfree(dev_dax);
> }
> @@ -1326,6 +1340,7 @@ struct dev_dax *devm_create_dev_dax(struct dev_dax_data *data)
> if (!dev_dax)
> return ERR_PTR(-ENOMEM);
>
> + dev_dax->region = dax_region;
Overall I like that this reference is not needed to be carried and/or
managed by the callers.
However, here you are referencing the dax_region from the dev_dax in an
unrelated place to where the reference matters (in id management).
Could alloc_dev_dax_id() change to:
static int alloc_dev_dax_id(struct dev_dax *dev_dax, struct dax_region *dax_region)
{
...
}
Then make this assignment next to where the kref is taken so it is clear
that this is the only user of the reference?
I did not pick up on the fact this reference was only needed to free the
id at all in reviewing the code and I think this would make it even more
clear.
Ira
> if (is_static(dax_region)) {
> if (dev_WARN_ONCE(parent, data->id < 0,
> "dynamic id specified to static region\n")) {
> @@ -1341,13 +1356,11 @@ struct dev_dax *devm_create_dev_dax(struct dev_dax_data *data)
> goto err_id;
> }
>
> - rc = ida_alloc(&dax_region->ida, GFP_KERNEL);
> + rc = alloc_dev_dax_id(dev_dax);
> if (rc < 0)
> goto err_id;
> - dev_dax->id = rc;
> }
>
> - dev_dax->region = dax_region;
> dev = &dev_dax->dev;
> device_initialize(dev);
> dev_set_name(dev, "dax%d.%d", dax_region->id, dev_dax->id);
> @@ -1388,7 +1401,6 @@ struct dev_dax *devm_create_dev_dax(struct dev_dax_data *data)
> dev_dax->target_node = dax_region->target_node;
> dev_dax->align = dax_region->align;
> ida_init(&dev_dax->ida);
> - kref_get(&dax_region->kref);
>
> inode = dax_inode(dax_dev);
> dev->devt = inode->i_rdev;
> diff --git a/drivers/dax/dax-private.h b/drivers/dax/dax-private.h
> index 1c974b7caae6..afcada6fd2ed 100644
> --- a/drivers/dax/dax-private.h
> +++ b/drivers/dax/dax-private.h
> @@ -52,7 +52,8 @@ struct dax_mapping {
> * @region - parent region
> * @dax_dev - core dax functionality
> * @target_node: effective numa node if dev_dax memory range is onlined
> - * @id: ida allocated id
> + * @dyn_id: is this a dynamic or statically created instance
> + * @id: ida allocated id when the dax_region is not static
> * @ida: mapping id allocator
> * @dev - device core
> * @pgmap - pgmap for memmap setup / lifetime (driver owned)
> @@ -64,6 +65,7 @@ struct dev_dax {
> struct dax_device *dax_dev;
> unsigned int align;
> int target_node;
> + bool dyn_id;
> int id;
> struct ida ida;
> struct device dev;
>
^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: [PATCH 4/4] dax: Cleanup extra dax_region references
2023-06-03 6:14 ` [PATCH 4/4] dax: Cleanup extra dax_region references Dan Williams
@ 2023-06-04 2:58 ` Ira Weiny
2023-06-06 17:46 ` Fan Ni
2023-06-15 17:45 ` Dave Jiang
2 siblings, 0 replies; 18+ messages in thread
From: Ira Weiny @ 2023-06-04 2:58 UTC (permalink / raw)
To: Dan Williams, nvdimm; +Cc: Ira Weiny, linux-cxl
Dan Williams wrote:
> Now that free_dev_dax_id() internally manages the references it needs
> the extra references taken by the dax_region drivers are not needed.
>
> Reported-by: Ira Weiny <ira.weiny@intel.com>
Reviewed-by: Ira Weiny <ira.weiny@intel.com>
> Signed-off-by: Dan Williams <dan.j.williams@intel.com>
> ---
> drivers/dax/bus.c | 4 +---
> drivers/dax/bus.h | 1 -
> drivers/dax/cxl.c | 8 +-------
> drivers/dax/hmem/hmem.c | 8 +-------
> drivers/dax/pmem.c | 7 +------
> 5 files changed, 4 insertions(+), 24 deletions(-)
>
> diff --git a/drivers/dax/bus.c b/drivers/dax/bus.c
> index a4cc3eca774f..0ee96e6fc426 100644
> --- a/drivers/dax/bus.c
> +++ b/drivers/dax/bus.c
> @@ -454,11 +454,10 @@ static void dax_region_free(struct kref *kref)
> kfree(dax_region);
> }
>
> -void dax_region_put(struct dax_region *dax_region)
> +static void dax_region_put(struct dax_region *dax_region)
> {
> kref_put(&dax_region->kref, dax_region_free);
> }
> -EXPORT_SYMBOL_GPL(dax_region_put);
>
> /* a return value >= 0 indicates this invocation invalidated the id */
> static int __free_dev_dax_id(struct dev_dax *dev_dax)
> @@ -641,7 +640,6 @@ struct dax_region *alloc_dax_region(struct device *parent, int region_id,
> return NULL;
> }
>
> - kref_get(&dax_region->kref);
> if (devm_add_action_or_reset(parent, dax_region_unregister, dax_region))
> return NULL;
> return dax_region;
> diff --git a/drivers/dax/bus.h b/drivers/dax/bus.h
> index 8cd79ab34292..bdbf719df5c5 100644
> --- a/drivers/dax/bus.h
> +++ b/drivers/dax/bus.h
> @@ -9,7 +9,6 @@ struct dev_dax;
> struct resource;
> struct dax_device;
> struct dax_region;
> -void dax_region_put(struct dax_region *dax_region);
>
> /* dax bus specific ioresource flags */
> #define IORESOURCE_DAX_STATIC BIT(0)
> diff --git a/drivers/dax/cxl.c b/drivers/dax/cxl.c
> index ccdf8de85bd5..8bc9d04034d6 100644
> --- a/drivers/dax/cxl.c
> +++ b/drivers/dax/cxl.c
> @@ -13,7 +13,6 @@ static int cxl_dax_region_probe(struct device *dev)
> struct cxl_region *cxlr = cxlr_dax->cxlr;
> struct dax_region *dax_region;
> struct dev_dax_data data;
> - struct dev_dax *dev_dax;
>
> if (nid == NUMA_NO_NODE)
> nid = memory_add_physaddr_to_nid(cxlr_dax->hpa_range.start);
> @@ -28,13 +27,8 @@ static int cxl_dax_region_probe(struct device *dev)
> .id = -1,
> .size = range_len(&cxlr_dax->hpa_range),
> };
> - dev_dax = devm_create_dev_dax(&data);
> - if (IS_ERR(dev_dax))
> - return PTR_ERR(dev_dax);
>
> - /* child dev_dax instances now own the lifetime of the dax_region */
> - dax_region_put(dax_region);
> - return 0;
> + return PTR_ERR_OR_ZERO(devm_create_dev_dax(&data));
> }
>
> static struct cxl_driver cxl_dax_region_driver = {
> diff --git a/drivers/dax/hmem/hmem.c b/drivers/dax/hmem/hmem.c
> index e5fe8b39fb94..5d2ddef0f8f5 100644
> --- a/drivers/dax/hmem/hmem.c
> +++ b/drivers/dax/hmem/hmem.c
> @@ -16,7 +16,6 @@ static int dax_hmem_probe(struct platform_device *pdev)
> struct dax_region *dax_region;
> struct memregion_info *mri;
> struct dev_dax_data data;
> - struct dev_dax *dev_dax;
>
> /*
> * @region_idle == true indicates that an administrative agent
> @@ -38,13 +37,8 @@ static int dax_hmem_probe(struct platform_device *pdev)
> .id = -1,
> .size = region_idle ? 0 : range_len(&mri->range),
> };
> - dev_dax = devm_create_dev_dax(&data);
> - if (IS_ERR(dev_dax))
> - return PTR_ERR(dev_dax);
>
> - /* child dev_dax instances now own the lifetime of the dax_region */
> - dax_region_put(dax_region);
> - return 0;
> + return PTR_ERR_OR_ZERO(devm_create_dev_dax(&data));
> }
>
> static struct platform_driver dax_hmem_driver = {
> diff --git a/drivers/dax/pmem.c b/drivers/dax/pmem.c
> index f050ea78bb83..ae0cb113a5d3 100644
> --- a/drivers/dax/pmem.c
> +++ b/drivers/dax/pmem.c
> @@ -13,7 +13,6 @@ static struct dev_dax *__dax_pmem_probe(struct device *dev)
> int rc, id, region_id;
> resource_size_t offset;
> struct nd_pfn_sb *pfn_sb;
> - struct dev_dax *dev_dax;
> struct dev_dax_data data;
> struct nd_namespace_io *nsio;
> struct dax_region *dax_region;
> @@ -65,12 +64,8 @@ static struct dev_dax *__dax_pmem_probe(struct device *dev)
> .pgmap = &pgmap,
> .size = range_len(&range),
> };
> - dev_dax = devm_create_dev_dax(&data);
>
> - /* child dev_dax instances now own the lifetime of the dax_region */
> - dax_region_put(dax_region);
> -
> - return dev_dax;
> + return devm_create_dev_dax(&data);
> }
>
> static int dax_pmem_probe(struct device *dev)
>
^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: [PATCH 1/4] dax: Fix dax_mapping_release() use after free
2023-06-03 6:13 ` [PATCH 1/4] dax: Fix dax_mapping_release() use after free Dan Williams
2023-06-04 2:40 ` Ira Weiny
@ 2023-06-05 20:45 ` Fan Ni
2023-06-15 17:33 ` Dave Jiang
2 siblings, 0 replies; 18+ messages in thread
From: Fan Ni @ 2023-06-05 20:45 UTC (permalink / raw)
To: Dan Williams
Cc: nvdimm@lists.linux.dev, linux-cxl@vger.kernel.org,
Adam Manzanares, dave@stgolabs.net, nmtadam.samsung@gmail.com,
nifan@outlook.com
On Fri, Jun 02, 2023 at 11:13:54PM -0700, Dan Williams wrote:
> A CONFIG_DEBUG_KOBJECT_RELEASE test of removing a device-dax region
> provider (like modprobe -r dax_hmem) yields:
>
> kobject: 'mapping0' (ffff93eb460e8800): kobject_release, parent 0000000000000000 (delayed 2000)
> [..]
> DEBUG_LOCKS_WARN_ON(1)
> WARNING: CPU: 23 PID: 282 at kernel/locking/lockdep.c:232 __lock_acquire+0x9fc/0x2260
> [..]
> RIP: 0010:__lock_acquire+0x9fc/0x2260
> [..]
> Call Trace:
> <TASK>
> [..]
> lock_acquire+0xd4/0x2c0
> ? ida_free+0x62/0x130
> _raw_spin_lock_irqsave+0x47/0x70
> ? ida_free+0x62/0x130
> ida_free+0x62/0x130
> dax_mapping_release+0x1f/0x30
> device_release+0x36/0x90
> kobject_delayed_cleanup+0x46/0x150
>
> Due to attempting ida_free() on an ida object that has already been
> freed. Devices typically only hold a reference on their parent while
> registered. If a child needs a parent object to complete its release it
> needs to hold a reference that it drops from its release callback.
> Arrange for a dax_mapping to pin its parent dev_dax instance until
> dax_mapping_release().
>
> Fixes: 0b07ce872a9e ("device-dax: introduce 'mapping' devices")
> Signed-off-by: Dan Williams <dan.j.williams@intel.com>
> ---
Reviewed-by: Fan Ni <fan.ni@samsung.com>
> drivers/dax/bus.c | 5 ++++-
> 1 file changed, 4 insertions(+), 1 deletion(-)
>
> diff --git a/drivers/dax/bus.c b/drivers/dax/bus.c
> index 227800053309..aee695f86b44 100644
> --- a/drivers/dax/bus.c
> +++ b/drivers/dax/bus.c
> @@ -635,10 +635,12 @@ EXPORT_SYMBOL_GPL(alloc_dax_region);
> static void dax_mapping_release(struct device *dev)
> {
> struct dax_mapping *mapping = to_dax_mapping(dev);
> - struct dev_dax *dev_dax = to_dev_dax(dev->parent);
> + struct device *parent = dev->parent;
> + struct dev_dax *dev_dax = to_dev_dax(parent);
>
> ida_free(&dev_dax->ida, mapping->id);
> kfree(mapping);
> + put_device(parent);
> }
>
> static void unregister_dax_mapping(void *data)
> @@ -778,6 +780,7 @@ static int devm_register_dax_mapping(struct dev_dax *dev_dax, int range_id)
> dev = &mapping->dev;
> device_initialize(dev);
> dev->parent = &dev_dax->dev;
> + get_device(dev->parent);
> dev->type = &dax_mapping_type;
> dev_set_name(dev, "mapping%d", mapping->id);
> rc = device_add(dev);
>
>
^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: [PATCH 2/4] dax: Use device_unregister() in unregister_dax_mapping()
2023-06-03 6:13 ` [PATCH 2/4] dax: Use device_unregister() in unregister_dax_mapping() Dan Williams
2023-06-04 2:41 ` Ira Weiny
@ 2023-06-05 20:46 ` Fan Ni
2023-06-15 17:34 ` Dave Jiang
2 siblings, 0 replies; 18+ messages in thread
From: Fan Ni @ 2023-06-05 20:46 UTC (permalink / raw)
To: Dan Williams
Cc: nvdimm@lists.linux.dev, linux-cxl@vger.kernel.org,
Adam Manzanares, dave@stgolabs.net, nmtadam.samsung@gmail.com,
nifan@outlook.com
On Fri, Jun 02, 2023 at 11:13:59PM -0700, Dan Williams wrote:
> Replace an open-coded device_unregister() sequence with the helper.
>
> Signed-off-by: Dan Williams <dan.j.williams@intel.com>
> ---
> drivers/dax/bus.c | 3 +--
> 1 file changed, 1 insertion(+), 2 deletions(-)
>
> diff --git a/drivers/dax/bus.c b/drivers/dax/bus.c
> index aee695f86b44..c99ea08aafc3 100644
> --- a/drivers/dax/bus.c
> +++ b/drivers/dax/bus.c
> @@ -657,8 +657,7 @@ static void unregister_dax_mapping(void *data)
> dev_dax->ranges[mapping->range_id].mapping = NULL;
> mapping->range_id = -1;
>
> - device_del(dev);
> - put_device(dev);
> + device_unregister(dev);
> }
>
> static struct dev_dax_range *get_dax_range(struct device *dev)
>
>
Reviewed-by: Fan Ni <fan.ni@samsung.com>
^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: [PATCH 4/4] dax: Cleanup extra dax_region references
2023-06-03 6:14 ` [PATCH 4/4] dax: Cleanup extra dax_region references Dan Williams
2023-06-04 2:58 ` Ira Weiny
@ 2023-06-06 17:46 ` Fan Ni
2023-06-06 20:42 ` Ira Weiny
2023-06-15 17:45 ` Dave Jiang
2 siblings, 1 reply; 18+ messages in thread
From: Fan Ni @ 2023-06-06 17:46 UTC (permalink / raw)
To: Dan Williams
Cc: nvdimm, Ira Weiny, linux-cxl, a.manzanares, dave, nmtadam.samsung,
nifan, fan.ni
The 06/02/2023 23:14, Dan Williams wrote:
> Now that free_dev_dax_id() internally manages the references it needs
> the extra references taken by the dax_region drivers are not needed.
>
> Reported-by: Ira Weiny <ira.weiny@intel.com>
> Signed-off-by: Dan Williams <dan.j.williams@intel.com>
Reviewed-by: Fan Ni <fan.ni@samsung.com>
One minor comment as below.
> ---
> drivers/dax/bus.c | 4 +---
> drivers/dax/bus.h | 1 -
> drivers/dax/cxl.c | 8 +-------
> drivers/dax/hmem/hmem.c | 8 +-------
> drivers/dax/pmem.c | 7 +------
> 5 files changed, 4 insertions(+), 24 deletions(-)
>
> diff --git a/drivers/dax/bus.c b/drivers/dax/bus.c
> index a4cc3eca774f..0ee96e6fc426 100644
> --- a/drivers/dax/bus.c
> +++ b/drivers/dax/bus.c
> @@ -454,11 +454,10 @@ static void dax_region_free(struct kref *kref)
> kfree(dax_region);
> }
>
> -void dax_region_put(struct dax_region *dax_region)
> +static void dax_region_put(struct dax_region *dax_region)
> {
> kref_put(&dax_region->kref, dax_region_free);
> }
> -EXPORT_SYMBOL_GPL(dax_region_put);
>
> /* a return value >= 0 indicates this invocation invalidated the id */
> static int __free_dev_dax_id(struct dev_dax *dev_dax)
> @@ -641,7 +640,6 @@ struct dax_region *alloc_dax_region(struct device *parent, int region_id,
> return NULL;
> }
>
> - kref_get(&dax_region->kref);
> if (devm_add_action_or_reset(parent, dax_region_unregister, dax_region))
> return NULL;
> return dax_region;
> diff --git a/drivers/dax/bus.h b/drivers/dax/bus.h
> index 8cd79ab34292..bdbf719df5c5 100644
> --- a/drivers/dax/bus.h
> +++ b/drivers/dax/bus.h
> @@ -9,7 +9,6 @@ struct dev_dax;
> struct resource;
> struct dax_device;
> struct dax_region;
> -void dax_region_put(struct dax_region *dax_region);
>
> /* dax bus specific ioresource flags */
> #define IORESOURCE_DAX_STATIC BIT(0)
> diff --git a/drivers/dax/cxl.c b/drivers/dax/cxl.c
> index ccdf8de85bd5..8bc9d04034d6 100644
> --- a/drivers/dax/cxl.c
> +++ b/drivers/dax/cxl.c
> @@ -13,7 +13,6 @@ static int cxl_dax_region_probe(struct device *dev)
> struct cxl_region *cxlr = cxlr_dax->cxlr;
> struct dax_region *dax_region;
> struct dev_dax_data data;
> - struct dev_dax *dev_dax;
>
> if (nid == NUMA_NO_NODE)
> nid = memory_add_physaddr_to_nid(cxlr_dax->hpa_range.start);
> @@ -28,13 +27,8 @@ static int cxl_dax_region_probe(struct device *dev)
> .id = -1,
> .size = range_len(&cxlr_dax->hpa_range),
> };
> - dev_dax = devm_create_dev_dax(&data);
> - if (IS_ERR(dev_dax))
> - return PTR_ERR(dev_dax);
>
> - /* child dev_dax instances now own the lifetime of the dax_region */
> - dax_region_put(dax_region);
> - return 0;
> + return PTR_ERR_OR_ZERO(devm_create_dev_dax(&data));
> }
>
> static struct cxl_driver cxl_dax_region_driver = {
> diff --git a/drivers/dax/hmem/hmem.c b/drivers/dax/hmem/hmem.c
> index e5fe8b39fb94..5d2ddef0f8f5 100644
> --- a/drivers/dax/hmem/hmem.c
> +++ b/drivers/dax/hmem/hmem.c
> @@ -16,7 +16,6 @@ static int dax_hmem_probe(struct platform_device *pdev)
> struct dax_region *dax_region;
> struct memregion_info *mri;
> struct dev_dax_data data;
> - struct dev_dax *dev_dax;
>
> /*
> * @region_idle == true indicates that an administrative agent
> @@ -38,13 +37,8 @@ static int dax_hmem_probe(struct platform_device *pdev)
> .id = -1,
> .size = region_idle ? 0 : range_len(&mri->range),
> };
> - dev_dax = devm_create_dev_dax(&data);
> - if (IS_ERR(dev_dax))
> - return PTR_ERR(dev_dax);
>
> - /* child dev_dax instances now own the lifetime of the dax_region */
> - dax_region_put(dax_region);
> - return 0;
> + return PTR_ERR_OR_ZERO(devm_create_dev_dax(&data));
> }
>
> static struct platform_driver dax_hmem_driver = {
> diff --git a/drivers/dax/pmem.c b/drivers/dax/pmem.c
> index f050ea78bb83..ae0cb113a5d3 100644
> --- a/drivers/dax/pmem.c
> +++ b/drivers/dax/pmem.c
> @@ -13,7 +13,6 @@ static struct dev_dax *__dax_pmem_probe(struct device *dev)
> int rc, id, region_id;
> resource_size_t offset;
> struct nd_pfn_sb *pfn_sb;
> - struct dev_dax *dev_dax;
> struct dev_dax_data data;
> struct nd_namespace_io *nsio;
> struct dax_region *dax_region;
> @@ -65,12 +64,8 @@ static struct dev_dax *__dax_pmem_probe(struct device *dev)
> .pgmap = &pgmap,
> .size = range_len(&range),
> };
> - dev_dax = devm_create_dev_dax(&data);
>
> - /* child dev_dax instances now own the lifetime of the dax_region */
> - dax_region_put(dax_region);
> -
> - return dev_dax;
> + return devm_create_dev_dax(&data);
Not related to the patch, but why we do not need to check the returned
value of devm_create_dev_dax as above? Or do we really need the check as
the function already returns ERR_PTR if failed?
Fan
> }
>
> static int dax_pmem_probe(struct device *dev)
>
--
Fan Ni <nifan@outlook.com>
^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: [PATCH 4/4] dax: Cleanup extra dax_region references
2023-06-06 17:46 ` Fan Ni
@ 2023-06-06 20:42 ` Ira Weiny
0 siblings, 0 replies; 18+ messages in thread
From: Ira Weiny @ 2023-06-06 20:42 UTC (permalink / raw)
To: Fan Ni, Dan Williams
Cc: nvdimm, Ira Weiny, linux-cxl, a.manzanares, dave, nmtadam.samsung,
nifan, fan.ni
Fan Ni wrote:
> The 06/02/2023 23:14, Dan Williams wrote:
> > Now that free_dev_dax_id() internally manages the references it needs
> > the extra references taken by the dax_region drivers are not needed.
> >
> > Reported-by: Ira Weiny <ira.weiny@intel.com>
> > Signed-off-by: Dan Williams <dan.j.williams@intel.com>
>
> Reviewed-by: Fan Ni <fan.ni@samsung.com>
> One minor comment as below.
>
[snip]
> > static struct platform_driver dax_hmem_driver = {
> > diff --git a/drivers/dax/pmem.c b/drivers/dax/pmem.c
> > index f050ea78bb83..ae0cb113a5d3 100644
> > --- a/drivers/dax/pmem.c
> > +++ b/drivers/dax/pmem.c
> > @@ -13,7 +13,6 @@ static struct dev_dax *__dax_pmem_probe(struct device *dev)
> > int rc, id, region_id;
> > resource_size_t offset;
> > struct nd_pfn_sb *pfn_sb;
> > - struct dev_dax *dev_dax;
> > struct dev_dax_data data;
> > struct nd_namespace_io *nsio;
> > struct dax_region *dax_region;
> > @@ -65,12 +64,8 @@ static struct dev_dax *__dax_pmem_probe(struct device *dev)
> > .pgmap = &pgmap,
> > .size = range_len(&range),
> > };
> > - dev_dax = devm_create_dev_dax(&data);
> >
> > - /* child dev_dax instances now own the lifetime of the dax_region */
> > - dax_region_put(dax_region);
> > -
> > - return dev_dax;
> > + return devm_create_dev_dax(&data);
>
> Not related to the patch, but why we do not need to check the returned
> value of devm_create_dev_dax as above?
__dax_pmem_probe() returns struct dev_dax * so we just pass the result on.
> Or do we really need the check as
> the function already returns ERR_PTR if failed?
Yea the caller of __dax_pmem_probe() needs to handle it.
Ira
^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: [PATCH 1/4] dax: Fix dax_mapping_release() use after free
2023-06-03 6:13 ` [PATCH 1/4] dax: Fix dax_mapping_release() use after free Dan Williams
2023-06-04 2:40 ` Ira Weiny
2023-06-05 20:45 ` Fan Ni
@ 2023-06-15 17:33 ` Dave Jiang
2 siblings, 0 replies; 18+ messages in thread
From: Dave Jiang @ 2023-06-15 17:33 UTC (permalink / raw)
To: Dan Williams, nvdimm; +Cc: linux-cxl
On 6/2/23 23:13, Dan Williams wrote:
> A CONFIG_DEBUG_KOBJECT_RELEASE test of removing a device-dax region
> provider (like modprobe -r dax_hmem) yields:
>
> kobject: 'mapping0' (ffff93eb460e8800): kobject_release, parent 0000000000000000 (delayed 2000)
> [..]
> DEBUG_LOCKS_WARN_ON(1)
> WARNING: CPU: 23 PID: 282 at kernel/locking/lockdep.c:232 __lock_acquire+0x9fc/0x2260
> [..]
> RIP: 0010:__lock_acquire+0x9fc/0x2260
> [..]
> Call Trace:
> <TASK>
> [..]
> lock_acquire+0xd4/0x2c0
> ? ida_free+0x62/0x130
> _raw_spin_lock_irqsave+0x47/0x70
> ? ida_free+0x62/0x130
> ida_free+0x62/0x130
> dax_mapping_release+0x1f/0x30
> device_release+0x36/0x90
> kobject_delayed_cleanup+0x46/0x150
>
> Due to attempting ida_free() on an ida object that has already been
> freed. Devices typically only hold a reference on their parent while
> registered. If a child needs a parent object to complete its release it
> needs to hold a reference that it drops from its release callback.
> Arrange for a dax_mapping to pin its parent dev_dax instance until
> dax_mapping_release().
>
> Fixes: 0b07ce872a9e ("device-dax: introduce 'mapping' devices")
> Signed-off-by: Dan Williams <dan.j.williams@intel.com>
Reviewed-by: Dave Jiang <dave.jiang@intel.com>
> ---
> drivers/dax/bus.c | 5 ++++-
> 1 file changed, 4 insertions(+), 1 deletion(-)
>
> diff --git a/drivers/dax/bus.c b/drivers/dax/bus.c
> index 227800053309..aee695f86b44 100644
> --- a/drivers/dax/bus.c
> +++ b/drivers/dax/bus.c
> @@ -635,10 +635,12 @@ EXPORT_SYMBOL_GPL(alloc_dax_region);
> static void dax_mapping_release(struct device *dev)
> {
> struct dax_mapping *mapping = to_dax_mapping(dev);
> - struct dev_dax *dev_dax = to_dev_dax(dev->parent);
> + struct device *parent = dev->parent;
> + struct dev_dax *dev_dax = to_dev_dax(parent);
>
> ida_free(&dev_dax->ida, mapping->id);
> kfree(mapping);
> + put_device(parent);
> }
>
> static void unregister_dax_mapping(void *data)
> @@ -778,6 +780,7 @@ static int devm_register_dax_mapping(struct dev_dax *dev_dax, int range_id)
> dev = &mapping->dev;
> device_initialize(dev);
> dev->parent = &dev_dax->dev;
> + get_device(dev->parent);
> dev->type = &dax_mapping_type;
> dev_set_name(dev, "mapping%d", mapping->id);
> rc = device_add(dev);
>
^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: [PATCH 2/4] dax: Use device_unregister() in unregister_dax_mapping()
2023-06-03 6:13 ` [PATCH 2/4] dax: Use device_unregister() in unregister_dax_mapping() Dan Williams
2023-06-04 2:41 ` Ira Weiny
2023-06-05 20:46 ` Fan Ni
@ 2023-06-15 17:34 ` Dave Jiang
2 siblings, 0 replies; 18+ messages in thread
From: Dave Jiang @ 2023-06-15 17:34 UTC (permalink / raw)
To: Dan Williams, nvdimm; +Cc: linux-cxl
On 6/2/23 23:13, Dan Williams wrote:
> Replace an open-coded device_unregister() sequence with the helper.
>
> Signed-off-by: Dan Williams <dan.j.williams@intel.com>
Reviewed-by: Dave Jiang <dave.jiang@intel.com>
> ---
> drivers/dax/bus.c | 3 +--
> 1 file changed, 1 insertion(+), 2 deletions(-)
>
> diff --git a/drivers/dax/bus.c b/drivers/dax/bus.c
> index aee695f86b44..c99ea08aafc3 100644
> --- a/drivers/dax/bus.c
> +++ b/drivers/dax/bus.c
> @@ -657,8 +657,7 @@ static void unregister_dax_mapping(void *data)
> dev_dax->ranges[mapping->range_id].mapping = NULL;
> mapping->range_id = -1;
>
> - device_del(dev);
> - put_device(dev);
> + device_unregister(dev);
> }
>
> static struct dev_dax_range *get_dax_range(struct device *dev)
>
>
^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: [PATCH 4/4] dax: Cleanup extra dax_region references
2023-06-03 6:14 ` [PATCH 4/4] dax: Cleanup extra dax_region references Dan Williams
2023-06-04 2:58 ` Ira Weiny
2023-06-06 17:46 ` Fan Ni
@ 2023-06-15 17:45 ` Dave Jiang
2 siblings, 0 replies; 18+ messages in thread
From: Dave Jiang @ 2023-06-15 17:45 UTC (permalink / raw)
To: Dan Williams, nvdimm; +Cc: Ira Weiny, linux-cxl
On 6/2/23 23:14, Dan Williams wrote:
> Now that free_dev_dax_id() internally manages the references it needs
> the extra references taken by the dax_region drivers are not needed.
>
> Reported-by: Ira Weiny <ira.weiny@intel.com>
> Signed-off-by: Dan Williams <dan.j.williams@intel.com>
Reviewed-by: Dave Jiang <dave.jiang@intel.com>
> ---
> drivers/dax/bus.c | 4 +---
> drivers/dax/bus.h | 1 -
> drivers/dax/cxl.c | 8 +-------
> drivers/dax/hmem/hmem.c | 8 +-------
> drivers/dax/pmem.c | 7 +------
> 5 files changed, 4 insertions(+), 24 deletions(-)
>
> diff --git a/drivers/dax/bus.c b/drivers/dax/bus.c
> index a4cc3eca774f..0ee96e6fc426 100644
> --- a/drivers/dax/bus.c
> +++ b/drivers/dax/bus.c
> @@ -454,11 +454,10 @@ static void dax_region_free(struct kref *kref)
> kfree(dax_region);
> }
>
> -void dax_region_put(struct dax_region *dax_region)
> +static void dax_region_put(struct dax_region *dax_region)
> {
> kref_put(&dax_region->kref, dax_region_free);
> }
> -EXPORT_SYMBOL_GPL(dax_region_put);
>
> /* a return value >= 0 indicates this invocation invalidated the id */
> static int __free_dev_dax_id(struct dev_dax *dev_dax)
> @@ -641,7 +640,6 @@ struct dax_region *alloc_dax_region(struct device *parent, int region_id,
> return NULL;
> }
>
> - kref_get(&dax_region->kref);
> if (devm_add_action_or_reset(parent, dax_region_unregister, dax_region))
> return NULL;
> return dax_region;
> diff --git a/drivers/dax/bus.h b/drivers/dax/bus.h
> index 8cd79ab34292..bdbf719df5c5 100644
> --- a/drivers/dax/bus.h
> +++ b/drivers/dax/bus.h
> @@ -9,7 +9,6 @@ struct dev_dax;
> struct resource;
> struct dax_device;
> struct dax_region;
> -void dax_region_put(struct dax_region *dax_region);
>
> /* dax bus specific ioresource flags */
> #define IORESOURCE_DAX_STATIC BIT(0)
> diff --git a/drivers/dax/cxl.c b/drivers/dax/cxl.c
> index ccdf8de85bd5..8bc9d04034d6 100644
> --- a/drivers/dax/cxl.c
> +++ b/drivers/dax/cxl.c
> @@ -13,7 +13,6 @@ static int cxl_dax_region_probe(struct device *dev)
> struct cxl_region *cxlr = cxlr_dax->cxlr;
> struct dax_region *dax_region;
> struct dev_dax_data data;
> - struct dev_dax *dev_dax;
>
> if (nid == NUMA_NO_NODE)
> nid = memory_add_physaddr_to_nid(cxlr_dax->hpa_range.start);
> @@ -28,13 +27,8 @@ static int cxl_dax_region_probe(struct device *dev)
> .id = -1,
> .size = range_len(&cxlr_dax->hpa_range),
> };
> - dev_dax = devm_create_dev_dax(&data);
> - if (IS_ERR(dev_dax))
> - return PTR_ERR(dev_dax);
>
> - /* child dev_dax instances now own the lifetime of the dax_region */
> - dax_region_put(dax_region);
> - return 0;
> + return PTR_ERR_OR_ZERO(devm_create_dev_dax(&data));
> }
>
> static struct cxl_driver cxl_dax_region_driver = {
> diff --git a/drivers/dax/hmem/hmem.c b/drivers/dax/hmem/hmem.c
> index e5fe8b39fb94..5d2ddef0f8f5 100644
> --- a/drivers/dax/hmem/hmem.c
> +++ b/drivers/dax/hmem/hmem.c
> @@ -16,7 +16,6 @@ static int dax_hmem_probe(struct platform_device *pdev)
> struct dax_region *dax_region;
> struct memregion_info *mri;
> struct dev_dax_data data;
> - struct dev_dax *dev_dax;
>
> /*
> * @region_idle == true indicates that an administrative agent
> @@ -38,13 +37,8 @@ static int dax_hmem_probe(struct platform_device *pdev)
> .id = -1,
> .size = region_idle ? 0 : range_len(&mri->range),
> };
> - dev_dax = devm_create_dev_dax(&data);
> - if (IS_ERR(dev_dax))
> - return PTR_ERR(dev_dax);
>
> - /* child dev_dax instances now own the lifetime of the dax_region */
> - dax_region_put(dax_region);
> - return 0;
> + return PTR_ERR_OR_ZERO(devm_create_dev_dax(&data));
> }
>
> static struct platform_driver dax_hmem_driver = {
> diff --git a/drivers/dax/pmem.c b/drivers/dax/pmem.c
> index f050ea78bb83..ae0cb113a5d3 100644
> --- a/drivers/dax/pmem.c
> +++ b/drivers/dax/pmem.c
> @@ -13,7 +13,6 @@ static struct dev_dax *__dax_pmem_probe(struct device *dev)
> int rc, id, region_id;
> resource_size_t offset;
> struct nd_pfn_sb *pfn_sb;
> - struct dev_dax *dev_dax;
> struct dev_dax_data data;
> struct nd_namespace_io *nsio;
> struct dax_region *dax_region;
> @@ -65,12 +64,8 @@ static struct dev_dax *__dax_pmem_probe(struct device *dev)
> .pgmap = &pgmap,
> .size = range_len(&range),
> };
> - dev_dax = devm_create_dev_dax(&data);
>
> - /* child dev_dax instances now own the lifetime of the dax_region */
> - dax_region_put(dax_region);
> -
> - return dev_dax;
> + return devm_create_dev_dax(&data);
> }
>
> static int dax_pmem_probe(struct device *dev)
>
>
^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: [PATCH 3/4] dax: Introduce alloc_dev_dax_id()
2023-06-04 2:57 ` Ira Weiny
@ 2023-06-16 1:22 ` Dan Williams
2023-06-16 22:11 ` Ira Weiny
0 siblings, 1 reply; 18+ messages in thread
From: Dan Williams @ 2023-06-16 1:22 UTC (permalink / raw)
To: Ira Weiny, Dan Williams, nvdimm
Cc: Yongqiang Liu, Paul Cassella, Ira Weiny, linux-cxl
Ira Weiny wrote:
> Dan Williams wrote:
> > The reference counting of dax_region objects is needlessly complicated,
> > has lead to confusion [1], and has hidden a bug [2]. Towards cleaning up
> > that mess introduce alloc_dev_dax_id() to minimize the holding of a
> > dax_region reference to only what dev_dax_release() needs, the
> > dax_region->ida.
> >
> > Part of the reason for the mess was the design to dereference a
> > dax_region in all cases in free_dev_dax_id() even if the id was
> > statically assigned by the upper level dax_region driver. Remove the
> > need to call "is_static(dax_region)" by tracking whether the id is
> > dynamic directly in the dev_dax instance itself.
> >
> > With that flag the dax_region pinning and release per dev_dax instance
> > can move to alloc_dev_dax_id() and free_dev_dax_id() respectively.
> >
> > A follow-on cleanup address the unnecessary references in the dax_region
> > setup and drivers.
> >
> > Fixes: 0f3da14a4f05 ("device-dax: introduce 'seed' devices")
> > Link: http://lore.kernel.org/r/20221203095858.612027-1-liuyongqiang13@huawei.com [1]
> > Link: http://lore.kernel.org/r/3cf0890b-4eb0-e70e-cd9c-2ecc3d496263@hpe.com [2]
> > Reported-by: Yongqiang Liu <liuyongqiang13@huawei.com>
> > Reported-by: Paul Cassella <cassella@hpe.com>
> > Reported-by: Ira Weiny <ira.weiny@intel.com>
> > Signed-off-by: Dan Williams <dan.j.williams@intel.com>
[..]
> > @@ -1326,6 +1340,7 @@ struct dev_dax *devm_create_dev_dax(struct dev_dax_data *data)
> > if (!dev_dax)
> > return ERR_PTR(-ENOMEM);
> >
> > + dev_dax->region = dax_region;
>
> Overall I like that this reference is not needed to be carried and/or
> managed by the callers.
>
> However, here you are referencing the dax_region from the dev_dax in an
> unrelated place to where the reference matters (in id management).
>
> Could alloc_dev_dax_id() change to:
>
> static int alloc_dev_dax_id(struct dev_dax *dev_dax, struct dax_region *dax_region)
> {
> ...
> }
>
> Then make this assignment next to where the kref is taken so it is clear
> that this is the only user of the reference?
>
> I did not pick up on the fact this reference was only needed to free the
> id at all in reviewing the code and I think this would make it even more
> clear.
I hesitate only for symmetry reasons. I.e. that there are many interfaces in
this file, in addition to free_dev_dax_id(), where @dax_region is
implicitly retrieved from the @dev_dax.
^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: [PATCH 3/4] dax: Introduce alloc_dev_dax_id()
2023-06-16 1:22 ` Dan Williams
@ 2023-06-16 22:11 ` Ira Weiny
0 siblings, 0 replies; 18+ messages in thread
From: Ira Weiny @ 2023-06-16 22:11 UTC (permalink / raw)
To: Dan Williams, Ira Weiny, nvdimm
Cc: Yongqiang Liu, Paul Cassella, Ira Weiny, linux-cxl
Dan Williams wrote:
> Ira Weiny wrote:
> > Dan Williams wrote:
> > > The reference counting of dax_region objects is needlessly complicated,
> > > has lead to confusion [1], and has hidden a bug [2]. Towards cleaning up
> > > that mess introduce alloc_dev_dax_id() to minimize the holding of a
> > > dax_region reference to only what dev_dax_release() needs, the
> > > dax_region->ida.
> > >
> > > Part of the reason for the mess was the design to dereference a
> > > dax_region in all cases in free_dev_dax_id() even if the id was
> > > statically assigned by the upper level dax_region driver. Remove the
> > > need to call "is_static(dax_region)" by tracking whether the id is
> > > dynamic directly in the dev_dax instance itself.
> > >
> > > With that flag the dax_region pinning and release per dev_dax instance
> > > can move to alloc_dev_dax_id() and free_dev_dax_id() respectively.
> > >
> > > A follow-on cleanup address the unnecessary references in the dax_region
> > > setup and drivers.
> > >
> > > Fixes: 0f3da14a4f05 ("device-dax: introduce 'seed' devices")
> > > Link: http://lore.kernel.org/r/20221203095858.612027-1-liuyongqiang13@huawei.com [1]
> > > Link: http://lore.kernel.org/r/3cf0890b-4eb0-e70e-cd9c-2ecc3d496263@hpe.com [2]
> > > Reported-by: Yongqiang Liu <liuyongqiang13@huawei.com>
> > > Reported-by: Paul Cassella <cassella@hpe.com>
> > > Reported-by: Ira Weiny <ira.weiny@intel.com>
> > > Signed-off-by: Dan Williams <dan.j.williams@intel.com>
> [..]
> > > @@ -1326,6 +1340,7 @@ struct dev_dax *devm_create_dev_dax(struct dev_dax_data *data)
> > > if (!dev_dax)
> > > return ERR_PTR(-ENOMEM);
> > >
> > > + dev_dax->region = dax_region;
> >
> > Overall I like that this reference is not needed to be carried and/or
> > managed by the callers.
> >
> > However, here you are referencing the dax_region from the dev_dax in an
> > unrelated place to where the reference matters (in id management).
> >
> > Could alloc_dev_dax_id() change to:
> >
> > static int alloc_dev_dax_id(struct dev_dax *dev_dax, struct dax_region *dax_region)
> > {
> > ...
> > }
> >
> > Then make this assignment next to where the kref is taken so it is clear
> > that this is the only user of the reference?
> >
> > I did not pick up on the fact this reference was only needed to free the
> > id at all in reviewing the code and I think this would make it even more
> > clear.
>
> I hesitate only for symmetry reasons. I.e. that there are many interfaces in
> this file, in addition to free_dev_dax_id(), where @dax_region is
> implicitly retrieved from the @dev_dax.
Ok but the reason we need this extra reference and for the dax_region to
live this long is because the ida within the dax_region. Otherwise the
normal device references would be enough, right?
Regardless, I've convinced myself this is ok.
Reviewed-by: Ira Weiny <ira.weiny@intel.com>
Ira
^ permalink raw reply [flat|nested] 18+ messages in thread
end of thread, other threads:[~2023-06-16 22:12 UTC | newest]
Thread overview: 18+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2023-06-03 6:13 [PATCH 0/4] dax: Fix use after free and other cleanups Dan Williams
2023-06-03 6:13 ` [PATCH 1/4] dax: Fix dax_mapping_release() use after free Dan Williams
2023-06-04 2:40 ` Ira Weiny
2023-06-05 20:45 ` Fan Ni
2023-06-15 17:33 ` Dave Jiang
2023-06-03 6:13 ` [PATCH 2/4] dax: Use device_unregister() in unregister_dax_mapping() Dan Williams
2023-06-04 2:41 ` Ira Weiny
2023-06-05 20:46 ` Fan Ni
2023-06-15 17:34 ` Dave Jiang
2023-06-03 6:14 ` [PATCH 3/4] dax: Introduce alloc_dev_dax_id() Dan Williams
2023-06-04 2:57 ` Ira Weiny
2023-06-16 1:22 ` Dan Williams
2023-06-16 22:11 ` Ira Weiny
2023-06-03 6:14 ` [PATCH 4/4] dax: Cleanup extra dax_region references Dan Williams
2023-06-04 2:58 ` Ira Weiny
2023-06-06 17:46 ` Fan Ni
2023-06-06 20:42 ` Ira Weiny
2023-06-15 17:45 ` Dave Jiang
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox