public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Mike Rapoport <rppt@kernel.org>
To: dan.j.williams@intel.com
Cc: "Dave Jiang" <dave.jiang@intel.com>,
	"Ira Weiny" <ira.weiny@intel.com>,
	"Vishal Verma" <vishal.l.verma@intel.com>,
	jane.chu@oracle.com, "Michał Cłapiński" <mclapinski@google.com>,
	"Pasha Tatashin" <pasha.tatashin@soleen.com>,
	"Tyler Hicks" <code@tyhicks.com>,
	linux-kernel@vger.kernel.org, nvdimm@lists.linux.dev
Subject: Re: [PATCH v2 1/1] nvdimm: allow exposing RAM carveouts as NVDIMM DIMM devices
Date: Wed, 22 Oct 2025 17:47:41 +0300	[thread overview]
Message-ID: <aPjujSjgLSWsAtsb@kernel.org> (raw)
In-Reply-To: <68f2da6bd013e_2a201008c@dwillia2-mobl4.notmuch>

On Fri, Oct 17, 2025 at 05:08:11PM -0700, dan.j.williams@intel.com wrote:
> Mike Rapoport wrote:
> > From: "Mike Rapoport (Microsoft)" <rppt@kernel.org>
> > 
> > There are use cases, for example virtual machine hosts, that create
> > "persistent" memory regions using memmap= option on x86 or dummy
> > pmem-region device tree nodes on DT based systems.
> > 
> > Both these options are inflexible because they create static regions and
> > the layout of the "persistent" memory cannot be adjusted without reboot
> > and sometimes they even require firmware update.
> > 
> > Add a ramdax driver that allows creation of DIMM devices on top of
> > E820_TYPE_PRAM regions and devicetree pmem-region nodes.
> > 
> > The DIMMs support label space management on the "device" and provide a
> > flexible way to access RAM using fsdax and devdax.
> > 
> > Signed-off-by: Mike Rapoport (Mircosoft) <rppt@kernel.org>
> > ---
> >  drivers/nvdimm/Kconfig  |  17 +++
> >  drivers/nvdimm/Makefile |   1 +
> >  drivers/nvdimm/ramdax.c | 272 ++++++++++++++++++++++++++++++++++++++++
> >  3 files changed, 290 insertions(+)
> >  create mode 100644 drivers/nvdimm/ramdax.c
> > 
> > diff --git a/drivers/nvdimm/Kconfig b/drivers/nvdimm/Kconfig
> > index fde3e17c836c..9ac96a7cd773 100644
> > --- a/drivers/nvdimm/Kconfig
> > +++ b/drivers/nvdimm/Kconfig
> > @@ -97,6 +97,23 @@ config OF_PMEM
> >  
> >  	  Select Y if unsure.
> >  
> > +config RAMDAX
> > +	tristate "Support persistent memory interfaces on RAM carveouts"
> > +	depends on OF || X86
> 
> I see no compile time dependency for CONFIG_OF. The one call to
> dev_of_node() looks like it still builds in the CONFIG_OF=n case. For
> CONFIG_X86 the situation is different because the kernel needs
> infrastructure to build the device.
> 
> So maybe change the dependency to drop OF and make it:
> 
> 	depends on X86_PMEM_LEGACY if X86

We can't put if in a depends statement :(
My intention with "depends on OF || X86" was that if it's not really
possible to use this driver if it's not X86 or OF because there's nothing
to define a platform device for ramdax to bind.

Maybe what we actually need is

	select X86_PMEM_LEGACY_DEVICE if X86
	default n

so that it could be only explicitly enabled in the configuration and if it
is, it will also enable X86_PMEM_LEGACY_DEVICE on x86.
With default set to no it won't be build "accidentailly", but OTOH cloud
providers can disable X86_PMEM_LEGACY and enable RAMDAX and distros can
build them as modules on x86 and architectures that support OF. 

What do you think?

> > +	select X86_PMEM_LEGACY_DEVICE
> 
> ...and drop this select.
> 
> > +	default LIBNVDIMM
> > +	help
> > +	  Allows creation of DAX devices on RAM carveouts.
> > +
> > +	  Memory ranges that are manually specified by the
> > +	  'memmap=nn[KMG]!ss[KMG]' kernel command line or defined by dummy
> > +	  pmem-region device tree nodes would be managed by this driver as DIMM
> > +	  devices with support for dynamic layout of namespaces.
> > +	  The driver can be bound to e820_pmem or pmem-region platform
> > +	  devices using 'driver_override' device attribute.
> 
> Maybe some notes for details like:
> 
> * 128K stolen at the end of the memmap range
> * supports 509 namespaces (see 'ndctl create-namespace --help')
> * must be force bound via driver_override

Sure.
 
> [..]
> > +static int ramdax_probe(struct platform_device *pdev)
> > +{
> > +	static struct nvdimm_bus_descriptor nd_desc;
> > +	struct device *dev = &pdev->dev;
> > +	struct nvdimm_bus *nvdimm_bus;
> > +	struct device_node *np;
> > +	int rc = -ENXIO;
> > +
> > +	nd_desc.provider_name = "ramdax";
> > +	nd_desc.module = THIS_MODULE;
> > +	nd_desc.ndctl = ramdax_ctl;
> > +	nvdimm_bus = nvdimm_bus_register(dev, &nd_desc);
> > +	if (!nvdimm_bus)
> > +		goto err;
> > +
> > +	np = dev_of_node(&pdev->dev);
> > +	if (np)
> > +		rc = ramdax_probe_of(pdev, nvdimm_bus, np);
> 
> Hmm, I do not see any confirmation that this node is actually a
> "pmem-region". If you attach the kernel to the wrong device I think you
> get fireworks that could be avoided with a manual of_match_node() check
> of the same device_id list as the of_pmem driver.
> 
> That still would not require a "depends on OF" given of_match_node()
> compiles away in the CONFIG_OF=n case.

With how driver_override is implemented it's possible to get fireworks with
any platform device :)

I'll add a manual check for of_match_node() to be on the safer side.

> [..]
> 
> This looks good to me. With the above comments addressed you can add:
> 
> Reviewed-by: Dan Williams <dan.j.williams@intel.com>

-- 
Sincerely yours,
Mike.

  reply	other threads:[~2025-10-22 14:47 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-10-15  8:00 [PATCH v2 0/1] nvdimm: allow exposing RAM as libnvdimm DIMMs Mike Rapoport
2025-10-15  8:00 ` [PATCH v2 1/1] nvdimm: allow exposing RAM carveouts as NVDIMM DIMM devices Mike Rapoport
2025-10-18  0:08   ` dan.j.williams
2025-10-22 14:47     ` Mike Rapoport [this message]
2025-10-22 23:29       ` dan.j.williams
2025-10-23  7:39         ` Mike Rapoport

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=aPjujSjgLSWsAtsb@kernel.org \
    --to=rppt@kernel.org \
    --cc=code@tyhicks.com \
    --cc=dan.j.williams@intel.com \
    --cc=dave.jiang@intel.com \
    --cc=ira.weiny@intel.com \
    --cc=jane.chu@oracle.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mclapinski@google.com \
    --cc=nvdimm@lists.linux.dev \
    --cc=pasha.tatashin@soleen.com \
    --cc=vishal.l.verma@intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox