From: Alexander Duyck <alexander.h.duyck@linux.intel.com>
To: Luis Chamberlain <mcgrof@kernel.org>
Cc: len.brown@intel.com, Dmitry Torokhov <dmitry.torokhov@gmail.com>,
bvanassche@acm.org, linux-pm@vger.kernel.org,
gregkh@linuxfoundation.org, linux-nvdimm@lists.01.org,
jiangshanlai@gmail.com, linux-kernel@vger.kernel.org,
brendanhiggins@google.com, pavel@ucw.cz, zwisler@kernel.org,
tj@kernel.org, akpm@linux-foundation.org, rafael@kernel.org
Subject: Re: [driver-core PATCH v7 4/9] driver core: Probe devices asynchronously instead of the driver
Date: Mon, 03 Dec 2018 08:44:43 -0800 [thread overview]
Message-ID: <c60069cf68cd0c674f31f7690406bb97dd39c8da.camel@linux.intel.com> (raw)
In-Reply-To: <20181201024847.GH28501@garbanzo.do-not-panic.com>
On Fri, 2018-11-30 at 18:48 -0800, Luis Chamberlain wrote:
> On Wed, Nov 28, 2018 at 04:32:26PM -0800, Alexander Duyck wrote:
> > Probe devices asynchronously instead of the driver.
> > +static void __driver_attach_async_helper(void *_dev, async_cookie_t cookie)
> > +{
> > + struct device *dev = _dev;
> > + struct device_driver *drv;
> > +
> > + __device_driver_lock(dev, dev->parent);
> > +
> > + /*
> > + * If someone attempted to bind a driver either successfully or
> > + * unsuccessfully before we got here we should just skip the driver
> > + * probe call.
> > + */
> > + drv = dev_get_drv_async(dev);
> > + if (drv && !dev->driver)
> > + driver_probe_device(drv, dev);
>
> I believe this should mean drivers which have async work on probe can
> deadlock. For instance, if a driver does call async_schedule() or a
> derivative call does this for it, the kernel will call
> async_synchronize_full() and I believe we deadlock.
>
> Are we sure most subsystems which would use async probe will not have
> an async_schedule() call?
>
> Luis
So the async_schedule call isn't a problem. I would only be an issue if
they are calling async_sychronize_full while we are holding a lock
and/or mutex. To mitigate that I believe many drivers are just using
the domain version of things instead of using the global async calls.
An issue like what you have described would already exist if there is
code like that floating around out there. As is this patch isn't
changing the fact that a driver can load asynchronously. All it is
doing is allowing each device to be handled asynchronously instead of
having just one thread work its way though all the devices one at a
time.
The earlier bug we were addressing in patch 1/9 was something like what
you were describing where we were performing an async_synchronize_full
while holding the device lock. I would think the requirement if you are
going to are going to use async within a driver is to use the domain
specific version instead of just synchronizing entire domains, or if
you must synchronize the entire domain you should not be doing so while
holding any locks and/or mutexs.
One of the reasons why I am using a flag to perform the synchronization
between the device_add and device_del in patch 2 is because technically
any driver can be turned into an asynchronous probing driver by just
adding the kernel parameter <driver>.async_probe. That flag is somewhat
hidden here as dev_get_drv_async was checking for the async_probe flag
in this version of the patch. In the future I plan to replace the
"async_probe" flag with a "dead" flag to indicate that the device is in
the process of doing through a device_del which should accomplish the
same thing.
- Alex
_______________________________________________
Linux-nvdimm mailing list
Linux-nvdimm@lists.01.org
https://lists.01.org/mailman/listinfo/linux-nvdimm
WARNING: multiple messages have this Message-ID (diff)
From: Alexander Duyck <alexander.h.duyck@linux.intel.com>
To: Luis Chamberlain <mcgrof@kernel.org>
Cc: linux-kernel@vger.kernel.org, gregkh@linuxfoundation.org,
linux-nvdimm@lists.01.org, tj@kernel.org,
akpm@linux-foundation.org, linux-pm@vger.kernel.org,
jiangshanlai@gmail.com, rafael@kernel.org, len.brown@intel.com,
pavel@ucw.cz, zwisler@kernel.org, dan.j.williams@intel.com,
dave.jiang@intel.com, bvanassche@acm.org,
Dmitry Torokhov <dmitry.torokhov@gmail.com>,
brendanhiggins@google.com
Subject: Re: [driver-core PATCH v7 4/9] driver core: Probe devices asynchronously instead of the driver
Date: Mon, 03 Dec 2018 08:44:43 -0800 [thread overview]
Message-ID: <c60069cf68cd0c674f31f7690406bb97dd39c8da.camel@linux.intel.com> (raw)
In-Reply-To: <20181201024847.GH28501@garbanzo.do-not-panic.com>
On Fri, 2018-11-30 at 18:48 -0800, Luis Chamberlain wrote:
> On Wed, Nov 28, 2018 at 04:32:26PM -0800, Alexander Duyck wrote:
> > Probe devices asynchronously instead of the driver.
> > +static void __driver_attach_async_helper(void *_dev, async_cookie_t cookie)
> > +{
> > + struct device *dev = _dev;
> > + struct device_driver *drv;
> > +
> > + __device_driver_lock(dev, dev->parent);
> > +
> > + /*
> > + * If someone attempted to bind a driver either successfully or
> > + * unsuccessfully before we got here we should just skip the driver
> > + * probe call.
> > + */
> > + drv = dev_get_drv_async(dev);
> > + if (drv && !dev->driver)
> > + driver_probe_device(drv, dev);
>
> I believe this should mean drivers which have async work on probe can
> deadlock. For instance, if a driver does call async_schedule() or a
> derivative call does this for it, the kernel will call
> async_synchronize_full() and I believe we deadlock.
>
> Are we sure most subsystems which would use async probe will not have
> an async_schedule() call?
>
> Luis
So the async_schedule call isn't a problem. I would only be an issue if
they are calling async_sychronize_full while we are holding a lock
and/or mutex. To mitigate that I believe many drivers are just using
the domain version of things instead of using the global async calls.
An issue like what you have described would already exist if there is
code like that floating around out there. As is this patch isn't
changing the fact that a driver can load asynchronously. All it is
doing is allowing each device to be handled asynchronously instead of
having just one thread work its way though all the devices one at a
time.
The earlier bug we were addressing in patch 1/9 was something like what
you were describing where we were performing an async_synchronize_full
while holding the device lock. I would think the requirement if you are
going to are going to use async within a driver is to use the domain
specific version instead of just synchronizing entire domains, or if
you must synchronize the entire domain you should not be doing so while
holding any locks and/or mutexs.
One of the reasons why I am using a flag to perform the synchronization
between the device_add and device_del in patch 2 is because technically
any driver can be turned into an asynchronous probing driver by just
adding the kernel parameter <driver>.async_probe. That flag is somewhat
hidden here as dev_get_drv_async was checking for the async_probe flag
in this version of the patch. In the future I plan to replace the
"async_probe" flag with a "dead" flag to indicate that the device is in
the process of doing through a device_del which should accomplish the
same thing.
- Alex
next prev parent reply other threads:[~2018-12-03 16:44 UTC|newest]
Thread overview: 53+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-11-29 0:32 [driver-core PATCH v7 0/9] Add NUMA aware async_schedule calls Alexander Duyck
2018-11-29 0:32 ` Alexander Duyck
2018-11-29 0:32 ` Alexander Duyck
2018-11-29 0:32 ` [driver-core PATCH v7 1/9] driver core: Move async_synchronize_full call Alexander Duyck
2018-11-29 0:32 ` Alexander Duyck
2018-11-29 0:32 ` Alexander Duyck
2018-11-30 23:21 ` Luis Chamberlain
2018-11-30 23:21 ` Luis Chamberlain
2018-11-30 23:21 ` Luis Chamberlain
2018-11-29 0:32 ` [driver-core PATCH v7 2/9] driver core: Establish clear order of operations for deferred probe and remove Alexander Duyck
2018-11-29 0:32 ` Alexander Duyck
2018-11-29 1:57 ` Dan Williams
2018-11-29 18:07 ` Alexander Duyck
2018-11-29 18:07 ` Alexander Duyck
2018-11-29 18:07 ` Alexander Duyck
2018-11-29 18:55 ` Dan Williams
2018-11-29 18:55 ` Dan Williams
2018-11-29 18:55 ` Dan Williams
2018-11-29 21:53 ` Alexander Duyck
2018-11-29 21:53 ` Alexander Duyck
2018-11-29 21:53 ` Alexander Duyck
2018-11-29 22:00 ` Dan Williams
2018-11-29 22:00 ` Dan Williams
2018-11-29 22:00 ` Dan Williams
2018-11-30 23:40 ` Luis Chamberlain
2018-11-29 0:32 ` [driver-core PATCH v7 3/9] device core: Consolidate locking and unlocking of parent and device Alexander Duyck
2018-11-29 0:32 ` Alexander Duyck
2018-12-01 0:01 ` Luis Chamberlain
2018-12-01 0:01 ` Luis Chamberlain
2018-12-01 0:01 ` Luis Chamberlain
2018-11-29 0:32 ` [driver-core PATCH v7 4/9] driver core: Probe devices asynchronously instead of the driver Alexander Duyck
2018-11-29 0:32 ` Alexander Duyck
2018-11-29 0:32 ` Alexander Duyck
2018-12-01 2:48 ` Luis Chamberlain
2018-12-01 2:48 ` Luis Chamberlain
2018-12-01 2:48 ` Luis Chamberlain
2018-12-03 16:44 ` Alexander Duyck [this message]
2018-12-03 16:44 ` Alexander Duyck
2018-11-29 0:32 ` [driver-core PATCH v7 5/9] workqueue: Provide queue_work_node to queue work near a given NUMA node Alexander Duyck
2018-11-29 0:32 ` Alexander Duyck
2018-11-29 0:32 ` Alexander Duyck
2018-11-29 0:32 ` [driver-core PATCH v7 6/9] async: Add support for queueing on specific " Alexander Duyck
2018-11-29 0:32 ` Alexander Duyck
2018-11-29 0:32 ` Alexander Duyck
2018-11-29 0:32 ` [driver-core PATCH v7 7/9] driver core: Attach devices on CPU local to device node Alexander Duyck
2018-11-29 0:32 ` Alexander Duyck
2018-11-29 0:32 ` Alexander Duyck
2018-11-29 0:32 ` [driver-core PATCH v7 8/9] PM core: Use new async_schedule_dev command Alexander Duyck
2018-11-29 0:32 ` Alexander Duyck
2018-11-29 0:32 ` Alexander Duyck
2018-11-29 0:32 ` [driver-core PATCH v7 9/9] libnvdimm: Schedule device registration on node local to the device Alexander Duyck
2018-11-29 0:32 ` Alexander Duyck
2018-11-29 0:32 ` Alexander Duyck
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=c60069cf68cd0c674f31f7690406bb97dd39c8da.camel@linux.intel.com \
--to=alexander.h.duyck@linux.intel.com \
--cc=akpm@linux-foundation.org \
--cc=brendanhiggins@google.com \
--cc=bvanassche@acm.org \
--cc=dmitry.torokhov@gmail.com \
--cc=gregkh@linuxfoundation.org \
--cc=jiangshanlai@gmail.com \
--cc=len.brown@intel.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-nvdimm@lists.01.org \
--cc=linux-pm@vger.kernel.org \
--cc=mcgrof@kernel.org \
--cc=pavel@ucw.cz \
--cc=rafael@kernel.org \
--cc=tj@kernel.org \
--cc=zwisler@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.