Re: udev hangs under high loads - Nishanth Aravamudan

linux-hotplug.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

From: Nishanth Aravamudan <nacc@us.ibm.com>
To: linux-hotplug@vger.kernel.org
Subject: Re: udev hangs under high loads
Date: Mon, 25 Oct 2004 20:55:43 +0000	[thread overview]
Message-ID: <20041025205543.GA1800@us.ibm.com> (raw)
In-Reply-To: <20041023054119.GA11915@kroah.com>

On Mon, Oct 25, 2004 at 07:53:45PM +0200, Kay Sievers wrote:
> On Mon, Oct 25, 2004 at 10:27:18AM -0700, Nishanth Aravamudan wrote:
> > On Mon, Oct 25, 2004 at 04:51:14AM +0200, Kay Sievers wrote:
> > > On Sun, Oct 24, 2004 at 07:08:41AM +0200, Kay Sievers wrote:
> > > > On Sat, Oct 23, 2004 at 08:31:04AM +0200, Kay Sievers wrote:
> > > > > On Fri, Oct 22, 2004 at 10:41:19PM -0700, Greg KH wrote:
> > > > > > One of my coworkers (hi Nish) is trying to get 10000 disk support tested
> > > > > > and working properly with linux and ran into a nasty udev issue
> > > > > > yesterday.  It seems that under a very high load, with a whole bunch of
> > > > > > hotplug events happening (for when the disks get added to the system),
> > > > > > udev hangs.
> > > > > > 
> > > > > > It hangs in the "grabbing the database lock" portion of the code (don't
> > > > > > have actual logs of where it hangs, will try to get that next week.)
> > > > > > But the interesting thing is we almost handle everything properly.  
> > > > > > udev creates the node, tries to write to the database.  Then the timer
> > > > > > expires and we report this.  After the timer expires, udev is done for.
> > > > > > It just sits and spins, doing a nanosleep constantly.  Have 500 of those
> > > > > > instances all running at once, all at a nice level of -10 is a sure way
> > > > > > to bring a box (even a relatively big one) down hard.
> > > > > > 
> > > > > > So, while I'll agree finding the root tdb locking bug is a good idea, I
> > > > > > wanted to point out that perhaps we should just exit udev if our timeout
> > > > > > expires, instead of still waiting around.  Or do you have a better
> > > > > > solution?
> > > > > 
> > > > > Maybe the time the udev process locks the db is too long for that setup
> > > > > and the serialization of the concurrent locks will take longer than the
> > > > > timout of the process?
> > > > > 
> > > > > Here is a patch that tries to limit the access time to the db.
> > > > > The current udev opens the db, reads all the rules, writes to the
> > > > > db, executes the scripts and then closes the db. With this patch we open
> > > > > the db after the rules and close it directly after writing to it.
> > > > > 
> > > > > A rate limit in udevd may help here too, to keep that under control.
> > > > > 
> > > > > > Try testing this out on your own using the scsi_debug module and adding
> > > > > > a few hundred disks.  It also helps if you have scsi generic support
> > > > > > enabled, as that creates 2 udev events for every disk created.
> > > > > 
> > > > > I expect that this is completely different on a SMP machine, but I will
> > > > > try again. I once tried it with 200 disks and this was working well.
> > > > 
> > > > Here is another idea. I've ripped out the tdb code completely. We are
> > > > maintaining a directory of files at /dev/.udevdb/* now. Every node will
> > > > have a corresponding file (the slashes in DEVPATH are replaced by another
> > > > char), that carries all neccessary data.  The files are human readable:
> > > > 
> > > >   [root@pim default]# ls -l /dev/.udevdb/*hda*
> > > >   -rw-r--r--  1 root root 26 Oct 24 06:32 /dev/.udevdb/block@hda
> > > >   -rw-r--r--  1 root root 32 Oct 24 06:32 /dev/.udevdb/block@hda@hda1
> > > >   -rw-r--r--  1 root root 32 Oct 24 06:32 /dev/.udevdb/block@hda@hda2
> > > >   -rw-r--r--  1 root root 32 Oct 24 06:32 /dev/.udevdb/block@hda@hda3
> > > >   -rw-r--r--  1 root root 54 Oct 24 06:32 /dev/.udevdb/block@hdc
> > > > 
> > > >   [root@pim default]# cat /dev/.udevdb/block@hdc
> > > >   P:/block/hdc
> > > >   N:hdc
> > > >   S:cdrom dvd cdwriter dvdwriter
> > > >   A:0
> > > > 
> > > > This way we have _no_ locking at all in userspace, every event operation
> > > > is completely independent from other events. No need to synchronize anything,
> > > > which is expected to be much much faster and reliable on your 10.000 disk
> > > > system. It may also work over NFS.
> > > > 
> > > > The patch is complete, but not well tested. It just stopped working on
> > > > it, after the regression test was successful and my box was able to
> > > > reboot and remove the symlinks grabbed form the files :)
> > > 
> > > New patch with a fix for a bug introduced in udevinfo.
> > 
> > I am going to rerun my test with this patch applied. I will respond soon
> > with the results.
> 
> Nice. It was just a bug with the -r option in udevinfo. If you don't use
> this db-query program, nothing has changed.
 
I do not use that program, but in any case the patch seems to have fixed
the issues I was experiencing. I am able to add 1000 with no problem. I
am going to try 10000 to make sure it is still successful. Thanks so
much for your help!

> Are you able to hotplug-remove the disks from the system? It would be nice
> to know, if the nodes and symlinks are all correctly removed if you use custom
> names, given by a rule for your thousands of disks setup.

So, the way scsi_debug is designed, I can remove devices by echo'ing a
negative number into /sys/bus/pseudo/drivers/scsi_debug/add_host. Doing
so seems to correctly remove the device nodes and symlinks. I am only
now learning to use udev, so I will try the rule you mentioned and
report those results once the 10000 disk test is finished.

Thanks again!

-Nish


-------------------------------------------------------
This SF.net email is sponsored by: IT Product Guide on ITManagersJournal
Use IT products in your business? Tell us what you think of them. Give us
Your Opinions, Get Free ThinkGeek Gift Certificates! Click to find out more
http://productguide.itmanagersjournal.com/guidepromo.tmpl
_______________________________________________
Linux-hotplug-devel mailing list  http://linux-hotplug.sourceforge.net
Linux-hotplug-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/linux-hotplug-devel

     prev parent reply	other threads:[~2004-10-25 20:55 UTC|newest]

Thread overview: 17+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2004-10-23  5:41 udev hangs under high loads Greg KH
2004-10-23  6:31 ` Kay Sievers
2004-10-24  5:08 ` Kay Sievers
2004-10-24  5:53 ` Olaf Hering
2004-10-24 15:53 ` Kay Sievers
2004-10-24 15:57 ` Olaf Hering
2004-10-24 19:27 ` Kevin P. Fleming
2004-10-24 19:46 ` Kay Sievers
2004-10-24 19:51 ` Marco d'Itri
2004-10-24 19:56 ` Kay Sievers
2004-10-25  2:51 ` Kay Sievers
2004-10-25  7:01 ` Greg KH
2004-10-25 17:07 ` Kay Sievers
2004-10-25 17:12 ` Kevin P. Fleming
2004-10-25 17:27 ` Nishanth Aravamudan
2004-10-25 17:53 ` Kay Sievers
2004-10-25 20:55 ` Nishanth Aravamudan [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20041025205543.GA1800@us.ibm.com \
    --to=nacc@us.ibm.com \
    --cc=linux-hotplug@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).