From: "Paul E. McKenney" <paulmck@linux.vnet.ibm.com>
To: Andrew Morton <akpm@linux-foundation.org>
Cc: Michal Piotrowski <michal.k.k.piotrowski@gmail.com>,
linux-kernel@vger.kernel.org, Oleg Nesterov <oleg@tv-sign.ru>,
Gautham R Shenoy <ego@in.ibm.com>
Subject: Re: 2.6.21-rc3-mm1
Date: Sat, 10 Mar 2007 07:45:11 -0800 [thread overview]
Message-ID: <20070310154511.GB9645@linux.vnet.ibm.com> (raw)
In-Reply-To: <20070309181851.e96d468a.akpm@linux-foundation.org>
On Fri, Mar 09, 2007 at 06:18:51PM -0800, Andrew Morton wrote:
> > On Thu, 08 Mar 2007 21:50:29 +0100 Michal Piotrowski <michal.k.k.piotrowski@gmail.com> wrote:
> > Andrew Morton napisał(a):
> > > Temporarily at
> > >
> > > http://userweb.kernel.org/~akpm/2.6.21-rc3-mm1/
> > >
> > > Will appear later at
> > >
> > > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.21-rc3/2.6.21-rc3-mm1/
> > >
> >
> > cpu_hotplug (AutoTest) hangs at this
> >
> > =============================================
> > [ INFO: possible recursive locking detected ]
> > 2.6.21-rc3-mm1 #2
> > ---------------------------------------------
> > sh/7213 is trying to acquire lock:
> > (sched_hotcpu_mutex){--..}, at: [<c033883a>] mutex_lock+0x1c/0x1f
> >
> > but task is already holding lock:
> > (sched_hotcpu_mutex){--..}, at: [<c033883a>] mutex_lock+0x1c/0x1f
> >
> > other info that might help us debug this:
> > 4 locks held by sh/7213:
> > #0: (cpu_add_remove_lock){--..}, at: [<c033883a>] mutex_lock+0x1c/0x1f
> > #1: (sched_hotcpu_mutex){--..}, at: [<c033883a>] mutex_lock+0x1c/0x1f
> > #2: (cache_chain_mutex){--..}, at: [<c033883a>] mutex_lock+0x1c/0x1f
> > #3: (workqueue_mutex){--..}, at: [<c033883a>] mutex_lock+0x1c/0x1f
>
> That's pretty useless, isn't it? We need to know the mutex_lock() caller
> here.
>
> > stack backtrace
> > [<c0105256>] show_trace_log_lvl+0x1a/0x2f
> > [<c010597b>] show_trace+0x12/0x14
> > [<c0105a3d>] dump_stack+0x16/0x18
> > [<c013fc73>] __lock_acquire+0x1aa/0xceb
> > [<c014082d>] lock_acquire+0x79/0x93
> > [<c03385dc>] __mutex_lock_slowpath+0x107/0x349
> > [<c033883a>] mutex_lock+0x1c/0x1f
> > [<c011d924>] sched_getaffinity+0x14/0x91
> > [<c015796d>] __synchronize_sched+0x11/0x5f
> > [<c011d257>] detach_destroy_domains+0x2c/0x30
> > [<c011fc1a>] update_sched_domains+0x27/0x3a
> > [<c012fe7a>] notifier_call_chain+0x2b/0x4a
> > [<c012fec6>] __raw_notifier_call_chain+0x19/0x1e
> > [<c0145756>] _cpu_down+0x70/0x282
> > [<c014598e>] cpu_down+0x26/0x38
> > [<c0272714>] store_online+0x27/0x5a
> > [<c026f610>] sysdev_store+0x20/0x25
> > [<c01b7a8e>] sysfs_write_file+0xc1/0xe9
> > [<c0180052>] vfs_write+0xd1/0x15a
> > [<c0180682>] sys_write+0x3d/0x72
> > [<c0104270>] syscall_call+0x7/0xb
> >
> > l *0xc033883a
> > 0xc033883a is in mutex_lock (/mnt/md0/devel/linux-mm/kernel/mutex.c:92).
> > 87 /*
> > 88 * The locking fastpath is the 1->0 transition from
> > 89 * 'unlocked' into 'locked' state.
> > 90 */
> > 91 __mutex_fastpath_lock(&lock->count, __mutex_lock_slowpath);
> > 92 }
> > 93
> > 94 EXPORT_SYMBOL(mutex_lock);
> > 95
> > 96 static void fastcall noinline __sched
> >
> > I didn't test other -mm's with this test.
> >
> > http://www.stardust.webpages.pl/files/tbf/bitis-gabonica/2.6.21-rc3-mm1/console.log
> > http://www.stardust.webpages.pl/files/tbf/bitis-gabonica/2.6.21-rc3-mm1/mm-config
>
> I can't immediately spot the bug. Probably it's caused by rcu-preempt's
> changes to synchronize_sched(): that function now does a heap more than it
> used to, including taking sched_hotcpu_muex.
>
> So, what to do about this. Paul, I'm thinking that I should drop
> rcu-preempt for now - I don't think we ended up being able to identify any
> particular benefit which it brings to current mainline, and I suspect that
> things will become simpler if/when we start using the process freezer for
> CPU hotplug.
It certainly makes sense for Michal to try backing out rcu-preempt using
your broken-out list of patches. If that makes the problem go away,
then I would certainly have a hard time arguing with you. We are working
on getting measurements showing benefit of rcu-preempt, but aren't there
yet.
Thanx, Paul
next prev parent reply other threads:[~2007-03-10 16:08 UTC|newest]
Thread overview: 44+ messages / expand[flat|nested] mbox.gz Atom feed top
2007-03-08 4:18 2.6.21-rc3-mm1 Andrew Morton
2007-03-08 9:59 ` ext4, wireless (was Re: 2.6.21-rc3-mm1) Jeff Garzik
2007-03-08 14:50 ` 2.6.21-rc3-mm1 John W. Linville
2007-03-08 16:37 ` 2.6.21-rc3-mm1 Dave Jones
2007-03-08 17:56 ` 2.6.21-rc3-mm1 Valdis.Kletnieks
2007-03-08 18:34 ` 2.6.21-rc3-mm1 John W. Linville
2007-03-08 20:27 ` 2.6.21-rc3-mm1 Valdis.Kletnieks
2007-03-08 20:50 ` 2.6.21-rc3-mm1 Michal Piotrowski
2007-03-10 2:18 ` 2.6.21-rc3-mm1 Andrew Morton
2007-03-10 15:45 ` Paul E. McKenney [this message]
[not found] ` <6bffcb0e0703111002t75a8ac51vcce7d52684e04a9d@mail.gmail.com>
2007-03-12 4:41 ` 2.6.21-rc3-mm1 Paul E. McKenney
2007-03-09 11:40 ` 2.6.21-rc3-mm1 Frederik Deweerdt
2007-03-15 9:22 ` ipw2200: can't load firmware (was Re: 2.6.21-rc3-mm1) Frederik Deweerdt
2007-03-10 8:33 ` 2.6.21-rc3-mm1 Mariusz Kozlowski
2007-03-10 8:48 ` 2.6.21-rc3-mm1 Mariusz Kozlowski
2007-03-10 8:58 ` 2.6.21-rc3-mm1 Andrew Morton
2007-03-10 9:18 ` 2.6.21-rc3-mm1 Greg KH
2007-03-10 12:43 ` 2.6.21-rc3-mm1 Jiri Kosina
2007-03-10 15:36 ` 2.6.21-rc3-mm1 Mariusz Kozlowski
2007-03-10 16:00 ` [linux-usb-devel] 2.6.21-rc3-mm1 Alan Stern
2007-03-10 16:36 ` Mariusz Kozlowski
2007-03-10 19:02 ` Alan Stern
2007-03-12 20:50 ` Mariusz Kozlowski
2007-03-12 20:53 ` Jiri Kosina
2007-03-10 13:32 ` 2.6.21-rc3-mm1 Mariusz Kozlowski
2007-03-10 13:32 ` 2.6.21-rc3-mm1 Mariusz Kozlowski
2007-03-12 18:14 ` 2.6.21-rc3-mm1 Radoslaw Szkodzinski
2007-03-14 19:06 ` 2.6.21-rc3-mm1 Mariusz Kozlowski
2007-03-15 1:07 ` 2.6.21-rc3-mm1 Andrew Morton
2007-03-15 6:09 ` 2.6.21-rc3-mm1 Mariusz Kozlowski
2007-03-15 10:16 ` 2.6.21-rc3-mm1 Mel Gorman
2007-03-15 15:37 ` 2.6.21-rc3-mm1 Mariusz Kozlowski
2007-03-15 19:59 ` 2.6.21-rc3-mm1 Mel Gorman
2007-03-16 6:43 ` 2.6.21-rc3-mm1 Mariusz Kozlowski
2007-03-16 10:03 ` 2.6.21-rc3-mm1 Mel Gorman
2007-03-17 18:26 ` [PATCH] Bias the location of pages freed for min_free_kbytes in the same MAX_ORDER_NR_PAGES blocks Mel Gorman
2007-03-18 8:22 ` Andrew Morton
2007-03-18 11:35 ` Mel Gorman
2007-03-18 18:12 ` Andrew Morton
2007-03-18 19:05 ` Mel Gorman
2007-03-18 19:28 ` Andrew Morton
2007-03-18 20:08 ` Mel Gorman
2007-03-18 20:45 ` Andrew Morton
2007-03-18 22:21 ` Mel Gorman
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20070310154511.GB9645@linux.vnet.ibm.com \
--to=paulmck@linux.vnet.ibm.com \
--cc=akpm@linux-foundation.org \
--cc=ego@in.ibm.com \
--cc=linux-kernel@vger.kernel.org \
--cc=michal.k.k.piotrowski@gmail.com \
--cc=oleg@tv-sign.ru \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.