[PATCH] kmemleak: Fix scheduling-while-atomic bug

public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed

* [PATCH] kmemleak: Fix scheduling-while-atomic bug
       [not found] <200907010300.n6130rRf026194@hera.kernel.org>
@ 2009-07-01  7:53 ` Ingo Molnar
  2009-07-01  8:10   ` Pekka Enberg
  2009-07-01  9:18   ` Catalin Marinas
  0 siblings, 2 replies; 21+ messages in thread
From: Ingo Molnar @ 2009-07-01  7:53 UTC (permalink / raw)
  To: Linux Kernel Mailing List, Andrew Morton, Linus Torvalds,
	Catalin Marinas, Peter Zijlstra
  Cc: git-commits-head


* Linux Kernel Mailing List <linux-kernel@vger.kernel.org> wrote:

> Gitweb:     http://git.kernel.org/linus/acf4968ec9dea49387ca8b3d36dfaa0850bdb2d5
> Commit:     acf4968ec9dea49387ca8b3d36dfaa0850bdb2d5
> Parent:     4698c1f2bbe44ce852ef1a6716973c1f5401a4c4
> Author:     Catalin Marinas <catalin.marinas@arm.com>
> AuthorDate: Fri Jun 26 17:38:29 2009 +0100
> Committer:  Catalin Marinas <catalin.marinas@arm.com>
> CommitDate: Fri Jun 26 17:38:29 2009 +0100
> 
>     kmemleak: Slightly change the policy on newly allocated objects

I think one of the kmemleak fixes that went upstream yesterday 
caused the following scheduling-while-holding-the-tasklist-lock 
regression/crash on x86:

BUG: sleeping function called from invalid context at mm/kmemleak.c:795
in_atomic(): 1, irqs_disabled(): 0, pid: 1737, name: kmemleak
2 locks held by kmemleak/1737:
 #0:  (scan_mutex){......}, at: [<c10c4376>] kmemleak_scan_thread+0x45/0x86
 #1:  (tasklist_lock){......}, at: [<c10c3bb4>] kmemleak_scan+0x1a9/0x39c
Pid: 1737, comm: kmemleak Not tainted 2.6.31-rc1-tip #59266
Call Trace:
 [<c105ac0f>] ? __debug_show_held_locks+0x1e/0x20
 [<c102e490>] __might_sleep+0x10a/0x111
 [<c10c38d5>] scan_yield+0x17/0x3b
 [<c10c3970>] scan_block+0x39/0xd4
 [<c10c3bc6>] kmemleak_scan+0x1bb/0x39c
 [<c10c4331>] ? kmemleak_scan_thread+0x0/0x86
 [<c10c437b>] kmemleak_scan_thread+0x4a/0x86
 [<c104d73e>] kthread+0x6e/0x73
 [<c104d6d0>] ? kthread+0x0/0x73
 [<c100959f>] kernel_thread_helper+0x7/0x10
kmemleak: 834 new suspected memory leaks (see /sys/kernel/debug/kmemleak)

The bit causing it is highly dubious:

static void scan_yield(void)
{
        might_sleep();

        if (time_is_before_eq_jiffies(next_scan_yield)) {
                schedule();
                next_scan_yield = jiffies + jiffies_scan_yield;
        }
}

It is called deep inside the codepath and in a conditional way, and 
that is what crapped up when one of the new scan_block() uses grew a 
tasklist_lock dependency. Also, we dont need another 'yield' 
primitive in the MM code, we have priorities and other scheduling 
mechanisms to throttle background scanning just fine.

The minimal fix below removes scan_yield() and adds a cond_resched() 
to the outmost (safe) place of the scanning thread. This solves the 
regression.

	Ingo

----------------->

>From 5ba1a8143c502f40b976a0ea1df5e5a10056fcc6 Mon Sep 17 00:00:00 2001
From: Ingo Molnar <mingo@elte.hu>
Date: Wed, 1 Jul 2009 09:43:53 +0200
Subject: [PATCH] kmemleak: Fix scheduling-while-atomic bug

One of the kmemleak changes caused the following 
scheduling-while-holding-the-tasklist-lock regression on x86:

BUG: sleeping function called from invalid context at mm/kmemleak.c:795
in_atomic(): 1, irqs_disabled(): 0, pid: 1737, name: kmemleak
2 locks held by kmemleak/1737:
 #0:  (scan_mutex){......}, at: [<c10c4376>] kmemleak_scan_thread+0x45/0x86
 #1:  (tasklist_lock){......}, at: [<c10c3bb4>] kmemleak_scan+0x1a9/0x39c
Pid: 1737, comm: kmemleak Not tainted 2.6.31-rc1-tip #59266
Call Trace:
 [<c105ac0f>] ? __debug_show_held_locks+0x1e/0x20
 [<c102e490>] __might_sleep+0x10a/0x111
 [<c10c38d5>] scan_yield+0x17/0x3b
 [<c10c3970>] scan_block+0x39/0xd4
 [<c10c3bc6>] kmemleak_scan+0x1bb/0x39c
 [<c10c4331>] ? kmemleak_scan_thread+0x0/0x86
 [<c10c437b>] kmemleak_scan_thread+0x4a/0x86
 [<c104d73e>] kthread+0x6e/0x73
 [<c104d6d0>] ? kthread+0x0/0x73
 [<c100959f>] kernel_thread_helper+0x7/0x10
kmemleak: 834 new suspected memory leaks (see /sys/kernel/debug/kmemleak)

The bit causing it is highly dubious:

static void scan_yield(void)
{
        might_sleep();

        if (time_is_before_eq_jiffies(next_scan_yield)) {
                schedule();
                next_scan_yield = jiffies + jiffies_scan_yield;
        }
}

It called deep inside the codepath and in a conditional way,
and that is what crapped up when one of the new scan_block()
uses grew a tasklist_lock dependency.

This minimal patch removes that yielding stuff and adds the
proper cond_resched().

The background scanning thread could probably also be reniced
to +10.

Signed-off-by: Ingo Molnar <mingo@elte.hu>
---
 mm/kmemleak.c |   31 +------------------------------
 1 files changed, 1 insertions(+), 30 deletions(-)

diff --git a/mm/kmemleak.c b/mm/kmemleak.c
index eeece2d..e766e1d 100644
--- a/mm/kmemleak.c
+++ b/mm/kmemleak.c
@@ -105,7 +105,6 @@
 #define MAX_TRACE		16	/* stack trace length */
 #define REPORTS_NR		50	/* maximum number of reported leaks */
 #define MSECS_MIN_AGE		5000	/* minimum object age for reporting */
-#define MSECS_SCAN_YIELD	10	/* CPU yielding period */
 #define SECS_FIRST_SCAN		60	/* delay before the first scan */
 #define SECS_SCAN_WAIT		600	/* subsequent auto scanning delay */
 
@@ -186,10 +185,7 @@ static atomic_t kmemleak_error = ATOMIC_INIT(0);
 static unsigned long min_addr = ULONG_MAX;
 static unsigned long max_addr;
 
-/* used for yielding the CPU to other tasks during scanning */
-static unsigned long next_scan_yield;
 static struct task_struct *scan_thread;
-static unsigned long jiffies_scan_yield;
 /* used to avoid reporting of recently allocated objects */
 static unsigned long jiffies_min_age;
 static unsigned long jiffies_last_scan;
@@ -786,21 +782,6 @@ void kmemleak_no_scan(const void *ptr)
 EXPORT_SYMBOL(kmemleak_no_scan);
 
 /*
- * Yield the CPU so that other tasks get a chance to run.  The yielding is
- * rate-limited to avoid excessive number of calls to the schedule() function
- * during memory scanning.
- */
-static void scan_yield(void)
-{
-	might_sleep();
-
-	if (time_is_before_eq_jiffies(next_scan_yield)) {
-		schedule();
-		next_scan_yield = jiffies + jiffies_scan_yield;
-	}
-}
-
-/*
  * Memory scanning is a long process and it needs to be interruptable. This
  * function checks whether such interrupt condition occured.
  */
@@ -840,15 +821,6 @@ static void scan_block(void *_start, void *_end,
 		if (scan_should_stop())
 			break;
 
-		/*
-		 * When scanning a memory block with a corresponding
-		 * kmemleak_object, the CPU yielding is handled in the calling
-		 * code since it holds the object->lock to avoid the block
-		 * freeing.
-		 */
-		if (!scanned)
-			scan_yield();
-
 		object = find_and_get_object(pointer, 1);
 		if (!object)
 			continue;
@@ -1014,7 +986,7 @@ static void kmemleak_scan(void)
 	 */
 	object = list_entry(gray_list.next, typeof(*object), gray_list);
 	while (&object->gray_list != &gray_list) {
-		scan_yield();
+		cond_resched();
 
 		/* may add new objects to the list */
 		if (!scan_should_stop())
@@ -1385,7 +1357,6 @@ void __init kmemleak_init(void)
 	int i;
 	unsigned long flags;
 
-	jiffies_scan_yield = msecs_to_jiffies(MSECS_SCAN_YIELD);
 	jiffies_min_age = msecs_to_jiffies(MSECS_MIN_AGE);
 	jiffies_scan_wait = msecs_to_jiffies(SECS_SCAN_WAIT * 1000);
 

^ permalink raw reply related	[flat|nested] 21+ messages in thread

* Re: [PATCH] kmemleak: Fix scheduling-while-atomic bug
  2009-07-01  7:53 ` [PATCH] kmemleak: Fix scheduling-while-atomic bug Ingo Molnar
@ 2009-07-01  8:10   ` Pekka Enberg
  2009-07-01  9:18   ` Catalin Marinas
  1 sibling, 0 replies; 21+ messages in thread
From: Pekka Enberg @ 2009-07-01  8:10 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: Linux Kernel Mailing List, Andrew Morton, Linus Torvalds,
	Catalin Marinas, Peter Zijlstra, git-commits-head

Hi Ingo,

On Wed, Jul 1, 2009 at 10:53 AM, Ingo Molnar<mingo@elte.hu> wrote:
> From 5ba1a8143c502f40b976a0ea1df5e5a10056fcc6 Mon Sep 17 00:00:00 2001
> From: Ingo Molnar <mingo@elte.hu>
> Date: Wed, 1 Jul 2009 09:43:53 +0200
> Subject: [PATCH] kmemleak: Fix scheduling-while-atomic bug
>
> One of the kmemleak changes caused the following
> scheduling-while-holding-the-tasklist-lock regression on x86:
>
> BUG: sleeping function called from invalid context at mm/kmemleak.c:795
> in_atomic(): 1, irqs_disabled(): 0, pid: 1737, name: kmemleak
> 2 locks held by kmemleak/1737:
>  #0:  (scan_mutex){......}, at: [<c10c4376>] kmemleak_scan_thread+0x45/0x86
>  #1:  (tasklist_lock){......}, at: [<c10c3bb4>] kmemleak_scan+0x1a9/0x39c
> Pid: 1737, comm: kmemleak Not tainted 2.6.31-rc1-tip #59266
> Call Trace:
>  [<c105ac0f>] ? __debug_show_held_locks+0x1e/0x20
>  [<c102e490>] __might_sleep+0x10a/0x111
>  [<c10c38d5>] scan_yield+0x17/0x3b
>  [<c10c3970>] scan_block+0x39/0xd4
>  [<c10c3bc6>] kmemleak_scan+0x1bb/0x39c
>  [<c10c4331>] ? kmemleak_scan_thread+0x0/0x86
>  [<c10c437b>] kmemleak_scan_thread+0x4a/0x86
>  [<c104d73e>] kthread+0x6e/0x73
>  [<c104d6d0>] ? kthread+0x0/0x73
>  [<c100959f>] kernel_thread_helper+0x7/0x10
> kmemleak: 834 new suspected memory leaks (see /sys/kernel/debug/kmemleak)
>
> The bit causing it is highly dubious:
>
> static void scan_yield(void)
> {
>        might_sleep();
>
>        if (time_is_before_eq_jiffies(next_scan_yield)) {
>                schedule();
>                next_scan_yield = jiffies + jiffies_scan_yield;
>        }
> }
>
> It called deep inside the codepath and in a conditional way,
> and that is what crapped up when one of the new scan_block()
> uses grew a tasklist_lock dependency.
>
> This minimal patch removes that yielding stuff and adds the
> proper cond_resched().
>
> The background scanning thread could probably also be reniced
> to +10.
>
> Signed-off-by: Ingo Molnar <mingo@elte.hu>

Looks good to me, thanks!

Acked-by: Pekka Enberg <penberg@cs.helsinki.fi>

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [PATCH] kmemleak: Fix scheduling-while-atomic bug
  2009-07-01  7:53 ` [PATCH] kmemleak: Fix scheduling-while-atomic bug Ingo Molnar
  2009-07-01  8:10   ` Pekka Enberg
@ 2009-07-01  9:18   ` Catalin Marinas
  2009-07-01  9:30     ` Ingo Molnar
  1 sibling, 1 reply; 21+ messages in thread
From: Catalin Marinas @ 2009-07-01  9:18 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: Linux Kernel Mailing List, Andrew Morton, Linus Torvalds,
	Peter Zijlstra, git-commits-head

On Wed, 2009-07-01 at 09:53 +0200, Ingo Molnar wrote:
> * Linux Kernel Mailing List <linux-kernel@vger.kernel.org> wrote:
> 
> > Gitweb:     http://git.kernel.org/linus/acf4968ec9dea49387ca8b3d36dfaa0850bdb2d5
> > Commit:     acf4968ec9dea49387ca8b3d36dfaa0850bdb2d5
> > Parent:     4698c1f2bbe44ce852ef1a6716973c1f5401a4c4
> > Author:     Catalin Marinas <catalin.marinas@arm.com>
> > AuthorDate: Fri Jun 26 17:38:29 2009 +0100
> > Committer:  Catalin Marinas <catalin.marinas@arm.com>
> > CommitDate: Fri Jun 26 17:38:29 2009 +0100
> > 
> >     kmemleak: Slightly change the policy on newly allocated objects
> 
> I think one of the kmemleak fixes that went upstream yesterday 
> caused the following scheduling-while-holding-the-tasklist-lock 
> regression/crash on x86:

Thanks for the patch. The bug was always there, only that the task stack
scanning is now enabled by default (and I probably have a small number
of tasks that the rescheduling didn't happen during stack scanning).

> The minimal fix below removes scan_yield() and adds a cond_resched() 
> to the outmost (safe) place of the scanning thread. This solves the 
> regression.

With CONFIG_PREEMPT disabled it won't reschedule during the bss scanning
but I don't see this as a real issue (task stacks scanning probably
takes longer anyway).

> The background scanning thread could probably also be reniced
> to +10.

diff --git a/mm/kmemleak.c b/mm/kmemleak.c
index e766e1d..6006553 100644
--- a/mm/kmemleak.c
+++ b/mm/kmemleak.c
@@ -1039,6 +1039,7 @@ static int kmemleak_scan_thread(void *arg)
 	static int first_run = 1;
 
 	pr_info("Automatic memory scanning thread started\n");
+	set_user_nice(current, 10);
 
 	/*
 	 * Wait before the first scan to allow the system to fully initialize.

-- 
Catalin


^ permalink raw reply related	[flat|nested] 21+ messages in thread

* Re: [PATCH] kmemleak: Fix scheduling-while-atomic bug
  2009-07-01  9:18   ` Catalin Marinas
@ 2009-07-01  9:30     ` Ingo Molnar
  2009-07-01  9:46       ` Catalin Marinas
                         ` (2 more replies)
  0 siblings, 3 replies; 21+ messages in thread
From: Ingo Molnar @ 2009-07-01  9:30 UTC (permalink / raw)
  To: Catalin Marinas
  Cc: Linux Kernel Mailing List, Andrew Morton, Linus Torvalds,
	Peter Zijlstra, git-commits-head


* Catalin Marinas <catalin.marinas@arm.com> wrote:

> > The minimal fix below removes scan_yield() and adds a 
> > cond_resched() to the outmost (safe) place of the scanning 
> > thread. This solves the regression.
> 
> With CONFIG_PREEMPT disabled it won't reschedule during the bss 
> scanning but I don't see this as a real issue (task stacks 
> scanning probably takes longer anyway).

Yeah. I suspect one more cond_resched() could be added - i just 
didnt see an obvious place for it, given that scan_block() is being 
called with asymetric held-locks contexts.

	Ingo

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [PATCH] kmemleak: Fix scheduling-while-atomic bug
  2009-07-01  9:30     ` Ingo Molnar
@ 2009-07-01  9:46       ` Catalin Marinas
  2009-07-01 11:04         ` Ingo Molnar
  2009-07-02  9:48       ` [PATCH] kmemleak: Fix scheduling-while-atomic bug Catalin Marinas
  2009-07-08 13:33       ` [PATCH] kmemleak: Fix scheduling-while-atomic bug Catalin Marinas
  2 siblings, 1 reply; 21+ messages in thread
From: Catalin Marinas @ 2009-07-01  9:46 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: Linux Kernel Mailing List, Andrew Morton, Linus Torvalds,
	Peter Zijlstra, git-commits-head

On Wed, 2009-07-01 at 11:30 +0200, Ingo Molnar wrote:
> * Catalin Marinas <catalin.marinas@arm.com> wrote:
> 
> > > The minimal fix below removes scan_yield() and adds a 
> > > cond_resched() to the outmost (safe) place of the scanning 
> > > thread. This solves the regression.
> > 
> > With CONFIG_PREEMPT disabled it won't reschedule during the bss 
> > scanning but I don't see this as a real issue (task stacks 
> > scanning probably takes longer anyway).
> 
> Yeah. I suspect one more cond_resched() could be added - i just 
> didnt see an obvious place for it, given that scan_block() is being 
> called with asymetric held-locks contexts.

Yes, scan_block shouldn't call cond_resched(). The code is cleaner if
functions don't have too many side-effects. I can see about 1 sec of bss
scanning on an ARM board but with processor at < 500MHz and slow memory
system. On a standard x86 systems BSS scanning may not be noticeable
(and I think PREEMPT enabling is quite common these days).

Since we are at locking, I just noticed this on my x86 laptop when
running cat /sys/kernel/debug/kmemleak (I haven't got it on an ARM
board):

================================================
[ BUG: lock held when returning to user space! ]
------------------------------------------------
cat/3687 is leaving the kernel with locks still held!
1 lock held by cat/3687:
#0:  (scan_mutex){+.+.+.}, at: [<c01e0c5c>] kmemleak_open+0x3c/0x70

kmemleak_open() acquires scan_mutex and unconditionally releases it in
kmemleak_release(). The mutex seems to be released as a subsequent
acquiring works fine.

Is this caused just because cat may have exited without closing the file
descriptor (which should be done automatically anyway)?

Thanks.

-- 
Catalin


^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [PATCH] kmemleak: Fix scheduling-while-atomic bug
  2009-07-01  9:46       ` Catalin Marinas
@ 2009-07-01 11:04         ` Ingo Molnar
  2009-07-02 12:48           ` Exiting with locks still held (was Re: [PATCH] kmemleak: Fix scheduling-while-atomic bug) Catalin Marinas
  0 siblings, 1 reply; 21+ messages in thread
From: Ingo Molnar @ 2009-07-01 11:04 UTC (permalink / raw)
  To: Catalin Marinas
  Cc: Linux Kernel Mailing List, Andrew Morton, Linus Torvalds,
	Peter Zijlstra, git-commits-head


* Catalin Marinas <catalin.marinas@arm.com> wrote:

> On Wed, 2009-07-01 at 11:30 +0200, Ingo Molnar wrote:
> > * Catalin Marinas <catalin.marinas@arm.com> wrote:
> > 
> > > > The minimal fix below removes scan_yield() and adds a 
> > > > cond_resched() to the outmost (safe) place of the scanning 
> > > > thread. This solves the regression.
> > > 
> > > With CONFIG_PREEMPT disabled it won't reschedule during the bss 
> > > scanning but I don't see this as a real issue (task stacks 
> > > scanning probably takes longer anyway).
> > 
> > Yeah. I suspect one more cond_resched() could be added - i just 
> > didnt see an obvious place for it, given that scan_block() is being 
> > called with asymetric held-locks contexts.
> 
> Yes, scan_block shouldn't call cond_resched(). The code is cleaner if
> functions don't have too many side-effects. I can see about 1 sec of bss
> scanning on an ARM board but with processor at < 500MHz and slow memory
> system. On a standard x86 systems BSS scanning may not be noticeable
> (and I think PREEMPT enabling is quite common these days).
> 
> Since we are at locking, I just noticed this on my x86 laptop when
> running cat /sys/kernel/debug/kmemleak (I haven't got it on an ARM
> board):
> 
> ================================================
> [ BUG: lock held when returning to user space! ]
> ------------------------------------------------
> cat/3687 is leaving the kernel with locks still held!
> 1 lock held by cat/3687:
> #0:  (scan_mutex){+.+.+.}, at: [<c01e0c5c>] kmemleak_open+0x3c/0x70
> 
> kmemleak_open() acquires scan_mutex and unconditionally releases 
> it in kmemleak_release(). The mutex seems to be released as a 
> subsequent acquiring works fine.
> 
> Is this caused just because cat may have exited without closing 
> the file descriptor (which should be done automatically anyway)?

This lockdep warning has a 0% false positives track record so far: 
all previous cases it triggered showed some real (and fatal) bug in 
the underlying code.

The above one probably means scan_mutex is leaked out of a /proc 
syscall - that would be a bug in kmemleak.

	Ingo

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Exiting with locks still held (was Re: [PATCH] kmemleak: Fix scheduling-while-atomic bug)
  2009-07-01 11:04         ` Ingo Molnar
@ 2009-07-02 12:48           ` Catalin Marinas
  2009-07-02 12:54             ` Pekka Enberg
  2009-07-03  7:04             ` Ingo Molnar
  0 siblings, 2 replies; 21+ messages in thread
From: Catalin Marinas @ 2009-07-02 12:48 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: Linux Kernel Mailing List, Andrew Morton, Linus Torvalds,
	Peter Zijlstra, git-commits-head

Hi Ingo,

On Wed, 2009-07-01 at 13:04 +0200, Ingo Molnar wrote:
> * Catalin Marinas <catalin.marinas@arm.com> wrote:
> > Since we are at locking, I just noticed this on my x86 laptop when
> > running cat /sys/kernel/debug/kmemleak (I haven't got it on an ARM
> > board):
> > 
> > ================================================
> > [ BUG: lock held when returning to user space! ]
> > ------------------------------------------------
> > cat/3687 is leaving the kernel with locks still held!
> > 1 lock held by cat/3687:
> > #0:  (scan_mutex){+.+.+.}, at: [<c01e0c5c>] kmemleak_open+0x3c/0x70
> > 
> > kmemleak_open() acquires scan_mutex and unconditionally releases 
> > it in kmemleak_release(). The mutex seems to be released as a 
> > subsequent acquiring works fine.
> > 
> > Is this caused just because cat may have exited without closing 
> > the file descriptor (which should be done automatically anyway)?
> 
> This lockdep warning has a 0% false positives track record so far: 
> all previous cases it triggered showed some real (and fatal) bug in 
> the underlying code.

In this particular case, there is no fatal problem as the mutex is
released shortly after this message.

> The above one probably means scan_mutex is leaked out of a /proc 
> syscall - that would be a bug in kmemleak.

It could be but I can't figure out a solution. If there is only one task
opening and closing the kmemleak file, everything is fine. In
combination with shell piping I think I get the kmemleak file descriptor
released from a different task than the one that opened it.

For example, the badly written code below opens kmemleak and acquires
the scan_mutex in the parent task but releases it in the child (it needs
a few tries to trigger it). With waitpid() in parent everything is fine.

#include <stdio.h>
#include <sys/types.h>
#include <sys/stat.h>
#include <fcntl.h>
#include <unistd.h>
#include <sys/wait.h>

int main(void)
{
	int fd = open("/sys/kernel/debug/kmemleak", O_RDONLY);

	printf("fd = %d\n", fd);
	if (fd < 0)
		return 2;

	if (!fork()) {
		/* child */
		sleep(2);
		close(fd);
		printf("fd closed in child\n");
	}

	return 0;
}

Running this gives (the ### lines are printed in the
kmemleak_open/release functions):

# ./cat-kmemleak
### kmemleak_open current->pid = 1409
fd = 3
=====================================
[ BUG: lock held at task exit time! ]
-------------------------------------
cat-kmemleak/1409 is exiting with locks still held!
1 lock held by cat-kmemleak/1409:
 #0:  (scan_mutex){+.+.+.}, at: [<c00662b1>] kmemleak_open+0x31/0x68

stack backtrace:
[<c0024025>] (unwind_backtrace+0x1/0x80) from [<c01cddd7>] (dump_stack+0xb/0xc)
[<c01cddd7>] (dump_stack+0xb/0xc) from [<c0043d2d>] (debug_check_no_locks_held+0x49/0x64)
[<c0043d2d>] (debug_check_no_locks_held+0x49/0x64) from [<c0031423>] (do_exit+0x3fb/0x43c)
[<c0031423>] (do_exit+0x3fb/0x43c) from [<c00314c5>] (do_group_exit+0x61/0x80)
[<c00314c5>] (do_group_exit+0x61/0x80) from [<c00314f3>] (sys_exit_group+0xf/0x14)
[<c00314f3>] (sys_exit_group+0xf/0x14) from [<c001fc41>] (ret_fast_syscall+0x1/0x40)

### kmemleak_release current->pid = 1410
fd closed in child


Any suggestions? Thanks.

-- 
Catalin


^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Exiting with locks still held (was Re: [PATCH] kmemleak: Fix  scheduling-while-atomic bug)
  2009-07-02 12:48           ` Exiting with locks still held (was Re: [PATCH] kmemleak: Fix scheduling-while-atomic bug) Catalin Marinas
@ 2009-07-02 12:54             ` Pekka Enberg
  2009-07-02 13:06               ` Catalin Marinas
  2009-07-02 14:13               ` Catalin Marinas
  2009-07-03  7:04             ` Ingo Molnar
  1 sibling, 2 replies; 21+ messages in thread
From: Pekka Enberg @ 2009-07-02 12:54 UTC (permalink / raw)
  To: Catalin Marinas
  Cc: Ingo Molnar, Linux Kernel Mailing List, Andrew Morton,
	Linus Torvalds, Peter Zijlstra, git-commits-head

Hi Catalin,

On Thu, Jul 2, 2009 at 3:48 PM, Catalin Marinas<catalin.marinas@arm.com> wrote:
> It could be but I can't figure out a solution. If there is only one task
> opening and closing the kmemleak file, everything is fine. In
> combination with shell piping I think I get the kmemleak file descriptor
> released from a different task than the one that opened it.
>
> For example, the badly written code below opens kmemleak and acquires
> the scan_mutex in the parent task but releases it in the child (it needs
> a few tries to trigger it). With waitpid() in parent everything is fine.
>
> #include <stdio.h>
> #include <sys/types.h>
> #include <sys/stat.h>
> #include <fcntl.h>
> #include <unistd.h>
> #include <sys/wait.h>
>
> int main(void)
> {
>        int fd = open("/sys/kernel/debug/kmemleak", O_RDONLY);
>
>        printf("fd = %d\n", fd);
>        if (fd < 0)
>                return 2;
>
>        if (!fork()) {
>                /* child */
>                sleep(2);
>                close(fd);
>                printf("fd closed in child\n");
>        }
>
>        return 0;
> }
>
> Running this gives (the ### lines are printed in the
> kmemleak_open/release functions):
>
> # ./cat-kmemleak
> ### kmemleak_open current->pid = 1409
> fd = 3
> =====================================
> [ BUG: lock held at task exit time! ]
> -------------------------------------
> cat-kmemleak/1409 is exiting with locks still held!
> 1 lock held by cat-kmemleak/1409:
>  #0:  (scan_mutex){+.+.+.}, at: [<c00662b1>] kmemleak_open+0x31/0x68
>
> stack backtrace:
> [<c0024025>] (unwind_backtrace+0x1/0x80) from [<c01cddd7>] (dump_stack+0xb/0xc)
> [<c01cddd7>] (dump_stack+0xb/0xc) from [<c0043d2d>] (debug_check_no_locks_held+0x49/0x64)
> [<c0043d2d>] (debug_check_no_locks_held+0x49/0x64) from [<c0031423>] (do_exit+0x3fb/0x43c)
> [<c0031423>] (do_exit+0x3fb/0x43c) from [<c00314c5>] (do_group_exit+0x61/0x80)
> [<c00314c5>] (do_group_exit+0x61/0x80) from [<c00314f3>] (sys_exit_group+0xf/0x14)
> [<c00314f3>] (sys_exit_group+0xf/0x14) from [<c001fc41>] (ret_fast_syscall+0x1/0x40)
>
> ### kmemleak_release current->pid = 1410
> fd closed in child
>
> Any suggestions? Thanks.

Well, you are not supposed to hold on to locks when returning from a
system call ("sys_open") anyway. You can probably do the exclusion
with a kmemcheck specific flag?

                        Pekka

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Exiting with locks still held (was Re: [PATCH] kmemleak: Fix  scheduling-while-atomic bug)
  2009-07-02 12:54             ` Pekka Enberg
@ 2009-07-02 13:06               ` Catalin Marinas
  2009-07-02 14:13               ` Catalin Marinas
  1 sibling, 0 replies; 21+ messages in thread
From: Catalin Marinas @ 2009-07-02 13:06 UTC (permalink / raw)
  To: Pekka Enberg
  Cc: Ingo Molnar, Linux Kernel Mailing List, Andrew Morton,
	Linus Torvalds, Peter Zijlstra, git-commits-head

On Thu, 2009-07-02 at 15:54 +0300, Pekka Enberg wrote:
> On Thu, Jul 2, 2009 at 3:48 PM, Catalin Marinas<catalin.marinas@arm.com> wrote:
> > It could be but I can't figure out a solution. If there is only one task
> > opening and closing the kmemleak file, everything is fine. In
> > combination with shell piping I think I get the kmemleak file descriptor
> > released from a different task than the one that opened it.
> >
> > For example, the badly written code below opens kmemleak and acquires
> > the scan_mutex in the parent task but releases it in the child (it needs
> > a few tries to trigger it). With waitpid() in parent everything is fine.
[...]
> Well, you are not supposed to hold on to locks when returning from a
> system call ("sys_open") anyway. You can probably do the exclusion
> with a kmemcheck specific flag?

Acquiring the mutex in "open" and releasing it in "release" was easier.
I'll see if I can move the mutex to the seq_read functions which
actually need it.

-- 
Catalin


^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Exiting with locks still held (was Re: [PATCH] kmemleak: Fix  scheduling-while-atomic bug)
  2009-07-02 12:54             ` Pekka Enberg
  2009-07-02 13:06               ` Catalin Marinas
@ 2009-07-02 14:13               ` Catalin Marinas
  2009-07-02 17:39                 ` Linus Torvalds
  1 sibling, 1 reply; 21+ messages in thread
From: Catalin Marinas @ 2009-07-02 14:13 UTC (permalink / raw)
  To: Pekka Enberg
  Cc: Ingo Molnar, Linux Kernel Mailing List, Andrew Morton,
	Linus Torvalds, Peter Zijlstra, git-commits-head

On Thu, 2009-07-02 at 15:54 +0300, Pekka Enberg wrote:
> On Thu, Jul 2, 2009 at 3:48 PM, Catalin Marinas<catalin.marinas@arm.com> wrote:
> > For example, the badly written code below opens kmemleak and acquires
> > the scan_mutex in the parent task but releases it in the child (it needs
> > a few tries to trigger it). With waitpid() in parent everything is fine.
[...]
> Well, you are not supposed to hold on to locks when returning from a
> system call ("sys_open") anyway.

There's a fix to this. I'll test it a bit more before pushing upstream:


kmemleak: Do not acquire scan_mutex in kmemleak_open()

From: Catalin Marinas <catalin.marinas@arm.com>

Initially, the scan_mutex was acquired in kmemleak_open() and released
in kmemleak_release() (corresponding to /sys/kernel/debug/kmemleak
operations). This was causing some lockdep reports when the file was
closed from a different task than the one opening it. This patch moves
the scan_mutex acquiring in kmemleak_write() or kmemleak_seq_show().

Signed-off-by: Catalin Marinas <catalin.marinas@arm.com>
---
 mm/kmemleak.c |   58 +++++++++++++++++++++++++++------------------------------
 1 files changed, 27 insertions(+), 31 deletions(-)

diff --git a/mm/kmemleak.c b/mm/kmemleak.c
index 958b462..678baad 100644
--- a/mm/kmemleak.c
+++ b/mm/kmemleak.c
@@ -1157,11 +1157,22 @@ static int kmemleak_seq_show(struct seq_file *seq, void *v)
 {
 	struct kmemleak_object *object = v;
 	unsigned long flags;
+	int ret;
+
+	/*
+	 * scan_mutex needs to be acquired here since unreferenced_object() is
+	 * not reliable during memory scanning.
+	 */
+	ret = mutex_lock_interruptible(&scan_mutex);
+	if (ret < 0)
+		return ret;
 
 	spin_lock_irqsave(&object->lock, flags);
 	if ((object->flags & OBJECT_REPORTED) && unreferenced_object(object))
 		print_unreferenced(seq, object);
 	spin_unlock_irqrestore(&object->lock, flags);
+
+	mutex_unlock(&scan_mutex);
 	return 0;
 }
 
@@ -1174,36 +1185,15 @@ static const struct seq_operations kmemleak_seq_ops = {
 
 static int kmemleak_open(struct inode *inode, struct file *file)
 {
-	int ret = 0;
-
 	if (!atomic_read(&kmemleak_enabled))
 		return -EBUSY;
 
-	ret = mutex_lock_interruptible(&scan_mutex);
-	if (ret < 0)
-		goto out;
-	if (file->f_mode & FMODE_READ) {
-		ret = seq_open(file, &kmemleak_seq_ops);
-		if (ret < 0)
-			goto scan_unlock;
-	}
-	return ret;
-
-scan_unlock:
-	mutex_unlock(&scan_mutex);
-out:
-	return ret;
+	return seq_open(file, &kmemleak_seq_ops);
 }
 
 static int kmemleak_release(struct inode *inode, struct file *file)
 {
-	int ret = 0;
-
-	if (file->f_mode & FMODE_READ)
-		seq_release(inode, file);
-	mutex_unlock(&scan_mutex);
-
-	return ret;
+	return seq_release(inode, file);
 }
 
 /*
@@ -1223,15 +1213,17 @@ static ssize_t kmemleak_write(struct file *file, const char __user *user_buf,
 {
 	char buf[64];
 	int buf_size;
-
-	if (!atomic_read(&kmemleak_enabled))
-		return -EBUSY;
+	int ret;
 
 	buf_size = min(size, (sizeof(buf) - 1));
 	if (strncpy_from_user(buf, user_buf, buf_size) < 0)
 		return -EFAULT;
 	buf[buf_size] = 0;
 
+	ret = mutex_lock_interruptible(&scan_mutex);
+	if (ret < 0)
+		return ret;
+
 	if (strncmp(buf, "off", 3) == 0)
 		kmemleak_disable();
 	else if (strncmp(buf, "stack=on", 8) == 0)
@@ -1244,11 +1236,10 @@ static ssize_t kmemleak_write(struct file *file, const char __user *user_buf,
 		stop_scan_thread();
 	else if (strncmp(buf, "scan=", 5) == 0) {
 		unsigned long secs;
-		int err;
 
-		err = strict_strtoul(buf + 5, 0, &secs);
-		if (err < 0)
-			return err;
+		ret = strict_strtoul(buf + 5, 0, &secs);
+		if (ret < 0)
+			goto out;
 		stop_scan_thread();
 		if (secs) {
 			jiffies_scan_wait = msecs_to_jiffies(secs * 1000);
@@ -1257,7 +1248,12 @@ static ssize_t kmemleak_write(struct file *file, const char __user *user_buf,
 	} else if (strncmp(buf, "scan", 4) == 0)
 		kmemleak_scan();
 	else
-		return -EINVAL;
+		ret = -EINVAL;
+
+out:
+	mutex_unlock(&scan_mutex);
+	if (ret < 0)
+		return ret;
 
 	/* ignore the rest of the buffer, only one command at a time */
 	*ppos += size;


-- 
Catalin


^ permalink raw reply related	[flat|nested] 21+ messages in thread

* Re: Exiting with locks still held (was Re: [PATCH] kmemleak: Fix  scheduling-while-atomic bug)
  2009-07-02 14:13               ` Catalin Marinas
@ 2009-07-02 17:39                 ` Linus Torvalds
  2009-07-03 10:18                   ` Catalin Marinas
  0 siblings, 1 reply; 21+ messages in thread
From: Linus Torvalds @ 2009-07-02 17:39 UTC (permalink / raw)
  To: Catalin Marinas
  Cc: Pekka Enberg, Ingo Molnar, Linux Kernel Mailing List,
	Andrew Morton, Peter Zijlstra, git-commits-head

On Thu, 2 Jul 2009, Catalin Marinas wrote:
> 
> Initially, the scan_mutex was acquired in kmemleak_open() and released
> in kmemleak_release() (corresponding to /sys/kernel/debug/kmemleak
> operations). This was causing some lockdep reports when the file was
> closed from a different task than the one opening it. This patch moves
> the scan_mutex acquiring in kmemleak_write() or kmemleak_seq_show().

This is better, but not really how you are supposed to do it.

The whole seq-file thing is very much _designed_ for taking a lock at the 
beginning of the operation, and releasing it at the end. It's a very 
common pattern.

But you should _not_ do it in the "show" routine. If you do, you're always 
going to be racy wrt lseek() and friends.

What you _should_ do is to take the lock in the "seq_start" routine, and 
release it in "seq_stop". The "seq_show" routine may be called multiple 
times in between.

For a trivial example, see the drivers/char/misc.c file. Note how it needs 
to hold the lock over the whole list traversal, and how seqfiles allows it 
to do that quite naturally.

		Linus

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Exiting with locks still held (was Re: [PATCH] kmemleak: Fix  scheduling-while-atomic bug)
  2009-07-02 17:39                 ` Linus Torvalds
@ 2009-07-03 10:18                   ` Catalin Marinas
  0 siblings, 0 replies; 21+ messages in thread
From: Catalin Marinas @ 2009-07-03 10:18 UTC (permalink / raw)
  To: Linus Torvalds
  Cc: Pekka Enberg, Ingo Molnar, Linux Kernel Mailing List,
	Andrew Morton, Peter Zijlstra, git-commits-head

On Thu, 2009-07-02 at 10:39 -0700, Linus Torvalds wrote:
> On Thu, 2 Jul 2009, Catalin Marinas wrote:
> > 
> > Initially, the scan_mutex was acquired in kmemleak_open() and released
> > in kmemleak_release() (corresponding to /sys/kernel/debug/kmemleak
> > operations). This was causing some lockdep reports when the file was
> > closed from a different task than the one opening it. This patch moves
> > the scan_mutex acquiring in kmemleak_write() or kmemleak_seq_show().
> 
> This is better, but not really how you are supposed to do it.
> 
> The whole seq-file thing is very much _designed_ for taking a lock at the 
> beginning of the operation, and releasing it at the end. It's a very 
> common pattern.

Thanks for this. Here's the updated patch (the difference from misc.c is
that it uses mutex_lock_interruptible):


kmemleak: Do not acquire scan_mutex in kmemleak_open()

From: Catalin Marinas <catalin.marinas@arm.com>

Initially, the scan_mutex was acquired in kmemleak_open() and released
in kmemleak_release() (corresponding to /sys/kernel/debug/kmemleak
operations). This was causing some lockdep reports when the file was
closed from a different task than the one opening it. This patch moves
the scan_mutex acquiring in kmemleak_write() or kmemleak_seq_start()
with releasing in kmemleak_seq_stop().

Signed-off-by: Catalin Marinas <catalin.marinas@arm.com>
---
 mm/kmemleak.c |   63 +++++++++++++++++++++++++++------------------------------
 1 files changed, 30 insertions(+), 33 deletions(-)

diff --git a/mm/kmemleak.c b/mm/kmemleak.c
index 23bf5f0..c6e0aae 100644
--- a/mm/kmemleak.c
+++ b/mm/kmemleak.c
@@ -1103,6 +1103,11 @@ static void *kmemleak_seq_start(struct seq_file *seq, loff_t *pos)
 {
 	struct kmemleak_object *object;
 	loff_t n = *pos;
+	int err;
+
+	err = mutex_lock_interruptible(&scan_mutex);
+	if (err < 0)
+		return ERR_PTR(err);
 
 	rcu_read_lock();
 	list_for_each_entry_rcu(object, &object_list, object_list) {
@@ -1146,8 +1151,15 @@ static void *kmemleak_seq_next(struct seq_file *seq, void *v, loff_t *pos)
  */
 static void kmemleak_seq_stop(struct seq_file *seq, void *v)
 {
-	if (v)
-		put_object(v);
+	if (!IS_ERR(v)) {
+		/*
+		 * kmemleak_seq_start may return ERR_PTR if the scan_mutex
+		 * waiting was interrupted, so only release it if !IS_ERR.
+		 */
+		mutex_unlock(&scan_mutex);
+		if (v)
+			put_object(v);
+	}
 }
 
 /*
@@ -1174,36 +1186,15 @@ static const struct seq_operations kmemleak_seq_ops = {
 
 static int kmemleak_open(struct inode *inode, struct file *file)
 {
-	int ret = 0;
-
 	if (!atomic_read(&kmemleak_enabled))
 		return -EBUSY;
 
-	ret = mutex_lock_interruptible(&scan_mutex);
-	if (ret < 0)
-		goto out;
-	if (file->f_mode & FMODE_READ) {
-		ret = seq_open(file, &kmemleak_seq_ops);
-		if (ret < 0)
-			goto scan_unlock;
-	}
-	return ret;
-
-scan_unlock:
-	mutex_unlock(&scan_mutex);
-out:
-	return ret;
+	return seq_open(file, &kmemleak_seq_ops);
 }
 
 static int kmemleak_release(struct inode *inode, struct file *file)
 {
-	int ret = 0;
-
-	if (file->f_mode & FMODE_READ)
-		seq_release(inode, file);
-	mutex_unlock(&scan_mutex);
-
-	return ret;
+	return seq_release(inode, file);
 }
 
 /*
@@ -1223,15 +1214,17 @@ static ssize_t kmemleak_write(struct file *file, const char __user *user_buf,
 {
 	char buf[64];
 	int buf_size;
-
-	if (!atomic_read(&kmemleak_enabled))
-		return -EBUSY;
+	int ret;
 
 	buf_size = min(size, (sizeof(buf) - 1));
 	if (strncpy_from_user(buf, user_buf, buf_size) < 0)
 		return -EFAULT;
 	buf[buf_size] = 0;
 
+	ret = mutex_lock_interruptible(&scan_mutex);
+	if (ret < 0)
+		return ret;
+
 	if (strncmp(buf, "off", 3) == 0)
 		kmemleak_disable();
 	else if (strncmp(buf, "stack=on", 8) == 0)
@@ -1244,11 +1237,10 @@ static ssize_t kmemleak_write(struct file *file, const char __user *user_buf,
 		stop_scan_thread();
 	else if (strncmp(buf, "scan=", 5) == 0) {
 		unsigned long secs;
-		int err;
 
-		err = strict_strtoul(buf + 5, 0, &secs);
-		if (err < 0)
-			return err;
+		ret = strict_strtoul(buf + 5, 0, &secs);
+		if (ret < 0)
+			goto out;
 		stop_scan_thread();
 		if (secs) {
 			jiffies_scan_wait = msecs_to_jiffies(secs * 1000);
@@ -1257,7 +1249,12 @@ static ssize_t kmemleak_write(struct file *file, const char __user *user_buf,
 	} else if (strncmp(buf, "scan", 4) == 0)
 		kmemleak_scan();
 	else
-		return -EINVAL;
+		ret = -EINVAL;
+
+out:
+	mutex_unlock(&scan_mutex);
+	if (ret < 0)
+		return ret;
 
 	/* ignore the rest of the buffer, only one command at a time */
 	*ppos += size;


-- 
Catalin


^ permalink raw reply related	[flat|nested] 21+ messages in thread

* Re: Exiting with locks still held (was Re: [PATCH] kmemleak: Fix scheduling-while-atomic bug)
  2009-07-02 12:48           ` Exiting with locks still held (was Re: [PATCH] kmemleak: Fix scheduling-while-atomic bug) Catalin Marinas
  2009-07-02 12:54             ` Pekka Enberg
@ 2009-07-03  7:04             ` Ingo Molnar
  1 sibling, 0 replies; 21+ messages in thread
From: Ingo Molnar @ 2009-07-03  7:04 UTC (permalink / raw)
  To: Catalin Marinas
  Cc: Linux Kernel Mailing List, Andrew Morton, Linus Torvalds,
	Peter Zijlstra, git-commits-head


* Catalin Marinas <catalin.marinas@arm.com> wrote:

> Hi Ingo,
> 
> On Wed, 2009-07-01 at 13:04 +0200, Ingo Molnar wrote:
> > * Catalin Marinas <catalin.marinas@arm.com> wrote:
> > > Since we are at locking, I just noticed this on my x86 laptop when
> > > running cat /sys/kernel/debug/kmemleak (I haven't got it on an ARM
> > > board):
> > > 
> > > ================================================
> > > [ BUG: lock held when returning to user space! ]
> > > ------------------------------------------------
> > > cat/3687 is leaving the kernel with locks still held!
> > > 1 lock held by cat/3687:
> > > #0:  (scan_mutex){+.+.+.}, at: [<c01e0c5c>] kmemleak_open+0x3c/0x70
> > > 
> > > kmemleak_open() acquires scan_mutex and unconditionally releases 
> > > it in kmemleak_release(). The mutex seems to be released as a 
> > > subsequent acquiring works fine.
> > > 
> > > Is this caused just because cat may have exited without closing 
> > > the file descriptor (which should be done automatically anyway)?
> > 
> > This lockdep warning has a 0% false positives track record so 
> > far: all previous cases it triggered showed some real (and 
> > fatal) bug in the underlying code.
> 
> In this particular case, there is no fatal problem as the mutex is 
> released shortly after this message.

Maybe - but holding locks in user-space is almost always bad.

What happens if user-space opens a second file descriptor before 
closing the first one? We either lock up (which is bad and fatal) or 
you already have some _other_ exclusion mechanism that prevents this 
case, which calls into question the need to hold this particular 
mutex in user-space to begin with.

I've yet to see a valid 'need to hold this kernel lock in 
user-space' case, and this does not seem to be such a case either.

	Ingo

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [PATCH] kmemleak: Fix scheduling-while-atomic bug
  2009-07-01  9:30     ` Ingo Molnar
  2009-07-01  9:46       ` Catalin Marinas
@ 2009-07-02  9:48       ` Catalin Marinas
  2009-07-03  7:00         ` [PATCH] kmemleak: Mark nice +10 Ingo Molnar
  2009-07-08 13:33       ` [PATCH] kmemleak: Fix scheduling-while-atomic bug Catalin Marinas
  2 siblings, 1 reply; 21+ messages in thread
From: Catalin Marinas @ 2009-07-02  9:48 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: Linux Kernel Mailing List, Andrew Morton, Linus Torvalds,
	Peter Zijlstra, git-commits-head

On Wed, 2009-07-01 at 11:30 +0200, Ingo Molnar wrote:
> * Catalin Marinas <catalin.marinas@arm.com> wrote:
> 
> > > The minimal fix below removes scan_yield() and adds a 
> > > cond_resched() to the outmost (safe) place of the scanning 
> > > thread. This solves the regression.
> > 
> > With CONFIG_PREEMPT disabled it won't reschedule during the bss 
> > scanning but I don't see this as a real issue (task stacks 
> > scanning probably takes longer anyway).
> 
> Yeah. I suspect one more cond_resched() could be added - i just 
> didnt see an obvious place for it, given that scan_block() is being 
> called with asymetric held-locks contexts.

Now that your patch was merged, I propose adding a few more
cond_resched() calls, useful for the !PREEMPT case:


kmemleak: Add more cond_resched() calls in the scanning thread

From: Catalin Marinas <catalin.marinas@arm.com>

Following recent patch to no longer reschedule in the scan_block()
function, we need a few more cond_resched() calls in the kmemleak_scan()
function (useful if PREEMPT is disabled).

Signed-off-by: Catalin Marinas <catalin.marinas@arm.com>
---
 mm/kmemleak.c |    7 ++++++-
 1 files changed, 6 insertions(+), 1 deletions(-)

diff --git a/mm/kmemleak.c b/mm/kmemleak.c
index 013f188..b4f675e 100644
--- a/mm/kmemleak.c
+++ b/mm/kmemleak.c
@@ -932,13 +932,16 @@ static void kmemleak_scan(void)
 
 	/* data/bss scanning */
 	scan_block(_sdata, _edata, NULL);
+	cond_resched();
 	scan_block(__bss_start, __bss_stop, NULL);
 
 #ifdef CONFIG_SMP
 	/* per-cpu sections scanning */
-	for_each_possible_cpu(i)
+	for_each_possible_cpu(i) {
+		cond_resched();
 		scan_block(__per_cpu_start + per_cpu_offset(i),
 			   __per_cpu_end + per_cpu_offset(i), NULL);
+	}
 #endif
 
 	/*
@@ -960,6 +963,7 @@ static void kmemleak_scan(void)
 			/* only scan if page is in use */
 			if (page_count(page) == 0)
 				continue;
+			cond_resched();
 			scan_block(page, page + 1, NULL);
 		}
 	}
@@ -969,6 +973,7 @@ static void kmemleak_scan(void)
 	 * not enabled by default.
 	 */
 	if (kmemleak_stack_scan) {
+		cond_resched();
 		read_lock(&tasklist_lock);
 		for_each_process(task)
 			scan_block(task_stack_page(task),


-- 
Catalin


^ permalink raw reply related	[flat|nested] 21+ messages in thread

* [PATCH] kmemleak: Mark nice +10
  2009-07-02  9:48       ` [PATCH] kmemleak: Fix scheduling-while-atomic bug Catalin Marinas
@ 2009-07-03  7:00         ` Ingo Molnar
  2009-07-03  8:09           ` Catalin Marinas
  0 siblings, 1 reply; 21+ messages in thread
From: Ingo Molnar @ 2009-07-03  7:00 UTC (permalink / raw)
  To: Catalin Marinas
  Cc: Linux Kernel Mailing List, Andrew Morton, Linus Torvalds,
	Peter Zijlstra, git-commits-head


* Catalin Marinas <catalin.marinas@arm.com> wrote:

> On Wed, 2009-07-01 at 11:30 +0200, Ingo Molnar wrote:
> > * Catalin Marinas <catalin.marinas@arm.com> wrote:
> > 
> > > > The minimal fix below removes scan_yield() and adds a 
> > > > cond_resched() to the outmost (safe) place of the scanning 
> > > > thread. This solves the regression.
> > > 
> > > With CONFIG_PREEMPT disabled it won't reschedule during the bss 
> > > scanning but I don't see this as a real issue (task stacks 
> > > scanning probably takes longer anyway).
> > 
> > Yeah. I suspect one more cond_resched() could be added - i just 
> > didnt see an obvious place for it, given that scan_block() is being 
> > called with asymetric held-locks contexts.
> 
> Now that your patch was merged, I propose adding a few more 
> cond_resched() calls, useful for the !PREEMPT case:

note, please also merge the renicing fix you sent. I have it tested 
in tip:out-of-tree, attached below.

	Ingo

>From f6a529517732a9d0e1ad0cd43ad7d2d96de4a4f5 Mon Sep 17 00:00:00 2001
From: Catalin Marinas <catalin.marinas@arm.com>
Date: Wed, 1 Jul 2009 10:18:57 +0100
Subject: [PATCH] kmemleak: Mark nice +10

> The background scanning thread could probably also be reniced
> to +10.

Signed-off-by: Ingo Molnar <mingo@elte.hu>
---
 mm/kmemleak.c |    1 +
 1 files changed, 1 insertions(+), 0 deletions(-)

diff --git a/mm/kmemleak.c b/mm/kmemleak.c
index e766e1d..6006553 100644
--- a/mm/kmemleak.c
+++ b/mm/kmemleak.c
@@ -1039,6 +1039,7 @@ static int kmemleak_scan_thread(void *arg)
 	static int first_run = 1;
 
 	pr_info("Automatic memory scanning thread started\n");
+	set_user_nice(current, 10);
 
 	/*
 	 * Wait before the first scan to allow the system to fully initialize.

^ permalink raw reply related	[flat|nested] 21+ messages in thread

* Re: [PATCH] kmemleak: Mark nice +10
  2009-07-03  7:00         ` [PATCH] kmemleak: Mark nice +10 Ingo Molnar
@ 2009-07-03  8:09           ` Catalin Marinas
  0 siblings, 0 replies; 21+ messages in thread
From: Catalin Marinas @ 2009-07-03  8:09 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: Linux Kernel Mailing List, Andrew Morton, Linus Torvalds,
	Peter Zijlstra, git-commits-head

On Fri, 2009-07-03 at 09:00 +0200, Ingo Molnar wrote:
> * Catalin Marinas <catalin.marinas@arm.com> wrote:
> 
> > On Wed, 2009-07-01 at 11:30 +0200, Ingo Molnar wrote:
> > > * Catalin Marinas <catalin.marinas@arm.com> wrote:
> > > 
> > > > > The minimal fix below removes scan_yield() and adds a 
> > > > > cond_resched() to the outmost (safe) place of the scanning 
> > > > > thread. This solves the regression.
> > > > 
> > > > With CONFIG_PREEMPT disabled it won't reschedule during the bss 
> > > > scanning but I don't see this as a real issue (task stacks 
> > > > scanning probably takes longer anyway).
> > > 
> > > Yeah. I suspect one more cond_resched() could be added - i just 
> > > didnt see an obvious place for it, given that scan_block() is being 
> > > called with asymetric held-locks contexts.
> > 
> > Now that your patch was merged, I propose adding a few more 
> > cond_resched() calls, useful for the !PREEMPT case:
> 
> note, please also merge the renicing fix you sent. I have it tested 
> in tip:out-of-tree, attached below.

I have this patch in my kmemleak branch
(http://www.linux-arm.org/git?p=linux-2.6.git;a=shortlog;h=kmemleak)
which I plan to push to Linus, only that I was waiting to accumulate a
few more patches (to avoid sending too many pull requests).

I'll fix the scan_mutex lock as well, following comments and send a pull
request tonight.

Thanks.

-- 
Catalin


^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [PATCH] kmemleak: Fix scheduling-while-atomic bug
  2009-07-01  9:30     ` Ingo Molnar
  2009-07-01  9:46       ` Catalin Marinas
  2009-07-02  9:48       ` [PATCH] kmemleak: Fix scheduling-while-atomic bug Catalin Marinas
@ 2009-07-08 13:33       ` Catalin Marinas
  2009-08-23  2:48         ` Ming Lei
  2 siblings, 1 reply; 21+ messages in thread
From: Catalin Marinas @ 2009-07-08 13:33 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: Linux Kernel Mailing List, Andrew Morton, Linus Torvalds,
	Peter Zijlstra

Hi Ingo,

On Wed, 2009-07-01 at 11:30 +0200, Ingo Molnar wrote:
> * Catalin Marinas <catalin.marinas@arm.com> wrote:
> 
> > > The minimal fix below removes scan_yield() and adds a 
> > > cond_resched() to the outmost (safe) place of the scanning 
> > > thread. This solves the regression.
> > 
> > With CONFIG_PREEMPT disabled it won't reschedule during the bss 
> > scanning but I don't see this as a real issue (task stacks 
> > scanning probably takes longer anyway).
> 
> Yeah. I suspect one more cond_resched() could be added - i just 
> didnt see an obvious place for it, given that scan_block() is being 
> called with asymetric held-locks contexts.

I tried with a few cond_resched() calls in the kmemleak_scan() function
but it looks like a significant (well, 1-2 seconds, depending on the
hardware) amount of time is spent in scanning the data and bss sections.
The patch below makes the system with !PREEMPT more responsive during
the scanning episodes.

Are you ok with this approach? Thanks.


kmemleak: Add more cond_resched() calls in the scanning thread

From: Catalin Marinas <catalin.marinas@arm.com>

Following recent fix to no longer reschedule in the scan_block()
function, the system may become unresponsive with !PREEMPT. This patch
re-adds the cond_resched() call to scan_block() but conditioned by the
allow_resched parameter.

Signed-off-by: Catalin Marinas <catalin.marinas@arm.com>
Cc: Ingo Molnar <mingo@elte.hu>
---
 mm/kmemleak.c |   19 +++++++++++--------
 1 files changed, 11 insertions(+), 8 deletions(-)

diff --git a/mm/kmemleak.c b/mm/kmemleak.c
index 6006553..93f1481 100644
--- a/mm/kmemleak.c
+++ b/mm/kmemleak.c
@@ -807,7 +807,7 @@ static int scan_should_stop(void)
  * found to the gray list.
  */
 static void scan_block(void *_start, void *_end,
-		       struct kmemleak_object *scanned)
+		       struct kmemleak_object *scanned, int allow_resched)
 {
 	unsigned long *ptr;
 	unsigned long *start = PTR_ALIGN(_start, BYTES_PER_POINTER);
@@ -818,6 +818,8 @@ static void scan_block(void *_start, void *_end,
 		unsigned long pointer = *ptr;
 		struct kmemleak_object *object;
 
+		if (allow_resched)
+			cond_resched();
 		if (scan_should_stop())
 			break;
 
@@ -881,12 +883,12 @@ static void scan_object(struct kmemleak_object *object)
 		goto out;
 	if (hlist_empty(&object->area_list))
 		scan_block((void *)object->pointer,
-			   (void *)(object->pointer + object->size), object);
+			   (void *)(object->pointer + object->size), object, 0);
 	else
 		hlist_for_each_entry(area, elem, &object->area_list, node)
 			scan_block((void *)(object->pointer + area->offset),
 				   (void *)(object->pointer + area->offset
-					    + area->length), object);
+					    + area->length), object, 0);
 out:
 	spin_unlock_irqrestore(&object->lock, flags);
 }
@@ -931,14 +933,14 @@ static void kmemleak_scan(void)
 	rcu_read_unlock();
 
 	/* data/bss scanning */
-	scan_block(_sdata, _edata, NULL);
-	scan_block(__bss_start, __bss_stop, NULL);
+	scan_block(_sdata, _edata, NULL, 1);
+	scan_block(__bss_start, __bss_stop, NULL, 1);
 
 #ifdef CONFIG_SMP
 	/* per-cpu sections scanning */
 	for_each_possible_cpu(i)
 		scan_block(__per_cpu_start + per_cpu_offset(i),
-			   __per_cpu_end + per_cpu_offset(i), NULL);
+			   __per_cpu_end + per_cpu_offset(i), NULL, 1);
 #endif
 
 	/*
@@ -960,7 +962,7 @@ static void kmemleak_scan(void)
 			/* only scan if page is in use */
 			if (page_count(page) == 0)
 				continue;
-			scan_block(page, page + 1, NULL);
+			scan_block(page, page + 1, NULL, 1);
 		}
 	}
 
@@ -972,7 +974,8 @@ static void kmemleak_scan(void)
 		read_lock(&tasklist_lock);
 		for_each_process(task)
 			scan_block(task_stack_page(task),
-				   task_stack_page(task) + THREAD_SIZE, NULL);
+				   task_stack_page(task) + THREAD_SIZE,
+				   NULL, 0);
 		read_unlock(&tasklist_lock);
 	}
 


-- 
Catalin


^ permalink raw reply related	[flat|nested] 21+ messages in thread

* Re: [PATCH] kmemleak: Fix scheduling-while-atomic bug
  2009-07-08 13:33       ` [PATCH] kmemleak: Fix scheduling-while-atomic bug Catalin Marinas
@ 2009-08-23  2:48         ` Ming Lei
  2009-08-23 14:59           ` Catalin Marinas
  0 siblings, 1 reply; 21+ messages in thread
From: Ming Lei @ 2009-08-23  2:48 UTC (permalink / raw)
  To: Catalin Marinas
  Cc: Ingo Molnar, Linux Kernel Mailing List, Andrew Morton,
	Linus Torvalds, Peter Zijlstra

2009/7/8 Catalin Marinas <catalin.marinas@arm.com>:
> Hi Ingo,
>
> On Wed, 2009-07-01 at 11:30 +0200, Ingo Molnar wrote:
>> * Catalin Marinas <catalin.marinas@arm.com> wrote:
>>
>> > > The minimal fix below removes scan_yield() and adds a
>> > > cond_resched() to the outmost (safe) place of the scanning
>> > > thread. This solves the regression.
>> >
>> > With CONFIG_PREEMPT disabled it won't reschedule during the bss
>> > scanning but I don't see this as a real issue (task stacks
>> > scanning probably takes longer anyway).
>>
>> Yeah. I suspect one more cond_resched() could be added - i just
>> didnt see an obvious place for it, given that scan_block() is being
>> called with asymetric held-locks contexts.
>
> I tried with a few cond_resched() calls in the kmemleak_scan() function
> but it looks like a significant (well, 1-2 seconds, depending on the
> hardware) amount of time is spent in scanning the data and bss sections.
> The patch below makes the system with !PREEMPT more responsive during
> the scanning episodes.
>
> Are you ok with this approach? Thanks.
>
>
> kmemleak: Add more cond_resched() calls in the scanning thread
>
> From: Catalin Marinas <catalin.marinas@arm.com>
>
> Following recent fix to no longer reschedule in the scan_block()
> function, the system may become unresponsive with !PREEMPT. This patch
> re-adds the cond_resched() call to scan_block() but conditioned by the
> allow_resched parameter.
>
> Signed-off-by: Catalin Marinas <catalin.marinas@arm.com>
> Cc: Ingo Molnar <mingo@elte.hu>
> ---
>  mm/kmemleak.c |   19 +++++++++++--------
>  1 files changed, 11 insertions(+), 8 deletions(-)
>
> diff --git a/mm/kmemleak.c b/mm/kmemleak.c
> index 6006553..93f1481 100644
> --- a/mm/kmemleak.c
> +++ b/mm/kmemleak.c
> @@ -807,7 +807,7 @@ static int scan_should_stop(void)
>  * found to the gray list.
>  */
>  static void scan_block(void *_start, void *_end,
> -                      struct kmemleak_object *scanned)
> +                      struct kmemleak_object *scanned, int allow_resched)
>  {
>        unsigned long *ptr;
>        unsigned long *start = PTR_ALIGN(_start, BYTES_PER_POINTER);
> @@ -818,6 +818,8 @@ static void scan_block(void *_start, void *_end,
>                unsigned long pointer = *ptr;
>                struct kmemleak_object *object;
>
> +               if (allow_resched)
> +                       cond_resched();
>                if (scan_should_stop())
>                        break;
>
> @@ -881,12 +883,12 @@ static void scan_object(struct kmemleak_object *object)
>                goto out;
>        if (hlist_empty(&object->area_list))
>                scan_block((void *)object->pointer,
> -                          (void *)(object->pointer + object->size), object);
> +                          (void *)(object->pointer + object->size), object, 0);
>        else
>                hlist_for_each_entry(area, elem, &object->area_list, node)
>                        scan_block((void *)(object->pointer + area->offset),
>                                   (void *)(object->pointer + area->offset
> -                                           + area->length), object);
> +                                           + area->length), object, 0);
>  out:
>        spin_unlock_irqrestore(&object->lock, flags);
>  }
> @@ -931,14 +933,14 @@ static void kmemleak_scan(void)
>        rcu_read_unlock();
>
>        /* data/bss scanning */
> -       scan_block(_sdata, _edata, NULL);
> -       scan_block(__bss_start, __bss_stop, NULL);
> +       scan_block(_sdata, _edata, NULL, 1);
> +       scan_block(__bss_start, __bss_stop, NULL, 1);
>
>  #ifdef CONFIG_SMP
>        /* per-cpu sections scanning */
>        for_each_possible_cpu(i)
>                scan_block(__per_cpu_start + per_cpu_offset(i),
> -                          __per_cpu_end + per_cpu_offset(i), NULL);
> +                          __per_cpu_end + per_cpu_offset(i), NULL, 1);
>  #endif
>
>        /*
> @@ -960,7 +962,7 @@ static void kmemleak_scan(void)
>                        /* only scan if page is in use */
>                        if (page_count(page) == 0)
>                                continue;
> -                       scan_block(page, page + 1, NULL);
> +                       scan_block(page, page + 1, NULL, 1);
>                }
>        }
>
> @@ -972,7 +974,8 @@ static void kmemleak_scan(void)
>                read_lock(&tasklist_lock);
>                for_each_process(task)
>                        scan_block(task_stack_page(task),
> -                                  task_stack_page(task) + THREAD_SIZE, NULL);
> +                                  task_stack_page(task) + THREAD_SIZE,
> +                                  NULL, 0);
>                read_unlock(&tasklist_lock);
>        }
>

This still exits in 2.6.31-rc7 if kmemcheck is enabled, and isn't  the patch
merged into mainline?

Thanks.

[    0.006569] ACPI: Core revision 20090521
[    0.011476] BUG: scheduling while atomic: swapper/0/0x10000002
[    0.011558] no locks held by swapper/0.
[    0.011561] Modules linked in:
[    0.011566] Pid: 0, comm: swapper Not tainted 2.6.31-rc7 #141
[    0.011569] Call Trace:
[    0.011577]  [<ffffffff81070bb7>] ? __debug_show_held_locks+0x22/0x24
[    0.011584]  [<ffffffff8103d4ba>] __schedule_bug+0x77/0x7c
[    0.011590]  [<ffffffff812da32f>] schedule+0xd6/0xa3f
[    0.011596]  [<ffffffff810e4b9b>] ? kmem_cache_free+0xe2/0x158
[    0.011600]  [<ffffffff810717e2>] ? trace_hardirqs_on_caller+0x12d/0x158
[    0.011604]  [<ffffffff8107181a>] ? trace_hardirqs_on+0xd/0xf
[    0.011609]  [<ffffffff8103e3f0>] __cond_resched+0x29/0x47
[    0.011614]  [<ffffffff812dae24>] _cond_resched+0x29/0x34
[    0.011620]  [<ffffffff811da90a>] acpi_ps_complete_op+0x246/0x25c
[    0.011625]  [<ffffffff811db024>] acpi_ps_parse_loop+0x704/0x860
[    0.011630]  [<ffffffff811da200>] acpi_ps_parse_aml+0x9f/0x2de
[    0.011635]  [<ffffffff811d9181>] acpi_ns_one_complete_parse+0x101/0x11c
[    0.011640]  [<ffffffff811d91bd>] acpi_ns_parse_table+0x21/0x3c
[    0.011645]  [<ffffffff811d67e3>] acpi_ns_load_table+0x4f/0x94
[    0.011650]  [<ffffffff811dd14a>] acpi_load_tables+0x72/0x133
[    0.011656]  [<ffffffff8153c592>] acpi_early_init+0x60/0xf5
[    0.011661]  [<ffffffff81519cb0>] start_kernel+0x38b/0x3a0
[    0.011666]  [<ffffffff81519140>] ? early_idt_handler+0x0/0x71
[    0.011670]  [<ffffffff815192a3>] x86_64_start_reservations+0xaa/0xae
[    0.011675]  [<ffffffff8151939e>] x86_64_start_kernel+0xf7/0x106
[    0.012058] BUG: scheduling while atomic: swapper/0/0x10000002
[    0.012135] no locks held by swapper/0.
[    0.012138] Modules linked in:
[    0.012141] Pid: 0, comm: swapper Not tainted 2.6.31-rc7 #141
[    0.012144] Call Trace:
[    0.012148]  [<ffffffff81070bb7>] ? __debug_show_held_locks+0x22/0x24
[    0.012153]  [<ffffffff8103d4ba>] __schedule_bug+0x77/0x7c
[    0.012157]  [<ffffffff812da32f>] schedule+0xd6/0xa3f
[    0.012162]  [<ffffffff810e4b9b>] ? kmem_cache_free+0xe2/0x158
[    0.012166]  [<ffffffff810717e2>] ? trace_hardirqs_on_caller+0x12d/0x158
[    0.012170]  [<ffffffff8107181a>] ? trace_hardirqs_on+0xd/0xf
[    0.012175]  [<ffffffff8103e3f0>] __cond_resched+0x29/0x47
[    0.012180]  [<ffffffff812dae24>] _cond_resched+0x29/0x34
[    0.012184]  [<ffffffff811da90a>] acpi_ps_complete_op+0x246/0x25c
[    0.012189]  [<ffffffff811db024>] acpi_ps_parse_loop+0x704/0x860
[    0.012194]  [<ffffffff811da200>] acpi_ps_parse_aml+0x9f/0x2de
[    0.012199]  [<ffffffff811d9181>] acpi_ns_one_complete_parse+0x101/0x11c
[    0.012204]  [<ffffffff811d91bd>] acpi_ns_parse_table+0x21/0x3c
[    0.012209]  [<ffffffff811d67e3>] acpi_ns_load_table+0x4f/0x94
[    0.012213]  [<ffffffff811dd14a>] acpi_load_tables+0x72/0x133
[    0.012217]  [<ffffffff8153c592>] acpi_early_init+0x60/0xf5
[    0.012222]  [<ffffffff81519cb0>] start_kernel+0x38b/0x3a0
[    0.012226]  [<ffffffff81519140>] ? early_idt_handler+0x0/0x71
[    0.012231]  [<ffffffff815192a3>] x86_64_start_reservations+0xaa/0xae
[    0.012235]  [<ffffffff8151939e>] x86_64_start_kernel+0xf7/0x106
[    0.021056] BUG: scheduling while atomic: swapper/0/0x10000002
[    0.021134] no locks held by swapper/0.
[    0.021136] Modules linked in:
[    0.021140] Pid: 0, comm: swapper Not tainted 2.6.31-rc7 #141
[    0.021143] Call Trace:
[    0.021147]  [<ffffffff81070bb7>] ? __debug_show_held_locks+0x22/0x24
[    0.021152]  [<ffffffff8103d4ba>] __schedule_bug+0x77/0x7c
[    0.021156]  [<ffffffff812da32f>] schedule+0xd6/0xa3f
[    0.021161]  [<ffffffff810717e2>] ? trace_hardirqs_on_caller+0x12d/0x158
[    0.021167]  [<ffffffff811bbd5f>] ? acpi_os_release_object+0xe/0x12
[    0.021172]  [<ffffffff811db561>] ? acpi_ps_free_op+0x27/0x29
[    0.021177]  [<ffffffff811db28b>] ? acpi_ps_delete_parse_tree+0x43/0x60
[    0.021181]  [<ffffffff8103e3f0>] __cond_resched+0x29/0x47
[    0.021186]  [<ffffffff812dae24>] _cond_resched+0x29/0x34
[    0.021190]  [<ffffffff811da90a>] acpi_ps_complete_op+0x246/0x25c
[    0.021195]  [<ffffffff811db024>] acpi_ps_parse_loop+0x704/0x860
[    0.021200]  [<ffffffff811da200>] acpi_ps_parse_aml+0x9f/0x2de
[    0.021205]  [<ffffffff811d9181>] acpi_ns_one_complete_parse+0x101/0x11c
[    0.021210]  [<ffffffff811d91bd>] acpi_ns_parse_table+0x21/0x3c
[    0.021215]  [<ffffffff811d67e3>] acpi_ns_load_table+0x4f/0x94
[    0.021219]  [<ffffffff811dd14a>] acpi_load_tables+0x72/0x133
[    0.021223]  [<ffffffff8153c592>] acpi_early_init+0x60/0xf5
[    0.021228]  [<ffffffff81519cb0>] start_kernel+0x38b/0x3a0
[    0.021232]  [<ffffffff81519140>] ? early_idt_handler+0x0/0x71
[    0.021236]  [<ffffffff815192a3>] x86_64_start_reservations+0xaa/0xae
[    0.021241]  [<ffffffff8151939e>] x86_64_start_kernel+0xf7/0x106
[    0.030047] BUG: scheduling while atomic: swapper/0/0x10000002
[    0.030127] no locks held by swapper/0.
[    0.030129] Modules linked in:
[    0.030133] Pid: 0, comm: swapper Not tainted 2.6.31-rc7 #141
[    0.030135] Call Trace:
[    0.030140]  [<ffffffff81070bb7>] ? __debug_show_held_locks+0x22/0x24
[    0.030145]  [<ffffffff8103d4ba>] __schedule_bug+0x77/0x7c
[    0.030149]  [<ffffffff812da32f>] schedule+0xd6/0xa3f
[    0.030153]  [<ffffffff810e4b9b>] ? kmem_cache_free+0xe2/0x158
[    0.030158]  [<ffffffff810717e2>] ? trace_hardirqs_on_caller+0x12d/0x158
[    0.030162]  [<ffffffff8107181a>] ? trace_hardirqs_on+0xd/0xf
[    0.030167]  [<ffffffff8103e3f0>] __cond_resched+0x29/0x47
[    0.030171]  [<ffffffff812dae24>] _cond_resched+0x29/0x34
[    0.030176]  [<ffffffff811da90a>] acpi_ps_complete_op+0x246/0x25c
[    0.030181]  [<ffffffff811db024>] acpi_ps_parse_loop+0x704/0x860
[    0.030186]  [<ffffffff811da200>] acpi_ps_parse_aml+0x9f/0x2de
[    0.030190]  [<ffffffff811d9181>] acpi_ns_one_complete_parse+0x101/0x11c
[    0.030196]  [<ffffffff811d91bd>] acpi_ns_parse_table+0x21/0x3c
[    0.030200]  [<ffffffff811d67e3>] acpi_ns_load_table+0x4f/0x94
[    0.030204]  [<ffffffff811dd14a>] acpi_load_tables+0x72/0x133
[    0.030209]  [<ffffffff8153c592>] acpi_early_init+0x60/0xf5
[    0.030213]  [<ffffffff81519cb0>] start_kernel+0x38b/0x3a0
[    0.030217]  [<ffffffff81519140>] ? early_idt_handler+0x0/0x71
[    0.030222]  [<ffffffff815192a3>] x86_64_start_reservations+0xaa/0xae
[    0.030226]  [<ffffffff8151939e>] x86_64_start_kernel+0xf7/0x106
[    0.039062] BUG: scheduling while atomic: swapper/0/0x10000002
[    0.039141] no locks held by swapper/0.
[    0.039143] Modules linked in:
[    0.039147] Pid: 0, comm: swapper Not tainted 2.6.31-rc7 #141
[    0.039150] Call Trace:
[    0.039154]  [<ffffffff81070bb7>] ? __debug_show_held_locks+0x22/0x24
[    0.039159]  [<ffffffff8103d4ba>] __schedule_bug+0x77/0x7c
[    0.039164]  [<ffffffff812da32f>] schedule+0xd6/0xa3f
[    0.039168]  [<ffffffff810e4b9b>] ? kmem_cache_free+0xe2/0x158
[    0.039172]  [<ffffffff810717e2>] ? trace_hardirqs_on_caller+0x12d/0x158
[    0.039177]  [<ffffffff8107181a>] ? trace_hardirqs_on+0xd/0xf
[    0.039181]  [<ffffffff8103e3f0>] __cond_resched+0x29/0x47
[    0.039186]  [<ffffffff812dae24>] _cond_resched+0x29/0x34
[    0.039190]  [<ffffffff811da90a>] acpi_ps_complete_op+0x246/0x25c
[    0.039195]  [<ffffffff811db024>] acpi_ps_parse_loop+0x704/0x860
[    0.039200]  [<ffffffff811da200>] acpi_ps_parse_aml+0x9f/0x2de
[    0.039205]  [<ffffffff811d9181>] acpi_ns_one_complete_parse+0x101/0x11c
[    0.039210]  [<ffffffff811d91bd>] acpi_ns_parse_table+0x21/0x3c
[    0.039215]  [<ffffffff811d67e3>] acpi_ns_load_table+0x4f/0x94
[    0.039219]  [<ffffffff811dd14a>] acpi_load_tables+0x72/0x133
[    0.039223]  [<ffffffff8153c592>] acpi_early_init+0x60/0xf5
[    0.039228]  [<ffffffff81519cb0>] start_kernel+0x38b/0x3a0
[    0.039232]  [<ffffffff81519140>] ? early_idt_handler+0x0/0x71
[    0.039236]  [<ffffffff815192a3>] x86_64_start_reservations+0xaa/0xae
[    0.039241]  [<ffffffff8151939e>] x86_64_start_kernel+0xf7/0x106
[    0.048030] BUG: scheduling while atomic: swapper/0/0x10000002
[    0.048108] no locks held by swapper/0.
[    0.048111] Modules linked in:
[    0.048115] Pid: 0, comm: swapper Not tainted 2.6.31-rc7 #141
[    0.048117] Call Trace:
[    0.048122]  [<ffffffff81070bb7>] ? __debug_show_held_locks+0x22/0x24
[    0.048126]  [<ffffffff8103d4ba>] __schedule_bug+0x77/0x7c
[    0.048131]  [<ffffffff812da32f>] schedule+0xd6/0xa3f
[    0.048135]  [<ffffffff810717e2>] ? trace_hardirqs_on_caller+0x12d/0x158
[    0.048140]  [<ffffffff811bbd5f>] ? acpi_os_release_object+0xe/0x12
[    0.048145]  [<ffffffff811db561>] ? acpi_ps_free_op+0x27/0x29
[    0.048150]  [<ffffffff811db28b>] ? acpi_ps_delete_parse_tree+0x43/0x60
[    0.048154]  [<ffffffff8103e3f0>] __cond_resched+0x29/0x47
[    0.048158]  [<ffffffff812dae24>] _cond_resched+0x29/0x34
[    0.048163]  [<ffffffff811da90a>] acpi_ps_complete_op+0x246/0x25c
[    0.048168]  [<ffffffff811db024>] acpi_ps_parse_loop+0x704/0x860
[    0.048173]  [<ffffffff811da200>] acpi_ps_parse_aml+0x9f/0x2de
[    0.048178]  [<ffffffff811d9181>] acpi_ns_one_complete_parse+0x101/0x11c
[    0.048183]  [<ffffffff811d91bd>] acpi_ns_parse_table+0x21/0x3c
[    0.048187]  [<ffffffff811d67e3>] acpi_ns_load_table+0x4f/0x94
[    0.048192]  [<ffffffff811dd14a>] acpi_load_tables+0x72/0x133
[    0.048196]  [<ffffffff8153c592>] acpi_early_init+0x60/0xf5
[    0.048200]  [<ffffffff81519cb0>] start_kernel+0x38b/0x3a0
[    0.048205]  [<ffffffff81519140>] ? early_idt_handler+0x0/0x71
[    0.048209]  [<ffffffff815192a3>] x86_64_start_reservations+0xaa/0xae
[    0.048214]  [<ffffffff8151939e>] x86_64_start_kernel+0xf7/0x106
[    0.057134] BUG: scheduling while atomic: swapper/0/0x10000002
[    0.057213] no locks held by swapper/0.
[    0.057215] Modules linked in:
[    0.057219] Pid: 0, comm: swapper Not tainted 2.6.31-rc7 #141
[    0.057221] Call Trace:
[    0.057226]  [<ffffffff81070bb7>] ? __debug_show_held_locks+0x22/0x24
[    0.057231]  [<ffffffff8103d4ba>] __schedule_bug+0x77/0x7c
[    0.057235]  [<ffffffff812da32f>] schedule+0xd6/0xa3f
[    0.057239]  [<ffffffff810e4b9b>] ? kmem_cache_free+0xe2/0x158
[    0.057244]  [<ffffffff810717e2>] ? trace_hardirqs_on_caller+0x12d/0x158
[    0.057248]  [<ffffffff8107181a>] ? trace_hardirqs_on+0xd/0xf
[    0.057253]  [<ffffffff8103e3f0>] __cond_resched+0x29/0x47
[    0.057257]  [<ffffffff812dae24>] _cond_resched+0x29/0x34
[    0.057262]  [<ffffffff811da90a>] acpi_ps_complete_op+0x246/0x25c
[    0.057267]  [<ffffffff811db024>] acpi_ps_parse_loop+0x704/0x860
[    0.057272]  [<ffffffff811da200>] acpi_ps_parse_aml+0x9f/0x2de
[    0.057277]  [<ffffffff811d9181>] acpi_ns_one_complete_parse+0x101/0x11c
[    0.057282]  [<ffffffff811d91bd>] acpi_ns_parse_table+0x21/0x3c
[    0.057286]  [<ffffffff811d67e3>] acpi_ns_load_table+0x4f/0x94
[    0.057290]  [<ffffffff811dd14a>] acpi_load_tables+0x72/0x133
[    0.057295]  [<ffffffff8153c592>] acpi_early_init+0x60/0xf5
[    0.057299]  [<ffffffff81519cb0>] start_kernel+0x38b/0x3a0
[    0.057304]  [<ffffffff81519140>] ? early_idt_handler+0x0/0x71
[    0.057308]  [<ffffffff815192a3>] x86_64_start_reservations+0xaa/0xae
[    0.057313]  [<ffffffff8151939e>] x86_64_start_kernel+0xf7/0x106
[    0.066077] BUG: scheduling while atomic: swapper/0/0x10000002
[    0.066156] no locks held by swapper/0.
[    0.066158] Modules linked in:
[    0.066162] Pid: 0, comm: swapper Not tainted 2.6.31-rc7 #141
[    0.066165] Call Trace:
[    0.066169]  [<ffffffff81070bb7>] ? __debug_show_held_locks+0x22/0x24
[    0.066174]  [<ffffffff8103d4ba>] __schedule_bug+0x77/0x7c
[    0.066179]  [<ffffffff812da32f>] schedule+0xd6/0xa3f
[    0.066183]  [<ffffffff810e4b9b>] ? kmem_cache_free+0xe2/0x158
[    0.066188]  [<ffffffff810717e2>] ? trace_hardirqs_on_caller+0x12d/0x158
[    0.066192]  [<ffffffff8107181a>] ? trace_hardirqs_on+0xd/0xf
[    0.066197]  [<ffffffff8103e3f0>] __cond_resched+0x29/0x47
[    0.066201]  [<ffffffff812dae24>] _cond_resched+0x29/0x34
[    0.066206]  [<ffffffff811da90a>] acpi_ps_complete_op+0x246/0x25c
[    0.066211]  [<ffffffff811db024>] acpi_ps_parse_loop+0x704/0x860
[    0.066216]  [<ffffffff811da200>] acpi_ps_parse_aml+0x9f/0x2de
[    0.066221]  [<ffffffff811d9181>] acpi_ns_one_complete_parse+0x101/0x11c
[    0.066226]  [<ffffffff811d91bd>] acpi_ns_parse_table+0x21/0x3c
[    0.066230]  [<ffffffff811d67e3>] acpi_ns_load_table+0x4f/0x94
[    0.066235]  [<ffffffff811dd14a>] acpi_load_tables+0x72/0x133
[    0.066239]  [<ffffffff8153c592>] acpi_early_init+0x60/0xf5
[    0.066243]  [<ffffffff81519cb0>] start_kernel+0x38b/0x3a0
[    0.066248]  [<ffffffff81519140>] ? early_idt_handler+0x0/0x71
[    0.066252]  [<ffffffff815192a3>] x86_64_start_reservations+0xaa/0xae
[    0.066257]  [<ffffffff8151939e>] x86_64_start_kernel+0xf7/0x106
[    0.075010] BUG: scheduling while atomic: swapper/0/0x10000002
[    0.075088] no locks held by swapper/0.
[    0.075090] Modules linked in:
[    0.075094] Pid: 0, comm: swapper Not tainted 2.6.31-rc7 #141
[    0.075097] Call Trace:
[    0.075101]  [<ffffffff81070bb7>] ? __debug_show_held_locks+0x22/0x24
[    0.075106]  [<ffffffff8103d4ba>] __schedule_bug+0x77/0x7c
[    0.075111]  [<ffffffff812da32f>] schedule+0xd6/0xa3f
[    0.075115]  [<ffffffff810e4b9b>] ? kmem_cache_free+0xe2/0x158
[    0.075120]  [<ffffffff810717e2>] ? trace_hardirqs_on_caller+0x12d/0x158
[    0.075124]  [<ffffffff8107181a>] ? trace_hardirqs_on+0xd/0xf
[    0.075129]  [<ffffffff8103e3f0>] __cond_resched+0x29/0x47
[    0.075133]  [<ffffffff812dae24>] _cond_resched+0x29/0x34
[    0.075138]  [<ffffffff811da90a>] acpi_ps_complete_op+0x246/0x25c
[    0.075143]  [<ffffffff811db024>] acpi_ps_parse_loop+0x704/0x860
[    0.075148]  [<ffffffff811da200>] acpi_ps_parse_aml+0x9f/0x2de
[    0.075152]  [<ffffffff811d9181>] acpi_ns_one_complete_parse+0x101/0x11c
[    0.075158]  [<ffffffff811d91bd>] acpi_ns_parse_table+0x21/0x3c
[    0.075162]  [<ffffffff811d67e3>] acpi_ns_load_table+0x4f/0x94
[    0.075166]  [<ffffffff811dd14a>] acpi_load_tables+0x72/0x133
[    0.075171]  [<ffffffff8153c592>] acpi_early_init+0x60/0xf5
[    0.075175]  [<ffffffff81519cb0>] start_kernel+0x38b/0x3a0
[    0.075180]  [<ffffffff81519140>] ? early_idt_handler+0x0/0x71
[    0.075184]  [<ffffffff815192a3>] x86_64_start_reservations+0xaa/0xae
[    0.075189]  [<ffffffff8151939e>] x86_64_start_kernel+0xf7/0x106
[    0.084057] BUG: scheduling while atomic: swapper/0/0x10000002
[    0.084136] no locks held by swapper/0.
[    0.084139] Modules linked in:
[    0.084143] Pid: 0, comm: swapper Not tainted 2.6.31-rc7 #141
[    0.084145] Call Trace:
[    0.084150]  [<ffffffff81070bb7>] ? __debug_show_held_locks+0x22/0x24
[    0.084155]  [<ffffffff8103d4ba>] __schedule_bug+0x77/0x7c
[    0.084159]  [<ffffffff812da32f>] schedule+0xd6/0xa3f
[    0.084163]  [<ffffffff810717e2>] ? trace_hardirqs_on_caller+0x12d/0x158
[    0.084169]  [<ffffffff811bbd5f>] ? acpi_os_release_object+0xe/0x12
[    0.084173]  [<ffffffff811db561>] ? acpi_ps_free_op+0x27/0x29
[    0.084178]  [<ffffffff811db28b>] ? acpi_ps_delete_parse_tree+0x43/0x60
[    0.084183]  [<ffffffff8103e3f0>] __cond_resched+0x29/0x47
[    0.084187]  [<ffffffff812dae24>] _cond_resched+0x29/0x34
[    0.084191]  [<ffffffff811da90a>] acpi_ps_complete_op+0x246/0x25c
[    0.084196]  [<ffffffff811db024>] acpi_ps_parse_loop+0x704/0x860
[    0.084201]  [<ffffffff811da200>] acpi_ps_parse_aml+0x9f/0x2de
[    0.084206]  [<ffffffff811d9181>] acpi_ns_one_complete_parse+0x101/0x11c
[    0.084211]  [<ffffffff811d91d1>] acpi_ns_parse_table+0x35/0x3c
[    0.084216]  [<ffffffff811d67e3>] acpi_ns_load_table+0x4f/0x94
[    0.084220]  [<ffffffff811dd14a>] acpi_load_tables+0x72/0x133
[    0.084225]  [<ffffffff8153c592>] acpi_early_init+0x60/0xf5
[    0.084229]  [<ffffffff81519cb0>] start_kernel+0x38b/0x3a0
[    0.084233]  [<ffffffff81519140>] ? early_idt_handler+0x0/0x71
[    0.084238]  [<ffffffff815192a3>] x86_64_start_reservations+0xaa/0xae
[    0.084242]  [<ffffffff8151939e>] x86_64_start_kernel+0xf7/0x106
[    0.093045] BUG: scheduling while atomic: swapper/0/0x10000002
[    0.093126] no locks held by swapper/0.
[    0.093128] Modules linked in:
[    0.093132] Pid: 0, comm: swapper Not tainted 2.6.31-rc7 #141
[    0.093134] Call Trace:
[    0.093139]  [<ffffffff81070bb7>] ? __debug_show_held_locks+0x22/0x24
[    0.093143]  [<ffffffff8103d4ba>] __schedule_bug+0x77/0x7c
[    0.093148]  [<ffffffff812da32f>] schedule+0xd6/0xa3f
[    0.093152]  [<ffffffff810e4b9b>] ? kmem_cache_free+0xe2/0x158
[    0.093157]  [<ffffffff810717e2>] ? trace_hardirqs_on_caller+0x12d/0x158
[    0.093161]  [<ffffffff8107181a>] ? trace_hardirqs_on+0xd/0xf
[    0.093166]  [<ffffffff8103e3f0>] __cond_resched+0x29/0x47
[    0.093170]  [<ffffffff812dae24>] _cond_resched+0x29/0x34
[    0.093175]  [<ffffffff811da90a>] acpi_ps_complete_op+0x246/0x25c
[    0.093180]  [<ffffffff811db024>] acpi_ps_parse_loop+0x704/0x860
[    0.093185]  [<ffffffff811da200>] acpi_ps_parse_aml+0x9f/0x2de
[    0.093190]  [<ffffffff811d9181>] acpi_ns_one_complete_parse+0x101/0x11c
[    0.093195]  [<ffffffff811d91d1>] acpi_ns_parse_table+0x35/0x3c
[    0.093199]  [<ffffffff811d67e3>] acpi_ns_load_table+0x4f/0x94
[    0.093203]  [<ffffffff811dd14a>] acpi_load_tables+0x72/0x133
[    0.093208]  [<ffffffff8153c592>] acpi_early_init+0x60/0xf5
[    0.093212]  [<ffffffff81519cb0>] start_kernel+0x38b/0x3a0
[    0.093217]  [<ffffffff81519140>] ? early_idt_handler+0x0/0x71
[    0.093221]  [<ffffffff815192a3>] x86_64_start_reservations+0xaa/0xae
[    0.093226]  [<ffffffff8151939e>] x86_64_start_kernel+0xf7/0x106
[    0.102024] BUG: scheduling while atomic: swapper/0/0x10000002
[    0.102104] no locks held by swapper/0.
[    0.102106] Modules linked in:
[    0.102110] Pid: 0, comm: swapper Not tainted 2.6.31-rc7 #141
[    0.102112] Call Trace:
[    0.102117]  [<ffffffff81070bb7>] ? __debug_show_held_locks+0x22/0x24
[    0.102122]  [<ffffffff8103d4ba>] __schedule_bug+0x77/0x7c
[    0.102126]  [<ffffffff812da32f>] schedule+0xd6/0xa3f
[    0.102130]  [<ffffffff810717e2>] ? trace_hardirqs_on_caller+0x12d/0x158
[    0.102136]  [<ffffffff811bbd5f>] ? acpi_os_release_object+0xe/0x12
[    0.102141]  [<ffffffff811db561>] ? acpi_ps_free_op+0x27/0x29
[    0.102145]  [<ffffffff811db28b>] ? acpi_ps_delete_parse_tree+0x43/0x60
[    0.102150]  [<ffffffff8103e3f0>] __cond_resched+0x29/0x47
[    0.102154]  [<ffffffff812dae24>] _cond_resched+0x29/0x34
[    0.102159]  [<ffffffff811da90a>] acpi_ps_complete_op+0x246/0x25c
[    0.102164]  [<ffffffff811db024>] acpi_ps_parse_loop+0x704/0x860
[    0.102169]  [<ffffffff811da200>] acpi_ps_parse_aml+0x9f/0x2de
[    0.102173]  [<ffffffff811d9181>] acpi_ns_one_complete_parse+0x101/0x11c
[    0.102179]  [<ffffffff811d91d1>] acpi_ns_parse_table+0x35/0x3c
[    0.102183]  [<ffffffff811d67e3>] acpi_ns_load_table+0x4f/0x94
[    0.102187]  [<ffffffff811dd14a>] acpi_load_tables+0x72/0x133
[    0.102192]  [<ffffffff8153c592>] acpi_early_init+0x60/0xf5
[    0.102196]  [<ffffffff81519cb0>] start_kernel+0x38b/0x3a0
[    0.102201]  [<ffffffff81519140>] ? early_idt_handler+0x0/0x71
[    0.102205]  [<ffffffff815192a3>] x86_64_start_reservations+0xaa/0xae
[    0.102210]  [<ffffffff8151939e>] x86_64_start_kernel+0xf7/0x106
[    0.111000] BUG: scheduling while atomic: swapper/0/0x10000002
[    0.111078] no locks held by swapper/0.
[    0.111081] Modules linked in:
[    0.111085] Pid: 0, comm: swapper Not tainted 2.6.31-rc7 #141
[    0.111087] Call Trace:
[    0.111092]  [<ffffffff81070bb7>] ? __debug_show_held_locks+0x22/0x24
[    0.111097]  [<ffffffff8103d4ba>] __schedule_bug+0x77/0x7c
[    0.111101]  [<ffffffff812da32f>] schedule+0xd6/0xa3f
[    0.111105]  [<ffffffff810717e2>] ? trace_hardirqs_on_caller+0x12d/0x158
[    0.111111]  [<ffffffff811bbd5f>] ? acpi_os_release_object+0xe/0x12
[    0.111116]  [<ffffffff811db561>] ? acpi_ps_free_op+0x27/0x29
[    0.111120]  [<ffffffff811db28b>] ? acpi_ps_delete_parse_tree+0x43/0x60
[    0.111125]  [<ffffffff8103e3f0>] __cond_resched+0x29/0x47
[    0.111129]  [<ffffffff812dae24>] _cond_resched+0x29/0x34
[    0.111134]  [<ffffffff811da90a>] acpi_ps_complete_op+0x246/0x25c
[    0.111139]  [<ffffffff811db024>] acpi_ps_parse_loop+0x704/0x860
[    0.111144]  [<ffffffff811da200>] acpi_ps_parse_aml+0x9f/0x2de
[    0.111149]  [<ffffffff811d9181>] acpi_ns_one_complete_parse+0x101/0x11c
[    0.111154]  [<ffffffff811d91d1>] acpi_ns_parse_table+0x35/0x3c
[    0.111158]  [<ffffffff811d67e3>] acpi_ns_load_table+0x4f/0x94
[    0.111163]  [<ffffffff811dd14a>] acpi_load_tables+0x72/0x133
[    0.111167]  [<ffffffff8153c592>] acpi_early_init+0x60/0xf5
[    0.111172]  [<ffffffff81519cb0>] start_kernel+0x38b/0x3a0
[    0.111176]  [<ffffffff81519140>] ? early_idt_handler+0x0/0x71
[    0.111181]  [<ffffffff815192a3>] x86_64_start_reservations+0xaa/0xae
[    0.111185]  [<ffffffff8151939e>] x86_64_start_kernel+0xf7/0x106
[    0.120108] BUG: scheduling while atomic: swapper/0/0x10000002
[    0.120186] no locks held by swapper/0.
[    0.120189] Modules linked in:
[    0.120193] Pid: 0, comm: swapper Not tainted 2.6.31-rc7 #141
[    0.120195] Call Trace:
[    0.120200]  [<ffffffff81070bb7>] ? __debug_show_held_locks+0x22/0x24
[    0.120205]  [<ffffffff8103d4ba>] __schedule_bug+0x77/0x7c
[    0.120209]  [<ffffffff812da32f>] schedule+0xd6/0xa3f
[    0.120213]  [<ffffffff810717e2>] ? trace_hardirqs_on_caller+0x12d/0x158
[    0.120219]  [<ffffffff811bbd5f>] ? acpi_os_release_object+0xe/0x12
[    0.120223]  [<ffffffff811db561>] ? acpi_ps_free_op+0x27/0x29
[    0.120228]  [<ffffffff811db28b>] ? acpi_ps_delete_parse_tree+0x43/0x60
[    0.120232]  [<ffffffff8103e3f0>] __cond_resched+0x29/0x47
[    0.120237]  [<ffffffff812dae24>] _cond_resched+0x29/0x34
[    0.120241]  [<ffffffff811da90a>] acpi_ps_complete_op+0x246/0x25c
[    0.120246]  [<ffffffff811db024>] acpi_ps_parse_loop+0x704/0x860
[    0.120251]  [<ffffffff811da200>] acpi_ps_parse_aml+0x9f/0x2de
[    0.120256]  [<ffffffff811d9181>] acpi_ns_one_complete_parse+0x101/0x11c
[    0.120261]  [<ffffffff811d91d1>] acpi_ns_parse_table+0x35/0x3c
[    0.120266]  [<ffffffff811d67e3>] acpi_ns_load_table+0x4f/0x94
[    0.120270]  [<ffffffff811dd14a>] acpi_load_tables+0x72/0x133
[    0.120275]  [<ffffffff8153c592>] acpi_early_init+0x60/0xf5
[    0.120279]  [<ffffffff81519cb0>] start_kernel+0x38b/0x3a0
[    0.120283]  [<ffffffff81519140>] ? early_idt_handler+0x0/0x71
[    0.120288]  [<ffffffff815192a3>] x86_64_start_reservations+0xaa/0xae
[    0.120293]  [<ffffffff8151939e>] x86_64_start_kernel+0xf7/0x106
[    0.129041] BUG: scheduling while atomic: swapper/0/0x10000002
[    0.129120] no locks held by swapper/0.
[    0.129122] Modules linked in:
[    0.129126] Pid: 0, comm: swapper Not tainted 2.6.31-rc7 #141
[    0.129129] Call Trace:
[    0.129133]  [<ffffffff81070bb7>] ? __debug_show_held_locks+0x22/0x24
[    0.129138]  [<ffffffff8103d4ba>] __schedule_bug+0x77/0x7c
[    0.129143]  [<ffffffff812da32f>] schedule+0xd6/0xa3f
[    0.129147]  [<ffffffff810e4b9b>] ? kmem_cache_free+0xe2/0x158
[    0.129152]  [<ffffffff810717e2>] ? trace_hardirqs_on_caller+0x12d/0x158
[    0.129156]  [<ffffffff8107181a>] ? trace_hardirqs_on+0xd/0xf
[    0.129161]  [<ffffffff8103e3f0>] __cond_resched+0x29/0x47
[    0.129165]  [<ffffffff812dae24>] _cond_resched+0x29/0x34
[    0.129170]  [<ffffffff811da90a>] acpi_ps_complete_op+0x246/0x25c
[    0.129175]  [<ffffffff811db024>] acpi_ps_parse_loop+0x704/0x860
[    0.129180]  [<ffffffff811da200>] acpi_ps_parse_aml+0x9f/0x2de
[    0.129185]  [<ffffffff811d9181>] acpi_ns_one_complete_parse+0x101/0x11c
[    0.129190]  [<ffffffff811d91d1>] acpi_ns_parse_table+0x35/0x3c
[    0.129195]  [<ffffffff811d67e3>] acpi_ns_load_table+0x4f/0x94
[    0.129199]  [<ffffffff811dd14a>] acpi_load_tables+0x72/0x133
[    0.129203]  [<ffffffff8153c592>] acpi_early_init+0x60/0xf5
[    0.129208]  [<ffffffff81519cb0>] start_kernel+0x38b/0x3a0
[    0.129212]  [<ffffffff81519140>] ? early_idt_handler+0x0/0x71
[    0.129217]  [<ffffffff815192a3>] x86_64_start_reservations+0xaa/0xae
[    0.129221]  [<ffffffff8151939e>] x86_64_start_kernel+0xf7/0x106


-- 
Lei Ming

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [PATCH] kmemleak: Fix scheduling-while-atomic bug
  2009-08-23  2:48         ` Ming Lei
@ 2009-08-23 14:59           ` Catalin Marinas
  2009-08-24  0:10             ` Ming Lei
  0 siblings, 1 reply; 21+ messages in thread
From: Catalin Marinas @ 2009-08-23 14:59 UTC (permalink / raw)
  To: Ming Lei
  Cc: Ingo Molnar, Linux Kernel Mailing List, Andrew Morton,
	Linus Torvalds, Peter Zijlstra

Hi,

On Sun, 2009-08-23 at 10:48 +0800, Ming Lei wrote:
> 2009/7/8 Catalin Marinas <catalin.marinas@arm.com>:
> > kmemleak: Add more cond_resched() calls in the scanning thread
> >
> > From: Catalin Marinas <catalin.marinas@arm.com>
> >
> > Following recent fix to no longer reschedule in the scan_block()
> > function, the system may become unresponsive with !PREEMPT. This patch
> > re-adds the cond_resched() call to scan_block() but conditioned by the
> > allow_resched parameter.
> >
> > Signed-off-by: Catalin Marinas <catalin.marinas@arm.com>
> > Cc: Ingo Molnar <mingo@elte.hu>
[...]
> This still exits in 2.6.31-rc7 if kmemcheck is enabled, and isn't  the patch
> merged into mainline?

Is this kmemcheck or kmemleak (two different things with similar names)?

> [    0.006569] ACPI: Core revision 20090521
> [    0.011476] BUG: scheduling while atomic: swapper/0/0x10000002
> [    0.011558] no locks held by swapper/0.
> [    0.011561] Modules linked in:
> [    0.011566] Pid: 0, comm: swapper Not tainted 2.6.31-rc7 #141
> [    0.011569] Call Trace:
> [    0.011577]  [<ffffffff81070bb7>] ? __debug_show_held_locks+0x22/0x24
> [    0.011584]  [<ffffffff8103d4ba>] __schedule_bug+0x77/0x7c
> [    0.011590]  [<ffffffff812da32f>] schedule+0xd6/0xa3f
> [    0.011596]  [<ffffffff810e4b9b>] ? kmem_cache_free+0xe2/0x158
> [    0.011600]  [<ffffffff810717e2>] ? trace_hardirqs_on_caller+0x12d/0x158
> [    0.011604]  [<ffffffff8107181a>] ? trace_hardirqs_on+0xd/0xf
> [    0.011609]  [<ffffffff8103e3f0>] __cond_resched+0x29/0x47
> [    0.011614]  [<ffffffff812dae24>] _cond_resched+0x29/0x34
> [    0.011620]  [<ffffffff811da90a>] acpi_ps_complete_op+0x246/0x25c
> [    0.011625]  [<ffffffff811db024>] acpi_ps_parse_loop+0x704/0x860
> [    0.011630]  [<ffffffff811da200>] acpi_ps_parse_aml+0x9f/0x2de
> [    0.011635]  [<ffffffff811d9181>] acpi_ns_one_complete_parse+0x101/0x11c
> [    0.011640]  [<ffffffff811d91bd>] acpi_ns_parse_table+0x21/0x3c
> [    0.011645]  [<ffffffff811d67e3>] acpi_ns_load_table+0x4f/0x94
> [    0.011650]  [<ffffffff811dd14a>] acpi_load_tables+0x72/0x133
> [    0.011656]  [<ffffffff8153c592>] acpi_early_init+0x60/0xf5
> [    0.011661]  [<ffffffff81519cb0>] start_kernel+0x38b/0x3a0
> [    0.011666]  [<ffffffff81519140>] ? early_idt_handler+0x0/0x71
> [    0.011670]  [<ffffffff815192a3>] x86_64_start_reservations+0xaa/0xae
> [    0.011675]  [<ffffffff8151939e>] x86_64_start_kernel+0xf7/0x106

I looked at the traces and there are no kmemleak calls. It's either that
kmemleak is disabled or the error is not on a kmemleak path. It looks
more like an ACPI bug to me.

Can you try only with slab debugging enabled (without kmemleak or
kmemcheck)? IIRC someone else reported a similar issue a few weeks ago.

-- 
Catalin


^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [PATCH] kmemleak: Fix scheduling-while-atomic bug
  2009-08-23 14:59           ` Catalin Marinas
@ 2009-08-24  0:10             ` Ming Lei
  2009-08-24 10:02               ` ACPI scheduling while atomic (was: Re: [PATCH] kmemleak: Fix scheduling-while-atomic bug) Catalin Marinas
  0 siblings, 1 reply; 21+ messages in thread
From: Ming Lei @ 2009-08-24  0:10 UTC (permalink / raw)
  To: Catalin Marinas
  Cc: Ingo Molnar, Linux Kernel Mailing List, Andrew Morton,
	Linus Torvalds, Peter Zijlstra

[-- Attachment #1: Type: text/plain, Size: 3164 bytes --]

2009/8/23 Catalin Marinas <catalin.marinas@arm.com>:
> Hi,
>
> On Sun, 2009-08-23 at 10:48 +0800, Ming Lei wrote:
>> 2009/7/8 Catalin Marinas <catalin.marinas@arm.com>:
>> > kmemleak: Add more cond_resched() calls in the scanning thread
>> >
>> > From: Catalin Marinas <catalin.marinas@arm.com>
>> >
>> > Following recent fix to no longer reschedule in the scan_block()
>> > function, the system may become unresponsive with !PREEMPT. This patch
>> > re-adds the cond_resched() call to scan_block() but conditioned by the
>> > allow_resched parameter.
>> >
>> > Signed-off-by: Catalin Marinas <catalin.marinas@arm.com>
>> > Cc: Ingo Molnar <mingo@elte.hu>
> [...]
>> This still exits in 2.6.31-rc7 if kmemcheck is enabled, and isn't  the patch
>> merged into mainline?
>
> Is this kmemcheck or kmemleak (two different things with similar names)?

Sorry, this is kmemleak.

>
>> [    0.006569] ACPI: Core revision 20090521
>> [    0.011476] BUG: scheduling while atomic: swapper/0/0x10000002
>> [    0.011558] no locks held by swapper/0.
>> [    0.011561] Modules linked in:
>> [    0.011566] Pid: 0, comm: swapper Not tainted 2.6.31-rc7 #141
>> [    0.011569] Call Trace:
>> [    0.011577]  [<ffffffff81070bb7>] ? __debug_show_held_locks+0x22/0x24
>> [    0.011584]  [<ffffffff8103d4ba>] __schedule_bug+0x77/0x7c
>> [    0.011590]  [<ffffffff812da32f>] schedule+0xd6/0xa3f
>> [    0.011596]  [<ffffffff810e4b9b>] ? kmem_cache_free+0xe2/0x158
>> [    0.011600]  [<ffffffff810717e2>] ? trace_hardirqs_on_caller+0x12d/0x158
>> [    0.011604]  [<ffffffff8107181a>] ? trace_hardirqs_on+0xd/0xf
>> [    0.011609]  [<ffffffff8103e3f0>] __cond_resched+0x29/0x47
>> [    0.011614]  [<ffffffff812dae24>] _cond_resched+0x29/0x34
>> [    0.011620]  [<ffffffff811da90a>] acpi_ps_complete_op+0x246/0x25c
>> [    0.011625]  [<ffffffff811db024>] acpi_ps_parse_loop+0x704/0x860
>> [    0.011630]  [<ffffffff811da200>] acpi_ps_parse_aml+0x9f/0x2de
>> [    0.011635]  [<ffffffff811d9181>] acpi_ns_one_complete_parse+0x101/0x11c
>> [    0.011640]  [<ffffffff811d91bd>] acpi_ns_parse_table+0x21/0x3c
>> [    0.011645]  [<ffffffff811d67e3>] acpi_ns_load_table+0x4f/0x94
>> [    0.011650]  [<ffffffff811dd14a>] acpi_load_tables+0x72/0x133
>> [    0.011656]  [<ffffffff8153c592>] acpi_early_init+0x60/0xf5
>> [    0.011661]  [<ffffffff81519cb0>] start_kernel+0x38b/0x3a0
>> [    0.011666]  [<ffffffff81519140>] ? early_idt_handler+0x0/0x71
>> [    0.011670]  [<ffffffff815192a3>] x86_64_start_reservations+0xaa/0xae
>> [    0.011675]  [<ffffffff8151939e>] x86_64_start_kernel+0xf7/0x106
>
> I looked at the traces and there are no kmemleak calls. It's either that
> kmemleak is disabled or the error is not on a kmemleak path. It looks
> more like an ACPI bug to me.
>
> Can you try only with slab debugging enabled (without kmemleak or
> kmemcheck)? IIRC someone else reported a similar issue a few weeks ago.

If I only disabled kmemleak, the kernel does not print the warning, so I suppose
it is related with kmemleak.

Attachment is the .config,  hope it is usable to help to fix the warning.

Thanks.

-- 
Lei Ming

[-- Attachment #2: config.tar.gz --]
[-- Type: application/x-gzip, Size: 15594 bytes --]

^ permalink raw reply	[flat|nested] 21+ messages in thread

* ACPI scheduling while atomic (was: Re: [PATCH] kmemleak: Fix scheduling-while-atomic bug)
  2009-08-24  0:10             ` Ming Lei
@ 2009-08-24 10:02               ` Catalin Marinas
  0 siblings, 0 replies; 21+ messages in thread
From: Catalin Marinas @ 2009-08-24 10:02 UTC (permalink / raw)
  To: Ming Lei
  Cc: Ingo Molnar, Linux Kernel Mailing List, Andrew Morton,
	Linus Torvalds, Peter Zijlstra, linux-acpi

On Mon, 2009-08-24 at 08:10 +0800, Ming Lei wrote:
> 2009/8/23 Catalin Marinas <catalin.marinas@arm.com>:
> > On Sun, 2009-08-23 at 10:48 +0800, Ming Lei wrote:
> >> [    0.006569] ACPI: Core revision 20090521
> >> [    0.011476] BUG: scheduling while atomic: swapper/0/0x10000002
> >> [    0.011558] no locks held by swapper/0.
> >> [    0.011561] Modules linked in:
> >> [    0.011566] Pid: 0, comm: swapper Not tainted 2.6.31-rc7 #141
> >> [    0.011569] Call Trace:
> >> [    0.011577]  [<ffffffff81070bb7>] ? __debug_show_held_locks+0x22/0x24
> >> [    0.011584]  [<ffffffff8103d4ba>] __schedule_bug+0x77/0x7c
> >> [    0.011590]  [<ffffffff812da32f>] schedule+0xd6/0xa3f
> >> [    0.011596]  [<ffffffff810e4b9b>] ? kmem_cache_free+0xe2/0x158
> >> [    0.011600]  [<ffffffff810717e2>] ? trace_hardirqs_on_caller+0x12d/0x158
> >> [    0.011604]  [<ffffffff8107181a>] ? trace_hardirqs_on+0xd/0xf
> >> [    0.011609]  [<ffffffff8103e3f0>] __cond_resched+0x29/0x47
> >> [    0.011614]  [<ffffffff812dae24>] _cond_resched+0x29/0x34
> >> [    0.011620]  [<ffffffff811da90a>] acpi_ps_complete_op+0x246/0x25c
> >> [    0.011625]  [<ffffffff811db024>] acpi_ps_parse_loop+0x704/0x860
> >> [    0.011630]  [<ffffffff811da200>] acpi_ps_parse_aml+0x9f/0x2de
> >> [    0.011635]  [<ffffffff811d9181>] acpi_ns_one_complete_parse+0x101/0x11c
> >> [    0.011640]  [<ffffffff811d91bd>] acpi_ns_parse_table+0x21/0x3c
> >> [    0.011645]  [<ffffffff811d67e3>] acpi_ns_load_table+0x4f/0x94
> >> [    0.011650]  [<ffffffff811dd14a>] acpi_load_tables+0x72/0x133
> >> [    0.011656]  [<ffffffff8153c592>] acpi_early_init+0x60/0xf5
> >> [    0.011661]  [<ffffffff81519cb0>] start_kernel+0x38b/0x3a0
> >> [    0.011666]  [<ffffffff81519140>] ? early_idt_handler+0x0/0x71
> >> [    0.011670]  [<ffffffff815192a3>] x86_64_start_reservations+0xaa/0xae
> >> [    0.011675]  [<ffffffff8151939e>] x86_64_start_kernel+0xf7/0x106
> >
> > I looked at the traces and there are no kmemleak calls. It's either that
> > kmemleak is disabled or the error is not on a kmemleak path. It looks
> > more like an ACPI bug to me.
> >
> > Can you try only with slab debugging enabled (without kmemleak or
> > kmemcheck)? IIRC someone else reported a similar issue a few weeks ago.
> 
> If I only disabled kmemleak, the kernel does not print the warning, so I suppose
> it is related with kmemleak.

I can't really see how kmemleak gets involved in here. Maybe you could
just enable some of the features turned on by kmemleak like STACKTRACE
and see if you still get the error.

For x86-64 you may need this patch - http://lkml.org/lkml/2009/8/16/269

> Attachment is the .config,  hope it is usable to help to fix the warning.

Thanks but I don't have such hardware to test and I don't get this error
on x86-32.

Looking at the code, the acpi_ps_complete_op() function calls the
ACPI_PREEMPTION_POINT() macro which expands to a cond_resched() call if
IRQs aren't disabled. The IRQs are probably enabled at that point but it
seems that the context is atomic.

I cc'ed the ACPI people, maybe they can help with some suggestions 

Thanks.

-- 
Catalin


^ permalink raw reply	[flat|nested] 21+ messages in thread

end of thread, other threads:[~2009-08-24 10:03 UTC | newest]

Thread overview: 21+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <200907010300.n6130rRf026194@hera.kernel.org>
2009-07-01  7:53 ` [PATCH] kmemleak: Fix scheduling-while-atomic bug Ingo Molnar
2009-07-01  8:10   ` Pekka Enberg
2009-07-01  9:18   ` Catalin Marinas
2009-07-01  9:30     ` Ingo Molnar
2009-07-01  9:46       ` Catalin Marinas
2009-07-01 11:04         ` Ingo Molnar
2009-07-02 12:48           ` Exiting with locks still held (was Re: [PATCH] kmemleak: Fix scheduling-while-atomic bug) Catalin Marinas
2009-07-02 12:54             ` Pekka Enberg
2009-07-02 13:06               ` Catalin Marinas
2009-07-02 14:13               ` Catalin Marinas
2009-07-02 17:39                 ` Linus Torvalds
2009-07-03 10:18                   ` Catalin Marinas
2009-07-03  7:04             ` Ingo Molnar
2009-07-02  9:48       ` [PATCH] kmemleak: Fix scheduling-while-atomic bug Catalin Marinas
2009-07-03  7:00         ` [PATCH] kmemleak: Mark nice +10 Ingo Molnar
2009-07-03  8:09           ` Catalin Marinas
2009-07-08 13:33       ` [PATCH] kmemleak: Fix scheduling-while-atomic bug Catalin Marinas
2009-08-23  2:48         ` Ming Lei
2009-08-23 14:59           ` Catalin Marinas
2009-08-24  0:10             ` Ming Lei
2009-08-24 10:02               ` ACPI scheduling while atomic (was: Re: [PATCH] kmemleak: Fix scheduling-while-atomic bug) Catalin Marinas

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox