All of lore.kernel.org
 help / color / mirror / Atom feed
From: Ingo Molnar <mingo@elte.hu>
To: "Rafael J. Wysocki" <rjw@sisk.pl>
Cc: Linus Torvalds <torvalds@linux-foundation.org>,
	Maciej Rutecki <maciej.rutecki@gmail.com>,
	Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
	Andrew Morton <akpm@linux-foundation.org>,
	Thomas Gleixner <tglx@linutronix.de>
Subject: Re: [Linux 2.6.29-rc2] BUG: using smp_processor_id() in preemptible
Date: Mon, 26 Jan 2009 18:41:13 +0100	[thread overview]
Message-ID: <20090126174113.GF6238@elte.hu> (raw)
In-Reply-To: <200901261753.35416.rjw@sisk.pl>


* Rafael J. Wysocki <rjw@sisk.pl> wrote:

> On Monday 26 January 2009, Linus Torvalds wrote:
> > 
> > On Sat, 17 Jan 2009, Maciej Rutecki wrote:
> > >
> > > During suspend to ram:
> > > [  131.287012] BUG: using smp_processor_id() in preemptible [00000000]
> > > code: suspend_to_ram./2958
> > > [  131.287012] caller is retrigger_next_event+0x13/0xb0
> > > [  131.287012] Pid: 2958, comm: suspend_to_ram. Not tainted 2.6.29-rc2 #1
> > > [  131.287012] Call Trace:
> > > [  131.287012]  [<c025b41f>] debug_smp_processor_id+0xbf/0xd0
> > > [  131.287012]  [<c01473b3>] retrigger_next_event+0x13/0xb0
> > > [  131.287012]  [<c01489b7>] raw_notifier_call_chain+0x17/0x20
> > > [  131.287012]  [<c014b938>] timekeeping_resume+0xe8/0x110
> > > [  131.287012]  [<c02cc651>] __sysdev_resume+0x11/0x50
> > > [  131.287012]  [<c02cc6d7>] sysdev_resume+0x47/0x80
> > > [  131.287012]  [<c02d2478>] device_power_up+0x8/0x10
> > 
> > Very scary.
> > 
> > device_power_up() calls sysdev_resume _before_ it enables interrupts so it 
> > sounds like something else has - very incorrectly - enabled interrupts too 
> > early in your resume sequence. 
> > 
> > The patch that Andrew sent out and that apparently fixed things for you 
> > should absolutely not have made any difference. This is suspend_enter():
> > 
> > 	        arch_suspend_disable_irqs();
> > 	        BUG_ON(!irqs_disabled());
> > 
> > 	        if ((error = device_power_down(PMSG_SUSPEND))) {
> > 	                printk(KERN_ERR "PM: Some devices failed to power down\n");
> > 	                goto Done;
> > 	        }
> > 	
> > 	        if (!suspend_test(TEST_CORE))
> > 	                error = suspend_ops->enter(state);
> > 
> > 	        device_power_up(PMSG_RESUME);
> > 	 Done:
> > 	        arch_suspend_enable_irqs();
> > 
> > and notice how the whole thing is surrounded by that 
> > arch_suspend_disable/enable_irqs().
> > 
> > So it looks like some sysdev driver (device_power_up does the sysdev 
> > drivers first, so it can't be the regular low-level PCI drivers) is 
> > enabling interrupts in its resume function. Scary and very wrong.
> > 
> > It could easily be ACPI, of course. There was some other case where ACPI 
> > did that, iirc.
> 
> There is a known bug in the USB controllers' suspend that enables interrupts
> from within ->suspend_late().  It should be fixed by the next USB merge
> AFAICS.

the patch from Andrew looks wrong, as it hides the only place in the 
kernel that was able to report the resume bug. Nevertheless related to 
that bug we've got a new debug check queued up in timers/urgent:

 void hres_timers_resume(void)
 {
-       /* Retrigger the CPU local events: */
+       WARN_ONCE(!irqs_disabled(),
+                 KERN_INFO "hres_timers_resume() called with IRQs enabled!");
+
        retrigger_next_event(NULL);
 }

as the buggy 'irqs are enabled' condition was not detected reliably. (it 
was only detected with certain lockdep options turned on - and even then 
it did not seem to be 100% triggerable)

i sent it to Linus earlier today.

	Ingo

  reply	other threads:[~2009-01-26 17:42 UTC|newest]

Thread overview: 19+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-01-17  8:58 [Linux 2.6.29-rc2] BUG: using smp_processor_id() in preemptible Maciej Rutecki
2009-01-25  2:37 ` Andrew Morton
2009-01-25 11:57   ` Thomas Gleixner
2009-01-25 14:19   ` Maciej Rutecki
2009-01-25 16:30   ` Maciej Rutecki
2009-01-26 16:07 ` Linus Torvalds
2009-01-26 16:53   ` Rafael J. Wysocki
2009-01-26 17:41     ` Ingo Molnar [this message]
2009-01-26 19:21       ` Rafael J. Wysocki
2009-01-26 20:35         ` Ingo Molnar
2009-01-26 20:48           ` Rafael J. Wysocki
2009-01-26 21:35             ` Ingo Molnar
2009-01-27 15:28               ` Rafael J. Wysocki
2009-01-27 15:49                 ` Ingo Molnar
2009-01-27 21:18                   ` Rafael J. Wysocki
2009-01-29 15:07                     ` Ingo Molnar
2009-01-29 22:29                       ` Rafael J. Wysocki
2009-01-30 14:06                         ` Ingo Molnar
2009-01-30 21:30                           ` Rafael J. Wysocki

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20090126174113.GF6238@elte.hu \
    --to=mingo@elte.hu \
    --cc=akpm@linux-foundation.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=maciej.rutecki@gmail.com \
    --cc=rjw@sisk.pl \
    --cc=tglx@linutronix.de \
    --cc=torvalds@linux-foundation.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.