All of lore.kernel.org
 help / color / mirror / Atom feed
From: Suresh Siddha <suresh.b.siddha@intel.com>
To: Andrew Morton <akpm@linux-foundation.org>
Cc: mingo@elte.hu, g.liakhovetski@gmx.de, rjw@sisk.pl,
	linux-kernel@vger.kernel.org, torvalds@linux-foundation.org,
	bunk@kernel.org, gregkh@suse.de
Subject: Re: 2.6.25-rc3-git3: Reported regressions from 2.6.24
Date: Thu, 6 Mar 2008 15:13:31 -0800	[thread overview]
Message-ID: <20080306231331.GJ28006@linux-os.sc.intel.com> (raw)
In-Reply-To: <20080306145739.817dea8e.akpm@linux-foundation.org>

On Thu, Mar 06, 2008 at 02:57:39PM -0800, Andrew Morton wrote:
> On Thu, 6 Mar 2008 13:36:32 -0800
> Andrew Morton <akpm@linux-foundation.org> wrote:
> 
> > On Thu, 6 Mar 2008 21:59:51 +0100
> > Ingo Molnar <mingo@elte.hu> wrote:
> > 
> > > 
> > > * Andrew Morton <akpm@linux-foundation.org> wrote:
> > > 
> > > > I'd love to poke around in kgdb (what does kthread_stop_info.k point 
> > > > at?) but it seems that -mm's copy of kgdb got taken away when I wasn't 
> > > > looking. Can I have it back please?
> > > 
> > > it's in the full x86.git or you can pick up the kgdb-light tree:
> > > 
> > >   http://people.redhat.com/mingo/kgdb-light.git/README
> > > 
> > 
> > We'll see.
> > 
> > Meanwhile, further investigation show that cpu_callback() (the one in
> > kernel/softlockup.c) is waiting on this thread:
> > 
> > watchdog/1    R  running task        0     8      2 task_struct:ffff81025f1089e0
> 
> Note the "/1".
> 
> >  ffff81025f10deb0 0000000000000046 0000000000000000 0000000000000246
> >  ffff81025f10de20 ffff81025f1089e0 ffff81025f1080c0 ffff81025f108d30
> >  000000015f10de50 00000000ffff2adf ffffffffffffffff ffffffffffffffff
> > Call Trace:
> >  [<ffffffff80263290>] ? watchdog+0x0/0x1dc
> >  [<ffffffff802632d6>] watchdog+0x46/0x1dc
> >  [<ffffffff80263290>] ? watchdog+0x0/0x1dc
> >  [<ffffffff8024704d>] kthread+0x44/0x6b
> >  [<ffffffff8020cd88>] child_rip+0xa/0x12
> >  [<ffffffff80247009>] ? kthread+0x0/0x6b
> >  [<ffffffff8020cd7e>] ? child_rip+0x0/0x12
> > 
> > kthread_stop_info.k=ffff81025f1089e0
> > 
> > (gdb) l *0xffffffff802632d6
> > 0xffffffff802632d6 is in watchdog (kernel/softlockup.c:229).
> > 224              */
> > 225             while (!kthread_should_stop()) {
> > 226                     touch_softlockup_watchdog();
> > 227                     schedule();
> > 228     
> > 229                     if (kthread_should_stop())
> > 230                             break;
> > 231     
> > 232                     if (this_cpu == check_cpu) {
> > 233                             if (sysctl_hung_task_timeout_secs)
> > 
> > so this watchdog thread seems to be runnable, but not running.  What would
> > cause this?  
> 
> At the start of the sysrq-T trace we have:
> 
> sd 1:0:0:0: [sdb] Stopping disk
> sd 0:0:0:0: [sda] Synchronizing SCSI cache
> sd 0:0:0:0: [sda] Stopping disk
> ACPI: PCI interrupt for device 0000:05:00.1 disabled
> ACPI: PCI interrupt for device 0000:05:00.0 disabled
> ACPI: Preparing to enter system sleep state S5
> Disabling non-boot CPUs ...
> CPU 1 is now offline
> SysRq : Show State
>   task                        PC stack   pid father

I have been looking into a similar issue, which stops my system going into
standy.

> 
> So CPU 1 is offline.  But the comatose watchdog thread is pinned to CPU 1. 
> Could this be related to the problem?  By what means is a task which is
> pinned to a going-away CPU handled?  How is this guy supposed to ever run
> again?

move_task_off_dead_cpu() should move that thread to another online cpu. But
for some reason it isn't running.

thanks,
suresh

  reply	other threads:[~2008-03-06 23:13 UTC|newest]

Thread overview: 27+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-03-03  2:16 2.6.25-rc3-git3: Reported regressions from 2.6.24 Rafael J. Wysocki
2008-03-03  7:53 ` Ingo Molnar
2008-03-03 11:41   ` Rafael J. Wysocki
2008-03-04  7:36 ` Pekka Enberg
2008-03-04 11:17 ` Pavel Machek
2008-03-04 13:59   ` Ingo Molnar
2008-03-04 23:00     ` Rafael J. Wysocki
2008-03-04 11:27 ` Dhaval Giani
2008-03-05  2:06   ` Zhang, Yanmin
2008-03-05  3:49     ` Balbir Singh
2008-03-05  5:15       ` Zhang, Yanmin
2008-03-05  6:36         ` Balbir Singh
2008-03-05  6:56         ` Ingo Molnar
2008-03-05  7:13           ` Zhang, Yanmin
2008-03-05  6:26     ` Zhang, Yanmin
2008-03-06  7:27 ` Ingo Molnar
2008-03-06 19:55   ` Guennadi Liakhovetski
2008-03-06 20:11     ` Andrew Morton
2008-03-06 20:51       ` Andrew Morton
2008-03-06 20:59         ` Ingo Molnar
2008-03-06 21:36           ` Andrew Morton
2008-03-06 22:57             ` Andrew Morton
2008-03-06 23:13               ` Suresh Siddha [this message]
2008-03-06 23:24                 ` Andrew Morton
2008-03-06 20:51     ` Greg KH
2008-03-06 17:57 ` Tilman Schmidt
2008-03-06 20:25   ` Rafael J. Wysocki

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20080306231331.GJ28006@linux-os.sc.intel.com \
    --to=suresh.b.siddha@intel.com \
    --cc=akpm@linux-foundation.org \
    --cc=bunk@kernel.org \
    --cc=g.liakhovetski@gmx.de \
    --cc=gregkh@suse.de \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mingo@elte.hu \
    --cc=rjw@sisk.pl \
    --cc=torvalds@linux-foundation.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.