All of lore.kernel.org
 help / color / mirror / Atom feed
From: Matt Mackall <mpm@selenic.com>
To: "Mariusz Kozłowski" <m.kozlowski@tuxland.pl>
Cc: Adrian Bunk <bunk@stusta.de>,
	Andrew Morton <akpm@linux-foundation.org>,
	linux-kernel@vger.kernel.org,
	Thomas Gleixner <tglx@linutronix.de>, Ingo Molnar <mingo@elte.hu>,
	john stultz <johnstul@us.ibm.com>,
	David Miller <davem@davemloft.net>
Subject: Re: 2.6.21-rc5-mm1
Date: Thu, 29 Mar 2007 14:46:04 -0500	[thread overview]
Message-ID: <20070329194604.GH4892@waste.org> (raw)
In-Reply-To: <200703292055.26117.m.kozlowski@tuxland.pl>

On Thu, Mar 29, 2007 at 08:55:25PM +0200, Mariusz Kozłowski wrote:
> > > > > > > 	I run 2.6.21-rc4-mm1 with no hangs for a week.
> > > > > > > Then when 2.6.21-rc5-mm1 showed up so I switched to it. Unfortunately
> > > > > > > today my laptop hunged twice in a similar way as described here:
> > > > > > > 
> > > > > > > http://www.ussg.iu.edu/hypermail/linux/kernel/0703.0/index.html#1165
> > > > > > 
> > > > > > It's not good that we went backwards between those two releases.
> > > > > > 
> > > > > > > The difference is that it happened when I closed the lid in my laptop.
> > > > > > > When reopend it the box was frozen (ACPI?). Again disk I/O was dead
> > > > > > > so nothing was found in syslog.
> > > > > > 
> > > > > > Adrian, does this look like any of the bugs whcih you're monitoring?
> > > > > >...
> > > > > 
> > > > > Is it also present in 2.6.21-rc5?
> > > > 
> > > > Don't know. I usualy test -mm series. Will test tommorow and let you know
> > > > after some reasonable uptime.
> > > > 
> > > > > Is it also present with CONFIG_NO_HZ=n?
> > > > 
> > > > Don't know. Did not try recently. Will let you know.
> > > > 
> > > > It takes time as these hangs are not easy to trigger. With 2.6.21-rc2-mm1
> > > > it was easy -> push the system and watch it die in minutes. With
> > > > 2.6.21-rc5-mm1 it takes hours (3 hangs in ~15 hours) and not sure how to
> > > > trigger it. It just happens from time to time.
> > > 
> > > Ok. CONGIG_NO_HZ=n and uptime ~12 hours, netconsole loaded, and no hangs
> > > ... until I moved my laptop. The same scenario happened yesterday during the
> > > last hang. I started playing and repeating some steps which I naturally do when
> > > I move my laptop.
> > > 
> > > An hour later I came to this -> steps to hang 2.6.21-rc5-mm1 on my laptop:
> > > 1. boot the system and login as root
> > > 2. load netconsole (insmod netconole.ko netconsole=blah... blah...)
> > > 3. unplug the cable (link goes down)
> > > 4. system is frozen until forced to reboot
> > > 
> > > This is verified and repeatable _every_ single time I tried. Unfortunately
> > > the last thing seen on the screen before system is frozen is 'eth0: link down'.
> > > So my guess is that when hunting for hangs I found something else that can hang
> > > my laptop (netconsole that is).
> > 
> > Which NIC do you have? Odds are that the driver is doing that printk
> > while holding a driver-internal spinlock that causes it to deadlock
> > when netpoll tries to send that message out.
> 
> 8139too was in use. This is from lspci:
> 
> 00:12.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL-8139/8139C/8139C+ (rev 10)
>         Subsystem: Sony Corporation Unknown device 8158
>         Flags: bus master, medium devsel, latency 64, IRQ 11
>         I/O ports at 9c00 [size=256]
>         Memory at f0404c00 (32-bit, non-prefetchable) [size=512]
>         Capabilities: [50] Power Management version 2

Yep, that's a known problem with this driver:

http://lkml.org/lkml/2007/2/17/222

This was a known theoretical problem with netconsole from the start
but I'm actually not aware of any other drivers that have run into it.
My preferred fix is to create a netconsole_disable/enable API for
drivers that have this sort of reentrancy problem.

-- 
Mathematics is the supreme nostalgia of our time.

  reply	other threads:[~2007-03-29 19:59 UTC|newest]

Thread overview: 42+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2007-03-27  4:57 2.6.21-rc5-mm1 Andrew Morton
2007-03-27  6:20 ` 2.6.21-rc5-mm1 David Miller
2007-03-27 11:15 ` 2.6.21-rc5-mm1 Bartlomiej Zolnierkiewicz
2007-03-27 16:52 ` [-mm patch] fix arch/i386/kernel/marker.c compilation Adrian Bunk
2007-03-27 17:11   ` Mathieu Desnoyers
2007-03-27 17:29     ` Christoph Hellwig
2007-03-28 12:11     ` [-mm patch] no longer #include <asm/kdebug.h> Adrian Bunk
2007-03-27 17:39 ` 2.6.21-rc5-mm1 Badari Pulavarty
2007-03-27 16:48   ` 2.6.21-rc5-mm1 Andrew Morton
2007-03-27 17:57     ` 2.6.21-rc5-mm1 Badari Pulavarty
2007-03-27 20:12       ` 2.6.21-rc5-mm1 Andrew Morton
2007-03-27 20:30         ` 2.6.21-rc5-mm1 Eric Dumazet
2007-03-27 21:29         ` 2.6.21-rc5-mm1 Eric Dumazet
2007-03-27 21:40           ` 2.6.21-rc5-mm1 Andrew Morton
2007-03-27 23:29             ` 2.6.21-rc5-mm1 Badari Pulavarty
2007-03-27 22:15         ` 2.6.21-rc5-mm1 Badari Pulavarty
2007-03-27 17:54 ` 2.6.21-rc5-mm1 Badari Pulavarty
2007-03-27 20:28   ` 2.6.21-rc5-mm1 Andrew Morton
2007-03-27 21:23 ` 2.6.21-rc5-mm1 Sam Ravnborg
2007-03-27 21:26 ` 2.6.21-rc5-mm1: i386/x86_64 register_die_notifier() change Adrian Bunk
2007-03-27 21:29   ` [discuss] " Andi Kleen
2007-03-28 12:11 ` [-mm patch] drivers/mtd/ubi/: make code static Adrian Bunk
2007-03-28 12:11   ` Adrian Bunk
2007-03-28 12:58   ` Artem Bityutskiy
2007-03-28 12:58     ` Artem Bityutskiy
2007-03-28 16:44 ` 2.6.21-rc5-mm1 Mariusz Kozłowski
2007-03-28 19:59   ` 2.6.21-rc5-mm1 Mariusz Kozłowski
2007-03-28 20:02   ` 2.6.21-rc5-mm1 Andrew Morton
2007-03-28 20:25     ` 2.6.21-rc5-mm1 Adrian Bunk
2007-03-28 20:49       ` 2.6.21-rc5-mm1 Mariusz Kozłowski
2007-03-29 18:01         ` 2.6.21-rc5-mm1 Mariusz Kozłowski
2007-03-29 17:52           ` 2.6.21-rc5-mm1 Andrew Morton
2007-03-29 18:45             ` 2.6.21-rc5-mm1 Mariusz Kozłowski
2007-03-29 18:38               ` 2.6.21-rc5-mm1 Ingo Molnar
2007-03-29 19:02                 ` 2.6.21-rc5-mm1 Mariusz Kozłowski
2007-03-29 19:40                   ` 2.6.21-rc5-mm1 Matt Mackall
2007-03-29 18:21           ` 2.6.21-rc5-mm1 Matt Mackall
2007-03-29 18:55             ` 2.6.21-rc5-mm1 Mariusz Kozłowski
2007-03-29 19:46               ` Matt Mackall [this message]
2007-03-28 20:31     ` 2.6.21-rc5-mm1 john stultz
2007-03-31  6:28       ` 2.6.21-rc5-mm1 Mariusz Kozłowski
     [not found] ` <48f7fe350703291332q5f2da2dar7c52afd34f79072a@mail.gmail.com>
2007-03-29 21:33   ` 2.6.21-rc5-mm1 Andrew Morton

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20070329194604.GH4892@waste.org \
    --to=mpm@selenic.com \
    --cc=akpm@linux-foundation.org \
    --cc=bunk@stusta.de \
    --cc=davem@davemloft.net \
    --cc=johnstul@us.ibm.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=m.kozlowski@tuxland.pl \
    --cc=mingo@elte.hu \
    --cc=tglx@linutronix.de \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.