All of lore.kernel.org
 help / color / mirror / Atom feed
From: Ingo Molnar <mingo@elte.hu>
To: Jason Wessel <jason.wessel@windriver.com>
Cc: Andrew Morton <akpm@linux-foundation.org>,
	tglx@linutronix.de, penberg@cs.helsinki.fi,
	linux-usb@vger.kernel.org, linux-kernel@vger.kernel.org,
	linux-mm@kvack.org, jmorris@namei.org, sds@tycho.nsa.gov
Subject: Re: 2.6.25-mm1: not looking good
Date: Fri, 18 Apr 2008 09:37:32 +0200	[thread overview]
Message-ID: <20080418073732.GA22724@elte.hu> (raw)
In-Reply-To: <48080FE7.1070400@windriver.com>


* Jason Wessel <jason.wessel@windriver.com> wrote:

> > [...] The final initcall is init_kgdbts() and disabling KGDB 
> > prevents the hang.

> That enables verbose logging of exactly what is going on and will show 
> where wheels fall off the cart.  If the kernel is dying silently it 
> means the early exception code has completely failed in some way on 
> the kernel architecture that was selected, and of course the .config 
> is always useful in this case.

incidentally, just today, in overnight testing i triggered a similar 
hang in the KGDB self-test:

  http://redhat.com/~mingo/misc/config-Thu_Apr_17_23_46_36_CEST_2008.bad

to get a similar tree to the one i tested, pick up sched-devel/latest 
from:

   http://people.redhat.com/mingo/sched-devel.git/README 

pick up that failing .config, do 'make oldconfig' and accept all the 
defaults to get a comparable kernel to mine. (kgdb is embedded in 
sched-devel.git.)

the hang was at:

[   12.504057] Calling initcall 0xffffffff80b800c1: init_kgdbts+0x0/0x1b()
[   12.511298] kgdb: Registered I/O driver kgdbts.
[   12.515062] kgdbts:RUN plant and detach test
[   12.520283] kgdbts:RUN sw breakpoint test
[   12.524651] kgdbts:RUN bad memory access test
[   12.529052] kgdbts:RUN singlestep breakpoint test

full log:

  http://redhat.com/~mingo/misc/log-Thu_Apr_17_23_46_36_CEST_2008.bad

note that this was a 64-bit config too - our tests do a perfect mix of 
50% 32-bit and 50% 64-bit kernels. So single-stepping of the kernel 
broke in some circumstances.

find the boot log below. (it also includes all command line parameters) 

This is the first time ever i saw the self-test in KGDB hanging, so it's 
some recent non-KGDB change that provoked it or made it more likely. The 
KGDB self-test runs very frequently in my bootup tests:

[   12.508236] kgdb: Registered I/O driver kgdbts.
[   12.511245] kgdbts:RUN plant and detach test
[   12.517418] kgdbts:RUN sw breakpoint test
[   12.521056] kgdbts:RUN bad memory access test
[   12.525515] kgdbts:RUN singlestep breakpoint test
[   12.531483] kgdbts:RUN hw breakpoint test
[   12.536142] kgdbts:RUN hw write breakpoint test
[   12.541007] kgdbts:RUN access write breakpoint test
[   12.546223] kgdbts:RUN do_fork for 100 breakpoints

so the latest kgdb-light tree literally survived thousands of such tests 
since it was changed last.

unfortunately, the condition was not reproducible - i booted it once 
more and then it came up just fine - using the same bzImage.

there's no recent change in x86.git related to the TF flag that i could 
think of to cause something like this. I checked changes to traps_64.c 
and entry_64.S, and nothing suspicious.

	Ingo

WARNING: multiple messages have this Message-ID (diff)
From: Ingo Molnar <mingo@elte.hu>
To: Jason Wessel <jason.wessel@windriver.com>
Cc: Andrew Morton <akpm@linux-foundation.org>,
	tglx@linutronix.de, penberg@cs.helsinki.fi,
	linux-usb@vger.kernel.org, linux-kernel@vger.kernel.org,
	linux-mm@kvack.org, jmorris@namei.org, sds@tycho.nsa.gov
Subject: Re: 2.6.25-mm1: not looking good
Date: Fri, 18 Apr 2008 09:37:32 +0200	[thread overview]
Message-ID: <20080418073732.GA22724@elte.hu> (raw)
In-Reply-To: <48080FE7.1070400@windriver.com>

* Jason Wessel <jason.wessel@windriver.com> wrote:

> > [...] The final initcall is init_kgdbts() and disabling KGDB 
> > prevents the hang.

> That enables verbose logging of exactly what is going on and will show 
> where wheels fall off the cart.  If the kernel is dying silently it 
> means the early exception code has completely failed in some way on 
> the kernel architecture that was selected, and of course the .config 
> is always useful in this case.

incidentally, just today, in overnight testing i triggered a similar 
hang in the KGDB self-test:

  http://redhat.com/~mingo/misc/config-Thu_Apr_17_23_46_36_CEST_2008.bad

to get a similar tree to the one i tested, pick up sched-devel/latest 
from:

   http://people.redhat.com/mingo/sched-devel.git/README 

pick up that failing .config, do 'make oldconfig' and accept all the 
defaults to get a comparable kernel to mine. (kgdb is embedded in 
sched-devel.git.)

the hang was at:

[   12.504057] Calling initcall 0xffffffff80b800c1: init_kgdbts+0x0/0x1b()
[   12.511298] kgdb: Registered I/O driver kgdbts.
[   12.515062] kgdbts:RUN plant and detach test
[   12.520283] kgdbts:RUN sw breakpoint test
[   12.524651] kgdbts:RUN bad memory access test
[   12.529052] kgdbts:RUN singlestep breakpoint test

full log:

  http://redhat.com/~mingo/misc/log-Thu_Apr_17_23_46_36_CEST_2008.bad

note that this was a 64-bit config too - our tests do a perfect mix of 
50% 32-bit and 50% 64-bit kernels. So single-stepping of the kernel 
broke in some circumstances.

find the boot log below. (it also includes all command line parameters) 

This is the first time ever i saw the self-test in KGDB hanging, so it's 
some recent non-KGDB change that provoked it or made it more likely. The 
KGDB self-test runs very frequently in my bootup tests:

[   12.508236] kgdb: Registered I/O driver kgdbts.
[   12.511245] kgdbts:RUN plant and detach test
[   12.517418] kgdbts:RUN sw breakpoint test
[   12.521056] kgdbts:RUN bad memory access test
[   12.525515] kgdbts:RUN singlestep breakpoint test
[   12.531483] kgdbts:RUN hw breakpoint test
[   12.536142] kgdbts:RUN hw write breakpoint test
[   12.541007] kgdbts:RUN access write breakpoint test
[   12.546223] kgdbts:RUN do_fork for 100 breakpoints

so the latest kgdb-light tree literally survived thousands of such tests 
since it was changed last.

unfortunately, the condition was not reproducible - i booted it once 
more and then it came up just fine - using the same bzImage.

there's no recent change in x86.git related to the TF flag that i could 
think of to cause something like this. I checked changes to traps_64.c 
and entry_64.S, and nothing suspicious.

	Ingo

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

  reply	other threads:[~2008-04-18  7:38 UTC|newest]

Thread overview: 112+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-04-17 23:03 2.6.25-mm1: not looking good Andrew Morton
2008-04-17 23:03 ` Andrew Morton
2008-04-17 23:24 ` Greg KH
2008-04-17 23:24   ` Greg KH
2008-04-18  0:48   ` Kay Sievers
2008-04-18  0:48     ` Kay Sievers
2008-04-18  1:12     ` Andrew Morton
2008-04-18  1:12       ` Andrew Morton
2008-04-18  4:07     ` Andrew Morton
2008-04-18  4:07       ` Andrew Morton
2008-04-17 23:24 ` Dan Williams
2008-04-17 23:24   ` Dan Williams
2008-04-17 23:40 ` Andrew Morton
2008-04-17 23:40   ` Andrew Morton
2008-04-18  0:14   ` Andrew Morton
2008-04-18  0:14     ` Andrew Morton
2008-04-18  3:05     ` Jason Wessel
2008-04-18  3:05       ` Jason Wessel
2008-04-18  7:37       ` Ingo Molnar [this message]
2008-04-18  7:37         ` Ingo Molnar
2008-04-18 11:46         ` Vegard Nossum
2008-04-18 11:46           ` Vegard Nossum
2008-04-18 12:34           ` Ingo Molnar
2008-04-18 12:34             ` Ingo Molnar
2008-04-18 12:41             ` Vegard Nossum
2008-04-18 12:41               ` Vegard Nossum
2008-04-18 13:02               ` Jason Wessel
2008-04-18 13:02                 ` Jason Wessel
2008-04-18 13:22                 ` Vegard Nossum
2008-04-18 13:22                   ` Vegard Nossum
2008-04-18 13:27                   ` Jason Wessel
2008-04-18 13:27                     ` Jason Wessel
2008-04-18 14:47                     ` Vegard Nossum
2008-04-18 14:47                       ` Vegard Nossum
2008-04-18 16:02                       ` Vegard Nossum
2008-04-18 16:02                         ` Vegard Nossum
2008-04-18 21:54         ` Jason Wessel
2008-04-18 21:54           ` Jason Wessel
2008-04-17 23:55 ` Paul Moore
2008-04-17 23:55   ` Paul Moore
2008-04-18  0:04   ` Andrew Morton
2008-04-18  0:04     ` Andrew Morton
2008-04-18 14:55     ` Paul Moore
2008-04-18 14:55       ` Paul Moore
2008-04-18  1:35   ` Andrew Morton
2008-04-18  1:35     ` Andrew Morton
2008-04-18 14:57     ` Paul Moore
2008-04-18 14:57       ` Paul Moore
2008-04-18  5:49 ` Arjan van de Ven
2008-04-18  5:49   ` Arjan van de Ven
2008-04-18  6:10   ` Andrew Morton
2008-04-18  6:10     ` Andrew Morton
2008-04-18  7:19     ` Ingo Molnar
2008-04-18  7:19       ` Ingo Molnar
2008-04-18  7:28       ` Andrew Morton
2008-04-18  7:28         ` Andrew Morton
2008-04-18  9:28         ` Ingo Molnar
2008-04-18  9:28           ` Ingo Molnar
2008-04-18 13:58         ` Stack protector build failure (was Re: 2.6.25-mm1: not looking good) Arjan van de Ven
2008-04-18 16:57           ` Arjan van de Ven
2008-04-18  6:40 ` 2.6.25-mm1: not looking good Pekka Enberg
2008-04-18  6:40   ` Pekka Enberg
2008-04-18  6:56   ` Andrew Morton
2008-04-18  6:56     ` Andrew Morton
2008-04-18  7:24   ` Ingo Molnar
2008-04-18  7:24     ` Ingo Molnar
2008-04-18  7:25     ` Pekka Enberg
2008-04-18  7:25       ` Pekka Enberg
2008-04-18 10:32     ` James Morris
2008-04-18 10:32       ` James Morris
2008-04-18  7:09 ` Ingo Molnar
2008-04-18  7:09   ` Ingo Molnar
2008-04-18  7:50 ` Andrew Morton
2008-04-18  7:50   ` Andrew Morton
2008-04-18  7:53   ` Andrew Morton
2008-04-18  7:53   ` Andrew Morton
2008-04-18  7:53     ` Andrew Morton
2008-04-18  7:57     ` Andrew Morton
2008-04-18  7:57     ` Andrew Morton
2008-04-18  7:57       ` Andrew Morton
2008-04-18  9:22       ` Ingo Molnar
2008-04-18  9:22         ` Ingo Molnar
2008-04-18  9:22         ` Ingo Molnar
2008-04-18 12:18         ` Ingo Molnar
2008-04-18 12:18           ` Ingo Molnar
2008-04-18 12:18           ` Ingo Molnar
2008-04-18  9:42     ` Pavel Machek
2008-04-18  9:42       ` Pavel Machek
2008-04-18 15:22       ` Alan Stern
2008-04-18 15:22         ` Alan Stern
2008-04-18 15:22         ` Alan Stern
2008-04-18  9:42     ` Pavel Machek
2008-04-18 11:07     ` Pavel Machek
2008-04-18 11:07       ` Pavel Machek
2008-04-18 11:07       ` Pavel Machek
2008-04-28 16:42 ` 2.6.25-mm1: Failing to probe IDE interface Mel Gorman
2008-04-28 16:59   ` Andrew Morton
2008-04-28 16:59     ` Andrew Morton
2008-04-29  9:39     ` Mel Gorman
2008-04-29  9:39       ` Mel Gorman
2008-04-28 18:44   ` Bartlomiej Zolnierkiewicz
2008-04-28 18:44     ` Bartlomiej Zolnierkiewicz
2008-04-29  9:43     ` Mel Gorman
2008-04-29  9:43       ` Mel Gorman
2008-04-29 15:49       ` Mel Gorman
2008-04-29 15:49         ` Mel Gorman
2008-04-29 16:58         ` Mel Gorman
2008-04-29 16:58           ` Mel Gorman
2008-04-29 21:37           ` Bartlomiej Zolnierkiewicz
2008-04-29 21:37             ` Bartlomiej Zolnierkiewicz
2008-04-30 11:16             ` Mel Gorman
2008-04-30 11:16               ` Mel Gorman

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20080418073732.GA22724@elte.hu \
    --to=mingo@elte.hu \
    --cc=akpm@linux-foundation.org \
    --cc=jason.wessel@windriver.com \
    --cc=jmorris@namei.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=linux-usb@vger.kernel.org \
    --cc=penberg@cs.helsinki.fi \
    --cc=sds@tycho.nsa.gov \
    --cc=tglx@linutronix.de \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.