public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Thomas Gleixner <tglx@linutronix.de>
To: "Rafael J. Wysocki" <rjw@sisk.pl>
Cc: Linus Torvalds <torvalds@linux-foundation.org>,
	Andi Kleen <ak@suse.de>,
	Andrew Morton <akpm@linux-foundation.org>,
	LKML <linux-kernel@vger.kernel.org>, Ingo Molnar <mingo@elte.hu>
Subject: Re: [REGRESSION from 2.6.23-rc8] (was: Re: 2.6.23-rc4-mm1 and -rc6-mm1: boot failure on HP nx6325, related to clockevents)
Date: Wed, 26 Sep 2007 23:34:36 +0200	[thread overview]
Message-ID: <1190842476.23376.57.camel@chaos> (raw)
In-Reply-To: <200709262300.28336.rjw@sisk.pl>

Rafael,

On Wed, 2007-09-26 at 23:00 +0200, Rafael J. Wysocki wrote:
> > > > First, with the "x86-64: Disable local APIC timer use on AMD systems with C1E"
> > > > patch and my collection of suspend patches applied, the box doesn't boot
> > > > (the suspend patches don't even thouch the boot code, so they should be
> > > > irrelevant here).  However, it boots if patch-2.6.23-rc7-hrt1.patch (adjusted
> > > > for 2.6.23-rc8) is applied in addition.  Is this expected?
> > > 
> > > No. That's odd. It is nothing else than adding "noapictimer" to the
> > > kernel command line.
> > 
> > Seems to be reproducible, though.  I'll investigate further.
> 
> So far, the results are the following:
> 
> 1) current Linus' tree doesn't boot with any command line (regression)
> 
> [  Linus, please revert commit e66485d747505e9d960b864fc6c37f8b2afafaf0
> 
>    x86-64: Disable local APIC timer use on AMD systems with C1E
> 
>    It's not necessary for 2.6.23 and actually kills the box that it's supposed to fix. ]
> 
> 2) 2.6.23-rc8 w/ the "x86-64: Disable local APIC timer use on AMD systems with C1E"
>    patch applied behaves like the current -git
> 
> 3) 2.6.23-rc8 w/o this patch doesn't boot with either "noapictimer" _or_

OK, this explains 2) and 3). I just looked into the code and the logic
vs. noapictimer on SMP is completely broken.

On i386 the noapictimer option not only disables the local APIC timer,
it also registers the CPUs for broadcasting via IPI on SMP systems. 

The x8664 code uses the broadcast only when the local apic timer is
active, i.e. "noapictimer" is not on the command line. This defeats the
whole purpose of "noapictimer". It should be there to make boxen work,
where the local APIC timer actually has a hardware problem, e.g. the
nx6325.

The current implementation of x86_64 only fixes the ACPI c-states
related problem where the APIC timer stops in C3(2), nothing else.

On nx6325 and other AMD X2 equipped systems which have the C1E enabled
we run into the following:

PIT keeps jiffies (and the system) running, but the local APIC timer
interrupts can get out of sync due to this C1E effect. 

I don't think this is a critical problem, but it is wrong nevertheless.

I think it's safe to revert the C1E patch and postpone the fix to the
clock events conversion.

>   "apicmaintimer"

on your box is not going to work. See the C1E patch. "apicmaintimer"
switches off PIT and then waits for ever for the local APIC timer
interrupts.

> 4) 2.6.22 behaves like 2.6.23-rc8

No surprise

> 5) 2.6.23-rc8 with (adjusted) patch-2.6.23-rc7-hrt1.patch boots only with
>    "noapictimer"
> 
> 6) 2.6.23-rc8 with (adjusted) patch-2.6.23-rc7-hrt1.patch and with the
>    "x86-64: Disable local APIC timer use on AMD systems with C1E" patch boots
>    without any extra command line options

That's consistent behaviour.

> Tested for a couple of times with each kernel, the results seem to be
> reproducible 100% of the time.

Thanks for going through this debug marathon.

	tglx



  reply	other threads:[~2007-09-26 21:34 UTC|newest]

Thread overview: 37+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2007-09-23 10:57 2.6.23-rc4-mm1 and -rc6-mm1: boot failure on HP nx6325, related to clockevents Rafael J. Wysocki
2007-09-23 19:10 ` Thomas Gleixner
2007-09-23 20:08   ` Rafael J. Wysocki
2007-09-23 19:59     ` Thomas Gleixner
2007-09-23 20:52       ` Rafael J. Wysocki
2007-09-24  8:07         ` Thomas Gleixner
2007-09-24 12:57           ` Rafael J. Wysocki
2007-09-24 13:05             ` Thomas Gleixner
2007-09-24 13:52               ` Rafael J. Wysocki
2007-09-24 14:23                 ` Thomas Gleixner
2007-09-24 15:18                   ` Rafael J. Wysocki
2007-09-24 16:46                     ` Thomas Gleixner
2007-09-24 19:11                       ` Rafael J. Wysocki
2007-09-24 19:13                         ` Thomas Gleixner
2007-09-25 13:16                           ` Rafael J. Wysocki
2007-09-25 13:17                             ` Thomas Gleixner
2007-09-25 20:07                               ` Rafael J. Wysocki
2007-09-25 20:46                                 ` Thomas Gleixner
2007-09-25 21:28                                   ` Rafael J. Wysocki
2007-09-25 21:24                                     ` Thomas Gleixner
2007-09-26 15:25                                       ` Rafael J. Wysocki
2007-09-26 18:51                                         ` Thomas Gleixner
2007-09-26 19:49                                           ` Rafael J. Wysocki
2007-09-26 21:00                                             ` [REGRESSION from 2.6.23-rc8] (was: Re: 2.6.23-rc4-mm1 and -rc6-mm1: boot failure on HP nx6325, related to clockevents) Rafael J. Wysocki
2007-09-26 21:34                                               ` Thomas Gleixner [this message]
2007-09-26 22:22                                                 ` Linus Torvalds
2007-09-26 22:35                                                   ` Thomas Gleixner
2007-09-26 23:30                                                     ` [REGRESSION from 2.6.23-rc8] Chuck Ebbert
2007-09-28 15:07                                                     ` Chuck Ebbert
2007-09-28 15:10                                                       ` Thomas Gleixner
2007-09-30  9:58                                                   ` [REGRESSION from 2.6.23-rc8] (was: Re: 2.6.23-rc4-mm1 and -rc6-mm1: boot failure on HP nx6325, related to clockevents) Andi Kleen
2007-09-30 14:06                                                     ` Thomas Gleixner
2007-09-30 15:03                                                       ` Andi Kleen
2007-09-26 23:30                                                 ` Rafael J. Wysocki
2007-09-26 23:21                                                   ` Thomas Gleixner
2007-09-27 15:58                                                     ` Rafael J. Wysocki
2007-09-30  9:52                                                 ` Andi Kleen

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1190842476.23376.57.camel@chaos \
    --to=tglx@linutronix.de \
    --cc=ak@suse.de \
    --cc=akpm@linux-foundation.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mingo@elte.hu \
    --cc=rjw@sisk.pl \
    --cc=torvalds@linux-foundation.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox