From: Konstantin Khlebnikov <khlebnikov@openvz.org>
To: "linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
Thomas Gleixner <tglx@linutronix.de>, Greg KH <gregkh@suse.de>
Subject: [stable-longterm] x86: HPET: Chose a paranoid safe value for the ETIME check
Date: Thu, 28 Jul 2011 19:31:21 +0400 [thread overview]
Message-ID: <4E3180C9.4000601@openvz.org> (raw)
In-Reply-To: <tip-f1c18071ad70e2a78ab31fc26a18fcfa954a05c6@git.kernel.org>
I think this v2.6.37-rc5-64-gf1c1807 and v2.6.36-rc4-167-g995bd3b should be merged into longterm/stable kernels.
We also found the same bug in RHEL6 kernel. (nohz kernel stuck and wait for HPET wraparound)
This HPET-bug really annoying, while fix is tiny.
tip-bot for Thomas Gleixner wrote:
> Commit-ID: f1c18071ad70e2a78ab31fc26a18fcfa954a05c6
> Gitweb: http://git.kernel.org/tip/f1c18071ad70e2a78ab31fc26a18fcfa954a05c6
> Author: Thomas Gleixner<tglx@linutronix.de>
> AuthorDate: Mon, 13 Dec 2010 12:43:23 +0100
> Committer: Thomas Gleixner<tglx@linutronix.de>
> CommitDate: Mon, 13 Dec 2010 13:42:44 +0100
>
> x86: HPET: Chose a paranoid safe value for the ETIME check
>
> commit 995bd3bb5 (x86: Hpet: Avoid the comparator readback penalty)
> chose 8 HPET cycles as a safe value for the ETIME check, as we had the
> confirmation that the posted write to the comparator register is
> delayed by two HPET clock cycles on Intel chipsets which showed
> readback problems.
>
> After that patch hit mainline we got reports from machines with newer
> AMD chipsets which seem to have an even longer delay. See
> http://thread.gmane.org/gmane.linux.kernel/1054283 and
> http://thread.gmane.org/gmane.linux.kernel/1069458 for further
> information.
>
> Boris tried to come up with an ACPI based selection of the minimum
> HPET cycles, but this failed on a couple of test machines. And of
> course we did not get any useful information from the hardware folks.
>
> For now our only option is to chose a paranoid high and safe value for
> the minimum HPET cycles used by the ETIME check. Adjust the minimum ns
> value for the HPET clockevent accordingly.
>
> Reported-Bistected-and-Tested-by: Markus Trippelsdorf<markus@trippelsdorf.de>
> Signed-off-by: Thomas Gleixner<tglx@linutronix.de>
> LKML-Reference:<alpine.LFD.2.00.1012131222420.2653@localhost6.localdomain6>
> Cc: Simon Kirby<sim@hostway.ca>
> Cc: Borislav Petkov<bp@alien8.de>
> Cc: Andreas Herrmann<Andreas.Herrmann3@amd.com>
> Cc: John Stultz<johnstul@us.ibm.com>
> ---
> arch/x86/kernel/hpet.c | 26 ++++++++++++++++----------
> 1 files changed, 16 insertions(+), 10 deletions(-)
>
> diff --git a/arch/x86/kernel/hpet.c b/arch/x86/kernel/hpet.c
> index ae03cab..4ff5968 100644
> --- a/arch/x86/kernel/hpet.c
> +++ b/arch/x86/kernel/hpet.c
> @@ -27,6 +27,9 @@
> #define HPET_DEV_FSB_CAP 0x1000
> #define HPET_DEV_PERI_CAP 0x2000
>
> +#define HPET_MIN_CYCLES 128
> +#define HPET_MIN_PROG_DELTA (HPET_MIN_CYCLES + (HPET_MIN_CYCLES>> 1))
> +
> #define EVT_TO_HPET_DEV(evt) container_of(evt, struct hpet_dev, evt)
>
> /*
> @@ -299,8 +302,9 @@ static void hpet_legacy_clockevent_register(void)
> /* Calculate the min / max delta */
> hpet_clockevent.max_delta_ns = clockevent_delta2ns(0x7FFFFFFF,
> &hpet_clockevent);
> - /* 5 usec minimum reprogramming delta. */
> - hpet_clockevent.min_delta_ns = 5000;
> + /* Setup minimum reprogramming delta. */
> + hpet_clockevent.min_delta_ns = clockevent_delta2ns(HPET_MIN_PROG_DELTA,
> + &hpet_clockevent);
>
> /*
> * Start hpet with the boot cpu mask and make it
> @@ -393,22 +397,24 @@ static int hpet_next_event(unsigned long delta,
> * the wraparound into account) nor a simple count down event
> * mode. Further the write to the comparator register is
> * delayed internally up to two HPET clock cycles in certain
> - * chipsets (ATI, ICH9,10). We worked around that by reading
> - * back the compare register, but that required another
> - * workaround for ICH9,10 chips where the first readout after
> - * write can return the old stale value. We already have a
> - * minimum delta of 5us enforced, but a NMI or SMI hitting
> + * chipsets (ATI, ICH9,10). Some newer AMD chipsets have even
> + * longer delays. We worked around that by reading back the
> + * compare register, but that required another workaround for
> + * ICH9,10 chips where the first readout after write can
> + * return the old stale value. We already had a minimum
> + * programming delta of 5us enforced, but a NMI or SMI hitting
> * between the counter readout and the comparator write can
> * move us behind that point easily. Now instead of reading
> * the compare register back several times, we make the ETIME
> * decision based on the following: Return ETIME if the
> - * counter value after the write is less than 8 HPET cycles
> + * counter value after the write is less than HPET_MIN_CYCLES
> * away from the event or if the counter is already ahead of
> - * the event.
> + * the event. The minimum programming delta for the generic
> + * clockevents code is set to 1.5 * HPET_MIN_CYCLES.
> */
> res = (s32)(cnt - hpet_readl(HPET_COUNTER));
>
> - return res< 8 ? -ETIME : 0;
> + return res< HPET_MIN_CYCLES ? -ETIME : 0;
> }
>
> static void hpet_legacy_set_mode(enum clock_event_mode mode,
> --
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at http://www.tux.org/lkml/
next prev parent reply other threads:[~2011-07-28 15:31 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-12-13 10:54 [2.6.37-rc5] Timer or ICE1724 issues, HZ=250, NO_HZ=y Simon Kirby
2010-12-13 11:19 ` Markus Trippelsdorf
2010-12-13 11:24 ` Thomas Gleixner
2010-12-13 12:46 ` [tip:x86/urgent] x86: HPET: Chose a paranoid safe value for the ETIME check tip-bot for Thomas Gleixner
2011-07-28 15:31 ` Konstantin Khlebnikov [this message]
2011-07-28 15:52 ` [stable-longterm] " Greg KH
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4E3180C9.4000601@openvz.org \
--to=khlebnikov@openvz.org \
--cc=gregkh@suse.de \
--cc=linux-kernel@vger.kernel.org \
--cc=tglx@linutronix.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.