From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1757486AbYG1Tja (ORCPT ); Mon, 28 Jul 2008 15:39:30 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751898AbYG1TjV (ORCPT ); Mon, 28 Jul 2008 15:39:21 -0400 Received: from relay1.sgi.com ([192.48.171.29]:44155 "EHLO relay.sgi.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1751529AbYG1TjV (ORCPT ); Mon, 28 Jul 2008 15:39:21 -0400 Message-ID: <488E2066.9010908@sgi.com> Date: Mon, 28 Jul 2008 12:39:18 -0700 From: Mike Travis User-Agent: Thunderbird 2.0.0.6 (X11/20070801) MIME-Version: 1.0 To: Ingo Molnar CC: Andrew Morton , "Eric W. Biederman" , Hugh Dickins , Jack Steiner , Jeremy Fitzhardinge , "H. Peter Anvin" , linux-kernel@vger.kernel.org, Thomas Gleixner Subject: Re: [crash] Re: [PATCH 0/4] x86_64: Optimize percpu accesses References: <20080725211117.586723000@polaris-admin.engr.sgi.com> <20080728155248.GA7123@elte.hu> In-Reply-To: <20080728155248.GA7123@elte.hu> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Ingo Molnar wrote: > ok, i have integrated tip/x86/percpu-zerobased into tip/master briefly, > but it blew up almost immediately in testing, on two boxes. > > one bad config is: > > http://redhat.com/~mingo/misc/config-Mon_Jul_28_17_35_00_CEST_2008.bad > > failure pattern: it booted up fine to userspace and seemed function, but > then produced a spontaneous reboot while building a kernel, without any > log entries. > > other bad config is: > > http://redhat.com/~mingo/misc/config-Mon_Jul_28_17_30_39_CEST_2008.bad > > failure pattern: early crash at: > > PANIC: early exception 0e rip 10:fffffff817dfc1a error 0 cr2 28 > > which corresponds to: > > ffffffff817dfc0f : > ffffffff817dfc0f: 48 8b 05 aa cf 04 00 mov 315306(%rip),%rax > # ffffffff8182cbc0 > ffffffff817dfc16: 55 push %rbp > ffffffff817dfc17: 48 89 e5 mov %rsp,%rbp > ffffffff817dfc1a: 48 8b 40 28 mov 0x28(%rax),%rax [*] > ffffffff817dfc1e: 48 85 c0 test %rax,%rax > > i.e. RAX was zero. > > i've pushed out the tip/tmp.x86/percpu-zerobased.bad branch which shows > the exact kernel that failed. It was generated by: > > git-checkout tip/master > git-merge tip/x86/percpu-zerobased > > Ingo Ok, thanks, I'll take a look. There were some questions that I had (and I should have RFC'd the patch since there are still questions.) Mike