All of lore.kernel.org
 help / color / mirror / Atom feed
From: Michael Ellerman <michael@ellerman.id.au>
To: Paul Collins <paul@burly.ondioline.org>
Cc: "J. Bruce Fields" <bfields@fieldses.org>,
	Neil Brown <neilb@suse.de>,
	nfsv4@linux-nfs.org, linux-kernel@vger.kernel.org,
	linuxppc-dev@ozlabs.org
Subject: Re: nfsd, v4: oops in find_acceptable_alias, ppc32 Linux, post-2.6.27-rc1
Date: Tue, 05 Aug 2008 04:34:22 +0000	[thread overview]
Message-ID: <1217910862.7951.22.camel@localhost> (raw)
In-Reply-To: <8763qg5don.fsf@burly.wgtn.ondioline.org>

[-- Attachment #1: Type: text/plain, Size: 3183 bytes --]

On Tue, 2008-08-05 at 15:43 +1200, Paul Collins wrote:
> Michael Ellerman <michael@ellerman.id.au> writes:
> 
> > On Mon, 2008-08-04 at 16:59 -0400, J. Bruce Fields wrote:
> >> On Tue, Aug 05, 2008 at 08:51:23AM +1200, Paul Collins wrote:
> >> > Michael Ellerman <michael@ellerman.id.au> writes:
> >> > 
> >> > > On Mon, 2008-08-04 at 22:00 +1200, Paul Collins wrote:
> >> > >> Paul Collins <paul@burly.ondioline.org> writes:
> >> > >> 
> >> > >> > Neil Brown <neilb@suse.de> writes:
> >> > >> >> Could you try removing the 'static' declaration for nfsd_acceptable
> >> > >> >> and recompile?
> >> > >> >> Or maybe try a different compiler?
> >> > >> >
> >> > >> > I will give these a try this evening.
> >> > >> 
> >> > >> I built myself a nice new cross compiler:
> >> > >> 
> >> > >>         powerpc-linux-gnu-gcc-4.1 (GCC) 4.1.3 20080623 (prerelease) (Debian 4.1.2-23)
> >> > >> 
> >> > >> and rebuilt 94ad374a0751f40d25e22e036c37f7263569d24c.  Running that on
> >> > >> the server and 2.6.26 on the client, I got yet another Oops.  This one
> >> > >> locked the machine up pretty good, so all I have is a picture:
> >> > >> 
> >> > >> http://ondioline.org/~paul/DSCN1608.JPG
> >> > >
> >> > > Wow.
> >> > >
> >> > > Can you try building a kernel on the server? ie. not over NFS.
> >> > 
> >> > Built kernels on the server with native gcc 4.2.4 and 4.3.1 and repeated
> >> > the build test.
> >>  
> >> But the build test itself was over nfs?  (And you can't reproduce the
> >> same problem without nfs?)
> >
> > Yeah, I'm not clear on that either. What I was aiming at was can you get
> > it to oops somewhere else by not building over NFS - in which case we
> > can rule NFS (more or less) out.
> 
> I think may be able to rule NFS out now.  I just got this Oops when Xorg
> started on boot.

Cool, that looks fairly convincing.

> In case anyone wants to disassemble it, I've uploaded the kernel to
> http://ondioline.org/~paul/vmlinux-2.6.27-rc1-00158-g643fbd8 and the
> config to http://ondioline.org/~paul/config-2.6.27-rc1-00158-g643fbd8
> 
> I've rebuilt a whole bunch of times in the course of this little
> project, but the all four Oopses in this message are from the very
> vmlinux linked above.
> 
> I have a couple of patches applied locally (a console font and a
> Bluetooth HID quirk), so this is really Linus revision
> 94ad374a0751f40d25e22e036c37f7263569d24c.

And you're _sure_ none of them has a "break-everything" hunk in it? :)


I see you have FTRACE enabled. That's new and could potentially bugger
things up without the compiler knowing, so can you turn that off.

And can you enable CONFIG_CODE_PATCHING_SELFTEST and
CONFIG_FTR_FIXUP_SELFTEST, that will enable tests of some code I changed
that /could/ (maybe) cause random blow ups.

Also, how old is the machine? Any chance you're just seeing random
memory corruption?

cheers

-- 
Michael Ellerman
OzLabs, IBM Australia Development Lab

wwweb: http://michael.ellerman.id.au
phone: +61 2 6212 1183 (tie line 70 21183)

We do not inherit the earth from our ancestors,
we borrow it from our children. - S.M.A.R.T Person

[-- Attachment #2: This is a digitally signed message part --]
[-- Type: application/pgp-signature, Size: 189 bytes --]

WARNING: multiple messages have this Message-ID (diff)
From: Michael Ellerman <michael@ellerman.id.au>
To: Paul Collins <paul@burly.ondioline.org>
Cc: "J. Bruce Fields" <bfields@fieldses.org>,
	Neil Brown <neilb@suse.de>,
	linuxppc-dev@ozlabs.org, nfsv4@linux-nfs.org,
	linux-kernel@vger.kernel.org
Subject: Re: nfsd, v4: oops in find_acceptable_alias, ppc32 Linux, post-2.6.27-rc1
Date: Tue, 05 Aug 2008 04:34:22 +0000	[thread overview]
Message-ID: <1217910862.7951.22.camel@localhost> (raw)
In-Reply-To: <8763qg5don.fsf@burly.wgtn.ondioline.org>

[-- Attachment #1: Type: text/plain, Size: 3183 bytes --]

On Tue, 2008-08-05 at 15:43 +1200, Paul Collins wrote:
> Michael Ellerman <michael@ellerman.id.au> writes:
> 
> > On Mon, 2008-08-04 at 16:59 -0400, J. Bruce Fields wrote:
> >> On Tue, Aug 05, 2008 at 08:51:23AM +1200, Paul Collins wrote:
> >> > Michael Ellerman <michael@ellerman.id.au> writes:
> >> > 
> >> > > On Mon, 2008-08-04 at 22:00 +1200, Paul Collins wrote:
> >> > >> Paul Collins <paul@burly.ondioline.org> writes:
> >> > >> 
> >> > >> > Neil Brown <neilb@suse.de> writes:
> >> > >> >> Could you try removing the 'static' declaration for nfsd_acceptable
> >> > >> >> and recompile?
> >> > >> >> Or maybe try a different compiler?
> >> > >> >
> >> > >> > I will give these a try this evening.
> >> > >> 
> >> > >> I built myself a nice new cross compiler:
> >> > >> 
> >> > >>         powerpc-linux-gnu-gcc-4.1 (GCC) 4.1.3 20080623 (prerelease) (Debian 4.1.2-23)
> >> > >> 
> >> > >> and rebuilt 94ad374a0751f40d25e22e036c37f7263569d24c.  Running that on
> >> > >> the server and 2.6.26 on the client, I got yet another Oops.  This one
> >> > >> locked the machine up pretty good, so all I have is a picture:
> >> > >> 
> >> > >> http://ondioline.org/~paul/DSCN1608.JPG
> >> > >
> >> > > Wow.
> >> > >
> >> > > Can you try building a kernel on the server? ie. not over NFS.
> >> > 
> >> > Built kernels on the server with native gcc 4.2.4 and 4.3.1 and repeated
> >> > the build test.
> >>  
> >> But the build test itself was over nfs?  (And you can't reproduce the
> >> same problem without nfs?)
> >
> > Yeah, I'm not clear on that either. What I was aiming at was can you get
> > it to oops somewhere else by not building over NFS - in which case we
> > can rule NFS (more or less) out.
> 
> I think may be able to rule NFS out now.  I just got this Oops when Xorg
> started on boot.

Cool, that looks fairly convincing.

> In case anyone wants to disassemble it, I've uploaded the kernel to
> http://ondioline.org/~paul/vmlinux-2.6.27-rc1-00158-g643fbd8 and the
> config to http://ondioline.org/~paul/config-2.6.27-rc1-00158-g643fbd8
> 
> I've rebuilt a whole bunch of times in the course of this little
> project, but the all four Oopses in this message are from the very
> vmlinux linked above.
> 
> I have a couple of patches applied locally (a console font and a
> Bluetooth HID quirk), so this is really Linus revision
> 94ad374a0751f40d25e22e036c37f7263569d24c.

And you're _sure_ none of them has a "break-everything" hunk in it? :)


I see you have FTRACE enabled. That's new and could potentially bugger
things up without the compiler knowing, so can you turn that off.

And can you enable CONFIG_CODE_PATCHING_SELFTEST and
CONFIG_FTR_FIXUP_SELFTEST, that will enable tests of some code I changed
that /could/ (maybe) cause random blow ups.

Also, how old is the machine? Any chance you're just seeing random
memory corruption?

cheers

-- 
Michael Ellerman
OzLabs, IBM Australia Development Lab

wwweb: http://michael.ellerman.id.au
phone: +61 2 6212 1183 (tie line 70 21183)

We do not inherit the earth from our ancestors,
we borrow it from our children. - S.M.A.R.T Person

[-- Attachment #2: This is a digitally signed message part --]
[-- Type: application/pgp-signature, Size: 189 bytes --]

  reply	other threads:[~2008-08-05  4:34 UTC|newest]

Thread overview: 34+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-08-02 12:03 nfsd, v4: oops in find_acceptable_alias, ppc32 Linux, post-2.6.27-rc1 Paul Collins
2008-08-02 18:45 ` J. Bruce Fields
2008-08-02 22:36   ` Paul Collins
2008-08-02 22:36     ` Paul Collins
2008-08-03  6:47     ` Paul Collins
2008-08-03  6:47       ` Paul Collins
2008-08-03 12:09       ` Neil Brown
2008-08-03 12:25         ` Paul Collins
2008-08-04  4:08           ` Neil Brown
2008-08-04  5:11             ` Paul Collins
2008-08-04 10:00               ` Paul Collins
2008-08-04 14:36                 ` Michael Ellerman
2008-08-04 14:36                   ` Michael Ellerman
2008-08-04 20:51                   ` Paul Collins
2008-08-04 20:51                     ` Paul Collins
2008-08-04 20:59                     ` J. Bruce Fields
2008-08-04 20:59                       ` J. Bruce Fields
2008-08-05  0:16                       ` Michael Ellerman
2008-08-05  0:16                         ` Michael Ellerman
2008-08-05  3:43                         ` Paul Collins
2008-08-05  3:43                           ` Paul Collins
2008-08-05  4:34                           ` Michael Ellerman [this message]
2008-08-05  4:34                             ` Michael Ellerman
2008-08-05  4:47                             ` Paul Collins
2008-08-05  4:47                               ` Paul Collins
2008-08-05  7:16                               ` Benjamin Herrenschmidt
2008-08-05  7:16                                 ` Benjamin Herrenschmidt
2008-08-06  6:29                                 ` Benjamin Herrenschmidt
2008-08-25 20:12                               ` Bill Davidsen
2008-08-25 20:12                                 ` Bill Davidsen
2008-08-05  9:43                             ` Paul Collins
2008-08-05  9:43                               ` Paul Collins
2008-08-05 11:53                               ` Michael Ellerman
2008-08-05 11:53                                 ` Michael Ellerman

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1217910862.7951.22.camel@localhost \
    --to=michael@ellerman.id.au \
    --cc=bfields@fieldses.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linuxppc-dev@ozlabs.org \
    --cc=neilb@suse.de \
    --cc=nfsv4@linux-nfs.org \
    --cc=paul@burly.ondioline.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.