public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Nathan Scott <nathans@sgi.com>
To: Joerg Sommrey <jo@sommrey.de>, Christoph Hellwig <hch@infradead.org>
Cc: Linux kernel mailing list <linux-kernel@vger.kernel.org>
Subject: Re: Oops on 2.6.9-ac16: xfs, dm and md may be involved
Date: Thu, 23 Dec 2004 10:11:43 +1100	[thread overview]
Message-ID: <20041223101143.A702917@wobbly.melbourne.sgi.com> (raw)
In-Reply-To: <20041222195203.GA24857@sommrey.de>; from jo@sommrey.de on Wed, Dec 22, 2004 at 08:52:03PM +0100

On Wed, Dec 22, 2004 at 08:52:03PM +0100, Joerg Sommrey wrote:
> On Wed, Dec 22, 2004 at 06:26:06PM +0000, Christoph Hellwig wrote:
> > On Tue, Dec 21, 2004 at 07:57:54PM +0100, Joerg Sommrey wrote:
> > > Hello,
> > > 
> > > last night my box died with a kernel oops.  There was a backup
> > > running at that time. The setup:
> > > - 2 SATA disks + 1 SCSI disk
> > > - SATA partitions build up md-raid-arrays (level 0 and 1)
> > > - md-raid-devices and SCSI partitions are physical volumes for dm
> > > - dm logical volumes are used for xfs filesystems
> > > - backup is done on dm-snapshots of those filesystems
> > 
> > Given the strange backtrace and this enormous stack of drivers I bet
> > you're seeing a stack overflow.  

Hmm, I'm not real sure of that Christoph - this was inside a
kernel thread (xfsbufd) where there is almost nothing on the
stack at the point we dove into driver land.  Looked like a
genuine bug to me.  There were plenty of calls on the trace,
but I think several of those were badly guessed by the stack
dump code.  And a couple of registers having a memory poison
pattern looked a bit suspect.

> Does this mean that this kind of stuff just doesn't work?  I was running
> a 4K-stack kernel with this "stack of drivers" for quiet some time without
> problems.  The problems started around 2.6.9-pre-something.  Converting
> to 8K-stacks didn't help.  Is this only xfs related?

Certainly wasn't XFS using stack in the initial oops, perhaps
the lower layers, but I'm a bit sceptical.  Almost certainly
this is a device mapper snapshot problem, the DM folks should
be able to analyse it further.

cheers.

-- 
Nathan

  reply	other threads:[~2004-12-22 23:12 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2004-12-21 18:57 Oops on 2.6.9-ac16: xfs, dm and md may be involved Joerg Sommrey
2004-12-21 23:13 ` Nathan Scott
2004-12-22 18:26 ` Christoph Hellwig
2004-12-22 19:52   ` Joerg Sommrey
2004-12-22 23:11     ` Nathan Scott [this message]
2004-12-23 15:36       ` Joerg Sommrey
2005-01-04 22:44       ` Joerg Sommrey

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20041223101143.A702917@wobbly.melbourne.sgi.com \
    --to=nathans@sgi.com \
    --cc=hch@infradead.org \
    --cc=jo@sommrey.de \
    --cc=linux-kernel@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox