From: Dave Chinner <david@fromorbit.com>
To: Mikulas Patocka <mpatocka@redhat.com>
Cc: devel@driverdev.osuosl.org, thornber@redhat.com,
snitzer@redhat.com, cesarb@cesarb.net,
gregkh@linuxfoundation.org, linux-kernel@vger.kernel.org,
Akira Hayakawa <ruby.wktk@gmail.com>,
tj@kernel.org, dm-devel@redhat.com, agk@redhat.com,
joe@perches.com, akpm@linux-foundation.org, xfs@oss.sgi.com,
ejt@redhat.com, dan.carpenter@oracle.com, m.chehab@samsung.com
Subject: Re: A review of dm-writeboost
Date: Wed, 16 Oct 2013 17:07:50 +1100 [thread overview]
Message-ID: <20131016060750.GE4446@dastard> (raw)
In-Reply-To: <alpine.LRH.2.02.1310151950530.4664@file01.intranet.prod.int.rdu2.redhat.com>
[cc xfs@oss.sgi.com]
On Tue, Oct 15, 2013 at 08:01:45PM -0400, Mikulas Patocka wrote:
> On Mon, 14 Oct 2013, Akira Hayakawa wrote:
> > But, XFS stalls ...
> > -------------------
> > For testing,
> > I manually turns `blockup` to 1
> > when compiling Ruby is in progress
> > on XFS on a writeboost device.
> > As soon as I do it,
> > XFS starts to dump error message
> > like "metadata I/O error: ... ("xlog_iodone") error ..."
> > and after few seconds it then starts to dump
> > like "BUG: soft lockup -CPU#3 stuck for 22s!".
> > The system stalls and doesn't accept the keyboard.
> >
> > I think this behavior is caused by
> > the device always returning -EIO after turning
> > the variable to 1.
> > But why XFS goes stalling on I/O error?
>
> Because it is bloated and buggy.
How did I know you'd take that cheap shot, Mikulas? You are so
predictable...
> We have bug 924301 for XFS crash on I/O
> error...
Which is a problem with memory corruption after filling a dm
snapshot volume to 100% and shortly after XFS has shut down the
kernel panics from memory corruption. Can't be reproduced without
filling the dm-snapshot volume to 100%, can't be reproduced with any
other filesystem. Crashes are also occurring randomly in printk and
the worker thread infrastructure. Memory and list poisoning clearly
indicates worker thread lists have freed objects on them. There are
lockdep messages from the DM snapshot code, etc.
There's actually very little to point at XFS problems other than the
first hang that was reported where XFS was stuck in a tight loop due
to memory corruption. It reminds me of a very similar bug report
and triage we went through last week:
http://oss.sgi.com/pipermail/xfs/2013-October/030681.html
Further analysis and bisects pointed to the zram driver being buggy,
not XFS:
http://oss.sgi.com/pipermail/xfs/2013-October/030707.html
XFS has historically exposing bugs in block device drivers that no
other filesystem exposes, and so when a new block device driver gets
tested with XFS and we start seeing memory corruption symptoms, it's
a fair bet that it's not XFS that is causing it....
Just sayin'.
---
Akira, can you please post the entire set of messages you are
getting when XFS showing problems? That way I can try to confirm
whether it's a regression in XFS or something else.
Cheers,
Dave.
--
Dave Chinner
david@fromorbit.com
_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs
next parent reply other threads:[~2013-10-16 6:08 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <alpine.LRH.2.02.1310031719340.24440@file01.intranet.prod.int.rdu2.redhat.com>
[not found] ` <52550841.5030001@gmail.com>
[not found] ` <525BAB32.5050901@gmail.com>
[not found] ` <alpine.LRH.2.02.1310151950530.4664@file01.intranet.prod.int.rdu2.redhat.com>
2013-10-16 6:07 ` Dave Chinner [this message]
2013-10-16 10:34 ` A review of dm-writeboost Akira Hayakawa
2013-10-16 11:01 ` Dave Chinner
2013-10-16 12:17 ` Akira Hayakawa
2013-10-16 21:42 ` Dave Chinner
2013-10-19 10:59 ` Akira Hayakawa
2013-10-21 1:31 ` Dave Chinner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20131016060750.GE4446@dastard \
--to=david@fromorbit.com \
--cc=agk@redhat.com \
--cc=akpm@linux-foundation.org \
--cc=cesarb@cesarb.net \
--cc=dan.carpenter@oracle.com \
--cc=devel@driverdev.osuosl.org \
--cc=dm-devel@redhat.com \
--cc=ejt@redhat.com \
--cc=gregkh@linuxfoundation.org \
--cc=joe@perches.com \
--cc=linux-kernel@vger.kernel.org \
--cc=m.chehab@samsung.com \
--cc=mpatocka@redhat.com \
--cc=ruby.wktk@gmail.com \
--cc=snitzer@redhat.com \
--cc=thornber@redhat.com \
--cc=tj@kernel.org \
--cc=xfs@oss.sgi.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox