linux-ext4.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Jan Kara <jack@suse.cz>
To: Dmitry Monakhov <dmonakhov@openvz.org>
Cc: Kazuya Mio <k-mio@sx.jp.nec.com>, Jan Kara <jack@suse.cz>,
	ext4 <linux-ext4@vger.kernel.org>, Theodore Tso <tytso@mit.edu>,
	Andreas Dilger <adilger@dilger.ca>
Subject: Re: [BUG] aborted ext4 leads to inifinity loop in balance_dirty_pages
Date: Mon, 7 Nov 2011 18:29:39 +0100	[thread overview]
Message-ID: <20111107172939.GH15796@quack.suse.cz> (raw)
In-Reply-To: <87y5vsl5ue.fsf@dmbot.sw.ru>

On Mon 07-11-11 12:00:41, Dmitry Monakhov wrote:
> On Fri, 28 Oct 2011 14:34:31 +0900, Kazuya Mio <k-mio@sx.jp.nec.com> wrote:
> > 2011/10/25 22:40, Jan Kara wrote:
> > >   Please no. Generally this boils down to what do we do with dirty data
> > > when there's error in writing them out. Currently we just throw them away
> > > (e.g. in media error case) but I don't think that's a generally good thing
> > > because e.g. admin may want to copy the data to other working storage or
> > > so. So I think we should rather keep the data and provide a mechanism for
> > > userspace to ask kernel to get rid of the data (so that we don't eventually
> > > run OOM).
> > 
> > I see. I agree with you.
> > 
> > >> Do you have any ideas?
> > >   So the question is what would you like to achieve. If you just want to
> > > unblock a thread then a solution would be to make a thread at
> > > balance_dirty_pages() killable. If generally you want to get rid of dirty
> > > memory, then I don't have a really good answer but throwing dirty data away
> > > seems like a bad answer to me.
> > 
> > The problem is that we cannot unmount the corrupted filesystem due to
> > un-killable dd process. We must bring down the system to resume the service
> > with no dirty pages. I think it is important for the service continuity
> > to be able to kill the thread handling in balance_dirty_pages().
> In fact you are very lucky because dd is just deadlocked, in many cases
> journal abort result in BUG_ON triggering(if IO load is high enough).
  Can you provide the exact kernel message? I'd be interested...

> This is because transaction abort check is racy. Right now i've no good
> fix which has reasonable performance. My latest idea is to protect
> transaction abort check via SRCU.
  Yeah, the code does not seem to care about races too much but I don't see
which BUG_ON would be triggered...

								Honza
-- 
Jan Kara <jack@suse.cz>
SUSE Labs, CR

  reply	other threads:[~2011-11-07 17:29 UTC|newest]

Thread overview: 14+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-10-25 12:04 [BUG] aborted ext4 leads to inifinity loop in balance_dirty_pages Kazuya Mio
2011-10-25 13:40 ` Jan Kara
2011-10-28  5:34   ` Kazuya Mio
2011-11-01 23:13     ` Jan Kara
2011-11-02  5:24       ` Kazuya Mio
2011-11-07  8:00     ` Dmitry Monakhov
2011-11-07 17:29       ` Jan Kara [this message]
2011-11-07 17:45         ` Dmitry Monakhov
2011-11-07 21:23           ` Jan Kara
2011-11-08  0:03     ` Jan Kara
2011-11-09  8:28       ` Kazuya Mio
2011-11-09 11:15         ` Jan Kara
2011-11-14 10:06       ` Kazuya Mio
2011-11-14 11:11         ` Jan Kara

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20111107172939.GH15796@quack.suse.cz \
    --to=jack@suse.cz \
    --cc=adilger@dilger.ca \
    --cc=dmonakhov@openvz.org \
    --cc=k-mio@sx.jp.nec.com \
    --cc=linux-ext4@vger.kernel.org \
    --cc=tytso@mit.edu \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).