linux-ext4.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Benjamin LaHaise <bcrl@kvack.org>
To: Jan Kara <jack@suse.cz>
Cc: linux-ext4@vger.kernel.org
Subject: Re: high write latency bug in ext3 / jbd in 3.4
Date: Tue, 28 Jan 2014 11:06:26 -0500	[thread overview]
Message-ID: <20140128160626.GM19273@kvack.org> (raw)
In-Reply-To: <20140127235518.GB7020@quack.suse.cz>

Hi Jan,

On Tue, Jan 28, 2014 at 12:55:18AM +0100, Jan Kara wrote:
>   Hello,
> 
> On Mon 13-01-14 15:13:20, Benjamin LaHaise wrote:
...
>   I'm not sure if you haven't switched to ext4 as others have suggested in
> this thread. If not:
>   1) Since the stall is so long, can you run
>      'echo w >/proc/sysrq-trigger'
>      when the stall happens and send the stack traces from kernel log?

Unfortunately, I didn't capture that output while testing.  I ended up 
migrating to using the ext4 codebase for our ext3 filesystems.  With a 
couple of tweaks to the inode allocator, I was able to resolve the 
regression moving to ext4 had caused.  If there is actually some desire 
to fix this bug, I can certainly go back and reproduce it.

>   2) Are you running with 'barrier' option?

I didn't change the barrier setting from the default.

> > Does anyone have any ideas on where to look in ext3 or jbd for something 
> > that might be causing this behaviour?  If I use ext4 to mount the ext3 
> > filesystem being tested, the problem goes away.  Testing on newer kernels 
> > is not very easy to do (the system has other dependencyies on the 3.4 
> > kernel).  Thoughts?
>   My suspicion is we are hanging on writing the 'commit' block of a
> transaction. That issues a cache flush to the storage and that can take
> quite a bit of time if we are unlucky.

I actually control both ends of the SAN (the two systems are connected via 
fibre channel), and while the hang occurs, no I/O shows up as being queued 
on the head end.  It is as if the system is waiting on a write that hasn't 
been submitted yet.

		-ben

> 								Honza
> -- 
> Jan Kara <jack@suse.cz>
> SUSE Labs, CR

-- 
"Thought is the essence of where you are now."

      reply	other threads:[~2014-01-28 16:06 UTC|newest]

Thread overview: 11+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-01-13 20:13 high write latency bug in ext3 / jbd in 3.4 Benjamin LaHaise
2014-01-13 21:01 ` Andreas Dilger
2014-01-13 21:16   ` Benjamin LaHaise
2014-01-13 21:39     ` Eric Sandeen
2014-01-13 22:52     ` Theodore Ts'o
2014-01-14  0:55       ` Andreas Dilger
2014-01-14  1:01         ` Eric Sandeen
2014-01-14  1:21       ` Benjamin LaHaise
2014-01-14  3:52         ` Theodore Ts'o
2014-01-27 23:55 ` Jan Kara
2014-01-28 16:06   ` Benjamin LaHaise [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20140128160626.GM19273@kvack.org \
    --to=bcrl@kvack.org \
    --cc=jack@suse.cz \
    --cc=linux-ext4@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).