From: Benjamin LaHaise <bcrl@kvack.org>
To: Jan Kara <jack@suse.cz>
Cc: linux-ext4@vger.kernel.org
Subject: Re: high write latency bug in ext3 / jbd in 3.4
Date: Tue, 28 Jan 2014 11:06:26 -0500 [thread overview]
Message-ID: <20140128160626.GM19273@kvack.org> (raw)
In-Reply-To: <20140127235518.GB7020@quack.suse.cz>
Hi Jan,
On Tue, Jan 28, 2014 at 12:55:18AM +0100, Jan Kara wrote:
> Hello,
>
> On Mon 13-01-14 15:13:20, Benjamin LaHaise wrote:
...
> I'm not sure if you haven't switched to ext4 as others have suggested in
> this thread. If not:
> 1) Since the stall is so long, can you run
> 'echo w >/proc/sysrq-trigger'
> when the stall happens and send the stack traces from kernel log?
Unfortunately, I didn't capture that output while testing. I ended up
migrating to using the ext4 codebase for our ext3 filesystems. With a
couple of tweaks to the inode allocator, I was able to resolve the
regression moving to ext4 had caused. If there is actually some desire
to fix this bug, I can certainly go back and reproduce it.
> 2) Are you running with 'barrier' option?
I didn't change the barrier setting from the default.
> > Does anyone have any ideas on where to look in ext3 or jbd for something
> > that might be causing this behaviour? If I use ext4 to mount the ext3
> > filesystem being tested, the problem goes away. Testing on newer kernels
> > is not very easy to do (the system has other dependencyies on the 3.4
> > kernel). Thoughts?
> My suspicion is we are hanging on writing the 'commit' block of a
> transaction. That issues a cache flush to the storage and that can take
> quite a bit of time if we are unlucky.
I actually control both ends of the SAN (the two systems are connected via
fibre channel), and while the hang occurs, no I/O shows up as being queued
on the head end. It is as if the system is waiting on a write that hasn't
been submitted yet.
-ben
> Honza
> --
> Jan Kara <jack@suse.cz>
> SUSE Labs, CR
--
"Thought is the essence of where you are now."
prev parent reply other threads:[~2014-01-28 16:06 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-01-13 20:13 high write latency bug in ext3 / jbd in 3.4 Benjamin LaHaise
2014-01-13 21:01 ` Andreas Dilger
2014-01-13 21:16 ` Benjamin LaHaise
2014-01-13 21:39 ` Eric Sandeen
2014-01-13 22:52 ` Theodore Ts'o
2014-01-14 0:55 ` Andreas Dilger
2014-01-14 1:01 ` Eric Sandeen
2014-01-14 1:21 ` Benjamin LaHaise
2014-01-14 3:52 ` Theodore Ts'o
2014-01-27 23:55 ` Jan Kara
2014-01-28 16:06 ` Benjamin LaHaise [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20140128160626.GM19273@kvack.org \
--to=bcrl@kvack.org \
--cc=jack@suse.cz \
--cc=linux-ext4@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).