From: Benjamin LaHaise <bcrl@kvack.org>
To: linux-ext4@vger.kernel.org
Subject: high write latency bug in ext3 / jbd in 3.4
Date: Mon, 13 Jan 2014 15:13:20 -0500 [thread overview]
Message-ID: <20140113201320.GD1214@kvack.org> (raw)
Hello all,
I've recently encountered a bug in ext3 where the occasional write is
showing extremely high latency, on the order of 2.2 to 11 seconds compared
to a more typical 200-300ms. This is happening on a 3.4.67 kernel. When
this occurs, the system is writing to disk somewhere between 290-330MB/s.
The test takes anywhere from 3 to 12 minutes into a run to trigger the
high latency write. During one of these high latency writes, vmstat reports
0 blocks being written to disk. The disk array being written to is able to
write quite a bit faster (about ~770MB/s).
The setup is a bit complicated, but is completely reproducible. The
workload consists of about 8 worker threads creating and then writing out
spool files that are a little under 8MB in size. After each write, the file
and the directory it is in are then fsync()d. The latency measured is from
the beginning open() of a spool file until the final fsync() completes.
Poking around the system with latencytop shows that sleep_on_buffer() is
where all the latency is coming from, leading to log_wait_commit() showing
the very high latency for the fsync()s. This leads me to believe that jbd
is somehow not properly flushing a buffer being waited on in a timely
fashion. Changing elevator in use has no effect.
Does anyone have any ideas on where to look in ext3 or jbd for something
that might be causing this behaviour? If I use ext4 to mount the ext3
filesystem being tested, the problem goes away. Testing on newer kernels
is not very easy to do (the system has other dependencyies on the 3.4
kernel). Thoughts?
-ben
--
"Thought is the essence of where you are now."
next reply other threads:[~2014-01-13 20:13 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-01-13 20:13 Benjamin LaHaise [this message]
2014-01-13 21:01 ` high write latency bug in ext3 / jbd in 3.4 Andreas Dilger
2014-01-13 21:16 ` Benjamin LaHaise
2014-01-13 21:39 ` Eric Sandeen
2014-01-13 22:52 ` Theodore Ts'o
2014-01-14 0:55 ` Andreas Dilger
2014-01-14 1:01 ` Eric Sandeen
2014-01-14 1:21 ` Benjamin LaHaise
2014-01-14 3:52 ` Theodore Ts'o
2014-01-27 23:55 ` Jan Kara
2014-01-28 16:06 ` Benjamin LaHaise
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20140113201320.GD1214@kvack.org \
--to=bcrl@kvack.org \
--cc=linux-ext4@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).