From: Jan Kara <jack@suse.cz>
To: Christoph Hellwig <hch@infradead.org>
Cc: Jan Kara <jack@suse.cz>, Sergey Meirovich <rathamahata@gmail.com>,
linux-scsi <linux-scsi@vger.kernel.org>,
Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
Gluk <git.user@gmail.com>
Subject: Re: Terrible performance of sequential O_DIRECT 4k writes in SAN environment. ~3 times slower then Solars 10 with the same HBA/Storage.
Date: Wed, 8 Jan 2014 02:17:13 +0100 [thread overview]
Message-ID: <20140108011713.GA5212@quack.suse.cz> (raw)
In-Reply-To: <20140107155830.GA28395@infradead.org>
On Tue 07-01-14 07:58:30, Christoph Hellwig wrote:
> On Mon, Jan 06, 2014 at 09:10:32PM +0100, Jan Kara wrote:
> > This is likely a problem of Linux direct IO implementation. The thing is
> > that in Linux when you are doing appending direct IO (i.e., direct IO which
> > changes file size), the IO is performed synchronously so that we have our
> > life simpler with inode size update etc. (and frankly our current locking
> > rules make inode size update on IO completion almost impossible). Since
> > appending direct IO isn't very common, we seem to get away with this
> > simplification just fine...
>
> Shouldn't be too much of a problem at least for XFS and maybe even ext4
> with the workqueue based I/O end handler. For XFS we protect size
> updates by the ilock which we already taken in that handler, not sure
> what ext4 would do there.
Well, I was specifically worried about i_mutex locking. In particular:
Before we report appending IO completion we need to update i_size.
To update i_size we need to grab i_mutex.
Now this is unpleasant because inode_dio_wait() happens under i_mutex so
the above would create lock inversion. And we cannot really do
inode_dio_done() before grabbing i_mutex as that would open interesting
races between truncate decreasing i_size and DIO increasing it.
Honza
--
Jan Kara <jack@suse.cz>
SUSE Labs, CR
next prev parent reply other threads:[~2014-01-08 1:17 UTC|newest]
Thread overview: 32+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-01-06 9:38 Terrible performance of sequential O_DIRECT 4k writes in SAN environment. ~3 times slower then Solars 10 with the same HBA/Storage Sergey Meirovich
2014-01-06 20:10 ` Jan Kara
2014-01-07 9:13 ` Sergey Meirovich
2014-01-07 15:58 ` Christoph Hellwig
2014-01-07 18:37 ` Sergey Meirovich
2014-01-08 14:03 ` Christoph Hellwig
2014-01-08 14:43 ` Sergey Meirovich
2014-01-08 15:26 ` Christoph Hellwig
2014-01-08 17:30 ` Sergey Meirovich
2014-01-08 20:55 ` Jan Kara
2014-01-09 10:11 ` Sergey Meirovich
2014-01-10 9:36 ` Jan Kara
2014-01-10 10:36 ` Sergey Meirovich
2014-01-10 10:48 ` Jan Kara
2014-01-10 14:32 ` Sergey Meirovich
2014-01-10 18:14 ` Sergey Meirovich
2014-01-14 13:30 ` Sergey Meirovich
2014-01-14 13:30 ` Sergey Meirovich
2014-01-15 22:07 ` Dave Chinner
2014-01-15 22:07 ` Dave Chinner
2014-01-20 13:58 ` Christoph Hellwig
2014-01-20 13:58 ` Christoph Hellwig
2014-01-20 22:18 ` Dave Chinner
2014-01-20 22:18 ` Dave Chinner
2014-01-08 1:17 ` Jan Kara [this message]
2014-01-08 14:03 ` Christoph Hellwig
2014-01-07 20:57 ` James Smart
2014-01-08 13:57 ` Sergey Meirovich
2014-01-09 19:54 ` Douglas Gilbert
2014-01-09 21:26 ` Sergey Meirovich
2014-01-09 21:43 ` Sergey Meirovich
-- strict thread matches above, loose matches on Subject: below --
2014-01-06 13:16 Sergey Meirovich
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20140108011713.GA5212@quack.suse.cz \
--to=jack@suse.cz \
--cc=git.user@gmail.com \
--cc=hch@infradead.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-scsi@vger.kernel.org \
--cc=rathamahata@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.