From: Nick Piggin <npiggin@suse.de>
To: Dmitriy Monakhov <dmonakhov@sw.ru>
Cc: linux-kernel@vger.kernel.org,
Andrew Morton <akpm@linux-foundation.org>,
devel@openvz.org
Subject: Re: [PATCH 2/2] mm: incorrect direct io error handling (v6)
Date: Mon, 12 Mar 2007 10:09:17 +0100 [thread overview]
Message-ID: <20070312090917.GD28546@wotan.suse.de> (raw)
In-Reply-To: <871wjurgcd.fsf@sw.ru>
On Mon, Mar 12, 2007 at 11:55:30AM +0300, Dmitriy Monakhov wrote:
> Nick Piggin <npiggin@suse.de> writes:
>
> > On Mon, Mar 12, 2007 at 10:58:10AM +0300, Dmitriy Monakhov wrote:
> >> @@ -2240,6 +2241,29 @@ ssize_t generic_file_aio_write(struct kiocb *iocb, const struct iovec *iov,
> >> mutex_lock(&inode->i_mutex);
> >> ret = __generic_file_aio_write_nolock(iocb, iov, nr_segs,
> >> &iocb->ki_pos);
> >> + /*
> >> + * If __generic_file_aio_write_nolock has failed.
> >> + * This may happen because of:
> >> + * 1) Bad segment found (failed before actual write attempt)
> >> + * 2) Segments are good, but actual write operation failed
> >> + * and may have instantiated a few blocks outside i_size.
> >> + * a) in case of buffered write these blocks was already
> >> + * trimmed by generic_file_buffered_write()
> >> + * b) in case of O_DIRECT these blocks weren't trimmed yet.
> >> + *
> >> + * In case of (2b) these blocks have to be trimmed off again.
> >> + */
> >> + if (unlikely( ret < 0 && file->f_flags & O_DIRECT)) {
> >> + unsigned long nr_segs_avail = nr_segs;
> >> + size_t count = 0;
> >> + if (!generic_segment_checks(iov, &nr_segs_avail, &count,
> >> + VERIFY_READ)) {
> >> + /*It is (2b) case, because segments are good*/
> >> + loff_t isize = i_size_read(inode);
> >> + if (pos + count > isize)
> >> + vmtruncate(inode, isize);
> >> + }
> >> + }
> >
> > OK, but wouldn't this be better to be done in the actual direct IO
> > functions themselves? Thus you could be sure that you have the 2b case,
> > and the code would be less fragile to something changing?
> Ohh, We can't just call vmtruncate() after generic_file_direct_write()
> failure while __generic_file_aio_write_nolock() becase where is no guarantee
> what i_mutex held. In fact all existing fs always invoke
> __generic_file_aio_write_nolock() with i_mutex held in case of S_ISREG files,
> but this was't explicitly demanded and documented. I've proposed to do it in
> previous versions of this patch, because it this just document current state
> of affairs, but David Chinner wasn't agree with it.
It seemed like it was documented in the comments that you altered in this
patch...
How would such a filesystem that did not hold i_mutex propose to fix the
problem?
The burden should be on those filesystems that might not want to hold
i_mutex here, to solve the problem nicely, rather than generic code to take
this ugly code.
next prev parent reply other threads:[~2007-03-12 9:09 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2007-03-12 7:58 [PATCH 2/2] mm: incorrect direct io error handling (v6) Dmitriy Monakhov
2007-03-12 8:20 ` Nick Piggin
2007-03-12 8:55 ` Dmitriy Monakhov
2007-03-12 9:09 ` Nick Piggin [this message]
2007-03-12 9:23 ` Dmitriy Monakhov
2007-03-12 12:14 ` Nick Piggin
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20070312090917.GD28546@wotan.suse.de \
--to=npiggin@suse.de \
--cc=akpm@linux-foundation.org \
--cc=devel@openvz.org \
--cc=dmonakhov@sw.ru \
--cc=linux-kernel@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox