Re: [REVIEW] Prevent direct I/O from mapping extents beyond eof

public inbox for linux-xfs@vger.kernel.org
 help / color / mirror / Atom feed

* Re: [REVIEW] Prevent direct I/O from mapping extents beyond eof
       [not found] <48A50152.8020104@sgi.com>
@ 2008-08-15 22:09 ` Christoph Hellwig
  2008-08-15 22:27   ` Andrew Morton
  0 siblings, 1 reply; 3+ messages in thread
From: Christoph Hellwig @ 2008-08-15 22:09 UTC (permalink / raw)
  To: Lachlan McIlroy; +Cc: xfs-dev, xfs-oss, akpm, linux-fsdevel

On Fri, Aug 15, 2008 at 02:08:50PM +1000, Lachlan McIlroy wrote:
> With the help from some tracing I found that we try to map extents beyond
> eof when doing a direct I/O read.  It appears that the way to inform the
> generic direct I/O path (ie do_direct_IO()) that we have breached eof is
> to return an unmapped buffer from xfs_get_blocks_direct().  This will cause
> do_direct_IO() to jump to the hole handling code where is will check for
> eof and then abort.
>
> This problem was found because a direct I/O read was trying to map beyond
> eof and was encountering delayed allocations.  The delayed allocations beyond
> eof are speculative allocations and they didn't get converted when the direct
> I/O flushed the file because there was only enough space in the current AG
> to convert and write out the dirty pages within eof.  Note that
> xfs_iomap_write_allocate() wont necessarily convert all the delayed allocation
> passed to it - it will return after allocating the first extent - so if the
> delayed allocation extends beyond eof then it will stay that way.
>
> This change will detect a direct I/O read beyond eof:

The change looks good to me, but I really think the direct I/O could
should never send down requests like this down to the filesystems.  akpm
and -fsdevel Cc'ed.

> --- a/fs/xfs/linux-2.6/xfs_aops.c	2008-08-15 13:30:03.000000000 +1000
> +++ b/fs/xfs/linux-2.6/xfs_aops.c	2008-08-11 16:51:07.000000000 +1000
> @@ -1338,6 +1338,10 @@ __xfs_get_blocks(
> 	offset = (xfs_off_t)iblock << inode->i_blkbits;
> 	ASSERT(bh_result->b_size >= (1 << inode->i_blkbits));
> 	size = bh_result->b_size;
> +
> +	if (!create && direct && offset >= i_size_read(inode))
> +		return 0;
> +
> 	error = xfs_iomap(XFS_I(inode), offset, size,
> 			     create ? flags : BMAPI_READ, &iomap, &niomap);
> 	if (error)
>
>
---end quoted text---

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [REVIEW] Prevent direct I/O from mapping extents beyond eof
  2008-08-15 22:09 ` [REVIEW] Prevent direct I/O from mapping extents beyond eof Christoph Hellwig
@ 2008-08-15 22:27   ` Andrew Morton
  2008-08-15 22:40     ` Christoph Hellwig
  0 siblings, 1 reply; 3+ messages in thread
From: Andrew Morton @ 2008-08-15 22:27 UTC (permalink / raw)
  To: Christoph Hellwig; +Cc: lachlan, xfs-dev, xfs, linux-fsdevel

On Fri, 15 Aug 2008 18:09:58 -0400
Christoph Hellwig <hch@infradead.org> wrote:

> On Fri, Aug 15, 2008 at 02:08:50PM +1000, Lachlan McIlroy wrote:
> > With the help from some tracing I found that we try to map extents beyond
> > eof when doing a direct I/O read.  It appears that the way to inform the
> > generic direct I/O path (ie do_direct_IO()) that we have breached eof is
> > to return an unmapped buffer from xfs_get_blocks_direct().  This will cause
> > do_direct_IO() to jump to the hole handling code where is will check for
> > eof and then abort.
> >
> > This problem was found because a direct I/O read was trying to map beyond
> > eof and was encountering delayed allocations.  The delayed allocations beyond
> > eof are speculative allocations and they didn't get converted when the direct
> > I/O flushed the file because there was only enough space in the current AG
> > to convert and write out the dirty pages within eof.  Note that
> > xfs_iomap_write_allocate() wont necessarily convert all the delayed allocation
> > passed to it - it will return after allocating the first extent - so if the
> > delayed allocation extends beyond eof then it will stay that way.
> >
> > This change will detect a direct I/O read beyond eof:
> 
> The change looks good to me, but I really think the direct I/O could
> should never send down requests like this down to the filesystems.  akpm
> and -fsdevel Cc'ed.

Oh gee, I forget, and so many people have done drivebys on that code...

We _could_ add additional i_size checking into direct-io.c but bear in
mind that it would be best-effort unreliable stuff.  The code will
still be tripped up by concurrent extends and concurrent truncates.

So we'll still end up calling the fs for blocks outside i_size, only
less commonly.  I think.

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [REVIEW] Prevent direct I/O from mapping extents beyond eof
  2008-08-15 22:27   ` Andrew Morton
@ 2008-08-15 22:40     ` Christoph Hellwig
  0 siblings, 0 replies; 3+ messages in thread
From: Christoph Hellwig @ 2008-08-15 22:40 UTC (permalink / raw)
  To: Andrew Morton; +Cc: Christoph Hellwig, lachlan, xfs-dev, xfs, linux-fsdevel

On Fri, Aug 15, 2008 at 03:27:56PM -0700, Andrew Morton wrote:
> Oh gee, I forget, and so many people have done drivebys on that code...
> 
> We _could_ add additional i_size checking into direct-io.c but bear in
> mind that it would be best-effort unreliable stuff.  The code will
> still be tripped up by concurrent extends and concurrent truncates.
> 
> So we'll still end up calling the fs for blocks outside i_size, only
> less commonly.  I think.

Yeah, guess we should put in this patch then.

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2008-08-15 22:39 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <48A50152.8020104@sgi.com>
2008-08-15 22:09 ` [REVIEW] Prevent direct I/O from mapping extents beyond eof Christoph Hellwig
2008-08-15 22:27   ` Andrew Morton
2008-08-15 22:40     ` Christoph Hellwig

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox