linux-btrfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Dave Chinner <david@fromorbit.com>
To: Sunil Mushran <sunil.mushran@oracle.com>
Cc: Andreas Dilger <adilger@dilger.ca>,
	Christoph Hellwig <hch@infradead.org>,
	Josef Bacik <josef@redhat.com>,
	linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org,
	linux-btrfs@vger.kernel.org, xfs@oss.sgi.com,
	viro@ZenIV.linux.org.uk, dchinner@redhat.com
Subject: Re: [PATCH] xfstests 255: add a seek_data/seek_hole tester
Date: Wed, 31 Aug 2011 13:29:32 +1000	[thread overview]
Message-ID: <20110831032932.GI32358@dastard> (raw)
In-Reply-To: <4E5D8B8E.8030401@oracle.com>

On Tue, Aug 30, 2011 at 06:17:02PM -0700, Sunil Mushran wrote:
> On 08/25/2011 06:35 PM, Dave Chinner wrote:
> >Agreed, that's the way I'd interpret it, too. So perhaps we need to
> >ensure that this interpretation is actually tested by this test?
> >
> >How about some definitions to work by:
> >
> >Data: a range of the file that contains valid data, regardless of
> >whether it exists in memory or on disk. The valid data can be
> >preceeded and/or followed by an arbitrary number of zero bytes
> >dependent on the underlying implementation of hole detection.
> >
> >Hole: a range of the file that contains no data or is made up
> >entirely of  NULL (zero) data. Holes include preallocated ranges of
> >files that have not had actual data written to them.
> >
> >Does that make sense? It has sufficient flexibility in it for the
> >existing generic "non-implementation", allows for filesystems to
> >define their own hole detection boundaries (e.g. filesystem block
> >size), and effectively defines how preallocated ranges from
> >fallocate() should be treated (i.e. as holes). If we can agree on
> >those definitions, I think that we should document them in both the
> >kernel and the man page that defines SEEK_HOLE/SEEK_DATA so everyone
> >is on the same page...
> 
> We should not tie in the definition to existing fs technologies.

Such as? If we don't use well known, well defined terminology, we
end up with ambiguous, vague functionality and inconsistent
implementations.

> Instead
> we should let the fs weigh the cost of providing accurate information
> with the possible gain in performance.
> 
> Data:
> A range in a file that could contain something other than nulls.
> If in doubt, it is data.
> 
> Hole:
> A range in a file that only contains nulls.

And that's -exactly- the ambiguous, vague definition that has raised
all these questions in the first place. I was in doubt about whether
unwritten extents can be considered a hole, and by your definition
that means it should be data. But Andreas seems to be in no doubt it
should be considered a hole.

Hence if I implement XFS support and Andreas implements ext4 support
by your defintion, we end with vastly different behaviour even
though the two filesystems use the same underlying technology for
preallocated ranges. That's exactly the inconsistency in
implementation that I'd like us to avoid.

IOWs, the definition needs to be clear enough to prevent these
inconsistencies from occurring. Indeed, the phrase "preallocated
ranges that have not had data written to them" is as independent of
filesystem implementation or technologies as possible. However,
because Linux supports preallocation (unlike our reference
platform), and we encourage developers to use it where appropriate,
it is best that we define how we expect such ranges to behave
clearly. That makes life easier for everyone.

Cheers,

Dave.
-- 
Dave Chinner
david@fromorbit.com

  reply	other threads:[~2011-08-31  3:29 UTC|newest]

Thread overview: 47+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-06-28 15:33 [PATCH 1/4] fs: add SEEK_HOLE and SEEK_DATA flags Josef Bacik
2011-06-28 15:33 ` [PATCH 2/4] Btrfs: implement our own ->llseek Josef Bacik
2011-06-28 15:33 ` [PATCH 3/4] Ext4: handle SEEK_HOLE/SEEK_DATA generically Josef Bacik
2011-06-28 15:33 ` [PATCH 4/4] fs: handle SEEK_HOLE/SEEK_DATA properly in all fs's that define their own llseek Josef Bacik
2011-06-28 15:33 ` [PATCH] xfstests 255: add a seek_data/seek_hole tester Josef Bacik
2011-06-29  6:53   ` Dave Chinner
2011-06-29  7:40     ` Christoph Hellwig
2011-06-29 10:42       ` Pádraig Brady
2011-06-29 17:29         ` Sunil Mushran
2011-06-29 17:36           ` Christoph Hellwig
2011-06-29 17:40             ` Sunil Mushran
2011-06-29 21:29           ` Pádraig Brady
2011-07-01  9:37         ` Christoph Hellwig
2011-06-29 17:10       ` Sunil Mushran
2011-06-29 17:52         ` Josef Bacik
2011-06-29 13:19     ` Josef Bacik
2011-08-25  6:06   ` Christoph Hellwig
2011-08-25  6:40     ` Dave Chinner
2011-08-25  6:51       ` Andreas Dilger
2011-08-26  1:35         ` Dave Chinner
2011-08-26  6:24           ` Marco Stornelli
2011-08-26 14:41             ` Zach Brown
2011-08-27  8:30               ` Marco Stornelli
2011-08-28 10:17                 ` Marco Stornelli
2011-08-30 17:42                 ` Sunil Mushran
2011-08-31  1:17           ` Sunil Mushran
2011-08-31  3:29             ` Dave Chinner [this message]
2011-08-31  3:53               ` david
2011-08-31  4:43               ` Sunil Mushran
2011-08-31  9:05                 ` Pádraig Brady
2011-08-31  4:48               ` Dan Merillat
2011-07-29  9:58 ` [PATCH 1/4] fs: add SEEK_HOLE and SEEK_DATA flags Marco Stornelli
2011-08-20  9:41 ` Marco Stornelli
2011-08-20 10:03   ` Marco Stornelli
2011-08-20 15:36     ` Sunil Mushran
2011-08-20 16:32       ` Marco Stornelli
2011-08-22  6:08         ` Sunil Mushran
2011-08-22 10:56           ` Marco Stornelli
2011-08-22 15:57             ` Sunil Mushran
2011-08-22 17:56               ` Marco Stornelli
2011-08-22 21:22                 ` Sunil Mushran
2011-08-23 17:44                   ` Marco Stornelli
2011-08-31  0:35                 ` Dave Chinner
     [not found]   ` <CAGpXXZ+xjhadprkc_LiP3qUypLLkCxdeEmo8+K+6mOnBuNhmLg@mail.gmail.com>
2011-08-20 17:18     ` Greg Freemyer
  -- strict thread matches above, loose matches on Subject: below --
2011-06-27 18:02 Josef Bacik
2011-06-27 18:02 ` [PATCH] xfstests 255: add a seek_data/seek_hole tester Josef Bacik
2011-06-27 18:32   ` Andreas Dilger
2011-06-27 18:47     ` Josef Bacik

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20110831032932.GI32358@dastard \
    --to=david@fromorbit.com \
    --cc=adilger@dilger.ca \
    --cc=dchinner@redhat.com \
    --cc=hch@infradead.org \
    --cc=josef@redhat.com \
    --cc=linux-btrfs@vger.kernel.org \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=sunil.mushran@oracle.com \
    --cc=viro@ZenIV.linux.org.uk \
    --cc=xfs@oss.sgi.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).