public inbox for linux-xfs@vger.kernel.org
 help / color / mirror / Atom feed
From: Dave Chinner <david@fromorbit.com>
To: Yongqiang Yang <xiaoqiangnk@gmail.com>
Cc: "Andreas Dilger" <adilger@dilger.ca>,
	"Eric Sandeen" <sandeen@sandeen.net>, xfs-oss <xfs@oss.sgi.com>,
	"coreutils@gnu.org" <coreutils@gnu.org>,
	"linux-ext4@vger.kernel.org" <linux-ext4@vger.kernel.org>,
	"Pádraig Brady" <P@draigbrady.com>,
	"Markus Trippelsdorf" <markus@trippelsdorf.de>
Subject: Re: Files full of zeros with coreutils-8.11 and xfs (FIEMAP related?)
Date: Mon, 18 Apr 2011 10:35:53 +1000	[thread overview]
Message-ID: <20110418003553.GR21395@dastard> (raw)
In-Reply-To: <BANLkTikEeXcvjgREoRCgriWAhZfnxJVtKQ@mail.gmail.com>

On Sat, Apr 16, 2011 at 02:05:51PM +0800, Yongqiang Yang wrote:
> On Sat, Apr 16, 2011 at 8:50 AM, Dave Chinner <david@fromorbit.com> wrote:
> > On Thu, Apr 14, 2011 at 11:01:04PM -0600, Andreas Dilger wrote:
> >> On 2011-04-14, at 6:09 PM, Dave Chinner <david@fromorbit.com>
> >> wrote:
> >> > No, this was explicitly laid out in the fiemap interface
> >> > discussions - it's up to the applicaiton to decide if it needs
> >> > to do a sync first. That's what the FIEMAP_FLAG_SYNC control
> >> > flag is for.  This forces the fiemap call to do a fsync _before_
> >> > getting the mapping. If you want to know the exact layout of the
> >> > file is, then you must use this flag.
> >> >
> >> > Even so, it is recognised that this is racy - any use of the
> >> > block map has a time-of-read-to-time-of-use race condition that
> >> > means you have to _verify_ the copy after it completes. FYI,
> >> > that's what xfs_fsr does when copying based on extent maps - if
> >> > the inode has changed in _any way_ during the copy, it aborts
> >> > the copy of that file.
> >> >
> >> > i.e. using fiemap for copying is at best a *hint* about the
> >> > regions that need copying, and it is in no way a guarantee that
> >> > you'll get all the information you need to make accurate copy
> >> > even if you do use the synchronous variant.
> >>
> >> I would tend to agree with Pádraig. If there is data in the
> >> mapping (regardless of whether it is on disk or not), the FIEMAP
> >> should return this to the caller.  The SYNC flag is only intended
> >> to flush the data to disk for tools that are doing
> >> direct-to-disk operations on the data.
> >
> > What you are suggesting is that FIEMAP needs to be page cache
> > coherent, and that is far, far away from the intended use of the
> > interface. Even consiering that you need to looking for active pages
> > in the page cache when mapping extents say to me that you are
> > doing something very wrong.
> >
> > Unwritten extents remain unwritten until the data is physically
> > written to them. Therefore, to change their state, you need to sync
> No, buffered writes change their state without sync.

They shouldn't.

> > the data covering the range.  _Lying_ about whether an extent is in
> > the unwritten state is a really bad precedence to set, especially as
> > it is then guaranteed to change state when a crash occurs (Why did
> > recovery zero out my file? FIEMAP said it contained data before my
> > system crashed!).
> 
> All filesystems have metadata in memory which is not flushed to
> permanent storage. e.g. if a extent exists in memory, but itself and
> corresponding data are not flushed to permanent storage.

Sure, but in the case of unwritten extents, XFS does not change the
metadata state in memory until *after the physical IO is completed*.
I'm pretty sure that btrfs is the same.

IOWs, despite the fact that a buffered write has occurred, no
metadata has changed state in memory, and the extents are still
unwritten in both memory and on disk....

Cheers,

Dave.
-- 
Dave Chinner
david@fromorbit.com

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

  reply	other threads:[~2011-04-18  0:32 UTC|newest]

Thread overview: 61+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-04-14 10:26 Files full of zeros with coreutils-8.11 and xfs (FIEMAP related?) Markus Trippelsdorf
2011-04-14 12:06 ` Markus Trippelsdorf
2011-04-14 14:02   ` Markus Trippelsdorf
2011-04-14 14:59     ` Pádraig Brady
2011-04-14 15:50       ` Eric Sandeen
2011-04-14 15:52         ` Pádraig Brady
2011-04-14 15:56           ` Eric Sandeen
2011-04-14 16:03             ` Markus Trippelsdorf
2011-04-14 16:14               ` Eric Sandeen
2011-04-14 16:21               ` Yongqiang Yang
2011-04-14 16:28                 ` Markus Trippelsdorf
2011-04-14 16:31                   ` Eric Sandeen
2011-04-14 16:48                     ` Markus Trippelsdorf
2011-04-14 16:49                       ` Eric Sandeen
2011-04-14 16:04             ` Yongqiang Yang
2011-04-14 16:10               ` Yongqiang Yang
2011-05-05 11:29                 ` Pádraig Brady
2011-05-05 11:47                   ` Yongqiang Yang
2011-04-14 17:27           ` Jim Meyering
2011-04-14 19:13             ` Pádraig Brady
2011-04-14 19:39             ` Jim Meyering
2011-04-14 22:59         ` Dave Chinner
2011-04-14 23:29           ` Pádraig Brady
2011-04-15  0:09             ` Dave Chinner
2011-04-15  5:01               ` Andreas Dilger
2011-04-16  0:50                 ` Dave Chinner
2011-04-16  5:11                   ` Andreas Dilger
2011-04-16 12:21                     ` Theodore Tso
2011-04-18  0:40                       ` Dave Chinner
2011-04-18  2:45                         ` Andreas Dilger
2011-04-19  1:58                           ` Yongqiang Yang
2011-04-19  2:59                             ` Ted Ts'o
2011-04-19  3:05                               ` Eric Sandeen
2011-04-21 20:12                                 ` Jim Meyering
2011-04-19  3:30                               ` Yongqiang Yang
2011-04-19  4:14                               ` Dave Chinner
2011-04-19  5:27                               ` Christoph Hellwig
2011-04-19  3:44                             ` Dave Chinner
2011-04-19  6:53                               ` Yongqiang Yang
2011-04-19  7:45                                 ` Dave Chinner
2011-04-19  8:11                                   ` Yongqiang Yang
2011-04-19 14:05                                     ` Eric Sandeen
2011-04-19 14:09                                   ` Ted Ts'o
2011-04-19 14:13                                     ` Eric Sandeen
2011-04-19 16:01                                       ` Ted Ts'o
2011-04-20  1:53                                         ` Yongqiang Yang
2011-04-20 15:21                                         ` Christoph Hellwig
2011-04-20 17:21                                           ` Ted Ts'o
2011-04-19 21:08                                     ` Dave Chinner
2011-04-20 15:29                                       ` Christoph Hellwig
2011-04-16  6:05                   ` Yongqiang Yang
2011-04-18  0:35                     ` Dave Chinner [this message]
2011-04-15  8:53               ` Jim Meyering
2011-04-15 17:16                 ` Christoph Hellwig
2011-04-15 17:24                   ` Eric Blake
2011-04-15 17:26                     ` Christoph Hellwig
2011-04-15 22:28                       ` Andreas Dilger
2011-04-16  0:25                         ` Dave Chinner
2011-04-14 14:39 ` Eric Sandeen
2011-04-20 14:39 ` Jim Meyering
2011-04-21 20:01   ` Jim Meyering

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20110418003553.GR21395@dastard \
    --to=david@fromorbit.com \
    --cc=P@draigbrady.com \
    --cc=adilger@dilger.ca \
    --cc=coreutils@gnu.org \
    --cc=linux-ext4@vger.kernel.org \
    --cc=markus@trippelsdorf.de \
    --cc=sandeen@sandeen.net \
    --cc=xfs@oss.sgi.com \
    --cc=xiaoqiangnk@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox