From: "Darrick J. Wong" <darrick.wong@oracle.com>
To: Dave Chinner <david@fromorbit.com>
Cc: Andreas Dilger <adilger@dilger.ca>,
linux-man@vger.kernel.org, linux-api@vger.kernel.org,
xfs <xfs@oss.sgi.com>,
linux-xfs@vger.kernel.org, mtk.manpages@gmail.com,
linux-fsdevel@vger.kernel.org, linux-btrfs@vger.kernel.org
Subject: Re: [PATCH 3/3] ioctl_xfs_ioc_getfsmap.2: document XFS_IOC_GETFSMAP ioctl
Date: Thu, 8 Sep 2016 23:07:16 -0700 [thread overview]
Message-ID: <20160909060716.GD8969@birch.djwong.org> (raw)
In-Reply-To: <20160908233806.GB30056@dastard>
On Fri, Sep 09, 2016 at 09:38:06AM +1000, Dave Chinner wrote:
> On Tue, Aug 30, 2016 at 12:09:49PM -0700, Darrick J. Wong wrote:
> > > I recall for FIEMAP that some filesystems may not have files aligned
> > > to sector offsets, and we just used byte offsets. Storage like
> > > NVDIMMs are cacheline granular, so I don't think it makes sense to
> > > tie this to old disk sector sizes. Alternately, the units could be
> > > in terms of fs blocks as returned by statvfs.st_bsize, but mixing
> > > units for fmv_block, fmv_offset, fmv_length is uneeded complexity.
> >
> > Ugh. I'd rather just change the units to bytes rather than force all
> > the users to multiply things. :)
>
> Yup, units need to be either in disk addresses (i.e. 512 byte units)
> or bytes. If people can't handle disk addresses (seems to be the
> case), the bytes it should be.
<nod>
> > I'd much rather just add more special owner codes for any other
> > filesystem that has distinguishable metadata types that are not
> > covered by the existing OWN_ codes. We /do/ have 2^64 possible
> > values, so it's not like we're going to run out.
>
> This is diagnositc information as much as anything, just like
> fiemap is diagnostic information. So if we have specific type
> information, it needs to be reported accurately to be useful.
>
> Hence I really don't care if the users and developers of other fs
> types don't understand what the special owner codes that a specific
> filesystem returns mean. i.e. it's not useful user information -
> only a tool that groks the specific filesystem is going to be able
> to anything useful with special owner codes. So, IMO, there's little
> point trying to make them generic or to even trying to define and
> explain them in the man page....
<shrug> I'm ok with describing generally what each special owner code
means. Maybe the manpage could be more explicit about "None of these
codes are useful unless you're a low level filesystem tool"?
> > > It seems like there are several fields in the structure that are used for
> > > only input or only output? Does it make more sense to have one structure
> > > used only for the input request, and then the array of values returned be
> > > in a different structure? I'm not necessarily requesting that it be changed,
> > > but it definitely is something I noticed a few times while reading this doc.
> >
> > I've been thinking about rearranging this a bit, since the flags
> > handling is very awkward with the current array structure. Each
> > rmap has its own flags; we may someday want to pass operation flags
> > into the ioctl; and we currently have one operation flag to pass back
> > to userspace. Each of those flags can be a separate field. I think
> > people will get confused about FMV_OF_* and FMV_HOF_* being referenced
> > in oflags, and iflags has no meaning for returned records.
>
> Yup, that's what I initially noticed when I glanced at this. The XFS
> getbmap interface is just plain nasty, and we shouldn't be copying
> that API pattern if we can help it.
Lol ok. :)
> > So, this instead?
> >
> > struct getfsmap_rec {
> > u32 device; /* device id */
> > u32 flags; /* mapping flags */
> > u64 block; /* physical addr, bytes */
> > u64 owner; /* inode or special owner code */
> > u64 offset; /* file offset of mapping, bytes */
> > u64 length; /* length of segment, bytes */
> > u64 reserved; /* will be set to zero */
> > }; /* 48 bytes */
> >
> > struct getfsmap_head {
> > u32 iflags; /* none defined yet */
> > u32 oflags; /* FMV_HOF_DEV_T */
> > u32 count; /* # entries in recs array */
> > u32 entries; /* # entries filled in (output) */
> > u64 reserved[2]; /* must be zero */
> >
> > struct getfsmap_rec keys[2]; /* low and high keys for the mapping search */
> > struct getfsmap_rec recs[0];
> > }; /* 32 bytes + 2*48 = 128 bytes */
> >
> > #define XFS_IOC_GETFSMAP _IOWR('X', 59, struct getfsmap_head)
> >
> > This also means that userspace can set up for the next ioctl
> > invocation with memcpy(&head->keys[0], &head->recs[head->entries - 1]).
> >
> > Yes, I think I like this better. Everyone else, please chime in. :)
>
> That's pretty much the structure I was going to suggest - it matches
> the fiemap pattern. i.e control parameters are separated from record
> data. I'd dump a bit more reserved space in the structure, though;
> we've got heaps of flag space for future expansion, but if we need
> to pass new parameters into/out of the kernel we'll quickly use the
> reserved space.
I padded struct fsmap with enough reserved space to make it an even 64 bytes,
and padded struct fsmap_head so that the space before keys is 64 bytes in
length. See v3 patch of the ioctl manpage.
--D
>
> Cheers,
>
> Dave.
> --
> Dave Chinner
> david@fromorbit.com
next prev parent reply other threads:[~2016-09-09 6:07 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-08-25 23:26 [PATCH v8 0/3] man-pages: fix reflink/dedupe ioctl manpages Darrick J. Wong
2016-08-25 23:26 ` [PATCH 1/3] man2/fallocate.2: document behavior with shared blocks Darrick J. Wong
2016-08-25 23:26 ` [PATCH 2/3] man2/ioctl_fideduperange.2: clarify operation some more Darrick J. Wong
2016-08-25 23:26 ` [PATCH 3/3] ioctl_xfs_ioc_getfsmap.2: document XFS_IOC_GETFSMAP ioctl Darrick J. Wong
2016-08-29 21:34 ` Andreas Dilger
2016-08-30 19:09 ` Darrick J. Wong
2016-09-08 23:38 ` Dave Chinner
2016-09-09 6:07 ` Darrick J. Wong [this message]
2016-09-10 0:00 ` Dave Chinner
2016-09-11 18:56 ` Darrick J. Wong
2016-09-04 5:36 ` [PATCH v2 3/3] ioctl_getfsmap.2: document the GETFSMAP ioctl Darrick J. Wong
2016-09-09 6:17 ` [PATCH v3 " Darrick J. Wong
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20160909060716.GD8969@birch.djwong.org \
--to=darrick.wong@oracle.com \
--cc=adilger@dilger.ca \
--cc=david@fromorbit.com \
--cc=linux-api@vger.kernel.org \
--cc=linux-btrfs@vger.kernel.org \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-man@vger.kernel.org \
--cc=linux-xfs@vger.kernel.org \
--cc=mtk.manpages@gmail.com \
--cc=xfs@oss.sgi.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).