From: Allison Henderson <allison.henderson@oracle.com>
To: "Darrick J. Wong" <darrick.wong@oracle.com>
Cc: linux-xfs@vger.kernel.org
Subject: Re: [PATCH v3 17/17] Add parent pointer ioctl
Date: Wed, 29 Nov 2017 11:52:36 -0700 [thread overview]
Message-ID: <13a1961e-73c5-4e1b-0583-4faf637102b9@oracle.com> (raw)
In-Reply-To: <20171128203537.GZ21412@magnolia>
On 11/28/2017 01:35 PM, Darrick J. Wong wrote:
> On Fri, Nov 17, 2017 at 11:21:45AM -0700, Allison Henderson wrote:
>> This patch adds a new file ioctl to retrieve the parent
>> pointer of a given inode
>>
>> Signed-off-by: Allison Henderson<allison.henderson@oracle.com>
>> ---
>> fs/xfs/libxfs/xfs_attr.c | 67 ++++++++++++++++++++++++++++++++++++++++++++++++
>> fs/xfs/libxfs/xfs_fs.h | 1 +
>> fs/xfs/xfs_attr.h | 2 ++
>> fs/xfs/xfs_attr_list.c | 3 +++
>> fs/xfs/xfs_ioctl.c | 48 +++++++++++++++++++++++++++++++++-
>> 5 files changed, 120 insertions(+), 1 deletion(-)
>>
>> diff --git a/fs/xfs/libxfs/xfs_attr.c b/fs/xfs/libxfs/xfs_attr.c
>> index 9d4d883..d2be842 100644
>> --- a/fs/xfs/libxfs/xfs_attr.c
>> +++ b/fs/xfs/libxfs/xfs_attr.c
>> @@ -134,6 +134,73 @@ xfs_attr_get_ilocked(
>> return xfs_attr_node_get(args);
>> }
>>
>> +/*
>> + * Get the parent pointer for a given inode
>> + * Caller will need to allocate a buffer pointed to by xpnir->p_name
>> + * and store the buffer size in xpnir->p_namelen. The parent
>> + * pointer will be stored in the given xfs_parent_name_irec
>> + *
>> + * Returns 0 on success and non zero on error
>> + */
>> +int
>> +xfs_attr_get_parent_pointer(struct xfs_inode *ip,
>> + struct xfs_parent_name_irec *xpnir)
> Please fix the parameter list here.
>
>> +{
>> + struct attrlist *alist;
>> + struct attrlist_ent *aent;
>> + struct attrlist_cursor_kern cursor;
>> + struct xfs_parent_name_rec *xpnr;
>> + char *namebuf;
>> + int error = 0;
>> + unsigned int flags = ATTR_PARENT;
>> +
>> + /* Allocate a buffer to store the attribute names */
>> + namebuf = kmem_zalloc_large(XFS_XATTR_LIST_MAX, KM_SLEEP);
>> + if (!namebuf)
>> + return -ENOMEM;
>> +
>> + /* Get all attribute names that have the ATTR_PARENT flag */
>> + memset(&cursor, 0, sizeof(struct attrlist_cursor_kern));
>> + error = xfs_attr_list(ip, namebuf, XFS_XATTR_LIST_MAX, flags, &cursor);
>> + if (error)
>> + goto out_kfree;
>> +
>> + alist = (struct attrlist *)namebuf;
>> +
>> + /* There should never be more than one parent pointer */
>> + ASSERT(alist->al_count == 1);
> As mentioned earlier, this is not true. Files can have multiple parents.
>
>> + aent = (struct attrlist_ent *) &namebuf[alist->al_offset[0]];
>> + xpnr = (struct xfs_parent_name_rec *)(aent->a_name);
>> +
>> + /*
>> + * The value of the parent pointer attribute should be the file name
>> + * So we check the value length of the attribute entry against the name
>> + * length of the parent name record to make sure the caller gave enough
>> + * buffer space to store the file name (plus a null terminator)
>> + */
>> + if (aent->a_valuelen >= xpnir->p_namelen) {
>> + error = -ERANGE;
>> + goto out_kfree;
>> + }
>> +
>> + xpnir->p_namelen = aent->a_valuelen + 1;
>> + memset((void *)(xpnir->p_name), 0, xpnir->p_namelen);
>> + error = xfs_attr_get(ip, (char *)xpnr,
>> + sizeof(struct xfs_parent_name_rec),
>> + (unsigned char *)(xpnir->p_name),
>> + (int *)&(xpnir->p_namelen), flags);
>> + if (error)
>> + goto out_kfree;
>> +
>> + xfs_init_parent_name_irec(xpnir, xpnr);
>> +
>> +out_kfree:
>> + kmem_free(namebuf);
>> +
>> + return error;
>> +}
>> +
>> /* Retrieve an extended attribute by name, and its value. */
>> int
>> xfs_attr_get(
>> diff --git a/fs/xfs/libxfs/xfs_fs.h b/fs/xfs/libxfs/xfs_fs.h
>> index b8108f8..2f9ca2c 100644
>> --- a/fs/xfs/libxfs/xfs_fs.h
>> +++ b/fs/xfs/libxfs/xfs_fs.h
>> @@ -512,6 +512,7 @@ typedef struct xfs_swapext
>> #define XFS_IOC_ZERO_RANGE _IOW ('X', 57, struct xfs_flock64)
>> #define XFS_IOC_FREE_EOFBLOCKS _IOR ('X', 58, struct xfs_fs_eofblocks)
>> /* XFS_IOC_GETFSMAP ------ hoisted 59 */
>> +#define XFS_IOC_GETPPOINTER _IOR ('X', 61, struct xfs_parent_name_irec)
> I don't think it's a good idea to expose internal data structures
> directly to userspace, because that inhibits our ability to change the
> in-core data structure.
Yes, this part I already have that separated in my local copy
> Furthermore, hardlinked files can have multiple parent pointers, so it's
> not going to suffice to return a single parent pointer entry. Given
> that there can be potentially 2^32 parents, we're going to need a data
> structure for the ioctl to store (in an opaque manner) the attribute
> iteration cursor and have space to pass back some number of parent
> pointers.
>
> (Yes, it's time to start talking about actual use cases...)
>
> At a bare minimum, this is what I pictured for the "return parents of
> the open file" ioctl:
>
> #define XFS_PPTR_MAXNAMELEN 255
>
> struct xfs_pptr {
> u64 pp_ino;
> u32 pp_gen;
> u8 pp_namelen;
> u8 pp_name[XFS_PPTR_MAXNAMELEN];
> };
>
> /* return parents of the handle, instead of the open fd */
> #define XFS_PPTR_FLAG_HANDLE (1u << 0)
>
> struct xfs_pptr_info {
> struct xfs_fsop_handlereq pi_handle;
> struct xfs_attrlist_cursor pi_cursor;
> u32 pi_flags;
> u32 pi_reserved;
> u32 pi_ptrs_size;
> u32 pi_ptrs_used;
> u64 pi_reserved2[6];
> struct xfs_pptr pi_ptrs[0];
> };
>
> #define XFS_PPTR_INFO_SIZEOF(ptrs) (sizeof(struct xfs_pptr_info) + \
> ((ptrs) * sizeof(struct xfs_pptr)));
>
> static inline struct xfs_pptr_info *
> xfs_pptr_alloc(
> size_t nr_ptrs)
> {
> struct xfs_pptr_info *ppi;
>
> ppi = malloc(XFS_PPTR_INFO_SIZEOF(nr_ptrs));
> if (!ppi)
> return NULL;
> memset(ppi, 0, XFS_PPTR_INFO_SIZEOF(nr_ptrs));
> ppi->pi_ptrs_size = nr_ptrs;
> return ppi;
> }
>
> With the following example userspace program (that does no checking
> whatsoever):
>
> int main(int argc, char *argv[])
> {
> struct xfs_pptr_info *ppi;
> struct xfs_pptr *pp;
> int fd;
>
> fd = open(argv[1], O_RDONLY);
> ppi = xfs_pptr_alloc(32);
>
> while (ioctl(fd, XFS_IOC_GETPPOINTER, ppi) == 0 && ppi->pi_ptrs_used) {
> for (i = 0; i < ppi->pi_ptrs_used; i++) {
> printf("%llu:%u -> %s\n",
> ppi->pi_ptrs[i].pp_ino,
> ppi->pi_ptrs[i].pp_gen,
> ppi->pi_ptrs[i].pp_name);
> }
> }
> }
>
> Notice here how we the userspace structure contains an opaque attribute
> list cursor, so we can keep coming back for more parent pointers until
> we run out of xattrs (and pi_ptrs_used == 0). The kernel will copy its
> internal cursor out to the userspace buffer as an opaque cookie prior to
> returning.
>
> From this simple implementation it shouldn't be difficult to finish the
> parents_by_handle/parentpaths_by_handle functions in libhandle, though
> given that they've never been implemented in Linux and we no longer care
> about Irix, you've some flexibility to change those library functions if
> that is convenient for setting up xfstests.
Wow, ok that makes a lot of sense. I will follow your model here and get
it fleshed out. Thank you!
> Speaking of xfstests... what are the initial test cases? I figured at
> least the following:
>
> 0) mkfs with protofile, make sure the parent records get created
> 1) create file, check parent records
> 2) hardlink file, check both parent records
> 3) delete one link of a hardlinked file, check parent records
> 4) hardlink a file a few thousand times, check that the iteration
> scheme laid out above actually works
> 5) rename a file within a directory, check the parent records
> 6) rename a file across directories, check the parent records
> 7) some sort of testing where we run out of space while updating pptrs
> 8) add some error injection knobs to make sure that pptr replay actually
> works correctly
>
> Can you think of other test cases?
I think that is a good start. This looks similar to what I've been
doing by hand to stabilize things as I go along. I'll have to work on
developing an inject knob for the last one.
> For xfs_scrub, we want to be able to query the parents of any (damaged)
> inode we find in the filesystem. If the inode is so damaged we can't
> open it (or it's a special file) then scrub has to construct a file
> handle and pass that in via pi_handle.
Alrighty, I will take a look at those routines and see if I can put
together something that reconstructs the parent pointers with out
opening the inode
> I /also/ wonder if there's any interest in having a fallback for
> non-pptr filesystems that walks the dentry->d_parent links (like
> d_paths() does) back to the root. Such a fallback will only work on an
> opened dir or a file opened by path (i.e. not a handle), however, which
> limits its appeal.
>
> --D
You mean a way to get the parent pointer even if they chose not to
enable the feature flag? I think its something we could investigate,
but I think you're right in that the limitations might not make it quite
as valuable. IMHO I think maybe getting the full version working first
might give people a chance to appreciate what it can do, and if it turns
out to be something that people end up using a lot, then it might
generate more demand for the "light" version. :-)
>> /*
>> * ioctl commands that replace IRIX syssgi()'s
>> diff --git a/fs/xfs/xfs_attr.h b/fs/xfs/xfs_attr.h
>> index 0829687..0ec3458 100644
>> --- a/fs/xfs/xfs_attr.h
>> +++ b/fs/xfs/xfs_attr.h
>> @@ -172,6 +172,8 @@ int xfs_attr_get(struct xfs_inode *ip, const unsigned char *name,
>> int flags);
>> int xfs_attr_set(struct xfs_inode *dp, const unsigned char *name,
>> size_t namelen, unsigned char *value, int valuelen, int flags);
>> +int xfs_attr_get_parent_pointer(struct xfs_inode *ip,
>> + struct xfs_parent_name_irec *xpnir);
>> int xfs_attr_set_args(struct xfs_da_args *args, int flags, bool roll_trans);
>> int xfs_attr_remove(struct xfs_inode *dp, const unsigned char *name,
>> size_t namelen, int flags);
>> diff --git a/fs/xfs/xfs_attr_list.c b/fs/xfs/xfs_attr_list.c
>> index 7740c8a..78fc477 100644
>> --- a/fs/xfs/xfs_attr_list.c
>> +++ b/fs/xfs/xfs_attr_list.c
>> @@ -534,6 +534,9 @@ xfs_attr_put_listent(
>> if (((context->flags & ATTR_ROOT) == 0) !=
>> ((flags & XFS_ATTR_ROOT) == 0))
>> return;
>> + if (((context->flags & ATTR_PARENT) == 0) !=
>> + ((flags & XFS_ATTR_PARENT) == 0))
>> + return;
>>
>> arraytop = sizeof(*alist) +
>> context->count * sizeof(alist->al_offset[0]);
>> diff --git a/fs/xfs/xfs_ioctl.c b/fs/xfs/xfs_ioctl.c
>> index 4664314..5492607 100644
>> --- a/fs/xfs/xfs_ioctl.c
>> +++ b/fs/xfs/xfs_ioctl.c
>> @@ -44,6 +44,7 @@
>> #include "xfs_btree.h"
>> #include <linux/fsmap.h>
>> #include "xfs_fsmap.h"
>> +#include "xfs_attr.h"
>>
>> #include <linux/capability.h>
>> #include <linux/cred.h>
>> @@ -1710,6 +1711,50 @@ xfs_ioc_getfsmap(
>> return 0;
>> }
>>
>> +/*
>> + * IOCTL routine to get the parent pointer of an inode and return it to user
>> + * space. Caller must pass an struct xfs_parent_name_irec with a name buffer
>> + * large enough to hold the file name. Returns 0 on success or non-zero on
>> + * failure
>> + */
>> +STATIC int
>> +xfs_ioc_get_parent_pointer(
>> + struct file *filp,
>> + void __user *arg)
>> +{
>> + struct inode *inode = file_inode(filp);
>> + struct xfs_inode *ip = XFS_I(inode);
>> + struct xfs_parent_name_irec xpnir;
>> + char *uname;
>> + char *kname;
>> + int error = 0;
>> +
>> + copy_from_user(&xpnir, arg, sizeof(struct xfs_parent_name_irec));
>> + uname = (char *)xpnir.p_name;
>> +
>> + /*
>> + * Use kernel space memory to get the parent pointer name.
>> + * We'll copy it to the user space name back when we're done
>> + */
>> + kname = kmem_zalloc_large(xpnir.p_namelen, KM_SLEEP);
> Please sanity-check the amount of memory we try to allocate.
>
>> + if (!kname)
>> + return -ENOMEM;
>> +
>> + xpnir.p_name = kname;
>> + error = xfs_attr_get_parent_pointer(ip, &xpnir);
>> +
>> + if (error)
>> + goto out;
>> +
>> + copy_to_user(uname, xpnir.p_name, xpnir.p_namelen);
>> + xpnir.p_name = uname;
>> + copy_to_user(arg, &xpnir, sizeof(struct xfs_parent_name_irec));
>> +
>> +out:
>> + kmem_free(kname);
>> + return error;
>> +}
>> +
>> int
>> xfs_ioc_swapext(
>> xfs_swapext_t *sxp)
>> @@ -1866,7 +1911,8 @@ xfs_file_ioctl(
>> return xfs_ioc_getxflags(ip, arg);
>> case XFS_IOC_SETXFLAGS:
>> return xfs_ioc_setxflags(ip, filp, arg);
>> -
>> + case XFS_IOC_GETPPOINTER:
>> + return xfs_ioc_get_parent_pointer(filp, arg);
>> case XFS_IOC_FSSETDM: {
>> struct fsdmidata dmi;
>>
>> --
>> 2.7.4
>>
>> --
>> To unsubscribe from this list: send the line "unsubscribe linux-xfs" in
>> the body of a message tomajordomo@vger.kernel.org
>> More majordomo info athttps://urldefense.proofpoint.com/v2/url?u=http-3A__vger.kernel.org_majordomo-2Dinfo.html&d=DwIBAg&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=LHZQ8fHvy6wDKXGTWcm97burZH5sQKHRDMaY1UthQxc&m=4f7DOEYDfWf_ZRdBfE0cU7L0QfDJjKolv1tc2HeLeck&s=6K6iOFwNgQv30L_9mpWjoAPsnvxojOglPp6hADhWRb8&e=
> --
> To unsubscribe from this list: send the line "unsubscribe linux-xfs" in
> the body of a message tomajordomo@vger.kernel.org
> More majordomo info athttps://urldefense.proofpoint.com/v2/url?u=http-3A__vger.kernel.org_majordomo-2Dinfo.html&d=DwIBAg&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=LHZQ8fHvy6wDKXGTWcm97burZH5sQKHRDMaY1UthQxc&m=4f7DOEYDfWf_ZRdBfE0cU7L0QfDJjKolv1tc2HeLeck&s=6K6iOFwNgQv30L_9mpWjoAPsnvxojOglPp6hADhWRb8&e=
next prev parent reply other threads:[~2017-11-29 18:52 UTC|newest]
Thread overview: 69+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-11-17 18:21 [PATCH v3 00/17] Parent Pointers v4 Allison Henderson
2017-11-17 18:21 ` [PATCH v3 01/17] Add helper functions xfs_attr_set_args and xfs_attr_remove_args Allison Henderson
2017-11-28 19:54 ` Darrick J. Wong
2017-11-29 1:02 ` Dave Chinner
2017-11-29 18:52 ` Allison Henderson
2017-11-29 22:34 ` Allison Henderson
2017-11-17 18:21 ` [PATCH v3 02/17] Set up infastructure for deferred attribute operations Allison Henderson
2017-11-28 19:45 ` Darrick J. Wong
2017-11-29 1:19 ` Dave Chinner
2017-11-29 18:52 ` Allison Henderson
2017-11-29 18:51 ` Allison Henderson
2017-11-17 18:21 ` [PATCH v3 03/17] Add xfs_attr_set_defered and xfs_attr_remove_defered Allison Henderson
2017-11-28 19:19 ` Darrick J. Wong
2017-11-29 18:50 ` Allison Henderson
2017-11-17 18:21 ` [PATCH v3 04/17] Remove all strlen calls in all xfs_attr_* functions for attr names Allison Henderson
2017-11-28 19:10 ` Darrick J. Wong
2017-11-29 18:50 ` Allison Henderson
2017-11-17 18:21 ` [PATCH v3 05/17] xfs: get directory offset when adding directory name Allison Henderson
2017-11-28 19:07 ` Darrick J. Wong
2017-11-29 18:50 ` Allison Henderson
2017-11-17 18:21 ` [PATCH v3 06/17] xfs: get directory offset when removing " Allison Henderson
2017-11-28 19:05 ` Darrick J. Wong
2017-11-29 18:49 ` Allison Henderson
2017-11-17 18:21 ` [PATCH v3 07/17] xfs: get directory offset when replacing a " Allison Henderson
2017-11-28 19:04 ` Darrick J. Wong
2017-11-29 18:49 ` Allison Henderson
2017-11-17 18:21 ` [PATCH v3 08/17] xfs: add parent pointer support to attribute code Allison Henderson
2017-11-28 19:01 ` Darrick J. Wong
2017-11-29 18:48 ` Allison Henderson
2017-11-17 18:21 ` [PATCH v3 09/17] xfs: define parent pointer xattr format Allison Henderson
2017-11-28 18:59 ` Darrick J. Wong
2017-11-29 18:48 ` Allison Henderson
2017-11-17 18:21 ` [PATCH v3 10/17] xfs: extent transaction reservations for parent attributes Allison Henderson
2017-11-28 18:58 ` Darrick J. Wong
2017-11-29 18:48 ` Allison Henderson
2017-11-17 18:21 ` [PATCH v3 11/17] Add the extra space requirements for parent pointer attributes when calculating the minimum log size during mkfs Allison Henderson
2017-11-28 18:51 ` Darrick J. Wong
2017-11-29 18:47 ` Allison Henderson
2017-11-29 20:18 ` Darrick J. Wong
2017-11-17 18:21 ` [PATCH v3 12/17] xfs: parent pointer attribute creation Allison Henderson
2017-11-28 18:49 ` Darrick J. Wong
2017-11-28 18:54 ` Darrick J. Wong
2017-11-29 18:46 ` Allison Henderson
2017-11-17 18:21 ` [PATCH v3 13/17] xfs: add parent attributes to link Allison Henderson
2017-11-28 18:37 ` Darrick J. Wong
2017-11-29 18:45 ` Allison Henderson
2017-11-17 18:21 ` [PATCH v3 14/17] xfs: remove parent pointers in unlink Allison Henderson
2017-11-28 18:24 ` Darrick J. Wong
2017-11-29 18:44 ` Allison Henderson
2017-11-17 18:21 ` [PATCH v3 15/17] Add parent pointers to rename Allison Henderson
2017-11-28 18:20 ` Darrick J. Wong
2017-11-29 18:43 ` Allison Henderson
2017-11-17 18:21 ` [PATCH v3 16/17] Add the parent pointer support to the superblock version 5 Allison Henderson
2017-11-28 18:08 ` Darrick J. Wong
2017-11-29 18:41 ` Allison Henderson
2017-11-17 18:21 ` [PATCH v3 17/17] Add parent pointer ioctl Allison Henderson
2017-11-22 19:54 ` Allison Henderson
2017-11-22 21:07 ` Dave Chinner
2017-11-22 22:49 ` Allison Henderson
2017-11-22 21:13 ` Darrick J. Wong
2017-11-22 22:49 ` Allison Henderson
2017-11-28 20:35 ` Darrick J. Wong
2017-11-29 18:52 ` Allison Henderson [this message]
2017-11-29 21:37 ` Dave Chinner
2017-11-29 22:48 ` Allison Henderson
2017-11-30 0:02 ` Dave Chinner
2017-11-30 1:52 ` Allison Henderson
2017-11-30 21:11 ` Darrick J. Wong
2017-12-01 2:58 ` Dave Chinner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=13a1961e-73c5-4e1b-0583-4faf637102b9@oracle.com \
--to=allison.henderson@oracle.com \
--cc=darrick.wong@oracle.com \
--cc=linux-xfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).