* [RFC PATCH] c/r: fixed false complaint on "unlinked" pipes, socket, etc
@ 2010-02-05 19:57 Oren Laadan
[not found] ` <1265399854-26534-1-git-send-email-orenl-eQaUEPhvms7ENvBUuze7eA@public.gmane.org>
0 siblings, 1 reply; 4+ messages in thread
From: Oren Laadan @ 2010-02-05 19:57 UTC (permalink / raw)
To: Serge Hallyn; +Cc: containers-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA
Commit a3a065e3f13da8a3470ed09c7f38aad256083726 changes the behavior
of dentry for pipe, sockets and anon_inodes (eventfd, timerfd, epoll,
signalfd and perf...):
"""
Filesystems outside the regular namespace do not have to clear
in order to have a working /proc/$pid/fd/XXX. Nothing in proc prevents the
fd link from being used if its dentry is not in the hash.
Also, it does not get put into the dcache hash if DCACHE_UNHASHED is clear;
that depends on the filesystem calling d_add or d_rehash.
So delete the misleading comments and needless code.
"""
That made checkpoint complain on half-closed pipes which are
DCACHE_UNHASHED and trigger the d_unlinked() test.
This patch proposes one fix - which is to mark such dentry with a
special flag DCACHE_SPECIAL_INODE, which indicates that this dentry
is an exception, and the d_unlinked() doesn't matter.
An alternative approach would be to add an unlinked() method to file
operations: if it's null, then checkpoint will test with d_unlinked()
otherwise it will test with the specific method. for these filesystems
we'll make this method return "ok" always. This may also be useful if
unlinking a file becomes a per-filesystem operation.
Comments ?
Oren.
---
checkpoint/files.c | 2 +-
fs/anon_inodes.c | 1 +
fs/pipe.c | 1 +
include/linux/dcache.h | 14 +++++++++++++-
net/socket.c | 1 +
5 files changed, 17 insertions(+), 2 deletions(-)
diff --git a/checkpoint/files.c b/checkpoint/files.c
index d1242f2..fa318f3 100644
--- a/checkpoint/files.c
+++ b/checkpoint/files.c
@@ -231,7 +231,7 @@ int checkpoint_file(struct ckpt_ctx *ctx, void *ptr)
file, file->f_op);
return -EBADF;
}
- if (d_unlinked(file->f_dentry)) {
+ if (!d_special_inode(file->f_dentry) && d_unlinked(file->f_dentry)) {
ckpt_err(ctx, -EBADF, "%(T)%(P)Unlinked files unsupported\n",
file);
return -EBADF;
diff --git a/fs/anon_inodes.c b/fs/anon_inodes.c
index 9f0bf13..a7bf417 100644
--- a/fs/anon_inodes.c
+++ b/fs/anon_inodes.c
@@ -118,6 +118,7 @@ struct file *anon_inode_getfile(const char *name,
atomic_inc(&anon_inode_inode->i_count);
path.dentry->d_op = &anon_inodefs_dentry_operations;
+ path.dentry->d_flags |= DCACHE_SPECIAL_INODE;
d_instantiate(path.dentry, anon_inode_inode);
error = -ENFILE;
diff --git a/fs/pipe.c b/fs/pipe.c
index 8c79493..2fd5ad6 100644
--- a/fs/pipe.c
+++ b/fs/pipe.c
@@ -1213,6 +1213,7 @@ struct file *create_write_pipe(int flags)
path.mnt = mntget(pipe_mnt);
path.dentry->d_op = &pipefs_dentry_operations;
+ path.dentry->d_flags |= DCACHE_SPECIAL_INODE;
d_instantiate(path.dentry, inode);
err = -ENFILE;
diff --git a/include/linux/dcache.h b/include/linux/dcache.h
index 30b93b2..af01318 100644
--- a/include/linux/dcache.h
+++ b/include/linux/dcache.h
@@ -186,6 +186,13 @@ d_iput: no no no yes
#define DCACHE_FSNOTIFY_PARENT_WATCHED 0x0080 /* Parent inode is watched by some fsnotify listener */
+#define DCACHE_SPECIAL_INODE 0x0100
+ /* This dentry corresponds to a special inode, like anon-inode,
+ * pipe, and sockets. It is set once by the file-system. Used by
+ * checkpoint/restart to ignore a positive d_unlinked() for such
+ * dentries (a closed pipe is not a unlinked file)
+ */
+
extern spinlock_t dcache_lock;
extern seqlock_t rename_lock;
@@ -347,7 +354,12 @@ extern struct dentry * dget_locked(struct dentry *);
*
* Returns true if the dentry passed is not currently hashed.
*/
-
+
+static inline int d_special_inode(struct dentry *dentry)
+{
+ return (dentry->d_flags & DCACHE_SPECIAL_INODE);
+}
+
static inline int d_unhashed(struct dentry *dentry)
{
return (dentry->d_flags & DCACHE_UNHASHED);
diff --git a/net/socket.c b/net/socket.c
index 3253c04..dd5983a 100644
--- a/net/socket.c
+++ b/net/socket.c
@@ -365,6 +365,7 @@ int sock_alloc_file(struct socket *sock, struct file **f, int flags)
path.mnt = mntget(sock_mnt);
path.dentry->d_op = &sockfs_dentry_operations;
+ path.dentry->d_flags |= DCACHE_SPECIAL_INODE;
d_instantiate(path.dentry, SOCK_INODE(sock));
SOCK_INODE(sock)->i_fop = &socket_file_ops;
--
1.6.3.3
^ permalink raw reply related [flat|nested] 4+ messages in thread
* Re: [RFC PATCH] c/r: fixed false complaint on "unlinked" pipes, socket, etc
[not found] ` <1265399854-26534-1-git-send-email-orenl-eQaUEPhvms7ENvBUuze7eA@public.gmane.org>
@ 2010-02-06 0:08 ` Serge E. Hallyn
2010-02-10 16:24 ` Dan Smith
1 sibling, 0 replies; 4+ messages in thread
From: Serge E. Hallyn @ 2010-02-06 0:08 UTC (permalink / raw)
To: Oren Laadan; +Cc: containers-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA
Seems worth a shot. May get shot down by Al or someone, but it
seems a good place to start.
There may be opposition to the name, 'special', being too generic,
but I can't think of a better name (never_linked, ...)
-serge
Quoting Oren Laadan (orenl-eQaUEPhvms7ENvBUuze7eA@public.gmane.org):
> Commit a3a065e3f13da8a3470ed09c7f38aad256083726 changes the behavior
> of dentry for pipe, sockets and anon_inodes (eventfd, timerfd, epoll,
> signalfd and perf...):
>
> """
> Filesystems outside the regular namespace do not have to clear
> in order to have a working /proc/$pid/fd/XXX. Nothing in proc prevents the
> fd link from being used if its dentry is not in the hash.
>
> Also, it does not get put into the dcache hash if DCACHE_UNHASHED is clear;
> that depends on the filesystem calling d_add or d_rehash.
>
> So delete the misleading comments and needless code.
> """
>
> That made checkpoint complain on half-closed pipes which are
> DCACHE_UNHASHED and trigger the d_unlinked() test.
>
> This patch proposes one fix - which is to mark such dentry with a
> special flag DCACHE_SPECIAL_INODE, which indicates that this dentry
> is an exception, and the d_unlinked() doesn't matter.
>
> An alternative approach would be to add an unlinked() method to file
> operations: if it's null, then checkpoint will test with d_unlinked()
> otherwise it will test with the specific method. for these filesystems
> we'll make this method return "ok" always. This may also be useful if
> unlinking a file becomes a per-filesystem operation.
>
> Comments ?
>
>
> Oren.
>
>
> ---
> checkpoint/files.c | 2 +-
> fs/anon_inodes.c | 1 +
> fs/pipe.c | 1 +
> include/linux/dcache.h | 14 +++++++++++++-
> net/socket.c | 1 +
> 5 files changed, 17 insertions(+), 2 deletions(-)
>
> diff --git a/checkpoint/files.c b/checkpoint/files.c
> index d1242f2..fa318f3 100644
> --- a/checkpoint/files.c
> +++ b/checkpoint/files.c
> @@ -231,7 +231,7 @@ int checkpoint_file(struct ckpt_ctx *ctx, void *ptr)
> file, file->f_op);
> return -EBADF;
> }
> - if (d_unlinked(file->f_dentry)) {
> + if (!d_special_inode(file->f_dentry) && d_unlinked(file->f_dentry)) {
> ckpt_err(ctx, -EBADF, "%(T)%(P)Unlinked files unsupported\n",
> file);
> return -EBADF;
> diff --git a/fs/anon_inodes.c b/fs/anon_inodes.c
> index 9f0bf13..a7bf417 100644
> --- a/fs/anon_inodes.c
> +++ b/fs/anon_inodes.c
> @@ -118,6 +118,7 @@ struct file *anon_inode_getfile(const char *name,
> atomic_inc(&anon_inode_inode->i_count);
>
> path.dentry->d_op = &anon_inodefs_dentry_operations;
> + path.dentry->d_flags |= DCACHE_SPECIAL_INODE;
> d_instantiate(path.dentry, anon_inode_inode);
>
> error = -ENFILE;
> diff --git a/fs/pipe.c b/fs/pipe.c
> index 8c79493..2fd5ad6 100644
> --- a/fs/pipe.c
> +++ b/fs/pipe.c
> @@ -1213,6 +1213,7 @@ struct file *create_write_pipe(int flags)
> path.mnt = mntget(pipe_mnt);
>
> path.dentry->d_op = &pipefs_dentry_operations;
> + path.dentry->d_flags |= DCACHE_SPECIAL_INODE;
> d_instantiate(path.dentry, inode);
>
> err = -ENFILE;
> diff --git a/include/linux/dcache.h b/include/linux/dcache.h
> index 30b93b2..af01318 100644
> --- a/include/linux/dcache.h
> +++ b/include/linux/dcache.h
> @@ -186,6 +186,13 @@ d_iput: no no no yes
>
> #define DCACHE_FSNOTIFY_PARENT_WATCHED 0x0080 /* Parent inode is watched by some fsnotify listener */
>
> +#define DCACHE_SPECIAL_INODE 0x0100
> + /* This dentry corresponds to a special inode, like anon-inode,
> + * pipe, and sockets. It is set once by the file-system. Used by
> + * checkpoint/restart to ignore a positive d_unlinked() for such
> + * dentries (a closed pipe is not a unlinked file)
> + */
> +
> extern spinlock_t dcache_lock;
> extern seqlock_t rename_lock;
>
> @@ -347,7 +354,12 @@ extern struct dentry * dget_locked(struct dentry *);
> *
> * Returns true if the dentry passed is not currently hashed.
> */
> -
> +
> +static inline int d_special_inode(struct dentry *dentry)
> +{
> + return (dentry->d_flags & DCACHE_SPECIAL_INODE);
> +}
> +
> static inline int d_unhashed(struct dentry *dentry)
> {
> return (dentry->d_flags & DCACHE_UNHASHED);
> diff --git a/net/socket.c b/net/socket.c
> index 3253c04..dd5983a 100644
> --- a/net/socket.c
> +++ b/net/socket.c
> @@ -365,6 +365,7 @@ int sock_alloc_file(struct socket *sock, struct file **f, int flags)
> path.mnt = mntget(sock_mnt);
>
> path.dentry->d_op = &sockfs_dentry_operations;
> + path.dentry->d_flags |= DCACHE_SPECIAL_INODE;
> d_instantiate(path.dentry, SOCK_INODE(sock));
> SOCK_INODE(sock)->i_fop = &socket_file_ops;
>
> --
> 1.6.3.3
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [RFC PATCH] c/r: fixed false complaint on "unlinked" pipes, socket, etc
[not found] ` <1265399854-26534-1-git-send-email-orenl-eQaUEPhvms7ENvBUuze7eA@public.gmane.org>
2010-02-06 0:08 ` Serge E. Hallyn
@ 2010-02-10 16:24 ` Dan Smith
[not found] ` <87tytpgitd.fsf-FLMGYpZoEPULwtHQx/6qkW3U47Q5hpJU@public.gmane.org>
1 sibling, 1 reply; 4+ messages in thread
From: Dan Smith @ 2010-02-10 16:24 UTC (permalink / raw)
To: Oren Laadan; +Cc: containers-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA
OL> Comments ?
Works for me (although the issue remains that find_locks_with_owner()
thinks there are locks present on a socket).
--
Dan Smith
IBM Linux Technology Center
email: danms-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [RFC PATCH] c/r: fixed false complaint on "unlinked" pipes, socket, etc
[not found] ` <87tytpgitd.fsf-FLMGYpZoEPULwtHQx/6qkW3U47Q5hpJU@public.gmane.org>
@ 2010-02-10 16:47 ` Oren Laadan
0 siblings, 0 replies; 4+ messages in thread
From: Oren Laadan @ 2010-02-10 16:47 UTC (permalink / raw)
To: Dan Smith
Cc: containers-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA,
Dave Hansen
Heh ... following the discussion on the IRC, I actually prefer
a solution that will push tests that are not filesystm-agnostic
to the filesystem-specific .checkpoint method. Then also put
it in the generic .checkpoint method to catch the "default" case.
I think it's cleaner and does not introduce needless noise in
the kernel.
FWIW, not only for the d_unlinked() test, but also for the locks
issue; even though locks are supposed to be VFS generic and not
depend on specific filesystem, apparently they do. (Either that
or fix find_locks_with_owner() in a generic way).
Oren.
Dan Smith wrote:
> OL> Comments ?
>
> Works for me (although the issue remains that find_locks_with_owner()
> thinks there are locks present on a socket).
>
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2010-02-10 16:47 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-02-05 19:57 [RFC PATCH] c/r: fixed false complaint on "unlinked" pipes, socket, etc Oren Laadan
[not found] ` <1265399854-26534-1-git-send-email-orenl-eQaUEPhvms7ENvBUuze7eA@public.gmane.org>
2010-02-06 0:08 ` Serge E. Hallyn
2010-02-10 16:24 ` Dan Smith
[not found] ` <87tytpgitd.fsf-FLMGYpZoEPULwtHQx/6qkW3U47Q5hpJU@public.gmane.org>
2010-02-10 16:47 ` Oren Laadan
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.