netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH] fix integer overflow in H-TCP congestion control
@ 2006-10-24 14:17 Gavin McCullagh
  2006-10-24 22:30 ` David Miller
  0 siblings, 1 reply; 15+ messages in thread
From: Gavin McCullagh @ 2006-10-24 14:17 UTC (permalink / raw)
  To: NetDev; +Cc: Douglas Leith, Baruch Even


When using H-TCP with a single flow on a 500Mbit connection (or less
actually), alpha can exceed 65000, so alpha needs to be a u32.

Signed-off-by: Gavin McCullagh <gavin.mccullagh@nuim.ie>
Signed-off-by: Doug Leith <doug.leith@nuim.ie>


diff --git a/net/ipv4/tcp_htcp.c b/net/ipv4/tcp_htcp.c
index 6edfe5e..8072b6d 100644
--- a/net/ipv4/tcp_htcp.c
+++ b/net/ipv4/tcp_htcp.c
@@ -23,7 +23,7 @@ module_param(use_bandwidth_switch, int, 
 MODULE_PARM_DESC(use_bandwidth_switch, "turn on/off bandwidth switcher");
 
 struct htcp {
-       u16     alpha;          /* Fixed point arith, << 7 */
+       u32     alpha;          /* Fixed point arith, << 7 */
        u8      beta;           /* Fixed point arith, << 7 */
        u8      modeswitch;     /* Delay modeswitch until we had at least one congestion event */
        u32     last_cong;      /* Time since last congestion event end */


^ permalink raw reply related	[flat|nested] 15+ messages in thread

* Re: [PATCH] fix integer overflow in H-TCP congestion control
  2006-10-24 14:17 [PATCH] fix integer overflow in H-TCP congestion control Gavin McCullagh
@ 2006-10-24 22:30 ` David Miller
  2006-10-25  8:47   ` Gavin McCullagh
  0 siblings, 1 reply; 15+ messages in thread
From: David Miller @ 2006-10-24 22:30 UTC (permalink / raw)
  To: gavin.mccullagh; +Cc: netdev, doug.leith, baruch


Your patch doesn't apply, your email client turned the tab
characters in the patch into spaces.

Please fix and resubmit, thank you.

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: [PATCH] fix integer overflow in H-TCP congestion control
  2006-10-24 22:30 ` David Miller
@ 2006-10-25  8:47   ` Gavin McCullagh
  2006-10-26  6:06     ` David Miller
  0 siblings, 1 reply; 15+ messages in thread
From: Gavin McCullagh @ 2006-10-25  8:47 UTC (permalink / raw)
  To: David Miller; +Cc: netdev, doug.leith, baruch


When using H-TCP with a single flow on a 500Mbit connection (or less
actually), alpha can exceed 65000, so alpha needs to be a u32.

Signed-off-by: Gavin McCullagh <gavin.mccullagh@nuim.ie>
Signed-off-by: Doug Leith <doug.leith@nuim.ie>


diff --git a/net/ipv4/tcp_htcp.c b/net/ipv4/tcp_htcp.c
index 6edfe5e..8072b6d 100644
--- a/net/ipv4/tcp_htcp.c
+++ b/net/ipv4/tcp_htcp.c
@@ -23,7 +23,7 @@ module_param(use_bandwidth_switch, int, 
 MODULE_PARM_DESC(use_bandwidth_switch, "turn on/off bandwidth switcher");
 
 struct htcp {
-	u16	alpha;		/* Fixed point arith, << 7 */
+	u32	alpha;		/* Fixed point arith, << 7 */
 	u8	beta;           /* Fixed point arith, << 7 */
 	u8	modeswitch;     /* Delay modeswitch until we had at least one congestion event */
 	u32	last_cong;	/* Time since last congestion event end */


^ permalink raw reply related	[flat|nested] 15+ messages in thread

* Re: [PATCH] fix integer overflow in H-TCP congestion control
  2006-10-25  8:47   ` Gavin McCullagh
@ 2006-10-26  6:06     ` David Miller
  2006-10-31 18:48       ` [RFC, PATCH] dont insert sockets/pipes dentries into dentry_hashtable Eric Dumazet
  0 siblings, 1 reply; 15+ messages in thread
From: David Miller @ 2006-10-26  6:06 UTC (permalink / raw)
  To: gavin.mccullagh; +Cc: netdev, doug.leith, baruch

From: Gavin McCullagh <gavin.mccullagh@nuim.ie>
Date: Wed, 25 Oct 2006 09:47:26 +0100

> When using H-TCP with a single flow on a 500Mbit connection (or less
> actually), alpha can exceed 65000, so alpha needs to be a u32.
> 
> Signed-off-by: Gavin McCullagh <gavin.mccullagh@nuim.ie>
> Signed-off-by: Doug Leith <doug.leith@nuim.ie>

Applied, thank you.

^ permalink raw reply	[flat|nested] 15+ messages in thread

* [RFC, PATCH] dont insert sockets/pipes dentries into dentry_hashtable.
  2006-10-26  6:06     ` David Miller
@ 2006-10-31 18:48       ` Eric Dumazet
  2006-11-01  7:19         ` David Miller
  2006-11-22 18:00         ` [PATCH] [NET] dont insert socket " Eric Dumazet
  0 siblings, 2 replies; 15+ messages in thread
From: Eric Dumazet @ 2006-10-31 18:48 UTC (permalink / raw)
  To: David Miller; +Cc: netdev, linux-kernel

[-- Attachment #1: Type: text/plain, Size: 1629 bytes --]

Hi David

Here is the patch I cooked after our mail exchange. (was [RFC] Any strong 
reason why socket dentries are hashed in global dentry_hashtable )

If necessary, I could split this patch in 4 elementary patches. I chose to 
sent it as one patch for initial discussion.

[RFC, PATCH] dont insert sockets/pipes dentries into dentry_hashtable.

We currently insert sockets/pipes dentries into the global dentry hashtable.
This is *useless* because there is currently no way these entries can be used 
for a lookup(). (/proc/xxx/fd/xxx uses a different mechanism)

Machines with a lot of sockets/pipes might suffer from longer chains in dentry 
hashtable.

The goals of this patch are :

[0] No more insertion in hashtable of sockets/pipes dentries.

[1] Introduction of a DENTRY_DELETED flag, that can distinguish dentries that 
were deleted and others in d_path(). (previous code was using d_unhashed())

[2] Small optimization to bypass RCU freeing in d_free() for dentries that 
were never hashed (like sockets and pipes). Such dentries dont have to wait a 
RCU grace period.

[3] Plug socket code to use d_instantiate() instead of d_hash()
   (No more need for a private d_delete function, and dentry_operations)

[4] Plug pipe code to use d_instantiate() instead of d_hash()
   (No more need for a private d_delete function, and dentry_operations)

Another step would be to eliminate dentries for sockets/pipes, but that's 
another story. (Or at least allocate them from a separate kmem_cache_t as 
they are not reclaimable, and they might be smaller than a full dentry)

Signed-off-by: Eric Dumazet <dada1@cosmosbay.com>

[-- Attachment #2: dcache.patch --]
[-- Type: text/plain, Size: 3652 bytes --]

--- linux-2.6.19-rc4/include/linux/dcache.h	2006-10-31 17:38:09.000000000 +0100
+++ linux-2.6.19-rc4-ed/include/linux/dcache.h	2006-10-31 17:39:15.000000000 +0100
@@ -175,6 +175,7 @@
 #define DCACHE_UNHASHED		0x0010	
 
 #define DCACHE_INOTIFY_PARENT_WATCHED	0x0020 /* Parent inode is watched */
+#define DCACHE_DELETED		0x0040 /* dentry was deleted */
 
 extern spinlock_t dcache_lock;
 
--- linux-2.6.19-rc4/fs/dcache.c	2006-10-31 17:39:25.000000000 +0100
+++ linux-2.6.19-rc4-ed/fs/dcache.c	2006-10-31 18:37:26.000000000 +0100
@@ -68,15 +68,19 @@
 	.age_limit = 45,
 };
 
-static void d_callback(struct rcu_head *head)
+static void __d_free(struct dentry *dentry)
 {
-	struct dentry * dentry = container_of(head, struct dentry, d_u.d_rcu);
-
 	if (dname_external(dentry))
 		kfree(dentry->d_name.name);
 	kmem_cache_free(dentry_cache, dentry); 
 }
 
+static void d_callback(struct rcu_head *head)
+{
+	struct dentry * dentry = container_of(head, struct dentry, d_u.d_rcu);
+	__d_free(dentry);
+}
+
 /*
  * no dcache_lock, please.  The caller must decrement dentry_stat.nr_dentry
  * inside dcache_lock.
@@ -85,7 +89,11 @@
 {
 	if (dentry->d_op && dentry->d_op->d_release)
 		dentry->d_op->d_release(dentry);
- 	call_rcu(&dentry->d_u.d_rcu, d_callback);
+	/* if dentry was never inserted into hash, immediate free is OK */
+	if (dentry->d_hash.pprev == NULL)
+		__d_free(dentry);
+	else
+		call_rcu(&dentry->d_u.d_rcu, d_callback);
 }
 
 /*
@@ -1376,6 +1384,7 @@
 		return;
 	}
 
+	dentry->d_flags |= DCACHE_DELETED;
 	if (!d_unhashed(dentry))
 		__d_drop(dentry);
 
@@ -1749,7 +1758,7 @@
 
 	*--end = '\0';
 	buflen--;
-	if (!IS_ROOT(dentry) && d_unhashed(dentry)) {
+	if (!IS_ROOT(dentry) && (dentry->d_flags & DCACHE_DELETED)) {
 		buflen -= 10;
 		end -= 10;
 		if (buflen < 0)
--- linux-2.6.19-rc4/net/socket.c	2006-10-31 17:53:34.000000000 +0100
+++ linux-2.6.19-rc4-ed/net/socket.c	2006-10-31 18:39:06.000000000 +0100
@@ -304,13 +304,6 @@
 	.kill_sb =	kill_anon_super,
 };
 
-static int sockfs_delete_dentry(struct dentry *dentry)
-{
-	return 1;
-}
-static struct dentry_operations sockfs_dentry_operations = {
-	.d_delete = sockfs_delete_dentry,
-};
 
 /*
  *	Obtains the first available file descriptor and sets it up for use.
@@ -360,8 +353,9 @@
 	if (unlikely(!file->f_dentry))
 		return -ENOMEM;
 
-	file->f_dentry->d_op = &sockfs_dentry_operations;
-	d_add(file->f_dentry, SOCK_INODE(sock));
+	/* Dont insert socket dentry into global dentry hashtable */
+	d_instantiate(file->f_dentry, SOCK_INODE(sock));
+
 	file->f_vfsmnt = mntget(sock_mnt);
 	file->f_mapping = file->f_dentry->d_inode->i_mapping;
 
--- linux-2.6.19-rc4/fs/pipe.c	2006-10-31 18:53:21.000000000 +0100
+++ linux-2.6.19-rc4-ed/fs/pipe.c	2006-10-31 18:55:20.000000000 +0100
@@ -828,14 +828,6 @@
 }
 
 static struct vfsmount *pipe_mnt __read_mostly;
-static int pipefs_delete_dentry(struct dentry *dentry)
-{
-	return 1;
-}
-
-static struct dentry_operations pipefs_dentry_operations = {
-	.d_delete	= pipefs_delete_dentry,
-};
 
 static struct inode * get_pipe_inode(void)
 {
@@ -891,17 +883,15 @@
 	if (!inode)
 		goto err_file;
 
-	sprintf(name, "[%lu]", inode->i_ino);
+	this.len = sprintf(name, "[%lu]", inode->i_ino);
 	this.name = name;
-	this.len = strlen(name);
 	this.hash = inode->i_ino; /* will go */
 	err = -ENOMEM;
 	dentry = d_alloc(pipe_mnt->mnt_sb->s_root, &this);
 	if (!dentry)
 		goto err_inode;
-
-	dentry->d_op = &pipefs_dentry_operations;
-	d_add(dentry, inode);
+	/* Dont insert pipe dentry into global dentry hashtable */
+	d_instantiate(dentry, inode);
 	f->f_vfsmnt = mntget(pipe_mnt);
 	f->f_dentry = dentry;
 	f->f_mapping = inode->i_mapping;

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: [RFC, PATCH] dont insert sockets/pipes dentries into dentry_hashtable.
  2006-10-31 18:48       ` [RFC, PATCH] dont insert sockets/pipes dentries into dentry_hashtable Eric Dumazet
@ 2006-11-01  7:19         ` David Miller
  2006-11-01  8:21           ` Eric Dumazet
  2006-11-01 13:54           ` Eric Dumazet
  2006-11-22 18:00         ` [PATCH] [NET] dont insert socket " Eric Dumazet
  1 sibling, 2 replies; 15+ messages in thread
From: David Miller @ 2006-11-01  7:19 UTC (permalink / raw)
  To: dada1; +Cc: netdev, linux-kernel

From: Eric Dumazet <dada1@cosmosbay.com>
Date: Tue, 31 Oct 2006 19:48:48 +0100

> We currently insert sockets/pipes dentries into the global dentry
> hashtable.  This is *useless* because there is currently no way
> these entries can be used for a lookup(). (/proc/xxx/fd/xxx uses a
> different mechanism)

It turns out that while procfs uses a different "mechanism", those
procfs symlinks do point to the real socket dentry, so when you
readlink() on it you do d_path() on the real socket dentry.

If you unhash these things, I'm pretty sure you'll see an ugly
"(deleted)" at the end of the symlink string for /proc/$pid/fd/$X
files that are sockets or something like that.

Al Viro just suggested a way around this to me:

1) Just mark the dentry HASHED by hand in the dentry flags, but don't
   actually hash it.

2) Create a special dentry->d_deleted method for sockets that returns
   0 and clears by hand the HASHED flag bit in the dentry (see what
   dput() does when this happens).

It's an abuse but it will work.

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: [RFC, PATCH] dont insert sockets/pipes dentries into dentry_hashtable.
  2006-11-01  7:19         ` David Miller
@ 2006-11-01  8:21           ` Eric Dumazet
  2006-11-01  8:34             ` David Miller
  2006-11-01  8:38             ` Al Viro
  2006-11-01 13:54           ` Eric Dumazet
  1 sibling, 2 replies; 15+ messages in thread
From: Eric Dumazet @ 2006-11-01  8:21 UTC (permalink / raw)
  To: David Miller; +Cc: netdev, linux-kernel

David Miller a écrit :
> From: Eric Dumazet <dada1@cosmosbay.com>
> Date: Tue, 31 Oct 2006 19:48:48 +0100
> 
>> We currently insert sockets/pipes dentries into the global dentry
>> hashtable.  This is *useless* because there is currently no way
>> these entries can be used for a lookup(). (/proc/xxx/fd/xxx uses a
>> different mechanism)
> 
> It turns out that while procfs uses a different "mechanism", those
> procfs symlinks do point to the real socket dentry, so when you
> readlink() on it you do d_path() on the real socket dentry.
> 
> If you unhash these things, I'm pretty sure you'll see an ugly
> "(deleted)" at the end of the symlink string for /proc/$pid/fd/$X
> files that are sockets or something like that.

No no, my patch takes care of that.

You still see the right link for pipes and sockets on /proc/$pid/fd/XXX

And " (deleted)" is correctly added to deleted files.


> 
> Al Viro just suggested a way around this to me:
> 
> 1) Just mark the dentry HASHED by hand in the dentry flags, but don't
>    actually hash it.
> 
> 2) Create a special dentry->d_deleted method for sockets that returns
>    0 and clears by hand the HASHED flag bit in the dentry (see what
>    dput() does when this happens).
> 
> It's an abuse but it will work.
> 
Why hack when a proper thing can be done ?


^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: [RFC, PATCH] dont insert sockets/pipes dentries into dentry_hashtable.
  2006-11-01  8:21           ` Eric Dumazet
@ 2006-11-01  8:34             ` David Miller
  2006-11-01  8:38             ` Al Viro
  1 sibling, 0 replies; 15+ messages in thread
From: David Miller @ 2006-11-01  8:34 UTC (permalink / raw)
  To: dada1; +Cc: netdev, linux-kernel

From: Eric Dumazet <dada1@cosmosbay.com>
Date: Wed, 01 Nov 2006 09:21:06 +0100

> No no, my patch takes care of that.
> 
> You still see the right link for pipes and sockets on /proc/$pid/fd/XXX
> 
> And " (deleted)" is correctly added to deleted files.

I see.  Excellent :-)

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: [RFC, PATCH] dont insert sockets/pipes dentries into dentry_hashtable.
  2006-11-01  8:21           ` Eric Dumazet
  2006-11-01  8:34             ` David Miller
@ 2006-11-01  8:38             ` Al Viro
  2006-11-01  8:42               ` Al Viro
  2006-11-01  9:04               ` Eric Dumazet
  1 sibling, 2 replies; 15+ messages in thread
From: Al Viro @ 2006-11-01  8:38 UTC (permalink / raw)
  To: Eric Dumazet; +Cc: David Miller, netdev, linux-kernel

On Wed, Nov 01, 2006 at 09:21:06AM +0100, Eric Dumazet wrote:

> And " (deleted)" is correctly added to deleted files.

The hell it will.

touch a
touch b
exec 5<a
mv b a
ls -l /proc/$$/fd/5

With your patch and without it, please.

PS: getting rid of socket dentries is a bad idea with the capital "Fuck, No".
For those who want to see where does that path lead and are attracted to
trainwrecks in general I can recommend *BSD socket handling.  They have
paid quite painfully for lack of proper vnodes.  It's simply not worth
the resulting trouble.

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: [RFC, PATCH] dont insert sockets/pipes dentries into dentry_hashtable.
  2006-11-01  8:38             ` Al Viro
@ 2006-11-01  8:42               ` Al Viro
  2006-11-01  9:04               ` Eric Dumazet
  1 sibling, 0 replies; 15+ messages in thread
From: Al Viro @ 2006-11-01  8:42 UTC (permalink / raw)
  To: Eric Dumazet; +Cc: David Miller, netdev, linux-kernel

On Wed, Nov 01, 2006 at 08:38:11AM +0000, Al Viro wrote:
> On Wed, Nov 01, 2006 at 09:21:06AM +0100, Eric Dumazet wrote:
> 
> > And " (deleted)" is correctly added to deleted files.
> 
> The hell it will.
> 
> touch a
> touch b
> exec 5<a
> mv b a
> ls -l /proc/$$/fd/5
> 
> With your patch and without it, please.

While we are at it,

touch a
rm a
touch a
ls -l /proc/self/fd/0 <a

With and without your patch.  Note that you never remove DCACHE_DELETED
after you've set it.

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: [RFC, PATCH] dont insert sockets/pipes dentries into dentry_hashtable.
  2006-11-01  8:38             ` Al Viro
  2006-11-01  8:42               ` Al Viro
@ 2006-11-01  9:04               ` Eric Dumazet
  1 sibling, 0 replies; 15+ messages in thread
From: Eric Dumazet @ 2006-11-01  9:04 UTC (permalink / raw)
  To: Al Viro; +Cc: David Miller, netdev, linux-kernel

Al Viro a écrit :
> On Wed, Nov 01, 2006 at 09:21:06AM +0100, Eric Dumazet wrote:
> 
>> And " (deleted)" is correctly added to deleted files.
> 
> The hell it will.
> 
> touch a
> touch b
> exec 5<a
> mv b a
> ls -l /proc/$$/fd/5
> 
> With your patch and without it, please.

Yes I will do, thanks.

> 
> PS: getting rid of socket dentries is a bad idea with the capital "Fuck, No".
> For those who want to see where does that path lead and are attracted to
> trainwrecks in general I can recommend *BSD socket handling.  They have
> paid quite painfully for lack of proper vnodes.  It's simply not worth
> the resulting trouble.

I have one server with one million sockets, wasting 210 MB of ram for dentries 
that are only used when some guy does some /proc/$pid/fd work

Socket hot path already dont use anymore dentries, thanks to file->private_data

Eric


^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: [RFC, PATCH] dont insert sockets/pipes dentries into dentry_hashtable.
  2006-11-01  7:19         ` David Miller
  2006-11-01  8:21           ` Eric Dumazet
@ 2006-11-01 13:54           ` Eric Dumazet
  1 sibling, 0 replies; 15+ messages in thread
From: Eric Dumazet @ 2006-11-01 13:54 UTC (permalink / raw)
  To: David Miller, Al Viro; +Cc: netdev, linux-kernel

[-- Attachment #1: Type: text/plain, Size: 2052 bytes --]

David Miller a écrit :
> It turns out that while procfs uses a different "mechanism", those
> procfs symlinks do point to the real socket dentry, so when you
> readlink() on it you do d_path() on the real socket dentry.
> 
> Al Viro just suggested a way around this to me:
> 
> 1) Just mark the dentry HASHED by hand in the dentry flags, but don't
>    actually hash it.
> 
> 2) Create a special dentry->d_deleted method for sockets that returns
>    0 and clears by hand the HASHED flag bit in the dentry (see what
>    dput() does when this happens).
> 
> It's an abuse but it will work.
> 


Thank you David and Al for the feedback.

Here is a new version of the patch with your suggestions included.

If necessary, I could split this patch in 3 elementary patches. I chose to
sent it as one patch for ease of discussion.

[RFC, PATCH] dont insert sockets/pipes dentries into dentry_hashtable.

We currently insert sockets/pipes dentries into the global dentry hashtable.
This is *useless* because there is currently no way these entries can be used
for a lookup(). (/proc/xxx/fd/xxx uses a different mechanism)

Machines with a lot of sockets/pipes might suffer from longer chains in dentry
hashtable.

Since dentries an unhashed dentry means __dpath() adds a " (deleted)", the 
trick for socket/pipe dentries is to :

- Right after d_alloc(), pretend they are hashed by clearing the 
DCACHE_UNHASHED bit. __dpath() & friends work as intended.

- Call d_instantiate() instead of d_add() : dentry is not inserted in hash table.

- Once dput() must clear the dentry, setting again DCACHE_UNHASHED bit inside 
the custom d_delete() function provided by socket/pipe code.


[patch 1/3] Small optimization to bypass RCU freeing in d_free() for dentries 
that were never hashed (like sockets and pipes). Such dentries dont have to 
wait a RCU grace period.

[patch 2/3] Change socket code to use d_instantiate() instead of d_add()

[patch 3/3] Change pipe code to use d_instantiate() instead of d_add()

Signed-off-by: Eric Dumazet <dada1@cosmosbay.com>

[-- Attachment #2: donthash.patch --]
[-- Type: text/plain, Size: 3861 bytes --]

--- linux-2.6.19-rc4/fs/dcache.c	2006-11-01 12:38:23.000000000 +0100
+++ linux-2.6.19-rc4-ed/fs/dcache.c	2006-11-01 13:22:44.000000000 +0100
@@ -68,15 +68,19 @@
 	.age_limit = 45,
 };
 
-static void d_callback(struct rcu_head *head)
+static void __d_free(struct dentry *dentry)
 {
-	struct dentry * dentry = container_of(head, struct dentry, d_u.d_rcu);
-
 	if (dname_external(dentry))
 		kfree(dentry->d_name.name);
 	kmem_cache_free(dentry_cache, dentry); 
 }
 
+static void d_callback(struct rcu_head *head)
+{
+	struct dentry * dentry = container_of(head, struct dentry, d_u.d_rcu);
+	__d_free(dentry);
+}
+
 /*
  * no dcache_lock, please.  The caller must decrement dentry_stat.nr_dentry
  * inside dcache_lock.
@@ -85,7 +89,11 @@
 {
 	if (dentry->d_op && dentry->d_op->d_release)
 		dentry->d_op->d_release(dentry);
- 	call_rcu(&dentry->d_u.d_rcu, d_callback);
+	/* if dentry was never inserted into hash, immediate free is OK */
+	if (dentry->d_hash.pprev == NULL)
+		__d_free(dentry);
+	else
+		call_rcu(&dentry->d_u.d_rcu, d_callback);
 }
 
 /*
--- linux-2.6.19-rc4/net/socket.c	2006-11-01 12:40:27.000000000 +0100
+++ linux-2.6.19-rc4-ed/net/socket.c	2006-11-01 14:33:18.000000000 +0100
@@ -306,7 +306,14 @@
 
 static int sockfs_delete_dentry(struct dentry *dentry)
 {
-	return 1;
+	/*
+	 * At creation time, we pretended this dentry was hashed
+	 * (by clearing DCACHE_UNHASHED bit in d_flags)
+	 * At delete time, we restore the truth : not hashed.
+	 * (so that dput() can proceed correctly)
+	 */
+	dentry->d_flags |= DCACHE_UNHASHED;
+	return 0;
 }
 static struct dentry_operations sockfs_dentry_operations = {
 	.d_delete = sockfs_delete_dentry,
@@ -354,14 +361,20 @@
 
 	this.len = sprintf(name, "[%lu]", SOCK_INODE(sock)->i_ino);
 	this.name = name;
-	this.hash = SOCK_INODE(sock)->i_ino;
+	this.hash = 0;
 
 	file->f_dentry = d_alloc(sock_mnt->mnt_sb->s_root, &this);
 	if (unlikely(!file->f_dentry))
 		return -ENOMEM;
 
 	file->f_dentry->d_op = &sockfs_dentry_operations;
-	d_add(file->f_dentry, SOCK_INODE(sock));
+	/*
+	 * We dont want to push this dentry into global dentry hash table. 
+	 * We pretend dentry is already hashed, by unsetting DCACHE_UNHASHED
+	 * This hack permits a working /proc/$pid/fd/XXX on sockets
+	 */
+	file->f_dentry->d_flags &= ~DCACHE_UNHASHED;
+	d_instantiate(file->f_dentry, SOCK_INODE(sock));
 	file->f_vfsmnt = mntget(sock_mnt);
 	file->f_mapping = file->f_dentry->d_inode->i_mapping;
 
--- linux-2.6.19-rc4/fs/pipe.c	2006-11-01 12:56:05.000000000 +0100
+++ linux-2.6.19-rc4-ed/fs/pipe.c	2006-11-01 14:33:18.000000000 +0100
@@ -830,7 +830,14 @@
 static struct vfsmount *pipe_mnt __read_mostly;
 static int pipefs_delete_dentry(struct dentry *dentry)
 {
-	return 1;
+	/*
+	 * At creation time, we pretended this dentry was hashed
+	 * (by clearing DCACHE_UNHASHED bit in d_flags)
+	 * At delete time, we restore the truth : not hashed.
+	 * (so that dput() can proceed correctly)
+	 */
+	dentry->d_flags |= DCACHE_UNHASHED;
+	return 0;
 }
 
 static struct dentry_operations pipefs_dentry_operations = {
@@ -891,17 +898,22 @@
 	if (!inode)
 		goto err_file;
 
-	sprintf(name, "[%lu]", inode->i_ino);
+	this.len = sprintf(name, "[%lu]", inode->i_ino);
 	this.name = name;
-	this.len = strlen(name);
-	this.hash = inode->i_ino; /* will go */
+	this.hash = 0;
 	err = -ENOMEM;
 	dentry = d_alloc(pipe_mnt->mnt_sb->s_root, &this);
 	if (!dentry)
 		goto err_inode;
 
 	dentry->d_op = &pipefs_dentry_operations;
-	d_add(dentry, inode);
+	/*
+	 * We dont want to push this dentry into global dentry hash table. 
+	 * We pretend dentry is already hashed, by unsetting DCACHE_UNHASHED
+	 * This hack permits a working /proc/$pid/fd/XXX on pipes
+	 */
+	dentry->d_flags &= ~DCACHE_UNHASHED;
+	d_instantiate(dentry, inode);
 	f->f_vfsmnt = mntget(pipe_mnt);
 	f->f_dentry = dentry;
 	f->f_mapping = inode->i_mapping;

^ permalink raw reply	[flat|nested] 15+ messages in thread

* [PATCH] [NET] dont insert socket dentries into dentry_hashtable.
  2006-10-31 18:48       ` [RFC, PATCH] dont insert sockets/pipes dentries into dentry_hashtable Eric Dumazet
  2006-11-01  7:19         ` David Miller
@ 2006-11-22 18:00         ` Eric Dumazet
  2006-11-28 23:35           ` David Miller
  1 sibling, 1 reply; 15+ messages in thread
From: Eric Dumazet @ 2006-11-22 18:00 UTC (permalink / raw)
  To: David Miller; +Cc: netdev, linux-kernel, Al Viro, Andrew Morton

[-- Attachment #1: Type: text/plain, Size: 847 bytes --]

We currently insert socket dentries into the global dentry hashtable.
This is *suboptimal* because there is currently no way these entries can be 
used for a lookup(). (/proc/xxx/fd/xxx uses a different mechanism). Inserting 
them in dentry hashtable slow dcache lookups.


To let __dpath() still work correctly (ie not adding a " (deleted)") after 
dentry name, we do : 

- Right after d_alloc(), pretend they are hashed by clearing the 
DCACHE_UNHASHED bit. 

- Call d_instantiate() instead of d_add() : dentry is not inserted in hash 
table.

__dpath() & friends work as intended during dentry lifetime.

- At dismantle time, once dput() must clear the dentry, setting again 
DCACHE_UNHASHED bit inside the custom d_delete() function provided by socket 
code, so that dput() can just kill_it.

Signed-off-by: Eric Dumazet <dada1@cosmosbay.com>

[-- Attachment #2: socket_nohash_dentry.patch --]
[-- Type: text/plain, Size: 1466 bytes --]

--- linux-2.6.19-rc6/net/socket.c	2006-11-22 17:33:41.000000000 +0100
+++ linux-2.6.19-rc6-ed/net/socket.c	2006-11-22 18:28:12.000000000 +0100
@@ -306,7 +306,14 @@ static struct file_system_type sock_fs_t
 
 static int sockfs_delete_dentry(struct dentry *dentry)
 {
-	return 1;
+	/*
+	 * At creation time, we pretended this dentry was hashed
+	 * (by clearing DCACHE_UNHASHED bit in d_flags)
+	 * At delete time, we restore the truth : not hashed.
+	 * (so that dput() can proceed correctly)
+	 */
+	dentry->d_flags |= DCACHE_UNHASHED;
+	return 0;
 }
 static struct dentry_operations sockfs_dentry_operations = {
 	.d_delete = sockfs_delete_dentry,
@@ -354,14 +361,20 @@ static int sock_attach_fd(struct socket 
 
 	this.len = sprintf(name, "[%lu]", SOCK_INODE(sock)->i_ino);
 	this.name = name;
-	this.hash = SOCK_INODE(sock)->i_ino;
+	this.hash = 0;
 
 	file->f_dentry = d_alloc(sock_mnt->mnt_sb->s_root, &this);
 	if (unlikely(!file->f_dentry))
 		return -ENOMEM;
 
 	file->f_dentry->d_op = &sockfs_dentry_operations;
-	d_add(file->f_dentry, SOCK_INODE(sock));
+	/*
+	 * We dont want to push this dentry into global dentry hash table. 
+	 * We pretend dentry is already hashed, by unsetting DCACHE_UNHASHED
+	 * This permits a working /proc/$pid/fd/XXX on sockets
+	 */
+	file->f_dentry->d_flags &= ~DCACHE_UNHASHED;
+	d_instantiate(file->f_dentry, SOCK_INODE(sock));
 	file->f_vfsmnt = mntget(sock_mnt);
 	file->f_mapping = file->f_dentry->d_inode->i_mapping;
 

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: [PATCH] [NET] dont insert socket dentries into dentry_hashtable.
  2006-11-22 18:00         ` [PATCH] [NET] dont insert socket " Eric Dumazet
@ 2006-11-28 23:35           ` David Miller
  2006-11-29  0:13             ` Andrew Morton
  0 siblings, 1 reply; 15+ messages in thread
From: David Miller @ 2006-11-28 23:35 UTC (permalink / raw)
  To: dada1; +Cc: netdev, linux-kernel, viro, akpm


Andrew, I'm fine with these three patches, specifically:

[PATCH] dont insert pipe dentries into dentry_hashtable.
[PATCH] [DCACHE] : avoid RCU for never hashed dentries
[PATCH] [NET] dont insert socket dentries into dentry_hashtable.

Could you toss them into -mm if you haven't already?  This
makes better sense then me putting it into net-2.6.20 since
it touches FS stuff.

Thanks!

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: [PATCH] [NET] dont insert socket dentries into dentry_hashtable.
  2006-11-28 23:35           ` David Miller
@ 2006-11-29  0:13             ` Andrew Morton
  0 siblings, 0 replies; 15+ messages in thread
From: Andrew Morton @ 2006-11-29  0:13 UTC (permalink / raw)
  To: David Miller; +Cc: dada1, netdev, linux-kernel, viro

On Tue, 28 Nov 2006 15:35:31 -0800 (PST)
David Miller <davem@davemloft.net> wrote:

> 
> Andrew, I'm fine with these three patches, specifically:
> 
> [PATCH] dont insert pipe dentries into dentry_hashtable.
> [PATCH] [DCACHE] : avoid RCU for never hashed dentries
> [PATCH] [NET] dont insert socket dentries into dentry_hashtable.
> 
> Could you toss them into -mm if you haven't already?

They were in rc6-mm2.

>  This
> makes better sense then me putting it into net-2.6.20 since
> it touches FS stuff.
> 

No probs, they're all lined up and ready to go, thanks.

^ permalink raw reply	[flat|nested] 15+ messages in thread

end of thread, other threads:[~2006-11-29  0:13 UTC | newest]

Thread overview: 15+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-10-24 14:17 [PATCH] fix integer overflow in H-TCP congestion control Gavin McCullagh
2006-10-24 22:30 ` David Miller
2006-10-25  8:47   ` Gavin McCullagh
2006-10-26  6:06     ` David Miller
2006-10-31 18:48       ` [RFC, PATCH] dont insert sockets/pipes dentries into dentry_hashtable Eric Dumazet
2006-11-01  7:19         ` David Miller
2006-11-01  8:21           ` Eric Dumazet
2006-11-01  8:34             ` David Miller
2006-11-01  8:38             ` Al Viro
2006-11-01  8:42               ` Al Viro
2006-11-01  9:04               ` Eric Dumazet
2006-11-01 13:54           ` Eric Dumazet
2006-11-22 18:00         ` [PATCH] [NET] dont insert socket " Eric Dumazet
2006-11-28 23:35           ` David Miller
2006-11-29  0:13             ` Andrew Morton

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).