* [PATCH net] ipv4: set transport header earlier
@ 2013-07-16  3:03 Eric Dumazet
  2013-07-16 13:24 ` Sergei Shtylyov
  2013-07-16 20:00 ` David Miller
  0 siblings, 2 replies; 4+ messages in thread
From: Eric Dumazet @ 2013-07-16  3:03 UTC (permalink / raw)
  To: David Miller; +Cc: netdev, Neal Cardwell, netfilter-devel, Tom Herbert
From: Eric Dumazet <edumazet@google.com>
commit 45f00f99d6e ("ipv4: tcp: clean up tcp_v4_early_demux()") added a
performance regression for non GRO traffic, basically disabling
IP early demux.
IPv6 stack resets transport header in ip6_rcv() before calling
IP early demux in ip6_rcv_finish(), while IPv4 does this only in
ip_local_deliver_finish(), _after_ IP early demux.
GRO traffic happened to enable IP early demux because transport header
is also set in inet_gro_receive()
Instead of reverting the faulty commit, we can make IPv4/IPv6 behave the
same : transport_header should be set in ip_rcv() instead of
ip_local_deliver_finish()
ip_local_deliver_finish() can also use skb_network_header_len() which is
faster than ip_hdrlen()
Signed-off-by: Eric Dumazet <edumazet@google.com>
Cc: Neal Cardwell <ncardwell@google.com>
Cc: Tom Herbert <therbert@google.com>
---
 net/ipv4/ip_input.c |    7 +++----
 1 file changed, 3 insertions(+), 4 deletions(-)
diff --git a/net/ipv4/ip_input.c b/net/ipv4/ip_input.c
index 3da817b..15e3e68 100644
--- a/net/ipv4/ip_input.c
+++ b/net/ipv4/ip_input.c
@@ -190,10 +190,7 @@ static int ip_local_deliver_finish(struct sk_buff *skb)
 {
 	struct net *net = dev_net(skb->dev);
 
-	__skb_pull(skb, ip_hdrlen(skb));
-
-	/* Point into the IP datagram, just past the header. */
-	skb_reset_transport_header(skb);
+	__skb_pull(skb, skb_network_header_len(skb));
 
 	rcu_read_lock();
 	{
@@ -437,6 +434,8 @@ int ip_rcv(struct sk_buff *skb, struct net_device *dev, struct packet_type *pt,
 		goto drop;
 	}
 
+	skb->transport_header = skb->network_header + iph->ihl*4;
+
 	/* Remove any debris in the socket control block */
 	memset(IPCB(skb), 0, sizeof(struct inet_skb_parm));
 
^ permalink raw reply related	[flat|nested] 4+ messages in thread- * Re: [PATCH net] ipv4: set transport header earlier
  2013-07-16  3:03 [PATCH net] ipv4: set transport header earlier Eric Dumazet
@ 2013-07-16 13:24 ` Sergei Shtylyov
  2013-07-16 13:56   ` Eric Dumazet
  2013-07-16 20:00 ` David Miller
  1 sibling, 1 reply; 4+ messages in thread
From: Sergei Shtylyov @ 2013-07-16 13:24 UTC (permalink / raw)
  To: Eric Dumazet
  Cc: David Miller, netdev, Neal Cardwell, netfilter-devel, Tom Herbert
Hello.
On 16-07-2013 7:03, Eric Dumazet wrote:
> From: Eric Dumazet <edumazet@google.com>
> commit 45f00f99d6e ("ipv4: tcp: clean up tcp_v4_early_demux()") added a
> performance regression for non GRO traffic, basically disabling
> IP early demux.
> IPv6 stack resets transport header in ip6_rcv() before calling
> IP early demux in ip6_rcv_finish(), while IPv4 does this only in
> ip_local_deliver_finish(), _after_ IP early demux.
> GRO traffic happened to enable IP early demux because transport header
> is also set in inet_gro_receive()
> Instead of reverting the faulty commit, we can make IPv4/IPv6 behave the
> same : transport_header should be set in ip_rcv() instead of
> ip_local_deliver_finish()
> ip_local_deliver_finish() can also use skb_network_header_len() which is
> faster than ip_hdrlen()
> Signed-off-by: Eric Dumazet <edumazet@google.com>
> Cc: Neal Cardwell <ncardwell@google.com>
> Cc: Tom Herbert <therbert@google.com>
> ---
>   net/ipv4/ip_input.c |    7 +++----
>   1 file changed, 3 insertions(+), 4 deletions(-)
> diff --git a/net/ipv4/ip_input.c b/net/ipv4/ip_input.c
> index 3da817b..15e3e68 100644
> --- a/net/ipv4/ip_input.c
> +++ b/net/ipv4/ip_input.c
[...]
> @@ -437,6 +434,8 @@ int ip_rcv(struct sk_buff *skb, struct net_device *dev, struct packet_type *pt,
>   		goto drop;
>   	}
>
> +	skb->transport_header = skb->network_header + iph->ihl*4;
    Spaces around * wouldn't hurt, to be consistent with the rest of the 
statement and the Linux style in common.
WBR, Sergei
^ permalink raw reply	[flat|nested] 4+ messages in thread
- * Re: [PATCH net] ipv4: set transport header earlier
  2013-07-16 13:24 ` Sergei Shtylyov
@ 2013-07-16 13:56   ` Eric Dumazet
  0 siblings, 0 replies; 4+ messages in thread
From: Eric Dumazet @ 2013-07-16 13:56 UTC (permalink / raw)
  To: Sergei Shtylyov
  Cc: David Miller, netdev, Neal Cardwell, netfilter-devel, Tom Herbert
On Tue, 2013-07-16 at 17:24 +0400, Sergei Shtylyov wrote:
> > +	skb->transport_header = skb->network_header + iph->ihl*4;
> 
>     Spaces around * wouldn't hurt, to be consistent with the rest of the 
> statement and the Linux style in common.
I am well aware of this, I chose the convention used in this function
and file.
# grep ihl net/ipv4/ip_input.c
	opt->optlen = iph->ihl*4 - sizeof(struct iphdr);
	if (iph->ihl > 5 && ip_rcv_options(skb))
	if (iph->ihl < 5 || iph->version != 4)
	if (!pskb_may_pull(skb, iph->ihl*4))
	if (unlikely(ip_fast_csum((u8 *)iph, iph->ihl)))
	} else if (len < (iph->ihl*4))
Thanks
^ permalink raw reply	[flat|nested] 4+ messages in thread 
 
- * Re: [PATCH net] ipv4: set transport header earlier
  2013-07-16  3:03 [PATCH net] ipv4: set transport header earlier Eric Dumazet
  2013-07-16 13:24 ` Sergei Shtylyov
@ 2013-07-16 20:00 ` David Miller
  1 sibling, 0 replies; 4+ messages in thread
From: David Miller @ 2013-07-16 20:00 UTC (permalink / raw)
  To: eric.dumazet; +Cc: netdev, ncardwell, netfilter-devel, therbert
From: Eric Dumazet <eric.dumazet@gmail.com>
Date: Mon, 15 Jul 2013 20:03:19 -0700
> From: Eric Dumazet <edumazet@google.com>
> 
> commit 45f00f99d6e ("ipv4: tcp: clean up tcp_v4_early_demux()") added a
> performance regression for non GRO traffic, basically disabling
> IP early demux.
> 
> IPv6 stack resets transport header in ip6_rcv() before calling
> IP early demux in ip6_rcv_finish(), while IPv4 does this only in
> ip_local_deliver_finish(), _after_ IP early demux.
> 
> GRO traffic happened to enable IP early demux because transport header
> is also set in inet_gro_receive()
> 
> Instead of reverting the faulty commit, we can make IPv4/IPv6 behave the
> same : transport_header should be set in ip_rcv() instead of
> ip_local_deliver_finish()
> 
> ip_local_deliver_finish() can also use skb_network_header_len() which is
> faster than ip_hdrlen()
> 
> Signed-off-by: Eric Dumazet <edumazet@google.com>
Applied and queued up for -stable, thanks Eric.
^ permalink raw reply	[flat|nested] 4+ messages in thread
end of thread, other threads:[~2013-07-16 20:00 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2013-07-16  3:03 [PATCH net] ipv4: set transport header earlier Eric Dumazet
2013-07-16 13:24 ` Sergei Shtylyov
2013-07-16 13:56   ` Eric Dumazet
2013-07-16 20:00 ` David Miller
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).