From: Eric Dumazet <eric.dumazet@gmail.com>
To: Changli Gao <xiaosuo@gmail.com>
Cc: "David S. Miller" <davem@davemloft.net>,
Alexey Kuznetsov <kuznet@ms2.inr.ac.ru>,
"Pekka Savola (ipv6)" <pekkas@netcore.fi>,
James Morris <jmorris@namei.org>,
Hideaki YOSHIFUJI <yoshfuji@linux-ipv6.org>,
Patrick McHardy <kaber@trash.net>,
netdev@vger.kernel.org
Subject: Re: [PATCH] net: code cleanups
Date: Thu, 30 Sep 2010 08:31:39 +0200 [thread overview]
Message-ID: <1285828299.5211.806.camel@edumazet-laptop> (raw)
In-Reply-To: <AANLkTimVngrNU+PLbaf8r42J3HrvURpY8KUPa+--KvLC@mail.gmail.com>
Le jeudi 30 septembre 2010 à 14:09 +0800, Changli Gao a écrit :
> On Thu, Sep 30, 2010 at 1:16 PM, Eric Dumazet <eric.dumazet@gmail.com> wrote:
> > Le jeudi 30 septembre 2010 à 10:24 +0800, Changli Gao a écrit :
> >> Compare operations are more readable, and compilers generate the same code
> >> for the both.
> >>
> >
> > You have a buggy compiler then.
>
> gcc version 4.4.3 (Gentoo 4.4.3-r2 p1.2)
>
> rth = rcu_dereference(rth->dst.rt_next)) {
> if ((((__force u32)rth->fl.fl4_dst ^ (__force u32)daddr) |
> ((__force u32)rth->fl.fl4_src ^ (__force u32)saddr) |
> (rth->fl.iif ^ iif) |
> 2f12: 44 3b 80 dc 00 00 00 cmp 0xdc(%rax),%r8d
> 2f19: 0f 85 a2 00 00 00 jne 2fc1 <ip_route_input_common+0x145
> >
> rth->fl.oif |
> 2f1f: 83 b8 d8 00 00 00 00 cmpl $0x0,0xd8(%rax)
> 2f26: 0f 85 95 00 00 00 jne 2fc1 <ip_route_input_common+0x145
> >
> tos &= IPTOS_RT_MASK;
> hash = rt_hash(daddr, saddr, iif, rt_genid(net));
>
> for (rth = rcu_dereference(rt_hash_table[hash].chain); rth;
> rth = rcu_dereference(rth->dst.rt_next)) {
> if ((((__force u32)rth->fl.fl4_dst ^ (__force u32)daddr) |
> 2f2c: 44 3b b8 e4 00 00 00 cmp 0xe4(%rax),%r15d
> 2f33: 0f 85 88 00 00 00 jne 2fc1
> <ip_route_input_common+0x145>
> ((__force u32)rth->fl.fl4_src ^ (__force u32)saddr) |
> 2f39: 44 3b b0 e8 00 00 00 cmp 0xe8(%rax),%r14d
> 2f40: 75 7f jne 2fc1
> <ip_route_input_common+0x145>
>
>
> >
> > I know this code is ugly, but please keep it as is, dont add conditional
> > branches on hot paths.
> >
>
> If the compiler doesn't generate conditional branches, we have to
> touch every necessary field of all the cache entries in one hash
> bucket. Is it better than condition branch? I think the compiler
> developers know it better.
Last famous words.
Are you aware of cache lines (64 bytes at least on typical cpus), and
that all fields are already in CPU L1 cache ? I (and others) worked hard
in the past.
>
> And the compiler reorders the conditional branches, is it expected?
>
Your compiler added conditional branches on a code not wanting them,
only because on _your_ cpu, these conditional branches might be cheap.
Now, try to compile for an i686 target and see the difference.
If there was no difference, your compiler would be _buggy_, because not
generating optimal assembly.
Here I get :
c141dda9: 8b 55 e8 mov -0x18(%ebp),%edx
c141ddac: 8b 81 9c 00 00 00 mov 0x9c(%ecx),%eax
c141ddb2: 33 91 a0 00 00 00 xor 0xa0(%ecx),%edx
c141ddb8: 31 f0 xor %esi,%eax
c141ddba: 09 d0 or %edx,%eax
c141ddbc: 8b 55 e0 mov -0x20(%ebp),%edx
c141ddbf: 33 91 94 00 00 00 xor 0x94(%ecx),%edx
c141ddc5: 09 d0 or %edx,%eax
c141ddc7: 0f b6 55 e7 movzbl -0x19(%ebp),%edx
c141ddcb: 0b 81 90 00 00 00 or 0x90(%ecx),%eax
c141ddd1: 32 91 a4 00 00 00 xor 0xa4(%ecx),%dl
c141ddd7: 0f b6 d2 movzbl %dl,%edx
c141ddda: 09 d0 or %edx,%eax
c141dddc: 0f 85 9d 00 00 00 jne c141de7f <ip_route_input_common+0x1b4>
As you can see, only one conditional branch.
Your patch is not welcomed, thanks.
prev parent reply other threads:[~2010-09-30 6:31 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-09-30 2:24 [PATCH] net: code cleanups Changli Gao
2010-09-30 2:49 ` Joe Perches
2010-09-30 3:07 ` Changli Gao
2010-09-30 3:13 ` Joe Perches
2010-09-30 5:16 ` Eric Dumazet
2010-09-30 6:09 ` Changli Gao
2010-09-30 6:31 ` Eric Dumazet [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1285828299.5211.806.camel@edumazet-laptop \
--to=eric.dumazet@gmail.com \
--cc=davem@davemloft.net \
--cc=jmorris@namei.org \
--cc=kaber@trash.net \
--cc=kuznet@ms2.inr.ac.ru \
--cc=netdev@vger.kernel.org \
--cc=pekkas@netcore.fi \
--cc=xiaosuo@gmail.com \
--cc=yoshfuji@linux-ipv6.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).