From mboxrd@z Thu Jan  1 00:00:00 1970
From: Eric Dumazet <dada1@cosmosbay.com>
Subject: Re: [PATCH] convert hh_lock to seqlock
Date: Fri, 08 Dec 2006 00:02:44 +0100
Message-ID: <45789D94.3050509@cosmosbay.com>
References: <20061207113309.2f892cf1@localhost>
 <4578782B.4010908@cosmosbay.com> <20061207131553.45e4388b@localhost>
 <45789534.1050703@cosmosbay.com> <20061207145300.3a5a21a6@localhost>
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8;
	format=flowed
Content-Transfer-Encoding: QUOTED-PRINTABLE
Cc: David Miller <davem@davemloft.net>, netdev@vger.kernel.org
Return-path: <netdev-owner@vger.kernel.org>
Received: from sp604005mt.neufgp.fr ([84.96.92.11]:44736 "EHLO smtp.Neuf.fr"
	rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP
	id S1163909AbWLGXFt (ORCPT <rfc822;netdev@vger.kernel.org>);
	Thu, 7 Dec 2006 18:05:49 -0500
Received: from [192.168.30.10] ([84.7.37.114]) by sp604005mt.gpm.neuf.ld
 (Sun Java System Messaging Server 6.2-5.05 (built Feb 16 2006))
 with ESMTP id <0J9X00LABEO1YMN0@sp604005mt.gpm.neuf.ld> for
 netdev@vger.kernel.org; Fri, 08 Dec 2006 00:02:26 +0100 (CET)
In-reply-to: <20061207145300.3a5a21a6@localhost>
To: Stephen Hemminger <shemminger@osdl.org>
Sender: netdev-owner@vger.kernel.org
List-Id: netdev.vger.kernel.org

Stephen Hemminger a =C3=A9crit :
> On Thu, 07 Dec 2006 23:27:00 +0100
> Eric Dumazet <dada1@cosmosbay.com> wrote:
>=20
>> Stephen Hemminger a =C3=A9crit :
>>> On Thu, 07 Dec 2006 21:23:07 +0100
>>> Eric Dumazet <dada1@cosmosbay.com> wrote:
>>>
>>>> Stephen Hemminger a =C3=A9crit :
>>>>> The hard header cache is in the main output path, so using
>>>>> seqlock instead of reader/writer lock should reduce overhead.
>>>>>
>>>> Nice work Stephen, I am very interested.
>>>>
>>>> Did you benchmarked it ?
>>>>
>>>> I ask because I think hh_refcnt frequent changes may defeat the ga=
in you want=20
>>>> (ie avoiding cache line ping pongs between cpus). seqlock are defi=
nitly better=20
>>>> than rwlock, but if we really keep cache lines shared.
>>>>
>>>> So I would suggest reordering fields of hh_cache and adding one=20
>>>> ____cacheline_aligned_in_smp to keep hh_refcnt in another cache li=
ne.
>>>>
>>>> (hh_len, hh_lock and hh_data should be placed on a 'mostly read' c=
ache line)
>>>>
>>>> Thank you
>>>> Eric
>>> It doesn't make any visible performance difference for real network=
s;=20
>>> copies and device issues are much larger.
>> Hum, so 'my' machines must be unreal :)
>>
>>> The hh_refcnt is used only when creating destroying neighbor entrie=
s,
>>> so except under DoS attack it doesn't make a lot of difference.
>>> The hh_lock is used on each packet sent.
>> Some machines create/delete 10.000 entries per second in rt_cache.
>> I believe they are real. DoS ? you tell it, some people wont agree.
>=20
>=20
> That could be fixed by doing RCU, I did some of that previously, but =
it
> seemed better to hit the worst case first.  Even Robert doesn't see 1=
0,000
> rt cache entries per second.

What's the problem with my suggestion of keeping hh_refcnt on another c=
ache line ?
It is basically free (once your change from rwlock to seqlock put in), =
and no=20
change of algorithm.