From: Joe Eykholt <jeykholt@cisco.com>
To: Vasu Dev <vasu.dev@linux.intel.com>
Cc: Andi Kleen <andi@firstfloor.org>,
Robert Love <robert.w.love@intel.com>,
james.bottomley@hansenpartnership.com,
linux-scsi@vger.kernel.org, jgarzik@redhat.com,
davem@davemloft.net, james.smart@emulex.com,
michaelc@cs.wisc.edu, jeffrey.t.kirsher@intel.com
Subject: Re: [PATCH 2/3] libfc: A modular Fibre Channel library
Date: Thu, 11 Dec 2008 18:19:43 -0800 [thread overview]
Message-ID: <4941CA3F.5050904@cisco.com> (raw)
In-Reply-To: <4941C4AE.2000403@linux.intel.com>
Vasu Dev wrote:
> Andi Kleen wrote:
>> On Wed, Dec 10, 2008 at 10:42:28AM -0800, Vasu Dev wrote:
>>
>>> It had load balancing issue but now it is fixed, related latest
>>> submitted code with its updated comment is:-
>>>
>>> /*
>>> * The incoming frame exchange id(oxid) is ANDed with num of online
>>> * cpu bits to get cpu_idx and then this cpu_idx is used for
>>> selecting
>>> * a per cpu kernel thread from fcoe_percpu. In case the cpu is
>>> * offline or no kernel thread for derived cpu_idx then cpu_idx is
>>> * initialize to first online cpu index.
>>> */
>>> cpu_idx = oxid & (num_online_cpus() - 1);
>>>
>>
>> First note that num_online_cpus() is not guaranteed to be a power of two,
>> - 1 is not guaranteed to give a suitable mask. So you might actually
>> lose random bits.
>
> Correct, this will work best for only power of 2 online cpus and that
> would be the most common typical use case. I agree it won't load balance
> better in non power of 2 cpus case.
>
>> Also your load balancing scheme is unusual to say at least. e.g. when
>> you're just talking to a single frame exchange you would always
>> transfer data between CPUs instead of keeping it all on the CPU that
>> processes the interrupt. Normally the rule of thumb is to use local
>> data as much as possible. Or when you distribute like this at least
>> stay in the same socket.
>
> We cannot control what cpu to get interrupted for a FC frame in a
> typical generic NIC, so we may end up receiving mostly all FC frames on
> a single same cpu though system might have several other cpus available.
> In this scenario if frame are passed up to same cpu as suggested above
> then that won't do any load balancing, therefore some sort of load
> balancing is required based on some FC frame attributes here.
>
> As I said in my last response that "performance tuning is yet to be
> done" but you bring up some good related points now of cross socket
> frame migration and balancing on non power of 2 cpus system. These
> should be considered during pending performance tuning but for now I can
> add additional check to select cpu within same socket but not sure how
> to do that, any kernel call for this ? This might cause more locking
> contentions on libfc structs so really we have to experiment these thing
> during performance tuning. Thanks Andi for these hints on performance
> consideration.
>
> Vasu
Somehow the exchange IDs should be allocated so that responses can be
directed back to the same CPU that issued the requests. At one time
the mask was a computed mask such that it was always a power of 2.
If we made a separate exchange ID space per CPU and the LLD allocated the XIDs,
then it could also be sure it was doing the right thing with the responses.
If we used a separate exchange manager for each CPU, there would be less
lock contention in the EM code. As we diffy up the exchange ID space
between the multiple EMs, we could use the high-order bits to be the CPU
number. We would probably want to limit the number of EMs to something
like 8 or 16 even on systems with more CPUs.
Like you said, Vasu, this is for performance tuning time, whenever that is.
However, I think that might be a few releases away or a gradual evolution
since there might be some significant changes involved.
Joe
next prev parent reply other threads:[~2008-12-12 2:19 UTC|newest]
Thread overview: 21+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-12-09 23:10 [PATCH 0/3] Open-FCoE Submission (round 2) Robert Love
2008-12-09 23:10 ` [PATCH 1/3] FC protocol definition header files Robert Love
2008-12-09 23:10 ` [PATCH 2/3] libfc: A modular Fibre Channel library Robert Love
2008-12-10 0:03 ` Andi Kleen
2008-12-10 18:42 ` Vasu Dev
2008-12-10 19:42 ` Andi Kleen
2008-12-12 1:55 ` Vasu Dev
2008-12-12 2:19 ` Joe Eykholt [this message]
2008-12-11 0:44 ` Chris Leech
2008-12-11 0:49 ` Chris Leech
2008-12-11 20:32 ` Zou, Yi
2008-12-11 23:33 ` Andi Kleen
2008-12-09 23:10 ` [PATCH 3/3] fcoe: Fibre Channel over Ethernet Robert Love
2009-02-05 2:24 ` Andrew Morton
2009-02-06 19:05 ` Robert Love
2009-02-06 19:13 ` Andrew Morton
2009-02-06 19:26 ` [PATCH] kernel-doc: preferred ending marker and examples Randy Dunlap
-- strict thread matches above, loose matches on Subject: below --
2008-11-18 22:20 [PATCH 0/3] Open-FCoE Submission Robert Love
2008-11-18 22:21 ` [PATCH 2/3] libfc: A modular Fibre Channel library Robert Love
2008-11-24 10:13 ` Andi Kleen
2008-11-25 20:47 ` Love, Robert W
2008-12-02 23:36 ` Love, Robert W
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4941CA3F.5050904@cisco.com \
--to=jeykholt@cisco.com \
--cc=andi@firstfloor.org \
--cc=davem@davemloft.net \
--cc=james.bottomley@hansenpartnership.com \
--cc=james.smart@emulex.com \
--cc=jeffrey.t.kirsher@intel.com \
--cc=jgarzik@redhat.com \
--cc=linux-scsi@vger.kernel.org \
--cc=michaelc@cs.wisc.edu \
--cc=robert.w.love@intel.com \
--cc=vasu.dev@linux.intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.