netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: "p.kosyh" <p.kosyh@gmail.com>
To: netdev@vger.kernel.org
Subject: napi and softirq sticking (stuck) solution
Date: Mon, 14 Jul 2014 13:57:39 +0400	[thread overview]
Message-ID: <53C3A993.4050007@gmail.com> (raw)

Hello!

There is one problem (well known?) we have with napi and softirq 
sticking while irq balancing.
We are solved this problem so may be someone will find this information 
useful.

For example, we have some multi-queue ethernet devices. Each tx/rx-queue 
uses own irq.
Lets assume that at start we have not optimal irq affinity and some 
queues irq are
binded to the same CPU.

Then we have a heavy load traffic. So, some irqs are on (for example) 
CPU#1. And we have
100% softirq on that CPU#1. Ethernet driver is working in napi mode, 
because there are always
a lot of packets in queues to poll.

Here, we want to make affinity better!

irq affinity in our situation is managed in realtime by irq balancer. 
There are no many balancers.
We found, that irqbalance and irqd sometimes do fuzzy logic, so, we have 
developed own
balancer that works well. Here it is: http://birq.libcode.org

But we can reproduce problem without balancer, just echo affinity in 
smp_affinity proc entries under
heavy load.

Anyway, under heavy load, after changing smp_affinity we stays with 100% 
softirq at CPU#1, just
because we are still in polling mode (irq disabled) and napi object is 
always scheduled on same CPU#1.

So, under heavy traffic, the irq ballancing is not works at all.

To solve this problem we just break napi mode sometimes in network driver.
For example, e1000e/netdev.c

In e1000e_poll function:
=============
         if (time_is_before_jiffies(adapter->napi_stamp + 
usecs_to_jiffies(netdev_napi_limit)))
                 work_done = 0;
...
         /* If weight not fully consumed, exit the polling mode */
                 if (work_done < weight) {
=============

So, every 1 sec (for example) we are breaking napi mode, and softirq 
will move on another CPU (according smp_affinity).

The bad thing is that we have to patch every network driver. But without 
this we can not use Linux as good router.

So, i hope, this text will be useful.

Thank you.

             reply	other threads:[~2014-07-14  9:49 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-07-14  9:57 p.kosyh [this message]
2014-07-14 10:24 ` napi and softirq sticking (stuck) solution Eric Dumazet

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=53C3A993.4050007@gmail.com \
    --to=p.kosyh@gmail.com \
    --cc=netdev@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).