public inbox for netdev@vger.kernel.org
 help / color / mirror / Atom feed
From: Eric Dumazet <eric.dumazet@gmail.com>
To: Wei Gu <wei.gu@ericsson.com>
Cc: netdev <netdev@vger.kernel.org>,
	Alexander Duyck <alexander.h.duyck@intel.com>,
	Jeff Kirsher <jeffrey.t.kirsher@intel.com>
Subject: RE: Low performance Intel 10GE NIC (3.2.10) on 2.6.38 Kernel
Date: Thu, 07 Apr 2011 13:46:51 +0200	[thread overview]
Message-ID: <1302176811.3357.15.camel@edumazet-laptop> (raw)
In-Reply-To: <D12839161ADD3A4B8DA63D1A134D084026E48BA027@ESGSCCMS0001.eapac.ericsson.se>

Le jeudi 07 avril 2011 à 19:15 +0800, Wei Gu a écrit :
> Hi,
> I compile the ixgbe driver into the kernel and run the test again and also change the copy to clone in the fw hook
> This is the perf report while I was forwarding 150Kpps with
> The attached file include the basic info about my test system. Please let me know if I did some thing wrong.
> 
> +     71.91%          swapper  [kernel.kallsyms]            [k] poll_idle
> +     10.43%          swapper  [kernel.kallsyms]            [k] intel_idle
> -      8.00%     ksoftirqd/24  [kernel.kallsyms]            [k] _raw_spin_unlock_irqrestore
> \u2592   - _raw_spin_unlock_irqrestore
> \u2592      - 42.25% alloc_iova
> \u2592           intel_alloc_iova
> \u2592           __intel_map_single
> \u2592           intel_map_page
> \u2592         - dma_map_single_attrs.clone.3
> \u2592            + 59.89% ixgbe_alloc_rx_buffers
> \u2592            - 40.11% ixgbe_xmit_frame_ring
> \u2592                 ixgbe_xmit_frame
> \u2592                 dev_hard_start_xmit
> \u2592                 sch_direct_xmit
> \u2592                 dev_queue_xmit
> \u2592                 vlan_dev_hard_start_xmit
> \u2592                 hook_func
> \u2592                 nf_iterate
> \u2592                 nf_hook_slow
> \u2592                 NF_HOOK.clone.1
> \u2592                 ip_rcv
> \u2592                 __netif_receive_skb
> \u2592                 __netif_receive_skb
> \u2592                 netif_receive_skb
> \u2592                 napi_skb_finish
> \u2592                 napi_gro_receive
> \u2592                 ixgbe_clean_rx_irq
> \u2592                 ixgbe_clean_rxtx_many
> \u2592                 net_rx_action
> \u2592                 __do_softirq
> \u2592               + call_softirq
> \u2592      + 36.30% find_iova
> \u2592      + 20.89% add_unmap
> \u2592+      1.60%     kworker/24:1  [kernel.kallsyms]            [k] _raw_spin_unlock_irqrestore
> \u2592+      0.80%          swapper  [kernel.kallsyms]            [k] _raw_spin_unlock_irqrestore
> \u2592+      0.66%            snmpd  [kernel.kallsyms]            [k] snmp_fold_field
> \u2592+      0.53%     ksoftirqd/24  [kernel.kallsyms]            [k] clflush_cache_range
> 
> 
> If I zoom out to this ksoftirqd/24
> +     80.38%  ksoftirqd/24  [kernel.kallsyms]  [k] _raw_spin_unlock_irqrestore
> +      5.35%  ksoftirqd/24  [kernel.kallsyms]  [k] clflush_cache_range
> +      1.49%  ksoftirqd/24  [kernel.kallsyms]  [k] __domain_mapping
> +      0.84%  ksoftirqd/24  [kernel.kallsyms]  [k] kmem_cache_alloc
> +      0.55%  ksoftirqd/24  [kernel.kallsyms]  [k] _raw_spin_lock
> +      0.54%  ksoftirqd/24  [kernel.kallsyms]  [k] ixgbe_xmit_frame_ring
> +      0.52%  ksoftirqd/24  [kernel.kallsyms]  [k] ixgbe_clean_rx_irq
> +      0.50%  ksoftirqd/24  [kernel.kallsyms]  [k] domain_get_iommu
> +      0.49%  ksoftirqd/24  [kernel.kallsyms]  [k] dma_map_single_attrs.clone.3
> +      0.48%  ksoftirqd/24  [kernel.kallsyms]  [k] kmem_cache_free
> 
> Perf top
> 
> ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
>    PerfTop:   10615 irqs/sec  kernel:99.7%  exact:  0.0% [1000Hz cpu-clock-msecs],  (all, 64 CPUs)
> ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
> 
>              samples  pcnt function                        DSO
>              _______ _____ _______________________________ __________________________________________________________________________
> 
>             11786.00 54.9% intel_idle                      [kernel.kallsyms]
>              7180.00 33.4% _raw_spin_unlock_irqrestore     [kernel.kallsyms]
>               469.00  2.2% clflush_cache_range             [kernel.kallsyms]
>               138.00  0.6% __domain_mapping                [kernel.kallsyms]
>                81.00  0.4% dso__find_symbol                /root/rpmbuild/BUILD/kernel-2.6.38.el6/linux-2.6.38.x86_64/tools/perf/perf
>                73.00  0.3% _raw_spin_lock                  [kernel.kallsyms]
>                72.00  0.3% dso__load_sym.clone.0           /root/rpmbuild/BUILD/kernel-2.6.38.el6/linux-2.6.38.x86_64/tools/perf/perf
>                68.00  0.3% kmem_cache_alloc                [kernel.kallsyms]
>                53.00  0.2% symbol_filter                   /root/rpmbuild/BUILD/kernel-2.6.38.el6/linux-2.6.38.x86_64/tools/perf/perf
>                51.00  0.2% domain_get_iommu                [kernel.kallsyms]
>                44.00  0.2% ixgbe_clean_rx_irq              [kernel.kallsyms]
>                42.00  0.2% kmem_cache_free                 [kernel.kallsyms]
>                42.00  0.2% ixgbe_xmit_frame_ring           [kernel.kallsyms]
>                41.00  0.2% ixgbe_clean_tx_irq              [kernel.kallsyms]
>                40.00  0.2% dma_map_single_attrs.clone.3    [kernel.kallsyms]
> 
> 
> Top:
> 
> Tasks: 425 total,   2 running, 423 sleeping,   0 stopped,   0 zombie
> Cpu(s):  0.0%us,  0.0%sy,  0.0%ni, 96.0%id,  0.0%wa,  0.0%hi,  3.9%si,  0.0%st
> Mem:  264733684k total,  6374016k used, 258359668k free,    43720k buffers
> Swap:  4194300k total,        0k used,  4194300k free,   137308k cached
> 
>   PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+   P COMMAND
>    79 root      20   0     0    0    0 R 38.8  0.0  29:22.85 24 ksoftirqd/24
>   233 root      20   0     0    0    0 S  7.6  0.0   4:06.60 24 kworker/24:1
>  1538 root      20   0     0    0    0 S  0.3  0.0   0:00.78 33 kworker/33:3
>  2271 root      20   0  200m 5564 1460 S  0.3  0.0   0:03.31  2 snmpd
> 
> 
> Thanks
> WeiGu

OK, please send your .config file




  reply	other threads:[~2011-04-07 11:46 UTC|newest]

Thread overview: 58+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <D12839161ADD3A4B8DA63D1A134D084026E48B9BEB@ESGSCCMS0001.eapac.ericsson.se>
2011-04-07  4:58 ` Question on "net: allocate skbs on local node" Eric Dumazet
2011-04-07  5:16   ` Eric Dumazet
2011-04-07  6:16     ` Eric Dumazet
2011-04-07  7:22       ` Low performance Intel 10GE NIC (3.2.10) on 2.6.38 Kernel Wei Gu
2011-04-07  8:07         ` Eric Dumazet
2011-04-07  8:39           ` Wei Gu
2011-04-07  9:06             ` Eric Dumazet
2011-04-07 11:15               ` Wei Gu
2011-04-07 11:46                 ` Eric Dumazet [this message]
2011-04-07 13:41                   ` Eric Dumazet
2011-04-07 15:58                   ` Alexander Duyck
2011-04-07 16:03                     ` Eric Dumazet
2011-04-07 16:20                       ` Alexander Duyck
2011-04-07 16:37                         ` Eric Dumazet
2011-04-08  8:59                         ` Wei Gu
2011-04-08  9:07                           ` Eric Dumazet
2011-04-08  9:15                             ` Wei Gu
2011-04-08  9:49                               ` Eric Dumazet
2011-04-08  9:59                                 ` Wei Gu
2011-04-08  9:41                             ` Wei Gu
2011-04-08 12:19                             ` Wei Gu
2011-04-08 12:56                               ` Eric Dumazet
2011-04-08 14:10                                 ` Wei Gu
2011-04-08 14:49                                   ` Stephen Hemminger
2011-04-09  3:51                                     ` Wei Gu
2011-04-08 15:07                                   ` Eric Dumazet
2011-04-09  3:27                                     ` Wei Gu
2011-04-09  6:36                                       ` Eric Dumazet
2011-04-10  7:02                                         ` Wei Gu
2011-04-11 14:50                                           ` Alexander Duyck
2011-04-11 15:00                                             ` Wei Gu
2011-04-11 15:14                                             ` Wei Gu
2011-04-11 15:42                                               ` Eric Dumazet
2011-04-12  1:22                                                 ` Wei Gu
2011-04-12  4:40                                                 ` Wei Gu
2011-04-12  4:56                                                   ` Eric Dumazet
2011-04-12  5:18                                                     ` Wei Gu
2011-04-14  5:42                                                 ` Wei Gu
2011-04-14  6:07                                                   ` Eric Dumazet
2011-04-14  6:33                                                     ` Eric Dumazet
2011-04-14  6:58                                                       ` Wei Gu
2011-04-14 16:42                                                         ` Alexander Duyck
2011-04-14 16:45                                                           ` Eric Dumazet
2011-04-14 16:56                                                           ` Peter Zijlstra
2011-04-14 16:57                                                             ` Eric Dumazet
2011-04-14 17:49                                                               ` Eric Dumazet
2011-04-14 19:08                                                                 ` Alexander Duyck
2011-04-15  2:10                                                               ` Wei Gu
2011-04-15  8:57                                                               ` Peter Zijlstra
2011-04-15  9:14                                                                 ` Wei Gu
2011-04-18 21:12                                                                   ` Jesse Brandeburg
2011-04-19  4:09                                                                     ` Wei Gu
2011-04-21  2:57                                                                     ` Wei Gu
2011-04-21  3:25                                                                     ` Wei Gu
2011-04-08 16:22                               ` Alexander Duyck
2011-04-09  3:36                                 ` Wei Gu
2011-04-09  4:40                                   ` Alexander H Duyck
2011-04-09  6:12                                     ` Wei Gu

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1302176811.3357.15.camel@edumazet-laptop \
    --to=eric.dumazet@gmail.com \
    --cc=alexander.h.duyck@intel.com \
    --cc=jeffrey.t.kirsher@intel.com \
    --cc=netdev@vger.kernel.org \
    --cc=wei.gu@ericsson.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox