From: Eric Dumazet <eric.dumazet@gmail.com>
To: Wei Gu <wei.gu@ericsson.com>
Cc: netdev <netdev@vger.kernel.org>,
Alexander Duyck <alexander.h.duyck@intel.com>,
Jeff Kirsher <jeffrey.t.kirsher@intel.com>
Subject: RE: Low performance Intel 10GE NIC (3.2.10) on 2.6.38 Kernel
Date: Thu, 07 Apr 2011 13:46:51 +0200 [thread overview]
Message-ID: <1302176811.3357.15.camel@edumazet-laptop> (raw)
In-Reply-To: <D12839161ADD3A4B8DA63D1A134D084026E48BA027@ESGSCCMS0001.eapac.ericsson.se>
Le jeudi 07 avril 2011 à 19:15 +0800, Wei Gu a écrit :
> Hi,
> I compile the ixgbe driver into the kernel and run the test again and also change the copy to clone in the fw hook
> This is the perf report while I was forwarding 150Kpps with
> The attached file include the basic info about my test system. Please let me know if I did some thing wrong.
>
> + 71.91% swapper [kernel.kallsyms] [k] poll_idle
> + 10.43% swapper [kernel.kallsyms] [k] intel_idle
> - 8.00% ksoftirqd/24 [kernel.kallsyms] [k] _raw_spin_unlock_irqrestore
> \u2592 - _raw_spin_unlock_irqrestore
> \u2592 - 42.25% alloc_iova
> \u2592 intel_alloc_iova
> \u2592 __intel_map_single
> \u2592 intel_map_page
> \u2592 - dma_map_single_attrs.clone.3
> \u2592 + 59.89% ixgbe_alloc_rx_buffers
> \u2592 - 40.11% ixgbe_xmit_frame_ring
> \u2592 ixgbe_xmit_frame
> \u2592 dev_hard_start_xmit
> \u2592 sch_direct_xmit
> \u2592 dev_queue_xmit
> \u2592 vlan_dev_hard_start_xmit
> \u2592 hook_func
> \u2592 nf_iterate
> \u2592 nf_hook_slow
> \u2592 NF_HOOK.clone.1
> \u2592 ip_rcv
> \u2592 __netif_receive_skb
> \u2592 __netif_receive_skb
> \u2592 netif_receive_skb
> \u2592 napi_skb_finish
> \u2592 napi_gro_receive
> \u2592 ixgbe_clean_rx_irq
> \u2592 ixgbe_clean_rxtx_many
> \u2592 net_rx_action
> \u2592 __do_softirq
> \u2592 + call_softirq
> \u2592 + 36.30% find_iova
> \u2592 + 20.89% add_unmap
> \u2592+ 1.60% kworker/24:1 [kernel.kallsyms] [k] _raw_spin_unlock_irqrestore
> \u2592+ 0.80% swapper [kernel.kallsyms] [k] _raw_spin_unlock_irqrestore
> \u2592+ 0.66% snmpd [kernel.kallsyms] [k] snmp_fold_field
> \u2592+ 0.53% ksoftirqd/24 [kernel.kallsyms] [k] clflush_cache_range
>
>
> If I zoom out to this ksoftirqd/24
> + 80.38% ksoftirqd/24 [kernel.kallsyms] [k] _raw_spin_unlock_irqrestore
> + 5.35% ksoftirqd/24 [kernel.kallsyms] [k] clflush_cache_range
> + 1.49% ksoftirqd/24 [kernel.kallsyms] [k] __domain_mapping
> + 0.84% ksoftirqd/24 [kernel.kallsyms] [k] kmem_cache_alloc
> + 0.55% ksoftirqd/24 [kernel.kallsyms] [k] _raw_spin_lock
> + 0.54% ksoftirqd/24 [kernel.kallsyms] [k] ixgbe_xmit_frame_ring
> + 0.52% ksoftirqd/24 [kernel.kallsyms] [k] ixgbe_clean_rx_irq
> + 0.50% ksoftirqd/24 [kernel.kallsyms] [k] domain_get_iommu
> + 0.49% ksoftirqd/24 [kernel.kallsyms] [k] dma_map_single_attrs.clone.3
> + 0.48% ksoftirqd/24 [kernel.kallsyms] [k] kmem_cache_free
>
> Perf top
>
> ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
> PerfTop: 10615 irqs/sec kernel:99.7% exact: 0.0% [1000Hz cpu-clock-msecs], (all, 64 CPUs)
> ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
>
> samples pcnt function DSO
> _______ _____ _______________________________ __________________________________________________________________________
>
> 11786.00 54.9% intel_idle [kernel.kallsyms]
> 7180.00 33.4% _raw_spin_unlock_irqrestore [kernel.kallsyms]
> 469.00 2.2% clflush_cache_range [kernel.kallsyms]
> 138.00 0.6% __domain_mapping [kernel.kallsyms]
> 81.00 0.4% dso__find_symbol /root/rpmbuild/BUILD/kernel-2.6.38.el6/linux-2.6.38.x86_64/tools/perf/perf
> 73.00 0.3% _raw_spin_lock [kernel.kallsyms]
> 72.00 0.3% dso__load_sym.clone.0 /root/rpmbuild/BUILD/kernel-2.6.38.el6/linux-2.6.38.x86_64/tools/perf/perf
> 68.00 0.3% kmem_cache_alloc [kernel.kallsyms]
> 53.00 0.2% symbol_filter /root/rpmbuild/BUILD/kernel-2.6.38.el6/linux-2.6.38.x86_64/tools/perf/perf
> 51.00 0.2% domain_get_iommu [kernel.kallsyms]
> 44.00 0.2% ixgbe_clean_rx_irq [kernel.kallsyms]
> 42.00 0.2% kmem_cache_free [kernel.kallsyms]
> 42.00 0.2% ixgbe_xmit_frame_ring [kernel.kallsyms]
> 41.00 0.2% ixgbe_clean_tx_irq [kernel.kallsyms]
> 40.00 0.2% dma_map_single_attrs.clone.3 [kernel.kallsyms]
>
>
> Top:
>
> Tasks: 425 total, 2 running, 423 sleeping, 0 stopped, 0 zombie
> Cpu(s): 0.0%us, 0.0%sy, 0.0%ni, 96.0%id, 0.0%wa, 0.0%hi, 3.9%si, 0.0%st
> Mem: 264733684k total, 6374016k used, 258359668k free, 43720k buffers
> Swap: 4194300k total, 0k used, 4194300k free, 137308k cached
>
> PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ P COMMAND
> 79 root 20 0 0 0 0 R 38.8 0.0 29:22.85 24 ksoftirqd/24
> 233 root 20 0 0 0 0 S 7.6 0.0 4:06.60 24 kworker/24:1
> 1538 root 20 0 0 0 0 S 0.3 0.0 0:00.78 33 kworker/33:3
> 2271 root 20 0 200m 5564 1460 S 0.3 0.0 0:03.31 2 snmpd
>
>
> Thanks
> WeiGu
OK, please send your .config file
next prev parent reply other threads:[~2011-04-07 11:46 UTC|newest]
Thread overview: 58+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <D12839161ADD3A4B8DA63D1A134D084026E48B9BEB@ESGSCCMS0001.eapac.ericsson.se>
2011-04-07 4:58 ` Question on "net: allocate skbs on local node" Eric Dumazet
2011-04-07 5:16 ` Eric Dumazet
2011-04-07 6:16 ` Eric Dumazet
2011-04-07 7:22 ` Low performance Intel 10GE NIC (3.2.10) on 2.6.38 Kernel Wei Gu
2011-04-07 8:07 ` Eric Dumazet
2011-04-07 8:39 ` Wei Gu
2011-04-07 9:06 ` Eric Dumazet
2011-04-07 11:15 ` Wei Gu
2011-04-07 11:46 ` Eric Dumazet [this message]
2011-04-07 13:41 ` Eric Dumazet
2011-04-07 15:58 ` Alexander Duyck
2011-04-07 16:03 ` Eric Dumazet
2011-04-07 16:20 ` Alexander Duyck
2011-04-07 16:37 ` Eric Dumazet
2011-04-08 8:59 ` Wei Gu
2011-04-08 9:07 ` Eric Dumazet
2011-04-08 9:15 ` Wei Gu
2011-04-08 9:49 ` Eric Dumazet
2011-04-08 9:59 ` Wei Gu
2011-04-08 9:41 ` Wei Gu
2011-04-08 12:19 ` Wei Gu
2011-04-08 12:56 ` Eric Dumazet
2011-04-08 14:10 ` Wei Gu
2011-04-08 14:49 ` Stephen Hemminger
2011-04-09 3:51 ` Wei Gu
2011-04-08 15:07 ` Eric Dumazet
2011-04-09 3:27 ` Wei Gu
2011-04-09 6:36 ` Eric Dumazet
2011-04-10 7:02 ` Wei Gu
2011-04-11 14:50 ` Alexander Duyck
2011-04-11 15:00 ` Wei Gu
2011-04-11 15:14 ` Wei Gu
2011-04-11 15:42 ` Eric Dumazet
2011-04-12 1:22 ` Wei Gu
2011-04-12 4:40 ` Wei Gu
2011-04-12 4:56 ` Eric Dumazet
2011-04-12 5:18 ` Wei Gu
2011-04-14 5:42 ` Wei Gu
2011-04-14 6:07 ` Eric Dumazet
2011-04-14 6:33 ` Eric Dumazet
2011-04-14 6:58 ` Wei Gu
2011-04-14 16:42 ` Alexander Duyck
2011-04-14 16:45 ` Eric Dumazet
2011-04-14 16:56 ` Peter Zijlstra
2011-04-14 16:57 ` Eric Dumazet
2011-04-14 17:49 ` Eric Dumazet
2011-04-14 19:08 ` Alexander Duyck
2011-04-15 2:10 ` Wei Gu
2011-04-15 8:57 ` Peter Zijlstra
2011-04-15 9:14 ` Wei Gu
2011-04-18 21:12 ` Jesse Brandeburg
2011-04-19 4:09 ` Wei Gu
2011-04-21 2:57 ` Wei Gu
2011-04-21 3:25 ` Wei Gu
2011-04-08 16:22 ` Alexander Duyck
2011-04-09 3:36 ` Wei Gu
2011-04-09 4:40 ` Alexander H Duyck
2011-04-09 6:12 ` Wei Gu
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1302176811.3357.15.camel@edumazet-laptop \
--to=eric.dumazet@gmail.com \
--cc=alexander.h.duyck@intel.com \
--cc=jeffrey.t.kirsher@intel.com \
--cc=netdev@vger.kernel.org \
--cc=wei.gu@ericsson.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox