From: jamal <hadi@cyberus.ca>
To: Eric Dumazet <eric.dumazet@gmail.com>
Cc: Changli Gao <xiaosuo@gmail.com>,
"David S. Miller" <davem@davemloft.net>,
Tom Herbert <therbert@google.com>,
Stephen Hemminger <shemminger@vyatta.com>,
netdev@vger.kernel.org
Subject: Re: [PATCH v6] net: batch skb dequeueing from softnet input_pkt_queue
Date: Sat, 24 Apr 2010 10:10:52 -0400 [thread overview]
Message-ID: <1272118252.8918.13.camel@bigi> (raw)
In-Reply-To: <1272060153.8918.8.camel@bigi>
[-- Attachment #1: Type: text/plain, Size: 203 bytes --]
On Fri, 2010-04-23 at 18:02 -0400, jamal wrote:
> Ive done a setup with the last patch from Changli + net-next - I will
> post test results tomorrow AM.
ok, annotated results attached.
cheers,
jamal
[-- Attachment #2: summary-apr23.txt --]
[-- Type: text/plain, Size: 45513 bytes --]
sink cpu all cpuint cpuapp
nn-standalone 93.95% 84.5% 99.8% 79.8%
nn-rps 96.41% 85.4% 95.5% 82.5%
nn-cl 97.29% 84.0% 99.9% 79.6%
nn-cl-rps 97.76% 86.5% 96.5% 84.8%
nn-standalone: Basic net-next from Apr23
nn-rps: Basic net-next from Apr23 with rps mask ee and irq affinity to cpu0
nn-cl: Basic net-next from Apr23 + Changli patch
nn-cl-rps: Basic net-next from Apr23 + Changli patch + rps mask ee,irq aff cpu0
sink: the amount of traffic the system was able to sink in.
cpu all: avg % system cpu consumed in test
cpuint: avg %cpu consumed by the cpu where interrupts happened
cpuapp: avg %cpu consumed by a sample cpu which did app processing
Testing was as previously explained..
I repeated each test 4-5 times and took averages..
It seems the non-rps case has improved drammatically since the last
net-next i tested. The rps case has also improved but the gap between
rps and non-rps is smaller.
[There are just too many variables for me to pinpoint
to one item as being the contributor. For example sky2 driver may
have become worse (consumes more cycles) but i cant quantify it yet
(i just see sky2_rx_submit showing up higher in profiles than before).
Also call_function_single_interrupt shows up prominently on application
processing CPUs but improved by Changli's changes].
After doing the math, I dont trust my results after applying Changlis patch.
It seems both the rps and non-rps case have gotten better (and i dont
see Changlis contribution to non-rps). It also seems that the gap between
rps and non-rps is non-existent now. In other words, there is no benefit to
using rps (it consumes more cpu for the same throughput). So it is likely
that i need to repeat these tests; maybe i did something wrong in my setup...
And here are the profiles:
--------------------------
cpu0 always received all the interrupts regardless of the tests.
cpu1, 7 etc were processing apps..
I could not spot much difference between before and after Changli's
I: Test setup : nn-standalone: Basic net-next from Apr23
All cpus
-------------------------------------------------------------------------------
PerfTop: 3784 irqs/sec kernel:84.2% [1000Hz cycles], (all, 8 CPUs)
-------------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ___________________________ ________
3254.00 10.3% sky2_poll [sky2]
1853.00 5.9% _raw_spin_lock_irqsave [kernel]
872.00 2.8% fget [kernel]
870.00 2.8% copy_user_generic_string [kernel]
819.00 2.6% _raw_spin_unlock_irqrestore [kernel]
729.00 2.3% sys_epoll_ctl [kernel]
701.00 2.2% datagram_poll [kernel]
615.00 2.0% udp_recvmsg [kernel]
602.00 1.9% _raw_spin_lock_bh [kernel]
595.00 1.9% system_call [kernel]
592.00 1.9% kmem_cache_free [kernel]
574.00 1.8% schedule [kernel]
568.00 1.8% _raw_spin_lock [kernel]
-------------------------------------------------------------------------------
PerfTop: 3574 irqs/sec kernel:85.1% [1000Hz cycles], (all, 8 CPUs)
-------------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ___________________________ ________
5023.00 10.9% sky2_poll [sky2]
2762.00 6.0% _raw_spin_lock_irqsave [kernel]
1319.00 2.9% copy_user_generic_string [kernel]
1306.00 2.8% fget [kernel]
1198.00 2.6% _raw_spin_unlock_irqrestore [kernel]
1071.00 2.3% datagram_poll [kernel]
1061.00 2.3% sys_epoll_ctl [kernel]
927.00 2.0% _raw_spin_lock_bh [kernel]
917.00 2.0% system_call [kernel]
901.00 1.9% udp_recvmsg [kernel]
895.00 1.9% kmem_cache_free [kernel]
819.00 1.8% _raw_spin_lock [kernel]
802.00 1.7% schedule [kernel]
774.00 1.7% sys_epoll_wait [kernel]
720.00 1.6% kmem_cache_alloc [kernel]
-------------------------------------------------------------------------------
PerfTop: 1000 irqs/sec kernel:100.0% [1000Hz cycles], (all, cpu: 0)
-------------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ______________________ ________
751.00 36.1% sky2_poll [sky2]
108.00 5.2% __udp4_lib_lookup [kernel]
95.00 4.6% ip_route_input [kernel]
83.00 4.0% _raw_spin_lock [kernel]
79.00 3.8% _raw_spin_lock_irqsave [kernel]
77.00 3.7% __netif_receive_skb [kernel]
77.00 3.7% __alloc_skb [kernel]
66.00 3.2% ip_rcv [kernel]
60.00 2.9% __udp4_lib_rcv [kernel]
54.00 2.6% sock_queue_rcv_skb [kernel]
45.00 2.2% sky2_rx_submit [sky2]
42.00 2.0% __wake_up_common [kernel]
40.00 1.9% __kmalloc [kernel]
39.00 1.9% sock_def_readable [kernel]
30.00 1.4% ep_poll_callback [kernel]
-------------------------------------------------------------------------------
PerfTop: 1001 irqs/sec kernel:99.8% [1000Hz cycles], (all, cpu: 0)
-------------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ______________________ ________
3511.00 36.7% sky2_poll [sky2]
519.00 5.4% __udp4_lib_lookup [kernel]
431.00 4.5% ip_route_input [kernel]
353.00 3.7% _raw_spin_lock_irqsave [kernel]
351.00 3.7% __alloc_skb [kernel]
338.00 3.5% __netif_receive_skb [kernel]
337.00 3.5% _raw_spin_lock [kernel]
307.00 3.2% ip_rcv [kernel]
264.00 2.8% sky2_rx_submit [sky2]
254.00 2.7% sock_queue_rcv_skb [kernel]
246.00 2.6% __udp4_lib_rcv [kernel]
206.00 2.2% sock_def_readable [kernel]
177.00 1.9% __wake_up_common [kernel]
168.00 1.8% __kmalloc [kernel]
-------------------------------------------------------------------------------
PerfTop: 908 irqs/sec kernel:80.0% [1000Hz cycles], (all, cpu: 1)
-------------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ___________________________ ________
177.00 6.7% _raw_spin_lock_irqsave [kernel]
120.00 4.5% copy_user_generic_string [kernel]
110.00 4.2% fget [kernel]
108.00 4.1% datagram_poll [kernel]
98.00 3.7% _raw_spin_lock_bh [kernel]
91.00 3.4% sys_epoll_ctl [kernel]
89.00 3.4% kmem_cache_free [kernel]
77.00 2.9% system_call [kernel]
76.00 2.9% schedule [kernel]
76.00 2.9% _raw_spin_unlock_irqrestore [kernel]
63.00 2.4% fput [kernel]
61.00 2.3% sys_epoll_wait [kernel]
61.00 2.3% udp_recvmsg [kernel]
49.00 1.8% process_recv mcpudp
-------------------------------------------------------------------------------
PerfTop: 815 irqs/sec kernel:79.8% [1000Hz cycles], (all, cpu: 1)
-------------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ___________________________ _________________
491.00 8.0% _raw_spin_lock_irqsave [kernel.kallsyms]
285.00 4.7% copy_user_generic_string [kernel.kallsyms]
252.00 4.1% fget [kernel.kallsyms]
215.00 3.5% datagram_poll [kernel.kallsyms]
206.00 3.4% _raw_spin_unlock_irqrestore [kernel.kallsyms]
204.00 3.3% sys_epoll_ctl [kernel.kallsyms]
196.00 3.2% _raw_spin_lock_bh [kernel.kallsyms]
184.00 3.0% udp_recvmsg [kernel.kallsyms]
184.00 3.0% kmem_cache_free [kernel.kallsyms]
180.00 2.9% system_call [kernel.kallsyms]
168.00 2.7% sys_epoll_wait [kernel.kallsyms]
159.00 2.6% schedule [kernel.kallsyms]
144.00 2.4% fput [kernel.kallsyms]
II: Test setup
nn-rps: Basic net-next from Apr23 with rps mask ee and irq affinity to cpu0
-------------------------------------------------------------------------------
PerfTop: 3558 irqs/sec kernel:85.0% [1000Hz cycles], (all, 8 CPUs)
-------------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ______________________________ ________
3519.00 15.9% sky2_poll [sky2]
865.00 3.9% _raw_spin_lock_irqsave [kernel]
568.00 2.6% _raw_spin_unlock_irqrestore [kernel]
526.00 2.4% sky2_intr [sky2]
493.00 2.2% __netif_receive_skb [kernel]
477.00 2.2% _raw_spin_lock [kernel]
470.00 2.1% ip_rcv [kernel]
456.00 2.1% fget [kernel]
447.00 2.0% sys_epoll_ctl [kernel]
420.00 1.9% copy_user_generic_string [kernel]
387.00 1.8% ip_route_input [kernel]
359.00 1.6% system_call [kernel]
334.00 1.5% kmem_cache_free [kernel]
310.00 1.4% kmem_cache_alloc [kernel]
302.00 1.4% call_function_single_interrupt [kernel]
-------------------------------------------------------------------------------
PerfTop: 3546 irqs/sec kernel:85.8% [1000Hz cycles], (all, 8 CPUs)
-------------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ______________________________ ________
6592.00 16.2% sky2_poll [sky2]
1540.00 3.8% _raw_spin_lock_irqsave [kernel]
1014.00 2.5% _raw_spin_unlock_irqrestore [kernel]
885.00 2.2% fget [kernel]
881.00 2.2% _raw_spin_lock [kernel]
880.00 2.2% sky2_intr [sky2]
872.00 2.1% __netif_receive_skb [kernel]
858.00 2.1% ip_rcv [kernel]
802.00 2.0% sys_epoll_ctl [kernel]
710.00 1.7% copy_user_generic_string [kernel]
696.00 1.7% system_call [kernel]
692.00 1.7% ip_route_input [kernel]
634.00 1.6% schedule [kernel]
618.00 1.5% kmem_cache_free [kernel]
605.00 1.5% call_function_single_interrupt [kernel]
cpu0
-------------------------------------------------------------------------------
PerfTop: 971 irqs/sec kernel:96.5% [1000Hz cycles], (all, cpu: 0)
-------------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ___________________________ ________
4222.00 58.2% sky2_poll [sky2]
668.00 9.2% sky2_intr [sky2]
228.00 3.1% __alloc_skb [kernel]
183.00 2.5% get_rps_cpu [kernel]
138.00 1.9% sky2_rx_submit [sky2]
124.00 1.7% enqueue_to_backlog [kernel]
119.00 1.6% __kmalloc [kernel]
103.00 1.4% kmem_cache_alloc [kernel]
91.00 1.3% _raw_spin_lock [kernel]
90.00 1.2% _raw_spin_lock_irqsave [kernel]
73.00 1.0% swiotlb_sync_single [kernel]
72.00 1.0% irq_entries_start [kernel]
55.00 0.8% copy_user_generic_string [kernel]
53.00 0.7% _raw_spin_unlock_irqrestore [kernel]
48.00 0.7% fget [kernel]
-------------------------------------------------------------------------------
PerfTop: 998 irqs/sec kernel:94.8% [1000Hz cycles], (all, cpu: 0)
-------------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ___________________________ ________
6745.00 58.5% sky2_poll [sky2]
831.00 7.2% sky2_intr [sky2]
352.00 3.1% __alloc_skb [kernel]
281.00 2.4% get_rps_cpu [kernel]
226.00 2.0% sky2_rx_submit [sky2]
186.00 1.6% __kmalloc [kernel]
181.00 1.6% enqueue_to_backlog [kernel]
173.00 1.5% _raw_spin_lock_irqsave [kernel]
166.00 1.4% kmem_cache_alloc [kernel]
162.00 1.4% _raw_spin_lock [kernel]
99.00 0.9% swiotlb_sync_single [kernel]
98.00 0.9% irq_entries_start [kernel]
94.00 0.8% fget [kernel]
92.00 0.8% _raw_spin_unlock_irqrestore [kernel]
80.00 0.7% system_call [kernel]
cpu1
-------------------------------------------------------------------------------
PerfTop: 724 irqs/sec kernel:82.0% [1000Hz cycles], (all, cpu: 1)
-------------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ______________________________ _________________
204.00 5.3% _raw_spin_lock_irqsave [kernel.kallsyms]
153.00 4.0% _raw_spin_unlock_irqrestore [kernel.kallsyms]
147.00 3.8% call_function_single_interrupt [kernel.kallsyms]
139.00 3.6% __netif_receive_skb [kernel.kallsyms]
135.00 3.5% sys_epoll_ctl [kernel.kallsyms]
132.00 3.4% ip_rcv [kernel.kallsyms]
129.00 3.3% fget [kernel.kallsyms]
128.00 3.3% _raw_spin_lock [kernel.kallsyms]
122.00 3.2% system_call [kernel.kallsyms]
118.00 3.1% ip_route_input [kernel.kallsyms]
109.00 2.8% kmem_cache_free [kernel.kallsyms]
108.00 2.8% copy_user_generic_string [kernel.kallsyms]
90.00 2.3% schedule [kernel.kallsyms]
85.00 2.2% fput [kernel.kallsyms]
-------------------------------------------------------------------------------
PerfTop: 763 irqs/sec kernel:83.0% [1000Hz cycles], (all, cpu: 1)
-------------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ______________________________ _________________
428.00 6.2% _raw_spin_lock_irqsave [kernel.kallsyms]
302.00 4.4% _raw_spin_unlock_irqrestore [kernel.kallsyms]
269.00 3.9% __netif_receive_skb [kernel.kallsyms]
258.00 3.7% call_function_single_interrupt [kernel.kallsyms]
254.00 3.7% fget [kernel.kallsyms]
238.00 3.4% ip_rcv [kernel.kallsyms]
230.00 3.3% sys_epoll_ctl [kernel.kallsyms]
222.00 3.2% _raw_spin_lock [kernel.kallsyms]
220.00 3.2% ip_route_input [kernel.kallsyms]
197.00 2.9% system_call [kernel.kallsyms]
189.00 2.7% kmem_cache_free [kernel.kallsyms]
184.00 2.7% copy_user_generic_string [kernel.kallsyms]
144.00 2.1% ep_remove [kernel.kallsyms]
140.00 2.0% schedule [kernel.kallsyms]
-------------------------------------------------------------------------------
PerfTop: 546 irqs/sec kernel:83.3% [1000Hz cycles], (all, cpu: 1)
-------------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ______________________________ _________________
346.00 5.7% _raw_spin_lock_irqsave [kernel.kallsyms]
275.00 4.6% _raw_spin_unlock_irqrestore [kernel.kallsyms]
238.00 3.9% call_function_single_interrupt [kernel.kallsyms]
228.00 3.8% fget [kernel.kallsyms]
222.00 3.7% __netif_receive_skb [kernel.kallsyms]
219.00 3.6% sys_epoll_ctl [kernel.kallsyms]
209.00 3.5% _raw_spin_lock [kernel.kallsyms]
205.00 3.4% ip_rcv [kernel.kallsyms]
199.00 3.3% ip_route_input [kernel.kallsyms]
173.00 2.9% system_call [kernel.kallsyms]
170.00 2.8% copy_user_generic_string [kernel.kallsyms]
167.00 2.8% kmem_cache_free [kernel.kallsyms]
127.00 2.1% ep_remove [kernel.kallsyms]
123.00 2.0% dst_release [kernel.kalls
III: Test setup
nn-cl: Basic net-next from Apr23 + Changli patch
-------------------------------------------------------------------------------
PerfTop: 3789 irqs/sec kernel:84.1% [1000Hz cycles], (all, 8 CPUs)
-------------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ___________________________ ____________________
3514.00 10.2% sky2_poll [sky2]
1862.00 5.4% _raw_spin_lock_irqsave [kernel]
1274.00 3.7% system_call [kernel]
926.00 2.7% fget [kernel]
872.00 2.5% _raw_spin_unlock_irqrestore [kernel]
862.00 2.5% copy_user_generic_string [kernel]
766.00 2.2% sys_epoll_ctl [kernel]
765.00 2.2% datagram_poll [kernel]
671.00 2.0% _raw_spin_lock_bh [kernel]
668.00 1.9% kmem_cache_free [kernel]
602.00 1.8% udp_recvmsg [kernel]
586.00 1.7% _raw_spin_lock [kernel]
585.00 1.7% vread_tsc [kernel].vsyscall_fn
-------------------------------------------------------------------------------
PerfTop: 3794 irqs/sec kernel:83.6% [1000Hz cycles], (all, 8 CPUs)
-------------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ___________________________ ____________________
4756.00 9.8% sky2_poll [sky2]
2742.00 5.7% _raw_spin_lock_irqsave [kernel]
1826.00 3.8% system_call [kernel]
1285.00 2.7% fget [kernel]
1284.00 2.7% copy_user_generic_string [kernel]
1235.00 2.6% _raw_spin_unlock_irqrestore [kernel]
1096.00 2.3% sys_epoll_ctl [kernel]
1071.00 2.2% datagram_poll [kernel]
954.00 2.0% kmem_cache_free [kernel]
925.00 1.9% _raw_spin_lock_bh [kernel]
888.00 1.8% vread_tsc [kernel].vsyscall_fn
880.00 1.8% udp_recvmsg [kernel]
793.00 1.6% _raw_spin_lock [kernel]
790.00 1.6% schedule [kernel]
-------------------------------------------------------------------------------
PerfTop: 1001 irqs/sec kernel:99.9% [1000Hz cycles], (all, cpu: 0)
-------------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ______________________ ________
675.00 32.6% sky2_poll [sky2]
116.00 5.6% __udp4_lib_lookup [kernel]
111.00 5.4% ip_route_input [kernel]
81.00 3.9% _raw_spin_lock_irqsave [kernel]
81.00 3.9% _raw_spin_lock [kernel]
70.00 3.4% __alloc_skb [kernel]
67.00 3.2% ip_rcv [kernel]
66.00 3.2% __netif_receive_skb [kernel]
61.00 2.9% __udp4_lib_rcv [kernel]
57.00 2.8% sock_queue_rcv_skb [kernel]
47.00 2.3% sock_def_readable [kernel]
42.00 2.0% __kmalloc [kernel]
42.00 2.0% __wake_up_common [kernel]
38.00 1.8% sky2_rx_submit [sky2]
-------------------------------------------------------------------------------
PerfTop: 1001 irqs/sec kernel:100.0% [1000Hz cycles], (all, cpu: 0)
-------------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ______________________ ________
2526.00 32.8% sky2_poll [sky2]
406.00 5.3% ip_route_input [kernel]
399.00 5.2% __udp4_lib_lookup [kernel]
328.00 4.3% _raw_spin_lock_irqsave [kernel]
307.00 4.0% _raw_spin_lock [kernel]
296.00 3.8% ip_rcv [kernel]
287.00 3.7% __alloc_skb [kernel]
272.00 3.5% sock_queue_rcv_skb [kernel]
224.00 2.9% __udp4_lib_rcv [kernel]
224.00 2.9% __netif_receive_skb [kernel]
182.00 2.4% sock_def_readable [kernel]
163.00 2.1% __wake_up_common [kernel]
140.00 1.8% sky2_rx_submit [sky2]
-------------------------------------------------------------------------------
PerfTop: 1001 irqs/sec kernel:100.0% [1000Hz cycles], (all, cpu: 0)
-------------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ______________________ ________
4445.00 33.4% sky2_poll [sky2]
707.00 5.3% __udp4_lib_lookup [kernel]
662.00 5.0% ip_route_input [kernel]
567.00 4.3% _raw_spin_lock_irqsave [kernel]
512.00 3.8% __alloc_skb [kernel]
506.00 3.8% ip_rcv [kernel]
476.00 3.6% sock_queue_rcv_skb [kernel]
473.00 3.6% _raw_spin_lock [kernel]
415.00 3.1% __udp4_lib_rcv [kernel]
408.00 3.1% __netif_receive_skb [kernel]
306.00 2.3% sock_def_readable [kernel]
272.00 2.0% __wake_up_common [kernel]
260.00 2.0% __kmalloc [kernel]
216.00 1.6% _raw_read_lock [kernel]
214.00 1.6% sky2_rx_submit [sky2]
-------------------------------------------------------------------------------
PerfTop: 748 irqs/sec kernel:80.9% [1000Hz cycles], (all, cpu: 1)
-------------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ___________________________ ____________________
244.00 7.4% _raw_spin_lock_irqsave [kernel]
207.00 6.2% system_call [kernel]
127.00 3.8% _raw_spin_unlock_irqrestore [kernel]
124.00 3.7% copy_user_generic_string [kernel]
122.00 3.7% sys_epoll_ctl [kernel]
120.00 3.6% fget [kernel]
118.00 3.6% datagram_poll [kernel]
96.00 2.9% schedule [kernel]
94.00 2.8% _raw_spin_lock_bh [kernel]
86.00 2.6% vread_tsc [kernel].vsyscall_fn
82.00 2.5% udp_recvmsg [kernel]
76.00 2.3% fput [kernel]
73.00 2.2% kmem_cache_free [kernel]
67.00 2.0% sys_epoll_wait [kernel]
-------------------------------------------------------------------------------
PerfTop: 625 irqs/sec kernel:78.6% [1000Hz cycles], (all, cpu: 1)
-------------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ___________________________ ____________________
488.00 7.5% _raw_spin_lock_irqsave [kernel]
380.00 5.9% system_call [kernel]
274.00 4.2% copy_user_generic_string [kernel]
252.00 3.9% fget [kernel]
244.00 3.8% datagram_poll [kernel]
217.00 3.3% _raw_spin_unlock_irqrestore [kernel]
211.00 3.3% sys_epoll_ctl [kernel]
186.00 2.9% schedule [kernel]
185.00 2.9% _raw_spin_lock_bh [kernel]
173.00 2.7% udp_recvmsg [kernel]
169.00 2.6% vread_tsc [kernel].vsyscall_fn
164.00 2.5% kmem_cache_free [kernel]
143.00 2.2% fput [kernel]
133.00 2.1% sys_epoll_wait [kernel]
IV: Test setup
nn-cl-rps: Basic net-next from Apr23 + Changli patch + rps mask ee,irq aff
--------------------------------------------------------------------------
PerfTop: 3043 irqs/sec kernel:87.5% [1000Hz cycles], (all, 8 CPUs)
--------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ __________________________ ____________________
2240.00 20.4% sky2_poll [sky2]
375.00 3.4% _raw_spin_lock_irqsave [kernel]
335.00 3.0% sky2_intr [sky2]
326.00 3.0% system_call [kernel]
239.00 2.2% _raw_spin_unlock_irqrestor [kernel]
224.00 2.0% ip_rcv [kernel]
201.00 1.8% __netif_receive_skb [kernel]
198.00 1.8% sys_epoll_ctl [kernel]
190.00 1.7% _raw_spin_lock [kernel]
182.00 1.7% fget [kernel]
169.00 1.5% copy_user_generic_string [kernel]
165.00 1.5% kmem_cache_free [kernel]
149.00 1.4% load_balance [kernel]
146.00 1.3% ip_route_input [kernel]
--------------------------------------------------------------------------
PerfTop: 3210 irqs/sec kernel:85.8% [1000Hz cycles], (all, 8 CPUs)
--------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ __________________________ ____________________
6539.00 20.4% sky2_poll [sky2]
1106.00 3.4% _raw_spin_lock_irqsave [kernel]
1014.00 3.2% sky2_intr [sky2]
976.00 3.0% system_call [kernel]
684.00 2.1% _raw_spin_unlock_irqrestor [kernel]
611.00 1.9% ip_rcv [kernel]
601.00 1.9% fget [kernel]
593.00 1.8% _raw_spin_lock [kernel]
592.00 1.8% sys_epoll_ctl [kernel]
574.00 1.8% __netif_receive_skb [kernel]
526.00 1.6% copy_user_generic_string [kernel]
482.00 1.5% kmem_cache_free [kernel]
480.00 1.5% ip_route_input [kernel]
425.00 1.3% vread_tsc [kernel].vsyscall_fn
410.00 1.3% kmem_cache_alloc [kernel]
--------------------------------------------------------------------------
PerfTop: 999 irqs/sec kernel:97.2% [1000Hz cycles], (all, cpu: 0)
--------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ___________________________ ________
2035.00 60.5% sky2_poll [sky2]
302.00 9.0% sky2_intr [sky2]
109.00 3.2% __alloc_skb [kernel]
57.00 1.7% _raw_spin_lock [kernel]
57.00 1.7% get_rps_cpu [kernel]
52.00 1.5% __kmalloc [kernel]
51.00 1.5% enqueue_to_backlog [kernel]
49.00 1.5% _raw_spin_lock_irqsave [kernel]
44.00 1.3% kmem_cache_alloc [kernel]
34.00 1.0% sky2_rx_submit [sky2]
33.00 1.0% swiotlb_sync_single [kernel]
31.00 0.9% system_call [kernel]
28.00 0.8% irq_entries_start [kernel]
22.00 0.7% _raw_spin_unlock_irqrestore [kernel]
21.00 0.6% sky2_remove [sky2]
--------------------------------------------------------------------------
PerfTop: 1000 irqs/sec kernel:96.2% [1000Hz cycles], (all, cpu: 0)
--------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ___________________________ ________
5493.00 60.1% sky2_poll [sky2]
803.00 8.8% sky2_intr [sky2]
281.00 3.1% __alloc_skb [kernel]
233.00 2.6% get_rps_cpu [kernel]
136.00 1.5% enqueue_to_backlog [kernel]
132.00 1.4% __kmalloc [kernel]
126.00 1.4% _raw_spin_lock [kernel]
122.00 1.3% kmem_cache_alloc [kernel]
122.00 1.3% _raw_spin_lock_irqsave [kernel]
102.00 1.1% swiotlb_sync_single [kernel]
88.00 1.0% sky2_rx_submit [sky2]
77.00 0.8% system_call [kernel]
69.00 0.8% irq_entries_start [kernel]
55.00 0.6% _raw_spin_unlock_irqrestore [kernel]
54.00 0.6% copy_user_generic_string [kernel]
--------------------------------------------------------------------------
PerfTop: 999 irqs/sec kernel:97.5% [1000Hz cycles], (all, cpu: 0)
--------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ ___________________________ ________
6699.00 60.1% sky2_poll [sky2]
988.00 8.9% sky2_intr [sky2]
327.00 2.9% __alloc_skb [kernel]
261.00 2.3% get_rps_cpu [kernel]
168.00 1.5% __kmalloc [kernel]
161.00 1.4% kmem_cache_alloc [kernel]
160.00 1.4% enqueue_to_backlog [kernel]
157.00 1.4% _raw_spin_lock [kernel]
125.00 1.1% _raw_spin_lock_irqsave [kernel]
122.00 1.1% swiotlb_sync_single [kernel]
114.00 1.0% sky2_rx_submit [sky2]
96.00 0.9% system_call [kernel]
85.00 0.8% irq_entries_start [kernel]
66.00 0.6% sky2_remove [sky2]
64.00 0.6% _raw_spin_unlock_irqrestore [kernel]
--------------------------------------------------------------------------
PerfTop: 420 irqs/sec kernel:84.8% [1000Hz cycles], (all, cpu: 2)
--------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ __________________________ ____________________
188.00 4.8% _raw_spin_lock_irqsave [kernel]
175.00 4.5% system_call [kernel]
155.00 4.0% _raw_spin_unlock_irqrestor [kernel]
143.00 3.7% __netif_receive_skb [kernel]
124.00 3.2% ip_route_input [kernel]
122.00 3.1% fget [kernel]
118.00 3.0% ip_rcv [kernel]
115.00 2.9% sys_epoll_ctl [kernel]
107.00 2.7% call_function_single_inter [kernel]
98.00 2.5% vread_tsc [kernel].vsyscall_fn
97.00 2.5% _raw_spin_lock [kernel]
89.00 2.3% copy_user_generic_string [kernel]
--------------------------------------------------------------------------
PerfTop: 372 irqs/sec kernel:87.9% [1000Hz cycles], (all, cpu: 2)
--------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ __________________________ ____________________
212.00 4.6% _raw_spin_lock_irqsave [kernel]
192.00 4.2% system_call [kernel]
187.00 4.1% __netif_receive_skb [kernel]
184.00 4.0% ip_rcv [kernel]
174.00 3.8% ip_route_input [kernel]
165.00 3.6% _raw_spin_unlock_irqrestor [kernel]
143.00 3.1% call_function_single_inter [kernel]
135.00 3.0% fget [kernel]
133.00 2.9% sys_epoll_ctl [kernel]
122.00 2.7% _raw_spin_lock [kernel]
112.00 2.5% __udp4_lib_lookup [kernel]
99.00 2.2% copy_user_generic_string [kernel]
93.00 2.0% vread_tsc [kernel].vsyscall_fn
90.00 2.0% kmem_cache_free [kernel]
89.00 1.9% ep_remove [kernel]
o
--------------------------------------------------------------------------
PerfTop: 269 irqs/sec kernel:85.1% [1000Hz cycles], (all, cpu: 7)
--------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ __________________________ ____________________
23.00 4.6% _raw_spin_lock_irqsave [kernel]
21.00 4.2% system_call [kernel]
19.00 3.8% _raw_spin_unlock_irqrestor [kernel]
17.00 3.4% fget [kernel]
15.00 3.0% __netif_receive_skb [kernel]
14.00 2.8% dst_release [kernel]
13.00 2.6% call_function_single_inter [kernel]
11.00 2.2% kmem_cache_free [kernel]
10.00 2.0% vread_tsc [kernel].vsyscall_fn
10.00 2.0% copy_user_generic_string [kernel]
10.00 2.0% ktime_get [kernel]
10.00 2.0% ip_route_input [kernel]
10.00 2.0% schedule [kernel]
--------------------------------------------------------------------------
PerfTop: 253 irqs/sec kernel:84.6% [1000Hz cycles], (all, cpu: 7)
--------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ __________________________ ____________________
109.00 4.9% system_call [kernel]
104.00 4.6% _raw_spin_lock_irqsave [kernel]
79.00 3.5% ip_rcv [kernel]
74.00 3.3% _raw_spin_unlock_irqrestor [kernel]
71.00 3.2% fget [kernel]
68.00 3.0% sys_epoll_ctl [kernel]
66.00 2.9% ip_route_input [kernel]
58.00 2.6% call_function_single_inter [kernel]
55.00 2.4% _raw_spin_lock [kernel]
54.00 2.4% copy_user_generic_string [kernel]
53.00 2.4% __netif_receive_skb [kernel]
51.00 2.3% schedule [kernel]
51.00 2.3% kmem_cache_free [kernel]
43.00 1.9% vread_tsc [kernel].vsyscall_fn
38.00 1.7% __udp4_lib_lookup [kernel]
--------------------------------------------------------------------------
PerfTop: 236 irqs/sec kernel:84.3% [1000Hz cycles], (all, cpu: 7)
--------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ __________________________ ____________________
131.00 4.9% _raw_spin_lock_irqsave [kernel]
128.00 4.8% system_call [kernel]
101.00 3.8% _raw_spin_unlock_irqrestor [kernel]
89.00 3.3% fget [kernel]
85.00 3.2% sys_epoll_ctl [kernel]
81.00 3.0% ip_rcv [kernel]
76.00 2.8% ip_route_input [kernel]
66.00 2.5% call_function_single_inter [kernel]
65.00 2.4% _raw_spin_lock [kernel]
65.00 2.4% kmem_cache_free [kernel]
64.00 2.4% copy_user_generic_string [kernel]
57.00 2.1% __netif_receive_skb [kernel]
47.00 1.8% schedule [kernel]
45.00 1.7% vread_tsc [kernel].vsyscall_fn
--------------------------------------------------------------------------
PerfTop: 478 irqs/sec kernel:82.2% [1000Hz cycles], (all, cpu: 2)
--------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ __________________________ ____________________
319.00 5.2% _raw_spin_lock_irqsave [kernel]
289.00 4.7% system_call [kernel]
246.00 4.0% _raw_spin_unlock_irqrestor [kernel]
199.00 3.2% ip_route_input [kernel]
198.00 3.2% __netif_receive_skb [kernel]
197.00 3.2% sys_epoll_ctl [kernel]
183.00 3.0% ip_rcv [kernel]
182.00 2.9% fget [kernel]
166.00 2.7% call_function_single_inter [kernel]
157.00 2.5% copy_user_generic_string [kernel]
149.00 2.4% kmem_cache_free [kernel]
146.00 2.4% vread_tsc [kernel].vsyscall_fn
133.00 2.1% _raw_spin_lock [kernel]
118.00 1.9% schedule [kernel]
112.00 1.8% __udp4_lib_lookup [kernel]
--------------------------------------------------------------------------
PerfTop: 535 irqs/sec kernel:83.0% [1000Hz cycles], (all, cpu: 2)
--------------------------------------------------------------------------
samples pcnt function DSO
_______ _____ __________________________ ____________________
345.00 5.2% _raw_spin_lock_irqsave [kernel]
291.00 4.4% system_call [kernel]
255.00 3.9% _raw_spin_unlock_irqrestor [kernel]
218.00 3.3% fget [kernel]
201.00 3.0% ip_route_input [kernel]
193.00 2.9% __netif_receive_skb [kernel]
193.00 2.9% sys_epoll_ctl [kernel]
180.00 2.7% ip_rcv [kernel]
173.00 2.6% call_function_single_inter [kernel]
163.00 2.5% copy_user_generic_string [kernel]
152.00 2.3% kmem_cache_free [kernel]
151.00 2.3% vread_tsc [kernel].vsyscall_fn
142.00 2.1% _raw_spin_lock [kernel]
131.00 2.0% schedule [kernel]
next prev parent reply other threads:[~2010-04-24 14:10 UTC|newest]
Thread overview: 108+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-04-23 8:12 [PATCH v6] net: batch skb dequeueing from softnet input_pkt_queue Changli Gao
2010-04-23 9:27 ` Eric Dumazet
2010-04-23 22:02 ` jamal
2010-04-24 14:10 ` jamal [this message]
2010-04-26 14:03 ` Eric Dumazet
2010-04-26 14:55 ` Eric Dumazet
2010-04-26 21:06 ` jamal
[not found] ` <20100429174056.GA8044@gargoyle.fritz.box>
2010-04-29 17:56 ` Eric Dumazet
2010-04-29 18:10 ` OFT - reserving CPU's for networking Stephen Hemminger
2010-04-29 19:19 ` Thomas Gleixner
2010-04-29 20:02 ` Eric Dumazet
2010-04-30 18:15 ` Brian Bloniarz
2010-04-30 18:57 ` David Miller
2010-04-30 19:58 ` Thomas Gleixner
2010-04-30 21:01 ` Andi Kleen
2010-04-30 22:30 ` David Miller
2010-05-01 10:53 ` Andi Kleen
2010-05-01 22:03 ` David Miller
2010-05-01 22:58 ` Andi Kleen
2010-05-01 23:29 ` David Miller
2010-05-01 23:44 ` Ben Hutchings
2010-05-01 20:31 ` Martin Josefsson
2010-05-01 22:13 ` David Miller
[not found] ` <20100429182347.GA8512@gargoyle.fritz.box>
2010-04-29 19:12 ` [PATCH v6] net: batch skb dequeueing from softnet input_pkt_queue Eric Dumazet
[not found] ` <20100429214144.GA10663@gargoyle.fritz.box>
2010-04-30 5:25 ` Eric Dumazet
2010-04-30 23:38 ` David Miller
2010-05-01 11:00 ` Andi Kleen
2010-05-02 6:56 ` Eric Dumazet
2010-05-02 9:20 ` Andi Kleen
2010-05-02 10:54 ` Eric Dumazet
2010-05-02 14:13 ` Arjan van de Ven
2010-05-02 14:27 ` Eric Dumazet
2010-05-02 15:32 ` Eric Dumazet
2010-05-02 17:54 ` Arjan van de Ven
2010-05-02 19:22 ` Eric Dumazet
2010-05-02 22:06 ` Andi Kleen
2010-05-03 3:50 ` Arjan van de Ven
2010-05-03 5:17 ` Eric Dumazet
2010-05-03 10:22 ` Arjan van de Ven
2010-05-03 10:34 ` Andi Kleen
2010-05-03 14:09 ` Arjan van de Ven
2010-05-03 14:45 ` Brian Bloniarz
2010-05-04 1:10 ` Arjan van de Ven
2010-05-03 15:52 ` Andi Kleen
2010-05-04 1:11 ` Arjan van de Ven
2010-05-02 21:30 ` Andi Kleen
2010-05-02 15:46 ` Andi Kleen
2010-05-02 16:35 ` Eric Dumazet
2010-05-02 17:43 ` Arjan van de Ven
2010-05-02 17:47 ` Eric Dumazet
2010-05-02 21:25 ` Andi Kleen
2010-05-02 21:45 ` Eric Dumazet
2010-05-02 21:54 ` Andi Kleen
2010-05-02 22:08 ` Eric Dumazet
2010-05-03 20:15 ` jamal
2010-04-26 21:03 ` jamal
2010-04-23 10:26 ` Eric Dumazet
2010-04-27 22:08 ` David Miller
2010-04-27 22:18 ` [PATCH net-next-2.6] bnx2x: Remove two prefetch() Eric Dumazet
2010-04-27 22:19 ` David Miller
2010-04-28 13:14 ` Eilon Greenstein
2010-04-28 15:44 ` Eliezer Tamir
2010-04-28 16:53 ` David Miller
[not found] ` <w2ue8f3c3211004280842r9f2589e8qb8fd4b7933cd9756@mail.gmail.com>
2010-04-28 16:55 ` David Miller
2010-04-28 11:33 ` jamal
2010-04-28 12:33 ` Eric Dumazet
2010-04-28 12:36 ` jamal
2010-04-28 14:06 ` [PATCH net-next-2.6] net: speedup udp receive path Eric Dumazet
2010-04-28 14:19 ` Eric Dumazet
2010-04-28 14:34 ` Eric Dumazet
2010-04-28 21:36 ` David Miller
2010-04-28 22:22 ` [PATCH net-next-2.6] net: ip_queue_rcv_skb() helper Eric Dumazet
2010-04-28 22:39 ` David Miller
2010-04-28 23:44 ` [PATCH net-next-2.6] net: speedup udp receive path jamal
2010-04-29 0:00 ` jamal
2010-04-29 4:09 ` Eric Dumazet
2010-04-29 11:35 ` jamal
2010-04-29 12:12 ` Changli Gao
2010-04-29 12:45 ` Eric Dumazet
2010-04-29 13:17 ` jamal
2010-04-29 13:21 ` Eric Dumazet
2010-04-29 13:37 ` jamal
2010-04-29 13:49 ` Eric Dumazet
2010-04-29 13:56 ` jamal
2010-04-29 20:36 ` jamal
2010-04-29 21:01 ` [PATCH net-next-2.6] net: sock_def_readable() and friends RCU conversion Eric Dumazet
2010-04-30 13:55 ` Brian Bloniarz
2010-04-30 17:26 ` Eric Dumazet
2010-04-30 23:35 ` David Miller
2010-05-01 4:56 ` Eric Dumazet
2010-05-01 7:02 ` Eric Dumazet
2010-05-01 8:03 ` Eric Dumazet
2010-05-01 22:00 ` David Miller
2010-04-30 19:30 ` [PATCH net-next-2.6] net: speedup udp receive path jamal
2010-04-30 20:40 ` Eric Dumazet
2010-05-01 0:06 ` jamal
2010-05-01 5:57 ` Eric Dumazet
2010-05-01 6:14 ` Eric Dumazet
2010-05-01 10:24 ` Changli Gao
2010-05-01 10:47 ` Eric Dumazet
2010-05-01 11:29 ` jamal
2010-05-01 11:23 ` jamal
2010-05-01 11:42 ` Eric Dumazet
2010-05-01 11:56 ` jamal
2010-05-01 13:22 ` Eric Dumazet
2010-05-01 13:49 ` jamal
2010-05-03 20:10 ` jamal
2010-04-29 23:07 ` Changli Gao
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1272118252.8918.13.camel@bigi \
--to=hadi@cyberus.ca \
--cc=davem@davemloft.net \
--cc=eric.dumazet@gmail.com \
--cc=netdev@vger.kernel.org \
--cc=shemminger@vyatta.com \
--cc=therbert@google.com \
--cc=xiaosuo@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).