From: "J. Bruce Fields" <bfields-uC3wQj2KruNg9hUCZPvPmw@public.gmane.org>
To: Yan Burman <yanb-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
Cc: Wendy Cheng
<s.wendy.cheng-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>,
"Atchley, Scott" <atchleyes-1Heg1YXhbW8@public.gmane.org>,
Tom Tucker
<tom-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>,
"linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org"
<linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>,
"linux-nfs-u79uwXL29TY76Z2rM5mHXA@public.gmane.org"
<linux-nfs-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>,
Or Gerlitz <ogerlitz-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
Subject: Re: NFS over RDMA benchmark
Date: Wed, 24 Apr 2013 11:05:40 -0400 [thread overview]
Message-ID: <20130424150540.GB20275@fieldses.org> (raw)
In-Reply-To: <0EE9A1CDC8D6434DB00095CD7DB873462CF988C9-fViJhHBwANKuSA5JZHE7gA@public.gmane.org>
On Wed, Apr 24, 2013 at 12:35:03PM +0000, Yan Burman wrote:
>
>
> > -----Original Message-----
> > From: J. Bruce Fields [mailto:bfields-uC3wQj2KruNg9hUCZPvPmw@public.gmane.org]
> > Sent: Wednesday, April 24, 2013 00:06
> > To: Yan Burman
> > Cc: Wendy Cheng; Atchley, Scott; Tom Tucker; linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org;
> > linux-nfs-u79uwXL29TY76Z2rM5mHXA@public.gmane.org; Or Gerlitz
> > Subject: Re: NFS over RDMA benchmark
> >
> > On Thu, Apr 18, 2013 at 12:47:09PM +0000, Yan Burman wrote:
> > >
> > >
> > > > -----Original Message-----
> > > > From: Wendy Cheng [mailto:s.wendy.cheng-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org]
> > > > Sent: Wednesday, April 17, 2013 21:06
> > > > To: Atchley, Scott
> > > > Cc: Yan Burman; J. Bruce Fields; Tom Tucker;
> > > > linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org; linux-nfs-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
> > > > Subject: Re: NFS over RDMA benchmark
> > > >
> > > > On Wed, Apr 17, 2013 at 10:32 AM, Atchley, Scott
> > > > <atchleyes-1Heg1YXhbW8@public.gmane.org>
> > > > wrote:
> > > > > On Apr 17, 2013, at 1:15 PM, Wendy Cheng <s.wendy.cheng-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
> > > > wrote:
> > > > >
> > > > >> On Wed, Apr 17, 2013 at 7:36 AM, Yan Burman <yanb-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
> > > > wrote:
> > > > >>> Hi.
> > > > >>>
> > > > >>> I've been trying to do some benchmarks for NFS over RDMA and I
> > > > >>> seem to
> > > > only get about half of the bandwidth that the HW can give me.
> > > > >>> My setup consists of 2 servers each with 16 cores, 32Gb of
> > > > >>> memory, and
> > > > Mellanox ConnectX3 QDR card over PCI-e gen3.
> > > > >>> These servers are connected to a QDR IB switch. The backing
> > > > >>> storage on
> > > > the server is tmpfs mounted with noatime.
> > > > >>> I am running kernel 3.5.7.
> > > > >>>
> > > > >>> When running ib_send_bw, I get 4.3-4.5 GB/sec for block sizes 4-512K.
> > > > >>> When I run fio over rdma mounted nfs, I get 260-2200MB/sec for
> > > > >>> the
> > > > same block sizes (4-512K). running over IPoIB-CM, I get 200-980MB/sec.
> > > > >
> > > > > Yan,
> > > > >
> > > > > Are you trying to optimize single client performance or server
> > > > > performance
> > > > with multiple clients?
> > > > >
> > >
> > > I am trying to get maximum performance from a single server - I used 2
> > processes in fio test - more than 2 did not show any performance boost.
> > > I tried running fio from 2 different PCs on 2 different files, but the sum of
> > the two is more or less the same as running from single client PC.
> > >
> > > What I did see is that server is sweating a lot more than the clients and
> > more than that, it has 1 core (CPU5) in 100% softirq tasklet:
> > > cat /proc/softirqs
> >
> > Would any profiling help figure out which code it's spending time in?
> > (E.g. something simple as "perf top" might have useful output.)
> >
>
>
> Perf top for the CPU with high tasklet count gives:
>
> samples pcnt RIP function DSO
> _______ _____ ________________ ___________________________ ___________________________________________________________________
>
> 2787.00 24.1% ffffffff81062a00 mutex_spin_on_owner /root/vmlinux
I guess that means lots of contention on some mutex? If only we knew
which one.... perf should also be able to collect stack statistics, I
forget how.
--b.
> 978.00 8.4% ffffffff810297f0 clflush_cache_range /root/vmlinux
> 445.00 3.8% ffffffff812ea440 __domain_mapping /root/vmlinux
> 441.00 3.8% 0000000000018c30 svc_recv /lib/modules/3.5.7-dbg/kernel/net/sunrpc/sunrpc.ko
> 344.00 3.0% ffffffff813a1bc0 _raw_spin_lock_bh /root/vmlinux
> 333.00 2.9% ffffffff813a19e0 _raw_spin_lock_irqsave /root/vmlinux
> 288.00 2.5% ffffffff813a07d0 __schedule /root/vmlinux
> 249.00 2.1% ffffffff811a87e0 rb_prev /root/vmlinux
> 242.00 2.1% ffffffff813a19b0 _raw_spin_lock /root/vmlinux
> 184.00 1.6% 0000000000002e90 svc_rdma_sendto /lib/modules/3.5.7-dbg/kernel/net/sunrpc/xprtrdma/svcrdma.ko
> 177.00 1.5% ffffffff810ac820 get_page_from_freelist /root/vmlinux
> 174.00 1.5% ffffffff812e6da0 alloc_iova /root/vmlinux
> 165.00 1.4% ffffffff810b1390 put_page /root/vmlinux
> 148.00 1.3% 0000000000014760 sunrpc_cache_lookup /lib/modules/3.5.7-dbg/kernel/net/sunrpc/sunrpc.ko
> 128.00 1.1% 0000000000017f20 svc_xprt_enqueue /lib/modules/3.5.7-dbg/kernel/net/sunrpc/sunrpc.ko
> 126.00 1.1% ffffffff8139f820 __mutex_lock_slowpath /root/vmlinux
> 108.00 0.9% ffffffff811a81d0 rb_insert_color /root/vmlinux
> 107.00 0.9% 0000000000004690 svc_rdma_recvfrom /lib/modules/3.5.7-dbg/kernel/net/sunrpc/xprtrdma/svcrdma.ko
> 102.00 0.9% 0000000000002640 send_reply /lib/modules/3.5.7-dbg/kernel/net/sunrpc/xprtrdma/svcrdma.ko
> 99.00 0.9% ffffffff810e6490 kmem_cache_alloc /root/vmlinux
> 96.00 0.8% ffffffff810e5840 __slab_alloc /root/vmlinux
> 91.00 0.8% 0000000000006d30 mlx4_ib_post_send /lib/modules/3.5.7-dbg/kernel/drivers/infiniband/hw/mlx4/mlx4_ib.ko
> 88.00 0.8% 0000000000000dd0 svc_rdma_get_context /lib/modules/3.5.7-dbg/kernel/net/sunrpc/xprtrdma/svcrdma.ko
> 86.00 0.7% ffffffff813a1a10 _raw_spin_lock_irq /root/vmlinux
> 86.00 0.7% 0000000000001530 svc_rdma_send /lib/modules/3.5.7-dbg/kernel/net/sunrpc/xprtrdma/svcrdma.ko
> 85.00 0.7% ffffffff81060a80 prepare_creds /root/vmlinux
> 83.00 0.7% ffffffff810a5790 find_get_pages_contig /root/vmlinux
> 79.00 0.7% ffffffff810e4620 __slab_free /root/vmlinux
> 79.00 0.7% ffffffff813a1a40 _raw_spin_unlock_irqrestore /root/vmlinux
> 77.00 0.7% ffffffff81065610 finish_task_switch /root/vmlinux
> 76.00 0.7% ffffffff812e9270 pfn_to_dma_pte /root/vmlinux
> 75.00 0.6% ffffffff810976d0 __call_rcu /root/vmlinux
> 73.00 0.6% ffffffff811a2fa0 _atomic_dec_and_lock /root/vmlinux
> 73.00 0.6% 00000000000002e0 svc_rdma_has_wspace /lib/modules/3.5.7-dbg/kernel/net/sunrpc/xprtrdma/svcrdma.ko
> 67.00 0.6% ffffffff813a1a70 _raw_read_lock /root/vmlinux
> 65.00 0.6% 000000000000f590 svcauth_unix_set_client /lib/modules/3.5.7-dbg/kernel/net/sunrpc/sunrpc.ko
> 63.00 0.5% 00000000000180e0 svc_reserve /lib/modules/3.5.7-dbg/kernel/net/sunrpc/sunrpc.ko
> 60.00 0.5% 00000000000064d0 stamp_send_wqe /lib/modules/3.5.7-dbg/kernel/drivers/infiniband/hw/mlx4/mlx4_ib.ko
> 57.00 0.5% ffffffff810ac110 free_hot_cold_page /root/vmlinux
> 57.00 0.5% ffffffff811ae540 memcpy /root/vmlinux
> 56.00 0.5% ffffffff810ad1a0 __alloc_pages_nodemask /root/vmlinux
> 55.00 0.5% ffffffff81118200 splice_to_pipe /root/vmlinux
> 53.00 0.5% ffffffff810e3bc0 get_partial_node /root/vmlinux
> 49.00 0.4% ffffffff812eb840 __intel_map_single /root/vmlinux
>
>
> > --b.
> >
> > > CPU0 CPU1 CPU2 CPU3 CPU4 CPU5 CPU6
> > CPU7 CPU8 CPU9 CPU10 CPU11 CPU12 CPU13 CPU14
> > CPU15
> > > HI: 0 0 0 0 0 0 0 0 0
> > 0 0 0 0 0 0 0
> > > TIMER: 418767 46596 43515 44547 50099 34815
> > 40634 40337 39551 93442 73733 42631 42509 41592
> > 40351 61793
> > > NET_TX: 28719 309 1421 1294 1730 1243 832
> > 937 11 44 41 20 26 19 15 29
> > > NET_RX: 612070 19 22 21 6 235 3 2
> > 9 6 17 16 20 13 16 10
> > > BLOCK: 5941 0 0 0 0 0 0 0
> > 519 259 1238 272 253 174 215 2618
> > > BLOCK_IOPOLL: 0 0 0 0 0 0 0 0
> > 0 0 0 0 0 0 0 0
> > > TASKLET: 28 1 1 1 1 1540653 1 1
> > 29 1 1 1 1 1 1 2
> > > SCHED: 364965 26547 16807 18403 22919 8678
> > 14358 14091 16981 64903 47141 18517 19179 18036
> > 17037 38261
> > > HRTIMER: 13 0 1 1 0 0 0 0
> > 0 0 0 0 1 1 0 1
> > > RCU: 945823 841546 715281 892762 823564 42663
> > 863063 841622 333577 389013 393501 239103 221524 258159
> > 313426 234030
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
next prev parent reply other threads:[~2013-04-24 15:05 UTC|newest]
Thread overview: 41+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-04-17 14:36 NFS over RDMA benchmark Yan Burman
[not found] ` <0EE9A1CDC8D6434DB00095CD7DB873462CF96C65-fViJhHBwANKuSA5JZHE7gA@public.gmane.org>
2013-04-17 17:15 ` Wendy Cheng
[not found] ` <CABgxfbF7c9ktSoMSPV21JU76V5J4iwbJQ257S91Y3z36WJbJVA-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2013-04-17 17:32 ` Atchley, Scott
[not found] ` <62745258-4F3B-4C05-BFFD-03EA604576E4-1Heg1YXhbW8@public.gmane.org>
2013-04-17 18:06 ` Wendy Cheng
[not found] ` <CABgxfbGxhnKj2n0Z-w87rZ6fwCssO31G009gwej957gv1p8PQQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2013-04-18 12:47 ` Yan Burman
[not found] ` <0EE9A1CDC8D6434DB00095CD7DB873462CF9715B-fViJhHBwANKuSA5JZHE7gA@public.gmane.org>
2013-04-18 16:16 ` Wendy Cheng
2013-04-23 21:06 ` J. Bruce Fields
[not found] ` <20130423210607.GJ3676-uC3wQj2KruNg9hUCZPvPmw@public.gmane.org>
2013-04-24 12:35 ` Yan Burman
[not found] ` <0EE9A1CDC8D6434DB00095CD7DB873462CF988C9-fViJhHBwANKuSA5JZHE7gA@public.gmane.org>
2013-04-24 15:05 ` J. Bruce Fields [this message]
[not found] ` <20130424150540.GB20275-uC3wQj2KruNg9hUCZPvPmw@public.gmane.org>
2013-04-24 15:26 ` J. Bruce Fields
[not found] ` <20130424152631.GC20275-uC3wQj2KruNg9hUCZPvPmw@public.gmane.org>
2013-04-24 16:27 ` Wendy Cheng
[not found] ` <CABgxfbHShU7aEttJ35vdAjXduPFFj8+E4=5LZqOgh4e=5bax5Q-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2013-04-24 18:04 ` Wendy Cheng
[not found] ` <CABgxfbHpNgQyEjd2OVNMgJoLpt_VyLiOL5hMCLwotMd5kincwg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2013-04-24 18:26 ` Tom Talpey
[not found] ` <517823E0.4000402-CLs1Zie5N5HQT0dZR+AlfA@public.gmane.org>
2013-04-25 17:18 ` Wendy Cheng
[not found] ` <CABgxfbHePAyq6AH9TFKZKUmwEHOupuYUnfc1W99HAuDkYddUqQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2013-04-25 19:01 ` Phil Pishioneri
[not found] ` <51797D8D.1080302-8DAjSxpRXgY@public.gmane.org>
2013-04-25 20:14 ` Tom Talpey
2013-04-25 20:04 ` Tom Talpey
[not found] ` <51798C51.50209-CLs1Zie5N5HQT0dZR+AlfA@public.gmane.org>
2013-04-25 21:17 ` Tom Tucker
[not found] ` <51799D52.1040903-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
2013-04-25 21:58 ` Wendy Cheng
[not found] ` <CABgxfbHnqQyucEpbGhsQ8-pA69peHUza7L7WHKD1K1n9Zv0WXQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2013-04-25 22:26 ` Wendy Cheng
2013-04-28 6:28 ` Yan Burman
[not found] ` <0EE9A1CDC8D6434DB00095CD7DB873462CF9A820-fViJhHBwANKuSA5JZHE7gA@public.gmane.org>
2013-04-28 14:42 ` J. Bruce Fields
[not found] ` <20130428144248.GA2037-uC3wQj2KruNg9hUCZPvPmw@public.gmane.org>
2013-04-29 5:34 ` Wendy Cheng
[not found] ` <CABgxfbF9MepShtOP8EoTjfMXzU4LLWC7brTmMfa3rtoWBiOweg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2013-04-29 12:16 ` Yan Burman
[not found] ` <0EE9A1CDC8D6434DB00095CD7DB873462CF9B3E7-fViJhHBwANKuSA5JZHE7gA@public.gmane.org>
2013-04-29 13:05 ` Tom Tucker
[not found] ` <517E701F.1010807-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
2013-04-29 13:07 ` Tom Tucker
2013-04-30 5:09 ` Yan Burman
[not found] ` <0EE9A1CDC8D6434DB00095CD7DB873462CF9C90C-fViJhHBwANKuSA5JZHE7gA@public.gmane.org>
2013-04-30 13:05 ` Tom Talpey
[not found] ` <517FC182.3030703-CLs1Zie5N5HQT0dZR+AlfA@public.gmane.org>
2013-04-30 14:23 ` Yan Burman
[not found] ` <0EE9A1CDC8D6434DB00095CD7DB873462CF9CBA7-fViJhHBwANKuSA5JZHE7gA@public.gmane.org>
2013-04-30 14:44 ` Tom Talpey
2013-04-30 14:20 ` Tom Talpey
[not found] ` <517FD327.3060901-CLs1Zie5N5HQT0dZR+AlfA@public.gmane.org>
2013-04-30 14:38 ` Yan Burman
[not found] ` <0EE9A1CDC8D6434DB00095CD7DB873462CF9CBD0-fViJhHBwANKuSA5JZHE7gA@public.gmane.org>
2013-04-30 18:58 ` Tom Tucker
[not found] ` <CALsNU1MsjH5=p4Wtj2a J5+odC7y7-5oTGhrzOL-=15pXaYYUZw@mail.gmail.com>
[not found] ` <CABgxfbFhZTBO81WC5BcRRfQB_YBjE4N=sfS+G9eAzaFHYC_dWw@mail.gmail.com>
[not found] ` <CABgxfbFhZTBO81WC5BcRRfQB_YBjE4N=sfS+G9eAzaFHYC_dWw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2013-06-20 14:56 ` Or Gerlitz
2013-04-30 16:24 ` Wendy Cheng
2013-04-30 13:38 ` J. Bruce Fields
2013-04-19 2:27 ` Peng Tao
[not found] ` <CA+a=Yy7zruyGbjLyYXtPsYs12xs1uCwXo9BJtU1Fg6OMoC2z6g-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2013-04-22 11:07 ` Yan Burman
[not found] <51703280.03e9440a.06a6.3f9f@mx.google.com>
[not found] ` <51703280.03e9440a.06a6.3f9f-ATjtLOhZ0NVl57MIdRCFDg@public.gmane.org>
2013-04-18 19:15 ` Wendy Cheng
[not found] ` <CABgxfbF2HSYPF=rAjrKjoMAKqMOzUOUxJbNFspKaZ4ykMyaCbw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2013-04-19 1:03 ` Atchley, Scott
[not found] ` <47A54DE8-EBCF-4CE2-80AC-58415B985FD7-1Heg1YXhbW8@public.gmane.org>
2013-04-19 3:35 ` Spencer
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20130424150540.GB20275@fieldses.org \
--to=bfields-uc3wqj2krung9huczpvpmw@public.gmane.org \
--cc=atchleyes-1Heg1YXhbW8@public.gmane.org \
--cc=linux-nfs-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
--cc=linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
--cc=ogerlitz-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org \
--cc=s.wendy.cheng-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org \
--cc=tom-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org \
--cc=yanb-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox