All of lore.kernel.org
 help / color / mirror / Atom feed
* Slow ceph fs performance
@ 2012-09-26 14:50 Bryan K. Wright
  2012-09-26 15:26 ` Mark Nelson
  0 siblings, 1 reply; 23+ messages in thread
From: Bryan K. Wright @ 2012-09-26 14:50 UTC (permalink / raw)
  To: ceph-devel

Hi folks,

	I'm seeing reasonable performance when I run rados
benchmarks, but really slow I/O when reading or writing 
from a mounted ceph filesystem.  The rados benchmarks
show about 150 MB/s for both read and write, but when I
go to a client machine with a mounted ceph filesystem
and try to rsync a large (60 GB) directory tree onto
the ceph fs, I'm getting rates of only 2-5 MB/s.

	The OSDs and MDSs are all running 64-bit CentOS 6.3
with the stock CentOS 2.6.32 kernel.  The client is also
64-bit CentOS 6.3, but it's running the "elrepo" 3.5.4 kernel.
There are four OSDs, each with a hardware RAID 5 array
and an SSD for the OSD journal.  The primary network
is a gigabit network, and the OSD, MDS and MON 
machines have a dedicated backend gigabit network on a 
second network interface.

	Locally on the OSD, "hdparm -t -T" reports read rates 
of ~350 MB/s, and bonnie++ shows:

Version  1.96       ------Sequential Output------ --Sequential Input- --Random-
Concurrency   1     -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks--
Machine        Size K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP  /sec %CP
osd-local    23800M  1037  99 316048  92 131023  19  2272  98 312781  21 521.0  24
Latency             13103us     183ms     123ms   15316us     100ms   75899us
Version  1.96       ------Sequential Create------ --------Random Create--------
osd-local           -Create-- --Read--- -Delete-- -Create-- --Read--- -Delete--
              files  /sec %CP  /sec %CP  /sec %CP  /sec %CP  /sec %CP  /sec %CP
                 16 16817  55 +++++ +++ 28786  77 23890  78 +++++ +++ 27128  75
Latency             21549us     105us     134us     902us      12us     104us


	While rsyncing the files, the ceph logs show lots
of warnings of the form:

[WRN] : slow request 91.848407 seconds old, received at 2012-09-26 09:30:52.252449: osd_op(client.5310.1:56400 1000026eda0.00001ec8 [write 2093056~4096] 0.aa047db8 snapc 1=[]) currently waiting for sub ops

	Snooping on traffic with wireshark shows bursts of 
activity separated by long periods (30-60 sec) of idle time.

	My first thought was that I was seeing a kind of 
"bufferbloat". The SSDs are 120 GB, so they could easily contain 
enough data to take a long time to dump.  I changed to using a 
journal file, limited to 1 GB, but I still see the same slow
behavior.

	Any advice about how to go about debugging this would
be appreciated.

					Thanks,
					Bryan

-- 
========================================================================
Bryan Wright              |"If you take cranberries and stew them like 
Physics Department        | applesauce, they taste much more like prunes
University of Virginia    | than rhubarb does."  --  Groucho 
Charlottesville, VA  22901|			
(434) 924-7218            |         bryan@virginia.edu
========================================================================


^ permalink raw reply	[flat|nested] 23+ messages in thread

end of thread, other threads:[~2012-11-03 17:55 UTC | newest]

Thread overview: 23+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-09-26 14:50 Slow ceph fs performance Bryan K. Wright
2012-09-26 15:26 ` Mark Nelson
2012-09-26 20:54   ` Bryan K. Wright
2012-09-27 15:16     ` Bryan K. Wright
2012-09-27 18:04     ` Gregory Farnum
2012-09-27 18:47       ` Bryan K. Wright
2012-09-27 19:47         ` Gregory Farnum
2012-10-01 16:47       ` Tommi Virtanen
2012-10-01 17:00         ` Gregory Farnum
2012-10-03 14:55           ` Bryan K. Wright
2012-10-03 18:35             ` Gregory Farnum
2012-10-04 13:14               ` Bryan K. Wright
2012-10-04 15:24                 ` Sage Weil
2012-10-04 15:54                   ` Bryan K. Wright
2012-10-26 20:48                     ` Gregory Farnum
2012-10-29 15:08                       ` Bryan K. Wright
2012-11-03 17:55                         ` Gregory Farnum
2012-10-01 17:03         ` Mark Nelson
2012-09-27 23:40     ` Mark Kirkwood
2012-09-27 23:49       ` Mark Kirkwood
2012-09-28 12:22         ` mark seger
2012-10-01 15:41           ` Bryan K. Wright
2012-10-01 16:43             ` Mark Nelson

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.