* OSD's slow down to a crawl
@ 2012-12-20 15:16 Matthew Anderson
2012-12-20 15:28 ` Mark Nelson
0 siblings, 1 reply; 13+ messages in thread
From: Matthew Anderson @ 2012-12-20 15:16 UTC (permalink / raw)
To: ceph-devel@vger.kernel.org
Hi All,
I've run into an issue where OSD's slow right down to the point that they no longer appear to be processing write IO and everything grinds to a halt. Once they've stopped performing IO they can be brought back to life by restarting them where they continue on for a while before they slow down again and eventually everything stops again. I'm at a total loss to explain the issue because everything appears to be working correctly yet something is clearly very wrong. Running 'ceph -w' reports everything as clean+active and there are never any slow requests logged.
The problem appears to be more pronounced on the newer 3.7 kernel but happened on previous versions in the same fashion. Weird things like OSD's never completing recovery until they are restarted was also happening occasionally. The cluster would be stuck on a certain percentage degraded and restarting the OSD 3-4 times would eventually see it heal itself with the degraded percentage dropping a little each time. I'm testing with both the kernel CephFS client and QEMU 1.3.0 accessing an RBD, both slow down to a stop in the same manner.
Kernel 3.7.0 mainline (also occurs on 3.7.1 and 3.6.8)
Ceph version 0.55.1-300-g08c6424 (08c64249eb8cd7922de5c398a9426538918db77c) (also occurs on 0.55 and 0.55.1)
Total of 3 hosts. Two of them with 4 OSD's on an SSD each. The other with 24x 2TB SATA drives. Each host is running a monitor. I have setup two pools in the CRUSHMAP, one for SATA and one for SSD. Both pools have the same problem.
OSD config:
[osd.24]
host = KVM04
osd data = /srv/ceph/osd/$name
btrfs devs = /dev/sda2
btrfs options = rw,noatime,ssd
osd journal = /dev/sda1
The log below is when the OSD has stopped processing IO and was then shutdown cleanly. The QEMU guest was partially responsive but the benchmark had frozen waiting for IO to complete.
If I can do anymore debugging to help diagnose the problem please let me know.
Thanks
-Matt
-------------------------------------
2012-12-20 22:18:06.809863 7fb9e4ff9700 20 journal write_thread_entry aio throttle: aio num 0 bytes 0 ... exp 0 min_new 1 ... pending 5748
2012-12-20 22:18:06.809871 7fb9e4ff9700 10 journal room 960479231 max_size 960495616 pos 543215616 header.start 543203328 top 4096
2012-12-20 22:18:06.809877 7fb9e4ff9700 10 journal check_for_full at 543215616 : 8192 < 960479231
2012-12-20 22:18:06.809880 7fb9e4ff9700 15 journal prepare_single_write 1 will write 543215616 : seq 2087065 len 5748 -> 8192 (head 40 pre_pad 0 ebl 5748 post_pad 2364 tail 40) (ebl alignment -1)
2012-12-20 22:18:06.809891 7fb9d57fa700 10 journal op_apply_start 2087065 open_ops 0 -> 1
2012-12-20 22:18:06.809897 7fb9d57fa700 5 filestore(/srv/ceph/osd/osd.24) _do_op 0x7fb9b4376570 seq 2087065 osr(6.1fe3 0x21a4f50)/0x21a4f50 start
2012-12-20 22:18:06.809901 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) _do_transaction on 0x7fb9b4f5db40
2012-12-20 22:18:06.809915 7fb9d57fa700 15 filestore(/srv/ceph/osd/osd.24) write meta/a4c1c2a3/pglog_6.1fe3/0//-1 147735~147
2012-12-20 22:18:06.809947 7fb9e4ff9700 20 journal prepare_multi_write queue_pos now 543223808
2012-12-20 22:18:06.809953 7fb9e4ff9700 15 journal do_aio_write writing 543215616~8192
2012-12-20 22:18:06.809975 7fb9e4ff9700 20 journal write_aio_bl 543215616~8192 seq 2087065
2012-12-20 22:18:06.809981 7fb9e4ff9700 20 journal write_aio_bl .. 543215616~8192 in 1
2012-12-20 22:18:06.810051 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) write meta/a4c1c2a3/pglog_6.1fe3/0//-1 147735~147 = 147
2012-12-20 22:18:06.810065 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) collection_setattr /srv/ceph/osd/osd.24/current/6.1fe3_head 'ondisklog' len 34
2012-12-20 22:18:06.810068 7fb9e4ff9700 5 journal put_throttle finished 1 ops and 5748 bytes, now 0 ops and 0 bytes
2012-12-20 22:18:06.810077 7fb9e4ff9700 20 journal write_thread_entry going to sleep
2012-12-20 22:18:06.810103 7fb9d7fff700 20 journal write_finish_thread_entry waiting for aio(s)
2012-12-20 22:18:06.810149 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) collection_setattr /srv/ceph/osd/osd.24/current/6.1fe3_head 'ondisklog' len 34 = 34
2012-12-20 22:18:06.810159 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) collection_setattr /srv/ceph/osd/osd.24/current/6.1fe3_head 'info' len 5
2012-12-20 22:18:06.810191 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) collection_setattr /srv/ceph/osd/osd.24/current/6.1fe3_head 'info' len 5 = 5
2012-12-20 22:18:06.810198 7fb9d57fa700 15 filestore(/srv/ceph/osd/osd.24) truncate meta/8dfefeef/pginfo_6.1fe3/0//-1 size 0
2012-12-20 22:18:06.810307 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) truncate meta/8dfefeef/pginfo_6.1fe3/0//-1 size 0 = 0
2012-12-20 22:18:06.810317 7fb9d57fa700 15 filestore(/srv/ceph/osd/osd.24) write meta/8dfefeef/pginfo_6.1fe3/0//-1 0~504
2012-12-20 22:18:06.810386 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) write meta/8dfefeef/pginfo_6.1fe3/0//-1 0~504 = 504
2012-12-20 22:18:06.810394 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) _do_transaction on 0x7fb9b4f5dab8
2012-12-20 22:18:06.810399 7fb9d57fa700 15 filestore(/srv/ceph/osd/osd.24) write 6.1fe3_head/42463fe3/rb.0.122c.6b8b4567.000000000357/head//6 3522560~4096
2012-12-20 22:18:06.810451 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) write 6.1fe3_head/42463fe3/rb.0.122c.6b8b4567.000000000357/head//6 3522560~4096 = 4096
2012-12-20 22:18:06.810497 7fb9d57fa700 15 filestore(/srv/ceph/osd/osd.24) setattrs 6.1fe3_head/42463fe3/rb.0.122c.6b8b4567.000000000357/head//6
2012-12-20 22:18:06.810576 7fb9d7fff700 10 journal write_finish_thread_entry aio 543215616~8192 done
2012-12-20 22:18:06.810588 7fb9d7fff700 20 journal check_aio_completion
2012-12-20 22:18:06.810590 7fb9d7fff700 20 journal check_aio_completion completed seq 2087065 543215616~8192
2012-12-20 22:18:06.810594 7fb9d7fff700 20 journal check_aio_completion queueing finishers through seq 2087065
2012-12-20 22:18:06.810596 7fb9d7fff700 10 journal queue_completions_thru seq 2087065 queueing seq 2087065 0x7fb9b4f70ac0 lat 0.000761
2012-12-20 22:18:06.810631 7fb9d7fff700 20 journal write_finish_thread_entry sleeping
2012-12-20 22:18:06.810695 7fb9d5ffb700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 luod=3473'23688 lcod 3473'23688 mlcod 3473'23687 active+clean] op_commit repgather(0x7fb9b47d1050 applying 3473'23689 rep_tid=72 wfack=24,31 wfdisk=24,31 op=osd_op(client.7636.0:100075 rb.0.122c.6b8b4567.000000000357 [write 3522560~4096] 6.42463fe3) v4)
2012-12-20 22:18:06.810720 7fb9d5ffb700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] eval_repop repgather(0x7fb9b47d1050 applying 3473'23689 rep_tid=72 wfack=31 wfdisk=31 op=osd_op(client.7636.0:100075 rb.0.122c.6b8b4567.000000000357 [write 3522560~4096] 6.42463fe3) v4) wants=ad
2012-12-20 22:18:06.810992 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) setattrs 6.1fe3_head/42463fe3/rb.0.122c.6b8b4567.000000000357/head//6 = 230
2012-12-20 22:18:06.811053 7fb9d57fa700 15 filestore(/srv/ceph/osd/osd.24) setattrs 6.1fe3_head/42463fe3/rb.0.122c.6b8b4567.000000000357/head//6
2012-12-20 22:18:06.811083 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) setattrs 6.1fe3_head/42463fe3/rb.0.122c.6b8b4567.000000000357/head//6 = 31
2012-12-20 22:18:06.811093 7fb9d57fa700 10 journal op_apply_finish 2087065 open_ops 1 -> 0, max_applied_seq 2087064 -> 2087065
2012-12-20 22:18:06.811096 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) _do_op 0x7fb9b4376570 seq 2087065 r = 0, finisher 0x7fb9b5022bb0 0x7fb9b483ce10
2012-12-20 22:18:06.811100 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) _finish_op 0x7fb9b4376570 seq 2087065 osr(6.1fe3 0x21a4f50)/0x21a4f50
2012-12-20 22:18:06.811136 7fb9c37fe700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] op_applied repgather(0x7fb9b47d1050 applying 3473'23689 rep_tid=72 wfack=31 wfdisk=31 op=osd_op(client.7636.0:100075 rb.0.122c.6b8b4567.000000000357 [write 3522560~4096] 6.42463fe3) v4)
2012-12-20 22:18:06.811205 7fb9c37fe700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] op_applied mode was rmw(wr=1)
2012-12-20 22:18:06.811218 7fb9c37fe700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] op_applied mode now idle(wr=0 WAKE) (finish_write)
2012-12-20 22:18:06.811229 7fb9c37fe700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] put_object_context 0x7fb9b5248020 42463fe3/rb.0.122c.6b8b4567.000000000357/head//6 1 -> 0
2012-12-20 22:18:06.811241 7fb9c37fe700 15 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] requeue_ops
2012-12-20 22:18:06.811251 7fb9c37fe700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] put_snapset_context rb.0.122c.6b8b4567.000000000357 1 -> 0
2012-12-20 22:18:06.811267 7fb9c37fe700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] eval_repop repgather(0x7fb9b47d1050 applied 3473'23689 rep_tid=72 wfack=31 wfdisk=31 op=osd_op(client.7636.0:100075 rb.0.122c.6b8b4567.000000000357 [write 3522560~4096] 6.42463fe3) v4) wants=ad
2012-12-20 22:18:06.813738 7fb9c0ff9700 20 osd.24 3473 _dispatch 0x7fb9600008f0 osd_sub_op_reply(client.7636.0:100075 6.1fe3 42463fe3/rb.0.122c.6b8b4567.000000000357/head//6 [] ondisk, result = 0) v1
2012-12-20 22:18:06.813850 7fb9c0ff9700 15 osd.24 3473 require_same_or_newer_map 3473 (i am 3473) 0x7fb9600008f0
2012-12-20 22:18:06.813856 7fb9c0ff9700 20 osd.24 3473 _share_map_incoming osd.31 172.16.0.12:6820/2871 3473
2012-12-20 22:18:06.813865 7fb9c0ff9700 15 osd.24 3473 enqueue_op 0x7fb968055650 osd_sub_op_reply(client.7636.0:100075 6.1fe3 42463fe3/rb.0.122c.6b8b4567.000000000357/head//6 [] ondisk,
result= 0) v1
2012-12-20 22:18:06.813934 7fb9bd7f2700 10 osd.24 3473 dequeue_op 0x7fb968055650 osd_sub_op_reply(client.7636.0:100075 6.1fe3 42463fe3/rb.0.122c.6b8b4567.000000000357/head//6 [] ondisk, result = 0) v1 pg pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean]
2012-12-20 22:18:06.814005 7fb9bd7f2700 7 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] repop_ack rep_tid 72 op osd_op(client.7636.0:100075 rb.0.122c.6b8b4567.000000000357 [write 3522560~4096] 6.42463fe3) v4 result 0 ack_type 4 from osd.31
2012-12-20 22:18:06.814037 7fb9bd7f2700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] eval_repop repgather(0x7fb9b47d1050 applied 3473'23689 rep_tid=72 wfack= wfdisk= op=osd_op(client.7636.0:100075 rb.0.122c.6b8b4567.000000000357 [write 3522560~4096] 6.42463fe3) v4) wants=ad
2012-12-20 22:18:06.814057 7fb9bd7f2700 15 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] log_op_stats osd_op(client.7636.0:100075 rb.0.122c.6b8b4567.000000000357 [write 3522560~4096] 6.42463fe3) v4 inb 4239 outb 0 rlat 0.000000 lat 0.005594
2012-12-20 22:18:06.814075 7fb9bd7f2700 15 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] update_stats 3465'46631
2012-12-20 22:18:06.814088 7fb9bd7f2700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] sending commit on repgather(0x7fb9b47d1050 applied 3473'23689 rep_tid=72 wfack= wfdisk= op=osd_op(client.7636.0:100075 rb.0.122c.6b8b4567.000000000357 [write 3522560~4096] 6.42463fe3) v4) 0x7fb9b4605310
2012-12-20 22:18:06.814122 7fb9bd7f2700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23688 active+clean] removing repgather(0x7fb9b47d1050 applied 3473'23689 rep_tid=72 wfack= wfdisk= op=osd_op(client.7636.0:100075 rb.0.122c.6b8b4567.000000000357 [write 3522560~4096] 6.42463fe3) v4)
2012-12-20 22:18:06.814147 7fb9bd7f2700 20 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23688 active+clean] q front is repgather(0x7fb9b47d1050 applied 3473'23689 rep_tid=72 wfack= wfdisk= op=osd_op(client.7636.0:100075 rb.0.122c.6b8b4567.000000000357 [write 3522560~4096] 6.42463fe3) v4)
2012-12-20 22:18:06.814204 7fb9bd7f2700 10 osd.24 3473 dequeue_op 0x7fb968055650 finish
2012-12-20 22:18:07.030914 7fb9be7f4700 10 monclient: tick
2012-12-20 22:18:07.030929 7fb9be7f4700 10 monclient: _check_auth_rotating renewing rotating keys (they expired before 2012-12-20 22:17:37.030927)
2012-12-20 22:18:07.030952 7fb9be7f4700 10 monclient: renew subs? (now: 2012-12-20 22:18:07.030952; renew after: 2012-12-20 22:18:33.322998) -- no
2012-12-20 22:18:07.172960 7fb9bafed700 20 osd.24 3473 update_osd_stat osd_stat(21111 MB used, 393 GB avail, 431 GB total, peers [28,29,30,31]/[])
2012-12-20 22:18:07.172979 7fb9bafed700 5 osd.24 3473 heartbeat: osd_stat(21111 MB used, 393 GB avail, 431 GB total, peers [28,29,30,31]/[])
2012-12-20 22:18:07.173647 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9b400d440 already has epoch 3473
2012-12-20 22:18:07.173675 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a002f010 already has epoch 3473
2012-12-20 22:18:07.173785 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a00334d0 already has epoch 3473
2012-12-20 22:18:07.173875 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a0033fc0 already has epoch 3473
2012-12-20 22:18:07.479259 7fb9e6ffd700 5 osd.24 3473 tick
2012-12-20 22:18:07.479311 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 6.36 >= max 0.5 = no, load too high
2012-12-20 22:18:07.479326 7fb9e6ffd700 7 osd.24 3473 do_mon_report
2012-12-20 22:18:07.479329 7fb9e6ffd700 10 osd.24 3473 send_alive up_thru currently 3465 want 3465
2012-12-20 22:18:07.479331 7fb9e6ffd700 20 osd.24 3473 send_pg_stats
2012-12-20 22:18:07.479334 7fb9e6ffd700 10 osd.24 3473 send_pg_stats - 2 pgs updated
2012-12-20 22:18:07.479343 7fb9e6ffd700 10 monclient: _send_mon_message to mon.KVM03 at 172.16.0.12:6789/0
2012-12-20 22:18:07.774794 7fb9bf7f6700 20 osd.24 3473 _share_map_outgoing 0x7fb9a002f010 already has epoch 3473
2012-12-20 22:18:07.809251 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) sync_entry woke after 5.000117
2012-12-20 22:18:07.809274 7fb9e57fa700 10 journal commit_start max_applied_seq 2087065, open_ops 0
2012-12-20 22:18:07.809278 7fb9e57fa700 10 journal commit_start blocked, all open_ops have completed
2012-12-20 22:18:07.809279 7fb9e57fa700 10 journal commit_start committing 2087065, still blocked
2012-12-20 22:18:07.809282 7fb9e57fa700 10 journal commit_start
2012-12-20 22:18:07.809291 7fb9e57fa700 15 filestore(/srv/ceph/osd/osd.24) sync_entry committing 2087065 sync_epoch 19
2012-12-20 22:18:07.809339 7fb9e57fa700 10 filestore(/srv/ceph/osd/osd.24) taking async snap 'snap_2087065'
2012-12-20 22:18:07.817045 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) async snap create 'snap_2087065' transid 96549
2012-12-20 22:18:07.817060 7fb9e57fa700 10 journal commit_started committing 2087065, unblocking
2012-12-20 22:18:07.817069 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) waiting for transid 96549 to complete
2012-12-20 22:18:07.836752 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) done waiting for transid 96549 to complete
2012-12-20 22:18:07.836767 7fb9e57fa700 10 filestore(/srv/ceph/osd/osd.24) sync_entry commit took 0.027482, interval was 5.027633
2012-12-20 22:18:07.836775 7fb9e57fa700 10 journal commit_finish thru 2087065
2012-12-20 22:18:07.836778 7fb9e57fa700 5 journal committed_thru 2087065 (last_committed_seq 2087063)
2012-12-20 22:18:07.836781 7fb9e57fa700 10 journal header: block_size 4096 alignment 4096 max_size 960495616
2012-12-20 22:18:07.836783 7fb9e57fa700 10 journal header: start 543223808
2012-12-20 22:18:07.836784 7fb9e57fa700 10 journal write_pos 543223808
2012-12-20 22:18:07.836788 7fb9e57fa700 10 journal committed_thru done
2012-12-20 22:18:07.836793 7fb9e57fa700 10 filestore(/srv/ceph/osd/osd.24) removing snap 'snap_2087061'
2012-12-20 22:18:07.836958 7fb9e57fa700 15 filestore(/srv/ceph/osd/osd.24) sync_entry committed to op_seq 2087065
2012-12-20 22:18:07.836972 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) sync_entry waiting for max_interval 5.000000
2012-12-20 22:18:07.855890 7fb9bf7f6700 20 osd.24 3473 _share_map_outgoing 0x7fb9a0033fc0 already has epoch 3473
2012-12-20 22:18:08.221094 7fb9c1ffb700 20 osd.24 3473 _dispatch 0x7fb990002de0 pg_stats_ack(2 pgs tid 18) v1
2012-12-20 22:18:08.221105 7fb9c1ffb700 10 osd.24 3473 handle_pg_stats_ack
2012-12-20 22:18:08.479450 7fb9e6ffd700 5 osd.24 3473 tick
2012-12-20 22:18:08.479499 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 6.36 >= max 0.5 = no, load too high
2012-12-20 22:18:09.479581 7fb9e6ffd700 5 osd.24 3473 tick
2012-12-20 22:18:09.479633 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 6.36 >= max 0.5 = no, load too high
2012-12-20 22:18:10.479743 7fb9e6ffd700 5 osd.24 3473 tick
2012-12-20 22:18:10.479808 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 6.36 >= max 0.5 = no, load too high
2012-12-20 22:18:11.273304 7fb9bafed700 20 osd.24 3473 update_osd_stat osd_stat(21111 MB used, 393 GB avail, 431 GB total, peers [28,29,30,31]/[])
2012-12-20 22:18:11.273324 7fb9bafed700 5 osd.24 3473 heartbeat: osd_stat(21111 MB used, 393 GB avail, 431 GB total, peers [28,29,30,31]/[])
2012-12-20 22:18:11.274018 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9b400d440 already has epoch 3473
2012-12-20 22:18:11.274081 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a002f010 already has epoch 3473
2012-12-20 22:18:11.274101 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a00334d0 already has epoch 3473
2012-12-20 22:18:11.274249 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a0033fc0 already has epoch 3473
2012-12-20 22:18:11.275112 7fb9bf7f6700 20 osd.24 3473 _share_map_outgoing 0x7fb9a002f010 already has epoch 3473
2012-12-20 22:18:11.356141 7fb9bf7f6700 20 osd.24 3473 _share_map_outgoing 0x7fb9a0033fc0 already has epoch 3473
2012-12-20 22:18:11.479927 7fb9e6ffd700 5 osd.24 3473 tick
2012-12-20 22:18:11.479976 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 6.36 >= max 0.5 = no, load too high
2012-12-20 22:18:12.182095 7fb9bf7f6700 20 osd.24 3473 _share_map_outgoing 0x7fb9a00334d0 already has epoch 3473
2012-12-20 22:18:12.373677 7fb9bafed700 20 osd.24 3473 update_osd_stat osd_stat(21111 MB used, 393 GB avail, 431 GB total, peers [28,29,30,31]/[])
2012-12-20 22:18:12.373694 7fb9bafed700 5 osd.24 3473 heartbeat: osd_stat(21111 MB used, 393 GB avail, 431 GB total, peers [28,29,30,31]/[])
2012-12-20 22:18:12.374496 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9b400d440 already has epoch 3473
2012-12-20 22:18:12.374523 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a002f010 already has epoch 3473
2012-12-20 22:18:12.374541 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a00334d0 already has epoch 3473
2012-12-20 22:18:12.374564 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a0033fc0 already has epoch 3473
2012-12-20 22:18:12.480076 7fb9e6ffd700 5 osd.24 3473 tick
2012-12-20 22:18:12.480125 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 6.09 >= max 0.5 = no, load too high
2012-12-20 22:18:12.480140 7fb9e6ffd700 7 osd.24 3473 do_mon_report
2012-12-20 22:18:12.480142 7fb9e6ffd700 10 osd.24 3473 send_alive up_thru currently 3465 want 3465
2012-12-20 22:18:12.480146 7fb9e6ffd700 20 osd.24 3473 send_pg_stats
2012-12-20 22:18:12.701202 7fb9bf7f6700 20 osd.24 3473 _share_map_outgoing 0x7fb9b400d440 already has epoch 3473
2012-12-20 22:18:12.837067 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) sync_entry woke after 5.000094
2012-12-20 22:18:12.837089 7fb9e57fa700 10 journal commit_start max_applied_seq 2087065, open_ops 0
2012-12-20 22:18:12.837093 7fb9e57fa700 10 journal commit_start blocked, all open_ops have completed
2012-12-20 22:18:12.837095 7fb9e57fa700 10 journal commit_start nothing to do
2012-12-20 22:18:12.837100 7fb9e57fa700 10 journal commit_start
2012-12-20 22:18:12.837117 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) sync_entry waiting for max_interval 5.000000
2012-12-20 22:18:13.480239 7fb9e6ffd700 5 osd.24 3473 tick
2012-12-20 22:18:13.480289 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 6.09 >= max 0.5 = no, load too high
2012-12-20 22:18:14.073917 7fb9bafed700 20 osd.24 3473 update_osd_stat osd_stat(21111 MB used, 393 GB avail, 431 GB total, peers [28,29,30,31]/[])
2012-12-20 22:18:14.073935 7fb9bafed700 5 osd.24 3473 heartbeat: osd_stat(21111 MB used, 393 GB avail, 431 GB total, peers [28,29,30,31]/[])
2012-12-20 22:18:14.074840 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a0033fc0 already has epoch 3473
2012-12-20 22:18:14.074868 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a00334d0 already has epoch 3473
2012-12-20 22:18:14.074886 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9b400d440 already has epoch 3473
2012-12-20 22:18:14.074903 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a002f010 already has epoch 3473
2012-12-20 22:18:14.480386 7fb9e6ffd700 5 osd.24 3473 tick
2012-12-20 22:18:14.480443 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 6.09 >= max 0.5 = no, load too high
2012-12-20 22:18:15.480571 7fb9e6ffd700 5 osd.24 3473 tick
2012-12-20 22:18:15.480632 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 6.09 >= max 0.5 = no, load too high
2012-12-20 22:18:16.282625 7fb9bf7f6700 20 osd.24 3473 _share_map_outgoing 0x7fb9a00334d0 already has epoch 3473
2012-12-20 22:18:16.480714 7fb9e6ffd700 5 osd.24 3473 tick
2012-12-20 22:18:16.480763 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 6.09 >= max 0.5 = no, load too high
2012-12-20 22:18:16.801391 7fb9bf7f6700 20 osd.24 3473 _share_map_outgoing 0x7fb9b400d440 already has epoch 3473
2012-12-20 22:18:17.031056 7fb9be7f4700 10 monclient: tick
2012-12-20 22:18:17.031082 7fb9be7f4700 10 monclient: _check_auth_rotating renewing rotating keys (they expired before 2012-12-20 22:17:47.031080)
2012-12-20 22:18:17.031124 7fb9be7f4700 10 monclient: renew subs? (now: 2012-12-20 22:18:17.031124; renew after: 2012-12-20 22:18:33.322998) -- no
2012-12-20 22:18:17.175266 7fb9bf7f6700 20 osd.24 3473 _share_map_outgoing 0x7fb9a002f010 already has epoch 3473
2012-12-20 22:18:17.256256 7fb9bf7f6700 20 osd.24 3473 _share_map_outgoing 0x7fb9a0033fc0 already has epoch 3473
2012-12-20 22:18:17.480898 7fb9e6ffd700 5 osd.24 3473 tick
2012-12-20 22:18:17.480952 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 5.84 >= max 0.5 = no, load too high
2012-12-20 22:18:17.480969 7fb9e6ffd700 7 osd.24 3473 do_mon_report
2012-12-20 22:18:17.480972 7fb9e6ffd700 10 osd.24 3473 send_alive up_thru currently 3465 want 3465
2012-12-20 22:18:17.480975 7fb9e6ffd700 20 osd.24 3473 send_pg_stats
2012-12-20 22:18:17.837232 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) sync_entry woke after 5.000116
2012-12-20 22:18:17.837254 7fb9e57fa700 10 journal commit_start max_applied_seq 2087065, open_ops 0
2012-12-20 22:18:17.837258 7fb9e57fa700 10 journal commit_start blocked, all open_ops have completed
2012-12-20 22:18:17.837260 7fb9e57fa700 10 journal commit_start nothing to do
2012-12-20 22:18:17.837262 7fb9e57fa700 10 journal commit_start
2012-12-20 22:18:17.837271 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) sync_entry waiting for max_interval 5.000000
2012-12-20 22:18:18.481073 7fb9e6ffd700 5 osd.24 3473 tick
2012-12-20 22:18:18.481133 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 5.84 >= max 0.5 = no, load too high
2012-12-20 22:18:19.481247 7fb9e6ffd700 5 osd.24 3473 tick
2012-12-20 22:18:19.481308 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 5.84 >= max 0.5 = no, load too high
2012-12-20 22:18:19.974219 7fb9bafed700 20 osd.24 3473 update_osd_stat osd_stat(21111 MB used, 393 GB avail, 431 GB total, peers [28,29,30,31]/[])
2012-12-20 22:18:19.974238 7fb9bafed700 5 osd.24 3473 heartbeat: osd_stat(21111 MB used, 393 GB avail, 431 GB total, peers [28,29,30,31]/[])
2012-12-20 22:18:19.974899 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a002f010 already has epoch 3473
2012-12-20 22:18:19.975070 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a00334d0 already has epoch 3473
2012-12-20 22:18:19.975116 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a0033fc0 already has epoch 3473
2012-12-20 22:18:19.975207 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9b400d440 already has epoch 3473
2012-12-20 22:18:20.481425 7fb9e6ffd700 5 osd.24 3473 tick
2012-12-20 22:18:20.481487 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 5.84 >= max 0.5 = no, load too high
2012-12-20 22:18:21.016361 7fb9e77fe700 -1 osd.24 3473 *** Got signal Terminated ***
2012-12-20 22:18:21.016458 7fb9e77fe700 -1 osd.24 3473 pausing thread pools
2012-12-20 22:18:21.016462 7fb9e77fe700 -1 osd.24 3473 flushing io
2012-12-20 22:18:21.016464 7fb9e77fe700 10 filestore(/srv/ceph/osd/osd.24) sync_and_flush
2012-12-20 22:18:21.016467 7fb9e77fe700 10 filestore(/srv/ceph/osd/osd.24) _flush_op_queue draining op tp
2012-12-20 22:18:21.016471 7fb9e77fe700 10 filestore(/srv/ceph/osd/osd.24) _flush_op_queue waiting for apply finisher
2012-12-20 22:18:21.016481 7fb9e77fe700 10 filestore(/srv/ceph/osd/osd.24) start_sync
2012-12-20 22:18:21.016483 7fb9e77fe700 10 filestore(/srv/ceph/osd/osd.24) sync waiting
2012-12-20 22:18:21.016523 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) sync_entry woke after 3.179252
2012-12-20 22:18:21.016540 7fb9e57fa700 10 journal commit_start max_applied_seq 2087065, open_ops 0
2012-12-20 22:18:21.016543 7fb9e57fa700 10 journal commit_start blocked, all open_ops have completed
2012-12-20 22:18:21.016545 7fb9e57fa700 10 journal commit_start nothing to do
2012-12-20 22:18:21.016548 7fb9e57fa700 10 journal commit_start
2012-12-20 22:18:21.016563 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) sync_entry waiting for max_interval 5.000000
2012-12-20 22:18:21.016563 7fb9e77fe700 10 filestore(/srv/ceph/osd/osd.24) sync done
2012-12-20 22:18:21.016570 7fb9e77fe700 10 filestore(/srv/ceph/osd/osd.24) sync_and_flush done
2012-12-20 22:18:21.016572 7fb9e77fe700 -1 osd.24 3473 removing pid file
2012-12-20 22:18:21.016665 7fb9e77fe700 -1 osd.24 3473 exit
^ permalink raw reply [flat|nested] 13+ messages in thread* Re: OSD's slow down to a crawl 2012-12-20 15:16 OSD's slow down to a crawl Matthew Anderson @ 2012-12-20 15:28 ` Mark Nelson 2012-12-20 15:51 ` Matthew Anderson 0 siblings, 1 reply; 13+ messages in thread From: Mark Nelson @ 2012-12-20 15:28 UTC (permalink / raw) To: Matthew Anderson; +Cc: ceph-devel@vger.kernel.org On 12/20/2012 09:16 AM, Matthew Anderson wrote: > Hi All, > > I've run into an issue where OSD's slow right down to the point that they no longer appear to be processing write IO and everything grinds to a halt. Once they've stopped performing IO they can be brought back to life by restarting them where they continue on for a while before they slow down again and eventually everything stops again. I'm at a total loss to explain the issue because everything appears to be working correctly yet something is clearly very wrong. Running 'ceph -w' reports everything as clean+active and there are never any slow requests logged. > > The problem appears to be more pronounced on the newer 3.7 kernel but happened on previous versions in the same fashion. Weird things like OSD's never completing recovery until they are restarted was also happening occasionally. The cluster would be stuck on a certain percentage degraded and restarting the OSD 3-4 times would eventually see it heal itself with the degraded percentage dropping a little each time. I'm testing with both the kernel CephFS client and QEMU 1.3.0 accessing an RBD, both slow down to a stop in the same manner. > > Kernel 3.7.0 mainline (also occurs on 3.7.1 and 3.6.8) > Ceph version 0.55.1-300-g08c6424 (08c64249eb8cd7922de5c398a9426538918db77c) (also occurs on 0.55 and 0.55.1) > Total of 3 hosts. Two of them with 4 OSD's on an SSD each. The other with 24x 2TB SATA drives. Each host is running a monitor. I have setup two pools in the CRUSHMAP, one for SATA and one for SSD. Both pools have the same problem. > > OSD config: > [osd.24] > host = KVM04 > osd data = /srv/ceph/osd/$name > btrfs devs = /dev/sda2 > btrfs options = rw,noatime,ssd > osd journal = /dev/sda1 > > The log below is when the OSD has stopped processing IO and was then shutdown cleanly. The QEMU guest was partially responsive but the benchmark had frozen waiting for IO to complete. > > If I can do anymore debugging to help diagnose the problem please let me know. Hi Matt, Yeah, your log is looking pretty sparse! I imagine Sam will have a lot of questions for you. Out of curiosity have you looked at any io/cpu/memory stats when things slow down? sysprof may come in handy if it looks like it's CPU related. Also, check out the OSD admin socket. Sebastien Han made a nice little blog entry describing it: http://www.sebastien-han.fr/blog/2012/08/14/ceph-admin-socket/ You can get a lot of information about the current state of things. Mark > > Thanks > -Matt > ------------------------------------- > > 2012-12-20 22:18:06.809863 7fb9e4ff9700 20 journal write_thread_entry aio throttle: aio num 0 bytes 0 ... exp 0 min_new 1 ... pending 5748 > 2012-12-20 22:18:06.809871 7fb9e4ff9700 10 journal room 960479231 max_size 960495616 pos 543215616 header.start 543203328 top 4096 > 2012-12-20 22:18:06.809877 7fb9e4ff9700 10 journal check_for_full at 543215616 : 8192 < 960479231 > 2012-12-20 22:18:06.809880 7fb9e4ff9700 15 journal prepare_single_write 1 will write 543215616 : seq 2087065 len 5748 -> 8192 (head 40 pre_pad 0 ebl 5748 post_pad 2364 tail 40) (ebl alignment -1) > 2012-12-20 22:18:06.809891 7fb9d57fa700 10 journal op_apply_start 2087065 open_ops 0 -> 1 > 2012-12-20 22:18:06.809897 7fb9d57fa700 5 filestore(/srv/ceph/osd/osd.24) _do_op 0x7fb9b4376570 seq 2087065 osr(6.1fe3 0x21a4f50)/0x21a4f50 start > 2012-12-20 22:18:06.809901 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) _do_transaction on 0x7fb9b4f5db40 > 2012-12-20 22:18:06.809915 7fb9d57fa700 15 filestore(/srv/ceph/osd/osd.24) write meta/a4c1c2a3/pglog_6.1fe3/0//-1 147735~147 > 2012-12-20 22:18:06.809947 7fb9e4ff9700 20 journal prepare_multi_write queue_pos now 543223808 > 2012-12-20 22:18:06.809953 7fb9e4ff9700 15 journal do_aio_write writing 543215616~8192 > 2012-12-20 22:18:06.809975 7fb9e4ff9700 20 journal write_aio_bl 543215616~8192 seq 2087065 > 2012-12-20 22:18:06.809981 7fb9e4ff9700 20 journal write_aio_bl .. 543215616~8192 in 1 > 2012-12-20 22:18:06.810051 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) write meta/a4c1c2a3/pglog_6.1fe3/0//-1 147735~147 = 147 > 2012-12-20 22:18:06.810065 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) collection_setattr /srv/ceph/osd/osd.24/current/6.1fe3_head 'ondisklog' len 34 > 2012-12-20 22:18:06.810068 7fb9e4ff9700 5 journal put_throttle finished 1 ops and 5748 bytes, now 0 ops and 0 bytes > 2012-12-20 22:18:06.810077 7fb9e4ff9700 20 journal write_thread_entry going to sleep > 2012-12-20 22:18:06.810103 7fb9d7fff700 20 journal write_finish_thread_entry waiting for aio(s) > 2012-12-20 22:18:06.810149 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) collection_setattr /srv/ceph/osd/osd.24/current/6.1fe3_head 'ondisklog' len 34 = 34 > 2012-12-20 22:18:06.810159 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) collection_setattr /srv/ceph/osd/osd.24/current/6.1fe3_head 'info' len 5 > 2012-12-20 22:18:06.810191 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) collection_setattr /srv/ceph/osd/osd.24/current/6.1fe3_head 'info' len 5 = 5 > 2012-12-20 22:18:06.810198 7fb9d57fa700 15 filestore(/srv/ceph/osd/osd.24) truncate meta/8dfefeef/pginfo_6.1fe3/0//-1 size 0 > 2012-12-20 22:18:06.810307 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) truncate meta/8dfefeef/pginfo_6.1fe3/0//-1 size 0 = 0 > 2012-12-20 22:18:06.810317 7fb9d57fa700 15 filestore(/srv/ceph/osd/osd.24) write meta/8dfefeef/pginfo_6.1fe3/0//-1 0~504 > 2012-12-20 22:18:06.810386 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) write meta/8dfefeef/pginfo_6.1fe3/0//-1 0~504 = 504 > 2012-12-20 22:18:06.810394 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) _do_transaction on 0x7fb9b4f5dab8 > 2012-12-20 22:18:06.810399 7fb9d57fa700 15 filestore(/srv/ceph/osd/osd.24) write 6.1fe3_head/42463fe3/rb.0.122c.6b8b4567.000000000357/head//6 3522560~4096 > 2012-12-20 22:18:06.810451 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) write 6.1fe3_head/42463fe3/rb.0.122c.6b8b4567.000000000357/head//6 3522560~4096 = 4096 > 2012-12-20 22:18:06.810497 7fb9d57fa700 15 filestore(/srv/ceph/osd/osd.24) setattrs 6.1fe3_head/42463fe3/rb.0.122c.6b8b4567.000000000357/head//6 > 2012-12-20 22:18:06.810576 7fb9d7fff700 10 journal write_finish_thread_entry aio 543215616~8192 done > 2012-12-20 22:18:06.810588 7fb9d7fff700 20 journal check_aio_completion > 2012-12-20 22:18:06.810590 7fb9d7fff700 20 journal check_aio_completion completed seq 2087065 543215616~8192 > 2012-12-20 22:18:06.810594 7fb9d7fff700 20 journal check_aio_completion queueing finishers through seq 2087065 > 2012-12-20 22:18:06.810596 7fb9d7fff700 10 journal queue_completions_thru seq 2087065 queueing seq 2087065 0x7fb9b4f70ac0 lat 0.000761 > 2012-12-20 22:18:06.810631 7fb9d7fff700 20 journal write_finish_thread_entry sleeping > 2012-12-20 22:18:06.810695 7fb9d5ffb700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 luod=3473'23688 lcod 3473'23688 mlcod 3473'23687 active+clean] op_commit repgather(0x7fb9b47d1050 applying 3473'23689 rep_tid=72 wfack=24,31 wfdisk=24,31 op=osd_op(client.7636.0:100075 rb.0.122c.6b8b4567.000000000357 [write 3522560~4096] 6.42463fe3) v4) > 2012-12-20 22:18:06.810720 7fb9d5ffb700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] eval_repop repgather(0x7fb9b47d1050 applying 3473'23689 rep_tid=72 wfack=31 wfdisk=31 op=osd_op(client.7636.0:100075 rb.0.122c.6b8b4567.000000000357 [write 3522560~4096] 6.42463fe3) v4) wants=ad > 2012-12-20 22:18:06.810992 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) setattrs 6.1fe3_head/42463fe3/rb.0.122c.6b8b4567.000000000357/head//6 = 230 > 2012-12-20 22:18:06.811053 7fb9d57fa700 15 filestore(/srv/ceph/osd/osd.24) setattrs 6.1fe3_head/42463fe3/rb.0.122c.6b8b4567.000000000357/head//6 > 2012-12-20 22:18:06.811083 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) setattrs 6.1fe3_head/42463fe3/rb.0.122c.6b8b4567.000000000357/head//6 = 31 > 2012-12-20 22:18:06.811093 7fb9d57fa700 10 journal op_apply_finish 2087065 open_ops 1 -> 0, max_applied_seq 2087064 -> 2087065 > 2012-12-20 22:18:06.811096 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) _do_op 0x7fb9b4376570 seq 2087065 r = 0, finisher 0x7fb9b5022bb0 0x7fb9b483ce10 > 2012-12-20 22:18:06.811100 7fb9d57fa700 10 filestore(/srv/ceph/osd/osd.24) _finish_op 0x7fb9b4376570 seq 2087065 osr(6.1fe3 0x21a4f50)/0x21a4f50 > 2012-12-20 22:18:06.811136 7fb9c37fe700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] op_applied repgather(0x7fb9b47d1050 applying 3473'23689 rep_tid=72 wfack=31 wfdisk=31 op=osd_op(client.7636.0:100075 rb.0.122c.6b8b4567.000000000357 [write 3522560~4096] 6.42463fe3) v4) > 2012-12-20 22:18:06.811205 7fb9c37fe700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] op_applied mode was rmw(wr=1) > 2012-12-20 22:18:06.811218 7fb9c37fe700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] op_applied mode now idle(wr=0 WAKE) (finish_write) > 2012-12-20 22:18:06.811229 7fb9c37fe700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] put_object_context 0x7fb9b5248020 42463fe3/rb.0.122c.6b8b4567.000000000357/head//6 1 -> 0 > 2012-12-20 22:18:06.811241 7fb9c37fe700 15 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] requeue_ops > 2012-12-20 22:18:06.811251 7fb9c37fe700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] put_snapset_context rb.0.122c.6b8b4567.000000000357 1 -> 0 > 2012-12-20 22:18:06.811267 7fb9c37fe700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] eval_repop repgather(0x7fb9b47d1050 applied 3473'23689 rep_tid=72 wfack=31 wfdisk=31 op=osd_op(client.7636.0:100075 rb.0.122c.6b8b4567.000000000357 [write 3522560~4096] 6.42463fe3) v4) wants=ad > 2012-12-20 22:18:06.813738 7fb9c0ff9700 20 osd.24 3473 _dispatch 0x7fb9600008f0 osd_sub_op_reply(client.7636.0:100075 6.1fe3 42463fe3/rb.0.122c.6b8b4567.000000000357/head//6 [] ondisk, result = 0) v1 > 2012-12-20 22:18:06.813850 7fb9c0ff9700 15 osd.24 3473 require_same_or_newer_map 3473 (i am 3473) 0x7fb9600008f0 > 2012-12-20 22:18:06.813856 7fb9c0ff9700 20 osd.24 3473 _share_map_incoming osd.31 172.16.0.12:6820/2871 3473 > 2012-12-20 22:18:06.813865 7fb9c0ff9700 15 osd.24 3473 enqueue_op 0x7fb968055650 osd_sub_op_reply(client.7636.0:100075 6.1fe3 42463fe3/rb.0.122c.6b8b4567.000000000357/head//6 [] ondisk, > result= 0) v1 > 2012-12-20 22:18:06.813934 7fb9bd7f2700 10 osd.24 3473 dequeue_op 0x7fb968055650 osd_sub_op_reply(client.7636.0:100075 6.1fe3 42463fe3/rb.0.122c.6b8b4567.000000000357/head//6 [] ondisk, result = 0) v1 pg pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] > 2012-12-20 22:18:06.814005 7fb9bd7f2700 7 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] repop_ack rep_tid 72 op osd_op(client.7636.0:100075 rb.0.122c.6b8b4567.000000000357 [write 3522560~4096] 6.42463fe3) v4 result 0 ack_type 4 from osd.31 > 2012-12-20 22:18:06.814037 7fb9bd7f2700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] eval_repop repgather(0x7fb9b47d1050 applied 3473'23689 rep_tid=72 wfack= wfdisk= op=osd_op(client.7636.0:100075 rb.0.122c.6b8b4567.000000000357 [write 3522560~4096] 6.42463fe3) v4) wants=ad > 2012-12-20 22:18:06.814057 7fb9bd7f2700 15 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] log_op_stats osd_op(client.7636.0:100075 rb.0.122c.6b8b4567.000000000357 [write 3522560~4096] 6.42463fe3) v4 inb 4239 outb 0 rlat 0.000000 lat 0.005594 > 2012-12-20 22:18:06.814075 7fb9bd7f2700 15 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] update_stats 3465'46631 > 2012-12-20 22:18:06.814088 7fb9bd7f2700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23687 active+clean] sending commit on repgather(0x7fb9b47d1050 applied 3473'23689 rep_tid=72 wfack= wfdisk= op=osd_op(client.7636.0:100075 rb.0.122c.6b8b4567.000000000357 [write 3522560~4096] 6.42463fe3) v4) 0x7fb9b4605310 > 2012-12-20 22:18:06.814122 7fb9bd7f2700 10 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23688 active+clean] removing repgather(0x7fb9b47d1050 applied 3473'23689 rep_tid=72 wfack= wfdisk= op=osd_op(client.7636.0:100075 rb.0.122c.6b8b4567.000000000357 [write 3522560~4096] 6.42463fe3) v4) > 2012-12-20 22:18:06.814147 7fb9bd7f2700 20 osd.24 pg_epoch: 3473 pg[6.1fe3( v 3473'23689 (3247'22688,3473'23689] local-les=3472 n=12 ec=656 les/c 3472/3473 3465/3465/3465) [24,31] r=0 lpr=3465 mlcod 3473'23688 active+clean] q front is repgather(0x7fb9b47d1050 applied 3473'23689 rep_tid=72 wfack= wfdisk= op=osd_op(client.7636.0:100075 rb.0.122c.6b8b4567.000000000357 [write 3522560~4096] 6.42463fe3) v4) > 2012-12-20 22:18:06.814204 7fb9bd7f2700 10 osd.24 3473 dequeue_op 0x7fb968055650 finish > 2012-12-20 22:18:07.030914 7fb9be7f4700 10 monclient: tick > 2012-12-20 22:18:07.030929 7fb9be7f4700 10 monclient: _check_auth_rotating renewing rotating keys (they expired before 2012-12-20 22:17:37.030927) > 2012-12-20 22:18:07.030952 7fb9be7f4700 10 monclient: renew subs? (now: 2012-12-20 22:18:07.030952; renew after: 2012-12-20 22:18:33.322998) -- no > 2012-12-20 22:18:07.172960 7fb9bafed700 20 osd.24 3473 update_osd_stat osd_stat(21111 MB used, 393 GB avail, 431 GB total, peers [28,29,30,31]/[]) > 2012-12-20 22:18:07.172979 7fb9bafed700 5 osd.24 3473 heartbeat: osd_stat(21111 MB used, 393 GB avail, 431 GB total, peers [28,29,30,31]/[]) > 2012-12-20 22:18:07.173647 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9b400d440 already has epoch 3473 > 2012-12-20 22:18:07.173675 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a002f010 already has epoch 3473 > 2012-12-20 22:18:07.173785 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a00334d0 already has epoch 3473 > 2012-12-20 22:18:07.173875 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a0033fc0 already has epoch 3473 > 2012-12-20 22:18:07.479259 7fb9e6ffd700 5 osd.24 3473 tick > 2012-12-20 22:18:07.479311 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 6.36 >= max 0.5 = no, load too high > 2012-12-20 22:18:07.479326 7fb9e6ffd700 7 osd.24 3473 do_mon_report > 2012-12-20 22:18:07.479329 7fb9e6ffd700 10 osd.24 3473 send_alive up_thru currently 3465 want 3465 > 2012-12-20 22:18:07.479331 7fb9e6ffd700 20 osd.24 3473 send_pg_stats > 2012-12-20 22:18:07.479334 7fb9e6ffd700 10 osd.24 3473 send_pg_stats - 2 pgs updated > 2012-12-20 22:18:07.479343 7fb9e6ffd700 10 monclient: _send_mon_message to mon.KVM03 at 172.16.0.12:6789/0 > 2012-12-20 22:18:07.774794 7fb9bf7f6700 20 osd.24 3473 _share_map_outgoing 0x7fb9a002f010 already has epoch 3473 > 2012-12-20 22:18:07.809251 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) sync_entry woke after 5.000117 > 2012-12-20 22:18:07.809274 7fb9e57fa700 10 journal commit_start max_applied_seq 2087065, open_ops 0 > 2012-12-20 22:18:07.809278 7fb9e57fa700 10 journal commit_start blocked, all open_ops have completed > 2012-12-20 22:18:07.809279 7fb9e57fa700 10 journal commit_start committing 2087065, still blocked > 2012-12-20 22:18:07.809282 7fb9e57fa700 10 journal commit_start > 2012-12-20 22:18:07.809291 7fb9e57fa700 15 filestore(/srv/ceph/osd/osd.24) sync_entry committing 2087065 sync_epoch 19 > 2012-12-20 22:18:07.809339 7fb9e57fa700 10 filestore(/srv/ceph/osd/osd.24) taking async snap 'snap_2087065' > 2012-12-20 22:18:07.817045 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) async snap create 'snap_2087065' transid 96549 > 2012-12-20 22:18:07.817060 7fb9e57fa700 10 journal commit_started committing 2087065, unblocking > 2012-12-20 22:18:07.817069 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) waiting for transid 96549 to complete > 2012-12-20 22:18:07.836752 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) done waiting for transid 96549 to complete > 2012-12-20 22:18:07.836767 7fb9e57fa700 10 filestore(/srv/ceph/osd/osd.24) sync_entry commit took 0.027482, interval was 5.027633 > 2012-12-20 22:18:07.836775 7fb9e57fa700 10 journal commit_finish thru 2087065 > 2012-12-20 22:18:07.836778 7fb9e57fa700 5 journal committed_thru 2087065 (last_committed_seq 2087063) > 2012-12-20 22:18:07.836781 7fb9e57fa700 10 journal header: block_size 4096 alignment 4096 max_size 960495616 > 2012-12-20 22:18:07.836783 7fb9e57fa700 10 journal header: start 543223808 > 2012-12-20 22:18:07.836784 7fb9e57fa700 10 journal write_pos 543223808 > 2012-12-20 22:18:07.836788 7fb9e57fa700 10 journal committed_thru done > 2012-12-20 22:18:07.836793 7fb9e57fa700 10 filestore(/srv/ceph/osd/osd.24) removing snap 'snap_2087061' > 2012-12-20 22:18:07.836958 7fb9e57fa700 15 filestore(/srv/ceph/osd/osd.24) sync_entry committed to op_seq 2087065 > 2012-12-20 22:18:07.836972 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) sync_entry waiting for max_interval 5.000000 > 2012-12-20 22:18:07.855890 7fb9bf7f6700 20 osd.24 3473 _share_map_outgoing 0x7fb9a0033fc0 already has epoch 3473 > 2012-12-20 22:18:08.221094 7fb9c1ffb700 20 osd.24 3473 _dispatch 0x7fb990002de0 pg_stats_ack(2 pgs tid 18) v1 > 2012-12-20 22:18:08.221105 7fb9c1ffb700 10 osd.24 3473 handle_pg_stats_ack > 2012-12-20 22:18:08.479450 7fb9e6ffd700 5 osd.24 3473 tick > 2012-12-20 22:18:08.479499 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 6.36 >= max 0.5 = no, load too high > 2012-12-20 22:18:09.479581 7fb9e6ffd700 5 osd.24 3473 tick > 2012-12-20 22:18:09.479633 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 6.36 >= max 0.5 = no, load too high > 2012-12-20 22:18:10.479743 7fb9e6ffd700 5 osd.24 3473 tick > 2012-12-20 22:18:10.479808 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 6.36 >= max 0.5 = no, load too high > 2012-12-20 22:18:11.273304 7fb9bafed700 20 osd.24 3473 update_osd_stat osd_stat(21111 MB used, 393 GB avail, 431 GB total, peers [28,29,30,31]/[]) > 2012-12-20 22:18:11.273324 7fb9bafed700 5 osd.24 3473 heartbeat: osd_stat(21111 MB used, 393 GB avail, 431 GB total, peers [28,29,30,31]/[]) > 2012-12-20 22:18:11.274018 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9b400d440 already has epoch 3473 > 2012-12-20 22:18:11.274081 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a002f010 already has epoch 3473 > 2012-12-20 22:18:11.274101 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a00334d0 already has epoch 3473 > 2012-12-20 22:18:11.274249 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a0033fc0 already has epoch 3473 > 2012-12-20 22:18:11.275112 7fb9bf7f6700 20 osd.24 3473 _share_map_outgoing 0x7fb9a002f010 already has epoch 3473 > 2012-12-20 22:18:11.356141 7fb9bf7f6700 20 osd.24 3473 _share_map_outgoing 0x7fb9a0033fc0 already has epoch 3473 > 2012-12-20 22:18:11.479927 7fb9e6ffd700 5 osd.24 3473 tick > 2012-12-20 22:18:11.479976 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 6.36 >= max 0.5 = no, load too high > 2012-12-20 22:18:12.182095 7fb9bf7f6700 20 osd.24 3473 _share_map_outgoing 0x7fb9a00334d0 already has epoch 3473 > 2012-12-20 22:18:12.373677 7fb9bafed700 20 osd.24 3473 update_osd_stat osd_stat(21111 MB used, 393 GB avail, 431 GB total, peers [28,29,30,31]/[]) > 2012-12-20 22:18:12.373694 7fb9bafed700 5 osd.24 3473 heartbeat: osd_stat(21111 MB used, 393 GB avail, 431 GB total, peers [28,29,30,31]/[]) > 2012-12-20 22:18:12.374496 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9b400d440 already has epoch 3473 > 2012-12-20 22:18:12.374523 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a002f010 already has epoch 3473 > 2012-12-20 22:18:12.374541 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a00334d0 already has epoch 3473 > 2012-12-20 22:18:12.374564 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a0033fc0 already has epoch 3473 > 2012-12-20 22:18:12.480076 7fb9e6ffd700 5 osd.24 3473 tick > 2012-12-20 22:18:12.480125 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 6.09 >= max 0.5 = no, load too high > 2012-12-20 22:18:12.480140 7fb9e6ffd700 7 osd.24 3473 do_mon_report > 2012-12-20 22:18:12.480142 7fb9e6ffd700 10 osd.24 3473 send_alive up_thru currently 3465 want 3465 > 2012-12-20 22:18:12.480146 7fb9e6ffd700 20 osd.24 3473 send_pg_stats > 2012-12-20 22:18:12.701202 7fb9bf7f6700 20 osd.24 3473 _share_map_outgoing 0x7fb9b400d440 already has epoch 3473 > 2012-12-20 22:18:12.837067 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) sync_entry woke after 5.000094 > 2012-12-20 22:18:12.837089 7fb9e57fa700 10 journal commit_start max_applied_seq 2087065, open_ops 0 > 2012-12-20 22:18:12.837093 7fb9e57fa700 10 journal commit_start blocked, all open_ops have completed > 2012-12-20 22:18:12.837095 7fb9e57fa700 10 journal commit_start nothing to do > 2012-12-20 22:18:12.837100 7fb9e57fa700 10 journal commit_start > 2012-12-20 22:18:12.837117 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) sync_entry waiting for max_interval 5.000000 > 2012-12-20 22:18:13.480239 7fb9e6ffd700 5 osd.24 3473 tick > 2012-12-20 22:18:13.480289 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 6.09 >= max 0.5 = no, load too high > 2012-12-20 22:18:14.073917 7fb9bafed700 20 osd.24 3473 update_osd_stat osd_stat(21111 MB used, 393 GB avail, 431 GB total, peers [28,29,30,31]/[]) > 2012-12-20 22:18:14.073935 7fb9bafed700 5 osd.24 3473 heartbeat: osd_stat(21111 MB used, 393 GB avail, 431 GB total, peers [28,29,30,31]/[]) > 2012-12-20 22:18:14.074840 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a0033fc0 already has epoch 3473 > 2012-12-20 22:18:14.074868 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a00334d0 already has epoch 3473 > 2012-12-20 22:18:14.074886 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9b400d440 already has epoch 3473 > 2012-12-20 22:18:14.074903 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a002f010 already has epoch 3473 > 2012-12-20 22:18:14.480386 7fb9e6ffd700 5 osd.24 3473 tick > 2012-12-20 22:18:14.480443 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 6.09 >= max 0.5 = no, load too high > 2012-12-20 22:18:15.480571 7fb9e6ffd700 5 osd.24 3473 tick > 2012-12-20 22:18:15.480632 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 6.09 >= max 0.5 = no, load too high > 2012-12-20 22:18:16.282625 7fb9bf7f6700 20 osd.24 3473 _share_map_outgoing 0x7fb9a00334d0 already has epoch 3473 > 2012-12-20 22:18:16.480714 7fb9e6ffd700 5 osd.24 3473 tick > 2012-12-20 22:18:16.480763 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 6.09 >= max 0.5 = no, load too high > 2012-12-20 22:18:16.801391 7fb9bf7f6700 20 osd.24 3473 _share_map_outgoing 0x7fb9b400d440 already has epoch 3473 > 2012-12-20 22:18:17.031056 7fb9be7f4700 10 monclient: tick > 2012-12-20 22:18:17.031082 7fb9be7f4700 10 monclient: _check_auth_rotating renewing rotating keys (they expired before 2012-12-20 22:17:47.031080) > 2012-12-20 22:18:17.031124 7fb9be7f4700 10 monclient: renew subs? (now: 2012-12-20 22:18:17.031124; renew after: 2012-12-20 22:18:33.322998) -- no > 2012-12-20 22:18:17.175266 7fb9bf7f6700 20 osd.24 3473 _share_map_outgoing 0x7fb9a002f010 already has epoch 3473 > 2012-12-20 22:18:17.256256 7fb9bf7f6700 20 osd.24 3473 _share_map_outgoing 0x7fb9a0033fc0 already has epoch 3473 > 2012-12-20 22:18:17.480898 7fb9e6ffd700 5 osd.24 3473 tick > 2012-12-20 22:18:17.480952 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 5.84 >= max 0.5 = no, load too high > 2012-12-20 22:18:17.480969 7fb9e6ffd700 7 osd.24 3473 do_mon_report > 2012-12-20 22:18:17.480972 7fb9e6ffd700 10 osd.24 3473 send_alive up_thru currently 3465 want 3465 > 2012-12-20 22:18:17.480975 7fb9e6ffd700 20 osd.24 3473 send_pg_stats > 2012-12-20 22:18:17.837232 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) sync_entry woke after 5.000116 > 2012-12-20 22:18:17.837254 7fb9e57fa700 10 journal commit_start max_applied_seq 2087065, open_ops 0 > 2012-12-20 22:18:17.837258 7fb9e57fa700 10 journal commit_start blocked, all open_ops have completed > 2012-12-20 22:18:17.837260 7fb9e57fa700 10 journal commit_start nothing to do > 2012-12-20 22:18:17.837262 7fb9e57fa700 10 journal commit_start > 2012-12-20 22:18:17.837271 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) sync_entry waiting for max_interval 5.000000 > 2012-12-20 22:18:18.481073 7fb9e6ffd700 5 osd.24 3473 tick > 2012-12-20 22:18:18.481133 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 5.84 >= max 0.5 = no, load too high > 2012-12-20 22:18:19.481247 7fb9e6ffd700 5 osd.24 3473 tick > 2012-12-20 22:18:19.481308 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 5.84 >= max 0.5 = no, load too high > 2012-12-20 22:18:19.974219 7fb9bafed700 20 osd.24 3473 update_osd_stat osd_stat(21111 MB used, 393 GB avail, 431 GB total, peers [28,29,30,31]/[]) > 2012-12-20 22:18:19.974238 7fb9bafed700 5 osd.24 3473 heartbeat: osd_stat(21111 MB used, 393 GB avail, 431 GB total, peers [28,29,30,31]/[]) > 2012-12-20 22:18:19.974899 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a002f010 already has epoch 3473 > 2012-12-20 22:18:19.975070 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a00334d0 already has epoch 3473 > 2012-12-20 22:18:19.975116 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9a0033fc0 already has epoch 3473 > 2012-12-20 22:18:19.975207 7fb9bfff7700 20 osd.24 3473 _share_map_outgoing 0x7fb9b400d440 already has epoch 3473 > 2012-12-20 22:18:20.481425 7fb9e6ffd700 5 osd.24 3473 tick > 2012-12-20 22:18:20.481487 7fb9e6ffd700 20 osd.24 3473 scrub_should_schedule loadavg 5.84 >= max 0.5 = no, load too high > 2012-12-20 22:18:21.016361 7fb9e77fe700 -1 osd.24 3473 *** Got signal Terminated *** > 2012-12-20 22:18:21.016458 7fb9e77fe700 -1 osd.24 3473 pausing thread pools > 2012-12-20 22:18:21.016462 7fb9e77fe700 -1 osd.24 3473 flushing io > 2012-12-20 22:18:21.016464 7fb9e77fe700 10 filestore(/srv/ceph/osd/osd.24) sync_and_flush > 2012-12-20 22:18:21.016467 7fb9e77fe700 10 filestore(/srv/ceph/osd/osd.24) _flush_op_queue draining op tp > 2012-12-20 22:18:21.016471 7fb9e77fe700 10 filestore(/srv/ceph/osd/osd.24) _flush_op_queue waiting for apply finisher > 2012-12-20 22:18:21.016481 7fb9e77fe700 10 filestore(/srv/ceph/osd/osd.24) start_sync > 2012-12-20 22:18:21.016483 7fb9e77fe700 10 filestore(/srv/ceph/osd/osd.24) sync waiting > 2012-12-20 22:18:21.016523 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) sync_entry woke after 3.179252 > 2012-12-20 22:18:21.016540 7fb9e57fa700 10 journal commit_start max_applied_seq 2087065, open_ops 0 > 2012-12-20 22:18:21.016543 7fb9e57fa700 10 journal commit_start blocked, all open_ops have completed > 2012-12-20 22:18:21.016545 7fb9e57fa700 10 journal commit_start nothing to do > 2012-12-20 22:18:21.016548 7fb9e57fa700 10 journal commit_start > 2012-12-20 22:18:21.016563 7fb9e57fa700 20 filestore(/srv/ceph/osd/osd.24) sync_entry waiting for max_interval 5.000000 > 2012-12-20 22:18:21.016563 7fb9e77fe700 10 filestore(/srv/ceph/osd/osd.24) sync done > 2012-12-20 22:18:21.016570 7fb9e77fe700 10 filestore(/srv/ceph/osd/osd.24) sync_and_flush done > 2012-12-20 22:18:21.016572 7fb9e77fe700 -1 osd.24 3473 removing pid file > 2012-12-20 22:18:21.016665 7fb9e77fe700 -1 osd.24 3473 exit > > > > -- > To unsubscribe from this list: send the line "unsubscribe ceph-devel" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > ^ permalink raw reply [flat|nested] 13+ messages in thread
* RE: OSD's slow down to a crawl 2012-12-20 15:28 ` Mark Nelson @ 2012-12-20 15:51 ` Matthew Anderson 2012-12-20 15:59 ` Mark Nelson 0 siblings, 1 reply; 13+ messages in thread From: Matthew Anderson @ 2012-12-20 15:51 UTC (permalink / raw) To: 'Mark Nelson'; +Cc: ceph-devel@vger.kernel.org Hi Mark, Thanks for the quick reply. The log was 380MB in total so I just posted last 200 lines hoping that a trained eye might be able to spot something. I'm happy to put the file up on a web server if anyone wants to review it in full. It doesn't appear to be memory or CPU related. In it's hung state (both read and write are not responding at the moment, I have a VM hung trying to shutdown and an rbd stuck exporting ) all of the OSD's sit around 0.3% CPU usage, ~300MB of ram and no IO wait. The server has roughly 40GB RAM free and is easily 90% idle. Iotop shows no IO coming from any of the Ceph-OSD's. The admin socket showed no ops in flight but the performance dump, perhistoric ops and config are below. I did notice that the age of some of the ops was quite high considering things were actively trying to write to the cluster. # ceph --admin-daemon ./ceph-osd.24.asok perf dump {"filestore":{"journal_queue_max_ops":500,"journal_queue_ops":0,"journal_ops":20868,"journal_queue_max_bytes":104857600,"journal_queue_bytes":0,"journal_bytes":1710397238,"journal_latency":{"avgcount":20868,"sum":185.268022000},"journal_wr":13354,"journal_wr_bytes":{"avgcount":13354,"sum":1770647552},"op_queue_max_ops":500,"op_queue_ops":0,"ops":20868,"op_queue_max_bytes":104857600,"op_queue_bytes":0,"bytes":1710232583,"apply_latency":{"avgcount":20868,"sum":508.896737000},"committing":0,"commitcycle":339,"commitcycle_interval":{"avgcount":339,"sum":1708.284725000},"commitcycle_latency":{"avgcount":339,"sum":13.241850000},"journal_full":0},"mutex-FileJournal::completions_lock":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::finisher_lock":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::write_lock":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::writeq_lock":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager::apply_lock":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager::com_lock":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::SubmitManager::lock":{"wait":{"avgcount":0,"sum":0.000000000}},"osd":{"opq":0,"op_wip":0,"op":31753,"op_in_bytes":797806224,"op_out_bytes":1728414048,"op_latency":{"avgcount":31753,"sum":82.183373000},"op_r":24945,"op_r_out_bytes":1728414048,"op_r_latency":{"avgcount":24945,"sum":16.833689000},"op_w":6808,"op_w_in_bytes":797806224,"op_w_rlat":{"avgcount":6808,"sum":38.223006000},"op_w_latency":{"avgcount":6808,"sum":65.349684000},"op_rw":0,"op_rw_in_bytes":0,"op_rw_out_bytes":0,"op_rw_rlat":{"avgcount":0,"sum":0.000000000},"op_rw_latency":{"avgcount":0,"sum":0.000000000},"subop":5700,"subop_in_bytes":813224156,"subop_latency":{"avgcount":5700,"sum":26.877085000},"subop_w":0,"subop_w_in_bytes":813224156,"subop_w_latency":{"avgcount":5700,"sum":26.877085000},"subop_pull":0,"subop_pull_latency":{"avgcount":0,"sum":0.000000000},"subop_push":0,"subop_push_in_bytes":0,"subop_push_latency":{"avgcount":0,"sum":0.000000000},"pull":0,"push":0,"push_out_bytes":0,"push_in":0,"push_in_bytes":0,"recovery_ops":0,"loadavg":359,"buffer_bytes":0,"numpg":2486,"numpg_primary":1214,"numpg_replica":1272,"numpg_stray":0,"heartbeat_to_peers":4,"heartbeat_from_peers":0,"map_messages":44,"map_message_epochs":184,"map_message_epoch_dups":174},"throttle-filestore_bytes":{"val":0,"max":104857600,"get":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0,"take":20868,"take_sum":1710397238,"put":13354,"put_sum":1710397238,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-filestore_ops":{"val":0,"max":500,"get":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0,"take":20868,"take_sum":20868,"put":13354,"put_sum":20868,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-client":{"val":0,"max":104857600,"get":31657,"get_sum":795546573,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":31657,"put_sum":795546573,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-cluster":{"val":0,"max":104857600,"get":18913,"get_sum":828011293,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":18913,"put_sum":828011293,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-hbclient":{"val":0,"max":104857600,"get":4260,"get_sum":200220,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":4260,"put_sum":200220,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-hbserver":{"val":0,"max":104857600,"get":4236,"get_sum":199092,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":4236,"put_sum":199092,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-osd_client_bytes":{"val":0,"max":524288000,"get":31308,"get_sum":795353720,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":37671,"put_sum":795353720,"wait":{"avgcount":0,"sum":0.000000000}}} # ceph --admin-daemon ./ceph-osd.24.asok dump_historic_ops { "description": "osd_op(client.7636.0:101498 rb.0.122c.6b8b4567.000000000357 [write 3600384~4096] 6.42463fe3)", "received_at": "2012-12-20 23:34:42.012131", "age": "75.428051", "duration": "0.009988", "flag_point": "waiting for sub ops", "client_info": { "client": "client.7636", "tid": 101498}, "events": [ { "time": "2012-12-20 23:34:42.012317", "event": "waiting_for_osdmap"}, { "time": "2012-12-20 23:34:42.012425", "event": "reached_pg"}, { "time": "2012-12-20 23:34:42.012678", "event": "started"}, { "time": "2012-12-20 23:34:42.012720", "event": "started"}, { "time": "2012-12-20 23:34:42.012815", "event": "sub_op_sent"}, { "time": "2012-12-20 23:34:42.012958", "event": "commit_queued_for_journal_write"}, { "time": "2012-12-20 23:34:42.013033", "event": "write_thread_in_journal_buffer"}, { "time": "2012-12-20 23:34:42.013745", "event": "journaled_completion_queued"}, { "time": "2012-12-20 23:34:42.013803", "event": "op_commit"}, { "time": "2012-12-20 23:34:42.017414", "event": "sub_op_commit_rec"}, { "time": "2012-12-20 23:34:42.022066", "event": "op_applied"}, { "time": "2012-12-20 23:34:42.022119", "event": "done"}]}, { "description": "osd_sub_op_reply(client.7636.0:101498 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", "received_at": "2012-12-20 23:34:42.017048", "age": "75.423134", "duration": "0.000411", "flag_point": "started", "events": [ { "time": "2012-12-20 23:34:42.017218", "event": "waiting_for_osdmap"}, { "time": "2012-12-20 23:34:42.017363", "event": "reached_pg"}, { "time": "2012-12-20 23:34:42.017401", "event": "started"}, { "time": "2012-12-20 23:34:42.017459", "event": "done"}]}, { "description": "osd_op(client.7636.0:101502 rb.0.122c.6b8b4567.000000000064 [write 1512448~61440] 6.a241e34d)", "received_at": "2012-12-20 23:34:49.851021", "age": "67.589161", "duration": "0.007114", "flag_point": "waiting for sub ops", "client_info": { "client": "client.7636", "tid": 101502}, "events": [ { "time": "2012-12-20 23:34:49.851276", "event": "waiting_for_osdmap"}, { "time": "2012-12-20 23:34:49.851429", "event": "reached_pg"}, { "time": "2012-12-20 23:34:49.851616", "event": "started"}, { "time": "2012-12-20 23:34:49.851648", "event": "started"}, { "time": "2012-12-20 23:34:49.851712", "event": "sub_op_sent"}, { "time": "2012-12-20 23:34:49.851770", "event": "commit_queued_for_journal_write"}, { "time": "2012-12-20 23:34:49.851915", "event": "write_thread_in_journal_buffer"}, { "time": "2012-12-20 23:34:49.853956", "event": "journaled_completion_queued"}, { "time": "2012-12-20 23:34:49.853992", "event": "op_commit"}, { "time": "2012-12-20 23:34:49.857016", "event": "sub_op_commit_rec"}, { "time": "2012-12-20 23:34:49.858077", "event": "op_applied"}, { "time": "2012-12-20 23:34:49.858135", "event": "done"}]}, { "description": "osd_sub_op_reply(client.7636.0:101502 6.234d a241e34d\/rb.0.122c.6b8b4567.000000000064\/head\/\/6 [] ondisk, result = 0)", "received_at": "2012-12-20 23:34:49.856762", "age": "67.583420", "duration": "0.000308", "flag_point": "started", "events": [ { "time": "2012-12-20 23:34:49.856869", "event": "waiting_for_osdmap"}, { "time": "2012-12-20 23:34:49.856969", "event": "reached_pg"}, { "time": "2012-12-20 23:34:49.857001", "event": "started"}, { "time": "2012-12-20 23:34:49.857070", "event": "done"}]}, { "description": "osd_op(client.7636.0:101504 rb.0.122c.6b8b4567.00000000012c [write 1658880~2048] 6.92d2c38c)", "received_at": "2012-12-20 23:35:34.351518", "age": "23.088664", "duration": "0.009430", "flag_point": "waiting for sub ops", "client_info": { "client": "client.7636", "tid": 101504}, "events": [ { "time": "2012-12-20 23:35:34.351714", "event": "waiting_for_osdmap"}, { "time": "2012-12-20 23:35:34.351811", "event": "reached_pg"}, { "time": "2012-12-20 23:35:34.352030", "event": "started"}, { "time": "2012-12-20 23:35:34.352071", "event": "started"}, { "time": "2012-12-20 23:35:34.352146", "event": "sub_op_sent"}, { "time": "2012-12-20 23:35:34.352213", "event": "commit_queued_for_journal_write"}, { "time": "2012-12-20 23:35:34.352287", "event": "write_thread_in_journal_buffer"}, { "time": "2012-12-20 23:35:34.355390", "event": "journaled_completion_queued"}, { "time": "2012-12-20 23:35:34.355475", "event": "op_commit"}, { "time": "2012-12-20 23:35:34.356661", "event": "sub_op_commit_rec"}, { "time": "2012-12-20 23:35:34.360894", "event": "op_applied"}, { "time": "2012-12-20 23:35:34.360948", "event": "done"}]}, { "description": "osd_sub_op_reply(client.7636.0:101504 6.38c 92d2c38c\/rb.0.122c.6b8b4567.00000000012c\/head\/\/6 [] ondisk, result = 0)", "received_at": "2012-12-20 23:35:34.356286", "age": "23.083896", "duration": "0.000421", "flag_point": "started", "events": [ { "time": "2012-12-20 23:35:34.356489", "event": "waiting_for_osdmap"}, { "time": "2012-12-20 23:35:34.356598", "event": "reached_pg"}, { "time": "2012-12-20 23:35:34.356640", "event": "started"}, { "time": "2012-12-20 23:35:34.356707", "event": "done"}]}, { "description": "osd_op(client.7636.0:101512 rb.0.122c.6b8b4567.000000000357 [write 3731456~4096] 6.42463fe3)", "received_at": "2012-12-20 23:35:42.133103", "age": "15.307079", "duration": "0.008357", "flag_point": "waiting for sub ops", "client_info": { "client": "client.7636", "tid": 101512}, "events": [ { "time": "2012-12-20 23:35:42.133315", "event": "waiting_for_osdmap"}, { "time": "2012-12-20 23:35:42.133427", "event": "reached_pg"}, { "time": "2012-12-20 23:35:42.133684", "event": "started"}, { "time": "2012-12-20 23:35:42.133723", "event": "started"}, { "time": "2012-12-20 23:35:42.133796", "event": "sub_op_sent"}, { "time": "2012-12-20 23:35:42.133908", "event": "commit_queued_for_journal_write"}, { "time": "2012-12-20 23:35:42.133995", "event": "write_thread_in_journal_buffer"}, { "time": "2012-12-20 23:35:42.136965", "event": "journaled_completion_queued"}, { "time": "2012-12-20 23:35:42.137025", "event": "op_commit"}, { "time": "2012-12-20 23:35:42.138456", "event": "sub_op_commit_rec"}, { "time": "2012-12-20 23:35:42.141408", "event": "op_applied"}, { "time": "2012-12-20 23:35:42.141460", "event": "done"}]}, { "description": "osd_sub_op_reply(client.7636.0:101512 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", "received_at": "2012-12-20 23:35:42.138087", "age": "15.302095", "duration": "0.000410", "flag_point": "started", "events": [ { "time": "2012-12-20 23:35:42.138248", "event": "waiting_for_osdmap"}, { "time": "2012-12-20 23:35:42.138363", "event": "reached_pg"}, { "time": "2012-12-20 23:35:42.138443", "event": "started"}, { "time": "2012-12-20 23:35:42.138497", "event": "done"}]}]} # ceph --admin-daemon ./ceph-osd.24.asok config show { "name": "osd.24", "cluster": "ceph", "none": "0\/5", "lockdep": "0\/1", "context": "0\/1", "crush": "1\/1", "mds": "1\/5", "mds_balancer": "1\/5", "mds_locker": "1\/5", "mds_log": "1\/5", "mds_log_expire": "1\/5", "mds_migrator": "1\/5", "buffer": "0\/1", "timer": "0\/1", "filer": "0\/1", "striper": "0\/1", "objecter": "0\/1", "rados": "0\/5", "rbd": "0\/5", "journaler": "0\/5", "objectcacher": "0\/5", "client": "0\/5", "osd": "0\/5", "optracker": "0\/5", "objclass": "0\/5", "filestore": "1\/3", "journal": "1\/3", "ms": "0\/5", "mon": "1\/5", "monc": "0\/10", "paxos": "0\/5", "tp": "0\/5", "auth": "1\/5", "crypto": "1\/5", "finisher": "1\/1", "heartbeatmap": "1\/5", "perfcounter": "1\/5", "rgw": "1\/5", "hadoop": "1\/5", "javaclient": "1\/5", "asok": "1\/5", "throttle": "1\/1", "host": "KVM04", "fsid": "00000000-0000-0000-0000-000000000000", "public_addr": ":\/0", "cluster_addr": ":\/0", "public_network": "", "cluster_network": "", "num_client": "1", "monmap": "", "mon_host": "", "lockdep": "false", "admin_socket": "\/var\/run\/ceph\/ceph-osd.24.asok", "daemonize": "true", "pid_file": "\/var\/run\/ceph\/osd.24.pid", "chdir": "\/", "max_open_files": "0", "fatal_signal_handlers": "true", "log_file": "\/var\/log\/ceph\/osd.24.log", "log_max_new": "1000", "log_max_recent": "100000", "log_to_stderr": "false", "err_to_stderr": "true", "log_to_syslog": "false", "err_to_syslog": "false", "log_flush_on_exit": "true", "clog_to_monitors": "true", "clog_to_syslog": "false", "mon_cluster_log_to_syslog": "false", "mon_cluster_log_file": "\/var\/log\/ceph\/ceph.log", "key": "", "keyfile": "", "keyring": "\/srv\/ceph\/osd\/osd.24\/keyring", "heartbeat_interval": "5", "heartbeat_file": "", "perf": "true", "ms_tcp_nodelay": "true", "ms_initial_backoff": "0.2", "ms_max_backoff": "15", "ms_nocrc": "false", "ms_die_on_bad_msg": "false", "ms_dispatch_throttle_bytes": "104857600", "ms_bind_ipv6": "false", "ms_bind_port_min": "6800", "ms_bind_port_max": "7100", "ms_rwthread_stack_bytes": "1048576", "ms_tcp_read_timeout": "900", "ms_inject_socket_failures": "0", "ms_inject_delay_type": "", "ms_inject_delay_max": "1", "ms_inject_delay_probability": "0", "mon_data": "\/var\/lib\/ceph\/mon\/ceph-24", "mon_initial_members": "", "mon_sync_fs_threshold": "5", "mon_tick_interval": "5", "mon_subscribe_interval": "300", "mon_osd_laggy_halflife": "3600", "mon_osd_laggy_weight": "0.3", "mon_osd_adjust_heartbeat_grace": "true", "mon_osd_adjust_down_out_interval": "true", "mon_osd_auto_mark_in": "false", "mon_osd_auto_mark_auto_out_in": "true", "mon_osd_auto_mark_new_in": "true", "mon_osd_down_out_interval": "300", "mon_osd_min_up_ratio": "0.3", "mon_osd_min_in_ratio": "0.3", "mon_lease": "5", "mon_lease_renew_interval": "3", "mon_lease_ack_timeout": "10", "mon_clock_drift_allowed": "0.05", "mon_clock_drift_warn_backoff": "5", "mon_accept_timeout": "10", "mon_pg_create_interval": "30", "mon_pg_stuck_threshold": "300", "mon_osd_full_ratio": "0.95", "mon_osd_nearfull_ratio": "0.85", "mon_globalid_prealloc": "100", "mon_osd_report_timeout": "900", "mon_force_standby_active": "true", "mon_min_osdmap_epochs": "500", "mon_max_pgmap_epochs": "500", "mon_max_log_epochs": "500", "mon_max_osd": "10000", "mon_probe_timeout": "2", "mon_slurp_timeout": "10", "mon_slurp_bytes": "262144", "mon_client_bytes": "104857600", "mon_daemon_bytes": "419430400", "mon_max_log_entries_per_event": "4096", "paxos_max_join_drift": "10", "paxos_propose_interval": "1", "paxos_min_wait": "0.05", "clock_offset": "0", "auth_cluster_required": "none", "auth_service_required": "none", "auth_client_required": "none", "auth_supported": "none", "cephx_require_signatures": "false", "cephx_sign_messages": "true", "auth_mon_ticket_ttl": "43200", "auth_service_ticket_ttl": "3600", "mon_client_hunt_interval": "3", "mon_client_ping_interval": "10", "mon_client_max_log_entries_per_message": "1000", "mon_max_pool_pg_num": "65536", "client_cache_size": "16384", "client_cache_mid": "0.75", "client_use_random_mds": "false", "client_mount_timeout": "30", "client_tick_interval": "1", "client_trace": "", "client_readahead_min": "131072", "client_readahead_max_bytes": "0", "client_readahead_max_periods": "4", "client_snapdir": ".snap", "client_mountpoint": "\/", "client_notify_timeout": "10", "client_caps_release_delay": "5", "client_oc": "true", "client_oc_size": "209715200", "client_oc_max_dirty": "104857600", "client_oc_target_dirty": "8388608", "client_oc_max_dirty_age": "5", "client_oc_max_objects": "1000", "client_debug_force_sync_read": "false", "client_debug_inject_tick_delay": "0", "fuse_use_invalidate_cb": "false", "fuse_big_writes": "true", "fuse_debug": "false", "objecter_tick_interval": "5", "objecter_timeout": "10", "objecter_inflight_op_bytes": "104857600", "objecter_inflight_ops": "1024", "journaler_allow_split_entries": "true", "journaler_write_head_interval": "15", "journaler_prefetch_periods": "10", "journaler_prezero_periods": "5", "journaler_batch_interval": "0.001", "journaler_batch_max": "0", "mds_data": "\/var\/lib\/ceph\/mds\/ceph-24", "mds_max_file_size": "1099511627776", "mds_cache_size": "100000", "mds_cache_mid": "0.7", "mds_mem_max": "1048576", "mds_dir_commit_ratio": "0.5", "mds_dir_max_commit_size": "90", "mds_decay_halflife": "5", "mds_beacon_interval": "4", "mds_beacon_grace": "15", "mds_blacklist_interval": "1440", "mds_session_timeout": "60", "mds_session_autoclose": "300", "mds_reconnect_timeout": "45", "mds_tick_interval": "5", "mds_dirstat_min_interval": "1", "mds_scatter_nudge_interval": "5", "mds_client_prealloc_inos": "1000", "mds_early_reply": "true", "mds_use_tmap": "true", "mds_default_dir_hash": "2", "mds_log": "true", "mds_log_skip_corrupt_events": "false", "mds_log_max_events": "-1", "mds_log_max_segments": "30", "mds_log_max_expiring": "20", "mds_bal_sample_interval": "3", "mds_bal_replicate_threshold": "8000", "mds_bal_unreplicate_threshold": "0", "mds_bal_frag": "false", "mds_bal_split_size": "10000", "mds_bal_split_rd": "25000", "mds_bal_split_wr": "10000", "mds_bal_split_bits": "3", "mds_bal_merge_size": "50", "mds_bal_merge_rd": "1000", "mds_bal_merge_wr": "1000", "mds_bal_interval": "10", "mds_bal_fragment_interval": "5", "mds_bal_idle_threshold": "0", "mds_bal_max": "-1", "mds_bal_max_until": "-1", "mds_bal_mode": "0", "mds_bal_min_rebalance": "0.1", "mds_bal_min_start": "0.2", "mds_bal_need_min": "0.8", "mds_bal_need_max": "1.2", "mds_bal_midchunk": "0.3", "mds_bal_minchunk": "0.001", "mds_bal_target_removal_min": "5", "mds_bal_target_removal_max": "10", "mds_replay_interval": "1", "mds_shutdown_check": "0", "mds_thrash_exports": "0", "mds_thrash_fragments": "0", "mds_dump_cache_on_map": "false", "mds_dump_cache_after_rejoin": "false", "mds_verify_scatter": "false", "mds_debug_scatterstat": "false", "mds_debug_frag": "false", "mds_debug_auth_pins": "false", "mds_debug_subtrees": "false", "mds_kill_mdstable_at": "0", "mds_kill_export_at": "0", "mds_kill_import_at": "0", "mds_kill_link_at": "0", "mds_kill_rename_at": "0", "mds_wipe_sessions": "false", "mds_wipe_ino_prealloc": "false", "mds_skip_ino": "0", "max_mds": "1", "mds_standby_for_name": "", "mds_standby_for_rank": "-1", "mds_standby_replay": "false", "osd_auto_upgrade_tmap": "true", "osd_tmapput_sets_uses_tmap": "false", "osd_max_backfills": "10", "osd_backfill_full_ratio": "0.85", "osd_backfill_retry_interval": "10", "osd_uuid": "00000000-0000-0000-0000-000000000000", "osd_data": "\/srv\/ceph\/osd\/osd.24", "osd_journal": "\/dev\/sda3", "osd_journal_size": "5120", "osd_max_write_size": "90", "osd_max_pgls": "1024", "osd_client_message_size_cap": "524288000", "osd_pg_bits": "6", "osd_pgp_bits": "6", "osd_min_rep": "1", "osd_max_rep": "10", "osd_pool_default_crush_rule": "0", "osd_pool_default_size": "2", "osd_pool_default_min_size": "0", "osd_pool_default_pg_num": "8", "osd_pool_default_pgp_num": "8", "osd_map_dedup": "true", "osd_map_cache_size": "500", "osd_map_message_max": "100", "osd_op_threads": "2", "osd_disk_threads": "1", "osd_recovery_threads": "1", "osd_recover_clone_overlap": "true", "osd_backfill_scan_min": "64", "osd_backfill_scan_max": "512", "osd_op_thread_timeout": "30", "osd_recovery_thread_timeout": "30", "osd_snap_trim_thread_timeout": "3600", "osd_scrub_thread_timeout": "60", "osd_scrub_finalize_thread_timeout": "600", "osd_remove_thread_timeout": "3600", "osd_command_thread_timeout": "600", "osd_age": "0.8", "osd_age_time": "0", "osd_heartbeat_addr": ":\/0", "osd_heartbeat_interval": "6", "osd_heartbeat_grace": "20", "osd_mon_heartbeat_interval": "30", "osd_mon_report_interval_max": "120", "osd_mon_report_interval_min": "5", "osd_mon_ack_timeout": "30", "osd_min_down_reporters": "1", "osd_min_down_reports": "3", "osd_default_data_pool_replay_window": "45", "osd_preserve_trimmed_log": "false", "osd_auto_mark_unfound_lost": "false", "osd_recovery_delay_start": "0", "osd_recovery_max_active": "50", "osd_recovery_max_chunk": "8388608", "osd_recovery_forget_lost_objects": "false", "osd_max_scrubs": "1", "osd_scrub_load_threshold": "0.5", "osd_scrub_min_interval": "300", "osd_scrub_max_interval": "86400", "osd_deep_scrub_interval": "604800", "osd_deep_scrub_stride": "524288", "osd_auto_weight": "false", "osd_class_dir": "\/usr\/local\/lib\/rados-classes", "osd_check_for_log_corruption": "false", "osd_use_stale_snap": "false", "osd_rollback_to_cluster_snap": "", "osd_default_notify_timeout": "30", "osd_kill_backfill_at": "0", "osd_min_pg_log_entries": "1000", "osd_op_complaint_time": "30", "osd_command_max_records": "256", "osd_op_log_threshold": "5", "osd_verify_sparse_read_holes": "false", "osd_debug_drop_ping_probability": "0", "osd_debug_drop_ping_duration": "0", "osd_debug_drop_pg_create_probability": "0", "osd_debug_drop_pg_create_duration": "1", "osd_debug_drop_op_probability": "0", "osd_op_history_size": "20", "osd_op_history_duration": "600", "osd_target_transaction_size": "300", "osd_client_op_priority": "63", "osd_recovery_op_priority": "10", "filestore": "false", "filestore_index_retry_probability": "0", "filestore_debug_omap_check": "false", "filestore_xattr_use_omap": "false", "filestore_max_inline_xattr_size": "512", "filestore_max_inline_xattrs": "2", "filestore_max_sync_interval": "5", "filestore_min_sync_interval": "0.01", "filestore_btrfs_snap": "true", "filestore_btrfs_clone_range": "true", "filestore_fsync_flushes_journal_data": "false", "filestore_fiemap": "false", "filestore_flusher": "true", "filestore_flusher_max_fds": "512", "filestore_flush_min": "65536", "filestore_sync_flush": "false", "filestore_journal_parallel": "false", "filestore_journal_writeahead": "false", "filestore_journal_trailing": "false", "filestore_queue_max_ops": "500", "filestore_queue_max_bytes": "104857600", "filestore_queue_committing_max_ops": "500", "filestore_queue_committing_max_bytes": "104857600", "filestore_op_threads": "2", "filestore_op_thread_timeout": "60", "filestore_op_thread_suicide_timeout": "180", "filestore_commit_timeout": "600", "filestore_fiemap_threshold": "4096", "filestore_merge_threshold": "10", "filestore_split_multiple": "2", "filestore_update_to": "1000", "filestore_blackhole": "false", "filestore_dump_file": "", "filestore_kill_at": "0", "filestore_fail_eio": "true", "journal_dio": "true", "journal_aio": "false", "journal_block_align": "true", "journal_max_write_bytes": "10485760", "journal_max_write_entries": "100", "journal_queue_max_ops": "500", "journal_queue_max_bytes": "104857600", "journal_align_min_size": "65536", "journal_replay_from": "0", "journal_zero_on_create": "false", "rbd_cache": "false", "rbd_cache_size": "33554432", "rbd_cache_max_dirty": "25165824", "rbd_cache_target_dirty": "16777216", "rbd_cache_max_dirty_age": "1", "nss_db_path": "", "rgw_data": "\/var\/lib\/ceph\/radosgw\/ceph-24", "rgw_enable_apis": "s3, swift, swift_auth, admin", "rgw_cache_enabled": "true", "rgw_cache_lru_size": "10000", "rgw_socket_path": "", "rgw_dns_name": "", "rgw_swift_url": "", "rgw_swift_url_prefix": "swift", "rgw_swift_auth_url": "", "rgw_swift_auth_entry": "auth", "rgw_keystone_url": "", "rgw_keystone_admin_token": "", "rgw_keystone_accepted_roles": "Member, admin", "rgw_keystone_token_cache_size": "10000", "rgw_keystone_revocation_interval": "900", "rgw_admin_entry": "admin", "rgw_enforce_swift_acls": "true", "rgw_swift_token_expiration": "86400", "rgw_print_continue": "true", "rgw_remote_addr_param": "REMOTE_ADDR", "rgw_op_thread_timeout": "600", "rgw_op_thread_suicide_timeout": "0", "rgw_thread_pool_size": "100", "rgw_num_control_oids": "8", "rgw_cluster_root_pool": ".rgw.root", "rgw_log_nonexistent_bucket": "false", "rgw_log_object_name": "%Y-%m-%d-%H-%i-%n", "rgw_log_object_name_utc": "false", "rgw_usage_max_shards": "32", "rgw_usage_max_user_shards": "1", "rgw_enable_ops_log": "true", "rgw_enable_usage_log": "true", "rgw_ops_log_rados": "true", "rgw_ops_log_socket_path": "", "rgw_ops_log_data_backlog": "5242880", "rgw_usage_log_flush_threshold": "1024", "rgw_usage_log_tick_interval": "30", "rgw_intent_log_object_name": "%Y-%m-%d-%i-%n", "rgw_intent_log_object_name_utc": "false", "rgw_init_timeout": "30", "rgw_mime_types_file": "\/etc\/mime.types", "rgw_gc_max_objs": "32", "rgw_gc_obj_min_wait": "7200", "rgw_gc_processor_max_time": "3600", "rgw_gc_processor_period": "3600", "rgw_s3_success_create_obj_status": "0", "rgw_resolve_cname": "false", "rgw_obj_stripe_size": "4194304", "rgw_extended_http_attrs": "", "mutex_perf_counter": "false", "internal_safe_to_start_threads": "true"} -----Original Message----- From: Mark Nelson [mailto:mark.nelson@inktank.com] Sent: Thursday, 20 December 2012 11:28 PM To: Matthew Anderson Cc: ceph-devel@vger.kernel.org Subject: Re: OSD's slow down to a crawl Hi Matt, Yeah, your log is looking pretty sparse! I imagine Sam will have a lot of questions for you. Out of curiosity have you looked at any io/cpu/memory stats when things slow down? sysprof may come in handy if it looks like it's CPU related. Also, check out the OSD admin socket. Sebastien Han made a nice little blog entry describing it: http://www.sebastien-han.fr/blog/2012/08/14/ceph-admin-socket/ You can get a lot of information about the current state of things. Mark -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: OSD's slow down to a crawl 2012-12-20 15:51 ` Matthew Anderson @ 2012-12-20 15:59 ` Mark Nelson 2012-12-20 16:08 ` Matthew Anderson 0 siblings, 1 reply; 13+ messages in thread From: Mark Nelson @ 2012-12-20 15:59 UTC (permalink / raw) To: Matthew Anderson; +Cc: ceph-devel@vger.kernel.org Out of curiosity, if you fire up a rados bench instance on one of the nodes with say, 256 concurrent writes, do any of the writes complete? Mark On 12/20/2012 09:51 AM, Matthew Anderson wrote: > Hi Mark, > > Thanks for the quick reply. The log was 380MB in total so I just posted last 200 lines hoping that a trained eye might be able to spot something. I'm happy to put the file up on a web server if anyone wants to review it in full. > > It doesn't appear to be memory or CPU related. In it's hung state (both read and write are not responding at the moment, I have a VM hung trying to shutdown and an rbd stuck exporting ) all of the OSD's sit around 0.3% CPU usage, ~300MB of ram and no IO wait. The server has roughly 40GB RAM free and is easily 90% idle. Iotop shows no IO coming from any of the Ceph-OSD's. > > The admin socket showed no ops in flight but the performance dump, perhistoric ops and config are below. I did notice that the age of some of the ops was quite high considering things were actively trying to write to the cluster. > # ceph --admin-daemon ./ceph-osd.24.asok perf dump > > {"filestore":{"journal_queue_max_ops":500,"journal_queue_ops":0,"journal_ops":20868,"journal_queue_max_bytes":104857600,"journal_queue_bytes":0,"journal_bytes":1710397238,"journal_latency":{"avgcount":20868,"sum":185.268022000},"journal_wr":13354,"journal_wr_bytes":{"avgcount":13354,"sum":1770647552},"op_queue_max_ops":500,"op_queue_ops":0,"ops":20868,"op_queue_max_bytes":104857600,"op_queue_bytes":0,"bytes":1710232583,"apply_latency":{"avgcount":20868,"sum":508.896737000},"committing":0,"commitcycle":339,"commitcycle_interval":{"avgcount":339,"sum":1708.284725000},"commitcycle_latency":{"avgcount":339,"sum":13.241850000},"journal_full":0},"mutex-FileJournal::completions_lock":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::finisher_lock":{"wait":{"avgcount":0,"sum":0.00000 0000}},"mutex-FileJournal::write_lock":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::writeq_lock":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager::apply_lock":{"wai t":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager::com_lock":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::SubmitManager::lock":{"wait":{"avgcount":0,"sum":0.000000000}},"osd":{"opq":0,"op_wip":0,"op":31753,"op_in_bytes":797806224,"op_out_bytes":1728414048,"op_latency":{"avgcount":31753,"sum":82.183373000},"op_r":24945,"op_r_out_bytes":1728414048,"op_r_latency":{"avgcount":24945,"sum":16.833689000},"op_w":6808,"op_w_in_bytes":797806224,"op_w_rlat":{"avgcount":6808,"sum":38.223006000},"op_w_latency":{"avgcount":6808,"sum":65.349684000},"op_rw":0,"op_rw_in_bytes":0,"op_rw_out_bytes":0,"op_rw_rlat":{"avgcount":0,"sum":0.000000000},"op_rw_latency":{"avgcount":0,"sum":0.000000000},"subop":5700,"subop_in_bytes":813224156,"subop_latency":{"avgcount":5700,"sum":26.877085000},"s ubop_w":0,"subop_w_in_bytes":813224156,"subop_w_latency":{"avgcount":5700,"sum":26.877085000},"subop_pull":0,"subop_pull_latency":{"avgcount":0,"sum":0.000000000},"subop_push":0,"subop_push_in_bytes" :0,"subop_push_latency":{"avgcount":0,"sum":0.000000000},"pull":0,"push":0,"push_out_bytes":0,"push_in":0,"push_in_bytes":0,"recovery_ops":0,"loadavg":359,"buffer_bytes":0,"numpg":2486,"numpg_primary":1214,"numpg_replica":1272,"numpg_stray":0,"heartbeat_to_peers":4,"heartbeat_from_peers":0,"map_messages":44,"map_message_epochs":184,"map_message_epoch_dups":174},"throttle-filestore_bytes":{"val":0,"max":104857600,"get":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0,"take":20868,"take_sum":1710397238,"put":13354,"put_sum":1710397238,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-filestore_ops":{"val":0,"max":500,"get":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0,"take":20868,"take_sum":20868,"put":13354,"put_sum":20868,"wait":{"avgcount":0,"sum":0.000000000}},"t hrottle-msgr_dispatch_throttler-client":{"val":0,"max":104857600,"get":31657,"get_sum":795546573,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":31657,"put_sum":795546573,"wa it":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-cluster":{"val":0,"max":104857600,"get":18913,"get_sum":828011293,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":18913,"put_sum":828011293,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-hbclient":{"val":0,"max":104857600,"get":4260,"get_sum":200220,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":4260,"put_sum":200220,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-hbserver":{"val":0,"max":104857600,"get":4236,"get_sum":199092,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":4236,"put_sum":199092,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-osd_client_bytes":{"val":0,"max":524288000,"ge t":31308,"get_sum":795353720,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":37671,"put_sum":795353720,"wait":{"avgcount":0,"sum":0.000000000}}} > > # ceph --admin-daemon ./ceph-osd.24.asok dump_historic_ops > > { "description": "osd_op(client.7636.0:101498 rb.0.122c.6b8b4567.000000000357 [write 3600384~4096] 6.42463fe3)", > "received_at": "2012-12-20 23:34:42.012131", > "age": "75.428051", > "duration": "0.009988", > "flag_point": "waiting for sub ops", > "client_info": { "client": "client.7636", > "tid": 101498}, > "events": [ > { "time": "2012-12-20 23:34:42.012317", > "event": "waiting_for_osdmap"}, > { "time": "2012-12-20 23:34:42.012425", > "event": "reached_pg"}, > { "time": "2012-12-20 23:34:42.012678", > "event": "started"}, > { "time": "2012-12-20 23:34:42.012720", > "event": "started"}, > { "time": "2012-12-20 23:34:42.012815", > "event": "sub_op_sent"}, > { "time": "2012-12-20 23:34:42.012958", > "event": "commit_queued_for_journal_write"}, > { "time": "2012-12-20 23:34:42.013033", > "event": "write_thread_in_journal_buffer"}, > { "time": "2012-12-20 23:34:42.013745", > "event": "journaled_completion_queued"}, > { "time": "2012-12-20 23:34:42.013803", > "event": "op_commit"}, > { "time": "2012-12-20 23:34:42.017414", > "event": "sub_op_commit_rec"}, > { "time": "2012-12-20 23:34:42.022066", > "event": "op_applied"}, > { "time": "2012-12-20 23:34:42.022119", > "event": "done"}]}, > { "description": "osd_sub_op_reply(client.7636.0:101498 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", > "received_at": "2012-12-20 23:34:42.017048", > "age": "75.423134", > "duration": "0.000411", > "flag_point": "started", > "events": [ > { "time": "2012-12-20 23:34:42.017218", > "event": "waiting_for_osdmap"}, > { "time": "2012-12-20 23:34:42.017363", > "event": "reached_pg"}, > { "time": "2012-12-20 23:34:42.017401", > "event": "started"}, > { "time": "2012-12-20 23:34:42.017459", > "event": "done"}]}, > { "description": "osd_op(client.7636.0:101502 rb.0.122c.6b8b4567.000000000064 [write 1512448~61440] 6.a241e34d)", > "received_at": "2012-12-20 23:34:49.851021", > "age": "67.589161", > "duration": "0.007114", > "flag_point": "waiting for sub ops", > "client_info": { "client": "client.7636", > "tid": 101502}, > "events": [ > { "time": "2012-12-20 23:34:49.851276", > "event": "waiting_for_osdmap"}, > { "time": "2012-12-20 23:34:49.851429", > "event": "reached_pg"}, > { "time": "2012-12-20 23:34:49.851616", > "event": "started"}, > { "time": "2012-12-20 23:34:49.851648", > "event": "started"}, > { "time": "2012-12-20 23:34:49.851712", > "event": "sub_op_sent"}, > { "time": "2012-12-20 23:34:49.851770", > "event": "commit_queued_for_journal_write"}, > { "time": "2012-12-20 23:34:49.851915", > "event": "write_thread_in_journal_buffer"}, > { "time": "2012-12-20 23:34:49.853956", > "event": "journaled_completion_queued"}, > { "time": "2012-12-20 23:34:49.853992", > "event": "op_commit"}, > { "time": "2012-12-20 23:34:49.857016", > "event": "sub_op_commit_rec"}, > { "time": "2012-12-20 23:34:49.858077", > "event": "op_applied"}, > { "time": "2012-12-20 23:34:49.858135", > "event": "done"}]}, > { "description": "osd_sub_op_reply(client.7636.0:101502 6.234d a241e34d\/rb.0.122c.6b8b4567.000000000064\/head\/\/6 [] ondisk, result = 0)", > "received_at": "2012-12-20 23:34:49.856762", > "age": "67.583420", > "duration": "0.000308", > "flag_point": "started", > "events": [ > { "time": "2012-12-20 23:34:49.856869", > "event": "waiting_for_osdmap"}, > { "time": "2012-12-20 23:34:49.856969", > "event": "reached_pg"}, > { "time": "2012-12-20 23:34:49.857001", > "event": "started"}, > { "time": "2012-12-20 23:34:49.857070", > "event": "done"}]}, > { "description": "osd_op(client.7636.0:101504 rb.0.122c.6b8b4567.00000000012c [write 1658880~2048] 6.92d2c38c)", > "received_at": "2012-12-20 23:35:34.351518", > "age": "23.088664", > "duration": "0.009430", > "flag_point": "waiting for sub ops", > "client_info": { "client": "client.7636", > "tid": 101504}, > "events": [ > { "time": "2012-12-20 23:35:34.351714", > "event": "waiting_for_osdmap"}, > { "time": "2012-12-20 23:35:34.351811", > "event": "reached_pg"}, > { "time": "2012-12-20 23:35:34.352030", > "event": "started"}, > { "time": "2012-12-20 23:35:34.352071", > "event": "started"}, > { "time": "2012-12-20 23:35:34.352146", > "event": "sub_op_sent"}, > { "time": "2012-12-20 23:35:34.352213", > "event": "commit_queued_for_journal_write"}, > { "time": "2012-12-20 23:35:34.352287", > "event": "write_thread_in_journal_buffer"}, > { "time": "2012-12-20 23:35:34.355390", > "event": "journaled_completion_queued"}, > { "time": "2012-12-20 23:35:34.355475", > "event": "op_commit"}, > { "time": "2012-12-20 23:35:34.356661", > "event": "sub_op_commit_rec"}, > { "time": "2012-12-20 23:35:34.360894", > "event": "op_applied"}, > { "time": "2012-12-20 23:35:34.360948", > "event": "done"}]}, > { "description": "osd_sub_op_reply(client.7636.0:101504 6.38c 92d2c38c\/rb.0.122c.6b8b4567.00000000012c\/head\/\/6 [] ondisk, result = 0)", > "received_at": "2012-12-20 23:35:34.356286", > "age": "23.083896", > "duration": "0.000421", > "flag_point": "started", > "events": [ > { "time": "2012-12-20 23:35:34.356489", > "event": "waiting_for_osdmap"}, > { "time": "2012-12-20 23:35:34.356598", > "event": "reached_pg"}, > { "time": "2012-12-20 23:35:34.356640", > "event": "started"}, > { "time": "2012-12-20 23:35:34.356707", > "event": "done"}]}, > { "description": "osd_op(client.7636.0:101512 rb.0.122c.6b8b4567.000000000357 [write 3731456~4096] 6.42463fe3)", > "received_at": "2012-12-20 23:35:42.133103", > "age": "15.307079", > "duration": "0.008357", > "flag_point": "waiting for sub ops", > "client_info": { "client": "client.7636", > "tid": 101512}, > "events": [ > { "time": "2012-12-20 23:35:42.133315", > "event": "waiting_for_osdmap"}, > { "time": "2012-12-20 23:35:42.133427", > "event": "reached_pg"}, > { "time": "2012-12-20 23:35:42.133684", > "event": "started"}, > { "time": "2012-12-20 23:35:42.133723", > "event": "started"}, > { "time": "2012-12-20 23:35:42.133796", > "event": "sub_op_sent"}, > { "time": "2012-12-20 23:35:42.133908", > "event": "commit_queued_for_journal_write"}, > { "time": "2012-12-20 23:35:42.133995", > "event": "write_thread_in_journal_buffer"}, > { "time": "2012-12-20 23:35:42.136965", > "event": "journaled_completion_queued"}, > { "time": "2012-12-20 23:35:42.137025", > "event": "op_commit"}, > { "time": "2012-12-20 23:35:42.138456", > "event": "sub_op_commit_rec"}, > { "time": "2012-12-20 23:35:42.141408", > "event": "op_applied"}, > { "time": "2012-12-20 23:35:42.141460", > "event": "done"}]}, > { "description": "osd_sub_op_reply(client.7636.0:101512 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", > "received_at": "2012-12-20 23:35:42.138087", > "age": "15.302095", > "duration": "0.000410", > "flag_point": "started", > "events": [ > { "time": "2012-12-20 23:35:42.138248", > "event": "waiting_for_osdmap"}, > { "time": "2012-12-20 23:35:42.138363", > "event": "reached_pg"}, > { "time": "2012-12-20 23:35:42.138443", > "event": "started"}, > { "time": "2012-12-20 23:35:42.138497", > "event": "done"}]}]} > > # ceph --admin-daemon ./ceph-osd.24.asok config show > { "name": "osd.24", > "cluster": "ceph", > "none": "0\/5", > "lockdep": "0\/1", > "context": "0\/1", > "crush": "1\/1", > "mds": "1\/5", > "mds_balancer": "1\/5", > "mds_locker": "1\/5", > "mds_log": "1\/5", > "mds_log_expire": "1\/5", > "mds_migrator": "1\/5", > "buffer": "0\/1", > "timer": "0\/1", > "filer": "0\/1", > "striper": "0\/1", > "objecter": "0\/1", > "rados": "0\/5", > "rbd": "0\/5", > "journaler": "0\/5", > "objectcacher": "0\/5", > "client": "0\/5", > "osd": "0\/5", > "optracker": "0\/5", > "objclass": "0\/5", > "filestore": "1\/3", > "journal": "1\/3", > "ms": "0\/5", > "mon": "1\/5", > "monc": "0\/10", > "paxos": "0\/5", > "tp": "0\/5", > "auth": "1\/5", > "crypto": "1\/5", > "finisher": "1\/1", > "heartbeatmap": "1\/5", > "perfcounter": "1\/5", > "rgw": "1\/5", > "hadoop": "1\/5", > "javaclient": "1\/5", > "asok": "1\/5", > "throttle": "1\/1", > "host": "KVM04", > "fsid": "00000000-0000-0000-0000-000000000000", > "public_addr": ":\/0", > "cluster_addr": ":\/0", > "public_network": "", > "cluster_network": "", > "num_client": "1", > "monmap": "", > "mon_host": "", > "lockdep": "false", > "admin_socket": "\/var\/run\/ceph\/ceph-osd.24.asok", > "daemonize": "true", > "pid_file": "\/var\/run\/ceph\/osd.24.pid", > "chdir": "\/", > "max_open_files": "0", > "fatal_signal_handlers": "true", > "log_file": "\/var\/log\/ceph\/osd.24.log", > "log_max_new": "1000", > "log_max_recent": "100000", > "log_to_stderr": "false", > "err_to_stderr": "true", > "log_to_syslog": "false", > "err_to_syslog": "false", > "log_flush_on_exit": "true", > "clog_to_monitors": "true", > "clog_to_syslog": "false", > "mon_cluster_log_to_syslog": "false", > "mon_cluster_log_file": "\/var\/log\/ceph\/ceph.log", > "key": "", > "keyfile": "", > "keyring": "\/srv\/ceph\/osd\/osd.24\/keyring", > "heartbeat_interval": "5", > "heartbeat_file": "", > "perf": "true", > "ms_tcp_nodelay": "true", > "ms_initial_backoff": "0.2", > "ms_max_backoff": "15", > "ms_nocrc": "false", > "ms_die_on_bad_msg": "false", > "ms_dispatch_throttle_bytes": "104857600", > "ms_bind_ipv6": "false", > "ms_bind_port_min": "6800", > "ms_bind_port_max": "7100", > "ms_rwthread_stack_bytes": "1048576", > "ms_tcp_read_timeout": "900", > "ms_inject_socket_failures": "0", > "ms_inject_delay_type": "", > "ms_inject_delay_max": "1", > "ms_inject_delay_probability": "0", > "mon_data": "\/var\/lib\/ceph\/mon\/ceph-24", > "mon_initial_members": "", > "mon_sync_fs_threshold": "5", > "mon_tick_interval": "5", > "mon_subscribe_interval": "300", > "mon_osd_laggy_halflife": "3600", > "mon_osd_laggy_weight": "0.3", > "mon_osd_adjust_heartbeat_grace": "true", > "mon_osd_adjust_down_out_interval": "true", > "mon_osd_auto_mark_in": "false", > "mon_osd_auto_mark_auto_out_in": "true", > "mon_osd_auto_mark_new_in": "true", > "mon_osd_down_out_interval": "300", > "mon_osd_min_up_ratio": "0.3", > "mon_osd_min_in_ratio": "0.3", > "mon_lease": "5", > "mon_lease_renew_interval": "3", > "mon_lease_ack_timeout": "10", > "mon_clock_drift_allowed": "0.05", > "mon_clock_drift_warn_backoff": "5", > "mon_accept_timeout": "10", > "mon_pg_create_interval": "30", > "mon_pg_stuck_threshold": "300", > "mon_osd_full_ratio": "0.95", > "mon_osd_nearfull_ratio": "0.85", > "mon_globalid_prealloc": "100", > "mon_osd_report_timeout": "900", > "mon_force_standby_active": "true", > "mon_min_osdmap_epochs": "500", > "mon_max_pgmap_epochs": "500", > "mon_max_log_epochs": "500", > "mon_max_osd": "10000", > "mon_probe_timeout": "2", > "mon_slurp_timeout": "10", > "mon_slurp_bytes": "262144", > "mon_client_bytes": "104857600", > "mon_daemon_bytes": "419430400", > "mon_max_log_entries_per_event": "4096", > "paxos_max_join_drift": "10", > "paxos_propose_interval": "1", > "paxos_min_wait": "0.05", > "clock_offset": "0", > "auth_cluster_required": "none", > "auth_service_required": "none", > "auth_client_required": "none", > "auth_supported": "none", > "cephx_require_signatures": "false", > "cephx_sign_messages": "true", > "auth_mon_ticket_ttl": "43200", > "auth_service_ticket_ttl": "3600", > "mon_client_hunt_interval": "3", > "mon_client_ping_interval": "10", > "mon_client_max_log_entries_per_message": "1000", > "mon_max_pool_pg_num": "65536", > "client_cache_size": "16384", > "client_cache_mid": "0.75", > "client_use_random_mds": "false", > "client_mount_timeout": "30", > "client_tick_interval": "1", > "client_trace": "", > "client_readahead_min": "131072", > "client_readahead_max_bytes": "0", > "client_readahead_max_periods": "4", > "client_snapdir": ".snap", > "client_mountpoint": "\/", > "client_notify_timeout": "10", > "client_caps_release_delay": "5", > "client_oc": "true", > "client_oc_size": "209715200", > "client_oc_max_dirty": "104857600", > "client_oc_target_dirty": "8388608", > "client_oc_max_dirty_age": "5", > "client_oc_max_objects": "1000", > "client_debug_force_sync_read": "false", > "client_debug_inject_tick_delay": "0", > "fuse_use_invalidate_cb": "false", > "fuse_big_writes": "true", > "fuse_debug": "false", > "objecter_tick_interval": "5", > "objecter_timeout": "10", > "objecter_inflight_op_bytes": "104857600", > "objecter_inflight_ops": "1024", > "journaler_allow_split_entries": "true", > "journaler_write_head_interval": "15", > "journaler_prefetch_periods": "10", > "journaler_prezero_periods": "5", > "journaler_batch_interval": "0.001", > "journaler_batch_max": "0", > "mds_data": "\/var\/lib\/ceph\/mds\/ceph-24", > "mds_max_file_size": "1099511627776", > "mds_cache_size": "100000", > "mds_cache_mid": "0.7", > "mds_mem_max": "1048576", > "mds_dir_commit_ratio": "0.5", > "mds_dir_max_commit_size": "90", > "mds_decay_halflife": "5", > "mds_beacon_interval": "4", > "mds_beacon_grace": "15", > "mds_blacklist_interval": "1440", > "mds_session_timeout": "60", > "mds_session_autoclose": "300", > "mds_reconnect_timeout": "45", > "mds_tick_interval": "5", > "mds_dirstat_min_interval": "1", > "mds_scatter_nudge_interval": "5", > "mds_client_prealloc_inos": "1000", > "mds_early_reply": "true", > "mds_use_tmap": "true", > "mds_default_dir_hash": "2", > "mds_log": "true", > "mds_log_skip_corrupt_events": "false", > "mds_log_max_events": "-1", > "mds_log_max_segments": "30", > "mds_log_max_expiring": "20", > "mds_bal_sample_interval": "3", > "mds_bal_replicate_threshold": "8000", > "mds_bal_unreplicate_threshold": "0", > "mds_bal_frag": "false", > "mds_bal_split_size": "10000", > "mds_bal_split_rd": "25000", > "mds_bal_split_wr": "10000", > "mds_bal_split_bits": "3", > "mds_bal_merge_size": "50", > "mds_bal_merge_rd": "1000", > "mds_bal_merge_wr": "1000", > "mds_bal_interval": "10", > "mds_bal_fragment_interval": "5", > "mds_bal_idle_threshold": "0", > "mds_bal_max": "-1", > "mds_bal_max_until": "-1", > "mds_bal_mode": "0", > "mds_bal_min_rebalance": "0.1", > "mds_bal_min_start": "0.2", > "mds_bal_need_min": "0.8", > "mds_bal_need_max": "1.2", > "mds_bal_midchunk": "0.3", > "mds_bal_minchunk": "0.001", > "mds_bal_target_removal_min": "5", > "mds_bal_target_removal_max": "10", > "mds_replay_interval": "1", > "mds_shutdown_check": "0", > "mds_thrash_exports": "0", > "mds_thrash_fragments": "0", > "mds_dump_cache_on_map": "false", > "mds_dump_cache_after_rejoin": "false", > "mds_verify_scatter": "false", > "mds_debug_scatterstat": "false", > "mds_debug_frag": "false", > "mds_debug_auth_pins": "false", > "mds_debug_subtrees": "false", > "mds_kill_mdstable_at": "0", > "mds_kill_export_at": "0", > "mds_kill_import_at": "0", > "mds_kill_link_at": "0", > "mds_kill_rename_at": "0", > "mds_wipe_sessions": "false", > "mds_wipe_ino_prealloc": "false", > "mds_skip_ino": "0", > "max_mds": "1", > "mds_standby_for_name": "", > "mds_standby_for_rank": "-1", > "mds_standby_replay": "false", > "osd_auto_upgrade_tmap": "true", > "osd_tmapput_sets_uses_tmap": "false", > "osd_max_backfills": "10", > "osd_backfill_full_ratio": "0.85", > "osd_backfill_retry_interval": "10", > "osd_uuid": "00000000-0000-0000-0000-000000000000", > "osd_data": "\/srv\/ceph\/osd\/osd.24", > "osd_journal": "\/dev\/sda3", > "osd_journal_size": "5120", > "osd_max_write_size": "90", > "osd_max_pgls": "1024", > "osd_client_message_size_cap": "524288000", > "osd_pg_bits": "6", > "osd_pgp_bits": "6", > "osd_min_rep": "1", > "osd_max_rep": "10", > "osd_pool_default_crush_rule": "0", > "osd_pool_default_size": "2", > "osd_pool_default_min_size": "0", > "osd_pool_default_pg_num": "8", > "osd_pool_default_pgp_num": "8", > "osd_map_dedup": "true", > "osd_map_cache_size": "500", > "osd_map_message_max": "100", > "osd_op_threads": "2", > "osd_disk_threads": "1", > "osd_recovery_threads": "1", > "osd_recover_clone_overlap": "true", > "osd_backfill_scan_min": "64", > "osd_backfill_scan_max": "512", > "osd_op_thread_timeout": "30", > "osd_recovery_thread_timeout": "30", > "osd_snap_trim_thread_timeout": "3600", > "osd_scrub_thread_timeout": "60", > "osd_scrub_finalize_thread_timeout": "600", > "osd_remove_thread_timeout": "3600", > "osd_command_thread_timeout": "600", > "osd_age": "0.8", > "osd_age_time": "0", > "osd_heartbeat_addr": ":\/0", > "osd_heartbeat_interval": "6", > "osd_heartbeat_grace": "20", > "osd_mon_heartbeat_interval": "30", > "osd_mon_report_interval_max": "120", > "osd_mon_report_interval_min": "5", > "osd_mon_ack_timeout": "30", > "osd_min_down_reporters": "1", > "osd_min_down_reports": "3", > "osd_default_data_pool_replay_window": "45", > "osd_preserve_trimmed_log": "false", > "osd_auto_mark_unfound_lost": "false", > "osd_recovery_delay_start": "0", > "osd_recovery_max_active": "50", > "osd_recovery_max_chunk": "8388608", > "osd_recovery_forget_lost_objects": "false", > "osd_max_scrubs": "1", > "osd_scrub_load_threshold": "0.5", > "osd_scrub_min_interval": "300", > "osd_scrub_max_interval": "86400", > "osd_deep_scrub_interval": "604800", > "osd_deep_scrub_stride": "524288", > "osd_auto_weight": "false", > "osd_class_dir": "\/usr\/local\/lib\/rados-classes", > "osd_check_for_log_corruption": "false", > "osd_use_stale_snap": "false", > "osd_rollback_to_cluster_snap": "", > "osd_default_notify_timeout": "30", > "osd_kill_backfill_at": "0", > "osd_min_pg_log_entries": "1000", > "osd_op_complaint_time": "30", > "osd_command_max_records": "256", > "osd_op_log_threshold": "5", > "osd_verify_sparse_read_holes": "false", > "osd_debug_drop_ping_probability": "0", > "osd_debug_drop_ping_duration": "0", > "osd_debug_drop_pg_create_probability": "0", > "osd_debug_drop_pg_create_duration": "1", > "osd_debug_drop_op_probability": "0", > "osd_op_history_size": "20", > "osd_op_history_duration": "600", > "osd_target_transaction_size": "300", > "osd_client_op_priority": "63", > "osd_recovery_op_priority": "10", > "filestore": "false", > "filestore_index_retry_probability": "0", > "filestore_debug_omap_check": "false", > "filestore_xattr_use_omap": "false", > "filestore_max_inline_xattr_size": "512", > "filestore_max_inline_xattrs": "2", > "filestore_max_sync_interval": "5", > "filestore_min_sync_interval": "0.01", > "filestore_btrfs_snap": "true", > "filestore_btrfs_clone_range": "true", > "filestore_fsync_flushes_journal_data": "false", > "filestore_fiemap": "false", > "filestore_flusher": "true", > "filestore_flusher_max_fds": "512", > "filestore_flush_min": "65536", > "filestore_sync_flush": "false", > "filestore_journal_parallel": "false", > "filestore_journal_writeahead": "false", > "filestore_journal_trailing": "false", > "filestore_queue_max_ops": "500", > "filestore_queue_max_bytes": "104857600", > "filestore_queue_committing_max_ops": "500", > "filestore_queue_committing_max_bytes": "104857600", > "filestore_op_threads": "2", > "filestore_op_thread_timeout": "60", > "filestore_op_thread_suicide_timeout": "180", > "filestore_commit_timeout": "600", > "filestore_fiemap_threshold": "4096", > "filestore_merge_threshold": "10", > "filestore_split_multiple": "2", > "filestore_update_to": "1000", > "filestore_blackhole": "false", > "filestore_dump_file": "", > "filestore_kill_at": "0", > "filestore_fail_eio": "true", > "journal_dio": "true", > "journal_aio": "false", > "journal_block_align": "true", > "journal_max_write_bytes": "10485760", > "journal_max_write_entries": "100", > "journal_queue_max_ops": "500", > "journal_queue_max_bytes": "104857600", > "journal_align_min_size": "65536", > "journal_replay_from": "0", > "journal_zero_on_create": "false", > "rbd_cache": "false", > "rbd_cache_size": "33554432", > "rbd_cache_max_dirty": "25165824", > "rbd_cache_target_dirty": "16777216", > "rbd_cache_max_dirty_age": "1", > "nss_db_path": "", > "rgw_data": "\/var\/lib\/ceph\/radosgw\/ceph-24", > "rgw_enable_apis": "s3, swift, swift_auth, admin", > "rgw_cache_enabled": "true", > "rgw_cache_lru_size": "10000", > "rgw_socket_path": "", > "rgw_dns_name": "", > "rgw_swift_url": "", > "rgw_swift_url_prefix": "swift", > "rgw_swift_auth_url": "", > "rgw_swift_auth_entry": "auth", > "rgw_keystone_url": "", > "rgw_keystone_admin_token": "", > "rgw_keystone_accepted_roles": "Member, admin", > "rgw_keystone_token_cache_size": "10000", > "rgw_keystone_revocation_interval": "900", > "rgw_admin_entry": "admin", > "rgw_enforce_swift_acls": "true", > "rgw_swift_token_expiration": "86400", > "rgw_print_continue": "true", > "rgw_remote_addr_param": "REMOTE_ADDR", > "rgw_op_thread_timeout": "600", > "rgw_op_thread_suicide_timeout": "0", > "rgw_thread_pool_size": "100", > "rgw_num_control_oids": "8", > "rgw_cluster_root_pool": ".rgw.root", > "rgw_log_nonexistent_bucket": "false", > "rgw_log_object_name": "%Y-%m-%d-%H-%i-%n", > "rgw_log_object_name_utc": "false", > "rgw_usage_max_shards": "32", > "rgw_usage_max_user_shards": "1", > "rgw_enable_ops_log": "true", > "rgw_enable_usage_log": "true", > "rgw_ops_log_rados": "true", > "rgw_ops_log_socket_path": "", > "rgw_ops_log_data_backlog": "5242880", > "rgw_usage_log_flush_threshold": "1024", > "rgw_usage_log_tick_interval": "30", > "rgw_intent_log_object_name": "%Y-%m-%d-%i-%n", > "rgw_intent_log_object_name_utc": "false", > "rgw_init_timeout": "30", > "rgw_mime_types_file": "\/etc\/mime.types", > "rgw_gc_max_objs": "32", > "rgw_gc_obj_min_wait": "7200", > "rgw_gc_processor_max_time": "3600", > "rgw_gc_processor_period": "3600", > "rgw_s3_success_create_obj_status": "0", > "rgw_resolve_cname": "false", > "rgw_obj_stripe_size": "4194304", > "rgw_extended_http_attrs": "", > "mutex_perf_counter": "false", > "internal_safe_to_start_threads": "true"} > > > -----Original Message----- > From: Mark Nelson [mailto:mark.nelson@inktank.com] > Sent: Thursday, 20 December 2012 11:28 PM > To: Matthew Anderson > Cc: ceph-devel@vger.kernel.org > Subject: Re: OSD's slow down to a crawl > > Hi Matt, > > Yeah, your log is looking pretty sparse! I imagine Sam will have a lot of questions for you. Out of curiosity have you looked at any io/cpu/memory stats when things slow down? sysprof may come in handy if it looks like it's CPU related. Also, check out the OSD admin socket. > Sebastien Han made a nice little blog entry describing it: > > http://www.sebastien-han.fr/blog/2012/08/14/ceph-admin-socket/ > > You can get a lot of information about the current state of things. > > Mark > ^ permalink raw reply [flat|nested] 13+ messages in thread
* RE: OSD's slow down to a crawl 2012-12-20 15:59 ` Mark Nelson @ 2012-12-20 16:08 ` Matthew Anderson 2012-12-20 16:22 ` Mark Nelson 2012-12-20 16:30 ` Sage Weil 0 siblings, 2 replies; 13+ messages in thread From: Matthew Anderson @ 2012-12-20 16:08 UTC (permalink / raw) To: 'Mark Nelson'; +Cc: ceph-devel@vger.kernel.org # rados bench 60 write -t 256 -p ssd Maintaining 256 concurrent writes of 4194304 bytes for at least 60 seconds. Object prefix: benchmark_data_KVM03_12985 sec Cur ops started finished avg MB/s cur MB/s last lat avg lat 0 0 0 0 0 0 - 0 1 45 45 0 0 0 - 0 2 71 71 0 0 0 - 0 3 90 90 0 0 0 - 0 4 125 125 0 0 0 - 0 5 136 136 0 0 0 - 0 6 144 144 0 0 0 - 0 7 175 175 0 0 0 - 0 8 201 201 0 0 0 - 0 9 217 217 0 0 0 - 0 10 228 228 0 0 0 - 0 11 251 251 0 0 0 - 0 12 255 266 11 3.66581 3.66667 11.7027 11.6013 13 255 271 16 4.92194 20 12.9795 11.9979 14 255 291 36 10.2833 80 13.1795 12.6478 15 255 301 46 12.2639 40 13.9424 12.9123 16 255 303 48 11.9973 8 14.0277 12.9579 17 255 321 66 15.526 72 15.1408 13.4616 18 255 345 90 19.9956 96 15.5673 13.9937 19 255 349 94 19.7851 16 15.707 14.0635 2012-12-21 00:06:37.198982min lat: 11.4904 max lat: 16.4053 avg lat: 14.3121 sec Cur ops started finished avg MB/s cur MB/s last lat avg lat 20 255 361 106 21.1953 48 16.4053 14.3121 21 255 367 112 21.3285 24 17.4263 14.4795 22 255 370 115 20.9043 12 17.3579 14.5544 23 255 387 132 22.9513 68 18.3185 15.0867 24 255 394 139 23.1614 28 18.2868 15.2504 25 255 404 149 23.8346 40 18.741 15.5048 26 255 410 155 23.8407 24 19.032 15.6358 27 255 410 155 22.9577 0 - 15.6358 28 255 410 155 22.1378 0 - 15.6358 29 255 411 156 21.5124 1.33333 22.6927 15.6811 30 255 431 176 23.4614 80 22.6577 16.4822 31 255 447 192 24.7687 64 23.2345 17.0563 32 255 464 209 26.1192 68 23.3788 17.5859 33 255 485 230 27.8726 84 22.8924 18.1122 34 255 497 242 28.4642 48 23.563 18.3693 35 255 508 253 28.9078 44 23.0516 18.5935 36 255 514 259 28.7713 24 24.4415 18.7278 37 255 530 275 29.723 64 23.847 19.0424 38 255 541 286 30.0985 44 24.5757 19.2427 39 255 544 289 29.6344 12 24.6802 19.2984 2012-12-21 00:06:57.203473min lat: 11.4904 max lat: 24.6802 avg lat: 19.2984 sec Cur ops started finished avg MB/s cur MB/s last lat avg lat 40 255 544 289 28.8935 0 - 19.2984 41 255 544 289 28.1888 0 - 19.2984 42 255 544 289 27.5176 0 - 19.2984 43 255 544 289 26.8776 0 - 19.2984 44 255 544 289 26.2668 0 - 19.2984 45 255 544 289 25.6831 0 - 19.2984 46 255 544 289 25.1248 0 - 19.2984 47 255 544 289 24.5902 0 - 19.2984 48 255 544 289 24.0779 0 - 19.2984 49 255 544 289 23.5865 0 - 19.2984 50 255 544 289 23.1148 0 - 19.2984 51 255 544 289 22.6616 0 - 19.2984 52 255 544 289 22.2258 0 - 19.2984 53 255 544 289 21.8065 0 - 19.2984 54 255 544 289 21.4026 0 - 19.2984 55 255 544 289 21.0135 0 - 19.2984 56 255 544 289 20.6383 0 - 19.2984 57 255 544 289 20.2762 0 - 19.2984 58 255 544 289 19.9266 0 - 19.2984 59 255 544 289 19.5889 0 - 19.2984 2012-12-21 00:07:17.207782min lat: 11.4904 max lat: 24.6802 avg lat: 19.2984 sec Cur ops started finished avg MB/s cur MB/s last lat avg lat 60 255 544 289 19.2624 0 - 19.2984 61 255 544 289 18.9466 0 - 19.2984 62 255 544 289 18.641 0 - 19.2984 63 255 544 289 18.3452 0 - 19.2984 64 255 544 289 18.0585 0 - 19.2984 65 255 544 289 17.7807 0 - 19.2984 66 255 544 289 17.5113 0 - 19.2984 67 133 545 412 24.5916 17.5714 37.3574 27.4834 Total time run: 68.000740 Total writes made: 545 Write size: 4194304 Bandwidth (MB/sec): 32.058 Stddev Bandwidth: 26.8837 Max bandwidth (MB/sec): 96 Min bandwidth (MB/sec): 0 Average Latency: 29.2801 Stddev Latency: 11.9768 Max latency: 52.6277 Min latency: 11.4904 ----Original Message----- From: Mark Nelson [mailto:mark.nelson@inktank.com] Sent: Thursday, 20 December 2012 11:59 PM To: Matthew Anderson Cc: ceph-devel@vger.kernel.org Subject: Re: OSD's slow down to a crawl Out of curiosity, if you fire up a rados bench instance on one of the nodes with say, 256 concurrent writes, do any of the writes complete? Mark On 12/20/2012 09:51 AM, Matthew Anderson wrote: > Hi Mark, > > Thanks for the quick reply. The log was 380MB in total so I just posted last 200 lines hoping that a trained eye might be able to spot something. I'm happy to put the file up on a web server if anyone wants to review it in full. > > It doesn't appear to be memory or CPU related. In it's hung state (both read and write are not responding at the moment, I have a VM hung trying to shutdown and an rbd stuck exporting ) all of the OSD's sit around 0.3% CPU usage, ~300MB of ram and no IO wait. The server has roughly 40GB RAM free and is easily 90% idle. Iotop shows no IO coming from any of the Ceph-OSD's. > > The admin socket showed no ops in flight but the performance dump, perhistoric ops and config are below. I did notice that the age of some of the ops was quite high considering things were actively trying to write to the cluster. > # ceph --admin-daemon ./ceph-osd.24.asok perf dump > > {"filestore":{"journal_queue_max_ops":500,"journal_queue_ops":0,"journ > al_ops":20868,"journal_queue_max_bytes":104857600,"journal_queue_bytes > ":0,"journal_bytes":1710397238,"journal_latency":{"avgcount":20868,"su > m":185.268022000},"journal_wr":13354,"journal_wr_bytes":{"avgcount":13 > 354,"sum":1770647552},"op_queue_max_ops":500,"op_queue_ops":0,"ops":20 > 868,"op_queue_max_bytes":104857600,"op_queue_bytes":0,"bytes":17102325 > 83,"apply_latency":{"avgcount":20868,"sum":508.896737000},"committing" > :0,"commitcycle":339,"commitcycle_interval":{"avgcount":339,"sum":1708 > .284725000},"commitcycle_latency":{"avgcount":339,"sum":13.241850000}, > "journal_full":0},"mutex-FileJournal::completions_lock":{"wait":{"avgc > ount":0,"sum":0.000000000}},"mutex-FileJournal::finisher_lock":{"wait" > :{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::write_lock":{"w > ait":{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::writeq_lock > ":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager:: > apply_lo ck":{"wai t":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager::com_lock":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::SubmitManager::lock":{"wait":{"avgcount":0,"sum":0.000000000}},"osd":{"opq":0,"op_wip":0,"op":31753,"op_in_bytes":797806224,"op_out_bytes":1728414048,"op_latency":{"avgcount":31753,"sum":82.183373000},"op_r":24945,"op_r_out_bytes":1728414048,"op_r_latency":{"avgcount":24945,"sum":16.833689000},"op_w":6808,"op_w_in_bytes":797806224,"op_w_rlat":{"avgcount":6808,"sum":38.223006000},"op_w_latency":{"avgcount":6808,"sum":65.349684000},"op_rw":0,"op_rw_in_bytes":0,"op_rw_out_bytes":0,"op_rw_rlat":{"avgcount":0,"sum":0.000000000},"op_rw_latency":{"avgcount":0,"sum":0.000000000},"subop":5700,"subop_in_bytes":813224156,"subop_latency":{"avgcount":5700,"sum":26.877085000},"subop_w":0,"subop_w_in_bytes":813224156,"subop_w_latency":{"avgcount":5700,"sum":26.877085000},"subop_pull":0,"subop_pull_latency":{"avgcount":0,"sum":0.000000000},"subop_push":0,"subop_push_ in_bytes" :0,"subop_push_latency":{"avgcount":0,"sum":0.000000000},"pull":0,"push":0,"push_out_bytes":0,"push_in":0,"push_in_bytes":0,"recovery_ops":0,"loadavg":359,"buffer_bytes":0,"numpg":2486,"numpg_primary":1214,"numpg_replica":1272,"numpg_stray":0,"heartbeat_to_peers":4,"heartbeat_from_peers":0,"map_messages":44,"map_message_epochs":184,"map_message_epoch_dups":174},"throttle-filestore_bytes":{"val":0,"max":104857600,"get":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0,"take":20868,"take_sum":1710397238,"put":13354,"put_sum":1710397238,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-filestore_ops":{"val":0,"max":500,"get":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0,"take":20868,"take_sum":20868,"put":13354,"put_sum":20868,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-client":{"val":0,"max":104857600,"get":31657,"get_sum":795546573,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":31657,"put_sum":7955 46573,"wa it":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-cluster":{"val":0,"max":104857600,"get":18913,"get_sum":828011293,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":18913,"put_sum":828011293,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-hbclient":{"val":0,"max":104857600,"get":4260,"get_sum":200220,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":4260,"put_sum":200220,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-hbserver":{"val":0,"max":104857600,"get":4236,"get_sum":199092,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":4236,"put_sum":199092,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-osd_client_bytes":{"val":0,"max":524288000,"get":31308,"get_sum":795353720,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":37671,"put_sum":795353720,"wait":{"avgcount":0,"sum":0.000000000}}} > > # ceph --admin-daemon ./ceph-osd.24.asok dump_historic_ops > > { "description": "osd_op(client.7636.0:101498 rb.0.122c.6b8b4567.000000000357 [write 3600384~4096] 6.42463fe3)", > "received_at": "2012-12-20 23:34:42.012131", > "age": "75.428051", > "duration": "0.009988", > "flag_point": "waiting for sub ops", > "client_info": { "client": "client.7636", > "tid": 101498}, > "events": [ > { "time": "2012-12-20 23:34:42.012317", > "event": "waiting_for_osdmap"}, > { "time": "2012-12-20 23:34:42.012425", > "event": "reached_pg"}, > { "time": "2012-12-20 23:34:42.012678", > "event": "started"}, > { "time": "2012-12-20 23:34:42.012720", > "event": "started"}, > { "time": "2012-12-20 23:34:42.012815", > "event": "sub_op_sent"}, > { "time": "2012-12-20 23:34:42.012958", > "event": "commit_queued_for_journal_write"}, > { "time": "2012-12-20 23:34:42.013033", > "event": "write_thread_in_journal_buffer"}, > { "time": "2012-12-20 23:34:42.013745", > "event": "journaled_completion_queued"}, > { "time": "2012-12-20 23:34:42.013803", > "event": "op_commit"}, > { "time": "2012-12-20 23:34:42.017414", > "event": "sub_op_commit_rec"}, > { "time": "2012-12-20 23:34:42.022066", > "event": "op_applied"}, > { "time": "2012-12-20 23:34:42.022119", > "event": "done"}]}, > { "description": "osd_sub_op_reply(client.7636.0:101498 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", > "received_at": "2012-12-20 23:34:42.017048", > "age": "75.423134", > "duration": "0.000411", > "flag_point": "started", > "events": [ > { "time": "2012-12-20 23:34:42.017218", > "event": "waiting_for_osdmap"}, > { "time": "2012-12-20 23:34:42.017363", > "event": "reached_pg"}, > { "time": "2012-12-20 23:34:42.017401", > "event": "started"}, > { "time": "2012-12-20 23:34:42.017459", > "event": "done"}]}, > { "description": "osd_op(client.7636.0:101502 rb.0.122c.6b8b4567.000000000064 [write 1512448~61440] 6.a241e34d)", > "received_at": "2012-12-20 23:34:49.851021", > "age": "67.589161", > "duration": "0.007114", > "flag_point": "waiting for sub ops", > "client_info": { "client": "client.7636", > "tid": 101502}, > "events": [ > { "time": "2012-12-20 23:34:49.851276", > "event": "waiting_for_osdmap"}, > { "time": "2012-12-20 23:34:49.851429", > "event": "reached_pg"}, > { "time": "2012-12-20 23:34:49.851616", > "event": "started"}, > { "time": "2012-12-20 23:34:49.851648", > "event": "started"}, > { "time": "2012-12-20 23:34:49.851712", > "event": "sub_op_sent"}, > { "time": "2012-12-20 23:34:49.851770", > "event": "commit_queued_for_journal_write"}, > { "time": "2012-12-20 23:34:49.851915", > "event": "write_thread_in_journal_buffer"}, > { "time": "2012-12-20 23:34:49.853956", > "event": "journaled_completion_queued"}, > { "time": "2012-12-20 23:34:49.853992", > "event": "op_commit"}, > { "time": "2012-12-20 23:34:49.857016", > "event": "sub_op_commit_rec"}, > { "time": "2012-12-20 23:34:49.858077", > "event": "op_applied"}, > { "time": "2012-12-20 23:34:49.858135", > "event": "done"}]}, > { "description": "osd_sub_op_reply(client.7636.0:101502 6.234d a241e34d\/rb.0.122c.6b8b4567.000000000064\/head\/\/6 [] ondisk, result = 0)", > "received_at": "2012-12-20 23:34:49.856762", > "age": "67.583420", > "duration": "0.000308", > "flag_point": "started", > "events": [ > { "time": "2012-12-20 23:34:49.856869", > "event": "waiting_for_osdmap"}, > { "time": "2012-12-20 23:34:49.856969", > "event": "reached_pg"}, > { "time": "2012-12-20 23:34:49.857001", > "event": "started"}, > { "time": "2012-12-20 23:34:49.857070", > "event": "done"}]}, > { "description": "osd_op(client.7636.0:101504 rb.0.122c.6b8b4567.00000000012c [write 1658880~2048] 6.92d2c38c)", > "received_at": "2012-12-20 23:35:34.351518", > "age": "23.088664", > "duration": "0.009430", > "flag_point": "waiting for sub ops", > "client_info": { "client": "client.7636", > "tid": 101504}, > "events": [ > { "time": "2012-12-20 23:35:34.351714", > "event": "waiting_for_osdmap"}, > { "time": "2012-12-20 23:35:34.351811", > "event": "reached_pg"}, > { "time": "2012-12-20 23:35:34.352030", > "event": "started"}, > { "time": "2012-12-20 23:35:34.352071", > "event": "started"}, > { "time": "2012-12-20 23:35:34.352146", > "event": "sub_op_sent"}, > { "time": "2012-12-20 23:35:34.352213", > "event": "commit_queued_for_journal_write"}, > { "time": "2012-12-20 23:35:34.352287", > "event": "write_thread_in_journal_buffer"}, > { "time": "2012-12-20 23:35:34.355390", > "event": "journaled_completion_queued"}, > { "time": "2012-12-20 23:35:34.355475", > "event": "op_commit"}, > { "time": "2012-12-20 23:35:34.356661", > "event": "sub_op_commit_rec"}, > { "time": "2012-12-20 23:35:34.360894", > "event": "op_applied"}, > { "time": "2012-12-20 23:35:34.360948", > "event": "done"}]}, > { "description": "osd_sub_op_reply(client.7636.0:101504 6.38c 92d2c38c\/rb.0.122c.6b8b4567.00000000012c\/head\/\/6 [] ondisk, result = 0)", > "received_at": "2012-12-20 23:35:34.356286", > "age": "23.083896", > "duration": "0.000421", > "flag_point": "started", > "events": [ > { "time": "2012-12-20 23:35:34.356489", > "event": "waiting_for_osdmap"}, > { "time": "2012-12-20 23:35:34.356598", > "event": "reached_pg"}, > { "time": "2012-12-20 23:35:34.356640", > "event": "started"}, > { "time": "2012-12-20 23:35:34.356707", > "event": "done"}]}, > { "description": "osd_op(client.7636.0:101512 rb.0.122c.6b8b4567.000000000357 [write 3731456~4096] 6.42463fe3)", > "received_at": "2012-12-20 23:35:42.133103", > "age": "15.307079", > "duration": "0.008357", > "flag_point": "waiting for sub ops", > "client_info": { "client": "client.7636", > "tid": 101512}, > "events": [ > { "time": "2012-12-20 23:35:42.133315", > "event": "waiting_for_osdmap"}, > { "time": "2012-12-20 23:35:42.133427", > "event": "reached_pg"}, > { "time": "2012-12-20 23:35:42.133684", > "event": "started"}, > { "time": "2012-12-20 23:35:42.133723", > "event": "started"}, > { "time": "2012-12-20 23:35:42.133796", > "event": "sub_op_sent"}, > { "time": "2012-12-20 23:35:42.133908", > "event": "commit_queued_for_journal_write"}, > { "time": "2012-12-20 23:35:42.133995", > "event": "write_thread_in_journal_buffer"}, > { "time": "2012-12-20 23:35:42.136965", > "event": "journaled_completion_queued"}, > { "time": "2012-12-20 23:35:42.137025", > "event": "op_commit"}, > { "time": "2012-12-20 23:35:42.138456", > "event": "sub_op_commit_rec"}, > { "time": "2012-12-20 23:35:42.141408", > "event": "op_applied"}, > { "time": "2012-12-20 23:35:42.141460", > "event": "done"}]}, > { "description": "osd_sub_op_reply(client.7636.0:101512 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", > "received_at": "2012-12-20 23:35:42.138087", > "age": "15.302095", > "duration": "0.000410", > "flag_point": "started", > "events": [ > { "time": "2012-12-20 23:35:42.138248", > "event": "waiting_for_osdmap"}, > { "time": "2012-12-20 23:35:42.138363", > "event": "reached_pg"}, > { "time": "2012-12-20 23:35:42.138443", > "event": "started"}, > { "time": "2012-12-20 23:35:42.138497", > "event": "done"}]}]} > > # ceph --admin-daemon ./ceph-osd.24.asok config show { "name": > "osd.24", > "cluster": "ceph", > "none": "0\/5", > "lockdep": "0\/1", > "context": "0\/1", > "crush": "1\/1", > "mds": "1\/5", > "mds_balancer": "1\/5", > "mds_locker": "1\/5", > "mds_log": "1\/5", > "mds_log_expire": "1\/5", > "mds_migrator": "1\/5", > "buffer": "0\/1", > "timer": "0\/1", > "filer": "0\/1", > "striper": "0\/1", > "objecter": "0\/1", > "rados": "0\/5", > "rbd": "0\/5", > "journaler": "0\/5", > "objectcacher": "0\/5", > "client": "0\/5", > "osd": "0\/5", > "optracker": "0\/5", > "objclass": "0\/5", > "filestore": "1\/3", > "journal": "1\/3", > "ms": "0\/5", > "mon": "1\/5", > "monc": "0\/10", > "paxos": "0\/5", > "tp": "0\/5", > "auth": "1\/5", > "crypto": "1\/5", > "finisher": "1\/1", > "heartbeatmap": "1\/5", > "perfcounter": "1\/5", > "rgw": "1\/5", > "hadoop": "1\/5", > "javaclient": "1\/5", > "asok": "1\/5", > "throttle": "1\/1", > "host": "KVM04", > "fsid": "00000000-0000-0000-0000-000000000000", > "public_addr": ":\/0", > "cluster_addr": ":\/0", > "public_network": "", > "cluster_network": "", > "num_client": "1", > "monmap": "", > "mon_host": "", > "lockdep": "false", > "admin_socket": "\/var\/run\/ceph\/ceph-osd.24.asok", > "daemonize": "true", > "pid_file": "\/var\/run\/ceph\/osd.24.pid", > "chdir": "\/", > "max_open_files": "0", > "fatal_signal_handlers": "true", > "log_file": "\/var\/log\/ceph\/osd.24.log", > "log_max_new": "1000", > "log_max_recent": "100000", > "log_to_stderr": "false", > "err_to_stderr": "true", > "log_to_syslog": "false", > "err_to_syslog": "false", > "log_flush_on_exit": "true", > "clog_to_monitors": "true", > "clog_to_syslog": "false", > "mon_cluster_log_to_syslog": "false", > "mon_cluster_log_file": "\/var\/log\/ceph\/ceph.log", > "key": "", > "keyfile": "", > "keyring": "\/srv\/ceph\/osd\/osd.24\/keyring", > "heartbeat_interval": "5", > "heartbeat_file": "", > "perf": "true", > "ms_tcp_nodelay": "true", > "ms_initial_backoff": "0.2", > "ms_max_backoff": "15", > "ms_nocrc": "false", > "ms_die_on_bad_msg": "false", > "ms_dispatch_throttle_bytes": "104857600", > "ms_bind_ipv6": "false", > "ms_bind_port_min": "6800", > "ms_bind_port_max": "7100", > "ms_rwthread_stack_bytes": "1048576", > "ms_tcp_read_timeout": "900", > "ms_inject_socket_failures": "0", > "ms_inject_delay_type": "", > "ms_inject_delay_max": "1", > "ms_inject_delay_probability": "0", > "mon_data": "\/var\/lib\/ceph\/mon\/ceph-24", > "mon_initial_members": "", > "mon_sync_fs_threshold": "5", > "mon_tick_interval": "5", > "mon_subscribe_interval": "300", > "mon_osd_laggy_halflife": "3600", > "mon_osd_laggy_weight": "0.3", > "mon_osd_adjust_heartbeat_grace": "true", > "mon_osd_adjust_down_out_interval": "true", > "mon_osd_auto_mark_in": "false", > "mon_osd_auto_mark_auto_out_in": "true", > "mon_osd_auto_mark_new_in": "true", > "mon_osd_down_out_interval": "300", > "mon_osd_min_up_ratio": "0.3", > "mon_osd_min_in_ratio": "0.3", > "mon_lease": "5", > "mon_lease_renew_interval": "3", > "mon_lease_ack_timeout": "10", > "mon_clock_drift_allowed": "0.05", > "mon_clock_drift_warn_backoff": "5", > "mon_accept_timeout": "10", > "mon_pg_create_interval": "30", > "mon_pg_stuck_threshold": "300", > "mon_osd_full_ratio": "0.95", > "mon_osd_nearfull_ratio": "0.85", > "mon_globalid_prealloc": "100", > "mon_osd_report_timeout": "900", > "mon_force_standby_active": "true", > "mon_min_osdmap_epochs": "500", > "mon_max_pgmap_epochs": "500", > "mon_max_log_epochs": "500", > "mon_max_osd": "10000", > "mon_probe_timeout": "2", > "mon_slurp_timeout": "10", > "mon_slurp_bytes": "262144", > "mon_client_bytes": "104857600", > "mon_daemon_bytes": "419430400", > "mon_max_log_entries_per_event": "4096", > "paxos_max_join_drift": "10", > "paxos_propose_interval": "1", > "paxos_min_wait": "0.05", > "clock_offset": "0", > "auth_cluster_required": "none", > "auth_service_required": "none", > "auth_client_required": "none", > "auth_supported": "none", > "cephx_require_signatures": "false", > "cephx_sign_messages": "true", > "auth_mon_ticket_ttl": "43200", > "auth_service_ticket_ttl": "3600", > "mon_client_hunt_interval": "3", > "mon_client_ping_interval": "10", > "mon_client_max_log_entries_per_message": "1000", > "mon_max_pool_pg_num": "65536", > "client_cache_size": "16384", > "client_cache_mid": "0.75", > "client_use_random_mds": "false", > "client_mount_timeout": "30", > "client_tick_interval": "1", > "client_trace": "", > "client_readahead_min": "131072", > "client_readahead_max_bytes": "0", > "client_readahead_max_periods": "4", > "client_snapdir": ".snap", > "client_mountpoint": "\/", > "client_notify_timeout": "10", > "client_caps_release_delay": "5", > "client_oc": "true", > "client_oc_size": "209715200", > "client_oc_max_dirty": "104857600", > "client_oc_target_dirty": "8388608", > "client_oc_max_dirty_age": "5", > "client_oc_max_objects": "1000", > "client_debug_force_sync_read": "false", > "client_debug_inject_tick_delay": "0", > "fuse_use_invalidate_cb": "false", > "fuse_big_writes": "true", > "fuse_debug": "false", > "objecter_tick_interval": "5", > "objecter_timeout": "10", > "objecter_inflight_op_bytes": "104857600", > "objecter_inflight_ops": "1024", > "journaler_allow_split_entries": "true", > "journaler_write_head_interval": "15", > "journaler_prefetch_periods": "10", > "journaler_prezero_periods": "5", > "journaler_batch_interval": "0.001", > "journaler_batch_max": "0", > "mds_data": "\/var\/lib\/ceph\/mds\/ceph-24", > "mds_max_file_size": "1099511627776", > "mds_cache_size": "100000", > "mds_cache_mid": "0.7", > "mds_mem_max": "1048576", > "mds_dir_commit_ratio": "0.5", > "mds_dir_max_commit_size": "90", > "mds_decay_halflife": "5", > "mds_beacon_interval": "4", > "mds_beacon_grace": "15", > "mds_blacklist_interval": "1440", > "mds_session_timeout": "60", > "mds_session_autoclose": "300", > "mds_reconnect_timeout": "45", > "mds_tick_interval": "5", > "mds_dirstat_min_interval": "1", > "mds_scatter_nudge_interval": "5", > "mds_client_prealloc_inos": "1000", > "mds_early_reply": "true", > "mds_use_tmap": "true", > "mds_default_dir_hash": "2", > "mds_log": "true", > "mds_log_skip_corrupt_events": "false", > "mds_log_max_events": "-1", > "mds_log_max_segments": "30", > "mds_log_max_expiring": "20", > "mds_bal_sample_interval": "3", > "mds_bal_replicate_threshold": "8000", > "mds_bal_unreplicate_threshold": "0", > "mds_bal_frag": "false", > "mds_bal_split_size": "10000", > "mds_bal_split_rd": "25000", > "mds_bal_split_wr": "10000", > "mds_bal_split_bits": "3", > "mds_bal_merge_size": "50", > "mds_bal_merge_rd": "1000", > "mds_bal_merge_wr": "1000", > "mds_bal_interval": "10", > "mds_bal_fragment_interval": "5", > "mds_bal_idle_threshold": "0", > "mds_bal_max": "-1", > "mds_bal_max_until": "-1", > "mds_bal_mode": "0", > "mds_bal_min_rebalance": "0.1", > "mds_bal_min_start": "0.2", > "mds_bal_need_min": "0.8", > "mds_bal_need_max": "1.2", > "mds_bal_midchunk": "0.3", > "mds_bal_minchunk": "0.001", > "mds_bal_target_removal_min": "5", > "mds_bal_target_removal_max": "10", > "mds_replay_interval": "1", > "mds_shutdown_check": "0", > "mds_thrash_exports": "0", > "mds_thrash_fragments": "0", > "mds_dump_cache_on_map": "false", > "mds_dump_cache_after_rejoin": "false", > "mds_verify_scatter": "false", > "mds_debug_scatterstat": "false", > "mds_debug_frag": "false", > "mds_debug_auth_pins": "false", > "mds_debug_subtrees": "false", > "mds_kill_mdstable_at": "0", > "mds_kill_export_at": "0", > "mds_kill_import_at": "0", > "mds_kill_link_at": "0", > "mds_kill_rename_at": "0", > "mds_wipe_sessions": "false", > "mds_wipe_ino_prealloc": "false", > "mds_skip_ino": "0", > "max_mds": "1", > "mds_standby_for_name": "", > "mds_standby_for_rank": "-1", > "mds_standby_replay": "false", > "osd_auto_upgrade_tmap": "true", > "osd_tmapput_sets_uses_tmap": "false", > "osd_max_backfills": "10", > "osd_backfill_full_ratio": "0.85", > "osd_backfill_retry_interval": "10", > "osd_uuid": "00000000-0000-0000-0000-000000000000", > "osd_data": "\/srv\/ceph\/osd\/osd.24", > "osd_journal": "\/dev\/sda3", > "osd_journal_size": "5120", > "osd_max_write_size": "90", > "osd_max_pgls": "1024", > "osd_client_message_size_cap": "524288000", > "osd_pg_bits": "6", > "osd_pgp_bits": "6", > "osd_min_rep": "1", > "osd_max_rep": "10", > "osd_pool_default_crush_rule": "0", > "osd_pool_default_size": "2", > "osd_pool_default_min_size": "0", > "osd_pool_default_pg_num": "8", > "osd_pool_default_pgp_num": "8", > "osd_map_dedup": "true", > "osd_map_cache_size": "500", > "osd_map_message_max": "100", > "osd_op_threads": "2", > "osd_disk_threads": "1", > "osd_recovery_threads": "1", > "osd_recover_clone_overlap": "true", > "osd_backfill_scan_min": "64", > "osd_backfill_scan_max": "512", > "osd_op_thread_timeout": "30", > "osd_recovery_thread_timeout": "30", > "osd_snap_trim_thread_timeout": "3600", > "osd_scrub_thread_timeout": "60", > "osd_scrub_finalize_thread_timeout": "600", > "osd_remove_thread_timeout": "3600", > "osd_command_thread_timeout": "600", > "osd_age": "0.8", > "osd_age_time": "0", > "osd_heartbeat_addr": ":\/0", > "osd_heartbeat_interval": "6", > "osd_heartbeat_grace": "20", > "osd_mon_heartbeat_interval": "30", > "osd_mon_report_interval_max": "120", > "osd_mon_report_interval_min": "5", > "osd_mon_ack_timeout": "30", > "osd_min_down_reporters": "1", > "osd_min_down_reports": "3", > "osd_default_data_pool_replay_window": "45", > "osd_preserve_trimmed_log": "false", > "osd_auto_mark_unfound_lost": "false", > "osd_recovery_delay_start": "0", > "osd_recovery_max_active": "50", > "osd_recovery_max_chunk": "8388608", > "osd_recovery_forget_lost_objects": "false", > "osd_max_scrubs": "1", > "osd_scrub_load_threshold": "0.5", > "osd_scrub_min_interval": "300", > "osd_scrub_max_interval": "86400", > "osd_deep_scrub_interval": "604800", > "osd_deep_scrub_stride": "524288", > "osd_auto_weight": "false", > "osd_class_dir": "\/usr\/local\/lib\/rados-classes", > "osd_check_for_log_corruption": "false", > "osd_use_stale_snap": "false", > "osd_rollback_to_cluster_snap": "", > "osd_default_notify_timeout": "30", > "osd_kill_backfill_at": "0", > "osd_min_pg_log_entries": "1000", > "osd_op_complaint_time": "30", > "osd_command_max_records": "256", > "osd_op_log_threshold": "5", > "osd_verify_sparse_read_holes": "false", > "osd_debug_drop_ping_probability": "0", > "osd_debug_drop_ping_duration": "0", > "osd_debug_drop_pg_create_probability": "0", > "osd_debug_drop_pg_create_duration": "1", > "osd_debug_drop_op_probability": "0", > "osd_op_history_size": "20", > "osd_op_history_duration": "600", > "osd_target_transaction_size": "300", > "osd_client_op_priority": "63", > "osd_recovery_op_priority": "10", > "filestore": "false", > "filestore_index_retry_probability": "0", > "filestore_debug_omap_check": "false", > "filestore_xattr_use_omap": "false", > "filestore_max_inline_xattr_size": "512", > "filestore_max_inline_xattrs": "2", > "filestore_max_sync_interval": "5", > "filestore_min_sync_interval": "0.01", > "filestore_btrfs_snap": "true", > "filestore_btrfs_clone_range": "true", > "filestore_fsync_flushes_journal_data": "false", > "filestore_fiemap": "false", > "filestore_flusher": "true", > "filestore_flusher_max_fds": "512", > "filestore_flush_min": "65536", > "filestore_sync_flush": "false", > "filestore_journal_parallel": "false", > "filestore_journal_writeahead": "false", > "filestore_journal_trailing": "false", > "filestore_queue_max_ops": "500", > "filestore_queue_max_bytes": "104857600", > "filestore_queue_committing_max_ops": "500", > "filestore_queue_committing_max_bytes": "104857600", > "filestore_op_threads": "2", > "filestore_op_thread_timeout": "60", > "filestore_op_thread_suicide_timeout": "180", > "filestore_commit_timeout": "600", > "filestore_fiemap_threshold": "4096", > "filestore_merge_threshold": "10", > "filestore_split_multiple": "2", > "filestore_update_to": "1000", > "filestore_blackhole": "false", > "filestore_dump_file": "", > "filestore_kill_at": "0", > "filestore_fail_eio": "true", > "journal_dio": "true", > "journal_aio": "false", > "journal_block_align": "true", > "journal_max_write_bytes": "10485760", > "journal_max_write_entries": "100", > "journal_queue_max_ops": "500", > "journal_queue_max_bytes": "104857600", > "journal_align_min_size": "65536", > "journal_replay_from": "0", > "journal_zero_on_create": "false", > "rbd_cache": "false", > "rbd_cache_size": "33554432", > "rbd_cache_max_dirty": "25165824", > "rbd_cache_target_dirty": "16777216", > "rbd_cache_max_dirty_age": "1", > "nss_db_path": "", > "rgw_data": "\/var\/lib\/ceph\/radosgw\/ceph-24", > "rgw_enable_apis": "s3, swift, swift_auth, admin", > "rgw_cache_enabled": "true", > "rgw_cache_lru_size": "10000", > "rgw_socket_path": "", > "rgw_dns_name": "", > "rgw_swift_url": "", > "rgw_swift_url_prefix": "swift", > "rgw_swift_auth_url": "", > "rgw_swift_auth_entry": "auth", > "rgw_keystone_url": "", > "rgw_keystone_admin_token": "", > "rgw_keystone_accepted_roles": "Member, admin", > "rgw_keystone_token_cache_size": "10000", > "rgw_keystone_revocation_interval": "900", > "rgw_admin_entry": "admin", > "rgw_enforce_swift_acls": "true", > "rgw_swift_token_expiration": "86400", > "rgw_print_continue": "true", > "rgw_remote_addr_param": "REMOTE_ADDR", > "rgw_op_thread_timeout": "600", > "rgw_op_thread_suicide_timeout": "0", > "rgw_thread_pool_size": "100", > "rgw_num_control_oids": "8", > "rgw_cluster_root_pool": ".rgw.root", > "rgw_log_nonexistent_bucket": "false", > "rgw_log_object_name": "%Y-%m-%d-%H-%i-%n", > "rgw_log_object_name_utc": "false", > "rgw_usage_max_shards": "32", > "rgw_usage_max_user_shards": "1", > "rgw_enable_ops_log": "true", > "rgw_enable_usage_log": "true", > "rgw_ops_log_rados": "true", > "rgw_ops_log_socket_path": "", > "rgw_ops_log_data_backlog": "5242880", > "rgw_usage_log_flush_threshold": "1024", > "rgw_usage_log_tick_interval": "30", > "rgw_intent_log_object_name": "%Y-%m-%d-%i-%n", > "rgw_intent_log_object_name_utc": "false", > "rgw_init_timeout": "30", > "rgw_mime_types_file": "\/etc\/mime.types", > "rgw_gc_max_objs": "32", > "rgw_gc_obj_min_wait": "7200", > "rgw_gc_processor_max_time": "3600", > "rgw_gc_processor_period": "3600", > "rgw_s3_success_create_obj_status": "0", > "rgw_resolve_cname": "false", > "rgw_obj_stripe_size": "4194304", > "rgw_extended_http_attrs": "", > "mutex_perf_counter": "false", > "internal_safe_to_start_threads": "true"} > > > -----Original Message----- > From: Mark Nelson [mailto:mark.nelson@inktank.com] > Sent: Thursday, 20 December 2012 11:28 PM > To: Matthew Anderson > Cc: ceph-devel@vger.kernel.org > Subject: Re: OSD's slow down to a crawl > > Hi Matt, > > Yeah, your log is looking pretty sparse! I imagine Sam will have a lot of questions for you. Out of curiosity have you looked at any io/cpu/memory stats when things slow down? sysprof may come in handy if it looks like it's CPU related. Also, check out the OSD admin socket. > Sebastien Han made a nice little blog entry describing it: > > http://www.sebastien-han.fr/blog/2012/08/14/ceph-admin-socket/ > > You can get a lot of information about the current state of things. > > Mark > -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: OSD's slow down to a crawl 2012-12-20 16:08 ` Matthew Anderson @ 2012-12-20 16:22 ` Mark Nelson 2012-12-20 16:34 ` Matthew Anderson 2012-12-20 16:30 ` Sage Weil 1 sibling, 1 reply; 13+ messages in thread From: Mark Nelson @ 2012-12-20 16:22 UTC (permalink / raw) To: Matthew Anderson; +Cc: ceph-devel@vger.kernel.org Ok, So the interesting news here is that the OSDs don't appear to be stalled exactly, but they do seem to be going really slow. With that new of a kernel and 0.55 you should have syncfs support so I don't think that's the problem. What kind of networking? Any possible problems there? Mark On 12/20/2012 10:08 AM, Matthew Anderson wrote: > # rados bench 60 write -t 256 -p ssd > Maintaining 256 concurrent writes of 4194304 bytes for at least 60 seconds. > Object prefix: benchmark_data_KVM03_12985 > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > 0 0 0 0 0 0 - 0 > 1 45 45 0 0 0 - 0 > 2 71 71 0 0 0 - 0 > 3 90 90 0 0 0 - 0 > 4 125 125 0 0 0 - 0 > 5 136 136 0 0 0 - 0 > 6 144 144 0 0 0 - 0 > 7 175 175 0 0 0 - 0 > 8 201 201 0 0 0 - 0 > 9 217 217 0 0 0 - 0 > 10 228 228 0 0 0 - 0 > 11 251 251 0 0 0 - 0 > 12 255 266 11 3.66581 3.66667 11.7027 11.6013 > 13 255 271 16 4.92194 20 12.9795 11.9979 > 14 255 291 36 10.2833 80 13.1795 12.6478 > 15 255 301 46 12.2639 40 13.9424 12.9123 > 16 255 303 48 11.9973 8 14.0277 12.9579 > 17 255 321 66 15.526 72 15.1408 13.4616 > 18 255 345 90 19.9956 96 15.5673 13.9937 > 19 255 349 94 19.7851 16 15.707 14.0635 > 2012-12-21 00:06:37.198982min lat: 11.4904 max lat: 16.4053 avg lat: 14.3121 > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > 20 255 361 106 21.1953 48 16.4053 14.3121 > 21 255 367 112 21.3285 24 17.4263 14.4795 > 22 255 370 115 20.9043 12 17.3579 14.5544 > 23 255 387 132 22.9513 68 18.3185 15.0867 > 24 255 394 139 23.1614 28 18.2868 15.2504 > 25 255 404 149 23.8346 40 18.741 15.5048 > 26 255 410 155 23.8407 24 19.032 15.6358 > 27 255 410 155 22.9577 0 - 15.6358 > 28 255 410 155 22.1378 0 - 15.6358 > 29 255 411 156 21.5124 1.33333 22.6927 15.6811 > 30 255 431 176 23.4614 80 22.6577 16.4822 > 31 255 447 192 24.7687 64 23.2345 17.0563 > 32 255 464 209 26.1192 68 23.3788 17.5859 > 33 255 485 230 27.8726 84 22.8924 18.1122 > 34 255 497 242 28.4642 48 23.563 18.3693 > 35 255 508 253 28.9078 44 23.0516 18.5935 > 36 255 514 259 28.7713 24 24.4415 18.7278 > 37 255 530 275 29.723 64 23.847 19.0424 > 38 255 541 286 30.0985 44 24.5757 19.2427 > 39 255 544 289 29.6344 12 24.6802 19.2984 > 2012-12-21 00:06:57.203473min lat: 11.4904 max lat: 24.6802 avg lat: 19.2984 > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > 40 255 544 289 28.8935 0 - 19.2984 > 41 255 544 289 28.1888 0 - 19.2984 > 42 255 544 289 27.5176 0 - 19.2984 > 43 255 544 289 26.8776 0 - 19.2984 > 44 255 544 289 26.2668 0 - 19.2984 > 45 255 544 289 25.6831 0 - 19.2984 > 46 255 544 289 25.1248 0 - 19.2984 > 47 255 544 289 24.5902 0 - 19.2984 > 48 255 544 289 24.0779 0 - 19.2984 > 49 255 544 289 23.5865 0 - 19.2984 > 50 255 544 289 23.1148 0 - 19.2984 > 51 255 544 289 22.6616 0 - 19.2984 > 52 255 544 289 22.2258 0 - 19.2984 > 53 255 544 289 21.8065 0 - 19.2984 > 54 255 544 289 21.4026 0 - 19.2984 > 55 255 544 289 21.0135 0 - 19.2984 > 56 255 544 289 20.6383 0 - 19.2984 > 57 255 544 289 20.2762 0 - 19.2984 > 58 255 544 289 19.9266 0 - 19.2984 > 59 255 544 289 19.5889 0 - 19.2984 > 2012-12-21 00:07:17.207782min lat: 11.4904 max lat: 24.6802 avg lat: 19.2984 > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > 60 255 544 289 19.2624 0 - 19.2984 > 61 255 544 289 18.9466 0 - 19.2984 > 62 255 544 289 18.641 0 - 19.2984 > 63 255 544 289 18.3452 0 - 19.2984 > 64 255 544 289 18.0585 0 - 19.2984 > 65 255 544 289 17.7807 0 - 19.2984 > 66 255 544 289 17.5113 0 - 19.2984 > 67 133 545 412 24.5916 17.5714 37.3574 27.4834 > Total time run: 68.000740 > Total writes made: 545 > Write size: 4194304 > Bandwidth (MB/sec): 32.058 > > Stddev Bandwidth: 26.8837 > Max bandwidth (MB/sec): 96 > Min bandwidth (MB/sec): 0 > Average Latency: 29.2801 > Stddev Latency: 11.9768 > Max latency: 52.6277 > Min latency: 11.4904 > > > > > > ----Original Message----- > From: Mark Nelson [mailto:mark.nelson@inktank.com] > Sent: Thursday, 20 December 2012 11:59 PM > To: Matthew Anderson > Cc: ceph-devel@vger.kernel.org > Subject: Re: OSD's slow down to a crawl > > Out of curiosity, if you fire up a rados bench instance on one of the nodes with say, 256 concurrent writes, do any of the writes complete? > > Mark > > On 12/20/2012 09:51 AM, Matthew Anderson wrote: >> Hi Mark, >> >> Thanks for the quick reply. The log was 380MB in total so I just posted last 200 lines hoping that a trained eye might be able to spot something. I'm happy to put the file up on a web server if anyone wants to review it in full. >> >> It doesn't appear to be memory or CPU related. In it's hung state (both read and write are not responding at the moment, I have a VM hung trying to shutdown and an rbd stuck exporting ) all of the OSD's sit around 0.3% CPU usage, ~300MB of ram and no IO wait. The server has roughly 40GB RAM free and is easily 90% idle. Iotop shows no IO coming from any of the Ceph-OSD's. >> >> The admin socket showed no ops in flight but the performance dump, perhistoric ops and config are below. I did notice that the age of some of the ops was quite high considering things were actively trying to write to the cluster. >> # ceph --admin-daemon ./ceph-osd.24.asok perf dump >> >> {"filestore":{"journal_queue_max_ops":500,"journal_queue_ops":0,"journ >> al_ops":20868,"journal_queue_max_bytes":104857600,"journal_queue_bytes >> ":0,"journal_bytes":1710397238,"journal_latency":{"avgcount":20868,"su >> m":185.268022000},"journal_wr":13354,"journal_wr_bytes":{"avgcount":13 >> 354,"sum":1770647552},"op_queue_max_ops":500,"op_queue_ops":0,"ops":20 >> 868,"op_queue_max_bytes":104857600,"op_queue_bytes":0,"bytes":17102325 >> 83,"apply_latency":{"avgcount":20868,"sum":508.896737000},"committing" >> :0,"commitcycle":339,"commitcycle_interval":{"avgcount":339,"sum":1708 >> .284725000},"commitcycle_latency":{"avgcount":339,"sum":13.241850000}, >> "journal_full":0},"mutex-FileJournal::completions_lock":{"wait":{"avgc >> ount":0,"sum":0.000000000}},"mutex-FileJournal::finisher_lock":{"wait" >> :{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::write_lock":{"w >> ait":{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::writeq_lock >> ":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager:: >> apply_lo > ck":{"wai > t":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager::com_lock":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::SubmitManager::lock":{"wait":{"avgcount":0,"sum":0.000000000}},"osd":{"opq":0,"op_wip":0,"op":31753,"op_in_bytes":797806224,"op_out_bytes":1728414048,"op_latency":{"avgcount":31753,"sum":82.183373000},"op_r":24945,"op_r_out_bytes":1728414048,"op_r_latency":{"avgcount":24945,"sum":16.833689000},"op_w":6808,"op_w_in_bytes":797806224,"op_w_rlat":{"avgcount":6808,"sum":38.223006000},"op_w_latency":{"avgcount":6808,"sum":65.349684000},"op_rw":0,"op_rw_in_bytes":0,"op_rw_out_bytes":0,"op_rw_rlat":{"avgcount":0,"sum":0.000000000},"op_rw_latency":{"avgcount":0,"sum":0.000000000},"subop":5700,"subop_in_bytes":813224156,"subop_latency":{"avgcount":5700,"sum":26.877085000}, "subop_w":0,"subop_w_in_bytes":813224156,"subop_w_latency":{"avgcount":5700,"sum":26.877085000},"subop_pull":0,"subop_pull_latency":{"avgcount":0,"sum":0.000000000},"subop_push":0,"subop_push_ > in_bytes" > :0,"subop_push_latency":{"avgcount":0,"sum":0.000000000},"pull":0,"push":0,"push_out_bytes":0,"push_in":0,"push_in_bytes":0,"recovery_ops":0,"loadavg":359,"buffer_bytes":0,"numpg":2486,"numpg_primary":1214,"numpg_replica":1272,"numpg_stray":0,"heartbeat_to_peers":4,"heartbeat_from_peers":0,"map_messages":44,"map_message_epochs":184,"map_message_epoch_dups":174},"throttle-filestore_bytes":{"val":0,"max":104857600,"get":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0,"take":20868,"take_sum":1710397238,"put":13354,"put_sum":1710397238,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-filestore_ops":{"val":0,"max":500,"get":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0,"take":20868,"take_sum":20868,"put":13354,"put_sum":20868,"wait":{"avgcount":0,"sum":0.000000000}}, "throttle-msgr_dispatch_throttler-client":{"val":0,"max":104857600,"get":31657,"get_sum":795546573,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":31657,"put_sum":7955 > 46573,"wa > it":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-cluster":{"val":0,"max":104857600,"get":18913,"get_sum":828011293,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":18913,"put_sum":828011293,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-hbclient":{"val":0,"max":104857600,"get":4260,"get_sum":200220,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":4260,"put_sum":200220,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-hbserver":{"val":0,"max":104857600,"get":4236,"get_sum":199092,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":4236,"put_sum":199092,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-osd_client_bytes":{"val":0,"max":524288000," get":31308,"get_sum":795353720,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":37671,"put_sum":795353720,"wait":{"avgcount":0,"sum":0.000000000}}} >> >> # ceph --admin-daemon ./ceph-osd.24.asok dump_historic_ops >> >> { "description": "osd_op(client.7636.0:101498 rb.0.122c.6b8b4567.000000000357 [write 3600384~4096] 6.42463fe3)", >> "received_at": "2012-12-20 23:34:42.012131", >> "age": "75.428051", >> "duration": "0.009988", >> "flag_point": "waiting for sub ops", >> "client_info": { "client": "client.7636", >> "tid": 101498}, >> "events": [ >> { "time": "2012-12-20 23:34:42.012317", >> "event": "waiting_for_osdmap"}, >> { "time": "2012-12-20 23:34:42.012425", >> "event": "reached_pg"}, >> { "time": "2012-12-20 23:34:42.012678", >> "event": "started"}, >> { "time": "2012-12-20 23:34:42.012720", >> "event": "started"}, >> { "time": "2012-12-20 23:34:42.012815", >> "event": "sub_op_sent"}, >> { "time": "2012-12-20 23:34:42.012958", >> "event": "commit_queued_for_journal_write"}, >> { "time": "2012-12-20 23:34:42.013033", >> "event": "write_thread_in_journal_buffer"}, >> { "time": "2012-12-20 23:34:42.013745", >> "event": "journaled_completion_queued"}, >> { "time": "2012-12-20 23:34:42.013803", >> "event": "op_commit"}, >> { "time": "2012-12-20 23:34:42.017414", >> "event": "sub_op_commit_rec"}, >> { "time": "2012-12-20 23:34:42.022066", >> "event": "op_applied"}, >> { "time": "2012-12-20 23:34:42.022119", >> "event": "done"}]}, >> { "description": "osd_sub_op_reply(client.7636.0:101498 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", >> "received_at": "2012-12-20 23:34:42.017048", >> "age": "75.423134", >> "duration": "0.000411", >> "flag_point": "started", >> "events": [ >> { "time": "2012-12-20 23:34:42.017218", >> "event": "waiting_for_osdmap"}, >> { "time": "2012-12-20 23:34:42.017363", >> "event": "reached_pg"}, >> { "time": "2012-12-20 23:34:42.017401", >> "event": "started"}, >> { "time": "2012-12-20 23:34:42.017459", >> "event": "done"}]}, >> { "description": "osd_op(client.7636.0:101502 rb.0.122c.6b8b4567.000000000064 [write 1512448~61440] 6.a241e34d)", >> "received_at": "2012-12-20 23:34:49.851021", >> "age": "67.589161", >> "duration": "0.007114", >> "flag_point": "waiting for sub ops", >> "client_info": { "client": "client.7636", >> "tid": 101502}, >> "events": [ >> { "time": "2012-12-20 23:34:49.851276", >> "event": "waiting_for_osdmap"}, >> { "time": "2012-12-20 23:34:49.851429", >> "event": "reached_pg"}, >> { "time": "2012-12-20 23:34:49.851616", >> "event": "started"}, >> { "time": "2012-12-20 23:34:49.851648", >> "event": "started"}, >> { "time": "2012-12-20 23:34:49.851712", >> "event": "sub_op_sent"}, >> { "time": "2012-12-20 23:34:49.851770", >> "event": "commit_queued_for_journal_write"}, >> { "time": "2012-12-20 23:34:49.851915", >> "event": "write_thread_in_journal_buffer"}, >> { "time": "2012-12-20 23:34:49.853956", >> "event": "journaled_completion_queued"}, >> { "time": "2012-12-20 23:34:49.853992", >> "event": "op_commit"}, >> { "time": "2012-12-20 23:34:49.857016", >> "event": "sub_op_commit_rec"}, >> { "time": "2012-12-20 23:34:49.858077", >> "event": "op_applied"}, >> { "time": "2012-12-20 23:34:49.858135", >> "event": "done"}]}, >> { "description": "osd_sub_op_reply(client.7636.0:101502 6.234d a241e34d\/rb.0.122c.6b8b4567.000000000064\/head\/\/6 [] ondisk, result = 0)", >> "received_at": "2012-12-20 23:34:49.856762", >> "age": "67.583420", >> "duration": "0.000308", >> "flag_point": "started", >> "events": [ >> { "time": "2012-12-20 23:34:49.856869", >> "event": "waiting_for_osdmap"}, >> { "time": "2012-12-20 23:34:49.856969", >> "event": "reached_pg"}, >> { "time": "2012-12-20 23:34:49.857001", >> "event": "started"}, >> { "time": "2012-12-20 23:34:49.857070", >> "event": "done"}]}, >> { "description": "osd_op(client.7636.0:101504 rb.0.122c.6b8b4567.00000000012c [write 1658880~2048] 6.92d2c38c)", >> "received_at": "2012-12-20 23:35:34.351518", >> "age": "23.088664", >> "duration": "0.009430", >> "flag_point": "waiting for sub ops", >> "client_info": { "client": "client.7636", >> "tid": 101504}, >> "events": [ >> { "time": "2012-12-20 23:35:34.351714", >> "event": "waiting_for_osdmap"}, >> { "time": "2012-12-20 23:35:34.351811", >> "event": "reached_pg"}, >> { "time": "2012-12-20 23:35:34.352030", >> "event": "started"}, >> { "time": "2012-12-20 23:35:34.352071", >> "event": "started"}, >> { "time": "2012-12-20 23:35:34.352146", >> "event": "sub_op_sent"}, >> { "time": "2012-12-20 23:35:34.352213", >> "event": "commit_queued_for_journal_write"}, >> { "time": "2012-12-20 23:35:34.352287", >> "event": "write_thread_in_journal_buffer"}, >> { "time": "2012-12-20 23:35:34.355390", >> "event": "journaled_completion_queued"}, >> { "time": "2012-12-20 23:35:34.355475", >> "event": "op_commit"}, >> { "time": "2012-12-20 23:35:34.356661", >> "event": "sub_op_commit_rec"}, >> { "time": "2012-12-20 23:35:34.360894", >> "event": "op_applied"}, >> { "time": "2012-12-20 23:35:34.360948", >> "event": "done"}]}, >> { "description": "osd_sub_op_reply(client.7636.0:101504 6.38c 92d2c38c\/rb.0.122c.6b8b4567.00000000012c\/head\/\/6 [] ondisk, result = 0)", >> "received_at": "2012-12-20 23:35:34.356286", >> "age": "23.083896", >> "duration": "0.000421", >> "flag_point": "started", >> "events": [ >> { "time": "2012-12-20 23:35:34.356489", >> "event": "waiting_for_osdmap"}, >> { "time": "2012-12-20 23:35:34.356598", >> "event": "reached_pg"}, >> { "time": "2012-12-20 23:35:34.356640", >> "event": "started"}, >> { "time": "2012-12-20 23:35:34.356707", >> "event": "done"}]}, >> { "description": "osd_op(client.7636.0:101512 rb.0.122c.6b8b4567.000000000357 [write 3731456~4096] 6.42463fe3)", >> "received_at": "2012-12-20 23:35:42.133103", >> "age": "15.307079", >> "duration": "0.008357", >> "flag_point": "waiting for sub ops", >> "client_info": { "client": "client.7636", >> "tid": 101512}, >> "events": [ >> { "time": "2012-12-20 23:35:42.133315", >> "event": "waiting_for_osdmap"}, >> { "time": "2012-12-20 23:35:42.133427", >> "event": "reached_pg"}, >> { "time": "2012-12-20 23:35:42.133684", >> "event": "started"}, >> { "time": "2012-12-20 23:35:42.133723", >> "event": "started"}, >> { "time": "2012-12-20 23:35:42.133796", >> "event": "sub_op_sent"}, >> { "time": "2012-12-20 23:35:42.133908", >> "event": "commit_queued_for_journal_write"}, >> { "time": "2012-12-20 23:35:42.133995", >> "event": "write_thread_in_journal_buffer"}, >> { "time": "2012-12-20 23:35:42.136965", >> "event": "journaled_completion_queued"}, >> { "time": "2012-12-20 23:35:42.137025", >> "event": "op_commit"}, >> { "time": "2012-12-20 23:35:42.138456", >> "event": "sub_op_commit_rec"}, >> { "time": "2012-12-20 23:35:42.141408", >> "event": "op_applied"}, >> { "time": "2012-12-20 23:35:42.141460", >> "event": "done"}]}, >> { "description": "osd_sub_op_reply(client.7636.0:101512 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", >> "received_at": "2012-12-20 23:35:42.138087", >> "age": "15.302095", >> "duration": "0.000410", >> "flag_point": "started", >> "events": [ >> { "time": "2012-12-20 23:35:42.138248", >> "event": "waiting_for_osdmap"}, >> { "time": "2012-12-20 23:35:42.138363", >> "event": "reached_pg"}, >> { "time": "2012-12-20 23:35:42.138443", >> "event": "started"}, >> { "time": "2012-12-20 23:35:42.138497", >> "event": "done"}]}]} >> >> # ceph --admin-daemon ./ceph-osd.24.asok config show { "name": >> "osd.24", >> "cluster": "ceph", >> "none": "0\/5", >> "lockdep": "0\/1", >> "context": "0\/1", >> "crush": "1\/1", >> "mds": "1\/5", >> "mds_balancer": "1\/5", >> "mds_locker": "1\/5", >> "mds_log": "1\/5", >> "mds_log_expire": "1\/5", >> "mds_migrator": "1\/5", >> "buffer": "0\/1", >> "timer": "0\/1", >> "filer": "0\/1", >> "striper": "0\/1", >> "objecter": "0\/1", >> "rados": "0\/5", >> "rbd": "0\/5", >> "journaler": "0\/5", >> "objectcacher": "0\/5", >> "client": "0\/5", >> "osd": "0\/5", >> "optracker": "0\/5", >> "objclass": "0\/5", >> "filestore": "1\/3", >> "journal": "1\/3", >> "ms": "0\/5", >> "mon": "1\/5", >> "monc": "0\/10", >> "paxos": "0\/5", >> "tp": "0\/5", >> "auth": "1\/5", >> "crypto": "1\/5", >> "finisher": "1\/1", >> "heartbeatmap": "1\/5", >> "perfcounter": "1\/5", >> "rgw": "1\/5", >> "hadoop": "1\/5", >> "javaclient": "1\/5", >> "asok": "1\/5", >> "throttle": "1\/1", >> "host": "KVM04", >> "fsid": "00000000-0000-0000-0000-000000000000", >> "public_addr": ":\/0", >> "cluster_addr": ":\/0", >> "public_network": "", >> "cluster_network": "", >> "num_client": "1", >> "monmap": "", >> "mon_host": "", >> "lockdep": "false", >> "admin_socket": "\/var\/run\/ceph\/ceph-osd.24.asok", >> "daemonize": "true", >> "pid_file": "\/var\/run\/ceph\/osd.24.pid", >> "chdir": "\/", >> "max_open_files": "0", >> "fatal_signal_handlers": "true", >> "log_file": "\/var\/log\/ceph\/osd.24.log", >> "log_max_new": "1000", >> "log_max_recent": "100000", >> "log_to_stderr": "false", >> "err_to_stderr": "true", >> "log_to_syslog": "false", >> "err_to_syslog": "false", >> "log_flush_on_exit": "true", >> "clog_to_monitors": "true", >> "clog_to_syslog": "false", >> "mon_cluster_log_to_syslog": "false", >> "mon_cluster_log_file": "\/var\/log\/ceph\/ceph.log", >> "key": "", >> "keyfile": "", >> "keyring": "\/srv\/ceph\/osd\/osd.24\/keyring", >> "heartbeat_interval": "5", >> "heartbeat_file": "", >> "perf": "true", >> "ms_tcp_nodelay": "true", >> "ms_initial_backoff": "0.2", >> "ms_max_backoff": "15", >> "ms_nocrc": "false", >> "ms_die_on_bad_msg": "false", >> "ms_dispatch_throttle_bytes": "104857600", >> "ms_bind_ipv6": "false", >> "ms_bind_port_min": "6800", >> "ms_bind_port_max": "7100", >> "ms_rwthread_stack_bytes": "1048576", >> "ms_tcp_read_timeout": "900", >> "ms_inject_socket_failures": "0", >> "ms_inject_delay_type": "", >> "ms_inject_delay_max": "1", >> "ms_inject_delay_probability": "0", >> "mon_data": "\/var\/lib\/ceph\/mon\/ceph-24", >> "mon_initial_members": "", >> "mon_sync_fs_threshold": "5", >> "mon_tick_interval": "5", >> "mon_subscribe_interval": "300", >> "mon_osd_laggy_halflife": "3600", >> "mon_osd_laggy_weight": "0.3", >> "mon_osd_adjust_heartbeat_grace": "true", >> "mon_osd_adjust_down_out_interval": "true", >> "mon_osd_auto_mark_in": "false", >> "mon_osd_auto_mark_auto_out_in": "true", >> "mon_osd_auto_mark_new_in": "true", >> "mon_osd_down_out_interval": "300", >> "mon_osd_min_up_ratio": "0.3", >> "mon_osd_min_in_ratio": "0.3", >> "mon_lease": "5", >> "mon_lease_renew_interval": "3", >> "mon_lease_ack_timeout": "10", >> "mon_clock_drift_allowed": "0.05", >> "mon_clock_drift_warn_backoff": "5", >> "mon_accept_timeout": "10", >> "mon_pg_create_interval": "30", >> "mon_pg_stuck_threshold": "300", >> "mon_osd_full_ratio": "0.95", >> "mon_osd_nearfull_ratio": "0.85", >> "mon_globalid_prealloc": "100", >> "mon_osd_report_timeout": "900", >> "mon_force_standby_active": "true", >> "mon_min_osdmap_epochs": "500", >> "mon_max_pgmap_epochs": "500", >> "mon_max_log_epochs": "500", >> "mon_max_osd": "10000", >> "mon_probe_timeout": "2", >> "mon_slurp_timeout": "10", >> "mon_slurp_bytes": "262144", >> "mon_client_bytes": "104857600", >> "mon_daemon_bytes": "419430400", >> "mon_max_log_entries_per_event": "4096", >> "paxos_max_join_drift": "10", >> "paxos_propose_interval": "1", >> "paxos_min_wait": "0.05", >> "clock_offset": "0", >> "auth_cluster_required": "none", >> "auth_service_required": "none", >> "auth_client_required": "none", >> "auth_supported": "none", >> "cephx_require_signatures": "false", >> "cephx_sign_messages": "true", >> "auth_mon_ticket_ttl": "43200", >> "auth_service_ticket_ttl": "3600", >> "mon_client_hunt_interval": "3", >> "mon_client_ping_interval": "10", >> "mon_client_max_log_entries_per_message": "1000", >> "mon_max_pool_pg_num": "65536", >> "client_cache_size": "16384", >> "client_cache_mid": "0.75", >> "client_use_random_mds": "false", >> "client_mount_timeout": "30", >> "client_tick_interval": "1", >> "client_trace": "", >> "client_readahead_min": "131072", >> "client_readahead_max_bytes": "0", >> "client_readahead_max_periods": "4", >> "client_snapdir": ".snap", >> "client_mountpoint": "\/", >> "client_notify_timeout": "10", >> "client_caps_release_delay": "5", >> "client_oc": "true", >> "client_oc_size": "209715200", >> "client_oc_max_dirty": "104857600", >> "client_oc_target_dirty": "8388608", >> "client_oc_max_dirty_age": "5", >> "client_oc_max_objects": "1000", >> "client_debug_force_sync_read": "false", >> "client_debug_inject_tick_delay": "0", >> "fuse_use_invalidate_cb": "false", >> "fuse_big_writes": "true", >> "fuse_debug": "false", >> "objecter_tick_interval": "5", >> "objecter_timeout": "10", >> "objecter_inflight_op_bytes": "104857600", >> "objecter_inflight_ops": "1024", >> "journaler_allow_split_entries": "true", >> "journaler_write_head_interval": "15", >> "journaler_prefetch_periods": "10", >> "journaler_prezero_periods": "5", >> "journaler_batch_interval": "0.001", >> "journaler_batch_max": "0", >> "mds_data": "\/var\/lib\/ceph\/mds\/ceph-24", >> "mds_max_file_size": "1099511627776", >> "mds_cache_size": "100000", >> "mds_cache_mid": "0.7", >> "mds_mem_max": "1048576", >> "mds_dir_commit_ratio": "0.5", >> "mds_dir_max_commit_size": "90", >> "mds_decay_halflife": "5", >> "mds_beacon_interval": "4", >> "mds_beacon_grace": "15", >> "mds_blacklist_interval": "1440", >> "mds_session_timeout": "60", >> "mds_session_autoclose": "300", >> "mds_reconnect_timeout": "45", >> "mds_tick_interval": "5", >> "mds_dirstat_min_interval": "1", >> "mds_scatter_nudge_interval": "5", >> "mds_client_prealloc_inos": "1000", >> "mds_early_reply": "true", >> "mds_use_tmap": "true", >> "mds_default_dir_hash": "2", >> "mds_log": "true", >> "mds_log_skip_corrupt_events": "false", >> "mds_log_max_events": "-1", >> "mds_log_max_segments": "30", >> "mds_log_max_expiring": "20", >> "mds_bal_sample_interval": "3", >> "mds_bal_replicate_threshold": "8000", >> "mds_bal_unreplicate_threshold": "0", >> "mds_bal_frag": "false", >> "mds_bal_split_size": "10000", >> "mds_bal_split_rd": "25000", >> "mds_bal_split_wr": "10000", >> "mds_bal_split_bits": "3", >> "mds_bal_merge_size": "50", >> "mds_bal_merge_rd": "1000", >> "mds_bal_merge_wr": "1000", >> "mds_bal_interval": "10", >> "mds_bal_fragment_interval": "5", >> "mds_bal_idle_threshold": "0", >> "mds_bal_max": "-1", >> "mds_bal_max_until": "-1", >> "mds_bal_mode": "0", >> "mds_bal_min_rebalance": "0.1", >> "mds_bal_min_start": "0.2", >> "mds_bal_need_min": "0.8", >> "mds_bal_need_max": "1.2", >> "mds_bal_midchunk": "0.3", >> "mds_bal_minchunk": "0.001", >> "mds_bal_target_removal_min": "5", >> "mds_bal_target_removal_max": "10", >> "mds_replay_interval": "1", >> "mds_shutdown_check": "0", >> "mds_thrash_exports": "0", >> "mds_thrash_fragments": "0", >> "mds_dump_cache_on_map": "false", >> "mds_dump_cache_after_rejoin": "false", >> "mds_verify_scatter": "false", >> "mds_debug_scatterstat": "false", >> "mds_debug_frag": "false", >> "mds_debug_auth_pins": "false", >> "mds_debug_subtrees": "false", >> "mds_kill_mdstable_at": "0", >> "mds_kill_export_at": "0", >> "mds_kill_import_at": "0", >> "mds_kill_link_at": "0", >> "mds_kill_rename_at": "0", >> "mds_wipe_sessions": "false", >> "mds_wipe_ino_prealloc": "false", >> "mds_skip_ino": "0", >> "max_mds": "1", >> "mds_standby_for_name": "", >> "mds_standby_for_rank": "-1", >> "mds_standby_replay": "false", >> "osd_auto_upgrade_tmap": "true", >> "osd_tmapput_sets_uses_tmap": "false", >> "osd_max_backfills": "10", >> "osd_backfill_full_ratio": "0.85", >> "osd_backfill_retry_interval": "10", >> "osd_uuid": "00000000-0000-0000-0000-000000000000", >> "osd_data": "\/srv\/ceph\/osd\/osd.24", >> "osd_journal": "\/dev\/sda3", >> "osd_journal_size": "5120", >> "osd_max_write_size": "90", >> "osd_max_pgls": "1024", >> "osd_client_message_size_cap": "524288000", >> "osd_pg_bits": "6", >> "osd_pgp_bits": "6", >> "osd_min_rep": "1", >> "osd_max_rep": "10", >> "osd_pool_default_crush_rule": "0", >> "osd_pool_default_size": "2", >> "osd_pool_default_min_size": "0", >> "osd_pool_default_pg_num": "8", >> "osd_pool_default_pgp_num": "8", >> "osd_map_dedup": "true", >> "osd_map_cache_size": "500", >> "osd_map_message_max": "100", >> "osd_op_threads": "2", >> "osd_disk_threads": "1", >> "osd_recovery_threads": "1", >> "osd_recover_clone_overlap": "true", >> "osd_backfill_scan_min": "64", >> "osd_backfill_scan_max": "512", >> "osd_op_thread_timeout": "30", >> "osd_recovery_thread_timeout": "30", >> "osd_snap_trim_thread_timeout": "3600", >> "osd_scrub_thread_timeout": "60", >> "osd_scrub_finalize_thread_timeout": "600", >> "osd_remove_thread_timeout": "3600", >> "osd_command_thread_timeout": "600", >> "osd_age": "0.8", >> "osd_age_time": "0", >> "osd_heartbeat_addr": ":\/0", >> "osd_heartbeat_interval": "6", >> "osd_heartbeat_grace": "20", >> "osd_mon_heartbeat_interval": "30", >> "osd_mon_report_interval_max": "120", >> "osd_mon_report_interval_min": "5", >> "osd_mon_ack_timeout": "30", >> "osd_min_down_reporters": "1", >> "osd_min_down_reports": "3", >> "osd_default_data_pool_replay_window": "45", >> "osd_preserve_trimmed_log": "false", >> "osd_auto_mark_unfound_lost": "false", >> "osd_recovery_delay_start": "0", >> "osd_recovery_max_active": "50", >> "osd_recovery_max_chunk": "8388608", >> "osd_recovery_forget_lost_objects": "false", >> "osd_max_scrubs": "1", >> "osd_scrub_load_threshold": "0.5", >> "osd_scrub_min_interval": "300", >> "osd_scrub_max_interval": "86400", >> "osd_deep_scrub_interval": "604800", >> "osd_deep_scrub_stride": "524288", >> "osd_auto_weight": "false", >> "osd_class_dir": "\/usr\/local\/lib\/rados-classes", >> "osd_check_for_log_corruption": "false", >> "osd_use_stale_snap": "false", >> "osd_rollback_to_cluster_snap": "", >> "osd_default_notify_timeout": "30", >> "osd_kill_backfill_at": "0", >> "osd_min_pg_log_entries": "1000", >> "osd_op_complaint_time": "30", >> "osd_command_max_records": "256", >> "osd_op_log_threshold": "5", >> "osd_verify_sparse_read_holes": "false", >> "osd_debug_drop_ping_probability": "0", >> "osd_debug_drop_ping_duration": "0", >> "osd_debug_drop_pg_create_probability": "0", >> "osd_debug_drop_pg_create_duration": "1", >> "osd_debug_drop_op_probability": "0", >> "osd_op_history_size": "20", >> "osd_op_history_duration": "600", >> "osd_target_transaction_size": "300", >> "osd_client_op_priority": "63", >> "osd_recovery_op_priority": "10", >> "filestore": "false", >> "filestore_index_retry_probability": "0", >> "filestore_debug_omap_check": "false", >> "filestore_xattr_use_omap": "false", >> "filestore_max_inline_xattr_size": "512", >> "filestore_max_inline_xattrs": "2", >> "filestore_max_sync_interval": "5", >> "filestore_min_sync_interval": "0.01", >> "filestore_btrfs_snap": "true", >> "filestore_btrfs_clone_range": "true", >> "filestore_fsync_flushes_journal_data": "false", >> "filestore_fiemap": "false", >> "filestore_flusher": "true", >> "filestore_flusher_max_fds": "512", >> "filestore_flush_min": "65536", >> "filestore_sync_flush": "false", >> "filestore_journal_parallel": "false", >> "filestore_journal_writeahead": "false", >> "filestore_journal_trailing": "false", >> "filestore_queue_max_ops": "500", >> "filestore_queue_max_bytes": "104857600", >> "filestore_queue_committing_max_ops": "500", >> "filestore_queue_committing_max_bytes": "104857600", >> "filestore_op_threads": "2", >> "filestore_op_thread_timeout": "60", >> "filestore_op_thread_suicide_timeout": "180", >> "filestore_commit_timeout": "600", >> "filestore_fiemap_threshold": "4096", >> "filestore_merge_threshold": "10", >> "filestore_split_multiple": "2", >> "filestore_update_to": "1000", >> "filestore_blackhole": "false", >> "filestore_dump_file": "", >> "filestore_kill_at": "0", >> "filestore_fail_eio": "true", >> "journal_dio": "true", >> "journal_aio": "false", >> "journal_block_align": "true", >> "journal_max_write_bytes": "10485760", >> "journal_max_write_entries": "100", >> "journal_queue_max_ops": "500", >> "journal_queue_max_bytes": "104857600", >> "journal_align_min_size": "65536", >> "journal_replay_from": "0", >> "journal_zero_on_create": "false", >> "rbd_cache": "false", >> "rbd_cache_size": "33554432", >> "rbd_cache_max_dirty": "25165824", >> "rbd_cache_target_dirty": "16777216", >> "rbd_cache_max_dirty_age": "1", >> "nss_db_path": "", >> "rgw_data": "\/var\/lib\/ceph\/radosgw\/ceph-24", >> "rgw_enable_apis": "s3, swift, swift_auth, admin", >> "rgw_cache_enabled": "true", >> "rgw_cache_lru_size": "10000", >> "rgw_socket_path": "", >> "rgw_dns_name": "", >> "rgw_swift_url": "", >> "rgw_swift_url_prefix": "swift", >> "rgw_swift_auth_url": "", >> "rgw_swift_auth_entry": "auth", >> "rgw_keystone_url": "", >> "rgw_keystone_admin_token": "", >> "rgw_keystone_accepted_roles": "Member, admin", >> "rgw_keystone_token_cache_size": "10000", >> "rgw_keystone_revocation_interval": "900", >> "rgw_admin_entry": "admin", >> "rgw_enforce_swift_acls": "true", >> "rgw_swift_token_expiration": "86400", >> "rgw_print_continue": "true", >> "rgw_remote_addr_param": "REMOTE_ADDR", >> "rgw_op_thread_timeout": "600", >> "rgw_op_thread_suicide_timeout": "0", >> "rgw_thread_pool_size": "100", >> "rgw_num_control_oids": "8", >> "rgw_cluster_root_pool": ".rgw.root", >> "rgw_log_nonexistent_bucket": "false", >> "rgw_log_object_name": "%Y-%m-%d-%H-%i-%n", >> "rgw_log_object_name_utc": "false", >> "rgw_usage_max_shards": "32", >> "rgw_usage_max_user_shards": "1", >> "rgw_enable_ops_log": "true", >> "rgw_enable_usage_log": "true", >> "rgw_ops_log_rados": "true", >> "rgw_ops_log_socket_path": "", >> "rgw_ops_log_data_backlog": "5242880", >> "rgw_usage_log_flush_threshold": "1024", >> "rgw_usage_log_tick_interval": "30", >> "rgw_intent_log_object_name": "%Y-%m-%d-%i-%n", >> "rgw_intent_log_object_name_utc": "false", >> "rgw_init_timeout": "30", >> "rgw_mime_types_file": "\/etc\/mime.types", >> "rgw_gc_max_objs": "32", >> "rgw_gc_obj_min_wait": "7200", >> "rgw_gc_processor_max_time": "3600", >> "rgw_gc_processor_period": "3600", >> "rgw_s3_success_create_obj_status": "0", >> "rgw_resolve_cname": "false", >> "rgw_obj_stripe_size": "4194304", >> "rgw_extended_http_attrs": "", >> "mutex_perf_counter": "false", >> "internal_safe_to_start_threads": "true"} >> >> >> -----Original Message----- >> From: Mark Nelson [mailto:mark.nelson@inktank.com] >> Sent: Thursday, 20 December 2012 11:28 PM >> To: Matthew Anderson >> Cc: ceph-devel@vger.kernel.org >> Subject: Re: OSD's slow down to a crawl >> >> Hi Matt, >> >> Yeah, your log is looking pretty sparse! I imagine Sam will have a lot of questions for you. Out of curiosity have you looked at any io/cpu/memory stats when things slow down? sysprof may come in handy if it looks like it's CPU related. Also, check out the OSD admin socket. >> Sebastien Han made a nice little blog entry describing it: >> >> http://www.sebastien-han.fr/blog/2012/08/14/ceph-admin-socket/ >> >> You can get a lot of information about the current state of things. >> >> Mark >> > > -- > To unsubscribe from this list: send the line "unsubscribe ceph-devel" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > ^ permalink raw reply [flat|nested] 13+ messages in thread
* RE: OSD's slow down to a crawl 2012-12-20 16:22 ` Mark Nelson @ 2012-12-20 16:34 ` Matthew Anderson 0 siblings, 0 replies; 13+ messages in thread From: Matthew Anderson @ 2012-12-20 16:34 UTC (permalink / raw) To: 'Mark Nelson'; +Cc: ceph-devel@vger.kernel.org In regards to syncfs, I'm running Scientific Linux 6.3 which has glibc 2.12 installed. The docs do mention that 0.55 and beyond doesn't require glibc syncfs support, only kernel support. I thought I would mention it just in case it was worth recompiling with a newer version of glibc. Networking is IPoIB using the supplied kernel modules and Mellanox Connect-X2 QDR cards. I've tested in both datagram and connected modes and there was no difference between them. Iperf shows roughly 13Gbits between hosts and there haven't been any packet drops or errors reported by ifconfig or the kernel stats for the IB card. -----Original Message----- From: ceph-devel-owner@vger.kernel.org [mailto:ceph-devel-owner@vger.kernel.org] On Behalf Of Mark Nelson Sent: Friday, 21 December 2012 12:22 AM To: Matthew Anderson Cc: ceph-devel@vger.kernel.org Subject: Re: OSD's slow down to a crawl Ok, So the interesting news here is that the OSDs don't appear to be stalled exactly, but they do seem to be going really slow. With that new of a kernel and 0.55 you should have syncfs support so I don't think that's the problem. What kind of networking? Any possible problems there? Mark On 12/20/2012 10:08 AM, Matthew Anderson wrote: > # rados bench 60 write -t 256 -p ssd > Maintaining 256 concurrent writes of 4194304 bytes for at least 60 seconds. > Object prefix: benchmark_data_KVM03_12985 > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > 0 0 0 0 0 0 - 0 > 1 45 45 0 0 0 - 0 > 2 71 71 0 0 0 - 0 > 3 90 90 0 0 0 - 0 > 4 125 125 0 0 0 - 0 > 5 136 136 0 0 0 - 0 > 6 144 144 0 0 0 - 0 > 7 175 175 0 0 0 - 0 > 8 201 201 0 0 0 - 0 > 9 217 217 0 0 0 - 0 > 10 228 228 0 0 0 - 0 > 11 251 251 0 0 0 - 0 > 12 255 266 11 3.66581 3.66667 11.7027 11.6013 > 13 255 271 16 4.92194 20 12.9795 11.9979 > 14 255 291 36 10.2833 80 13.1795 12.6478 > 15 255 301 46 12.2639 40 13.9424 12.9123 > 16 255 303 48 11.9973 8 14.0277 12.9579 > 17 255 321 66 15.526 72 15.1408 13.4616 > 18 255 345 90 19.9956 96 15.5673 13.9937 > 19 255 349 94 19.7851 16 15.707 14.0635 > 2012-12-21 00:06:37.198982min lat: 11.4904 max lat: 16.4053 avg lat: 14.3121 > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > 20 255 361 106 21.1953 48 16.4053 14.3121 > 21 255 367 112 21.3285 24 17.4263 14.4795 > 22 255 370 115 20.9043 12 17.3579 14.5544 > 23 255 387 132 22.9513 68 18.3185 15.0867 > 24 255 394 139 23.1614 28 18.2868 15.2504 > 25 255 404 149 23.8346 40 18.741 15.5048 > 26 255 410 155 23.8407 24 19.032 15.6358 > 27 255 410 155 22.9577 0 - 15.6358 > 28 255 410 155 22.1378 0 - 15.6358 > 29 255 411 156 21.5124 1.33333 22.6927 15.6811 > 30 255 431 176 23.4614 80 22.6577 16.4822 > 31 255 447 192 24.7687 64 23.2345 17.0563 > 32 255 464 209 26.1192 68 23.3788 17.5859 > 33 255 485 230 27.8726 84 22.8924 18.1122 > 34 255 497 242 28.4642 48 23.563 18.3693 > 35 255 508 253 28.9078 44 23.0516 18.5935 > 36 255 514 259 28.7713 24 24.4415 18.7278 > 37 255 530 275 29.723 64 23.847 19.0424 > 38 255 541 286 30.0985 44 24.5757 19.2427 > 39 255 544 289 29.6344 12 24.6802 19.2984 > 2012-12-21 00:06:57.203473min lat: 11.4904 max lat: 24.6802 avg lat: 19.2984 > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > 40 255 544 289 28.8935 0 - 19.2984 > 41 255 544 289 28.1888 0 - 19.2984 > 42 255 544 289 27.5176 0 - 19.2984 > 43 255 544 289 26.8776 0 - 19.2984 > 44 255 544 289 26.2668 0 - 19.2984 > 45 255 544 289 25.6831 0 - 19.2984 > 46 255 544 289 25.1248 0 - 19.2984 > 47 255 544 289 24.5902 0 - 19.2984 > 48 255 544 289 24.0779 0 - 19.2984 > 49 255 544 289 23.5865 0 - 19.2984 > 50 255 544 289 23.1148 0 - 19.2984 > 51 255 544 289 22.6616 0 - 19.2984 > 52 255 544 289 22.2258 0 - 19.2984 > 53 255 544 289 21.8065 0 - 19.2984 > 54 255 544 289 21.4026 0 - 19.2984 > 55 255 544 289 21.0135 0 - 19.2984 > 56 255 544 289 20.6383 0 - 19.2984 > 57 255 544 289 20.2762 0 - 19.2984 > 58 255 544 289 19.9266 0 - 19.2984 > 59 255 544 289 19.5889 0 - 19.2984 > 2012-12-21 00:07:17.207782min lat: 11.4904 max lat: 24.6802 avg lat: 19.2984 > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > 60 255 544 289 19.2624 0 - 19.2984 > 61 255 544 289 18.9466 0 - 19.2984 > 62 255 544 289 18.641 0 - 19.2984 > 63 255 544 289 18.3452 0 - 19.2984 > 64 255 544 289 18.0585 0 - 19.2984 > 65 255 544 289 17.7807 0 - 19.2984 > 66 255 544 289 17.5113 0 - 19.2984 > 67 133 545 412 24.5916 17.5714 37.3574 27.4834 > Total time run: 68.000740 > Total writes made: 545 > Write size: 4194304 > Bandwidth (MB/sec): 32.058 > > Stddev Bandwidth: 26.8837 > Max bandwidth (MB/sec): 96 > Min bandwidth (MB/sec): 0 > Average Latency: 29.2801 > Stddev Latency: 11.9768 > Max latency: 52.6277 > Min latency: 11.4904 > > > > > > ----Original Message----- > From: Mark Nelson [mailto:mark.nelson@inktank.com] > Sent: Thursday, 20 December 2012 11:59 PM > To: Matthew Anderson > Cc: ceph-devel@vger.kernel.org > Subject: Re: OSD's slow down to a crawl > > Out of curiosity, if you fire up a rados bench instance on one of the nodes with say, 256 concurrent writes, do any of the writes complete? > > Mark > > On 12/20/2012 09:51 AM, Matthew Anderson wrote: >> Hi Mark, >> >> Thanks for the quick reply. The log was 380MB in total so I just posted last 200 lines hoping that a trained eye might be able to spot something. I'm happy to put the file up on a web server if anyone wants to review it in full. >> >> It doesn't appear to be memory or CPU related. In it's hung state (both read and write are not responding at the moment, I have a VM hung trying to shutdown and an rbd stuck exporting ) all of the OSD's sit around 0.3% CPU usage, ~300MB of ram and no IO wait. The server has roughly 40GB RAM free and is easily 90% idle. Iotop shows no IO coming from any of the Ceph-OSD's. >> >> The admin socket showed no ops in flight but the performance dump, perhistoric ops and config are below. I did notice that the age of some of the ops was quite high considering things were actively trying to write to the cluster. >> # ceph --admin-daemon ./ceph-osd.24.asok perf dump >> >> {"filestore":{"journal_queue_max_ops":500,"journal_queue_ops":0,"jour >> n >> al_ops":20868,"journal_queue_max_bytes":104857600,"journal_queue_byte >> s >> ":0,"journal_bytes":1710397238,"journal_latency":{"avgcount":20868,"s >> u >> m":185.268022000},"journal_wr":13354,"journal_wr_bytes":{"avgcount":1 >> 3 >> 354,"sum":1770647552},"op_queue_max_ops":500,"op_queue_ops":0,"ops":2 >> 0 >> 868,"op_queue_max_bytes":104857600,"op_queue_bytes":0,"bytes":1710232 >> 5 >> 83,"apply_latency":{"avgcount":20868,"sum":508.896737000},"committing" >> :0,"commitcycle":339,"commitcycle_interval":{"avgcount":339,"sum":170 >> 8 >> .284725000},"commitcycle_latency":{"avgcount":339,"sum":13.241850000} >> , >> "journal_full":0},"mutex-FileJournal::completions_lock":{"wait":{"avg >> c >> ount":0,"sum":0.000000000}},"mutex-FileJournal::finisher_lock":{"wait" >> :{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::write_lock":{" >> w >> ait":{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::writeq_loc >> k >> ":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager:: >> apply_lo > ck":{"wai > t":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager::com_loc > k":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::SubmitManager > ::lock":{"wait":{"avgcount":0,"sum":0.000000000}},"osd":{"opq":0,"op_w > ip":0,"op":31753,"op_in_bytes":797806224,"op_out_bytes":1728414048,"op > _latency":{"avgcount":31753,"sum":82.183373000},"op_r":24945,"op_r_out > _bytes":1728414048,"op_r_latency":{"avgcount":24945,"sum":16.833689000 > },"op_w":6808,"op_w_in_bytes":797806224,"op_w_rlat":{"avgcount":6808," > sum":38.223006000},"op_w_latency":{"avgcount":6808,"sum":65.349684000} > ,"op_rw":0,"op_rw_in_bytes":0,"op_rw_out_bytes":0,"op_rw_rlat":{"avgco > unt":0,"sum":0.000000000},"op_rw_latency":{"avgcount":0,"sum":0.000000 > 000},"subop":5700,"subop_in_bytes":813224156,"subop_latency":{"avgcoun > t":5700,"sum":26.877085000},"subop_w":0,"subop_w_in_bytes":813224156," > subop_w_latency":{"avgcount":5700,"sum":26.877085000},"subop_pull":0," > subop_pull_latency":{"avgcount":0,"sum":0.000000000},"subop_push":0,"s > ubop_pus h_ > in_bytes" > :0,"subop_push_latency":{"avgcount":0,"sum":0.000000000},"pull":0,"pus > h":0,"push_out_bytes":0,"push_in":0,"push_in_bytes":0,"recovery_ops":0 > ,"loadavg":359,"buffer_bytes":0,"numpg":2486,"numpg_primary":1214,"num > pg_replica":1272,"numpg_stray":0,"heartbeat_to_peers":4,"heartbeat_fro > m_peers":0,"map_messages":44,"map_message_epochs":184,"map_message_epo > ch_dups":174},"throttle-filestore_bytes":{"val":0,"max":104857600,"get > ":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0,"take":20 > 868,"take_sum":1710397238,"put":13354,"put_sum":1710397238,"wait":{"av > gcount":0,"sum":0.000000000}},"throttle-filestore_ops":{"val":0,"max": > 500,"get":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0," > take":20868,"take_sum":20868,"put":13354,"put_sum":20868,"wait":{"avgc > ount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-client": > {"val":0,"max":104857600,"get":31657,"get_sum":795546573,"get_or_fail_ > fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":31657,"put > _sum":79 55 > 46573,"wa > it":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttle > r-cluster":{"val":0,"max":104857600,"get":18913,"get_sum":828011293,"g > et_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put" > :18913,"put_sum":828011293,"wait":{"avgcount":0,"sum":0.000000000}},"t > hrottle-msgr_dispatch_throttler-hbclient":{"val":0,"max":104857600,"ge > t":4260,"get_sum":200220,"get_or_fail_fail":0,"get_or_fail_success":0, > "take":0,"take_sum":0,"put":4260,"put_sum":200220,"wait":{"avgcount":0 > ,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-hbserver":{"val > ":0,"max":104857600,"get":4236,"get_sum":199092,"get_or_fail_fail":0," > get_or_fail_success":0,"take":0,"take_sum":0,"put":4236,"put_sum":1990 > 92,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-osd_client_bytes > ":{"val":0,"max":524288000,"get":31308,"get_sum":795353720,"get_or_fai > l_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":37671,"p > ut_sum":795353720,"wait":{"avgcount":0,"sum":0.000000000}}} >> >> # ceph --admin-daemon ./ceph-osd.24.asok dump_historic_ops >> >> { "description": "osd_op(client.7636.0:101498 rb.0.122c.6b8b4567.000000000357 [write 3600384~4096] 6.42463fe3)", >> "received_at": "2012-12-20 23:34:42.012131", >> "age": "75.428051", >> "duration": "0.009988", >> "flag_point": "waiting for sub ops", >> "client_info": { "client": "client.7636", >> "tid": 101498}, >> "events": [ >> { "time": "2012-12-20 23:34:42.012317", >> "event": "waiting_for_osdmap"}, >> { "time": "2012-12-20 23:34:42.012425", >> "event": "reached_pg"}, >> { "time": "2012-12-20 23:34:42.012678", >> "event": "started"}, >> { "time": "2012-12-20 23:34:42.012720", >> "event": "started"}, >> { "time": "2012-12-20 23:34:42.012815", >> "event": "sub_op_sent"}, >> { "time": "2012-12-20 23:34:42.012958", >> "event": "commit_queued_for_journal_write"}, >> { "time": "2012-12-20 23:34:42.013033", >> "event": "write_thread_in_journal_buffer"}, >> { "time": "2012-12-20 23:34:42.013745", >> "event": "journaled_completion_queued"}, >> { "time": "2012-12-20 23:34:42.013803", >> "event": "op_commit"}, >> { "time": "2012-12-20 23:34:42.017414", >> "event": "sub_op_commit_rec"}, >> { "time": "2012-12-20 23:34:42.022066", >> "event": "op_applied"}, >> { "time": "2012-12-20 23:34:42.022119", >> "event": "done"}]}, >> { "description": "osd_sub_op_reply(client.7636.0:101498 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", >> "received_at": "2012-12-20 23:34:42.017048", >> "age": "75.423134", >> "duration": "0.000411", >> "flag_point": "started", >> "events": [ >> { "time": "2012-12-20 23:34:42.017218", >> "event": "waiting_for_osdmap"}, >> { "time": "2012-12-20 23:34:42.017363", >> "event": "reached_pg"}, >> { "time": "2012-12-20 23:34:42.017401", >> "event": "started"}, >> { "time": "2012-12-20 23:34:42.017459", >> "event": "done"}]}, >> { "description": "osd_op(client.7636.0:101502 rb.0.122c.6b8b4567.000000000064 [write 1512448~61440] 6.a241e34d)", >> "received_at": "2012-12-20 23:34:49.851021", >> "age": "67.589161", >> "duration": "0.007114", >> "flag_point": "waiting for sub ops", >> "client_info": { "client": "client.7636", >> "tid": 101502}, >> "events": [ >> { "time": "2012-12-20 23:34:49.851276", >> "event": "waiting_for_osdmap"}, >> { "time": "2012-12-20 23:34:49.851429", >> "event": "reached_pg"}, >> { "time": "2012-12-20 23:34:49.851616", >> "event": "started"}, >> { "time": "2012-12-20 23:34:49.851648", >> "event": "started"}, >> { "time": "2012-12-20 23:34:49.851712", >> "event": "sub_op_sent"}, >> { "time": "2012-12-20 23:34:49.851770", >> "event": "commit_queued_for_journal_write"}, >> { "time": "2012-12-20 23:34:49.851915", >> "event": "write_thread_in_journal_buffer"}, >> { "time": "2012-12-20 23:34:49.853956", >> "event": "journaled_completion_queued"}, >> { "time": "2012-12-20 23:34:49.853992", >> "event": "op_commit"}, >> { "time": "2012-12-20 23:34:49.857016", >> "event": "sub_op_commit_rec"}, >> { "time": "2012-12-20 23:34:49.858077", >> "event": "op_applied"}, >> { "time": "2012-12-20 23:34:49.858135", >> "event": "done"}]}, >> { "description": "osd_sub_op_reply(client.7636.0:101502 6.234d a241e34d\/rb.0.122c.6b8b4567.000000000064\/head\/\/6 [] ondisk, result = 0)", >> "received_at": "2012-12-20 23:34:49.856762", >> "age": "67.583420", >> "duration": "0.000308", >> "flag_point": "started", >> "events": [ >> { "time": "2012-12-20 23:34:49.856869", >> "event": "waiting_for_osdmap"}, >> { "time": "2012-12-20 23:34:49.856969", >> "event": "reached_pg"}, >> { "time": "2012-12-20 23:34:49.857001", >> "event": "started"}, >> { "time": "2012-12-20 23:34:49.857070", >> "event": "done"}]}, >> { "description": "osd_op(client.7636.0:101504 rb.0.122c.6b8b4567.00000000012c [write 1658880~2048] 6.92d2c38c)", >> "received_at": "2012-12-20 23:35:34.351518", >> "age": "23.088664", >> "duration": "0.009430", >> "flag_point": "waiting for sub ops", >> "client_info": { "client": "client.7636", >> "tid": 101504}, >> "events": [ >> { "time": "2012-12-20 23:35:34.351714", >> "event": "waiting_for_osdmap"}, >> { "time": "2012-12-20 23:35:34.351811", >> "event": "reached_pg"}, >> { "time": "2012-12-20 23:35:34.352030", >> "event": "started"}, >> { "time": "2012-12-20 23:35:34.352071", >> "event": "started"}, >> { "time": "2012-12-20 23:35:34.352146", >> "event": "sub_op_sent"}, >> { "time": "2012-12-20 23:35:34.352213", >> "event": "commit_queued_for_journal_write"}, >> { "time": "2012-12-20 23:35:34.352287", >> "event": "write_thread_in_journal_buffer"}, >> { "time": "2012-12-20 23:35:34.355390", >> "event": "journaled_completion_queued"}, >> { "time": "2012-12-20 23:35:34.355475", >> "event": "op_commit"}, >> { "time": "2012-12-20 23:35:34.356661", >> "event": "sub_op_commit_rec"}, >> { "time": "2012-12-20 23:35:34.360894", >> "event": "op_applied"}, >> { "time": "2012-12-20 23:35:34.360948", >> "event": "done"}]}, >> { "description": "osd_sub_op_reply(client.7636.0:101504 6.38c 92d2c38c\/rb.0.122c.6b8b4567.00000000012c\/head\/\/6 [] ondisk, result = 0)", >> "received_at": "2012-12-20 23:35:34.356286", >> "age": "23.083896", >> "duration": "0.000421", >> "flag_point": "started", >> "events": [ >> { "time": "2012-12-20 23:35:34.356489", >> "event": "waiting_for_osdmap"}, >> { "time": "2012-12-20 23:35:34.356598", >> "event": "reached_pg"}, >> { "time": "2012-12-20 23:35:34.356640", >> "event": "started"}, >> { "time": "2012-12-20 23:35:34.356707", >> "event": "done"}]}, >> { "description": "osd_op(client.7636.0:101512 rb.0.122c.6b8b4567.000000000357 [write 3731456~4096] 6.42463fe3)", >> "received_at": "2012-12-20 23:35:42.133103", >> "age": "15.307079", >> "duration": "0.008357", >> "flag_point": "waiting for sub ops", >> "client_info": { "client": "client.7636", >> "tid": 101512}, >> "events": [ >> { "time": "2012-12-20 23:35:42.133315", >> "event": "waiting_for_osdmap"}, >> { "time": "2012-12-20 23:35:42.133427", >> "event": "reached_pg"}, >> { "time": "2012-12-20 23:35:42.133684", >> "event": "started"}, >> { "time": "2012-12-20 23:35:42.133723", >> "event": "started"}, >> { "time": "2012-12-20 23:35:42.133796", >> "event": "sub_op_sent"}, >> { "time": "2012-12-20 23:35:42.133908", >> "event": "commit_queued_for_journal_write"}, >> { "time": "2012-12-20 23:35:42.133995", >> "event": "write_thread_in_journal_buffer"}, >> { "time": "2012-12-20 23:35:42.136965", >> "event": "journaled_completion_queued"}, >> { "time": "2012-12-20 23:35:42.137025", >> "event": "op_commit"}, >> { "time": "2012-12-20 23:35:42.138456", >> "event": "sub_op_commit_rec"}, >> { "time": "2012-12-20 23:35:42.141408", >> "event": "op_applied"}, >> { "time": "2012-12-20 23:35:42.141460", >> "event": "done"}]}, >> { "description": "osd_sub_op_reply(client.7636.0:101512 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", >> "received_at": "2012-12-20 23:35:42.138087", >> "age": "15.302095", >> "duration": "0.000410", >> "flag_point": "started", >> "events": [ >> { "time": "2012-12-20 23:35:42.138248", >> "event": "waiting_for_osdmap"}, >> { "time": "2012-12-20 23:35:42.138363", >> "event": "reached_pg"}, >> { "time": "2012-12-20 23:35:42.138443", >> "event": "started"}, >> { "time": "2012-12-20 23:35:42.138497", >> "event": "done"}]}]} >> >> # ceph --admin-daemon ./ceph-osd.24.asok config show { "name": >> "osd.24", >> "cluster": "ceph", >> "none": "0\/5", >> "lockdep": "0\/1", >> "context": "0\/1", >> "crush": "1\/1", >> "mds": "1\/5", >> "mds_balancer": "1\/5", >> "mds_locker": "1\/5", >> "mds_log": "1\/5", >> "mds_log_expire": "1\/5", >> "mds_migrator": "1\/5", >> "buffer": "0\/1", >> "timer": "0\/1", >> "filer": "0\/1", >> "striper": "0\/1", >> "objecter": "0\/1", >> "rados": "0\/5", >> "rbd": "0\/5", >> "journaler": "0\/5", >> "objectcacher": "0\/5", >> "client": "0\/5", >> "osd": "0\/5", >> "optracker": "0\/5", >> "objclass": "0\/5", >> "filestore": "1\/3", >> "journal": "1\/3", >> "ms": "0\/5", >> "mon": "1\/5", >> "monc": "0\/10", >> "paxos": "0\/5", >> "tp": "0\/5", >> "auth": "1\/5", >> "crypto": "1\/5", >> "finisher": "1\/1", >> "heartbeatmap": "1\/5", >> "perfcounter": "1\/5", >> "rgw": "1\/5", >> "hadoop": "1\/5", >> "javaclient": "1\/5", >> "asok": "1\/5", >> "throttle": "1\/1", >> "host": "KVM04", >> "fsid": "00000000-0000-0000-0000-000000000000", >> "public_addr": ":\/0", >> "cluster_addr": ":\/0", >> "public_network": "", >> "cluster_network": "", >> "num_client": "1", >> "monmap": "", >> "mon_host": "", >> "lockdep": "false", >> "admin_socket": "\/var\/run\/ceph\/ceph-osd.24.asok", >> "daemonize": "true", >> "pid_file": "\/var\/run\/ceph\/osd.24.pid", >> "chdir": "\/", >> "max_open_files": "0", >> "fatal_signal_handlers": "true", >> "log_file": "\/var\/log\/ceph\/osd.24.log", >> "log_max_new": "1000", >> "log_max_recent": "100000", >> "log_to_stderr": "false", >> "err_to_stderr": "true", >> "log_to_syslog": "false", >> "err_to_syslog": "false", >> "log_flush_on_exit": "true", >> "clog_to_monitors": "true", >> "clog_to_syslog": "false", >> "mon_cluster_log_to_syslog": "false", >> "mon_cluster_log_file": "\/var\/log\/ceph\/ceph.log", >> "key": "", >> "keyfile": "", >> "keyring": "\/srv\/ceph\/osd\/osd.24\/keyring", >> "heartbeat_interval": "5", >> "heartbeat_file": "", >> "perf": "true", >> "ms_tcp_nodelay": "true", >> "ms_initial_backoff": "0.2", >> "ms_max_backoff": "15", >> "ms_nocrc": "false", >> "ms_die_on_bad_msg": "false", >> "ms_dispatch_throttle_bytes": "104857600", >> "ms_bind_ipv6": "false", >> "ms_bind_port_min": "6800", >> "ms_bind_port_max": "7100", >> "ms_rwthread_stack_bytes": "1048576", >> "ms_tcp_read_timeout": "900", >> "ms_inject_socket_failures": "0", >> "ms_inject_delay_type": "", >> "ms_inject_delay_max": "1", >> "ms_inject_delay_probability": "0", >> "mon_data": "\/var\/lib\/ceph\/mon\/ceph-24", >> "mon_initial_members": "", >> "mon_sync_fs_threshold": "5", >> "mon_tick_interval": "5", >> "mon_subscribe_interval": "300", >> "mon_osd_laggy_halflife": "3600", >> "mon_osd_laggy_weight": "0.3", >> "mon_osd_adjust_heartbeat_grace": "true", >> "mon_osd_adjust_down_out_interval": "true", >> "mon_osd_auto_mark_in": "false", >> "mon_osd_auto_mark_auto_out_in": "true", >> "mon_osd_auto_mark_new_in": "true", >> "mon_osd_down_out_interval": "300", >> "mon_osd_min_up_ratio": "0.3", >> "mon_osd_min_in_ratio": "0.3", >> "mon_lease": "5", >> "mon_lease_renew_interval": "3", >> "mon_lease_ack_timeout": "10", >> "mon_clock_drift_allowed": "0.05", >> "mon_clock_drift_warn_backoff": "5", >> "mon_accept_timeout": "10", >> "mon_pg_create_interval": "30", >> "mon_pg_stuck_threshold": "300", >> "mon_osd_full_ratio": "0.95", >> "mon_osd_nearfull_ratio": "0.85", >> "mon_globalid_prealloc": "100", >> "mon_osd_report_timeout": "900", >> "mon_force_standby_active": "true", >> "mon_min_osdmap_epochs": "500", >> "mon_max_pgmap_epochs": "500", >> "mon_max_log_epochs": "500", >> "mon_max_osd": "10000", >> "mon_probe_timeout": "2", >> "mon_slurp_timeout": "10", >> "mon_slurp_bytes": "262144", >> "mon_client_bytes": "104857600", >> "mon_daemon_bytes": "419430400", >> "mon_max_log_entries_per_event": "4096", >> "paxos_max_join_drift": "10", >> "paxos_propose_interval": "1", >> "paxos_min_wait": "0.05", >> "clock_offset": "0", >> "auth_cluster_required": "none", >> "auth_service_required": "none", >> "auth_client_required": "none", >> "auth_supported": "none", >> "cephx_require_signatures": "false", >> "cephx_sign_messages": "true", >> "auth_mon_ticket_ttl": "43200", >> "auth_service_ticket_ttl": "3600", >> "mon_client_hunt_interval": "3", >> "mon_client_ping_interval": "10", >> "mon_client_max_log_entries_per_message": "1000", >> "mon_max_pool_pg_num": "65536", >> "client_cache_size": "16384", >> "client_cache_mid": "0.75", >> "client_use_random_mds": "false", >> "client_mount_timeout": "30", >> "client_tick_interval": "1", >> "client_trace": "", >> "client_readahead_min": "131072", >> "client_readahead_max_bytes": "0", >> "client_readahead_max_periods": "4", >> "client_snapdir": ".snap", >> "client_mountpoint": "\/", >> "client_notify_timeout": "10", >> "client_caps_release_delay": "5", >> "client_oc": "true", >> "client_oc_size": "209715200", >> "client_oc_max_dirty": "104857600", >> "client_oc_target_dirty": "8388608", >> "client_oc_max_dirty_age": "5", >> "client_oc_max_objects": "1000", >> "client_debug_force_sync_read": "false", >> "client_debug_inject_tick_delay": "0", >> "fuse_use_invalidate_cb": "false", >> "fuse_big_writes": "true", >> "fuse_debug": "false", >> "objecter_tick_interval": "5", >> "objecter_timeout": "10", >> "objecter_inflight_op_bytes": "104857600", >> "objecter_inflight_ops": "1024", >> "journaler_allow_split_entries": "true", >> "journaler_write_head_interval": "15", >> "journaler_prefetch_periods": "10", >> "journaler_prezero_periods": "5", >> "journaler_batch_interval": "0.001", >> "journaler_batch_max": "0", >> "mds_data": "\/var\/lib\/ceph\/mds\/ceph-24", >> "mds_max_file_size": "1099511627776", >> "mds_cache_size": "100000", >> "mds_cache_mid": "0.7", >> "mds_mem_max": "1048576", >> "mds_dir_commit_ratio": "0.5", >> "mds_dir_max_commit_size": "90", >> "mds_decay_halflife": "5", >> "mds_beacon_interval": "4", >> "mds_beacon_grace": "15", >> "mds_blacklist_interval": "1440", >> "mds_session_timeout": "60", >> "mds_session_autoclose": "300", >> "mds_reconnect_timeout": "45", >> "mds_tick_interval": "5", >> "mds_dirstat_min_interval": "1", >> "mds_scatter_nudge_interval": "5", >> "mds_client_prealloc_inos": "1000", >> "mds_early_reply": "true", >> "mds_use_tmap": "true", >> "mds_default_dir_hash": "2", >> "mds_log": "true", >> "mds_log_skip_corrupt_events": "false", >> "mds_log_max_events": "-1", >> "mds_log_max_segments": "30", >> "mds_log_max_expiring": "20", >> "mds_bal_sample_interval": "3", >> "mds_bal_replicate_threshold": "8000", >> "mds_bal_unreplicate_threshold": "0", >> "mds_bal_frag": "false", >> "mds_bal_split_size": "10000", >> "mds_bal_split_rd": "25000", >> "mds_bal_split_wr": "10000", >> "mds_bal_split_bits": "3", >> "mds_bal_merge_size": "50", >> "mds_bal_merge_rd": "1000", >> "mds_bal_merge_wr": "1000", >> "mds_bal_interval": "10", >> "mds_bal_fragment_interval": "5", >> "mds_bal_idle_threshold": "0", >> "mds_bal_max": "-1", >> "mds_bal_max_until": "-1", >> "mds_bal_mode": "0", >> "mds_bal_min_rebalance": "0.1", >> "mds_bal_min_start": "0.2", >> "mds_bal_need_min": "0.8", >> "mds_bal_need_max": "1.2", >> "mds_bal_midchunk": "0.3", >> "mds_bal_minchunk": "0.001", >> "mds_bal_target_removal_min": "5", >> "mds_bal_target_removal_max": "10", >> "mds_replay_interval": "1", >> "mds_shutdown_check": "0", >> "mds_thrash_exports": "0", >> "mds_thrash_fragments": "0", >> "mds_dump_cache_on_map": "false", >> "mds_dump_cache_after_rejoin": "false", >> "mds_verify_scatter": "false", >> "mds_debug_scatterstat": "false", >> "mds_debug_frag": "false", >> "mds_debug_auth_pins": "false", >> "mds_debug_subtrees": "false", >> "mds_kill_mdstable_at": "0", >> "mds_kill_export_at": "0", >> "mds_kill_import_at": "0", >> "mds_kill_link_at": "0", >> "mds_kill_rename_at": "0", >> "mds_wipe_sessions": "false", >> "mds_wipe_ino_prealloc": "false", >> "mds_skip_ino": "0", >> "max_mds": "1", >> "mds_standby_for_name": "", >> "mds_standby_for_rank": "-1", >> "mds_standby_replay": "false", >> "osd_auto_upgrade_tmap": "true", >> "osd_tmapput_sets_uses_tmap": "false", >> "osd_max_backfills": "10", >> "osd_backfill_full_ratio": "0.85", >> "osd_backfill_retry_interval": "10", >> "osd_uuid": "00000000-0000-0000-0000-000000000000", >> "osd_data": "\/srv\/ceph\/osd\/osd.24", >> "osd_journal": "\/dev\/sda3", >> "osd_journal_size": "5120", >> "osd_max_write_size": "90", >> "osd_max_pgls": "1024", >> "osd_client_message_size_cap": "524288000", >> "osd_pg_bits": "6", >> "osd_pgp_bits": "6", >> "osd_min_rep": "1", >> "osd_max_rep": "10", >> "osd_pool_default_crush_rule": "0", >> "osd_pool_default_size": "2", >> "osd_pool_default_min_size": "0", >> "osd_pool_default_pg_num": "8", >> "osd_pool_default_pgp_num": "8", >> "osd_map_dedup": "true", >> "osd_map_cache_size": "500", >> "osd_map_message_max": "100", >> "osd_op_threads": "2", >> "osd_disk_threads": "1", >> "osd_recovery_threads": "1", >> "osd_recover_clone_overlap": "true", >> "osd_backfill_scan_min": "64", >> "osd_backfill_scan_max": "512", >> "osd_op_thread_timeout": "30", >> "osd_recovery_thread_timeout": "30", >> "osd_snap_trim_thread_timeout": "3600", >> "osd_scrub_thread_timeout": "60", >> "osd_scrub_finalize_thread_timeout": "600", >> "osd_remove_thread_timeout": "3600", >> "osd_command_thread_timeout": "600", >> "osd_age": "0.8", >> "osd_age_time": "0", >> "osd_heartbeat_addr": ":\/0", >> "osd_heartbeat_interval": "6", >> "osd_heartbeat_grace": "20", >> "osd_mon_heartbeat_interval": "30", >> "osd_mon_report_interval_max": "120", >> "osd_mon_report_interval_min": "5", >> "osd_mon_ack_timeout": "30", >> "osd_min_down_reporters": "1", >> "osd_min_down_reports": "3", >> "osd_default_data_pool_replay_window": "45", >> "osd_preserve_trimmed_log": "false", >> "osd_auto_mark_unfound_lost": "false", >> "osd_recovery_delay_start": "0", >> "osd_recovery_max_active": "50", >> "osd_recovery_max_chunk": "8388608", >> "osd_recovery_forget_lost_objects": "false", >> "osd_max_scrubs": "1", >> "osd_scrub_load_threshold": "0.5", >> "osd_scrub_min_interval": "300", >> "osd_scrub_max_interval": "86400", >> "osd_deep_scrub_interval": "604800", >> "osd_deep_scrub_stride": "524288", >> "osd_auto_weight": "false", >> "osd_class_dir": "\/usr\/local\/lib\/rados-classes", >> "osd_check_for_log_corruption": "false", >> "osd_use_stale_snap": "false", >> "osd_rollback_to_cluster_snap": "", >> "osd_default_notify_timeout": "30", >> "osd_kill_backfill_at": "0", >> "osd_min_pg_log_entries": "1000", >> "osd_op_complaint_time": "30", >> "osd_command_max_records": "256", >> "osd_op_log_threshold": "5", >> "osd_verify_sparse_read_holes": "false", >> "osd_debug_drop_ping_probability": "0", >> "osd_debug_drop_ping_duration": "0", >> "osd_debug_drop_pg_create_probability": "0", >> "osd_debug_drop_pg_create_duration": "1", >> "osd_debug_drop_op_probability": "0", >> "osd_op_history_size": "20", >> "osd_op_history_duration": "600", >> "osd_target_transaction_size": "300", >> "osd_client_op_priority": "63", >> "osd_recovery_op_priority": "10", >> "filestore": "false", >> "filestore_index_retry_probability": "0", >> "filestore_debug_omap_check": "false", >> "filestore_xattr_use_omap": "false", >> "filestore_max_inline_xattr_size": "512", >> "filestore_max_inline_xattrs": "2", >> "filestore_max_sync_interval": "5", >> "filestore_min_sync_interval": "0.01", >> "filestore_btrfs_snap": "true", >> "filestore_btrfs_clone_range": "true", >> "filestore_fsync_flushes_journal_data": "false", >> "filestore_fiemap": "false", >> "filestore_flusher": "true", >> "filestore_flusher_max_fds": "512", >> "filestore_flush_min": "65536", >> "filestore_sync_flush": "false", >> "filestore_journal_parallel": "false", >> "filestore_journal_writeahead": "false", >> "filestore_journal_trailing": "false", >> "filestore_queue_max_ops": "500", >> "filestore_queue_max_bytes": "104857600", >> "filestore_queue_committing_max_ops": "500", >> "filestore_queue_committing_max_bytes": "104857600", >> "filestore_op_threads": "2", >> "filestore_op_thread_timeout": "60", >> "filestore_op_thread_suicide_timeout": "180", >> "filestore_commit_timeout": "600", >> "filestore_fiemap_threshold": "4096", >> "filestore_merge_threshold": "10", >> "filestore_split_multiple": "2", >> "filestore_update_to": "1000", >> "filestore_blackhole": "false", >> "filestore_dump_file": "", >> "filestore_kill_at": "0", >> "filestore_fail_eio": "true", >> "journal_dio": "true", >> "journal_aio": "false", >> "journal_block_align": "true", >> "journal_max_write_bytes": "10485760", >> "journal_max_write_entries": "100", >> "journal_queue_max_ops": "500", >> "journal_queue_max_bytes": "104857600", >> "journal_align_min_size": "65536", >> "journal_replay_from": "0", >> "journal_zero_on_create": "false", >> "rbd_cache": "false", >> "rbd_cache_size": "33554432", >> "rbd_cache_max_dirty": "25165824", >> "rbd_cache_target_dirty": "16777216", >> "rbd_cache_max_dirty_age": "1", >> "nss_db_path": "", >> "rgw_data": "\/var\/lib\/ceph\/radosgw\/ceph-24", >> "rgw_enable_apis": "s3, swift, swift_auth, admin", >> "rgw_cache_enabled": "true", >> "rgw_cache_lru_size": "10000", >> "rgw_socket_path": "", >> "rgw_dns_name": "", >> "rgw_swift_url": "", >> "rgw_swift_url_prefix": "swift", >> "rgw_swift_auth_url": "", >> "rgw_swift_auth_entry": "auth", >> "rgw_keystone_url": "", >> "rgw_keystone_admin_token": "", >> "rgw_keystone_accepted_roles": "Member, admin", >> "rgw_keystone_token_cache_size": "10000", >> "rgw_keystone_revocation_interval": "900", >> "rgw_admin_entry": "admin", >> "rgw_enforce_swift_acls": "true", >> "rgw_swift_token_expiration": "86400", >> "rgw_print_continue": "true", >> "rgw_remote_addr_param": "REMOTE_ADDR", >> "rgw_op_thread_timeout": "600", >> "rgw_op_thread_suicide_timeout": "0", >> "rgw_thread_pool_size": "100", >> "rgw_num_control_oids": "8", >> "rgw_cluster_root_pool": ".rgw.root", >> "rgw_log_nonexistent_bucket": "false", >> "rgw_log_object_name": "%Y-%m-%d-%H-%i-%n", >> "rgw_log_object_name_utc": "false", >> "rgw_usage_max_shards": "32", >> "rgw_usage_max_user_shards": "1", >> "rgw_enable_ops_log": "true", >> "rgw_enable_usage_log": "true", >> "rgw_ops_log_rados": "true", >> "rgw_ops_log_socket_path": "", >> "rgw_ops_log_data_backlog": "5242880", >> "rgw_usage_log_flush_threshold": "1024", >> "rgw_usage_log_tick_interval": "30", >> "rgw_intent_log_object_name": "%Y-%m-%d-%i-%n", >> "rgw_intent_log_object_name_utc": "false", >> "rgw_init_timeout": "30", >> "rgw_mime_types_file": "\/etc\/mime.types", >> "rgw_gc_max_objs": "32", >> "rgw_gc_obj_min_wait": "7200", >> "rgw_gc_processor_max_time": "3600", >> "rgw_gc_processor_period": "3600", >> "rgw_s3_success_create_obj_status": "0", >> "rgw_resolve_cname": "false", >> "rgw_obj_stripe_size": "4194304", >> "rgw_extended_http_attrs": "", >> "mutex_perf_counter": "false", >> "internal_safe_to_start_threads": "true"} >> >> >> -----Original Message----- >> From: Mark Nelson [mailto:mark.nelson@inktank.com] >> Sent: Thursday, 20 December 2012 11:28 PM >> To: Matthew Anderson >> Cc: ceph-devel@vger.kernel.org >> Subject: Re: OSD's slow down to a crawl >> >> Hi Matt, >> >> Yeah, your log is looking pretty sparse! I imagine Sam will have a lot of questions for you. Out of curiosity have you looked at any io/cpu/memory stats when things slow down? sysprof may come in handy if it looks like it's CPU related. Also, check out the OSD admin socket. >> Sebastien Han made a nice little blog entry describing it: >> >> http://www.sebastien-han.fr/blog/2012/08/14/ceph-admin-socket/ >> >> You can get a lot of information about the current state of things. >> >> Mark >> > > -- > To unsubscribe from this list: send the line "unsubscribe ceph-devel" > in the body of a message to majordomo@vger.kernel.org More majordomo > info at http://vger.kernel.org/majordomo-info.html > -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 13+ messages in thread
* RE: OSD's slow down to a crawl 2012-12-20 16:08 ` Matthew Anderson 2012-12-20 16:22 ` Mark Nelson @ 2012-12-20 16:30 ` Sage Weil [not found] ` <38A500831D3DE24B90BD200D6C8701351BB15433@Exchange2010-2.corit.local> 1 sibling, 1 reply; 13+ messages in thread From: Sage Weil @ 2012-12-20 16:30 UTC (permalink / raw) To: Matthew Anderson; +Cc: 'Mark Nelson', ceph-devel@vger.kernel.org [-- Attachment #1: Type: TEXT/PLAIN, Size: 41033 bytes --] Can you do a similar test, but with full logging on? ceph tell osd.0 injectargs '--debug-ms 1 --debug-filestore 20 --debug-osd 20 --debug-journal 20' rados -p ssd bench 30 write -b 4096 -t 1 --log-file /tmp/foo --debug-ms 1 That will be a single IO in flight at a time and very easy to trace through the logs. If you can post the resulting log file (/tmp/foo and from osd.0), that would be awesome. Thanks! sage On Thu, 20 Dec 2012, Matthew Anderson wrote: > # rados bench 60 write -t 256 -p ssd > Maintaining 256 concurrent writes of 4194304 bytes for at least 60 seconds. > Object prefix: benchmark_data_KVM03_12985 > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > 0 0 0 0 0 0 - 0 > 1 45 45 0 0 0 - 0 > 2 71 71 0 0 0 - 0 > 3 90 90 0 0 0 - 0 > 4 125 125 0 0 0 - 0 > 5 136 136 0 0 0 - 0 > 6 144 144 0 0 0 - 0 > 7 175 175 0 0 0 - 0 > 8 201 201 0 0 0 - 0 > 9 217 217 0 0 0 - 0 > 10 228 228 0 0 0 - 0 > 11 251 251 0 0 0 - 0 > 12 255 266 11 3.66581 3.66667 11.7027 11.6013 > 13 255 271 16 4.92194 20 12.9795 11.9979 > 14 255 291 36 10.2833 80 13.1795 12.6478 > 15 255 301 46 12.2639 40 13.9424 12.9123 > 16 255 303 48 11.9973 8 14.0277 12.9579 > 17 255 321 66 15.526 72 15.1408 13.4616 > 18 255 345 90 19.9956 96 15.5673 13.9937 > 19 255 349 94 19.7851 16 15.707 14.0635 > 2012-12-21 00:06:37.198982min lat: 11.4904 max lat: 16.4053 avg lat: 14.3121 > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > 20 255 361 106 21.1953 48 16.4053 14.3121 > 21 255 367 112 21.3285 24 17.4263 14.4795 > 22 255 370 115 20.9043 12 17.3579 14.5544 > 23 255 387 132 22.9513 68 18.3185 15.0867 > 24 255 394 139 23.1614 28 18.2868 15.2504 > 25 255 404 149 23.8346 40 18.741 15.5048 > 26 255 410 155 23.8407 24 19.032 15.6358 > 27 255 410 155 22.9577 0 - 15.6358 > 28 255 410 155 22.1378 0 - 15.6358 > 29 255 411 156 21.5124 1.33333 22.6927 15.6811 > 30 255 431 176 23.4614 80 22.6577 16.4822 > 31 255 447 192 24.7687 64 23.2345 17.0563 > 32 255 464 209 26.1192 68 23.3788 17.5859 > 33 255 485 230 27.8726 84 22.8924 18.1122 > 34 255 497 242 28.4642 48 23.563 18.3693 > 35 255 508 253 28.9078 44 23.0516 18.5935 > 36 255 514 259 28.7713 24 24.4415 18.7278 > 37 255 530 275 29.723 64 23.847 19.0424 > 38 255 541 286 30.0985 44 24.5757 19.2427 > 39 255 544 289 29.6344 12 24.6802 19.2984 > 2012-12-21 00:06:57.203473min lat: 11.4904 max lat: 24.6802 avg lat: 19.2984 > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > 40 255 544 289 28.8935 0 - 19.2984 > 41 255 544 289 28.1888 0 - 19.2984 > 42 255 544 289 27.5176 0 - 19.2984 > 43 255 544 289 26.8776 0 - 19.2984 > 44 255 544 289 26.2668 0 - 19.2984 > 45 255 544 289 25.6831 0 - 19.2984 > 46 255 544 289 25.1248 0 - 19.2984 > 47 255 544 289 24.5902 0 - 19.2984 > 48 255 544 289 24.0779 0 - 19.2984 > 49 255 544 289 23.5865 0 - 19.2984 > 50 255 544 289 23.1148 0 - 19.2984 > 51 255 544 289 22.6616 0 - 19.2984 > 52 255 544 289 22.2258 0 - 19.2984 > 53 255 544 289 21.8065 0 - 19.2984 > 54 255 544 289 21.4026 0 - 19.2984 > 55 255 544 289 21.0135 0 - 19.2984 > 56 255 544 289 20.6383 0 - 19.2984 > 57 255 544 289 20.2762 0 - 19.2984 > 58 255 544 289 19.9266 0 - 19.2984 > 59 255 544 289 19.5889 0 - 19.2984 > 2012-12-21 00:07:17.207782min lat: 11.4904 max lat: 24.6802 avg lat: 19.2984 > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > 60 255 544 289 19.2624 0 - 19.2984 > 61 255 544 289 18.9466 0 - 19.2984 > 62 255 544 289 18.641 0 - 19.2984 > 63 255 544 289 18.3452 0 - 19.2984 > 64 255 544 289 18.0585 0 - 19.2984 > 65 255 544 289 17.7807 0 - 19.2984 > 66 255 544 289 17.5113 0 - 19.2984 > 67 133 545 412 24.5916 17.5714 37.3574 27.4834 > Total time run: 68.000740 > Total writes made: 545 > Write size: 4194304 > Bandwidth (MB/sec): 32.058 > > Stddev Bandwidth: 26.8837 > Max bandwidth (MB/sec): 96 > Min bandwidth (MB/sec): 0 > Average Latency: 29.2801 > Stddev Latency: 11.9768 > Max latency: 52.6277 > Min latency: 11.4904 > > > > > > ----Original Message----- > From: Mark Nelson [mailto:mark.nelson@inktank.com] > Sent: Thursday, 20 December 2012 11:59 PM > To: Matthew Anderson > Cc: ceph-devel@vger.kernel.org > Subject: Re: OSD's slow down to a crawl > > Out of curiosity, if you fire up a rados bench instance on one of the nodes with say, 256 concurrent writes, do any of the writes complete? > > Mark > > On 12/20/2012 09:51 AM, Matthew Anderson wrote: > > Hi Mark, > > > > Thanks for the quick reply. The log was 380MB in total so I just posted last 200 lines hoping that a trained eye might be able to spot something. I'm happy to put the file up on a web server if anyone wants to review it in full. > > > > It doesn't appear to be memory or CPU related. In it's hung state (both read and write are not responding at the moment, I have a VM hung trying to shutdown and an rbd stuck exporting ) all of the OSD's sit around 0.3% CPU usage, ~300MB of ram and no IO wait. The server has roughly 40GB RAM free and is easily 90% idle. Iotop shows no IO coming from any of the Ceph-OSD's. > > > > The admin socket showed no ops in flight but the performance dump, perhistoric ops and config are below. I did notice that the age of some of the ops was quite high considering things were actively trying to write to the cluster. > > # ceph --admin-daemon ./ceph-osd.24.asok perf dump > > > > {"filestore":{"journal_queue_max_ops":500,"journal_queue_ops":0,"journ > > al_ops":20868,"journal_queue_max_bytes":104857600,"journal_queue_bytes > > ":0,"journal_bytes":1710397238,"journal_latency":{"avgcount":20868,"su > > m":185.268022000},"journal_wr":13354,"journal_wr_bytes":{"avgcount":13 > > 354,"sum":1770647552},"op_queue_max_ops":500,"op_queue_ops":0,"ops":20 > > 868,"op_queue_max_bytes":104857600,"op_queue_bytes":0,"bytes":17102325 > > 83,"apply_latency":{"avgcount":20868,"sum":508.896737000},"committing" > > :0,"commitcycle":339,"commitcycle_interval":{"avgcount":339,"sum":1708 > > .284725000},"commitcycle_latency":{"avgcount":339,"sum":13.241850000}, > > "journal_full":0},"mutex-FileJournal::completions_lock":{"wait":{"avgc > > ount":0,"sum":0.000000000}},"mutex-FileJournal::finisher_lock":{"wait" > > :{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::write_lock":{"w > > ait":{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::writeq_lock > > ":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager:: > > apply_lo > ck":{"wai > t":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager::com_lock":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::SubmitManager::lock":{"wait":{"avgcount":0,"sum":0.000000000}},"osd":{"opq":0,"op_wip":0,"op":31753,"op_in_bytes":797806224,"op_out_bytes":1728414048,"op_latency":{"avgcount":31753,"sum":82.183373000},"op_r":24945,"op_r_out_bytes":1728414048,"op_r_latency":{"avgcount":24945,"sum":16.833689000},"op_w":6808,"op_w_in_bytes":797806224,"op_w_rlat":{"avgcount":6808,"sum":38.223006000},"op_w_latency":{"avgcount":6808,"sum":65.349684000},"op_rw":0,"op_rw_in_bytes":0,"op_rw_out_bytes":0,"op_rw_rlat":{"avgcount":0,"sum":0.000000000},"op_rw_latency":{"avgcount":0,"sum":0.000000000},"subop":5700,"subop_in_bytes":813224156,"subop_latency":{"avgcount":5700,"sum":26.877085000},"subop_w":0,"subop_w_in_bytes":813224156,"subop_w_latency":{"avgcount":5700,"sum":26.877085000},"subop_pull":0,"subop_pull_latency":{"avgcount":0,"sum":0.000000000},"subop_push":0,"subop_push_ > in_bytes" > :0,"subop_push_latency":{"avgcount":0,"sum":0.000000000},"pull":0,"push":0,"push_out_bytes":0,"push_in":0,"push_in_bytes":0,"recovery_ops":0,"loadavg":359,"buffer_bytes":0,"numpg":2486,"numpg_primary":1214,"numpg_replica":1272,"numpg_stray":0,"heartbeat_to_peers":4,"heartbeat_from_peers":0,"map_messages":44,"map_message_epochs":184,"map_message_epoch_dups":174},"throttle-filestore_bytes":{"val":0,"max":104857600,"get":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0,"take":20868,"take_sum":1710397238,"put":13354,"put_sum":1710397238,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-filestore_ops":{"val":0,"max":500,"get":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0,"take":20868,"take_sum":20868,"put":13354,"put_sum":20868,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-client":{"val":0,"max":104857600,"get":31657,"get_sum":795546573,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":31657,"put_sum":7955 > 46573,"wa > it":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-cluster":{"val":0,"max":104857600,"get":18913,"get_sum":828011293,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":18913,"put_sum":828011293,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-hbclient":{"val":0,"max":104857600,"get":4260,"get_sum":200220,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":4260,"put_sum":200220,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-hbserver":{"val":0,"max":104857600,"get":4236,"get_sum":199092,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":4236,"put_sum":199092,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-osd_client_bytes":{"val":0,"max":524288000,"get":31308,"get_sum":795353720,"get_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":37671,"put_sum":795353720,"wait":{"avgcount":0,"sum":0.000000000}}} > > > > # ceph --admin-daemon ./ceph-osd.24.asok dump_historic_ops > > > > { "description": "osd_op(client.7636.0:101498 rb.0.122c.6b8b4567.000000000357 [write 3600384~4096] 6.42463fe3)", > > "received_at": "2012-12-20 23:34:42.012131", > > "age": "75.428051", > > "duration": "0.009988", > > "flag_point": "waiting for sub ops", > > "client_info": { "client": "client.7636", > > "tid": 101498}, > > "events": [ > > { "time": "2012-12-20 23:34:42.012317", > > "event": "waiting_for_osdmap"}, > > { "time": "2012-12-20 23:34:42.012425", > > "event": "reached_pg"}, > > { "time": "2012-12-20 23:34:42.012678", > > "event": "started"}, > > { "time": "2012-12-20 23:34:42.012720", > > "event": "started"}, > > { "time": "2012-12-20 23:34:42.012815", > > "event": "sub_op_sent"}, > > { "time": "2012-12-20 23:34:42.012958", > > "event": "commit_queued_for_journal_write"}, > > { "time": "2012-12-20 23:34:42.013033", > > "event": "write_thread_in_journal_buffer"}, > > { "time": "2012-12-20 23:34:42.013745", > > "event": "journaled_completion_queued"}, > > { "time": "2012-12-20 23:34:42.013803", > > "event": "op_commit"}, > > { "time": "2012-12-20 23:34:42.017414", > > "event": "sub_op_commit_rec"}, > > { "time": "2012-12-20 23:34:42.022066", > > "event": "op_applied"}, > > { "time": "2012-12-20 23:34:42.022119", > > "event": "done"}]}, > > { "description": "osd_sub_op_reply(client.7636.0:101498 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", > > "received_at": "2012-12-20 23:34:42.017048", > > "age": "75.423134", > > "duration": "0.000411", > > "flag_point": "started", > > "events": [ > > { "time": "2012-12-20 23:34:42.017218", > > "event": "waiting_for_osdmap"}, > > { "time": "2012-12-20 23:34:42.017363", > > "event": "reached_pg"}, > > { "time": "2012-12-20 23:34:42.017401", > > "event": "started"}, > > { "time": "2012-12-20 23:34:42.017459", > > "event": "done"}]}, > > { "description": "osd_op(client.7636.0:101502 rb.0.122c.6b8b4567.000000000064 [write 1512448~61440] 6.a241e34d)", > > "received_at": "2012-12-20 23:34:49.851021", > > "age": "67.589161", > > "duration": "0.007114", > > "flag_point": "waiting for sub ops", > > "client_info": { "client": "client.7636", > > "tid": 101502}, > > "events": [ > > { "time": "2012-12-20 23:34:49.851276", > > "event": "waiting_for_osdmap"}, > > { "time": "2012-12-20 23:34:49.851429", > > "event": "reached_pg"}, > > { "time": "2012-12-20 23:34:49.851616", > > "event": "started"}, > > { "time": "2012-12-20 23:34:49.851648", > > "event": "started"}, > > { "time": "2012-12-20 23:34:49.851712", > > "event": "sub_op_sent"}, > > { "time": "2012-12-20 23:34:49.851770", > > "event": "commit_queued_for_journal_write"}, > > { "time": "2012-12-20 23:34:49.851915", > > "event": "write_thread_in_journal_buffer"}, > > { "time": "2012-12-20 23:34:49.853956", > > "event": "journaled_completion_queued"}, > > { "time": "2012-12-20 23:34:49.853992", > > "event": "op_commit"}, > > { "time": "2012-12-20 23:34:49.857016", > > "event": "sub_op_commit_rec"}, > > { "time": "2012-12-20 23:34:49.858077", > > "event": "op_applied"}, > > { "time": "2012-12-20 23:34:49.858135", > > "event": "done"}]}, > > { "description": "osd_sub_op_reply(client.7636.0:101502 6.234d a241e34d\/rb.0.122c.6b8b4567.000000000064\/head\/\/6 [] ondisk, result = 0)", > > "received_at": "2012-12-20 23:34:49.856762", > > "age": "67.583420", > > "duration": "0.000308", > > "flag_point": "started", > > "events": [ > > { "time": "2012-12-20 23:34:49.856869", > > "event": "waiting_for_osdmap"}, > > { "time": "2012-12-20 23:34:49.856969", > > "event": "reached_pg"}, > > { "time": "2012-12-20 23:34:49.857001", > > "event": "started"}, > > { "time": "2012-12-20 23:34:49.857070", > > "event": "done"}]}, > > { "description": "osd_op(client.7636.0:101504 rb.0.122c.6b8b4567.00000000012c [write 1658880~2048] 6.92d2c38c)", > > "received_at": "2012-12-20 23:35:34.351518", > > "age": "23.088664", > > "duration": "0.009430", > > "flag_point": "waiting for sub ops", > > "client_info": { "client": "client.7636", > > "tid": 101504}, > > "events": [ > > { "time": "2012-12-20 23:35:34.351714", > > "event": "waiting_for_osdmap"}, > > { "time": "2012-12-20 23:35:34.351811", > > "event": "reached_pg"}, > > { "time": "2012-12-20 23:35:34.352030", > > "event": "started"}, > > { "time": "2012-12-20 23:35:34.352071", > > "event": "started"}, > > { "time": "2012-12-20 23:35:34.352146", > > "event": "sub_op_sent"}, > > { "time": "2012-12-20 23:35:34.352213", > > "event": "commit_queued_for_journal_write"}, > > { "time": "2012-12-20 23:35:34.352287", > > "event": "write_thread_in_journal_buffer"}, > > { "time": "2012-12-20 23:35:34.355390", > > "event": "journaled_completion_queued"}, > > { "time": "2012-12-20 23:35:34.355475", > > "event": "op_commit"}, > > { "time": "2012-12-20 23:35:34.356661", > > "event": "sub_op_commit_rec"}, > > { "time": "2012-12-20 23:35:34.360894", > > "event": "op_applied"}, > > { "time": "2012-12-20 23:35:34.360948", > > "event": "done"}]}, > > { "description": "osd_sub_op_reply(client.7636.0:101504 6.38c 92d2c38c\/rb.0.122c.6b8b4567.00000000012c\/head\/\/6 [] ondisk, result = 0)", > > "received_at": "2012-12-20 23:35:34.356286", > > "age": "23.083896", > > "duration": "0.000421", > > "flag_point": "started", > > "events": [ > > { "time": "2012-12-20 23:35:34.356489", > > "event": "waiting_for_osdmap"}, > > { "time": "2012-12-20 23:35:34.356598", > > "event": "reached_pg"}, > > { "time": "2012-12-20 23:35:34.356640", > > "event": "started"}, > > { "time": "2012-12-20 23:35:34.356707", > > "event": "done"}]}, > > { "description": "osd_op(client.7636.0:101512 rb.0.122c.6b8b4567.000000000357 [write 3731456~4096] 6.42463fe3)", > > "received_at": "2012-12-20 23:35:42.133103", > > "age": "15.307079", > > "duration": "0.008357", > > "flag_point": "waiting for sub ops", > > "client_info": { "client": "client.7636", > > "tid": 101512}, > > "events": [ > > { "time": "2012-12-20 23:35:42.133315", > > "event": "waiting_for_osdmap"}, > > { "time": "2012-12-20 23:35:42.133427", > > "event": "reached_pg"}, > > { "time": "2012-12-20 23:35:42.133684", > > "event": "started"}, > > { "time": "2012-12-20 23:35:42.133723", > > "event": "started"}, > > { "time": "2012-12-20 23:35:42.133796", > > "event": "sub_op_sent"}, > > { "time": "2012-12-20 23:35:42.133908", > > "event": "commit_queued_for_journal_write"}, > > { "time": "2012-12-20 23:35:42.133995", > > "event": "write_thread_in_journal_buffer"}, > > { "time": "2012-12-20 23:35:42.136965", > > "event": "journaled_completion_queued"}, > > { "time": "2012-12-20 23:35:42.137025", > > "event": "op_commit"}, > > { "time": "2012-12-20 23:35:42.138456", > > "event": "sub_op_commit_rec"}, > > { "time": "2012-12-20 23:35:42.141408", > > "event": "op_applied"}, > > { "time": "2012-12-20 23:35:42.141460", > > "event": "done"}]}, > > { "description": "osd_sub_op_reply(client.7636.0:101512 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", > > "received_at": "2012-12-20 23:35:42.138087", > > "age": "15.302095", > > "duration": "0.000410", > > "flag_point": "started", > > "events": [ > > { "time": "2012-12-20 23:35:42.138248", > > "event": "waiting_for_osdmap"}, > > { "time": "2012-12-20 23:35:42.138363", > > "event": "reached_pg"}, > > { "time": "2012-12-20 23:35:42.138443", > > "event": "started"}, > > { "time": "2012-12-20 23:35:42.138497", > > "event": "done"}]}]} > > > > # ceph --admin-daemon ./ceph-osd.24.asok config show { "name": > > "osd.24", > > "cluster": "ceph", > > "none": "0\/5", > > "lockdep": "0\/1", > > "context": "0\/1", > > "crush": "1\/1", > > "mds": "1\/5", > > "mds_balancer": "1\/5", > > "mds_locker": "1\/5", > > "mds_log": "1\/5", > > "mds_log_expire": "1\/5", > > "mds_migrator": "1\/5", > > "buffer": "0\/1", > > "timer": "0\/1", > > "filer": "0\/1", > > "striper": "0\/1", > > "objecter": "0\/1", > > "rados": "0\/5", > > "rbd": "0\/5", > > "journaler": "0\/5", > > "objectcacher": "0\/5", > > "client": "0\/5", > > "osd": "0\/5", > > "optracker": "0\/5", > > "objclass": "0\/5", > > "filestore": "1\/3", > > "journal": "1\/3", > > "ms": "0\/5", > > "mon": "1\/5", > > "monc": "0\/10", > > "paxos": "0\/5", > > "tp": "0\/5", > > "auth": "1\/5", > > "crypto": "1\/5", > > "finisher": "1\/1", > > "heartbeatmap": "1\/5", > > "perfcounter": "1\/5", > > "rgw": "1\/5", > > "hadoop": "1\/5", > > "javaclient": "1\/5", > > "asok": "1\/5", > > "throttle": "1\/1", > > "host": "KVM04", > > "fsid": "00000000-0000-0000-0000-000000000000", > > "public_addr": ":\/0", > > "cluster_addr": ":\/0", > > "public_network": "", > > "cluster_network": "", > > "num_client": "1", > > "monmap": "", > > "mon_host": "", > > "lockdep": "false", > > "admin_socket": "\/var\/run\/ceph\/ceph-osd.24.asok", > > "daemonize": "true", > > "pid_file": "\/var\/run\/ceph\/osd.24.pid", > > "chdir": "\/", > > "max_open_files": "0", > > "fatal_signal_handlers": "true", > > "log_file": "\/var\/log\/ceph\/osd.24.log", > > "log_max_new": "1000", > > "log_max_recent": "100000", > > "log_to_stderr": "false", > > "err_to_stderr": "true", > > "log_to_syslog": "false", > > "err_to_syslog": "false", > > "log_flush_on_exit": "true", > > "clog_to_monitors": "true", > > "clog_to_syslog": "false", > > "mon_cluster_log_to_syslog": "false", > > "mon_cluster_log_file": "\/var\/log\/ceph\/ceph.log", > > "key": "", > > "keyfile": "", > > "keyring": "\/srv\/ceph\/osd\/osd.24\/keyring", > > "heartbeat_interval": "5", > > "heartbeat_file": "", > > "perf": "true", > > "ms_tcp_nodelay": "true", > > "ms_initial_backoff": "0.2", > > "ms_max_backoff": "15", > > "ms_nocrc": "false", > > "ms_die_on_bad_msg": "false", > > "ms_dispatch_throttle_bytes": "104857600", > > "ms_bind_ipv6": "false", > > "ms_bind_port_min": "6800", > > "ms_bind_port_max": "7100", > > "ms_rwthread_stack_bytes": "1048576", > > "ms_tcp_read_timeout": "900", > > "ms_inject_socket_failures": "0", > > "ms_inject_delay_type": "", > > "ms_inject_delay_max": "1", > > "ms_inject_delay_probability": "0", > > "mon_data": "\/var\/lib\/ceph\/mon\/ceph-24", > > "mon_initial_members": "", > > "mon_sync_fs_threshold": "5", > > "mon_tick_interval": "5", > > "mon_subscribe_interval": "300", > > "mon_osd_laggy_halflife": "3600", > > "mon_osd_laggy_weight": "0.3", > > "mon_osd_adjust_heartbeat_grace": "true", > > "mon_osd_adjust_down_out_interval": "true", > > "mon_osd_auto_mark_in": "false", > > "mon_osd_auto_mark_auto_out_in": "true", > > "mon_osd_auto_mark_new_in": "true", > > "mon_osd_down_out_interval": "300", > > "mon_osd_min_up_ratio": "0.3", > > "mon_osd_min_in_ratio": "0.3", > > "mon_lease": "5", > > "mon_lease_renew_interval": "3", > > "mon_lease_ack_timeout": "10", > > "mon_clock_drift_allowed": "0.05", > > "mon_clock_drift_warn_backoff": "5", > > "mon_accept_timeout": "10", > > "mon_pg_create_interval": "30", > > "mon_pg_stuck_threshold": "300", > > "mon_osd_full_ratio": "0.95", > > "mon_osd_nearfull_ratio": "0.85", > > "mon_globalid_prealloc": "100", > > "mon_osd_report_timeout": "900", > > "mon_force_standby_active": "true", > > "mon_min_osdmap_epochs": "500", > > "mon_max_pgmap_epochs": "500", > > "mon_max_log_epochs": "500", > > "mon_max_osd": "10000", > > "mon_probe_timeout": "2", > > "mon_slurp_timeout": "10", > > "mon_slurp_bytes": "262144", > > "mon_client_bytes": "104857600", > > "mon_daemon_bytes": "419430400", > > "mon_max_log_entries_per_event": "4096", > > "paxos_max_join_drift": "10", > > "paxos_propose_interval": "1", > > "paxos_min_wait": "0.05", > > "clock_offset": "0", > > "auth_cluster_required": "none", > > "auth_service_required": "none", > > "auth_client_required": "none", > > "auth_supported": "none", > > "cephx_require_signatures": "false", > > "cephx_sign_messages": "true", > > "auth_mon_ticket_ttl": "43200", > > "auth_service_ticket_ttl": "3600", > > "mon_client_hunt_interval": "3", > > "mon_client_ping_interval": "10", > > "mon_client_max_log_entries_per_message": "1000", > > "mon_max_pool_pg_num": "65536", > > "client_cache_size": "16384", > > "client_cache_mid": "0.75", > > "client_use_random_mds": "false", > > "client_mount_timeout": "30", > > "client_tick_interval": "1", > > "client_trace": "", > > "client_readahead_min": "131072", > > "client_readahead_max_bytes": "0", > > "client_readahead_max_periods": "4", > > "client_snapdir": ".snap", > > "client_mountpoint": "\/", > > "client_notify_timeout": "10", > > "client_caps_release_delay": "5", > > "client_oc": "true", > > "client_oc_size": "209715200", > > "client_oc_max_dirty": "104857600", > > "client_oc_target_dirty": "8388608", > > "client_oc_max_dirty_age": "5", > > "client_oc_max_objects": "1000", > > "client_debug_force_sync_read": "false", > > "client_debug_inject_tick_delay": "0", > > "fuse_use_invalidate_cb": "false", > > "fuse_big_writes": "true", > > "fuse_debug": "false", > > "objecter_tick_interval": "5", > > "objecter_timeout": "10", > > "objecter_inflight_op_bytes": "104857600", > > "objecter_inflight_ops": "1024", > > "journaler_allow_split_entries": "true", > > "journaler_write_head_interval": "15", > > "journaler_prefetch_periods": "10", > > "journaler_prezero_periods": "5", > > "journaler_batch_interval": "0.001", > > "journaler_batch_max": "0", > > "mds_data": "\/var\/lib\/ceph\/mds\/ceph-24", > > "mds_max_file_size": "1099511627776", > > "mds_cache_size": "100000", > > "mds_cache_mid": "0.7", > > "mds_mem_max": "1048576", > > "mds_dir_commit_ratio": "0.5", > > "mds_dir_max_commit_size": "90", > > "mds_decay_halflife": "5", > > "mds_beacon_interval": "4", > > "mds_beacon_grace": "15", > > "mds_blacklist_interval": "1440", > > "mds_session_timeout": "60", > > "mds_session_autoclose": "300", > > "mds_reconnect_timeout": "45", > > "mds_tick_interval": "5", > > "mds_dirstat_min_interval": "1", > > "mds_scatter_nudge_interval": "5", > > "mds_client_prealloc_inos": "1000", > > "mds_early_reply": "true", > > "mds_use_tmap": "true", > > "mds_default_dir_hash": "2", > > "mds_log": "true", > > "mds_log_skip_corrupt_events": "false", > > "mds_log_max_events": "-1", > > "mds_log_max_segments": "30", > > "mds_log_max_expiring": "20", > > "mds_bal_sample_interval": "3", > > "mds_bal_replicate_threshold": "8000", > > "mds_bal_unreplicate_threshold": "0", > > "mds_bal_frag": "false", > > "mds_bal_split_size": "10000", > > "mds_bal_split_rd": "25000", > > "mds_bal_split_wr": "10000", > > "mds_bal_split_bits": "3", > > "mds_bal_merge_size": "50", > > "mds_bal_merge_rd": "1000", > > "mds_bal_merge_wr": "1000", > > "mds_bal_interval": "10", > > "mds_bal_fragment_interval": "5", > > "mds_bal_idle_threshold": "0", > > "mds_bal_max": "-1", > > "mds_bal_max_until": "-1", > > "mds_bal_mode": "0", > > "mds_bal_min_rebalance": "0.1", > > "mds_bal_min_start": "0.2", > > "mds_bal_need_min": "0.8", > > "mds_bal_need_max": "1.2", > > "mds_bal_midchunk": "0.3", > > "mds_bal_minchunk": "0.001", > > "mds_bal_target_removal_min": "5", > > "mds_bal_target_removal_max": "10", > > "mds_replay_interval": "1", > > "mds_shutdown_check": "0", > > "mds_thrash_exports": "0", > > "mds_thrash_fragments": "0", > > "mds_dump_cache_on_map": "false", > > "mds_dump_cache_after_rejoin": "false", > > "mds_verify_scatter": "false", > > "mds_debug_scatterstat": "false", > > "mds_debug_frag": "false", > > "mds_debug_auth_pins": "false", > > "mds_debug_subtrees": "false", > > "mds_kill_mdstable_at": "0", > > "mds_kill_export_at": "0", > > "mds_kill_import_at": "0", > > "mds_kill_link_at": "0", > > "mds_kill_rename_at": "0", > > "mds_wipe_sessions": "false", > > "mds_wipe_ino_prealloc": "false", > > "mds_skip_ino": "0", > > "max_mds": "1", > > "mds_standby_for_name": "", > > "mds_standby_for_rank": "-1", > > "mds_standby_replay": "false", > > "osd_auto_upgrade_tmap": "true", > > "osd_tmapput_sets_uses_tmap": "false", > > "osd_max_backfills": "10", > > "osd_backfill_full_ratio": "0.85", > > "osd_backfill_retry_interval": "10", > > "osd_uuid": "00000000-0000-0000-0000-000000000000", > > "osd_data": "\/srv\/ceph\/osd\/osd.24", > > "osd_journal": "\/dev\/sda3", > > "osd_journal_size": "5120", > > "osd_max_write_size": "90", > > "osd_max_pgls": "1024", > > "osd_client_message_size_cap": "524288000", > > "osd_pg_bits": "6", > > "osd_pgp_bits": "6", > > "osd_min_rep": "1", > > "osd_max_rep": "10", > > "osd_pool_default_crush_rule": "0", > > "osd_pool_default_size": "2", > > "osd_pool_default_min_size": "0", > > "osd_pool_default_pg_num": "8", > > "osd_pool_default_pgp_num": "8", > > "osd_map_dedup": "true", > > "osd_map_cache_size": "500", > > "osd_map_message_max": "100", > > "osd_op_threads": "2", > > "osd_disk_threads": "1", > > "osd_recovery_threads": "1", > > "osd_recover_clone_overlap": "true", > > "osd_backfill_scan_min": "64", > > "osd_backfill_scan_max": "512", > > "osd_op_thread_timeout": "30", > > "osd_recovery_thread_timeout": "30", > > "osd_snap_trim_thread_timeout": "3600", > > "osd_scrub_thread_timeout": "60", > > "osd_scrub_finalize_thread_timeout": "600", > > "osd_remove_thread_timeout": "3600", > > "osd_command_thread_timeout": "600", > > "osd_age": "0.8", > > "osd_age_time": "0", > > "osd_heartbeat_addr": ":\/0", > > "osd_heartbeat_interval": "6", > > "osd_heartbeat_grace": "20", > > "osd_mon_heartbeat_interval": "30", > > "osd_mon_report_interval_max": "120", > > "osd_mon_report_interval_min": "5", > > "osd_mon_ack_timeout": "30", > > "osd_min_down_reporters": "1", > > "osd_min_down_reports": "3", > > "osd_default_data_pool_replay_window": "45", > > "osd_preserve_trimmed_log": "false", > > "osd_auto_mark_unfound_lost": "false", > > "osd_recovery_delay_start": "0", > > "osd_recovery_max_active": "50", > > "osd_recovery_max_chunk": "8388608", > > "osd_recovery_forget_lost_objects": "false", > > "osd_max_scrubs": "1", > > "osd_scrub_load_threshold": "0.5", > > "osd_scrub_min_interval": "300", > > "osd_scrub_max_interval": "86400", > > "osd_deep_scrub_interval": "604800", > > "osd_deep_scrub_stride": "524288", > > "osd_auto_weight": "false", > > "osd_class_dir": "\/usr\/local\/lib\/rados-classes", > > "osd_check_for_log_corruption": "false", > > "osd_use_stale_snap": "false", > > "osd_rollback_to_cluster_snap": "", > > "osd_default_notify_timeout": "30", > > "osd_kill_backfill_at": "0", > > "osd_min_pg_log_entries": "1000", > > "osd_op_complaint_time": "30", > > "osd_command_max_records": "256", > > "osd_op_log_threshold": "5", > > "osd_verify_sparse_read_holes": "false", > > "osd_debug_drop_ping_probability": "0", > > "osd_debug_drop_ping_duration": "0", > > "osd_debug_drop_pg_create_probability": "0", > > "osd_debug_drop_pg_create_duration": "1", > > "osd_debug_drop_op_probability": "0", > > "osd_op_history_size": "20", > > "osd_op_history_duration": "600", > > "osd_target_transaction_size": "300", > > "osd_client_op_priority": "63", > > "osd_recovery_op_priority": "10", > > "filestore": "false", > > "filestore_index_retry_probability": "0", > > "filestore_debug_omap_check": "false", > > "filestore_xattr_use_omap": "false", > > "filestore_max_inline_xattr_size": "512", > > "filestore_max_inline_xattrs": "2", > > "filestore_max_sync_interval": "5", > > "filestore_min_sync_interval": "0.01", > > "filestore_btrfs_snap": "true", > > "filestore_btrfs_clone_range": "true", > > "filestore_fsync_flushes_journal_data": "false", > > "filestore_fiemap": "false", > > "filestore_flusher": "true", > > "filestore_flusher_max_fds": "512", > > "filestore_flush_min": "65536", > > "filestore_sync_flush": "false", > > "filestore_journal_parallel": "false", > > "filestore_journal_writeahead": "false", > > "filestore_journal_trailing": "false", > > "filestore_queue_max_ops": "500", > > "filestore_queue_max_bytes": "104857600", > > "filestore_queue_committing_max_ops": "500", > > "filestore_queue_committing_max_bytes": "104857600", > > "filestore_op_threads": "2", > > "filestore_op_thread_timeout": "60", > > "filestore_op_thread_suicide_timeout": "180", > > "filestore_commit_timeout": "600", > > "filestore_fiemap_threshold": "4096", > > "filestore_merge_threshold": "10", > > "filestore_split_multiple": "2", > > "filestore_update_to": "1000", > > "filestore_blackhole": "false", > > "filestore_dump_file": "", > > "filestore_kill_at": "0", > > "filestore_fail_eio": "true", > > "journal_dio": "true", > > "journal_aio": "false", > > "journal_block_align": "true", > > "journal_max_write_bytes": "10485760", > > "journal_max_write_entries": "100", > > "journal_queue_max_ops": "500", > > "journal_queue_max_bytes": "104857600", > > "journal_align_min_size": "65536", > > "journal_replay_from": "0", > > "journal_zero_on_create": "false", > > "rbd_cache": "false", > > "rbd_cache_size": "33554432", > > "rbd_cache_max_dirty": "25165824", > > "rbd_cache_target_dirty": "16777216", > > "rbd_cache_max_dirty_age": "1", > > "nss_db_path": "", > > "rgw_data": "\/var\/lib\/ceph\/radosgw\/ceph-24", > > "rgw_enable_apis": "s3, swift, swift_auth, admin", > > "rgw_cache_enabled": "true", > > "rgw_cache_lru_size": "10000", > > "rgw_socket_path": "", > > "rgw_dns_name": "", > > "rgw_swift_url": "", > > "rgw_swift_url_prefix": "swift", > > "rgw_swift_auth_url": "", > > "rgw_swift_auth_entry": "auth", > > "rgw_keystone_url": "", > > "rgw_keystone_admin_token": "", > > "rgw_keystone_accepted_roles": "Member, admin", > > "rgw_keystone_token_cache_size": "10000", > > "rgw_keystone_revocation_interval": "900", > > "rgw_admin_entry": "admin", > > "rgw_enforce_swift_acls": "true", > > "rgw_swift_token_expiration": "86400", > > "rgw_print_continue": "true", > > "rgw_remote_addr_param": "REMOTE_ADDR", > > "rgw_op_thread_timeout": "600", > > "rgw_op_thread_suicide_timeout": "0", > > "rgw_thread_pool_size": "100", > > "rgw_num_control_oids": "8", > > "rgw_cluster_root_pool": ".rgw.root", > > "rgw_log_nonexistent_bucket": "false", > > "rgw_log_object_name": "%Y-%m-%d-%H-%i-%n", > > "rgw_log_object_name_utc": "false", > > "rgw_usage_max_shards": "32", > > "rgw_usage_max_user_shards": "1", > > "rgw_enable_ops_log": "true", > > "rgw_enable_usage_log": "true", > > "rgw_ops_log_rados": "true", > > "rgw_ops_log_socket_path": "", > > "rgw_ops_log_data_backlog": "5242880", > > "rgw_usage_log_flush_threshold": "1024", > > "rgw_usage_log_tick_interval": "30", > > "rgw_intent_log_object_name": "%Y-%m-%d-%i-%n", > > "rgw_intent_log_object_name_utc": "false", > > "rgw_init_timeout": "30", > > "rgw_mime_types_file": "\/etc\/mime.types", > > "rgw_gc_max_objs": "32", > > "rgw_gc_obj_min_wait": "7200", > > "rgw_gc_processor_max_time": "3600", > > "rgw_gc_processor_period": "3600", > > "rgw_s3_success_create_obj_status": "0", > > "rgw_resolve_cname": "false", > > "rgw_obj_stripe_size": "4194304", > > "rgw_extended_http_attrs": "", > > "mutex_perf_counter": "false", > > "internal_safe_to_start_threads": "true"} > > > > > > -----Original Message----- > > From: Mark Nelson [mailto:mark.nelson@inktank.com] > > Sent: Thursday, 20 December 2012 11:28 PM > > To: Matthew Anderson > > Cc: ceph-devel@vger.kernel.org > > Subject: Re: OSD's slow down to a crawl > > > > Hi Matt, > > > > Yeah, your log is looking pretty sparse! I imagine Sam will have a lot of questions for you. Out of curiosity have you looked at any io/cpu/memory stats when things slow down? sysprof may come in handy if it looks like it's CPU related. Also, check out the OSD admin socket. > > Sebastien Han made a nice little blog entry describing it: > > > > http://www.sebastien-han.fr/blog/2012/08/14/ceph-admin-socket/ > > > > You can get a lot of information about the current state of things. > > > > Mark > > > > -- > To unsubscribe from this list: send the line "unsubscribe ceph-devel" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > > ^ permalink raw reply [flat|nested] 13+ messages in thread
[parent not found: <38A500831D3DE24B90BD200D6C8701351BB15433@Exchange2010-2.corit.local>]
* RE: OSD's slow down to a crawl [not found] ` <38A500831D3DE24B90BD200D6C8701351BB15433@Exchange2010-2.corit.local> @ 2012-12-20 17:13 ` Sage Weil 2012-12-21 4:51 ` Matthew Anderson 0 siblings, 1 reply; 13+ messages in thread From: Sage Weil @ 2012-12-20 17:13 UTC (permalink / raw) To: Matthew Anderson; +Cc: 'Mark Nelson', ceph-devel@vger.kernel.org [-- Attachment #1: Type: TEXT/PLAIN, Size: 44581 bytes --] On Thu, 20 Dec 2012, Matthew Anderson wrote: > Hi Sage, > > Logs are attached. I took the osd logs from osd.24 as this is the first > osd in my SSD pool I've been testing with previously. > > The 4MB bench I was able to reproduce the fault by restarting my rbd > export which stalled after a few percent complete. When I ran the 4MB > bench it stalled early on and never received a response back from the > OSD and I terminated it after 60 seconds or so. I wasn't able to > reproduce the fault using the 4kb io size. The 4kb log should show rados > bench completing normally at a respectable speed of about 1MB/s. Let's drill into the hang.. up until that point things look okay. 2012-12-21 00:51:26.033622 7f6f3c042760 1 -- 172.16.0.13:0/1023886 --> 172.16.0.13:6813/22233 -- osd_op(client.9503.0:185 benchmark_data_KVM04_23886_object184 [write 0~4194304] 6.3ca4346e) v4 -- ?+0 0x171ea50 con 0x171a7e0 Do you have a log for that OSD so we can see what happened there? It may also be that the replicated write is hung. If you do ceph pg map 6.3ca4346e you can see all OSDs storing that PG. And/or you can grep for client.9503.0:185 in 172.16.0.13:6813/22233's log and see whether the sub_op was sent. Thanks! sage > > Thanks > -Matt > > -----Original Message----- > From: ceph-devel-owner@vger.kernel.org [mailto:ceph-devel-owner@vger.kernel.org] On Behalf Of Sage Weil > Sent: Friday, 21 December 2012 12:30 AM > To: Matthew Anderson > Cc: 'Mark Nelson'; ceph-devel@vger.kernel.org > Subject: RE: OSD's slow down to a crawl > > Can you do a similar test, but with full logging on? > > ceph tell osd.0 injectargs '--debug-ms 1 --debug-filestore 20 --debug-osd > 20 --debug-journal 20' > rados -p ssd bench 30 write -b 4096 -t 1 --log-file /tmp/foo --debug-ms 1 > > That will be a single IO in flight at a time and very easy to trace through the logs. If you can post the resulting log file (/tmp/foo and from osd.0), that would be awesome. > > Thanks! > sage > > > > On Thu, 20 Dec 2012, Matthew Anderson wrote: > > > # rados bench 60 write -t 256 -p ssd > > Maintaining 256 concurrent writes of 4194304 bytes for at least 60 seconds. > > Object prefix: benchmark_data_KVM03_12985 > > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > > 0 0 0 0 0 0 - 0 > > 1 45 45 0 0 0 - 0 > > 2 71 71 0 0 0 - 0 > > 3 90 90 0 0 0 - 0 > > 4 125 125 0 0 0 - 0 > > 5 136 136 0 0 0 - 0 > > 6 144 144 0 0 0 - 0 > > 7 175 175 0 0 0 - 0 > > 8 201 201 0 0 0 - 0 > > 9 217 217 0 0 0 - 0 > > 10 228 228 0 0 0 - 0 > > 11 251 251 0 0 0 - 0 > > 12 255 266 11 3.66581 3.66667 11.7027 11.6013 > > 13 255 271 16 4.92194 20 12.9795 11.9979 > > 14 255 291 36 10.2833 80 13.1795 12.6478 > > 15 255 301 46 12.2639 40 13.9424 12.9123 > > 16 255 303 48 11.9973 8 14.0277 12.9579 > > 17 255 321 66 15.526 72 15.1408 13.4616 > > 18 255 345 90 19.9956 96 15.5673 13.9937 > > 19 255 349 94 19.7851 16 15.707 14.0635 > > 2012-12-21 00:06:37.198982min lat: 11.4904 max lat: 16.4053 avg lat: 14.3121 > > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > > 20 255 361 106 21.1953 48 16.4053 14.3121 > > 21 255 367 112 21.3285 24 17.4263 14.4795 > > 22 255 370 115 20.9043 12 17.3579 14.5544 > > 23 255 387 132 22.9513 68 18.3185 15.0867 > > 24 255 394 139 23.1614 28 18.2868 15.2504 > > 25 255 404 149 23.8346 40 18.741 15.5048 > > 26 255 410 155 23.8407 24 19.032 15.6358 > > 27 255 410 155 22.9577 0 - 15.6358 > > 28 255 410 155 22.1378 0 - 15.6358 > > 29 255 411 156 21.5124 1.33333 22.6927 15.6811 > > 30 255 431 176 23.4614 80 22.6577 16.4822 > > 31 255 447 192 24.7687 64 23.2345 17.0563 > > 32 255 464 209 26.1192 68 23.3788 17.5859 > > 33 255 485 230 27.8726 84 22.8924 18.1122 > > 34 255 497 242 28.4642 48 23.563 18.3693 > > 35 255 508 253 28.9078 44 23.0516 18.5935 > > 36 255 514 259 28.7713 24 24.4415 18.7278 > > 37 255 530 275 29.723 64 23.847 19.0424 > > 38 255 541 286 30.0985 44 24.5757 19.2427 > > 39 255 544 289 29.6344 12 24.6802 19.2984 > > 2012-12-21 00:06:57.203473min lat: 11.4904 max lat: 24.6802 avg lat: 19.2984 > > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > > 40 255 544 289 28.8935 0 - 19.2984 > > 41 255 544 289 28.1888 0 - 19.2984 > > 42 255 544 289 27.5176 0 - 19.2984 > > 43 255 544 289 26.8776 0 - 19.2984 > > 44 255 544 289 26.2668 0 - 19.2984 > > 45 255 544 289 25.6831 0 - 19.2984 > > 46 255 544 289 25.1248 0 - 19.2984 > > 47 255 544 289 24.5902 0 - 19.2984 > > 48 255 544 289 24.0779 0 - 19.2984 > > 49 255 544 289 23.5865 0 - 19.2984 > > 50 255 544 289 23.1148 0 - 19.2984 > > 51 255 544 289 22.6616 0 - 19.2984 > > 52 255 544 289 22.2258 0 - 19.2984 > > 53 255 544 289 21.8065 0 - 19.2984 > > 54 255 544 289 21.4026 0 - 19.2984 > > 55 255 544 289 21.0135 0 - 19.2984 > > 56 255 544 289 20.6383 0 - 19.2984 > > 57 255 544 289 20.2762 0 - 19.2984 > > 58 255 544 289 19.9266 0 - 19.2984 > > 59 255 544 289 19.5889 0 - 19.2984 > > 2012-12-21 00:07:17.207782min lat: 11.4904 max lat: 24.6802 avg lat: 19.2984 > > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > > 60 255 544 289 19.2624 0 - 19.2984 > > 61 255 544 289 18.9466 0 - 19.2984 > > 62 255 544 289 18.641 0 - 19.2984 > > 63 255 544 289 18.3452 0 - 19.2984 > > 64 255 544 289 18.0585 0 - 19.2984 > > 65 255 544 289 17.7807 0 - 19.2984 > > 66 255 544 289 17.5113 0 - 19.2984 > > 67 133 545 412 24.5916 17.5714 37.3574 27.4834 > > Total time run: 68.000740 > > Total writes made: 545 > > Write size: 4194304 > > Bandwidth (MB/sec): 32.058 > > > > Stddev Bandwidth: 26.8837 > > Max bandwidth (MB/sec): 96 > > Min bandwidth (MB/sec): 0 > > Average Latency: 29.2801 > > Stddev Latency: 11.9768 > > Max latency: 52.6277 > > Min latency: 11.4904 > > > > > > > > > > > > ----Original Message----- > > From: Mark Nelson [mailto:mark.nelson@inktank.com] > > Sent: Thursday, 20 December 2012 11:59 PM > > To: Matthew Anderson > > Cc: ceph-devel@vger.kernel.org > > Subject: Re: OSD's slow down to a crawl > > > > Out of curiosity, if you fire up a rados bench instance on one of the nodes with say, 256 concurrent writes, do any of the writes complete? > > > > Mark > > > > On 12/20/2012 09:51 AM, Matthew Anderson wrote: > > > Hi Mark, > > > > > > Thanks for the quick reply. The log was 380MB in total so I just posted last 200 lines hoping that a trained eye might be able to spot something. I'm happy to put the file up on a web server if anyone wants to review it in full. > > > > > > It doesn't appear to be memory or CPU related. In it's hung state (both read and write are not responding at the moment, I have a VM hung trying to shutdown and an rbd stuck exporting ) all of the OSD's sit around 0.3% CPU usage, ~300MB of ram and no IO wait. The server has roughly 40GB RAM free and is easily 90% idle. Iotop shows no IO coming from any of the Ceph-OSD's. > > > > > > The admin socket showed no ops in flight but the performance dump, perhistoric ops and config are below. I did notice that the age of some of the ops was quite high considering things were actively trying to write to the cluster. > > > # ceph --admin-daemon ./ceph-osd.24.asok perf dump > > > > > > {"filestore":{"journal_queue_max_ops":500,"journal_queue_ops":0,"jou > > > rn > > > al_ops":20868,"journal_queue_max_bytes":104857600,"journal_queue_byt > > > es > > > ":0,"journal_bytes":1710397238,"journal_latency":{"avgcount":20868," > > > su > > > m":185.268022000},"journal_wr":13354,"journal_wr_bytes":{"avgcount": > > > 13 > > > 354,"sum":1770647552},"op_queue_max_ops":500,"op_queue_ops":0,"ops": > > > 20 > > > 868,"op_queue_max_bytes":104857600,"op_queue_bytes":0,"bytes":171023 > > > 25 > > > 83,"apply_latency":{"avgcount":20868,"sum":508.896737000},"committing" > > > :0,"commitcycle":339,"commitcycle_interval":{"avgcount":339,"sum":17 > > > 08 > > > .284725000},"commitcycle_latency":{"avgcount":339,"sum":13.241850000 > > > }, > > > "journal_full":0},"mutex-FileJournal::completions_lock":{"wait":{"av > > > gc > > > ount":0,"sum":0.000000000}},"mutex-FileJournal::finisher_lock":{"wait" > > > :{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::write_lock":{ > > > "w > > > ait":{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::writeq_lo > > > ck > > > ":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager:: > > > apply_lo > > ck":{"wai > > t":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager::com_loc > > k":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::SubmitManager > > ::lock":{"wait":{"avgcount":0,"sum":0.000000000}},"osd":{"opq":0,"op_w > > ip":0,"op":31753,"op_in_bytes":797806224,"op_out_bytes":1728414048,"op > > _latency":{"avgcount":31753,"sum":82.183373000},"op_r":24945,"op_r_out > > _bytes":1728414048,"op_r_latency":{"avgcount":24945,"sum":16.833689000 > > },"op_w":6808,"op_w_in_bytes":797806224,"op_w_rlat":{"avgcount":6808," > > sum":38.223006000},"op_w_latency":{"avgcount":6808,"sum":65.349684000} > > ,"op_rw":0,"op_rw_in_bytes":0,"op_rw_out_bytes":0,"op_rw_rlat":{"avgco > > unt":0,"sum":0.000000000},"op_rw_latency":{"avgcount":0,"sum":0.000000 > > 000},"subop":5700,"subop_in_bytes":813224156,"subop_latency":{"avgcoun > > t":5700,"sum":26.877085000},"subop_w":0,"subop_w_in_bytes":813224156," > > subop_w_latency":{"avgcount":5700,"sum":26.877085000},"subop_pull":0," > > subop_pull_latency":{"avgcount":0,"sum":0.000000000},"subop_push":0,"s > > ubop_push_ > > in_bytes" > > :0,"subop_push_latency":{"avgcount":0,"sum":0.000000000},"pull":0,"pus > > h":0,"push_out_bytes":0,"push_in":0,"push_in_bytes":0,"recovery_ops":0 > > ,"loadavg":359,"buffer_bytes":0,"numpg":2486,"numpg_primary":1214,"num > > pg_replica":1272,"numpg_stray":0,"heartbeat_to_peers":4,"heartbeat_fro > > m_peers":0,"map_messages":44,"map_message_epochs":184,"map_message_epo > > ch_dups":174},"throttle-filestore_bytes":{"val":0,"max":104857600,"get > > ":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0,"take":20 > > 868,"take_sum":1710397238,"put":13354,"put_sum":1710397238,"wait":{"av > > gcount":0,"sum":0.000000000}},"throttle-filestore_ops":{"val":0,"max": > > 500,"get":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0," > > take":20868,"take_sum":20868,"put":13354,"put_sum":20868,"wait":{"avgc > > ount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-client": > > {"val":0,"max":104857600,"get":31657,"get_sum":795546573,"get_or_fail_ > > fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":31657,"put > > _sum":7955 > > 46573,"wa > > it":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttle > > r-cluster":{"val":0,"max":104857600,"get":18913,"get_sum":828011293,"g > > et_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put" > > :18913,"put_sum":828011293,"wait":{"avgcount":0,"sum":0.000000000}},"t > > hrottle-msgr_dispatch_throttler-hbclient":{"val":0,"max":104857600,"ge > > t":4260,"get_sum":200220,"get_or_fail_fail":0,"get_or_fail_success":0, > > "take":0,"take_sum":0,"put":4260,"put_sum":200220,"wait":{"avgcount":0 > > ,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-hbserver":{"val > > ":0,"max":104857600,"get":4236,"get_sum":199092,"get_or_fail_fail":0," > > get_or_fail_success":0,"take":0,"take_sum":0,"put":4236,"put_sum":1990 > > 92,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-osd_client_bytes > > ":{"val":0,"max":524288000,"get":31308,"get_sum":795353720,"get_or_fai > > l_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":37671,"p > > ut_sum":795353720,"wait":{"avgcount":0,"sum":0.000000000}}} > > > > > > # ceph --admin-daemon ./ceph-osd.24.asok dump_historic_ops > > > > > > { "description": "osd_op(client.7636.0:101498 rb.0.122c.6b8b4567.000000000357 [write 3600384~4096] 6.42463fe3)", > > > "received_at": "2012-12-20 23:34:42.012131", > > > "age": "75.428051", > > > "duration": "0.009988", > > > "flag_point": "waiting for sub ops", > > > "client_info": { "client": "client.7636", > > > "tid": 101498}, > > > "events": [ > > > { "time": "2012-12-20 23:34:42.012317", > > > "event": "waiting_for_osdmap"}, > > > { "time": "2012-12-20 23:34:42.012425", > > > "event": "reached_pg"}, > > > { "time": "2012-12-20 23:34:42.012678", > > > "event": "started"}, > > > { "time": "2012-12-20 23:34:42.012720", > > > "event": "started"}, > > > { "time": "2012-12-20 23:34:42.012815", > > > "event": "sub_op_sent"}, > > > { "time": "2012-12-20 23:34:42.012958", > > > "event": "commit_queued_for_journal_write"}, > > > { "time": "2012-12-20 23:34:42.013033", > > > "event": "write_thread_in_journal_buffer"}, > > > { "time": "2012-12-20 23:34:42.013745", > > > "event": "journaled_completion_queued"}, > > > { "time": "2012-12-20 23:34:42.013803", > > > "event": "op_commit"}, > > > { "time": "2012-12-20 23:34:42.017414", > > > "event": "sub_op_commit_rec"}, > > > { "time": "2012-12-20 23:34:42.022066", > > > "event": "op_applied"}, > > > { "time": "2012-12-20 23:34:42.022119", > > > "event": "done"}]}, > > > { "description": "osd_sub_op_reply(client.7636.0:101498 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", > > > "received_at": "2012-12-20 23:34:42.017048", > > > "age": "75.423134", > > > "duration": "0.000411", > > > "flag_point": "started", > > > "events": [ > > > { "time": "2012-12-20 23:34:42.017218", > > > "event": "waiting_for_osdmap"}, > > > { "time": "2012-12-20 23:34:42.017363", > > > "event": "reached_pg"}, > > > { "time": "2012-12-20 23:34:42.017401", > > > "event": "started"}, > > > { "time": "2012-12-20 23:34:42.017459", > > > "event": "done"}]}, > > > { "description": "osd_op(client.7636.0:101502 rb.0.122c.6b8b4567.000000000064 [write 1512448~61440] 6.a241e34d)", > > > "received_at": "2012-12-20 23:34:49.851021", > > > "age": "67.589161", > > > "duration": "0.007114", > > > "flag_point": "waiting for sub ops", > > > "client_info": { "client": "client.7636", > > > "tid": 101502}, > > > "events": [ > > > { "time": "2012-12-20 23:34:49.851276", > > > "event": "waiting_for_osdmap"}, > > > { "time": "2012-12-20 23:34:49.851429", > > > "event": "reached_pg"}, > > > { "time": "2012-12-20 23:34:49.851616", > > > "event": "started"}, > > > { "time": "2012-12-20 23:34:49.851648", > > > "event": "started"}, > > > { "time": "2012-12-20 23:34:49.851712", > > > "event": "sub_op_sent"}, > > > { "time": "2012-12-20 23:34:49.851770", > > > "event": "commit_queued_for_journal_write"}, > > > { "time": "2012-12-20 23:34:49.851915", > > > "event": "write_thread_in_journal_buffer"}, > > > { "time": "2012-12-20 23:34:49.853956", > > > "event": "journaled_completion_queued"}, > > > { "time": "2012-12-20 23:34:49.853992", > > > "event": "op_commit"}, > > > { "time": "2012-12-20 23:34:49.857016", > > > "event": "sub_op_commit_rec"}, > > > { "time": "2012-12-20 23:34:49.858077", > > > "event": "op_applied"}, > > > { "time": "2012-12-20 23:34:49.858135", > > > "event": "done"}]}, > > > { "description": "osd_sub_op_reply(client.7636.0:101502 6.234d a241e34d\/rb.0.122c.6b8b4567.000000000064\/head\/\/6 [] ondisk, result = 0)", > > > "received_at": "2012-12-20 23:34:49.856762", > > > "age": "67.583420", > > > "duration": "0.000308", > > > "flag_point": "started", > > > "events": [ > > > { "time": "2012-12-20 23:34:49.856869", > > > "event": "waiting_for_osdmap"}, > > > { "time": "2012-12-20 23:34:49.856969", > > > "event": "reached_pg"}, > > > { "time": "2012-12-20 23:34:49.857001", > > > "event": "started"}, > > > { "time": "2012-12-20 23:34:49.857070", > > > "event": "done"}]}, > > > { "description": "osd_op(client.7636.0:101504 rb.0.122c.6b8b4567.00000000012c [write 1658880~2048] 6.92d2c38c)", > > > "received_at": "2012-12-20 23:35:34.351518", > > > "age": "23.088664", > > > "duration": "0.009430", > > > "flag_point": "waiting for sub ops", > > > "client_info": { "client": "client.7636", > > > "tid": 101504}, > > > "events": [ > > > { "time": "2012-12-20 23:35:34.351714", > > > "event": "waiting_for_osdmap"}, > > > { "time": "2012-12-20 23:35:34.351811", > > > "event": "reached_pg"}, > > > { "time": "2012-12-20 23:35:34.352030", > > > "event": "started"}, > > > { "time": "2012-12-20 23:35:34.352071", > > > "event": "started"}, > > > { "time": "2012-12-20 23:35:34.352146", > > > "event": "sub_op_sent"}, > > > { "time": "2012-12-20 23:35:34.352213", > > > "event": "commit_queued_for_journal_write"}, > > > { "time": "2012-12-20 23:35:34.352287", > > > "event": "write_thread_in_journal_buffer"}, > > > { "time": "2012-12-20 23:35:34.355390", > > > "event": "journaled_completion_queued"}, > > > { "time": "2012-12-20 23:35:34.355475", > > > "event": "op_commit"}, > > > { "time": "2012-12-20 23:35:34.356661", > > > "event": "sub_op_commit_rec"}, > > > { "time": "2012-12-20 23:35:34.360894", > > > "event": "op_applied"}, > > > { "time": "2012-12-20 23:35:34.360948", > > > "event": "done"}]}, > > > { "description": "osd_sub_op_reply(client.7636.0:101504 6.38c 92d2c38c\/rb.0.122c.6b8b4567.00000000012c\/head\/\/6 [] ondisk, result = 0)", > > > "received_at": "2012-12-20 23:35:34.356286", > > > "age": "23.083896", > > > "duration": "0.000421", > > > "flag_point": "started", > > > "events": [ > > > { "time": "2012-12-20 23:35:34.356489", > > > "event": "waiting_for_osdmap"}, > > > { "time": "2012-12-20 23:35:34.356598", > > > "event": "reached_pg"}, > > > { "time": "2012-12-20 23:35:34.356640", > > > "event": "started"}, > > > { "time": "2012-12-20 23:35:34.356707", > > > "event": "done"}]}, > > > { "description": "osd_op(client.7636.0:101512 rb.0.122c.6b8b4567.000000000357 [write 3731456~4096] 6.42463fe3)", > > > "received_at": "2012-12-20 23:35:42.133103", > > > "age": "15.307079", > > > "duration": "0.008357", > > > "flag_point": "waiting for sub ops", > > > "client_info": { "client": "client.7636", > > > "tid": 101512}, > > > "events": [ > > > { "time": "2012-12-20 23:35:42.133315", > > > "event": "waiting_for_osdmap"}, > > > { "time": "2012-12-20 23:35:42.133427", > > > "event": "reached_pg"}, > > > { "time": "2012-12-20 23:35:42.133684", > > > "event": "started"}, > > > { "time": "2012-12-20 23:35:42.133723", > > > "event": "started"}, > > > { "time": "2012-12-20 23:35:42.133796", > > > "event": "sub_op_sent"}, > > > { "time": "2012-12-20 23:35:42.133908", > > > "event": "commit_queued_for_journal_write"}, > > > { "time": "2012-12-20 23:35:42.133995", > > > "event": "write_thread_in_journal_buffer"}, > > > { "time": "2012-12-20 23:35:42.136965", > > > "event": "journaled_completion_queued"}, > > > { "time": "2012-12-20 23:35:42.137025", > > > "event": "op_commit"}, > > > { "time": "2012-12-20 23:35:42.138456", > > > "event": "sub_op_commit_rec"}, > > > { "time": "2012-12-20 23:35:42.141408", > > > "event": "op_applied"}, > > > { "time": "2012-12-20 23:35:42.141460", > > > "event": "done"}]}, > > > { "description": "osd_sub_op_reply(client.7636.0:101512 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", > > > "received_at": "2012-12-20 23:35:42.138087", > > > "age": "15.302095", > > > "duration": "0.000410", > > > "flag_point": "started", > > > "events": [ > > > { "time": "2012-12-20 23:35:42.138248", > > > "event": "waiting_for_osdmap"}, > > > { "time": "2012-12-20 23:35:42.138363", > > > "event": "reached_pg"}, > > > { "time": "2012-12-20 23:35:42.138443", > > > "event": "started"}, > > > { "time": "2012-12-20 23:35:42.138497", > > > "event": "done"}]}]} > > > > > > # ceph --admin-daemon ./ceph-osd.24.asok config show { "name": > > > "osd.24", > > > "cluster": "ceph", > > > "none": "0\/5", > > > "lockdep": "0\/1", > > > "context": "0\/1", > > > "crush": "1\/1", > > > "mds": "1\/5", > > > "mds_balancer": "1\/5", > > > "mds_locker": "1\/5", > > > "mds_log": "1\/5", > > > "mds_log_expire": "1\/5", > > > "mds_migrator": "1\/5", > > > "buffer": "0\/1", > > > "timer": "0\/1", > > > "filer": "0\/1", > > > "striper": "0\/1", > > > "objecter": "0\/1", > > > "rados": "0\/5", > > > "rbd": "0\/5", > > > "journaler": "0\/5", > > > "objectcacher": "0\/5", > > > "client": "0\/5", > > > "osd": "0\/5", > > > "optracker": "0\/5", > > > "objclass": "0\/5", > > > "filestore": "1\/3", > > > "journal": "1\/3", > > > "ms": "0\/5", > > > "mon": "1\/5", > > > "monc": "0\/10", > > > "paxos": "0\/5", > > > "tp": "0\/5", > > > "auth": "1\/5", > > > "crypto": "1\/5", > > > "finisher": "1\/1", > > > "heartbeatmap": "1\/5", > > > "perfcounter": "1\/5", > > > "rgw": "1\/5", > > > "hadoop": "1\/5", > > > "javaclient": "1\/5", > > > "asok": "1\/5", > > > "throttle": "1\/1", > > > "host": "KVM04", > > > "fsid": "00000000-0000-0000-0000-000000000000", > > > "public_addr": ":\/0", > > > "cluster_addr": ":\/0", > > > "public_network": "", > > > "cluster_network": "", > > > "num_client": "1", > > > "monmap": "", > > > "mon_host": "", > > > "lockdep": "false", > > > "admin_socket": "\/var\/run\/ceph\/ceph-osd.24.asok", > > > "daemonize": "true", > > > "pid_file": "\/var\/run\/ceph\/osd.24.pid", > > > "chdir": "\/", > > > "max_open_files": "0", > > > "fatal_signal_handlers": "true", > > > "log_file": "\/var\/log\/ceph\/osd.24.log", > > > "log_max_new": "1000", > > > "log_max_recent": "100000", > > > "log_to_stderr": "false", > > > "err_to_stderr": "true", > > > "log_to_syslog": "false", > > > "err_to_syslog": "false", > > > "log_flush_on_exit": "true", > > > "clog_to_monitors": "true", > > > "clog_to_syslog": "false", > > > "mon_cluster_log_to_syslog": "false", > > > "mon_cluster_log_file": "\/var\/log\/ceph\/ceph.log", > > > "key": "", > > > "keyfile": "", > > > "keyring": "\/srv\/ceph\/osd\/osd.24\/keyring", > > > "heartbeat_interval": "5", > > > "heartbeat_file": "", > > > "perf": "true", > > > "ms_tcp_nodelay": "true", > > > "ms_initial_backoff": "0.2", > > > "ms_max_backoff": "15", > > > "ms_nocrc": "false", > > > "ms_die_on_bad_msg": "false", > > > "ms_dispatch_throttle_bytes": "104857600", > > > "ms_bind_ipv6": "false", > > > "ms_bind_port_min": "6800", > > > "ms_bind_port_max": "7100", > > > "ms_rwthread_stack_bytes": "1048576", > > > "ms_tcp_read_timeout": "900", > > > "ms_inject_socket_failures": "0", > > > "ms_inject_delay_type": "", > > > "ms_inject_delay_max": "1", > > > "ms_inject_delay_probability": "0", > > > "mon_data": "\/var\/lib\/ceph\/mon\/ceph-24", > > > "mon_initial_members": "", > > > "mon_sync_fs_threshold": "5", > > > "mon_tick_interval": "5", > > > "mon_subscribe_interval": "300", > > > "mon_osd_laggy_halflife": "3600", > > > "mon_osd_laggy_weight": "0.3", > > > "mon_osd_adjust_heartbeat_grace": "true", > > > "mon_osd_adjust_down_out_interval": "true", > > > "mon_osd_auto_mark_in": "false", > > > "mon_osd_auto_mark_auto_out_in": "true", > > > "mon_osd_auto_mark_new_in": "true", > > > "mon_osd_down_out_interval": "300", > > > "mon_osd_min_up_ratio": "0.3", > > > "mon_osd_min_in_ratio": "0.3", > > > "mon_lease": "5", > > > "mon_lease_renew_interval": "3", > > > "mon_lease_ack_timeout": "10", > > > "mon_clock_drift_allowed": "0.05", > > > "mon_clock_drift_warn_backoff": "5", > > > "mon_accept_timeout": "10", > > > "mon_pg_create_interval": "30", > > > "mon_pg_stuck_threshold": "300", > > > "mon_osd_full_ratio": "0.95", > > > "mon_osd_nearfull_ratio": "0.85", > > > "mon_globalid_prealloc": "100", > > > "mon_osd_report_timeout": "900", > > > "mon_force_standby_active": "true", > > > "mon_min_osdmap_epochs": "500", > > > "mon_max_pgmap_epochs": "500", > > > "mon_max_log_epochs": "500", > > > "mon_max_osd": "10000", > > > "mon_probe_timeout": "2", > > > "mon_slurp_timeout": "10", > > > "mon_slurp_bytes": "262144", > > > "mon_client_bytes": "104857600", > > > "mon_daemon_bytes": "419430400", > > > "mon_max_log_entries_per_event": "4096", > > > "paxos_max_join_drift": "10", > > > "paxos_propose_interval": "1", > > > "paxos_min_wait": "0.05", > > > "clock_offset": "0", > > > "auth_cluster_required": "none", > > > "auth_service_required": "none", > > > "auth_client_required": "none", > > > "auth_supported": "none", > > > "cephx_require_signatures": "false", > > > "cephx_sign_messages": "true", > > > "auth_mon_ticket_ttl": "43200", > > > "auth_service_ticket_ttl": "3600", > > > "mon_client_hunt_interval": "3", > > > "mon_client_ping_interval": "10", > > > "mon_client_max_log_entries_per_message": "1000", > > > "mon_max_pool_pg_num": "65536", > > > "client_cache_size": "16384", > > > "client_cache_mid": "0.75", > > > "client_use_random_mds": "false", > > > "client_mount_timeout": "30", > > > "client_tick_interval": "1", > > > "client_trace": "", > > > "client_readahead_min": "131072", > > > "client_readahead_max_bytes": "0", > > > "client_readahead_max_periods": "4", > > > "client_snapdir": ".snap", > > > "client_mountpoint": "\/", > > > "client_notify_timeout": "10", > > > "client_caps_release_delay": "5", > > > "client_oc": "true", > > > "client_oc_size": "209715200", > > > "client_oc_max_dirty": "104857600", > > > "client_oc_target_dirty": "8388608", > > > "client_oc_max_dirty_age": "5", > > > "client_oc_max_objects": "1000", > > > "client_debug_force_sync_read": "false", > > > "client_debug_inject_tick_delay": "0", > > > "fuse_use_invalidate_cb": "false", > > > "fuse_big_writes": "true", > > > "fuse_debug": "false", > > > "objecter_tick_interval": "5", > > > "objecter_timeout": "10", > > > "objecter_inflight_op_bytes": "104857600", > > > "objecter_inflight_ops": "1024", > > > "journaler_allow_split_entries": "true", > > > "journaler_write_head_interval": "15", > > > "journaler_prefetch_periods": "10", > > > "journaler_prezero_periods": "5", > > > "journaler_batch_interval": "0.001", > > > "journaler_batch_max": "0", > > > "mds_data": "\/var\/lib\/ceph\/mds\/ceph-24", > > > "mds_max_file_size": "1099511627776", > > > "mds_cache_size": "100000", > > > "mds_cache_mid": "0.7", > > > "mds_mem_max": "1048576", > > > "mds_dir_commit_ratio": "0.5", > > > "mds_dir_max_commit_size": "90", > > > "mds_decay_halflife": "5", > > > "mds_beacon_interval": "4", > > > "mds_beacon_grace": "15", > > > "mds_blacklist_interval": "1440", > > > "mds_session_timeout": "60", > > > "mds_session_autoclose": "300", > > > "mds_reconnect_timeout": "45", > > > "mds_tick_interval": "5", > > > "mds_dirstat_min_interval": "1", > > > "mds_scatter_nudge_interval": "5", > > > "mds_client_prealloc_inos": "1000", > > > "mds_early_reply": "true", > > > "mds_use_tmap": "true", > > > "mds_default_dir_hash": "2", > > > "mds_log": "true", > > > "mds_log_skip_corrupt_events": "false", > > > "mds_log_max_events": "-1", > > > "mds_log_max_segments": "30", > > > "mds_log_max_expiring": "20", > > > "mds_bal_sample_interval": "3", > > > "mds_bal_replicate_threshold": "8000", > > > "mds_bal_unreplicate_threshold": "0", > > > "mds_bal_frag": "false", > > > "mds_bal_split_size": "10000", > > > "mds_bal_split_rd": "25000", > > > "mds_bal_split_wr": "10000", > > > "mds_bal_split_bits": "3", > > > "mds_bal_merge_size": "50", > > > "mds_bal_merge_rd": "1000", > > > "mds_bal_merge_wr": "1000", > > > "mds_bal_interval": "10", > > > "mds_bal_fragment_interval": "5", > > > "mds_bal_idle_threshold": "0", > > > "mds_bal_max": "-1", > > > "mds_bal_max_until": "-1", > > > "mds_bal_mode": "0", > > > "mds_bal_min_rebalance": "0.1", > > > "mds_bal_min_start": "0.2", > > > "mds_bal_need_min": "0.8", > > > "mds_bal_need_max": "1.2", > > > "mds_bal_midchunk": "0.3", > > > "mds_bal_minchunk": "0.001", > > > "mds_bal_target_removal_min": "5", > > > "mds_bal_target_removal_max": "10", > > > "mds_replay_interval": "1", > > > "mds_shutdown_check": "0", > > > "mds_thrash_exports": "0", > > > "mds_thrash_fragments": "0", > > > "mds_dump_cache_on_map": "false", > > > "mds_dump_cache_after_rejoin": "false", > > > "mds_verify_scatter": "false", > > > "mds_debug_scatterstat": "false", > > > "mds_debug_frag": "false", > > > "mds_debug_auth_pins": "false", > > > "mds_debug_subtrees": "false", > > > "mds_kill_mdstable_at": "0", > > > "mds_kill_export_at": "0", > > > "mds_kill_import_at": "0", > > > "mds_kill_link_at": "0", > > > "mds_kill_rename_at": "0", > > > "mds_wipe_sessions": "false", > > > "mds_wipe_ino_prealloc": "false", > > > "mds_skip_ino": "0", > > > "max_mds": "1", > > > "mds_standby_for_name": "", > > > "mds_standby_for_rank": "-1", > > > "mds_standby_replay": "false", > > > "osd_auto_upgrade_tmap": "true", > > > "osd_tmapput_sets_uses_tmap": "false", > > > "osd_max_backfills": "10", > > > "osd_backfill_full_ratio": "0.85", > > > "osd_backfill_retry_interval": "10", > > > "osd_uuid": "00000000-0000-0000-0000-000000000000", > > > "osd_data": "\/srv\/ceph\/osd\/osd.24", > > > "osd_journal": "\/dev\/sda3", > > > "osd_journal_size": "5120", > > > "osd_max_write_size": "90", > > > "osd_max_pgls": "1024", > > > "osd_client_message_size_cap": "524288000", > > > "osd_pg_bits": "6", > > > "osd_pgp_bits": "6", > > > "osd_min_rep": "1", > > > "osd_max_rep": "10", > > > "osd_pool_default_crush_rule": "0", > > > "osd_pool_default_size": "2", > > > "osd_pool_default_min_size": "0", > > > "osd_pool_default_pg_num": "8", > > > "osd_pool_default_pgp_num": "8", > > > "osd_map_dedup": "true", > > > "osd_map_cache_size": "500", > > > "osd_map_message_max": "100", > > > "osd_op_threads": "2", > > > "osd_disk_threads": "1", > > > "osd_recovery_threads": "1", > > > "osd_recover_clone_overlap": "true", > > > "osd_backfill_scan_min": "64", > > > "osd_backfill_scan_max": "512", > > > "osd_op_thread_timeout": "30", > > > "osd_recovery_thread_timeout": "30", > > > "osd_snap_trim_thread_timeout": "3600", > > > "osd_scrub_thread_timeout": "60", > > > "osd_scrub_finalize_thread_timeout": "600", > > > "osd_remove_thread_timeout": "3600", > > > "osd_command_thread_timeout": "600", > > > "osd_age": "0.8", > > > "osd_age_time": "0", > > > "osd_heartbeat_addr": ":\/0", > > > "osd_heartbeat_interval": "6", > > > "osd_heartbeat_grace": "20", > > > "osd_mon_heartbeat_interval": "30", > > > "osd_mon_report_interval_max": "120", > > > "osd_mon_report_interval_min": "5", > > > "osd_mon_ack_timeout": "30", > > > "osd_min_down_reporters": "1", > > > "osd_min_down_reports": "3", > > > "osd_default_data_pool_replay_window": "45", > > > "osd_preserve_trimmed_log": "false", > > > "osd_auto_mark_unfound_lost": "false", > > > "osd_recovery_delay_start": "0", > > > "osd_recovery_max_active": "50", > > > "osd_recovery_max_chunk": "8388608", > > > "osd_recovery_forget_lost_objects": "false", > > > "osd_max_scrubs": "1", > > > "osd_scrub_load_threshold": "0.5", > > > "osd_scrub_min_interval": "300", > > > "osd_scrub_max_interval": "86400", > > > "osd_deep_scrub_interval": "604800", > > > "osd_deep_scrub_stride": "524288", > > > "osd_auto_weight": "false", > > > "osd_class_dir": "\/usr\/local\/lib\/rados-classes", > > > "osd_check_for_log_corruption": "false", > > > "osd_use_stale_snap": "false", > > > "osd_rollback_to_cluster_snap": "", > > > "osd_default_notify_timeout": "30", > > > "osd_kill_backfill_at": "0", > > > "osd_min_pg_log_entries": "1000", > > > "osd_op_complaint_time": "30", > > > "osd_command_max_records": "256", > > > "osd_op_log_threshold": "5", > > > "osd_verify_sparse_read_holes": "false", > > > "osd_debug_drop_ping_probability": "0", > > > "osd_debug_drop_ping_duration": "0", > > > "osd_debug_drop_pg_create_probability": "0", > > > "osd_debug_drop_pg_create_duration": "1", > > > "osd_debug_drop_op_probability": "0", > > > "osd_op_history_size": "20", > > > "osd_op_history_duration": "600", > > > "osd_target_transaction_size": "300", > > > "osd_client_op_priority": "63", > > > "osd_recovery_op_priority": "10", > > > "filestore": "false", > > > "filestore_index_retry_probability": "0", > > > "filestore_debug_omap_check": "false", > > > "filestore_xattr_use_omap": "false", > > > "filestore_max_inline_xattr_size": "512", > > > "filestore_max_inline_xattrs": "2", > > > "filestore_max_sync_interval": "5", > > > "filestore_min_sync_interval": "0.01", > > > "filestore_btrfs_snap": "true", > > > "filestore_btrfs_clone_range": "true", > > > "filestore_fsync_flushes_journal_data": "false", > > > "filestore_fiemap": "false", > > > "filestore_flusher": "true", > > > "filestore_flusher_max_fds": "512", > > > "filestore_flush_min": "65536", > > > "filestore_sync_flush": "false", > > > "filestore_journal_parallel": "false", > > > "filestore_journal_writeahead": "false", > > > "filestore_journal_trailing": "false", > > > "filestore_queue_max_ops": "500", > > > "filestore_queue_max_bytes": "104857600", > > > "filestore_queue_committing_max_ops": "500", > > > "filestore_queue_committing_max_bytes": "104857600", > > > "filestore_op_threads": "2", > > > "filestore_op_thread_timeout": "60", > > > "filestore_op_thread_suicide_timeout": "180", > > > "filestore_commit_timeout": "600", > > > "filestore_fiemap_threshold": "4096", > > > "filestore_merge_threshold": "10", > > > "filestore_split_multiple": "2", > > > "filestore_update_to": "1000", > > > "filestore_blackhole": "false", > > > "filestore_dump_file": "", > > > "filestore_kill_at": "0", > > > "filestore_fail_eio": "true", > > > "journal_dio": "true", > > > "journal_aio": "false", > > > "journal_block_align": "true", > > > "journal_max_write_bytes": "10485760", > > > "journal_max_write_entries": "100", > > > "journal_queue_max_ops": "500", > > > "journal_queue_max_bytes": "104857600", > > > "journal_align_min_size": "65536", > > > "journal_replay_from": "0", > > > "journal_zero_on_create": "false", > > > "rbd_cache": "false", > > > "rbd_cache_size": "33554432", > > > "rbd_cache_max_dirty": "25165824", > > > "rbd_cache_target_dirty": "16777216", > > > "rbd_cache_max_dirty_age": "1", > > > "nss_db_path": "", > > > "rgw_data": "\/var\/lib\/ceph\/radosgw\/ceph-24", > > > "rgw_enable_apis": "s3, swift, swift_auth, admin", > > > "rgw_cache_enabled": "true", > > > "rgw_cache_lru_size": "10000", > > > "rgw_socket_path": "", > > > "rgw_dns_name": "", > > > "rgw_swift_url": "", > > > "rgw_swift_url_prefix": "swift", > > > "rgw_swift_auth_url": "", > > > "rgw_swift_auth_entry": "auth", > > > "rgw_keystone_url": "", > > > "rgw_keystone_admin_token": "", > > > "rgw_keystone_accepted_roles": "Member, admin", > > > "rgw_keystone_token_cache_size": "10000", > > > "rgw_keystone_revocation_interval": "900", > > > "rgw_admin_entry": "admin", > > > "rgw_enforce_swift_acls": "true", > > > "rgw_swift_token_expiration": "86400", > > > "rgw_print_continue": "true", > > > "rgw_remote_addr_param": "REMOTE_ADDR", > > > "rgw_op_thread_timeout": "600", > > > "rgw_op_thread_suicide_timeout": "0", > > > "rgw_thread_pool_size": "100", > > > "rgw_num_control_oids": "8", > > > "rgw_cluster_root_pool": ".rgw.root", > > > "rgw_log_nonexistent_bucket": "false", > > > "rgw_log_object_name": "%Y-%m-%d-%H-%i-%n", > > > "rgw_log_object_name_utc": "false", > > > "rgw_usage_max_shards": "32", > > > "rgw_usage_max_user_shards": "1", > > > "rgw_enable_ops_log": "true", > > > "rgw_enable_usage_log": "true", > > > "rgw_ops_log_rados": "true", > > > "rgw_ops_log_socket_path": "", > > > "rgw_ops_log_data_backlog": "5242880", > > > "rgw_usage_log_flush_threshold": "1024", > > > "rgw_usage_log_tick_interval": "30", > > > "rgw_intent_log_object_name": "%Y-%m-%d-%i-%n", > > > "rgw_intent_log_object_name_utc": "false", > > > "rgw_init_timeout": "30", > > > "rgw_mime_types_file": "\/etc\/mime.types", > > > "rgw_gc_max_objs": "32", > > > "rgw_gc_obj_min_wait": "7200", > > > "rgw_gc_processor_max_time": "3600", > > > "rgw_gc_processor_period": "3600", > > > "rgw_s3_success_create_obj_status": "0", > > > "rgw_resolve_cname": "false", > > > "rgw_obj_stripe_size": "4194304", > > > "rgw_extended_http_attrs": "", > > > "mutex_perf_counter": "false", > > > "internal_safe_to_start_threads": "true"} > > > > > > > > > -----Original Message----- > > > From: Mark Nelson [mailto:mark.nelson@inktank.com] > > > Sent: Thursday, 20 December 2012 11:28 PM > > > To: Matthew Anderson > > > Cc: ceph-devel@vger.kernel.org > > > Subject: Re: OSD's slow down to a crawl > > > > > > Hi Matt, > > > > > > Yeah, your log is looking pretty sparse! I imagine Sam will have a lot of questions for you. Out of curiosity have you looked at any io/cpu/memory stats when things slow down? sysprof may come in handy if it looks like it's CPU related. Also, check out the OSD admin socket. > > > Sebastien Han made a nice little blog entry describing it: > > > > > > http://www.sebastien-han.fr/blog/2012/08/14/ceph-admin-socket/ > > > > > > You can get a lot of information about the current state of things. > > > > > > Mark > > > > > > > -- > > To unsubscribe from this list: send the line "unsubscribe ceph-devel" > > in the body of a message to majordomo@vger.kernel.org More majordomo > > info at http://vger.kernel.org/majordomo-info.html > > > > > ^ permalink raw reply [flat|nested] 13+ messages in thread
* RE: OSD's slow down to a crawl 2012-12-20 17:13 ` Sage Weil @ 2012-12-21 4:51 ` Matthew Anderson 2012-12-21 16:02 ` Sage Weil 0 siblings, 1 reply; 13+ messages in thread From: Matthew Anderson @ 2012-12-21 4:51 UTC (permalink / raw) To: 'Sage Weil'; +Cc: 'Mark Nelson', ceph-devel@vger.kernel.org [-- Attachment #1: Type: text/plain, Size: 45581 bytes --] Hi Sage, I've tried to reproduce the error again with logging on every OSD and got the above. RADOS bench had stalled on a write request like the last time and the attached log is the grep'd OSD log (# cat osd.25.log | grep client.9501.0:744 > freeze.log) . The OSD that stalled was 25, pg map is below - # ceph pg map 6.5d83495b osdmap e3775 pg 6.5d83495b (6.95b) -> up [25,31] acting [25,31] I hope that's what you were after, if not just let me know Thanks again -Matt -----Original Message----- From: Sage Weil [mailto:sage@inktank.com] Sent: Friday, 21 December 2012 1:14 AM To: Matthew Anderson Cc: 'Mark Nelson'; ceph-devel@vger.kernel.org Subject: RE: OSD's slow down to a crawl On Thu, 20 Dec 2012, Matthew Anderson wrote: > Hi Sage, > > Logs are attached. I took the osd logs from osd.24 as this is the > first osd in my SSD pool I've been testing with previously. > > The 4MB bench I was able to reproduce the fault by restarting my rbd > export which stalled after a few percent complete. When I ran the 4MB > bench it stalled early on and never received a response back from the > OSD and I terminated it after 60 seconds or so. I wasn't able to > reproduce the fault using the 4kb io size. The 4kb log should show > rados bench completing normally at a respectable speed of about 1MB/s. Let's drill into the hang.. up until that point things look okay. 2012-12-21 00:51:26.033622 7f6f3c042760 1 -- 172.16.0.13:0/1023886 --> 172.16.0.13:6813/22233 -- osd_op(client.9503.0:185 benchmark_data_KVM04_23886_object184 [write 0~4194304] 6.3ca4346e) v4 -- ?+0 0x171ea50 con 0x171a7e0 Do you have a log for that OSD so we can see what happened there? It may also be that the replicated write is hung. If you do ceph pg map 6.3ca4346e you can see all OSDs storing that PG. And/or you can grep for client.9503.0:185 in 172.16.0.13:6813/22233's log and see whether the sub_op was sent. Thanks! sage > > Thanks > -Matt > > -----Original Message----- > From: ceph-devel-owner@vger.kernel.org > [mailto:ceph-devel-owner@vger.kernel.org] On Behalf Of Sage Weil > Sent: Friday, 21 December 2012 12:30 AM > To: Matthew Anderson > Cc: 'Mark Nelson'; ceph-devel@vger.kernel.org > Subject: RE: OSD's slow down to a crawl > > Can you do a similar test, but with full logging on? > > ceph tell osd.0 injectargs '--debug-ms 1 --debug-filestore 20 > --debug-osd > 20 --debug-journal 20' > rados -p ssd bench 30 write -b 4096 -t 1 --log-file /tmp/foo > --debug-ms 1 > > That will be a single IO in flight at a time and very easy to trace through the logs. If you can post the resulting log file (/tmp/foo and from osd.0), that would be awesome. > > Thanks! > sage > > > > On Thu, 20 Dec 2012, Matthew Anderson wrote: > > > # rados bench 60 write -t 256 -p ssd Maintaining 256 concurrent > > writes of 4194304 bytes for at least 60 seconds. > > Object prefix: benchmark_data_KVM03_12985 > > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > > 0 0 0 0 0 0 - 0 > > 1 45 45 0 0 0 - 0 > > 2 71 71 0 0 0 - 0 > > 3 90 90 0 0 0 - 0 > > 4 125 125 0 0 0 - 0 > > 5 136 136 0 0 0 - 0 > > 6 144 144 0 0 0 - 0 > > 7 175 175 0 0 0 - 0 > > 8 201 201 0 0 0 - 0 > > 9 217 217 0 0 0 - 0 > > 10 228 228 0 0 0 - 0 > > 11 251 251 0 0 0 - 0 > > 12 255 266 11 3.66581 3.66667 11.7027 11.6013 > > 13 255 271 16 4.92194 20 12.9795 11.9979 > > 14 255 291 36 10.2833 80 13.1795 12.6478 > > 15 255 301 46 12.2639 40 13.9424 12.9123 > > 16 255 303 48 11.9973 8 14.0277 12.9579 > > 17 255 321 66 15.526 72 15.1408 13.4616 > > 18 255 345 90 19.9956 96 15.5673 13.9937 > > 19 255 349 94 19.7851 16 15.707 14.0635 > > 2012-12-21 00:06:37.198982min lat: 11.4904 max lat: 16.4053 avg lat: 14.3121 > > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > > 20 255 361 106 21.1953 48 16.4053 14.3121 > > 21 255 367 112 21.3285 24 17.4263 14.4795 > > 22 255 370 115 20.9043 12 17.3579 14.5544 > > 23 255 387 132 22.9513 68 18.3185 15.0867 > > 24 255 394 139 23.1614 28 18.2868 15.2504 > > 25 255 404 149 23.8346 40 18.741 15.5048 > > 26 255 410 155 23.8407 24 19.032 15.6358 > > 27 255 410 155 22.9577 0 - 15.6358 > > 28 255 410 155 22.1378 0 - 15.6358 > > 29 255 411 156 21.5124 1.33333 22.6927 15.6811 > > 30 255 431 176 23.4614 80 22.6577 16.4822 > > 31 255 447 192 24.7687 64 23.2345 17.0563 > > 32 255 464 209 26.1192 68 23.3788 17.5859 > > 33 255 485 230 27.8726 84 22.8924 18.1122 > > 34 255 497 242 28.4642 48 23.563 18.3693 > > 35 255 508 253 28.9078 44 23.0516 18.5935 > > 36 255 514 259 28.7713 24 24.4415 18.7278 > > 37 255 530 275 29.723 64 23.847 19.0424 > > 38 255 541 286 30.0985 44 24.5757 19.2427 > > 39 255 544 289 29.6344 12 24.6802 19.2984 > > 2012-12-21 00:06:57.203473min lat: 11.4904 max lat: 24.6802 avg lat: 19.2984 > > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > > 40 255 544 289 28.8935 0 - 19.2984 > > 41 255 544 289 28.1888 0 - 19.2984 > > 42 255 544 289 27.5176 0 - 19.2984 > > 43 255 544 289 26.8776 0 - 19.2984 > > 44 255 544 289 26.2668 0 - 19.2984 > > 45 255 544 289 25.6831 0 - 19.2984 > > 46 255 544 289 25.1248 0 - 19.2984 > > 47 255 544 289 24.5902 0 - 19.2984 > > 48 255 544 289 24.0779 0 - 19.2984 > > 49 255 544 289 23.5865 0 - 19.2984 > > 50 255 544 289 23.1148 0 - 19.2984 > > 51 255 544 289 22.6616 0 - 19.2984 > > 52 255 544 289 22.2258 0 - 19.2984 > > 53 255 544 289 21.8065 0 - 19.2984 > > 54 255 544 289 21.4026 0 - 19.2984 > > 55 255 544 289 21.0135 0 - 19.2984 > > 56 255 544 289 20.6383 0 - 19.2984 > > 57 255 544 289 20.2762 0 - 19.2984 > > 58 255 544 289 19.9266 0 - 19.2984 > > 59 255 544 289 19.5889 0 - 19.2984 > > 2012-12-21 00:07:17.207782min lat: 11.4904 max lat: 24.6802 avg lat: 19.2984 > > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > > 60 255 544 289 19.2624 0 - 19.2984 > > 61 255 544 289 18.9466 0 - 19.2984 > > 62 255 544 289 18.641 0 - 19.2984 > > 63 255 544 289 18.3452 0 - 19.2984 > > 64 255 544 289 18.0585 0 - 19.2984 > > 65 255 544 289 17.7807 0 - 19.2984 > > 66 255 544 289 17.5113 0 - 19.2984 > > 67 133 545 412 24.5916 17.5714 37.3574 27.4834 > > Total time run: 68.000740 > > Total writes made: 545 > > Write size: 4194304 > > Bandwidth (MB/sec): 32.058 > > > > Stddev Bandwidth: 26.8837 > > Max bandwidth (MB/sec): 96 > > Min bandwidth (MB/sec): 0 > > Average Latency: 29.2801 > > Stddev Latency: 11.9768 > > Max latency: 52.6277 > > Min latency: 11.4904 > > > > > > > > > > > > ----Original Message----- > > From: Mark Nelson [mailto:mark.nelson@inktank.com] > > Sent: Thursday, 20 December 2012 11:59 PM > > To: Matthew Anderson > > Cc: ceph-devel@vger.kernel.org > > Subject: Re: OSD's slow down to a crawl > > > > Out of curiosity, if you fire up a rados bench instance on one of the nodes with say, 256 concurrent writes, do any of the writes complete? > > > > Mark > > > > On 12/20/2012 09:51 AM, Matthew Anderson wrote: > > > Hi Mark, > > > > > > Thanks for the quick reply. The log was 380MB in total so I just posted last 200 lines hoping that a trained eye might be able to spot something. I'm happy to put the file up on a web server if anyone wants to review it in full. > > > > > > It doesn't appear to be memory or CPU related. In it's hung state (both read and write are not responding at the moment, I have a VM hung trying to shutdown and an rbd stuck exporting ) all of the OSD's sit around 0.3% CPU usage, ~300MB of ram and no IO wait. The server has roughly 40GB RAM free and is easily 90% idle. Iotop shows no IO coming from any of the Ceph-OSD's. > > > > > > The admin socket showed no ops in flight but the performance dump, perhistoric ops and config are below. I did notice that the age of some of the ops was quite high considering things were actively trying to write to the cluster. > > > # ceph --admin-daemon ./ceph-osd.24.asok perf dump > > > > > > {"filestore":{"journal_queue_max_ops":500,"journal_queue_ops":0,"j > > > ou > > > rn > > > al_ops":20868,"journal_queue_max_bytes":104857600,"journal_queue_b > > > yt > > > es > > > ":0,"journal_bytes":1710397238,"journal_latency":{"avgcount":20868," > > > su > > > m":185.268022000},"journal_wr":13354,"journal_wr_bytes":{"avgcount": > > > 13 > > > 354,"sum":1770647552},"op_queue_max_ops":500,"op_queue_ops":0,"ops": > > > 20 > > > 868,"op_queue_max_bytes":104857600,"op_queue_bytes":0,"bytes":1710 > > > 23 > > > 25 > > > 83,"apply_latency":{"avgcount":20868,"sum":508.896737000},"committing" > > > :0,"commitcycle":339,"commitcycle_interval":{"avgcount":339,"sum": > > > 17 > > > 08 > > > .284725000},"commitcycle_latency":{"avgcount":339,"sum":13.2418500 > > > 00 > > > }, > > > "journal_full":0},"mutex-FileJournal::completions_lock":{"wait":{" > > > av > > > gc > > > ount":0,"sum":0.000000000}},"mutex-FileJournal::finisher_lock":{"wait" > > > :{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::write_lock" > > > :{ > > > "w > > > ait":{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::writeq_ > > > lo > > > ck > > > ":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager:: > > > apply_lo > > ck":{"wai > > t":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager::com_l > > oc > > k":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::SubmitManag > > er > > ::lock":{"wait":{"avgcount":0,"sum":0.000000000}},"osd":{"opq":0,"op > > _w > > ip":0,"op":31753,"op_in_bytes":797806224,"op_out_bytes":1728414048," > > op > > _latency":{"avgcount":31753,"sum":82.183373000},"op_r":24945,"op_r_o > > ut > > _bytes":1728414048,"op_r_latency":{"avgcount":24945,"sum":16.8336890 > > 00 > > },"op_w":6808,"op_w_in_bytes":797806224,"op_w_rlat":{"avgcount":6808," > > sum":38.223006000},"op_w_latency":{"avgcount":6808,"sum":65.34968400 > > 0} > > ,"op_rw":0,"op_rw_in_bytes":0,"op_rw_out_bytes":0,"op_rw_rlat":{"avg > > co > > unt":0,"sum":0.000000000},"op_rw_latency":{"avgcount":0,"sum":0.0000 > > 00 > > 000},"subop":5700,"subop_in_bytes":813224156,"subop_latency":{"avgco > > un > > t":5700,"sum":26.877085000},"subop_w":0,"subop_w_in_bytes":813224156," > > subop_w_latency":{"avgcount":5700,"sum":26.877085000},"subop_pull":0," > > subop_pull_latency":{"avgcount":0,"sum":0.000000000},"subop_push":0, > > "s > > ubop_push_ > > in_bytes" > > :0,"subop_push_latency":{"avgcount":0,"sum":0.000000000},"pull":0,"p > > us > > h":0,"push_out_bytes":0,"push_in":0,"push_in_bytes":0,"recovery_ops" > > :0 > > ,"loadavg":359,"buffer_bytes":0,"numpg":2486,"numpg_primary":1214,"n > > um > > pg_replica":1272,"numpg_stray":0,"heartbeat_to_peers":4,"heartbeat_f > > ro > > m_peers":0,"map_messages":44,"map_message_epochs":184,"map_message_e > > po > > ch_dups":174},"throttle-filestore_bytes":{"val":0,"max":104857600,"g > > et > > ":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0,"take": > > 20 > > 868,"take_sum":1710397238,"put":13354,"put_sum":1710397238,"wait":{" > > av > > gcount":0,"sum":0.000000000}},"throttle-filestore_ops":{"val":0,"max": > > 500,"get":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0," > > take":20868,"take_sum":20868,"put":13354,"put_sum":20868,"wait":{"av > > gc > > ount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-client": > > {"val":0,"max":104857600,"get":31657,"get_sum":795546573,"get_or_fai > > l_ > > fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":31657,"p > > ut > > _sum":7955 > > 46573,"wa > > it":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_thrott > > le > > r-cluster":{"val":0,"max":104857600,"get":18913,"get_sum":828011293, > > "g > > et_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put" > > :18913,"put_sum":828011293,"wait":{"avgcount":0,"sum":0.000000000}}, > > "t > > hrottle-msgr_dispatch_throttler-hbclient":{"val":0,"max":104857600," > > ge > > t":4260,"get_sum":200220,"get_or_fail_fail":0,"get_or_fail_success": > > 0, > > "take":0,"take_sum":0,"put":4260,"put_sum":200220,"wait":{"avgcount" > > :0 > > ,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-hbserver":{"v > > al > > ":0,"max":104857600,"get":4236,"get_sum":199092,"get_or_fail_fail":0," > > get_or_fail_success":0,"take":0,"take_sum":0,"put":4236,"put_sum":19 > > 90 > > 92,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-osd_client_byt > > es > > ":{"val":0,"max":524288000,"get":31308,"get_sum":795353720,"get_or_f > > ai > > l_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":37671, > > "p ut_sum":795353720,"wait":{"avgcount":0,"sum":0.000000000}}} > > > > > > # ceph --admin-daemon ./ceph-osd.24.asok dump_historic_ops > > > > > > { "description": "osd_op(client.7636.0:101498 rb.0.122c.6b8b4567.000000000357 [write 3600384~4096] 6.42463fe3)", > > > "received_at": "2012-12-20 23:34:42.012131", > > > "age": "75.428051", > > > "duration": "0.009988", > > > "flag_point": "waiting for sub ops", > > > "client_info": { "client": "client.7636", > > > "tid": 101498}, > > > "events": [ > > > { "time": "2012-12-20 23:34:42.012317", > > > "event": "waiting_for_osdmap"}, > > > { "time": "2012-12-20 23:34:42.012425", > > > "event": "reached_pg"}, > > > { "time": "2012-12-20 23:34:42.012678", > > > "event": "started"}, > > > { "time": "2012-12-20 23:34:42.012720", > > > "event": "started"}, > > > { "time": "2012-12-20 23:34:42.012815", > > > "event": "sub_op_sent"}, > > > { "time": "2012-12-20 23:34:42.012958", > > > "event": "commit_queued_for_journal_write"}, > > > { "time": "2012-12-20 23:34:42.013033", > > > "event": "write_thread_in_journal_buffer"}, > > > { "time": "2012-12-20 23:34:42.013745", > > > "event": "journaled_completion_queued"}, > > > { "time": "2012-12-20 23:34:42.013803", > > > "event": "op_commit"}, > > > { "time": "2012-12-20 23:34:42.017414", > > > "event": "sub_op_commit_rec"}, > > > { "time": "2012-12-20 23:34:42.022066", > > > "event": "op_applied"}, > > > { "time": "2012-12-20 23:34:42.022119", > > > "event": "done"}]}, > > > { "description": "osd_sub_op_reply(client.7636.0:101498 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", > > > "received_at": "2012-12-20 23:34:42.017048", > > > "age": "75.423134", > > > "duration": "0.000411", > > > "flag_point": "started", > > > "events": [ > > > { "time": "2012-12-20 23:34:42.017218", > > > "event": "waiting_for_osdmap"}, > > > { "time": "2012-12-20 23:34:42.017363", > > > "event": "reached_pg"}, > > > { "time": "2012-12-20 23:34:42.017401", > > > "event": "started"}, > > > { "time": "2012-12-20 23:34:42.017459", > > > "event": "done"}]}, > > > { "description": "osd_op(client.7636.0:101502 rb.0.122c.6b8b4567.000000000064 [write 1512448~61440] 6.a241e34d)", > > > "received_at": "2012-12-20 23:34:49.851021", > > > "age": "67.589161", > > > "duration": "0.007114", > > > "flag_point": "waiting for sub ops", > > > "client_info": { "client": "client.7636", > > > "tid": 101502}, > > > "events": [ > > > { "time": "2012-12-20 23:34:49.851276", > > > "event": "waiting_for_osdmap"}, > > > { "time": "2012-12-20 23:34:49.851429", > > > "event": "reached_pg"}, > > > { "time": "2012-12-20 23:34:49.851616", > > > "event": "started"}, > > > { "time": "2012-12-20 23:34:49.851648", > > > "event": "started"}, > > > { "time": "2012-12-20 23:34:49.851712", > > > "event": "sub_op_sent"}, > > > { "time": "2012-12-20 23:34:49.851770", > > > "event": "commit_queued_for_journal_write"}, > > > { "time": "2012-12-20 23:34:49.851915", > > > "event": "write_thread_in_journal_buffer"}, > > > { "time": "2012-12-20 23:34:49.853956", > > > "event": "journaled_completion_queued"}, > > > { "time": "2012-12-20 23:34:49.853992", > > > "event": "op_commit"}, > > > { "time": "2012-12-20 23:34:49.857016", > > > "event": "sub_op_commit_rec"}, > > > { "time": "2012-12-20 23:34:49.858077", > > > "event": "op_applied"}, > > > { "time": "2012-12-20 23:34:49.858135", > > > "event": "done"}]}, > > > { "description": "osd_sub_op_reply(client.7636.0:101502 6.234d a241e34d\/rb.0.122c.6b8b4567.000000000064\/head\/\/6 [] ondisk, result = 0)", > > > "received_at": "2012-12-20 23:34:49.856762", > > > "age": "67.583420", > > > "duration": "0.000308", > > > "flag_point": "started", > > > "events": [ > > > { "time": "2012-12-20 23:34:49.856869", > > > "event": "waiting_for_osdmap"}, > > > { "time": "2012-12-20 23:34:49.856969", > > > "event": "reached_pg"}, > > > { "time": "2012-12-20 23:34:49.857001", > > > "event": "started"}, > > > { "time": "2012-12-20 23:34:49.857070", > > > "event": "done"}]}, > > > { "description": "osd_op(client.7636.0:101504 rb.0.122c.6b8b4567.00000000012c [write 1658880~2048] 6.92d2c38c)", > > > "received_at": "2012-12-20 23:35:34.351518", > > > "age": "23.088664", > > > "duration": "0.009430", > > > "flag_point": "waiting for sub ops", > > > "client_info": { "client": "client.7636", > > > "tid": 101504}, > > > "events": [ > > > { "time": "2012-12-20 23:35:34.351714", > > > "event": "waiting_for_osdmap"}, > > > { "time": "2012-12-20 23:35:34.351811", > > > "event": "reached_pg"}, > > > { "time": "2012-12-20 23:35:34.352030", > > > "event": "started"}, > > > { "time": "2012-12-20 23:35:34.352071", > > > "event": "started"}, > > > { "time": "2012-12-20 23:35:34.352146", > > > "event": "sub_op_sent"}, > > > { "time": "2012-12-20 23:35:34.352213", > > > "event": "commit_queued_for_journal_write"}, > > > { "time": "2012-12-20 23:35:34.352287", > > > "event": "write_thread_in_journal_buffer"}, > > > { "time": "2012-12-20 23:35:34.355390", > > > "event": "journaled_completion_queued"}, > > > { "time": "2012-12-20 23:35:34.355475", > > > "event": "op_commit"}, > > > { "time": "2012-12-20 23:35:34.356661", > > > "event": "sub_op_commit_rec"}, > > > { "time": "2012-12-20 23:35:34.360894", > > > "event": "op_applied"}, > > > { "time": "2012-12-20 23:35:34.360948", > > > "event": "done"}]}, > > > { "description": "osd_sub_op_reply(client.7636.0:101504 6.38c 92d2c38c\/rb.0.122c.6b8b4567.00000000012c\/head\/\/6 [] ondisk, result = 0)", > > > "received_at": "2012-12-20 23:35:34.356286", > > > "age": "23.083896", > > > "duration": "0.000421", > > > "flag_point": "started", > > > "events": [ > > > { "time": "2012-12-20 23:35:34.356489", > > > "event": "waiting_for_osdmap"}, > > > { "time": "2012-12-20 23:35:34.356598", > > > "event": "reached_pg"}, > > > { "time": "2012-12-20 23:35:34.356640", > > > "event": "started"}, > > > { "time": "2012-12-20 23:35:34.356707", > > > "event": "done"}]}, > > > { "description": "osd_op(client.7636.0:101512 rb.0.122c.6b8b4567.000000000357 [write 3731456~4096] 6.42463fe3)", > > > "received_at": "2012-12-20 23:35:42.133103", > > > "age": "15.307079", > > > "duration": "0.008357", > > > "flag_point": "waiting for sub ops", > > > "client_info": { "client": "client.7636", > > > "tid": 101512}, > > > "events": [ > > > { "time": "2012-12-20 23:35:42.133315", > > > "event": "waiting_for_osdmap"}, > > > { "time": "2012-12-20 23:35:42.133427", > > > "event": "reached_pg"}, > > > { "time": "2012-12-20 23:35:42.133684", > > > "event": "started"}, > > > { "time": "2012-12-20 23:35:42.133723", > > > "event": "started"}, > > > { "time": "2012-12-20 23:35:42.133796", > > > "event": "sub_op_sent"}, > > > { "time": "2012-12-20 23:35:42.133908", > > > "event": "commit_queued_for_journal_write"}, > > > { "time": "2012-12-20 23:35:42.133995", > > > "event": "write_thread_in_journal_buffer"}, > > > { "time": "2012-12-20 23:35:42.136965", > > > "event": "journaled_completion_queued"}, > > > { "time": "2012-12-20 23:35:42.137025", > > > "event": "op_commit"}, > > > { "time": "2012-12-20 23:35:42.138456", > > > "event": "sub_op_commit_rec"}, > > > { "time": "2012-12-20 23:35:42.141408", > > > "event": "op_applied"}, > > > { "time": "2012-12-20 23:35:42.141460", > > > "event": "done"}]}, > > > { "description": "osd_sub_op_reply(client.7636.0:101512 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", > > > "received_at": "2012-12-20 23:35:42.138087", > > > "age": "15.302095", > > > "duration": "0.000410", > > > "flag_point": "started", > > > "events": [ > > > { "time": "2012-12-20 23:35:42.138248", > > > "event": "waiting_for_osdmap"}, > > > { "time": "2012-12-20 23:35:42.138363", > > > "event": "reached_pg"}, > > > { "time": "2012-12-20 23:35:42.138443", > > > "event": "started"}, > > > { "time": "2012-12-20 23:35:42.138497", > > > "event": "done"}]}]} > > > > > > # ceph --admin-daemon ./ceph-osd.24.asok config show { "name": > > > "osd.24", > > > "cluster": "ceph", > > > "none": "0\/5", > > > "lockdep": "0\/1", > > > "context": "0\/1", > > > "crush": "1\/1", > > > "mds": "1\/5", > > > "mds_balancer": "1\/5", > > > "mds_locker": "1\/5", > > > "mds_log": "1\/5", > > > "mds_log_expire": "1\/5", > > > "mds_migrator": "1\/5", > > > "buffer": "0\/1", > > > "timer": "0\/1", > > > "filer": "0\/1", > > > "striper": "0\/1", > > > "objecter": "0\/1", > > > "rados": "0\/5", > > > "rbd": "0\/5", > > > "journaler": "0\/5", > > > "objectcacher": "0\/5", > > > "client": "0\/5", > > > "osd": "0\/5", > > > "optracker": "0\/5", > > > "objclass": "0\/5", > > > "filestore": "1\/3", > > > "journal": "1\/3", > > > "ms": "0\/5", > > > "mon": "1\/5", > > > "monc": "0\/10", > > > "paxos": "0\/5", > > > "tp": "0\/5", > > > "auth": "1\/5", > > > "crypto": "1\/5", > > > "finisher": "1\/1", > > > "heartbeatmap": "1\/5", > > > "perfcounter": "1\/5", > > > "rgw": "1\/5", > > > "hadoop": "1\/5", > > > "javaclient": "1\/5", > > > "asok": "1\/5", > > > "throttle": "1\/1", > > > "host": "KVM04", > > > "fsid": "00000000-0000-0000-0000-000000000000", > > > "public_addr": ":\/0", > > > "cluster_addr": ":\/0", > > > "public_network": "", > > > "cluster_network": "", > > > "num_client": "1", > > > "monmap": "", > > > "mon_host": "", > > > "lockdep": "false", > > > "admin_socket": "\/var\/run\/ceph\/ceph-osd.24.asok", > > > "daemonize": "true", > > > "pid_file": "\/var\/run\/ceph\/osd.24.pid", > > > "chdir": "\/", > > > "max_open_files": "0", > > > "fatal_signal_handlers": "true", > > > "log_file": "\/var\/log\/ceph\/osd.24.log", > > > "log_max_new": "1000", > > > "log_max_recent": "100000", > > > "log_to_stderr": "false", > > > "err_to_stderr": "true", > > > "log_to_syslog": "false", > > > "err_to_syslog": "false", > > > "log_flush_on_exit": "true", > > > "clog_to_monitors": "true", > > > "clog_to_syslog": "false", > > > "mon_cluster_log_to_syslog": "false", > > > "mon_cluster_log_file": "\/var\/log\/ceph\/ceph.log", > > > "key": "", > > > "keyfile": "", > > > "keyring": "\/srv\/ceph\/osd\/osd.24\/keyring", > > > "heartbeat_interval": "5", > > > "heartbeat_file": "", > > > "perf": "true", > > > "ms_tcp_nodelay": "true", > > > "ms_initial_backoff": "0.2", > > > "ms_max_backoff": "15", > > > "ms_nocrc": "false", > > > "ms_die_on_bad_msg": "false", > > > "ms_dispatch_throttle_bytes": "104857600", > > > "ms_bind_ipv6": "false", > > > "ms_bind_port_min": "6800", > > > "ms_bind_port_max": "7100", > > > "ms_rwthread_stack_bytes": "1048576", > > > "ms_tcp_read_timeout": "900", > > > "ms_inject_socket_failures": "0", > > > "ms_inject_delay_type": "", > > > "ms_inject_delay_max": "1", > > > "ms_inject_delay_probability": "0", > > > "mon_data": "\/var\/lib\/ceph\/mon\/ceph-24", > > > "mon_initial_members": "", > > > "mon_sync_fs_threshold": "5", > > > "mon_tick_interval": "5", > > > "mon_subscribe_interval": "300", > > > "mon_osd_laggy_halflife": "3600", > > > "mon_osd_laggy_weight": "0.3", > > > "mon_osd_adjust_heartbeat_grace": "true", > > > "mon_osd_adjust_down_out_interval": "true", > > > "mon_osd_auto_mark_in": "false", > > > "mon_osd_auto_mark_auto_out_in": "true", > > > "mon_osd_auto_mark_new_in": "true", > > > "mon_osd_down_out_interval": "300", > > > "mon_osd_min_up_ratio": "0.3", > > > "mon_osd_min_in_ratio": "0.3", > > > "mon_lease": "5", > > > "mon_lease_renew_interval": "3", > > > "mon_lease_ack_timeout": "10", > > > "mon_clock_drift_allowed": "0.05", > > > "mon_clock_drift_warn_backoff": "5", > > > "mon_accept_timeout": "10", > > > "mon_pg_create_interval": "30", > > > "mon_pg_stuck_threshold": "300", > > > "mon_osd_full_ratio": "0.95", > > > "mon_osd_nearfull_ratio": "0.85", > > > "mon_globalid_prealloc": "100", > > > "mon_osd_report_timeout": "900", > > > "mon_force_standby_active": "true", > > > "mon_min_osdmap_epochs": "500", > > > "mon_max_pgmap_epochs": "500", > > > "mon_max_log_epochs": "500", > > > "mon_max_osd": "10000", > > > "mon_probe_timeout": "2", > > > "mon_slurp_timeout": "10", > > > "mon_slurp_bytes": "262144", > > > "mon_client_bytes": "104857600", > > > "mon_daemon_bytes": "419430400", > > > "mon_max_log_entries_per_event": "4096", > > > "paxos_max_join_drift": "10", > > > "paxos_propose_interval": "1", > > > "paxos_min_wait": "0.05", > > > "clock_offset": "0", > > > "auth_cluster_required": "none", > > > "auth_service_required": "none", > > > "auth_client_required": "none", > > > "auth_supported": "none", > > > "cephx_require_signatures": "false", > > > "cephx_sign_messages": "true", > > > "auth_mon_ticket_ttl": "43200", > > > "auth_service_ticket_ttl": "3600", > > > "mon_client_hunt_interval": "3", > > > "mon_client_ping_interval": "10", > > > "mon_client_max_log_entries_per_message": "1000", > > > "mon_max_pool_pg_num": "65536", > > > "client_cache_size": "16384", > > > "client_cache_mid": "0.75", > > > "client_use_random_mds": "false", > > > "client_mount_timeout": "30", > > > "client_tick_interval": "1", > > > "client_trace": "", > > > "client_readahead_min": "131072", > > > "client_readahead_max_bytes": "0", > > > "client_readahead_max_periods": "4", > > > "client_snapdir": ".snap", > > > "client_mountpoint": "\/", > > > "client_notify_timeout": "10", > > > "client_caps_release_delay": "5", > > > "client_oc": "true", > > > "client_oc_size": "209715200", > > > "client_oc_max_dirty": "104857600", > > > "client_oc_target_dirty": "8388608", > > > "client_oc_max_dirty_age": "5", > > > "client_oc_max_objects": "1000", > > > "client_debug_force_sync_read": "false", > > > "client_debug_inject_tick_delay": "0", > > > "fuse_use_invalidate_cb": "false", > > > "fuse_big_writes": "true", > > > "fuse_debug": "false", > > > "objecter_tick_interval": "5", > > > "objecter_timeout": "10", > > > "objecter_inflight_op_bytes": "104857600", > > > "objecter_inflight_ops": "1024", > > > "journaler_allow_split_entries": "true", > > > "journaler_write_head_interval": "15", > > > "journaler_prefetch_periods": "10", > > > "journaler_prezero_periods": "5", > > > "journaler_batch_interval": "0.001", > > > "journaler_batch_max": "0", > > > "mds_data": "\/var\/lib\/ceph\/mds\/ceph-24", > > > "mds_max_file_size": "1099511627776", > > > "mds_cache_size": "100000", > > > "mds_cache_mid": "0.7", > > > "mds_mem_max": "1048576", > > > "mds_dir_commit_ratio": "0.5", > > > "mds_dir_max_commit_size": "90", > > > "mds_decay_halflife": "5", > > > "mds_beacon_interval": "4", > > > "mds_beacon_grace": "15", > > > "mds_blacklist_interval": "1440", > > > "mds_session_timeout": "60", > > > "mds_session_autoclose": "300", > > > "mds_reconnect_timeout": "45", > > > "mds_tick_interval": "5", > > > "mds_dirstat_min_interval": "1", > > > "mds_scatter_nudge_interval": "5", > > > "mds_client_prealloc_inos": "1000", > > > "mds_early_reply": "true", > > > "mds_use_tmap": "true", > > > "mds_default_dir_hash": "2", > > > "mds_log": "true", > > > "mds_log_skip_corrupt_events": "false", > > > "mds_log_max_events": "-1", > > > "mds_log_max_segments": "30", > > > "mds_log_max_expiring": "20", > > > "mds_bal_sample_interval": "3", > > > "mds_bal_replicate_threshold": "8000", > > > "mds_bal_unreplicate_threshold": "0", > > > "mds_bal_frag": "false", > > > "mds_bal_split_size": "10000", > > > "mds_bal_split_rd": "25000", > > > "mds_bal_split_wr": "10000", > > > "mds_bal_split_bits": "3", > > > "mds_bal_merge_size": "50", > > > "mds_bal_merge_rd": "1000", > > > "mds_bal_merge_wr": "1000", > > > "mds_bal_interval": "10", > > > "mds_bal_fragment_interval": "5", > > > "mds_bal_idle_threshold": "0", > > > "mds_bal_max": "-1", > > > "mds_bal_max_until": "-1", > > > "mds_bal_mode": "0", > > > "mds_bal_min_rebalance": "0.1", > > > "mds_bal_min_start": "0.2", > > > "mds_bal_need_min": "0.8", > > > "mds_bal_need_max": "1.2", > > > "mds_bal_midchunk": "0.3", > > > "mds_bal_minchunk": "0.001", > > > "mds_bal_target_removal_min": "5", > > > "mds_bal_target_removal_max": "10", > > > "mds_replay_interval": "1", > > > "mds_shutdown_check": "0", > > > "mds_thrash_exports": "0", > > > "mds_thrash_fragments": "0", > > > "mds_dump_cache_on_map": "false", > > > "mds_dump_cache_after_rejoin": "false", > > > "mds_verify_scatter": "false", > > > "mds_debug_scatterstat": "false", > > > "mds_debug_frag": "false", > > > "mds_debug_auth_pins": "false", > > > "mds_debug_subtrees": "false", > > > "mds_kill_mdstable_at": "0", > > > "mds_kill_export_at": "0", > > > "mds_kill_import_at": "0", > > > "mds_kill_link_at": "0", > > > "mds_kill_rename_at": "0", > > > "mds_wipe_sessions": "false", > > > "mds_wipe_ino_prealloc": "false", > > > "mds_skip_ino": "0", > > > "max_mds": "1", > > > "mds_standby_for_name": "", > > > "mds_standby_for_rank": "-1", > > > "mds_standby_replay": "false", > > > "osd_auto_upgrade_tmap": "true", > > > "osd_tmapput_sets_uses_tmap": "false", > > > "osd_max_backfills": "10", > > > "osd_backfill_full_ratio": "0.85", > > > "osd_backfill_retry_interval": "10", > > > "osd_uuid": "00000000-0000-0000-0000-000000000000", > > > "osd_data": "\/srv\/ceph\/osd\/osd.24", > > > "osd_journal": "\/dev\/sda3", > > > "osd_journal_size": "5120", > > > "osd_max_write_size": "90", > > > "osd_max_pgls": "1024", > > > "osd_client_message_size_cap": "524288000", > > > "osd_pg_bits": "6", > > > "osd_pgp_bits": "6", > > > "osd_min_rep": "1", > > > "osd_max_rep": "10", > > > "osd_pool_default_crush_rule": "0", > > > "osd_pool_default_size": "2", > > > "osd_pool_default_min_size": "0", > > > "osd_pool_default_pg_num": "8", > > > "osd_pool_default_pgp_num": "8", > > > "osd_map_dedup": "true", > > > "osd_map_cache_size": "500", > > > "osd_map_message_max": "100", > > > "osd_op_threads": "2", > > > "osd_disk_threads": "1", > > > "osd_recovery_threads": "1", > > > "osd_recover_clone_overlap": "true", > > > "osd_backfill_scan_min": "64", > > > "osd_backfill_scan_max": "512", > > > "osd_op_thread_timeout": "30", > > > "osd_recovery_thread_timeout": "30", > > > "osd_snap_trim_thread_timeout": "3600", > > > "osd_scrub_thread_timeout": "60", > > > "osd_scrub_finalize_thread_timeout": "600", > > > "osd_remove_thread_timeout": "3600", > > > "osd_command_thread_timeout": "600", > > > "osd_age": "0.8", > > > "osd_age_time": "0", > > > "osd_heartbeat_addr": ":\/0", > > > "osd_heartbeat_interval": "6", > > > "osd_heartbeat_grace": "20", > > > "osd_mon_heartbeat_interval": "30", > > > "osd_mon_report_interval_max": "120", > > > "osd_mon_report_interval_min": "5", > > > "osd_mon_ack_timeout": "30", > > > "osd_min_down_reporters": "1", > > > "osd_min_down_reports": "3", > > > "osd_default_data_pool_replay_window": "45", > > > "osd_preserve_trimmed_log": "false", > > > "osd_auto_mark_unfound_lost": "false", > > > "osd_recovery_delay_start": "0", > > > "osd_recovery_max_active": "50", > > > "osd_recovery_max_chunk": "8388608", > > > "osd_recovery_forget_lost_objects": "false", > > > "osd_max_scrubs": "1", > > > "osd_scrub_load_threshold": "0.5", > > > "osd_scrub_min_interval": "300", > > > "osd_scrub_max_interval": "86400", > > > "osd_deep_scrub_interval": "604800", > > > "osd_deep_scrub_stride": "524288", > > > "osd_auto_weight": "false", > > > "osd_class_dir": "\/usr\/local\/lib\/rados-classes", > > > "osd_check_for_log_corruption": "false", > > > "osd_use_stale_snap": "false", > > > "osd_rollback_to_cluster_snap": "", > > > "osd_default_notify_timeout": "30", > > > "osd_kill_backfill_at": "0", > > > "osd_min_pg_log_entries": "1000", > > > "osd_op_complaint_time": "30", > > > "osd_command_max_records": "256", > > > "osd_op_log_threshold": "5", > > > "osd_verify_sparse_read_holes": "false", > > > "osd_debug_drop_ping_probability": "0", > > > "osd_debug_drop_ping_duration": "0", > > > "osd_debug_drop_pg_create_probability": "0", > > > "osd_debug_drop_pg_create_duration": "1", > > > "osd_debug_drop_op_probability": "0", > > > "osd_op_history_size": "20", > > > "osd_op_history_duration": "600", > > > "osd_target_transaction_size": "300", > > > "osd_client_op_priority": "63", > > > "osd_recovery_op_priority": "10", > > > "filestore": "false", > > > "filestore_index_retry_probability": "0", > > > "filestore_debug_omap_check": "false", > > > "filestore_xattr_use_omap": "false", > > > "filestore_max_inline_xattr_size": "512", > > > "filestore_max_inline_xattrs": "2", > > > "filestore_max_sync_interval": "5", > > > "filestore_min_sync_interval": "0.01", > > > "filestore_btrfs_snap": "true", > > > "filestore_btrfs_clone_range": "true", > > > "filestore_fsync_flushes_journal_data": "false", > > > "filestore_fiemap": "false", > > > "filestore_flusher": "true", > > > "filestore_flusher_max_fds": "512", > > > "filestore_flush_min": "65536", > > > "filestore_sync_flush": "false", > > > "filestore_journal_parallel": "false", > > > "filestore_journal_writeahead": "false", > > > "filestore_journal_trailing": "false", > > > "filestore_queue_max_ops": "500", > > > "filestore_queue_max_bytes": "104857600", > > > "filestore_queue_committing_max_ops": "500", > > > "filestore_queue_committing_max_bytes": "104857600", > > > "filestore_op_threads": "2", > > > "filestore_op_thread_timeout": "60", > > > "filestore_op_thread_suicide_timeout": "180", > > > "filestore_commit_timeout": "600", > > > "filestore_fiemap_threshold": "4096", > > > "filestore_merge_threshold": "10", > > > "filestore_split_multiple": "2", > > > "filestore_update_to": "1000", > > > "filestore_blackhole": "false", > > > "filestore_dump_file": "", > > > "filestore_kill_at": "0", > > > "filestore_fail_eio": "true", > > > "journal_dio": "true", > > > "journal_aio": "false", > > > "journal_block_align": "true", > > > "journal_max_write_bytes": "10485760", > > > "journal_max_write_entries": "100", > > > "journal_queue_max_ops": "500", > > > "journal_queue_max_bytes": "104857600", > > > "journal_align_min_size": "65536", > > > "journal_replay_from": "0", > > > "journal_zero_on_create": "false", > > > "rbd_cache": "false", > > > "rbd_cache_size": "33554432", > > > "rbd_cache_max_dirty": "25165824", > > > "rbd_cache_target_dirty": "16777216", > > > "rbd_cache_max_dirty_age": "1", > > > "nss_db_path": "", > > > "rgw_data": "\/var\/lib\/ceph\/radosgw\/ceph-24", > > > "rgw_enable_apis": "s3, swift, swift_auth, admin", > > > "rgw_cache_enabled": "true", > > > "rgw_cache_lru_size": "10000", > > > "rgw_socket_path": "", > > > "rgw_dns_name": "", > > > "rgw_swift_url": "", > > > "rgw_swift_url_prefix": "swift", > > > "rgw_swift_auth_url": "", > > > "rgw_swift_auth_entry": "auth", > > > "rgw_keystone_url": "", > > > "rgw_keystone_admin_token": "", > > > "rgw_keystone_accepted_roles": "Member, admin", > > > "rgw_keystone_token_cache_size": "10000", > > > "rgw_keystone_revocation_interval": "900", > > > "rgw_admin_entry": "admin", > > > "rgw_enforce_swift_acls": "true", > > > "rgw_swift_token_expiration": "86400", > > > "rgw_print_continue": "true", > > > "rgw_remote_addr_param": "REMOTE_ADDR", > > > "rgw_op_thread_timeout": "600", > > > "rgw_op_thread_suicide_timeout": "0", > > > "rgw_thread_pool_size": "100", > > > "rgw_num_control_oids": "8", > > > "rgw_cluster_root_pool": ".rgw.root", > > > "rgw_log_nonexistent_bucket": "false", > > > "rgw_log_object_name": "%Y-%m-%d-%H-%i-%n", > > > "rgw_log_object_name_utc": "false", > > > "rgw_usage_max_shards": "32", > > > "rgw_usage_max_user_shards": "1", > > > "rgw_enable_ops_log": "true", > > > "rgw_enable_usage_log": "true", > > > "rgw_ops_log_rados": "true", > > > "rgw_ops_log_socket_path": "", > > > "rgw_ops_log_data_backlog": "5242880", > > > "rgw_usage_log_flush_threshold": "1024", > > > "rgw_usage_log_tick_interval": "30", > > > "rgw_intent_log_object_name": "%Y-%m-%d-%i-%n", > > > "rgw_intent_log_object_name_utc": "false", > > > "rgw_init_timeout": "30", > > > "rgw_mime_types_file": "\/etc\/mime.types", > > > "rgw_gc_max_objs": "32", > > > "rgw_gc_obj_min_wait": "7200", > > > "rgw_gc_processor_max_time": "3600", > > > "rgw_gc_processor_period": "3600", > > > "rgw_s3_success_create_obj_status": "0", > > > "rgw_resolve_cname": "false", > > > "rgw_obj_stripe_size": "4194304", > > > "rgw_extended_http_attrs": "", > > > "mutex_perf_counter": "false", > > > "internal_safe_to_start_threads": "true"} > > > > > > > > > -----Original Message----- > > > From: Mark Nelson [mailto:mark.nelson@inktank.com] > > > Sent: Thursday, 20 December 2012 11:28 PM > > > To: Matthew Anderson > > > Cc: ceph-devel@vger.kernel.org > > > Subject: Re: OSD's slow down to a crawl > > > > > > Hi Matt, > > > > > > Yeah, your log is looking pretty sparse! I imagine Sam will have a lot of questions for you. Out of curiosity have you looked at any io/cpu/memory stats when things slow down? sysprof may come in handy if it looks like it's CPU related. Also, check out the OSD admin socket. > > > Sebastien Han made a nice little blog entry describing it: > > > > > > http://www.sebastien-han.fr/blog/2012/08/14/ceph-admin-socket/ > > > > > > You can get a lot of information about the current state of things. > > > > > > Mark > > > > > > > -- > > To unsubscribe from this list: send the line "unsubscribe ceph-devel" > > in the body of a message to majordomo@vger.kernel.org More majordomo > > info at http://vger.kernel.org/majordomo-info.html > > > > > [-- Attachment #2: freeze.log --] [-- Type: application/octet-stream, Size: 11078 bytes --] 2012-12-21 12:40:00.770821 7f4d90ff9700 1 -- 172.16.0.13:6814/15146 <== client.9501 172.16.0.13:0/1016365 118 ==== osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4 ==== 175+0+4194304 (3920014038 0 2072446764) 0x7f4ca6c0d750 con 0x7f4cf40015c0 2012-12-21 12:40:00.770876 7f4d90ff9700 20 osd.25 3775 _dispatch 0x7f4ca6c0d750 osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4 2012-12-21 12:40:00.770980 7f4d90ff9700 15 osd.25 3775 enqueue_op 0x7f4d5c1f6720 osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4 2012-12-21 12:40:00.771094 7f4d87ff7700 10 osd.25 3775 dequeue_op 0x7f4d5c1f6720 osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4 pg pg[6.95b( v 3775'134 (0'0,3775'134] local-les=3774 n=14 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 mlcod 3775'133 active+clean] 2012-12-21 12:40:00.771173 7f4d87ff7700 10 osd.25 pg_epoch: 3775 pg[6.95b( v 3775'134 (0'0,3775'134] local-les=3774 n=14 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 mlcod 3775'133 active+clean] do_op osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4 may_write 2012-12-21 12:40:00.771874 7f4d87ff7700 10 osd.25 pg_epoch: 3775 pg[6.95b( v 3775'134 (0'0,3775'134] local-les=3774 n=15 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 mlcod 3775'133 active+clean] append_log log(0'0,3775'134] [3775'135 (0'0) modify 5d83495b/benchmark_data_KVM04_16365_object743/head//6 by client.9501.0:744 2012-12-21 12:40:00.759292] 2012-12-21 12:40:00.771908 7f4d87ff7700 10 osd.25 pg_epoch: 3775 pg[6.95b( v 3775'135 (0'0,3775'135] local-les=3774 n=15 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 luod=3775'134 lcod 3775'134 mlcod 3775'133 active+clean] add_log_entry 3775'135 (0'0) modify 5d83495b/benchmark_data_KVM04_16365_object743/head//6 by client.9501.0:744 2012-12-21 12:40:00.759292 2012-12-21 12:40:00.772002 7f4d87ff7700 10 osd.25 pg_epoch: 3775 pg[6.95b( v 3775'135 (0'0,3775'135] local-les=3774 n=15 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 luod=3775'134 lcod 3775'134 mlcod 3775'133 active+clean] new_repop rep_tid 13299 on osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4 2012-12-21 12:40:00.772100 7f4d87ff7700 1 -- 172.16.0.13:6816/15146 --> osd.31 172.16.0.12:6822/17200 -- osd_sub_op(client.9501.0:744 6.95b 5d83495b/benchmark_data_KVM04_16365_object743/head//6 [] v 3775'135 snapset=0=[]:[] snapc=0=[]) v7 -- ?+4194934 0x7f4d78f01010 2012-12-21 12:40:00.772123 7f4d87ff7700 10 osd.25 pg_epoch: 3775 pg[6.95b( v 3775'135 (0'0,3775'135] local-les=3774 n=15 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 luod=3775'134 lcod 3775'134 mlcod 3775'133 active+clean] eval_repop repgather(0x7f4d7863bba0 3775'135 rep_tid=13299 wfack=25,31 wfdisk=25,31 op=osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4) wants=ad 2012-12-21 12:40:00.772146 7f4d87ff7700 10 osd.25 pg_epoch: 3775 pg[6.95b( v 3775'135 (0'0,3775'135] local-les=3774 n=15 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 luod=3775'134 lcod 3775'134 mlcod 3775'133 active+clean] apply_repop applying update on repgather(0x7f4d7863bba0 3775'135 rep_tid=13299 wfack=25,31 wfdisk=25,31 op=osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4) 2012-12-21 12:40:00.775536 7f4d927fc700 10 osd.25 pg_epoch: 3775 pg[6.95b( v 3775'135 (0'0,3775'135] local-les=3774 n=15 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 luod=3775'134 lcod 3775'134 mlcod 3775'133 active+clean] op_applied repgather(0x7f4d7863bba0 applying 3775'135 rep_tid=13299 wfack=25,31 wfdisk=25,31 op=osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4) 2012-12-21 12:40:00.775704 7f4d927fc700 10 osd.25 pg_epoch: 3775 pg[6.95b( v 3775'135 (0'0,3775'135] local-les=3774 n=15 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 luod=3775'134 lcod 3775'134 mlcod 3775'133 active+clean] eval_repop repgather(0x7f4d7863bba0 applied 3775'135 rep_tid=13299 wfack=31 wfdisk=25,31 op=osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4) wants=ad 2012-12-21 12:40:00.807245 7f4d9cff9700 10 osd.25 pg_epoch: 3775 pg[6.95b( v 3775'135 (0'0,3775'135] local-les=3774 n=15 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 luod=3775'134 lcod 3775'134 mlcod 3775'133 active+clean] op_commit repgather(0x7f4d7863bba0 applied 3775'135 rep_tid=13299 wfack=31 wfdisk=25,31 op=osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4) 2012-12-21 12:40:00.807271 7f4d9cff9700 10 osd.25 pg_epoch: 3775 pg[6.95b( v 3775'135 (0'0,3775'135] local-les=3774 n=15 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 mlcod 3775'133 active+clean] eval_repop repgather(0x7f4d7863bba0 applied 3775'135 rep_tid=13299 wfack=31 wfdisk=31 op=osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4) wants=ad 2012-12-21 12:40:31.728982 7f4da5c69700 0 log [WRN] : slow request 30.965405 seconds old, received at 2012-12-21 12:40:00.763528: osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4 currently waiting for sub ops 2012-12-21 12:40:32.522252 7f4d8b7fe700 1 -- 172.16.0.13:6816/15146 <== osd.31 172.16.0.12:6822/17200 6133 ==== osd_sub_op_reply(client.9501.0:744 6.95b 5d83495b/benchmark_data_KVM04_16365_object743/head//6 [] ack, result = 0) v1 ==== 162+0+0 (2474129029 0 0) 0x7f4d500008c0 con 0x7f4d4c000b10 2012-12-21 12:40:32.522284 7f4d8b7fe700 20 osd.25 3775 _dispatch 0x7f4d500008c0 osd_sub_op_reply(client.9501.0:744 6.95b 5d83495b/benchmark_data_KVM04_16365_object743/head//6 [] ack, result = 0) v1 2012-12-21 12:40:32.522390 7f4d8b7fe700 15 osd.25 3775 enqueue_op 0x7f4d442f69e0 osd_sub_op_reply(client.9501.0:744 6.95b 5d83495b/benchmark_data_KVM04_16365_object743/head//6 [] ack, result = 0) v1 2012-12-21 12:40:32.522450 7f4d87ff7700 10 osd.25 3775 dequeue_op 0x7f4d442f69e0 osd_sub_op_reply(client.9501.0:744 6.95b 5d83495b/benchmark_data_KVM04_16365_object743/head//6 [] ack, result = 0) v1 pg pg[6.95b( v 3775'135 (0'0,3775'135] local-les=3774 n=15 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 mlcod 3775'133 active+clean] 2012-12-21 12:40:32.522533 7f4d87ff7700 7 osd.25 pg_epoch: 3775 pg[6.95b( v 3775'135 (0'0,3775'135] local-les=3774 n=15 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 mlcod 3775'133 active+clean] repop_ack rep_tid 13299 op osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4 result 0 ack_type 1 from osd.31 2012-12-21 12:40:32.522565 7f4d87ff7700 10 osd.25 pg_epoch: 3775 pg[6.95b( v 3775'135 (0'0,3775'135] local-les=3774 n=15 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 mlcod 3775'133 active+clean] eval_repop repgather(0x7f4d7863bba0 applied 3775'135 rep_tid=13299 wfack= wfdisk=31 op=osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4) wants=ad 2012-12-21 12:40:32.522582 7f4d87ff7700 10 osd.25 pg_epoch: 3775 pg[6.95b( v 3775'135 (0'0,3775'135] local-les=3774 n=15 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 mlcod 3775'133 active+clean] sending ack on repgather(0x7f4d7863bba0 applied 3775'135 rep_tid=13299 wfack= wfdisk=31 op=osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4) 0x7f4d791ab010 2012-12-21 12:40:32.616813 7f4d8b7fe700 1 -- 172.16.0.13:6816/15146 <== osd.31 172.16.0.12:6822/17200 6138 ==== osd_sub_op_reply(client.9501.0:744 6.95b 5d83495b/benchmark_data_KVM04_16365_object743/head//6 [] ondisk, result = 0) v1 ==== 162+0+0 (2032311727 0 0) 0x7f4d50001850 con 0x7f4d4c000b10 2012-12-21 12:40:32.616826 7f4d8b7fe700 20 osd.25 3775 _dispatch 0x7f4d50001850 osd_sub_op_reply(client.9501.0:744 6.95b 5d83495b/benchmark_data_KVM04_16365_object743/head//6 [] ondisk, result = 0) v1 2012-12-21 12:40:32.616888 7f4d8b7fe700 15 osd.25 3775 enqueue_op 0x7f4d44d047a0 osd_sub_op_reply(client.9501.0:744 6.95b 5d83495b/benchmark_data_KVM04_16365_object743/head//6 [] ondisk, result = 0) v1 2012-12-21 12:40:32.616918 7f4d877f6700 10 osd.25 3775 dequeue_op 0x7f4d44d047a0 osd_sub_op_reply(client.9501.0:744 6.95b 5d83495b/benchmark_data_KVM04_16365_object743/head//6 [] ondisk, result = 0) v1 pg pg[6.95b( v 3775'135 (0'0,3775'135] local-les=3774 n=15 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 mlcod 3775'133 active+clean] 2012-12-21 12:40:32.616995 7f4d877f6700 7 osd.25 pg_epoch: 3775 pg[6.95b( v 3775'135 (0'0,3775'135] local-les=3774 n=15 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 mlcod 3775'133 active+clean] repop_ack rep_tid 13299 op osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4 result 0 ack_type 4 from osd.31 2012-12-21 12:40:32.617032 7f4d877f6700 10 osd.25 pg_epoch: 3775 pg[6.95b( v 3775'135 (0'0,3775'135] local-les=3774 n=15 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 mlcod 3775'133 active+clean] eval_repop repgather(0x7f4d7863bba0 applied 3775'135 rep_tid=13299 wfack= wfdisk= op=osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4) wants=ad 2012-12-21 12:40:32.617052 7f4d877f6700 15 osd.25 pg_epoch: 3775 pg[6.95b( v 3775'135 (0'0,3775'135] local-les=3774 n=15 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 mlcod 3775'133 active+clean] log_op_stats osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4 inb 4194451 outb 0 rlat 31.759090 lat 31.853523 2012-12-21 12:40:32.617085 7f4d877f6700 10 osd.25 pg_epoch: 3775 pg[6.95b( v 3775'135 (0'0,3775'135] local-les=3774 n=15 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 mlcod 3775'133 active+clean] sending commit on repgather(0x7f4d7863bba0 applied 3775'135 rep_tid=13299 wfack= wfdisk= op=osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4) 0x7f4d816be810 2012-12-21 12:40:32.617121 7f4d877f6700 10 osd.25 pg_epoch: 3775 pg[6.95b( v 3775'135 (0'0,3775'135] local-les=3774 n=15 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 mlcod 3775'134 active+clean] removing repgather(0x7f4d7863bba0 applied 3775'135 rep_tid=13299 wfack= wfdisk= op=osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4) 2012-12-21 12:40:32.617146 7f4d877f6700 20 osd.25 pg_epoch: 3775 pg[6.95b( v 3775'135 (0'0,3775'135] local-les=3774 n=15 ec=656 les/c 3774/3775 3773/3773/3773) [25,31] r=0 lpr=3773 mlcod 3775'134 active+clean] q front is repgather(0x7f4d7863bba0 applied 3775'135 rep_tid=13299 wfack= wfdisk= op=osd_op(client.9501.0:744 benchmark_data_KVM04_16365_object743 [write 0~4194304] 6.5d83495b) v4) ^ permalink raw reply [flat|nested] 13+ messages in thread
* RE: OSD's slow down to a crawl 2012-12-21 4:51 ` Matthew Anderson @ 2012-12-21 16:02 ` Sage Weil 2013-01-09 8:52 ` Matthew Anderson 0 siblings, 1 reply; 13+ messages in thread From: Sage Weil @ 2012-12-21 16:02 UTC (permalink / raw) To: Matthew Anderson; +Cc: 'Mark Nelson', ceph-devel@vger.kernel.org [-- Attachment #1: Type: TEXT/PLAIN, Size: 47800 bytes --] On Fri, 21 Dec 2012, Matthew Anderson wrote: > Hi Sage, > > I've tried to reproduce the error again with logging on every OSD and got the above. RADOS bench had stalled on a write request like the last time and the attached log is the grep'd OSD log (# cat osd.25.log | grep client.9501.0:744 > freeze.log) . The OSD that stalled was 25, pg map is below - > > # ceph pg map 6.5d83495b > osdmap e3775 pg 6.5d83495b (6.95b) -> up [25,31] acting [25,31] > > I hope that's what you were after, if not just let me know We're getting closer. The osd tried to send the reply. Can you reproduce with 'debug ms = 20' on the osds too, and on the client side do soemthing like rados --debug-ms 20 --debug-objecter 20 --log-file /tmp/foo ... Thanks! sage > > Thanks again > -Matt > > > -----Original Message----- > From: Sage Weil [mailto:sage@inktank.com] > Sent: Friday, 21 December 2012 1:14 AM > To: Matthew Anderson > Cc: 'Mark Nelson'; ceph-devel@vger.kernel.org > Subject: RE: OSD's slow down to a crawl > > On Thu, 20 Dec 2012, Matthew Anderson wrote: > > Hi Sage, > > > > Logs are attached. I took the osd logs from osd.24 as this is the > > first osd in my SSD pool I've been testing with previously. > > > > The 4MB bench I was able to reproduce the fault by restarting my rbd > > export which stalled after a few percent complete. When I ran the 4MB > > bench it stalled early on and never received a response back from the > > OSD and I terminated it after 60 seconds or so. I wasn't able to > > reproduce the fault using the 4kb io size. The 4kb log should show > > rados bench completing normally at a respectable speed of about 1MB/s. > > Let's drill into the hang.. up until that point things look okay. > > 2012-12-21 00:51:26.033622 7f6f3c042760 1 -- 172.16.0.13:0/1023886 --> 172.16.0.13:6813/22233 -- osd_op(client.9503.0:185 benchmark_data_KVM04_23886_object184 [write 0~4194304] 6.3ca4346e) v4 -- ?+0 0x171ea50 con 0x171a7e0 > > Do you have a log for that OSD so we can see what happened there? It may also be that the replicated write is hung. If you do > > ceph pg map 6.3ca4346e > > you can see all OSDs storing that PG. And/or you can grep for > client.9503.0:185 in 172.16.0.13:6813/22233's log and see whether the sub_op was sent. > > Thanks! > sage > > > > > > Thanks > > -Matt > > > > -----Original Message----- > > From: ceph-devel-owner@vger.kernel.org > > [mailto:ceph-devel-owner@vger.kernel.org] On Behalf Of Sage Weil > > Sent: Friday, 21 December 2012 12:30 AM > > To: Matthew Anderson > > Cc: 'Mark Nelson'; ceph-devel@vger.kernel.org > > Subject: RE: OSD's slow down to a crawl > > > > Can you do a similar test, but with full logging on? > > > > ceph tell osd.0 injectargs '--debug-ms 1 --debug-filestore 20 > > --debug-osd > > 20 --debug-journal 20' > > rados -p ssd bench 30 write -b 4096 -t 1 --log-file /tmp/foo > > --debug-ms 1 > > > > That will be a single IO in flight at a time and very easy to trace through the logs. If you can post the resulting log file (/tmp/foo and from osd.0), that would be awesome. > > > > Thanks! > > sage > > > > > > > > On Thu, 20 Dec 2012, Matthew Anderson wrote: > > > > > # rados bench 60 write -t 256 -p ssd Maintaining 256 concurrent > > > writes of 4194304 bytes for at least 60 seconds. > > > Object prefix: benchmark_data_KVM03_12985 > > > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > > > 0 0 0 0 0 0 - 0 > > > 1 45 45 0 0 0 - 0 > > > 2 71 71 0 0 0 - 0 > > > 3 90 90 0 0 0 - 0 > > > 4 125 125 0 0 0 - 0 > > > 5 136 136 0 0 0 - 0 > > > 6 144 144 0 0 0 - 0 > > > 7 175 175 0 0 0 - 0 > > > 8 201 201 0 0 0 - 0 > > > 9 217 217 0 0 0 - 0 > > > 10 228 228 0 0 0 - 0 > > > 11 251 251 0 0 0 - 0 > > > 12 255 266 11 3.66581 3.66667 11.7027 11.6013 > > > 13 255 271 16 4.92194 20 12.9795 11.9979 > > > 14 255 291 36 10.2833 80 13.1795 12.6478 > > > 15 255 301 46 12.2639 40 13.9424 12.9123 > > > 16 255 303 48 11.9973 8 14.0277 12.9579 > > > 17 255 321 66 15.526 72 15.1408 13.4616 > > > 18 255 345 90 19.9956 96 15.5673 13.9937 > > > 19 255 349 94 19.7851 16 15.707 14.0635 > > > 2012-12-21 00:06:37.198982min lat: 11.4904 max lat: 16.4053 avg lat: 14.3121 > > > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > > > 20 255 361 106 21.1953 48 16.4053 14.3121 > > > 21 255 367 112 21.3285 24 17.4263 14.4795 > > > 22 255 370 115 20.9043 12 17.3579 14.5544 > > > 23 255 387 132 22.9513 68 18.3185 15.0867 > > > 24 255 394 139 23.1614 28 18.2868 15.2504 > > > 25 255 404 149 23.8346 40 18.741 15.5048 > > > 26 255 410 155 23.8407 24 19.032 15.6358 > > > 27 255 410 155 22.9577 0 - 15.6358 > > > 28 255 410 155 22.1378 0 - 15.6358 > > > 29 255 411 156 21.5124 1.33333 22.6927 15.6811 > > > 30 255 431 176 23.4614 80 22.6577 16.4822 > > > 31 255 447 192 24.7687 64 23.2345 17.0563 > > > 32 255 464 209 26.1192 68 23.3788 17.5859 > > > 33 255 485 230 27.8726 84 22.8924 18.1122 > > > 34 255 497 242 28.4642 48 23.563 18.3693 > > > 35 255 508 253 28.9078 44 23.0516 18.5935 > > > 36 255 514 259 28.7713 24 24.4415 18.7278 > > > 37 255 530 275 29.723 64 23.847 19.0424 > > > 38 255 541 286 30.0985 44 24.5757 19.2427 > > > 39 255 544 289 29.6344 12 24.6802 19.2984 > > > 2012-12-21 00:06:57.203473min lat: 11.4904 max lat: 24.6802 avg lat: 19.2984 > > > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > > > 40 255 544 289 28.8935 0 - 19.2984 > > > 41 255 544 289 28.1888 0 - 19.2984 > > > 42 255 544 289 27.5176 0 - 19.2984 > > > 43 255 544 289 26.8776 0 - 19.2984 > > > 44 255 544 289 26.2668 0 - 19.2984 > > > 45 255 544 289 25.6831 0 - 19.2984 > > > 46 255 544 289 25.1248 0 - 19.2984 > > > 47 255 544 289 24.5902 0 - 19.2984 > > > 48 255 544 289 24.0779 0 - 19.2984 > > > 49 255 544 289 23.5865 0 - 19.2984 > > > 50 255 544 289 23.1148 0 - 19.2984 > > > 51 255 544 289 22.6616 0 - 19.2984 > > > 52 255 544 289 22.2258 0 - 19.2984 > > > 53 255 544 289 21.8065 0 - 19.2984 > > > 54 255 544 289 21.4026 0 - 19.2984 > > > 55 255 544 289 21.0135 0 - 19.2984 > > > 56 255 544 289 20.6383 0 - 19.2984 > > > 57 255 544 289 20.2762 0 - 19.2984 > > > 58 255 544 289 19.9266 0 - 19.2984 > > > 59 255 544 289 19.5889 0 - 19.2984 > > > 2012-12-21 00:07:17.207782min lat: 11.4904 max lat: 24.6802 avg lat: 19.2984 > > > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > > > 60 255 544 289 19.2624 0 - 19.2984 > > > 61 255 544 289 18.9466 0 - 19.2984 > > > 62 255 544 289 18.641 0 - 19.2984 > > > 63 255 544 289 18.3452 0 - 19.2984 > > > 64 255 544 289 18.0585 0 - 19.2984 > > > 65 255 544 289 17.7807 0 - 19.2984 > > > 66 255 544 289 17.5113 0 - 19.2984 > > > 67 133 545 412 24.5916 17.5714 37.3574 27.4834 > > > Total time run: 68.000740 > > > Total writes made: 545 > > > Write size: 4194304 > > > Bandwidth (MB/sec): 32.058 > > > > > > Stddev Bandwidth: 26.8837 > > > Max bandwidth (MB/sec): 96 > > > Min bandwidth (MB/sec): 0 > > > Average Latency: 29.2801 > > > Stddev Latency: 11.9768 > > > Max latency: 52.6277 > > > Min latency: 11.4904 > > > > > > > > > > > > > > > > > > ----Original Message----- > > > From: Mark Nelson [mailto:mark.nelson@inktank.com] > > > Sent: Thursday, 20 December 2012 11:59 PM > > > To: Matthew Anderson > > > Cc: ceph-devel@vger.kernel.org > > > Subject: Re: OSD's slow down to a crawl > > > > > > Out of curiosity, if you fire up a rados bench instance on one of the nodes with say, 256 concurrent writes, do any of the writes complete? > > > > > > Mark > > > > > > On 12/20/2012 09:51 AM, Matthew Anderson wrote: > > > > Hi Mark, > > > > > > > > Thanks for the quick reply. The log was 380MB in total so I just posted last 200 lines hoping that a trained eye might be able to spot something. I'm happy to put the file up on a web server if anyone wants to review it in full. > > > > > > > > It doesn't appear to be memory or CPU related. In it's hung state (both read and write are not responding at the moment, I have a VM hung trying to shutdown and an rbd stuck exporting ) all of the OSD's sit around 0.3% CPU usage, ~300MB of ram and no IO wait. The server has roughly 40GB RAM free and is easily 90% idle. Iotop shows no IO coming from any of the Ceph-OSD's. > > > > > > > > The admin socket showed no ops in flight but the performance dump, perhistoric ops and config are below. I did notice that the age of some of the ops was quite high considering things were actively trying to write to the cluster. > > > > # ceph --admin-daemon ./ceph-osd.24.asok perf dump > > > > > > > > {"filestore":{"journal_queue_max_ops":500,"journal_queue_ops":0,"j > > > > ou > > > > rn > > > > al_ops":20868,"journal_queue_max_bytes":104857600,"journal_queue_b > > > > yt > > > > es > > > > ":0,"journal_bytes":1710397238,"journal_latency":{"avgcount":20868," > > > > su > > > > m":185.268022000},"journal_wr":13354,"journal_wr_bytes":{"avgcount": > > > > 13 > > > > 354,"sum":1770647552},"op_queue_max_ops":500,"op_queue_ops":0,"ops": > > > > 20 > > > > 868,"op_queue_max_bytes":104857600,"op_queue_bytes":0,"bytes":1710 > > > > 23 > > > > 25 > > > > 83,"apply_latency":{"avgcount":20868,"sum":508.896737000},"committing" > > > > :0,"commitcycle":339,"commitcycle_interval":{"avgcount":339,"sum": > > > > 17 > > > > 08 > > > > .284725000},"commitcycle_latency":{"avgcount":339,"sum":13.2418500 > > > > 00 > > > > }, > > > > "journal_full":0},"mutex-FileJournal::completions_lock":{"wait":{" > > > > av > > > > gc > > > > ount":0,"sum":0.000000000}},"mutex-FileJournal::finisher_lock":{"wait" > > > > :{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::write_lock" > > > > :{ > > > > "w > > > > ait":{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::writeq_ > > > > lo > > > > ck > > > > ":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager:: > > > > apply_lo > > > ck":{"wai > > > t":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager::com_l > > > oc > > > k":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::SubmitManag > > > er > > > ::lock":{"wait":{"avgcount":0,"sum":0.000000000}},"osd":{"opq":0,"op > > > _w > > > ip":0,"op":31753,"op_in_bytes":797806224,"op_out_bytes":1728414048," > > > op > > > _latency":{"avgcount":31753,"sum":82.183373000},"op_r":24945,"op_r_o > > > ut > > > _bytes":1728414048,"op_r_latency":{"avgcount":24945,"sum":16.8336890 > > > 00 > > > },"op_w":6808,"op_w_in_bytes":797806224,"op_w_rlat":{"avgcount":6808," > > > sum":38.223006000},"op_w_latency":{"avgcount":6808,"sum":65.34968400 > > > 0} > > > ,"op_rw":0,"op_rw_in_bytes":0,"op_rw_out_bytes":0,"op_rw_rlat":{"avg > > > co > > > unt":0,"sum":0.000000000},"op_rw_latency":{"avgcount":0,"sum":0.0000 > > > 00 > > > 000},"subop":5700,"subop_in_bytes":813224156,"subop_latency":{"avgco > > > un > > > t":5700,"sum":26.877085000},"subop_w":0,"subop_w_in_bytes":813224156," > > > subop_w_latency":{"avgcount":5700,"sum":26.877085000},"subop_pull":0," > > > subop_pull_latency":{"avgcount":0,"sum":0.000000000},"subop_push":0, > > > "s > > > ubop_push_ > > > in_bytes" > > > :0,"subop_push_latency":{"avgcount":0,"sum":0.000000000},"pull":0,"p > > > us > > > h":0,"push_out_bytes":0,"push_in":0,"push_in_bytes":0,"recovery_ops" > > > :0 > > > ,"loadavg":359,"buffer_bytes":0,"numpg":2486,"numpg_primary":1214,"n > > > um > > > pg_replica":1272,"numpg_stray":0,"heartbeat_to_peers":4,"heartbeat_f > > > ro > > > m_peers":0,"map_messages":44,"map_message_epochs":184,"map_message_e > > > po > > > ch_dups":174},"throttle-filestore_bytes":{"val":0,"max":104857600,"g > > > et > > > ":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0,"take": > > > 20 > > > 868,"take_sum":1710397238,"put":13354,"put_sum":1710397238,"wait":{" > > > av > > > gcount":0,"sum":0.000000000}},"throttle-filestore_ops":{"val":0,"max": > > > 500,"get":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0," > > > take":20868,"take_sum":20868,"put":13354,"put_sum":20868,"wait":{"av > > > gc > > > ount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-client": > > > {"val":0,"max":104857600,"get":31657,"get_sum":795546573,"get_or_fai > > > l_ > > > fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":31657,"p > > > ut > > > _sum":7955 > > > 46573,"wa > > > it":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_thrott > > > le > > > r-cluster":{"val":0,"max":104857600,"get":18913,"get_sum":828011293, > > > "g > > > et_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put" > > > :18913,"put_sum":828011293,"wait":{"avgcount":0,"sum":0.000000000}}, > > > "t > > > hrottle-msgr_dispatch_throttler-hbclient":{"val":0,"max":104857600," > > > ge > > > t":4260,"get_sum":200220,"get_or_fail_fail":0,"get_or_fail_success": > > > 0, > > > "take":0,"take_sum":0,"put":4260,"put_sum":200220,"wait":{"avgcount" > > > :0 > > > ,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-hbserver":{"v > > > al > > > ":0,"max":104857600,"get":4236,"get_sum":199092,"get_or_fail_fail":0," > > > get_or_fail_success":0,"take":0,"take_sum":0,"put":4236,"put_sum":19 > > > 90 > > > 92,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-osd_client_byt > > > es > > > ":{"val":0,"max":524288000,"get":31308,"get_sum":795353720,"get_or_f > > > ai > > > l_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":37671, > > > "p ut_sum":795353720,"wait":{"avgcount":0,"sum":0.000000000}}} > > > > > > > > # ceph --admin-daemon ./ceph-osd.24.asok dump_historic_ops > > > > > > > > { "description": "osd_op(client.7636.0:101498 rb.0.122c.6b8b4567.000000000357 [write 3600384~4096] 6.42463fe3)", > > > > "received_at": "2012-12-20 23:34:42.012131", > > > > "age": "75.428051", > > > > "duration": "0.009988", > > > > "flag_point": "waiting for sub ops", > > > > "client_info": { "client": "client.7636", > > > > "tid": 101498}, > > > > "events": [ > > > > { "time": "2012-12-20 23:34:42.012317", > > > > "event": "waiting_for_osdmap"}, > > > > { "time": "2012-12-20 23:34:42.012425", > > > > "event": "reached_pg"}, > > > > { "time": "2012-12-20 23:34:42.012678", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:34:42.012720", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:34:42.012815", > > > > "event": "sub_op_sent"}, > > > > { "time": "2012-12-20 23:34:42.012958", > > > > "event": "commit_queued_for_journal_write"}, > > > > { "time": "2012-12-20 23:34:42.013033", > > > > "event": "write_thread_in_journal_buffer"}, > > > > { "time": "2012-12-20 23:34:42.013745", > > > > "event": "journaled_completion_queued"}, > > > > { "time": "2012-12-20 23:34:42.013803", > > > > "event": "op_commit"}, > > > > { "time": "2012-12-20 23:34:42.017414", > > > > "event": "sub_op_commit_rec"}, > > > > { "time": "2012-12-20 23:34:42.022066", > > > > "event": "op_applied"}, > > > > { "time": "2012-12-20 23:34:42.022119", > > > > "event": "done"}]}, > > > > { "description": "osd_sub_op_reply(client.7636.0:101498 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", > > > > "received_at": "2012-12-20 23:34:42.017048", > > > > "age": "75.423134", > > > > "duration": "0.000411", > > > > "flag_point": "started", > > > > "events": [ > > > > { "time": "2012-12-20 23:34:42.017218", > > > > "event": "waiting_for_osdmap"}, > > > > { "time": "2012-12-20 23:34:42.017363", > > > > "event": "reached_pg"}, > > > > { "time": "2012-12-20 23:34:42.017401", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:34:42.017459", > > > > "event": "done"}]}, > > > > { "description": "osd_op(client.7636.0:101502 rb.0.122c.6b8b4567.000000000064 [write 1512448~61440] 6.a241e34d)", > > > > "received_at": "2012-12-20 23:34:49.851021", > > > > "age": "67.589161", > > > > "duration": "0.007114", > > > > "flag_point": "waiting for sub ops", > > > > "client_info": { "client": "client.7636", > > > > "tid": 101502}, > > > > "events": [ > > > > { "time": "2012-12-20 23:34:49.851276", > > > > "event": "waiting_for_osdmap"}, > > > > { "time": "2012-12-20 23:34:49.851429", > > > > "event": "reached_pg"}, > > > > { "time": "2012-12-20 23:34:49.851616", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:34:49.851648", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:34:49.851712", > > > > "event": "sub_op_sent"}, > > > > { "time": "2012-12-20 23:34:49.851770", > > > > "event": "commit_queued_for_journal_write"}, > > > > { "time": "2012-12-20 23:34:49.851915", > > > > "event": "write_thread_in_journal_buffer"}, > > > > { "time": "2012-12-20 23:34:49.853956", > > > > "event": "journaled_completion_queued"}, > > > > { "time": "2012-12-20 23:34:49.853992", > > > > "event": "op_commit"}, > > > > { "time": "2012-12-20 23:34:49.857016", > > > > "event": "sub_op_commit_rec"}, > > > > { "time": "2012-12-20 23:34:49.858077", > > > > "event": "op_applied"}, > > > > { "time": "2012-12-20 23:34:49.858135", > > > > "event": "done"}]}, > > > > { "description": "osd_sub_op_reply(client.7636.0:101502 6.234d a241e34d\/rb.0.122c.6b8b4567.000000000064\/head\/\/6 [] ondisk, result = 0)", > > > > "received_at": "2012-12-20 23:34:49.856762", > > > > "age": "67.583420", > > > > "duration": "0.000308", > > > > "flag_point": "started", > > > > "events": [ > > > > { "time": "2012-12-20 23:34:49.856869", > > > > "event": "waiting_for_osdmap"}, > > > > { "time": "2012-12-20 23:34:49.856969", > > > > "event": "reached_pg"}, > > > > { "time": "2012-12-20 23:34:49.857001", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:34:49.857070", > > > > "event": "done"}]}, > > > > { "description": "osd_op(client.7636.0:101504 rb.0.122c.6b8b4567.00000000012c [write 1658880~2048] 6.92d2c38c)", > > > > "received_at": "2012-12-20 23:35:34.351518", > > > > "age": "23.088664", > > > > "duration": "0.009430", > > > > "flag_point": "waiting for sub ops", > > > > "client_info": { "client": "client.7636", > > > > "tid": 101504}, > > > > "events": [ > > > > { "time": "2012-12-20 23:35:34.351714", > > > > "event": "waiting_for_osdmap"}, > > > > { "time": "2012-12-20 23:35:34.351811", > > > > "event": "reached_pg"}, > > > > { "time": "2012-12-20 23:35:34.352030", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:35:34.352071", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:35:34.352146", > > > > "event": "sub_op_sent"}, > > > > { "time": "2012-12-20 23:35:34.352213", > > > > "event": "commit_queued_for_journal_write"}, > > > > { "time": "2012-12-20 23:35:34.352287", > > > > "event": "write_thread_in_journal_buffer"}, > > > > { "time": "2012-12-20 23:35:34.355390", > > > > "event": "journaled_completion_queued"}, > > > > { "time": "2012-12-20 23:35:34.355475", > > > > "event": "op_commit"}, > > > > { "time": "2012-12-20 23:35:34.356661", > > > > "event": "sub_op_commit_rec"}, > > > > { "time": "2012-12-20 23:35:34.360894", > > > > "event": "op_applied"}, > > > > { "time": "2012-12-20 23:35:34.360948", > > > > "event": "done"}]}, > > > > { "description": "osd_sub_op_reply(client.7636.0:101504 6.38c 92d2c38c\/rb.0.122c.6b8b4567.00000000012c\/head\/\/6 [] ondisk, result = 0)", > > > > "received_at": "2012-12-20 23:35:34.356286", > > > > "age": "23.083896", > > > > "duration": "0.000421", > > > > "flag_point": "started", > > > > "events": [ > > > > { "time": "2012-12-20 23:35:34.356489", > > > > "event": "waiting_for_osdmap"}, > > > > { "time": "2012-12-20 23:35:34.356598", > > > > "event": "reached_pg"}, > > > > { "time": "2012-12-20 23:35:34.356640", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:35:34.356707", > > > > "event": "done"}]}, > > > > { "description": "osd_op(client.7636.0:101512 rb.0.122c.6b8b4567.000000000357 [write 3731456~4096] 6.42463fe3)", > > > > "received_at": "2012-12-20 23:35:42.133103", > > > > "age": "15.307079", > > > > "duration": "0.008357", > > > > "flag_point": "waiting for sub ops", > > > > "client_info": { "client": "client.7636", > > > > "tid": 101512}, > > > > "events": [ > > > > { "time": "2012-12-20 23:35:42.133315", > > > > "event": "waiting_for_osdmap"}, > > > > { "time": "2012-12-20 23:35:42.133427", > > > > "event": "reached_pg"}, > > > > { "time": "2012-12-20 23:35:42.133684", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:35:42.133723", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:35:42.133796", > > > > "event": "sub_op_sent"}, > > > > { "time": "2012-12-20 23:35:42.133908", > > > > "event": "commit_queued_for_journal_write"}, > > > > { "time": "2012-12-20 23:35:42.133995", > > > > "event": "write_thread_in_journal_buffer"}, > > > > { "time": "2012-12-20 23:35:42.136965", > > > > "event": "journaled_completion_queued"}, > > > > { "time": "2012-12-20 23:35:42.137025", > > > > "event": "op_commit"}, > > > > { "time": "2012-12-20 23:35:42.138456", > > > > "event": "sub_op_commit_rec"}, > > > > { "time": "2012-12-20 23:35:42.141408", > > > > "event": "op_applied"}, > > > > { "time": "2012-12-20 23:35:42.141460", > > > > "event": "done"}]}, > > > > { "description": "osd_sub_op_reply(client.7636.0:101512 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", > > > > "received_at": "2012-12-20 23:35:42.138087", > > > > "age": "15.302095", > > > > "duration": "0.000410", > > > > "flag_point": "started", > > > > "events": [ > > > > { "time": "2012-12-20 23:35:42.138248", > > > > "event": "waiting_for_osdmap"}, > > > > { "time": "2012-12-20 23:35:42.138363", > > > > "event": "reached_pg"}, > > > > { "time": "2012-12-20 23:35:42.138443", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:35:42.138497", > > > > "event": "done"}]}]} > > > > > > > > # ceph --admin-daemon ./ceph-osd.24.asok config show { "name": > > > > "osd.24", > > > > "cluster": "ceph", > > > > "none": "0\/5", > > > > "lockdep": "0\/1", > > > > "context": "0\/1", > > > > "crush": "1\/1", > > > > "mds": "1\/5", > > > > "mds_balancer": "1\/5", > > > > "mds_locker": "1\/5", > > > > "mds_log": "1\/5", > > > > "mds_log_expire": "1\/5", > > > > "mds_migrator": "1\/5", > > > > "buffer": "0\/1", > > > > "timer": "0\/1", > > > > "filer": "0\/1", > > > > "striper": "0\/1", > > > > "objecter": "0\/1", > > > > "rados": "0\/5", > > > > "rbd": "0\/5", > > > > "journaler": "0\/5", > > > > "objectcacher": "0\/5", > > > > "client": "0\/5", > > > > "osd": "0\/5", > > > > "optracker": "0\/5", > > > > "objclass": "0\/5", > > > > "filestore": "1\/3", > > > > "journal": "1\/3", > > > > "ms": "0\/5", > > > > "mon": "1\/5", > > > > "monc": "0\/10", > > > > "paxos": "0\/5", > > > > "tp": "0\/5", > > > > "auth": "1\/5", > > > > "crypto": "1\/5", > > > > "finisher": "1\/1", > > > > "heartbeatmap": "1\/5", > > > > "perfcounter": "1\/5", > > > > "rgw": "1\/5", > > > > "hadoop": "1\/5", > > > > "javaclient": "1\/5", > > > > "asok": "1\/5", > > > > "throttle": "1\/1", > > > > "host": "KVM04", > > > > "fsid": "00000000-0000-0000-0000-000000000000", > > > > "public_addr": ":\/0", > > > > "cluster_addr": ":\/0", > > > > "public_network": "", > > > > "cluster_network": "", > > > > "num_client": "1", > > > > "monmap": "", > > > > "mon_host": "", > > > > "lockdep": "false", > > > > "admin_socket": "\/var\/run\/ceph\/ceph-osd.24.asok", > > > > "daemonize": "true", > > > > "pid_file": "\/var\/run\/ceph\/osd.24.pid", > > > > "chdir": "\/", > > > > "max_open_files": "0", > > > > "fatal_signal_handlers": "true", > > > > "log_file": "\/var\/log\/ceph\/osd.24.log", > > > > "log_max_new": "1000", > > > > "log_max_recent": "100000", > > > > "log_to_stderr": "false", > > > > "err_to_stderr": "true", > > > > "log_to_syslog": "false", > > > > "err_to_syslog": "false", > > > > "log_flush_on_exit": "true", > > > > "clog_to_monitors": "true", > > > > "clog_to_syslog": "false", > > > > "mon_cluster_log_to_syslog": "false", > > > > "mon_cluster_log_file": "\/var\/log\/ceph\/ceph.log", > > > > "key": "", > > > > "keyfile": "", > > > > "keyring": "\/srv\/ceph\/osd\/osd.24\/keyring", > > > > "heartbeat_interval": "5", > > > > "heartbeat_file": "", > > > > "perf": "true", > > > > "ms_tcp_nodelay": "true", > > > > "ms_initial_backoff": "0.2", > > > > "ms_max_backoff": "15", > > > > "ms_nocrc": "false", > > > > "ms_die_on_bad_msg": "false", > > > > "ms_dispatch_throttle_bytes": "104857600", > > > > "ms_bind_ipv6": "false", > > > > "ms_bind_port_min": "6800", > > > > "ms_bind_port_max": "7100", > > > > "ms_rwthread_stack_bytes": "1048576", > > > > "ms_tcp_read_timeout": "900", > > > > "ms_inject_socket_failures": "0", > > > > "ms_inject_delay_type": "", > > > > "ms_inject_delay_max": "1", > > > > "ms_inject_delay_probability": "0", > > > > "mon_data": "\/var\/lib\/ceph\/mon\/ceph-24", > > > > "mon_initial_members": "", > > > > "mon_sync_fs_threshold": "5", > > > > "mon_tick_interval": "5", > > > > "mon_subscribe_interval": "300", > > > > "mon_osd_laggy_halflife": "3600", > > > > "mon_osd_laggy_weight": "0.3", > > > > "mon_osd_adjust_heartbeat_grace": "true", > > > > "mon_osd_adjust_down_out_interval": "true", > > > > "mon_osd_auto_mark_in": "false", > > > > "mon_osd_auto_mark_auto_out_in": "true", > > > > "mon_osd_auto_mark_new_in": "true", > > > > "mon_osd_down_out_interval": "300", > > > > "mon_osd_min_up_ratio": "0.3", > > > > "mon_osd_min_in_ratio": "0.3", > > > > "mon_lease": "5", > > > > "mon_lease_renew_interval": "3", > > > > "mon_lease_ack_timeout": "10", > > > > "mon_clock_drift_allowed": "0.05", > > > > "mon_clock_drift_warn_backoff": "5", > > > > "mon_accept_timeout": "10", > > > > "mon_pg_create_interval": "30", > > > > "mon_pg_stuck_threshold": "300", > > > > "mon_osd_full_ratio": "0.95", > > > > "mon_osd_nearfull_ratio": "0.85", > > > > "mon_globalid_prealloc": "100", > > > > "mon_osd_report_timeout": "900", > > > > "mon_force_standby_active": "true", > > > > "mon_min_osdmap_epochs": "500", > > > > "mon_max_pgmap_epochs": "500", > > > > "mon_max_log_epochs": "500", > > > > "mon_max_osd": "10000", > > > > "mon_probe_timeout": "2", > > > > "mon_slurp_timeout": "10", > > > > "mon_slurp_bytes": "262144", > > > > "mon_client_bytes": "104857600", > > > > "mon_daemon_bytes": "419430400", > > > > "mon_max_log_entries_per_event": "4096", > > > > "paxos_max_join_drift": "10", > > > > "paxos_propose_interval": "1", > > > > "paxos_min_wait": "0.05", > > > > "clock_offset": "0", > > > > "auth_cluster_required": "none", > > > > "auth_service_required": "none", > > > > "auth_client_required": "none", > > > > "auth_supported": "none", > > > > "cephx_require_signatures": "false", > > > > "cephx_sign_messages": "true", > > > > "auth_mon_ticket_ttl": "43200", > > > > "auth_service_ticket_ttl": "3600", > > > > "mon_client_hunt_interval": "3", > > > > "mon_client_ping_interval": "10", > > > > "mon_client_max_log_entries_per_message": "1000", > > > > "mon_max_pool_pg_num": "65536", > > > > "client_cache_size": "16384", > > > > "client_cache_mid": "0.75", > > > > "client_use_random_mds": "false", > > > > "client_mount_timeout": "30", > > > > "client_tick_interval": "1", > > > > "client_trace": "", > > > > "client_readahead_min": "131072", > > > > "client_readahead_max_bytes": "0", > > > > "client_readahead_max_periods": "4", > > > > "client_snapdir": ".snap", > > > > "client_mountpoint": "\/", > > > > "client_notify_timeout": "10", > > > > "client_caps_release_delay": "5", > > > > "client_oc": "true", > > > > "client_oc_size": "209715200", > > > > "client_oc_max_dirty": "104857600", > > > > "client_oc_target_dirty": "8388608", > > > > "client_oc_max_dirty_age": "5", > > > > "client_oc_max_objects": "1000", > > > > "client_debug_force_sync_read": "false", > > > > "client_debug_inject_tick_delay": "0", > > > > "fuse_use_invalidate_cb": "false", > > > > "fuse_big_writes": "true", > > > > "fuse_debug": "false", > > > > "objecter_tick_interval": "5", > > > > "objecter_timeout": "10", > > > > "objecter_inflight_op_bytes": "104857600", > > > > "objecter_inflight_ops": "1024", > > > > "journaler_allow_split_entries": "true", > > > > "journaler_write_head_interval": "15", > > > > "journaler_prefetch_periods": "10", > > > > "journaler_prezero_periods": "5", > > > > "journaler_batch_interval": "0.001", > > > > "journaler_batch_max": "0", > > > > "mds_data": "\/var\/lib\/ceph\/mds\/ceph-24", > > > > "mds_max_file_size": "1099511627776", > > > > "mds_cache_size": "100000", > > > > "mds_cache_mid": "0.7", > > > > "mds_mem_max": "1048576", > > > > "mds_dir_commit_ratio": "0.5", > > > > "mds_dir_max_commit_size": "90", > > > > "mds_decay_halflife": "5", > > > > "mds_beacon_interval": "4", > > > > "mds_beacon_grace": "15", > > > > "mds_blacklist_interval": "1440", > > > > "mds_session_timeout": "60", > > > > "mds_session_autoclose": "300", > > > > "mds_reconnect_timeout": "45", > > > > "mds_tick_interval": "5", > > > > "mds_dirstat_min_interval": "1", > > > > "mds_scatter_nudge_interval": "5", > > > > "mds_client_prealloc_inos": "1000", > > > > "mds_early_reply": "true", > > > > "mds_use_tmap": "true", > > > > "mds_default_dir_hash": "2", > > > > "mds_log": "true", > > > > "mds_log_skip_corrupt_events": "false", > > > > "mds_log_max_events": "-1", > > > > "mds_log_max_segments": "30", > > > > "mds_log_max_expiring": "20", > > > > "mds_bal_sample_interval": "3", > > > > "mds_bal_replicate_threshold": "8000", > > > > "mds_bal_unreplicate_threshold": "0", > > > > "mds_bal_frag": "false", > > > > "mds_bal_split_size": "10000", > > > > "mds_bal_split_rd": "25000", > > > > "mds_bal_split_wr": "10000", > > > > "mds_bal_split_bits": "3", > > > > "mds_bal_merge_size": "50", > > > > "mds_bal_merge_rd": "1000", > > > > "mds_bal_merge_wr": "1000", > > > > "mds_bal_interval": "10", > > > > "mds_bal_fragment_interval": "5", > > > > "mds_bal_idle_threshold": "0", > > > > "mds_bal_max": "-1", > > > > "mds_bal_max_until": "-1", > > > > "mds_bal_mode": "0", > > > > "mds_bal_min_rebalance": "0.1", > > > > "mds_bal_min_start": "0.2", > > > > "mds_bal_need_min": "0.8", > > > > "mds_bal_need_max": "1.2", > > > > "mds_bal_midchunk": "0.3", > > > > "mds_bal_minchunk": "0.001", > > > > "mds_bal_target_removal_min": "5", > > > > "mds_bal_target_removal_max": "10", > > > > "mds_replay_interval": "1", > > > > "mds_shutdown_check": "0", > > > > "mds_thrash_exports": "0", > > > > "mds_thrash_fragments": "0", > > > > "mds_dump_cache_on_map": "false", > > > > "mds_dump_cache_after_rejoin": "false", > > > > "mds_verify_scatter": "false", > > > > "mds_debug_scatterstat": "false", > > > > "mds_debug_frag": "false", > > > > "mds_debug_auth_pins": "false", > > > > "mds_debug_subtrees": "false", > > > > "mds_kill_mdstable_at": "0", > > > > "mds_kill_export_at": "0", > > > > "mds_kill_import_at": "0", > > > > "mds_kill_link_at": "0", > > > > "mds_kill_rename_at": "0", > > > > "mds_wipe_sessions": "false", > > > > "mds_wipe_ino_prealloc": "false", > > > > "mds_skip_ino": "0", > > > > "max_mds": "1", > > > > "mds_standby_for_name": "", > > > > "mds_standby_for_rank": "-1", > > > > "mds_standby_replay": "false", > > > > "osd_auto_upgrade_tmap": "true", > > > > "osd_tmapput_sets_uses_tmap": "false", > > > > "osd_max_backfills": "10", > > > > "osd_backfill_full_ratio": "0.85", > > > > "osd_backfill_retry_interval": "10", > > > > "osd_uuid": "00000000-0000-0000-0000-000000000000", > > > > "osd_data": "\/srv\/ceph\/osd\/osd.24", > > > > "osd_journal": "\/dev\/sda3", > > > > "osd_journal_size": "5120", > > > > "osd_max_write_size": "90", > > > > "osd_max_pgls": "1024", > > > > "osd_client_message_size_cap": "524288000", > > > > "osd_pg_bits": "6", > > > > "osd_pgp_bits": "6", > > > > "osd_min_rep": "1", > > > > "osd_max_rep": "10", > > > > "osd_pool_default_crush_rule": "0", > > > > "osd_pool_default_size": "2", > > > > "osd_pool_default_min_size": "0", > > > > "osd_pool_default_pg_num": "8", > > > > "osd_pool_default_pgp_num": "8", > > > > "osd_map_dedup": "true", > > > > "osd_map_cache_size": "500", > > > > "osd_map_message_max": "100", > > > > "osd_op_threads": "2", > > > > "osd_disk_threads": "1", > > > > "osd_recovery_threads": "1", > > > > "osd_recover_clone_overlap": "true", > > > > "osd_backfill_scan_min": "64", > > > > "osd_backfill_scan_max": "512", > > > > "osd_op_thread_timeout": "30", > > > > "osd_recovery_thread_timeout": "30", > > > > "osd_snap_trim_thread_timeout": "3600", > > > > "osd_scrub_thread_timeout": "60", > > > > "osd_scrub_finalize_thread_timeout": "600", > > > > "osd_remove_thread_timeout": "3600", > > > > "osd_command_thread_timeout": "600", > > > > "osd_age": "0.8", > > > > "osd_age_time": "0", > > > > "osd_heartbeat_addr": ":\/0", > > > > "osd_heartbeat_interval": "6", > > > > "osd_heartbeat_grace": "20", > > > > "osd_mon_heartbeat_interval": "30", > > > > "osd_mon_report_interval_max": "120", > > > > "osd_mon_report_interval_min": "5", > > > > "osd_mon_ack_timeout": "30", > > > > "osd_min_down_reporters": "1", > > > > "osd_min_down_reports": "3", > > > > "osd_default_data_pool_replay_window": "45", > > > > "osd_preserve_trimmed_log": "false", > > > > "osd_auto_mark_unfound_lost": "false", > > > > "osd_recovery_delay_start": "0", > > > > "osd_recovery_max_active": "50", > > > > "osd_recovery_max_chunk": "8388608", > > > > "osd_recovery_forget_lost_objects": "false", > > > > "osd_max_scrubs": "1", > > > > "osd_scrub_load_threshold": "0.5", > > > > "osd_scrub_min_interval": "300", > > > > "osd_scrub_max_interval": "86400", > > > > "osd_deep_scrub_interval": "604800", > > > > "osd_deep_scrub_stride": "524288", > > > > "osd_auto_weight": "false", > > > > "osd_class_dir": "\/usr\/local\/lib\/rados-classes", > > > > "osd_check_for_log_corruption": "false", > > > > "osd_use_stale_snap": "false", > > > > "osd_rollback_to_cluster_snap": "", > > > > "osd_default_notify_timeout": "30", > > > > "osd_kill_backfill_at": "0", > > > > "osd_min_pg_log_entries": "1000", > > > > "osd_op_complaint_time": "30", > > > > "osd_command_max_records": "256", > > > > "osd_op_log_threshold": "5", > > > > "osd_verify_sparse_read_holes": "false", > > > > "osd_debug_drop_ping_probability": "0", > > > > "osd_debug_drop_ping_duration": "0", > > > > "osd_debug_drop_pg_create_probability": "0", > > > > "osd_debug_drop_pg_create_duration": "1", > > > > "osd_debug_drop_op_probability": "0", > > > > "osd_op_history_size": "20", > > > > "osd_op_history_duration": "600", > > > > "osd_target_transaction_size": "300", > > > > "osd_client_op_priority": "63", > > > > "osd_recovery_op_priority": "10", > > > > "filestore": "false", > > > > "filestore_index_retry_probability": "0", > > > > "filestore_debug_omap_check": "false", > > > > "filestore_xattr_use_omap": "false", > > > > "filestore_max_inline_xattr_size": "512", > > > > "filestore_max_inline_xattrs": "2", > > > > "filestore_max_sync_interval": "5", > > > > "filestore_min_sync_interval": "0.01", > > > > "filestore_btrfs_snap": "true", > > > > "filestore_btrfs_clone_range": "true", > > > > "filestore_fsync_flushes_journal_data": "false", > > > > "filestore_fiemap": "false", > > > > "filestore_flusher": "true", > > > > "filestore_flusher_max_fds": "512", > > > > "filestore_flush_min": "65536", > > > > "filestore_sync_flush": "false", > > > > "filestore_journal_parallel": "false", > > > > "filestore_journal_writeahead": "false", > > > > "filestore_journal_trailing": "false", > > > > "filestore_queue_max_ops": "500", > > > > "filestore_queue_max_bytes": "104857600", > > > > "filestore_queue_committing_max_ops": "500", > > > > "filestore_queue_committing_max_bytes": "104857600", > > > > "filestore_op_threads": "2", > > > > "filestore_op_thread_timeout": "60", > > > > "filestore_op_thread_suicide_timeout": "180", > > > > "filestore_commit_timeout": "600", > > > > "filestore_fiemap_threshold": "4096", > > > > "filestore_merge_threshold": "10", > > > > "filestore_split_multiple": "2", > > > > "filestore_update_to": "1000", > > > > "filestore_blackhole": "false", > > > > "filestore_dump_file": "", > > > > "filestore_kill_at": "0", > > > > "filestore_fail_eio": "true", > > > > "journal_dio": "true", > > > > "journal_aio": "false", > > > > "journal_block_align": "true", > > > > "journal_max_write_bytes": "10485760", > > > > "journal_max_write_entries": "100", > > > > "journal_queue_max_ops": "500", > > > > "journal_queue_max_bytes": "104857600", > > > > "journal_align_min_size": "65536", > > > > "journal_replay_from": "0", > > > > "journal_zero_on_create": "false", > > > > "rbd_cache": "false", > > > > "rbd_cache_size": "33554432", > > > > "rbd_cache_max_dirty": "25165824", > > > > "rbd_cache_target_dirty": "16777216", > > > > "rbd_cache_max_dirty_age": "1", > > > > "nss_db_path": "", > > > > "rgw_data": "\/var\/lib\/ceph\/radosgw\/ceph-24", > > > > "rgw_enable_apis": "s3, swift, swift_auth, admin", > > > > "rgw_cache_enabled": "true", > > > > "rgw_cache_lru_size": "10000", > > > > "rgw_socket_path": "", > > > > "rgw_dns_name": "", > > > > "rgw_swift_url": "", > > > > "rgw_swift_url_prefix": "swift", > > > > "rgw_swift_auth_url": "", > > > > "rgw_swift_auth_entry": "auth", > > > > "rgw_keystone_url": "", > > > > "rgw_keystone_admin_token": "", > > > > "rgw_keystone_accepted_roles": "Member, admin", > > > > "rgw_keystone_token_cache_size": "10000", > > > > "rgw_keystone_revocation_interval": "900", > > > > "rgw_admin_entry": "admin", > > > > "rgw_enforce_swift_acls": "true", > > > > "rgw_swift_token_expiration": "86400", > > > > "rgw_print_continue": "true", > > > > "rgw_remote_addr_param": "REMOTE_ADDR", > > > > "rgw_op_thread_timeout": "600", > > > > "rgw_op_thread_suicide_timeout": "0", > > > > "rgw_thread_pool_size": "100", > > > > "rgw_num_control_oids": "8", > > > > "rgw_cluster_root_pool": ".rgw.root", > > > > "rgw_log_nonexistent_bucket": "false", > > > > "rgw_log_object_name": "%Y-%m-%d-%H-%i-%n", > > > > "rgw_log_object_name_utc": "false", > > > > "rgw_usage_max_shards": "32", > > > > "rgw_usage_max_user_shards": "1", > > > > "rgw_enable_ops_log": "true", > > > > "rgw_enable_usage_log": "true", > > > > "rgw_ops_log_rados": "true", > > > > "rgw_ops_log_socket_path": "", > > > > "rgw_ops_log_data_backlog": "5242880", > > > > "rgw_usage_log_flush_threshold": "1024", > > > > "rgw_usage_log_tick_interval": "30", > > > > "rgw_intent_log_object_name": "%Y-%m-%d-%i-%n", > > > > "rgw_intent_log_object_name_utc": "false", > > > > "rgw_init_timeout": "30", > > > > "rgw_mime_types_file": "\/etc\/mime.types", > > > > "rgw_gc_max_objs": "32", > > > > "rgw_gc_obj_min_wait": "7200", > > > > "rgw_gc_processor_max_time": "3600", > > > > "rgw_gc_processor_period": "3600", > > > > "rgw_s3_success_create_obj_status": "0", > > > > "rgw_resolve_cname": "false", > > > > "rgw_obj_stripe_size": "4194304", > > > > "rgw_extended_http_attrs": "", > > > > "mutex_perf_counter": "false", > > > > "internal_safe_to_start_threads": "true"} > > > > > > > > > > > > -----Original Message----- > > > > From: Mark Nelson [mailto:mark.nelson@inktank.com] > > > > Sent: Thursday, 20 December 2012 11:28 PM > > > > To: Matthew Anderson > > > > Cc: ceph-devel@vger.kernel.org > > > > Subject: Re: OSD's slow down to a crawl > > > > > > > > Hi Matt, > > > > > > > > Yeah, your log is looking pretty sparse! I imagine Sam will have a lot of questions for you. Out of curiosity have you looked at any io/cpu/memory stats when things slow down? sysprof may come in handy if it looks like it's CPU related. Also, check out the OSD admin socket. > > > > Sebastien Han made a nice little blog entry describing it: > > > > > > > > http://www.sebastien-han.fr/blog/2012/08/14/ceph-admin-socket/ > > > > > > > > You can get a lot of information about the current state of things. > > > > > > > > Mark > > > > > > > > > > -- > > > To unsubscribe from this list: send the line "unsubscribe ceph-devel" > > > in the body of a message to majordomo@vger.kernel.org More majordomo > > > info at http://vger.kernel.org/majordomo-info.html > > > > > > > > > ^ permalink raw reply [flat|nested] 13+ messages in thread
* RE: OSD's slow down to a crawl 2012-12-21 16:02 ` Sage Weil @ 2013-01-09 8:52 ` Matthew Anderson 2013-01-09 14:09 ` Mark Nelson 0 siblings, 1 reply; 13+ messages in thread From: Matthew Anderson @ 2013-01-09 8:52 UTC (permalink / raw) To: 'Sage Weil'; +Cc: 'Mark Nelson', ceph-devel@vger.kernel.org Hi Sage, Sorry for the late follow up, I've been on a bit of a testing rampage and managed to somewhat sort the problem. Most of the problems appears to be from the 3.7.1 kernel. It seems to have a fairly big issue with its networking stack that was causing Ceph's network operations to hang. Moving back to a 3.6.8 kernel fixed this up. I don't know the full extent of the problem but it was reported on Phoronix briefly here - http://www.phoronix.com/scan.php?page=news_item&px=MTI2Nzc The second issue was BTRFS on both the 3.7 and 3.6.8 kernels. After running a long rados bench (10 minutes) on a fresh cluster it would often slow down significantly by going from 250MB/s down to a 50MB/s average. Latency also increased dramatically. Restarting the OSD's fixes the issue but after a while it slows right down again. In the end I re-formatted the cluster using XFS (and also EXT4 for benchmarks) and there wasn't a single issue. I had rados bench running for over 30 minutes from another machine and there wasn't a single issue. At thisstage I need to start moving into production with XFS. My test cluster arrives in a few weeks so I should be able to come back to the BTRFS issue later on as it would be very handy to have compression working. Thanks again for your help -Matt -----Original Message----- From: Sage Weil [mailto:sage@inktank.com] Sent: Saturday, 22 December 2012 12:02 AM To: Matthew Anderson Cc: 'Mark Nelson'; ceph-devel@vger.kernel.org Subject: RE: OSD's slow down to a crawl On Fri, 21 Dec 2012, Matthew Anderson wrote: > Hi Sage, > > I've tried to reproduce the error again with logging on every OSD and > got the above. RADOS bench had stalled on a write request like the > last time and the attached log is the grep'd OSD log (# cat osd.25.log > | grep client.9501.0:744 > freeze.log) . The OSD that stalled was 25, > pg map is below - > > # ceph pg map 6.5d83495b > osdmap e3775 pg 6.5d83495b (6.95b) -> up [25,31] acting [25,31] > > I hope that's what you were after, if not just let me know We're getting closer. The osd tried to send the reply. Can you reproduce with 'debug ms = 20' on the osds too, and on the client side do soemthing like rados --debug-ms 20 --debug-objecter 20 --log-file /tmp/foo ... Thanks! sage > > Thanks again > -Matt > > > -----Original Message----- > From: Sage Weil [mailto:sage@inktank.com] > Sent: Friday, 21 December 2012 1:14 AM > To: Matthew Anderson > Cc: 'Mark Nelson'; ceph-devel@vger.kernel.org > Subject: RE: OSD's slow down to a crawl > > On Thu, 20 Dec 2012, Matthew Anderson wrote: > > Hi Sage, > > > > Logs are attached. I took the osd logs from osd.24 as this is the > > first osd in my SSD pool I've been testing with previously. > > > > The 4MB bench I was able to reproduce the fault by restarting my rbd > > export which stalled after a few percent complete. When I ran the > > 4MB bench it stalled early on and never received a response back > > from the OSD and I terminated it after 60 seconds or so. I wasn't > > able to reproduce the fault using the 4kb io size. The 4kb log > > should show rados bench completing normally at a respectable speed of about 1MB/s. > > Let's drill into the hang.. up until that point things look okay. > > 2012-12-21 00:51:26.033622 7f6f3c042760 1 -- 172.16.0.13:0/1023886 > --> 172.16.0.13:6813/22233 -- osd_op(client.9503.0:185 > benchmark_data_KVM04_23886_object184 [write 0~4194304] 6.3ca4346e) v4 > -- ?+0 0x171ea50 con 0x171a7e0 > > Do you have a log for that OSD so we can see what happened there? It > may also be that the replicated write is hung. If you do > > ceph pg map 6.3ca4346e > > you can see all OSDs storing that PG. And/or you can grep for > client.9503.0:185 in 172.16.0.13:6813/22233's log and see whether the sub_op was sent. > > Thanks! > sage > > > > > > Thanks > > -Matt > > > > -----Original Message----- > > From: ceph-devel-owner@vger.kernel.org > > [mailto:ceph-devel-owner@vger.kernel.org] On Behalf Of Sage Weil > > Sent: Friday, 21 December 2012 12:30 AM > > To: Matthew Anderson > > Cc: 'Mark Nelson'; ceph-devel@vger.kernel.org > > Subject: RE: OSD's slow down to a crawl > > > > Can you do a similar test, but with full logging on? > > > > ceph tell osd.0 injectargs '--debug-ms 1 --debug-filestore 20 > > --debug-osd > > 20 --debug-journal 20' > > rados -p ssd bench 30 write -b 4096 -t 1 --log-file /tmp/foo > > --debug-ms 1 > > > > That will be a single IO in flight at a time and very easy to trace through the logs. If you can post the resulting log file (/tmp/foo and from osd.0), that would be awesome. > > > > Thanks! > > sage > > > > > > > > On Thu, 20 Dec 2012, Matthew Anderson wrote: > > > > > # rados bench 60 write -t 256 -p ssd Maintaining 256 concurrent > > > writes of 4194304 bytes for at least 60 seconds. > > > Object prefix: benchmark_data_KVM03_12985 > > > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > > > 0 0 0 0 0 0 - 0 > > > 1 45 45 0 0 0 - 0 > > > 2 71 71 0 0 0 - 0 > > > 3 90 90 0 0 0 - 0 > > > 4 125 125 0 0 0 - 0 > > > 5 136 136 0 0 0 - 0 > > > 6 144 144 0 0 0 - 0 > > > 7 175 175 0 0 0 - 0 > > > 8 201 201 0 0 0 - 0 > > > 9 217 217 0 0 0 - 0 > > > 10 228 228 0 0 0 - 0 > > > 11 251 251 0 0 0 - 0 > > > 12 255 266 11 3.66581 3.66667 11.7027 11.6013 > > > 13 255 271 16 4.92194 20 12.9795 11.9979 > > > 14 255 291 36 10.2833 80 13.1795 12.6478 > > > 15 255 301 46 12.2639 40 13.9424 12.9123 > > > 16 255 303 48 11.9973 8 14.0277 12.9579 > > > 17 255 321 66 15.526 72 15.1408 13.4616 > > > 18 255 345 90 19.9956 96 15.5673 13.9937 > > > 19 255 349 94 19.7851 16 15.707 14.0635 > > > 2012-12-21 00:06:37.198982min lat: 11.4904 max lat: 16.4053 avg lat: 14.3121 > > > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > > > 20 255 361 106 21.1953 48 16.4053 14.3121 > > > 21 255 367 112 21.3285 24 17.4263 14.4795 > > > 22 255 370 115 20.9043 12 17.3579 14.5544 > > > 23 255 387 132 22.9513 68 18.3185 15.0867 > > > 24 255 394 139 23.1614 28 18.2868 15.2504 > > > 25 255 404 149 23.8346 40 18.741 15.5048 > > > 26 255 410 155 23.8407 24 19.032 15.6358 > > > 27 255 410 155 22.9577 0 - 15.6358 > > > 28 255 410 155 22.1378 0 - 15.6358 > > > 29 255 411 156 21.5124 1.33333 22.6927 15.6811 > > > 30 255 431 176 23.4614 80 22.6577 16.4822 > > > 31 255 447 192 24.7687 64 23.2345 17.0563 > > > 32 255 464 209 26.1192 68 23.3788 17.5859 > > > 33 255 485 230 27.8726 84 22.8924 18.1122 > > > 34 255 497 242 28.4642 48 23.563 18.3693 > > > 35 255 508 253 28.9078 44 23.0516 18.5935 > > > 36 255 514 259 28.7713 24 24.4415 18.7278 > > > 37 255 530 275 29.723 64 23.847 19.0424 > > > 38 255 541 286 30.0985 44 24.5757 19.2427 > > > 39 255 544 289 29.6344 12 24.6802 19.2984 > > > 2012-12-21 00:06:57.203473min lat: 11.4904 max lat: 24.6802 avg lat: 19.2984 > > > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > > > 40 255 544 289 28.8935 0 - 19.2984 > > > 41 255 544 289 28.1888 0 - 19.2984 > > > 42 255 544 289 27.5176 0 - 19.2984 > > > 43 255 544 289 26.8776 0 - 19.2984 > > > 44 255 544 289 26.2668 0 - 19.2984 > > > 45 255 544 289 25.6831 0 - 19.2984 > > > 46 255 544 289 25.1248 0 - 19.2984 > > > 47 255 544 289 24.5902 0 - 19.2984 > > > 48 255 544 289 24.0779 0 - 19.2984 > > > 49 255 544 289 23.5865 0 - 19.2984 > > > 50 255 544 289 23.1148 0 - 19.2984 > > > 51 255 544 289 22.6616 0 - 19.2984 > > > 52 255 544 289 22.2258 0 - 19.2984 > > > 53 255 544 289 21.8065 0 - 19.2984 > > > 54 255 544 289 21.4026 0 - 19.2984 > > > 55 255 544 289 21.0135 0 - 19.2984 > > > 56 255 544 289 20.6383 0 - 19.2984 > > > 57 255 544 289 20.2762 0 - 19.2984 > > > 58 255 544 289 19.9266 0 - 19.2984 > > > 59 255 544 289 19.5889 0 - 19.2984 > > > 2012-12-21 00:07:17.207782min lat: 11.4904 max lat: 24.6802 avg lat: 19.2984 > > > sec Cur ops started finished avg MB/s cur MB/s last lat avg lat > > > 60 255 544 289 19.2624 0 - 19.2984 > > > 61 255 544 289 18.9466 0 - 19.2984 > > > 62 255 544 289 18.641 0 - 19.2984 > > > 63 255 544 289 18.3452 0 - 19.2984 > > > 64 255 544 289 18.0585 0 - 19.2984 > > > 65 255 544 289 17.7807 0 - 19.2984 > > > 66 255 544 289 17.5113 0 - 19.2984 > > > 67 133 545 412 24.5916 17.5714 37.3574 27.4834 > > > Total time run: 68.000740 > > > Total writes made: 545 > > > Write size: 4194304 > > > Bandwidth (MB/sec): 32.058 > > > > > > Stddev Bandwidth: 26.8837 > > > Max bandwidth (MB/sec): 96 > > > Min bandwidth (MB/sec): 0 > > > Average Latency: 29.2801 > > > Stddev Latency: 11.9768 > > > Max latency: 52.6277 > > > Min latency: 11.4904 > > > > > > > > > > > > > > > > > > ----Original Message----- > > > From: Mark Nelson [mailto:mark.nelson@inktank.com] > > > Sent: Thursday, 20 December 2012 11:59 PM > > > To: Matthew Anderson > > > Cc: ceph-devel@vger.kernel.org > > > Subject: Re: OSD's slow down to a crawl > > > > > > Out of curiosity, if you fire up a rados bench instance on one of the nodes with say, 256 concurrent writes, do any of the writes complete? > > > > > > Mark > > > > > > On 12/20/2012 09:51 AM, Matthew Anderson wrote: > > > > Hi Mark, > > > > > > > > Thanks for the quick reply. The log was 380MB in total so I just posted last 200 lines hoping that a trained eye might be able to spot something. I'm happy to put the file up on a web server if anyone wants to review it in full. > > > > > > > > It doesn't appear to be memory or CPU related. In it's hung state (both read and write are not responding at the moment, I have a VM hung trying to shutdown and an rbd stuck exporting ) all of the OSD's sit around 0.3% CPU usage, ~300MB of ram and no IO wait. The server has roughly 40GB RAM free and is easily 90% idle. Iotop shows no IO coming from any of the Ceph-OSD's. > > > > > > > > The admin socket showed no ops in flight but the performance dump, perhistoric ops and config are below. I did notice that the age of some of the ops was quite high considering things were actively trying to write to the cluster. > > > > # ceph --admin-daemon ./ceph-osd.24.asok perf dump > > > > > > > > {"filestore":{"journal_queue_max_ops":500,"journal_queue_ops":0, > > > > "j > > > > ou > > > > rn > > > > al_ops":20868,"journal_queue_max_bytes":104857600,"journal_queue > > > > _b > > > > yt > > > > es > > > > ":0,"journal_bytes":1710397238,"journal_latency":{"avgcount":20868," > > > > su > > > > m":185.268022000},"journal_wr":13354,"journal_wr_bytes":{"avgcount": > > > > 13 > > > > 354,"sum":1770647552},"op_queue_max_ops":500,"op_queue_ops":0,"ops": > > > > 20 > > > > 868,"op_queue_max_bytes":104857600,"op_queue_bytes":0,"bytes":17 > > > > 10 > > > > 23 > > > > 25 > > > > 83,"apply_latency":{"avgcount":20868,"sum":508.896737000},"committing" > > > > :0,"commitcycle":339,"commitcycle_interval":{"avgcount":339,"sum": > > > > 17 > > > > 08 > > > > .284725000},"commitcycle_latency":{"avgcount":339,"sum":13.24185 > > > > 00 > > > > 00 > > > > }, > > > > "journal_full":0},"mutex-FileJournal::completions_lock":{"wait":{" > > > > av > > > > gc > > > > ount":0,"sum":0.000000000}},"mutex-FileJournal::finisher_lock":{"wait" > > > > :{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::write_lock" > > > > :{ > > > > "w > > > > ait":{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::write > > > > q_ > > > > lo > > > > ck > > > > ":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager:: > > > > apply_lo > > > ck":{"wai > > > t":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager::com > > > _l > > > oc > > > k":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::SubmitMan > > > ag > > > er > > > ::lock":{"wait":{"avgcount":0,"sum":0.000000000}},"osd":{"opq":0," > > > op > > > _w > > > ip":0,"op":31753,"op_in_bytes":797806224,"op_out_bytes":1728414048," > > > op > > > _latency":{"avgcount":31753,"sum":82.183373000},"op_r":24945,"op_r > > > _o > > > ut > > > _bytes":1728414048,"op_r_latency":{"avgcount":24945,"sum":16.83368 > > > 90 > > > 00 > > > },"op_w":6808,"op_w_in_bytes":797806224,"op_w_rlat":{"avgcount":6808," > > > sum":38.223006000},"op_w_latency":{"avgcount":6808,"sum":65.349684 > > > 00 > > > 0} > > > ,"op_rw":0,"op_rw_in_bytes":0,"op_rw_out_bytes":0,"op_rw_rlat":{"a > > > vg > > > co > > > unt":0,"sum":0.000000000},"op_rw_latency":{"avgcount":0,"sum":0.00 > > > 00 > > > 00 > > > 000},"subop":5700,"subop_in_bytes":813224156,"subop_latency":{"avg > > > co > > > un > > > t":5700,"sum":26.877085000},"subop_w":0,"subop_w_in_bytes":813224156," > > > subop_w_latency":{"avgcount":5700,"sum":26.877085000},"subop_pull":0," > > > subop_pull_latency":{"avgcount":0,"sum":0.000000000},"subop_push": > > > 0, > > > "s > > > ubop_push_ > > > in_bytes" > > > :0,"subop_push_latency":{"avgcount":0,"sum":0.000000000},"pull":0, > > > "p > > > us > > > h":0,"push_out_bytes":0,"push_in":0,"push_in_bytes":0,"recovery_ops" > > > :0 > > > ,"loadavg":359,"buffer_bytes":0,"numpg":2486,"numpg_primary":1214, > > > "n > > > um > > > pg_replica":1272,"numpg_stray":0,"heartbeat_to_peers":4,"heartbeat > > > _f > > > ro > > > m_peers":0,"map_messages":44,"map_message_epochs":184,"map_message > > > _e > > > po > > > ch_dups":174},"throttle-filestore_bytes":{"val":0,"max":104857600, > > > "g > > > et > > > ":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0,"take": > > > 20 > > > 868,"take_sum":1710397238,"put":13354,"put_sum":1710397238,"wait":{" > > > av > > > gcount":0,"sum":0.000000000}},"throttle-filestore_ops":{"val":0,"max": > > > 500,"get":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0," > > > take":20868,"take_sum":20868,"put":13354,"put_sum":20868,"wait":{" > > > av > > > gc > > > ount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-client": > > > {"val":0,"max":104857600,"get":31657,"get_sum":795546573,"get_or_f > > > ai > > > l_ > > > fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":31657, > > > "p > > > ut > > > _sum":7955 > > > 46573,"wa > > > it":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_thro > > > tt > > > le > > > r-cluster":{"val":0,"max":104857600,"get":18913,"get_sum":82801129 > > > 3, > > > "g > > > et_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put" > > > :18913,"put_sum":828011293,"wait":{"avgcount":0,"sum":0.000000000} > > > }, > > > "t > > > hrottle-msgr_dispatch_throttler-hbclient":{"val":0,"max":104857600," > > > ge > > > t":4260,"get_sum":200220,"get_or_fail_fail":0,"get_or_fail_success": > > > 0, > > > "take":0,"take_sum":0,"put":4260,"put_sum":200220,"wait":{"avgcount" > > > :0 > > > ,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-hbserver":{ > > > "v > > > al > > > ":0,"max":104857600,"get":4236,"get_sum":199092,"get_or_fail_fail":0," > > > get_or_fail_success":0,"take":0,"take_sum":0,"put":4236,"put_sum": > > > 19 > > > 90 > > > 92,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-osd_client_b > > > yt > > > es > > > ":{"val":0,"max":524288000,"get":31308,"get_sum":795353720,"get_or > > > _f > > > ai > > > l_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":3767 > > > 1, "p ut_sum":795353720,"wait":{"avgcount":0,"sum":0.000000000}}} > > > > > > > > # ceph --admin-daemon ./ceph-osd.24.asok dump_historic_ops > > > > > > > > { "description": "osd_op(client.7636.0:101498 rb.0.122c.6b8b4567.000000000357 [write 3600384~4096] 6.42463fe3)", > > > > "received_at": "2012-12-20 23:34:42.012131", > > > > "age": "75.428051", > > > > "duration": "0.009988", > > > > "flag_point": "waiting for sub ops", > > > > "client_info": { "client": "client.7636", > > > > "tid": 101498}, > > > > "events": [ > > > > { "time": "2012-12-20 23:34:42.012317", > > > > "event": "waiting_for_osdmap"}, > > > > { "time": "2012-12-20 23:34:42.012425", > > > > "event": "reached_pg"}, > > > > { "time": "2012-12-20 23:34:42.012678", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:34:42.012720", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:34:42.012815", > > > > "event": "sub_op_sent"}, > > > > { "time": "2012-12-20 23:34:42.012958", > > > > "event": "commit_queued_for_journal_write"}, > > > > { "time": "2012-12-20 23:34:42.013033", > > > > "event": "write_thread_in_journal_buffer"}, > > > > { "time": "2012-12-20 23:34:42.013745", > > > > "event": "journaled_completion_queued"}, > > > > { "time": "2012-12-20 23:34:42.013803", > > > > "event": "op_commit"}, > > > > { "time": "2012-12-20 23:34:42.017414", > > > > "event": "sub_op_commit_rec"}, > > > > { "time": "2012-12-20 23:34:42.022066", > > > > "event": "op_applied"}, > > > > { "time": "2012-12-20 23:34:42.022119", > > > > "event": "done"}]}, > > > > { "description": "osd_sub_op_reply(client.7636.0:101498 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", > > > > "received_at": "2012-12-20 23:34:42.017048", > > > > "age": "75.423134", > > > > "duration": "0.000411", > > > > "flag_point": "started", > > > > "events": [ > > > > { "time": "2012-12-20 23:34:42.017218", > > > > "event": "waiting_for_osdmap"}, > > > > { "time": "2012-12-20 23:34:42.017363", > > > > "event": "reached_pg"}, > > > > { "time": "2012-12-20 23:34:42.017401", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:34:42.017459", > > > > "event": "done"}]}, > > > > { "description": "osd_op(client.7636.0:101502 rb.0.122c.6b8b4567.000000000064 [write 1512448~61440] 6.a241e34d)", > > > > "received_at": "2012-12-20 23:34:49.851021", > > > > "age": "67.589161", > > > > "duration": "0.007114", > > > > "flag_point": "waiting for sub ops", > > > > "client_info": { "client": "client.7636", > > > > "tid": 101502}, > > > > "events": [ > > > > { "time": "2012-12-20 23:34:49.851276", > > > > "event": "waiting_for_osdmap"}, > > > > { "time": "2012-12-20 23:34:49.851429", > > > > "event": "reached_pg"}, > > > > { "time": "2012-12-20 23:34:49.851616", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:34:49.851648", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:34:49.851712", > > > > "event": "sub_op_sent"}, > > > > { "time": "2012-12-20 23:34:49.851770", > > > > "event": "commit_queued_for_journal_write"}, > > > > { "time": "2012-12-20 23:34:49.851915", > > > > "event": "write_thread_in_journal_buffer"}, > > > > { "time": "2012-12-20 23:34:49.853956", > > > > "event": "journaled_completion_queued"}, > > > > { "time": "2012-12-20 23:34:49.853992", > > > > "event": "op_commit"}, > > > > { "time": "2012-12-20 23:34:49.857016", > > > > "event": "sub_op_commit_rec"}, > > > > { "time": "2012-12-20 23:34:49.858077", > > > > "event": "op_applied"}, > > > > { "time": "2012-12-20 23:34:49.858135", > > > > "event": "done"}]}, > > > > { "description": "osd_sub_op_reply(client.7636.0:101502 6.234d a241e34d\/rb.0.122c.6b8b4567.000000000064\/head\/\/6 [] ondisk, result = 0)", > > > > "received_at": "2012-12-20 23:34:49.856762", > > > > "age": "67.583420", > > > > "duration": "0.000308", > > > > "flag_point": "started", > > > > "events": [ > > > > { "time": "2012-12-20 23:34:49.856869", > > > > "event": "waiting_for_osdmap"}, > > > > { "time": "2012-12-20 23:34:49.856969", > > > > "event": "reached_pg"}, > > > > { "time": "2012-12-20 23:34:49.857001", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:34:49.857070", > > > > "event": "done"}]}, > > > > { "description": "osd_op(client.7636.0:101504 rb.0.122c.6b8b4567.00000000012c [write 1658880~2048] 6.92d2c38c)", > > > > "received_at": "2012-12-20 23:35:34.351518", > > > > "age": "23.088664", > > > > "duration": "0.009430", > > > > "flag_point": "waiting for sub ops", > > > > "client_info": { "client": "client.7636", > > > > "tid": 101504}, > > > > "events": [ > > > > { "time": "2012-12-20 23:35:34.351714", > > > > "event": "waiting_for_osdmap"}, > > > > { "time": "2012-12-20 23:35:34.351811", > > > > "event": "reached_pg"}, > > > > { "time": "2012-12-20 23:35:34.352030", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:35:34.352071", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:35:34.352146", > > > > "event": "sub_op_sent"}, > > > > { "time": "2012-12-20 23:35:34.352213", > > > > "event": "commit_queued_for_journal_write"}, > > > > { "time": "2012-12-20 23:35:34.352287", > > > > "event": "write_thread_in_journal_buffer"}, > > > > { "time": "2012-12-20 23:35:34.355390", > > > > "event": "journaled_completion_queued"}, > > > > { "time": "2012-12-20 23:35:34.355475", > > > > "event": "op_commit"}, > > > > { "time": "2012-12-20 23:35:34.356661", > > > > "event": "sub_op_commit_rec"}, > > > > { "time": "2012-12-20 23:35:34.360894", > > > > "event": "op_applied"}, > > > > { "time": "2012-12-20 23:35:34.360948", > > > > "event": "done"}]}, > > > > { "description": "osd_sub_op_reply(client.7636.0:101504 6.38c 92d2c38c\/rb.0.122c.6b8b4567.00000000012c\/head\/\/6 [] ondisk, result = 0)", > > > > "received_at": "2012-12-20 23:35:34.356286", > > > > "age": "23.083896", > > > > "duration": "0.000421", > > > > "flag_point": "started", > > > > "events": [ > > > > { "time": "2012-12-20 23:35:34.356489", > > > > "event": "waiting_for_osdmap"}, > > > > { "time": "2012-12-20 23:35:34.356598", > > > > "event": "reached_pg"}, > > > > { "time": "2012-12-20 23:35:34.356640", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:35:34.356707", > > > > "event": "done"}]}, > > > > { "description": "osd_op(client.7636.0:101512 rb.0.122c.6b8b4567.000000000357 [write 3731456~4096] 6.42463fe3)", > > > > "received_at": "2012-12-20 23:35:42.133103", > > > > "age": "15.307079", > > > > "duration": "0.008357", > > > > "flag_point": "waiting for sub ops", > > > > "client_info": { "client": "client.7636", > > > > "tid": 101512}, > > > > "events": [ > > > > { "time": "2012-12-20 23:35:42.133315", > > > > "event": "waiting_for_osdmap"}, > > > > { "time": "2012-12-20 23:35:42.133427", > > > > "event": "reached_pg"}, > > > > { "time": "2012-12-20 23:35:42.133684", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:35:42.133723", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:35:42.133796", > > > > "event": "sub_op_sent"}, > > > > { "time": "2012-12-20 23:35:42.133908", > > > > "event": "commit_queued_for_journal_write"}, > > > > { "time": "2012-12-20 23:35:42.133995", > > > > "event": "write_thread_in_journal_buffer"}, > > > > { "time": "2012-12-20 23:35:42.136965", > > > > "event": "journaled_completion_queued"}, > > > > { "time": "2012-12-20 23:35:42.137025", > > > > "event": "op_commit"}, > > > > { "time": "2012-12-20 23:35:42.138456", > > > > "event": "sub_op_commit_rec"}, > > > > { "time": "2012-12-20 23:35:42.141408", > > > > "event": "op_applied"}, > > > > { "time": "2012-12-20 23:35:42.141460", > > > > "event": "done"}]}, > > > > { "description": "osd_sub_op_reply(client.7636.0:101512 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", > > > > "received_at": "2012-12-20 23:35:42.138087", > > > > "age": "15.302095", > > > > "duration": "0.000410", > > > > "flag_point": "started", > > > > "events": [ > > > > { "time": "2012-12-20 23:35:42.138248", > > > > "event": "waiting_for_osdmap"}, > > > > { "time": "2012-12-20 23:35:42.138363", > > > > "event": "reached_pg"}, > > > > { "time": "2012-12-20 23:35:42.138443", > > > > "event": "started"}, > > > > { "time": "2012-12-20 23:35:42.138497", > > > > "event": "done"}]}]} > > > > > > > > # ceph --admin-daemon ./ceph-osd.24.asok config show { "name": > > > > "osd.24", > > > > "cluster": "ceph", > > > > "none": "0\/5", > > > > "lockdep": "0\/1", > > > > "context": "0\/1", > > > > "crush": "1\/1", > > > > "mds": "1\/5", > > > > "mds_balancer": "1\/5", > > > > "mds_locker": "1\/5", > > > > "mds_log": "1\/5", > > > > "mds_log_expire": "1\/5", > > > > "mds_migrator": "1\/5", > > > > "buffer": "0\/1", > > > > "timer": "0\/1", > > > > "filer": "0\/1", > > > > "striper": "0\/1", > > > > "objecter": "0\/1", > > > > "rados": "0\/5", > > > > "rbd": "0\/5", > > > > "journaler": "0\/5", > > > > "objectcacher": "0\/5", > > > > "client": "0\/5", > > > > "osd": "0\/5", > > > > "optracker": "0\/5", > > > > "objclass": "0\/5", > > > > "filestore": "1\/3", > > > > "journal": "1\/3", > > > > "ms": "0\/5", > > > > "mon": "1\/5", > > > > "monc": "0\/10", > > > > "paxos": "0\/5", > > > > "tp": "0\/5", > > > > "auth": "1\/5", > > > > "crypto": "1\/5", > > > > "finisher": "1\/1", > > > > "heartbeatmap": "1\/5", > > > > "perfcounter": "1\/5", > > > > "rgw": "1\/5", > > > > "hadoop": "1\/5", > > > > "javaclient": "1\/5", > > > > "asok": "1\/5", > > > > "throttle": "1\/1", > > > > "host": "KVM04", > > > > "fsid": "00000000-0000-0000-0000-000000000000", > > > > "public_addr": ":\/0", > > > > "cluster_addr": ":\/0", > > > > "public_network": "", > > > > "cluster_network": "", > > > > "num_client": "1", > > > > "monmap": "", > > > > "mon_host": "", > > > > "lockdep": "false", > > > > "admin_socket": "\/var\/run\/ceph\/ceph-osd.24.asok", > > > > "daemonize": "true", > > > > "pid_file": "\/var\/run\/ceph\/osd.24.pid", > > > > "chdir": "\/", > > > > "max_open_files": "0", > > > > "fatal_signal_handlers": "true", > > > > "log_file": "\/var\/log\/ceph\/osd.24.log", > > > > "log_max_new": "1000", > > > > "log_max_recent": "100000", > > > > "log_to_stderr": "false", > > > > "err_to_stderr": "true", > > > > "log_to_syslog": "false", > > > > "err_to_syslog": "false", > > > > "log_flush_on_exit": "true", > > > > "clog_to_monitors": "true", > > > > "clog_to_syslog": "false", > > > > "mon_cluster_log_to_syslog": "false", > > > > "mon_cluster_log_file": "\/var\/log\/ceph\/ceph.log", > > > > "key": "", > > > > "keyfile": "", > > > > "keyring": "\/srv\/ceph\/osd\/osd.24\/keyring", > > > > "heartbeat_interval": "5", > > > > "heartbeat_file": "", > > > > "perf": "true", > > > > "ms_tcp_nodelay": "true", > > > > "ms_initial_backoff": "0.2", > > > > "ms_max_backoff": "15", > > > > "ms_nocrc": "false", > > > > "ms_die_on_bad_msg": "false", > > > > "ms_dispatch_throttle_bytes": "104857600", > > > > "ms_bind_ipv6": "false", > > > > "ms_bind_port_min": "6800", > > > > "ms_bind_port_max": "7100", > > > > "ms_rwthread_stack_bytes": "1048576", > > > > "ms_tcp_read_timeout": "900", > > > > "ms_inject_socket_failures": "0", > > > > "ms_inject_delay_type": "", > > > > "ms_inject_delay_max": "1", > > > > "ms_inject_delay_probability": "0", > > > > "mon_data": "\/var\/lib\/ceph\/mon\/ceph-24", > > > > "mon_initial_members": "", > > > > "mon_sync_fs_threshold": "5", > > > > "mon_tick_interval": "5", > > > > "mon_subscribe_interval": "300", > > > > "mon_osd_laggy_halflife": "3600", > > > > "mon_osd_laggy_weight": "0.3", > > > > "mon_osd_adjust_heartbeat_grace": "true", > > > > "mon_osd_adjust_down_out_interval": "true", > > > > "mon_osd_auto_mark_in": "false", > > > > "mon_osd_auto_mark_auto_out_in": "true", > > > > "mon_osd_auto_mark_new_in": "true", > > > > "mon_osd_down_out_interval": "300", > > > > "mon_osd_min_up_ratio": "0.3", > > > > "mon_osd_min_in_ratio": "0.3", > > > > "mon_lease": "5", > > > > "mon_lease_renew_interval": "3", > > > > "mon_lease_ack_timeout": "10", > > > > "mon_clock_drift_allowed": "0.05", > > > > "mon_clock_drift_warn_backoff": "5", > > > > "mon_accept_timeout": "10", > > > > "mon_pg_create_interval": "30", > > > > "mon_pg_stuck_threshold": "300", > > > > "mon_osd_full_ratio": "0.95", > > > > "mon_osd_nearfull_ratio": "0.85", > > > > "mon_globalid_prealloc": "100", > > > > "mon_osd_report_timeout": "900", > > > > "mon_force_standby_active": "true", > > > > "mon_min_osdmap_epochs": "500", > > > > "mon_max_pgmap_epochs": "500", > > > > "mon_max_log_epochs": "500", > > > > "mon_max_osd": "10000", > > > > "mon_probe_timeout": "2", > > > > "mon_slurp_timeout": "10", > > > > "mon_slurp_bytes": "262144", > > > > "mon_client_bytes": "104857600", > > > > "mon_daemon_bytes": "419430400", > > > > "mon_max_log_entries_per_event": "4096", > > > > "paxos_max_join_drift": "10", > > > > "paxos_propose_interval": "1", > > > > "paxos_min_wait": "0.05", > > > > "clock_offset": "0", > > > > "auth_cluster_required": "none", > > > > "auth_service_required": "none", > > > > "auth_client_required": "none", > > > > "auth_supported": "none", > > > > "cephx_require_signatures": "false", > > > > "cephx_sign_messages": "true", > > > > "auth_mon_ticket_ttl": "43200", > > > > "auth_service_ticket_ttl": "3600", > > > > "mon_client_hunt_interval": "3", > > > > "mon_client_ping_interval": "10", > > > > "mon_client_max_log_entries_per_message": "1000", > > > > "mon_max_pool_pg_num": "65536", > > > > "client_cache_size": "16384", > > > > "client_cache_mid": "0.75", > > > > "client_use_random_mds": "false", > > > > "client_mount_timeout": "30", > > > > "client_tick_interval": "1", > > > > "client_trace": "", > > > > "client_readahead_min": "131072", > > > > "client_readahead_max_bytes": "0", > > > > "client_readahead_max_periods": "4", > > > > "client_snapdir": ".snap", > > > > "client_mountpoint": "\/", > > > > "client_notify_timeout": "10", > > > > "client_caps_release_delay": "5", > > > > "client_oc": "true", > > > > "client_oc_size": "209715200", > > > > "client_oc_max_dirty": "104857600", > > > > "client_oc_target_dirty": "8388608", > > > > "client_oc_max_dirty_age": "5", > > > > "client_oc_max_objects": "1000", > > > > "client_debug_force_sync_read": "false", > > > > "client_debug_inject_tick_delay": "0", > > > > "fuse_use_invalidate_cb": "false", > > > > "fuse_big_writes": "true", > > > > "fuse_debug": "false", > > > > "objecter_tick_interval": "5", > > > > "objecter_timeout": "10", > > > > "objecter_inflight_op_bytes": "104857600", > > > > "objecter_inflight_ops": "1024", > > > > "journaler_allow_split_entries": "true", > > > > "journaler_write_head_interval": "15", > > > > "journaler_prefetch_periods": "10", > > > > "journaler_prezero_periods": "5", > > > > "journaler_batch_interval": "0.001", > > > > "journaler_batch_max": "0", > > > > "mds_data": "\/var\/lib\/ceph\/mds\/ceph-24", > > > > "mds_max_file_size": "1099511627776", > > > > "mds_cache_size": "100000", > > > > "mds_cache_mid": "0.7", > > > > "mds_mem_max": "1048576", > > > > "mds_dir_commit_ratio": "0.5", > > > > "mds_dir_max_commit_size": "90", > > > > "mds_decay_halflife": "5", > > > > "mds_beacon_interval": "4", > > > > "mds_beacon_grace": "15", > > > > "mds_blacklist_interval": "1440", > > > > "mds_session_timeout": "60", > > > > "mds_session_autoclose": "300", > > > > "mds_reconnect_timeout": "45", > > > > "mds_tick_interval": "5", > > > > "mds_dirstat_min_interval": "1", > > > > "mds_scatter_nudge_interval": "5", > > > > "mds_client_prealloc_inos": "1000", > > > > "mds_early_reply": "true", > > > > "mds_use_tmap": "true", > > > > "mds_default_dir_hash": "2", > > > > "mds_log": "true", > > > > "mds_log_skip_corrupt_events": "false", > > > > "mds_log_max_events": "-1", > > > > "mds_log_max_segments": "30", > > > > "mds_log_max_expiring": "20", > > > > "mds_bal_sample_interval": "3", > > > > "mds_bal_replicate_threshold": "8000", > > > > "mds_bal_unreplicate_threshold": "0", > > > > "mds_bal_frag": "false", > > > > "mds_bal_split_size": "10000", > > > > "mds_bal_split_rd": "25000", > > > > "mds_bal_split_wr": "10000", > > > > "mds_bal_split_bits": "3", > > > > "mds_bal_merge_size": "50", > > > > "mds_bal_merge_rd": "1000", > > > > "mds_bal_merge_wr": "1000", > > > > "mds_bal_interval": "10", > > > > "mds_bal_fragment_interval": "5", > > > > "mds_bal_idle_threshold": "0", > > > > "mds_bal_max": "-1", > > > > "mds_bal_max_until": "-1", > > > > "mds_bal_mode": "0", > > > > "mds_bal_min_rebalance": "0.1", > > > > "mds_bal_min_start": "0.2", > > > > "mds_bal_need_min": "0.8", > > > > "mds_bal_need_max": "1.2", > > > > "mds_bal_midchunk": "0.3", > > > > "mds_bal_minchunk": "0.001", > > > > "mds_bal_target_removal_min": "5", > > > > "mds_bal_target_removal_max": "10", > > > > "mds_replay_interval": "1", > > > > "mds_shutdown_check": "0", > > > > "mds_thrash_exports": "0", > > > > "mds_thrash_fragments": "0", > > > > "mds_dump_cache_on_map": "false", > > > > "mds_dump_cache_after_rejoin": "false", > > > > "mds_verify_scatter": "false", > > > > "mds_debug_scatterstat": "false", > > > > "mds_debug_frag": "false", > > > > "mds_debug_auth_pins": "false", > > > > "mds_debug_subtrees": "false", > > > > "mds_kill_mdstable_at": "0", > > > > "mds_kill_export_at": "0", > > > > "mds_kill_import_at": "0", > > > > "mds_kill_link_at": "0", > > > > "mds_kill_rename_at": "0", > > > > "mds_wipe_sessions": "false", > > > > "mds_wipe_ino_prealloc": "false", > > > > "mds_skip_ino": "0", > > > > "max_mds": "1", > > > > "mds_standby_for_name": "", > > > > "mds_standby_for_rank": "-1", > > > > "mds_standby_replay": "false", > > > > "osd_auto_upgrade_tmap": "true", > > > > "osd_tmapput_sets_uses_tmap": "false", > > > > "osd_max_backfills": "10", > > > > "osd_backfill_full_ratio": "0.85", > > > > "osd_backfill_retry_interval": "10", > > > > "osd_uuid": "00000000-0000-0000-0000-000000000000", > > > > "osd_data": "\/srv\/ceph\/osd\/osd.24", > > > > "osd_journal": "\/dev\/sda3", > > > > "osd_journal_size": "5120", > > > > "osd_max_write_size": "90", > > > > "osd_max_pgls": "1024", > > > > "osd_client_message_size_cap": "524288000", > > > > "osd_pg_bits": "6", > > > > "osd_pgp_bits": "6", > > > > "osd_min_rep": "1", > > > > "osd_max_rep": "10", > > > > "osd_pool_default_crush_rule": "0", > > > > "osd_pool_default_size": "2", > > > > "osd_pool_default_min_size": "0", > > > > "osd_pool_default_pg_num": "8", > > > > "osd_pool_default_pgp_num": "8", > > > > "osd_map_dedup": "true", > > > > "osd_map_cache_size": "500", > > > > "osd_map_message_max": "100", > > > > "osd_op_threads": "2", > > > > "osd_disk_threads": "1", > > > > "osd_recovery_threads": "1", > > > > "osd_recover_clone_overlap": "true", > > > > "osd_backfill_scan_min": "64", > > > > "osd_backfill_scan_max": "512", > > > > "osd_op_thread_timeout": "30", > > > > "osd_recovery_thread_timeout": "30", > > > > "osd_snap_trim_thread_timeout": "3600", > > > > "osd_scrub_thread_timeout": "60", > > > > "osd_scrub_finalize_thread_timeout": "600", > > > > "osd_remove_thread_timeout": "3600", > > > > "osd_command_thread_timeout": "600", > > > > "osd_age": "0.8", > > > > "osd_age_time": "0", > > > > "osd_heartbeat_addr": ":\/0", > > > > "osd_heartbeat_interval": "6", > > > > "osd_heartbeat_grace": "20", > > > > "osd_mon_heartbeat_interval": "30", > > > > "osd_mon_report_interval_max": "120", > > > > "osd_mon_report_interval_min": "5", > > > > "osd_mon_ack_timeout": "30", > > > > "osd_min_down_reporters": "1", > > > > "osd_min_down_reports": "3", > > > > "osd_default_data_pool_replay_window": "45", > > > > "osd_preserve_trimmed_log": "false", > > > > "osd_auto_mark_unfound_lost": "false", > > > > "osd_recovery_delay_start": "0", > > > > "osd_recovery_max_active": "50", > > > > "osd_recovery_max_chunk": "8388608", > > > > "osd_recovery_forget_lost_objects": "false", > > > > "osd_max_scrubs": "1", > > > > "osd_scrub_load_threshold": "0.5", > > > > "osd_scrub_min_interval": "300", > > > > "osd_scrub_max_interval": "86400", > > > > "osd_deep_scrub_interval": "604800", > > > > "osd_deep_scrub_stride": "524288", > > > > "osd_auto_weight": "false", > > > > "osd_class_dir": "\/usr\/local\/lib\/rados-classes", > > > > "osd_check_for_log_corruption": "false", > > > > "osd_use_stale_snap": "false", > > > > "osd_rollback_to_cluster_snap": "", > > > > "osd_default_notify_timeout": "30", > > > > "osd_kill_backfill_at": "0", > > > > "osd_min_pg_log_entries": "1000", > > > > "osd_op_complaint_time": "30", > > > > "osd_command_max_records": "256", > > > > "osd_op_log_threshold": "5", > > > > "osd_verify_sparse_read_holes": "false", > > > > "osd_debug_drop_ping_probability": "0", > > > > "osd_debug_drop_ping_duration": "0", > > > > "osd_debug_drop_pg_create_probability": "0", > > > > "osd_debug_drop_pg_create_duration": "1", > > > > "osd_debug_drop_op_probability": "0", > > > > "osd_op_history_size": "20", > > > > "osd_op_history_duration": "600", > > > > "osd_target_transaction_size": "300", > > > > "osd_client_op_priority": "63", > > > > "osd_recovery_op_priority": "10", > > > > "filestore": "false", > > > > "filestore_index_retry_probability": "0", > > > > "filestore_debug_omap_check": "false", > > > > "filestore_xattr_use_omap": "false", > > > > "filestore_max_inline_xattr_size": "512", > > > > "filestore_max_inline_xattrs": "2", > > > > "filestore_max_sync_interval": "5", > > > > "filestore_min_sync_interval": "0.01", > > > > "filestore_btrfs_snap": "true", > > > > "filestore_btrfs_clone_range": "true", > > > > "filestore_fsync_flushes_journal_data": "false", > > > > "filestore_fiemap": "false", > > > > "filestore_flusher": "true", > > > > "filestore_flusher_max_fds": "512", > > > > "filestore_flush_min": "65536", > > > > "filestore_sync_flush": "false", > > > > "filestore_journal_parallel": "false", > > > > "filestore_journal_writeahead": "false", > > > > "filestore_journal_trailing": "false", > > > > "filestore_queue_max_ops": "500", > > > > "filestore_queue_max_bytes": "104857600", > > > > "filestore_queue_committing_max_ops": "500", > > > > "filestore_queue_committing_max_bytes": "104857600", > > > > "filestore_op_threads": "2", > > > > "filestore_op_thread_timeout": "60", > > > > "filestore_op_thread_suicide_timeout": "180", > > > > "filestore_commit_timeout": "600", > > > > "filestore_fiemap_threshold": "4096", > > > > "filestore_merge_threshold": "10", > > > > "filestore_split_multiple": "2", > > > > "filestore_update_to": "1000", > > > > "filestore_blackhole": "false", > > > > "filestore_dump_file": "", > > > > "filestore_kill_at": "0", > > > > "filestore_fail_eio": "true", > > > > "journal_dio": "true", > > > > "journal_aio": "false", > > > > "journal_block_align": "true", > > > > "journal_max_write_bytes": "10485760", > > > > "journal_max_write_entries": "100", > > > > "journal_queue_max_ops": "500", > > > > "journal_queue_max_bytes": "104857600", > > > > "journal_align_min_size": "65536", > > > > "journal_replay_from": "0", > > > > "journal_zero_on_create": "false", > > > > "rbd_cache": "false", > > > > "rbd_cache_size": "33554432", > > > > "rbd_cache_max_dirty": "25165824", > > > > "rbd_cache_target_dirty": "16777216", > > > > "rbd_cache_max_dirty_age": "1", > > > > "nss_db_path": "", > > > > "rgw_data": "\/var\/lib\/ceph\/radosgw\/ceph-24", > > > > "rgw_enable_apis": "s3, swift, swift_auth, admin", > > > > "rgw_cache_enabled": "true", > > > > "rgw_cache_lru_size": "10000", > > > > "rgw_socket_path": "", > > > > "rgw_dns_name": "", > > > > "rgw_swift_url": "", > > > > "rgw_swift_url_prefix": "swift", > > > > "rgw_swift_auth_url": "", > > > > "rgw_swift_auth_entry": "auth", > > > > "rgw_keystone_url": "", > > > > "rgw_keystone_admin_token": "", > > > > "rgw_keystone_accepted_roles": "Member, admin", > > > > "rgw_keystone_token_cache_size": "10000", > > > > "rgw_keystone_revocation_interval": "900", > > > > "rgw_admin_entry": "admin", > > > > "rgw_enforce_swift_acls": "true", > > > > "rgw_swift_token_expiration": "86400", > > > > "rgw_print_continue": "true", > > > > "rgw_remote_addr_param": "REMOTE_ADDR", > > > > "rgw_op_thread_timeout": "600", > > > > "rgw_op_thread_suicide_timeout": "0", > > > > "rgw_thread_pool_size": "100", > > > > "rgw_num_control_oids": "8", > > > > "rgw_cluster_root_pool": ".rgw.root", > > > > "rgw_log_nonexistent_bucket": "false", > > > > "rgw_log_object_name": "%Y-%m-%d-%H-%i-%n", > > > > "rgw_log_object_name_utc": "false", > > > > "rgw_usage_max_shards": "32", > > > > "rgw_usage_max_user_shards": "1", > > > > "rgw_enable_ops_log": "true", > > > > "rgw_enable_usage_log": "true", > > > > "rgw_ops_log_rados": "true", > > > > "rgw_ops_log_socket_path": "", > > > > "rgw_ops_log_data_backlog": "5242880", > > > > "rgw_usage_log_flush_threshold": "1024", > > > > "rgw_usage_log_tick_interval": "30", > > > > "rgw_intent_log_object_name": "%Y-%m-%d-%i-%n", > > > > "rgw_intent_log_object_name_utc": "false", > > > > "rgw_init_timeout": "30", > > > > "rgw_mime_types_file": "\/etc\/mime.types", > > > > "rgw_gc_max_objs": "32", > > > > "rgw_gc_obj_min_wait": "7200", > > > > "rgw_gc_processor_max_time": "3600", > > > > "rgw_gc_processor_period": "3600", > > > > "rgw_s3_success_create_obj_status": "0", > > > > "rgw_resolve_cname": "false", > > > > "rgw_obj_stripe_size": "4194304", > > > > "rgw_extended_http_attrs": "", > > > > "mutex_perf_counter": "false", > > > > "internal_safe_to_start_threads": "true"} > > > > > > > > > > > > -----Original Message----- > > > > From: Mark Nelson [mailto:mark.nelson@inktank.com] > > > > Sent: Thursday, 20 December 2012 11:28 PM > > > > To: Matthew Anderson > > > > Cc: ceph-devel@vger.kernel.org > > > > Subject: Re: OSD's slow down to a crawl > > > > > > > > Hi Matt, > > > > > > > > Yeah, your log is looking pretty sparse! I imagine Sam will have a lot of questions for you. Out of curiosity have you looked at any io/cpu/memory stats when things slow down? sysprof may come in handy if it looks like it's CPU related. Also, check out the OSD admin socket. > > > > Sebastien Han made a nice little blog entry describing it: > > > > > > > > http://www.sebastien-han.fr/blog/2012/08/14/ceph-admin-socket/ > > > > > > > > You can get a lot of information about the current state of things. > > > > > > > > Mark > > > > > > > > > > -- > > > To unsubscribe from this list: send the line "unsubscribe ceph-devel" > > > in the body of a message to majordomo@vger.kernel.org More > > > majordomo info at http://vger.kernel.org/majordomo-info.html > > > > > > > > > -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: OSD's slow down to a crawl 2013-01-09 8:52 ` Matthew Anderson @ 2013-01-09 14:09 ` Mark Nelson 0 siblings, 0 replies; 13+ messages in thread From: Mark Nelson @ 2013-01-09 14:09 UTC (permalink / raw) To: Matthew Anderson; +Cc: 'Sage Weil', ceph-devel@vger.kernel.org On 01/09/2013 02:52 AM, Matthew Anderson wrote: > Hi Sage, > > Sorry for the late follow up, I've been on a bit of a testing rampage and managed to somewhat sort the problem. > > Most of the problems appears to be from the 3.7.1 kernel. It seems to have a fairly big issue with its networking stack that was causing Ceph's network operations to hang. Moving back to a 3.6.8 kernel fixed this up. I don't know the full extent of the problem but it was reported on Phoronix briefly here - http://www.phoronix.com/scan.php?page=news_item&px=MTI2Nzc > > The second issue was BTRFS on both the 3.7 and 3.6.8 kernels. After running a long rados bench (10 minutes) on a fresh cluster it would often slow down significantly by going from 250MB/s down to a 50MB/s average. Latency also increased dramatically. Restarting the OSD's fixes the issue but after a while it slows right down again. In the end I re-formatted the cluster using XFS (and also EXT4 for benchmarks) and there wasn't a single issue. I had rados bench running for over 30 minutes from another machine and there wasn't a single issue. Ah, too bad this is still happening. :( It's interesting though that restarting the OSDs fixes it. That's not something I expected. Sounds like I need to run some more tests again and see if I can get to the bottom of it. > > At thisstage I need to start moving into production with XFS. My test cluster arrives in a few weeks so I should be able to come back to the BTRFS issue later on as it would be very handy to have compression working. > > Thanks again for your help > -Matt > > > -----Original Message----- > From: Sage Weil [mailto:sage@inktank.com] > Sent: Saturday, 22 December 2012 12:02 AM > To: Matthew Anderson > Cc: 'Mark Nelson'; ceph-devel@vger.kernel.org > Subject: RE: OSD's slow down to a crawl > > On Fri, 21 Dec 2012, Matthew Anderson wrote: >> Hi Sage, >> >> I've tried to reproduce the error again with logging on every OSD and >> got the above. RADOS bench had stalled on a write request like the >> last time and the attached log is the grep'd OSD log (# cat osd.25.log >> | grep client.9501.0:744> freeze.log) . The OSD that stalled was 25, >> pg map is below - >> >> # ceph pg map 6.5d83495b >> osdmap e3775 pg 6.5d83495b (6.95b) -> up [25,31] acting [25,31] >> >> I hope that's what you were after, if not just let me know > > We're getting closer. The osd tried to send the reply. Can you reproduce with 'debug ms = 20' on the osds too, and on the client side do soemthing like > > rados --debug-ms 20 --debug-objecter 20 --log-file /tmp/foo ... > > Thanks! > sage > > >> >> Thanks again >> -Matt >> >> >> -----Original Message----- >> From: Sage Weil [mailto:sage@inktank.com] >> Sent: Friday, 21 December 2012 1:14 AM >> To: Matthew Anderson >> Cc: 'Mark Nelson'; ceph-devel@vger.kernel.org >> Subject: RE: OSD's slow down to a crawl >> >> On Thu, 20 Dec 2012, Matthew Anderson wrote: >>> Hi Sage, >>> >>> Logs are attached. I took the osd logs from osd.24 as this is the >>> first osd in my SSD pool I've been testing with previously. >>> >>> The 4MB bench I was able to reproduce the fault by restarting my rbd >>> export which stalled after a few percent complete. When I ran the >>> 4MB bench it stalled early on and never received a response back >>> from the OSD and I terminated it after 60 seconds or so. I wasn't >>> able to reproduce the fault using the 4kb io size. The 4kb log >>> should show rados bench completing normally at a respectable speed of about 1MB/s. >> >> Let's drill into the hang.. up until that point things look okay. >> >> 2012-12-21 00:51:26.033622 7f6f3c042760 1 -- 172.16.0.13:0/1023886 >> --> 172.16.0.13:6813/22233 -- osd_op(client.9503.0:185 >> benchmark_data_KVM04_23886_object184 [write 0~4194304] 6.3ca4346e) v4 >> -- ?+0 0x171ea50 con 0x171a7e0 >> >> Do you have a log for that OSD so we can see what happened there? It >> may also be that the replicated write is hung. If you do >> >> ceph pg map 6.3ca4346e >> >> you can see all OSDs storing that PG. And/or you can grep for >> client.9503.0:185 in 172.16.0.13:6813/22233's log and see whether the sub_op was sent. >> >> Thanks! >> sage >> >> >>> >>> Thanks >>> -Matt >>> >>> -----Original Message----- >>> From: ceph-devel-owner@vger.kernel.org >>> [mailto:ceph-devel-owner@vger.kernel.org] On Behalf Of Sage Weil >>> Sent: Friday, 21 December 2012 12:30 AM >>> To: Matthew Anderson >>> Cc: 'Mark Nelson'; ceph-devel@vger.kernel.org >>> Subject: RE: OSD's slow down to a crawl >>> >>> Can you do a similar test, but with full logging on? >>> >>> ceph tell osd.0 injectargs '--debug-ms 1 --debug-filestore 20 >>> --debug-osd >>> 20 --debug-journal 20' >>> rados -p ssd bench 30 write -b 4096 -t 1 --log-file /tmp/foo >>> --debug-ms 1 >>> >>> That will be a single IO in flight at a time and very easy to trace through the logs. If you can post the resulting log file (/tmp/foo and from osd.0), that would be awesome. >>> >>> Thanks! >>> sage >>> >>> >>> >>> On Thu, 20 Dec 2012, Matthew Anderson wrote: >>> >>>> # rados bench 60 write -t 256 -p ssd Maintaining 256 concurrent >>>> writes of 4194304 bytes for at least 60 seconds. >>>> Object prefix: benchmark_data_KVM03_12985 >>>> sec Cur ops started finished avg MB/s cur MB/s last lat avg lat >>>> 0 0 0 0 0 0 - 0 >>>> 1 45 45 0 0 0 - 0 >>>> 2 71 71 0 0 0 - 0 >>>> 3 90 90 0 0 0 - 0 >>>> 4 125 125 0 0 0 - 0 >>>> 5 136 136 0 0 0 - 0 >>>> 6 144 144 0 0 0 - 0 >>>> 7 175 175 0 0 0 - 0 >>>> 8 201 201 0 0 0 - 0 >>>> 9 217 217 0 0 0 - 0 >>>> 10 228 228 0 0 0 - 0 >>>> 11 251 251 0 0 0 - 0 >>>> 12 255 266 11 3.66581 3.66667 11.7027 11.6013 >>>> 13 255 271 16 4.92194 20 12.9795 11.9979 >>>> 14 255 291 36 10.2833 80 13.1795 12.6478 >>>> 15 255 301 46 12.2639 40 13.9424 12.9123 >>>> 16 255 303 48 11.9973 8 14.0277 12.9579 >>>> 17 255 321 66 15.526 72 15.1408 13.4616 >>>> 18 255 345 90 19.9956 96 15.5673 13.9937 >>>> 19 255 349 94 19.7851 16 15.707 14.0635 >>>> 2012-12-21 00:06:37.198982min lat: 11.4904 max lat: 16.4053 avg lat: 14.3121 >>>> sec Cur ops started finished avg MB/s cur MB/s last lat avg lat >>>> 20 255 361 106 21.1953 48 16.4053 14.3121 >>>> 21 255 367 112 21.3285 24 17.4263 14.4795 >>>> 22 255 370 115 20.9043 12 17.3579 14.5544 >>>> 23 255 387 132 22.9513 68 18.3185 15.0867 >>>> 24 255 394 139 23.1614 28 18.2868 15.2504 >>>> 25 255 404 149 23.8346 40 18.741 15.5048 >>>> 26 255 410 155 23.8407 24 19.032 15.6358 >>>> 27 255 410 155 22.9577 0 - 15.6358 >>>> 28 255 410 155 22.1378 0 - 15.6358 >>>> 29 255 411 156 21.5124 1.33333 22.6927 15.6811 >>>> 30 255 431 176 23.4614 80 22.6577 16.4822 >>>> 31 255 447 192 24.7687 64 23.2345 17.0563 >>>> 32 255 464 209 26.1192 68 23.3788 17.5859 >>>> 33 255 485 230 27.8726 84 22.8924 18.1122 >>>> 34 255 497 242 28.4642 48 23.563 18.3693 >>>> 35 255 508 253 28.9078 44 23.0516 18.5935 >>>> 36 255 514 259 28.7713 24 24.4415 18.7278 >>>> 37 255 530 275 29.723 64 23.847 19.0424 >>>> 38 255 541 286 30.0985 44 24.5757 19.2427 >>>> 39 255 544 289 29.6344 12 24.6802 19.2984 >>>> 2012-12-21 00:06:57.203473min lat: 11.4904 max lat: 24.6802 avg lat: 19.2984 >>>> sec Cur ops started finished avg MB/s cur MB/s last lat avg lat >>>> 40 255 544 289 28.8935 0 - 19.2984 >>>> 41 255 544 289 28.1888 0 - 19.2984 >>>> 42 255 544 289 27.5176 0 - 19.2984 >>>> 43 255 544 289 26.8776 0 - 19.2984 >>>> 44 255 544 289 26.2668 0 - 19.2984 >>>> 45 255 544 289 25.6831 0 - 19.2984 >>>> 46 255 544 289 25.1248 0 - 19.2984 >>>> 47 255 544 289 24.5902 0 - 19.2984 >>>> 48 255 544 289 24.0779 0 - 19.2984 >>>> 49 255 544 289 23.5865 0 - 19.2984 >>>> 50 255 544 289 23.1148 0 - 19.2984 >>>> 51 255 544 289 22.6616 0 - 19.2984 >>>> 52 255 544 289 22.2258 0 - 19.2984 >>>> 53 255 544 289 21.8065 0 - 19.2984 >>>> 54 255 544 289 21.4026 0 - 19.2984 >>>> 55 255 544 289 21.0135 0 - 19.2984 >>>> 56 255 544 289 20.6383 0 - 19.2984 >>>> 57 255 544 289 20.2762 0 - 19.2984 >>>> 58 255 544 289 19.9266 0 - 19.2984 >>>> 59 255 544 289 19.5889 0 - 19.2984 >>>> 2012-12-21 00:07:17.207782min lat: 11.4904 max lat: 24.6802 avg lat: 19.2984 >>>> sec Cur ops started finished avg MB/s cur MB/s last lat avg lat >>>> 60 255 544 289 19.2624 0 - 19.2984 >>>> 61 255 544 289 18.9466 0 - 19.2984 >>>> 62 255 544 289 18.641 0 - 19.2984 >>>> 63 255 544 289 18.3452 0 - 19.2984 >>>> 64 255 544 289 18.0585 0 - 19.2984 >>>> 65 255 544 289 17.7807 0 - 19.2984 >>>> 66 255 544 289 17.5113 0 - 19.2984 >>>> 67 133 545 412 24.5916 17.5714 37.3574 27.4834 >>>> Total time run: 68.000740 >>>> Total writes made: 545 >>>> Write size: 4194304 >>>> Bandwidth (MB/sec): 32.058 >>>> >>>> Stddev Bandwidth: 26.8837 >>>> Max bandwidth (MB/sec): 96 >>>> Min bandwidth (MB/sec): 0 >>>> Average Latency: 29.2801 >>>> Stddev Latency: 11.9768 >>>> Max latency: 52.6277 >>>> Min latency: 11.4904 >>>> >>>> >>>> >>>> >>>> >>>> ----Original Message----- >>>> From: Mark Nelson [mailto:mark.nelson@inktank.com] >>>> Sent: Thursday, 20 December 2012 11:59 PM >>>> To: Matthew Anderson >>>> Cc: ceph-devel@vger.kernel.org >>>> Subject: Re: OSD's slow down to a crawl >>>> >>>> Out of curiosity, if you fire up a rados bench instance on one of the nodes with say, 256 concurrent writes, do any of the writes complete? >>>> >>>> Mark >>>> >>>> On 12/20/2012 09:51 AM, Matthew Anderson wrote: >>>>> Hi Mark, >>>>> >>>>> Thanks for the quick reply. The log was 380MB in total so I just posted last 200 lines hoping that a trained eye might be able to spot something. I'm happy to put the file up on a web server if anyone wants to review it in full. >>>>> >>>>> It doesn't appear to be memory or CPU related. In it's hung state (both read and write are not responding at the moment, I have a VM hung trying to shutdown and an rbd stuck exporting ) all of the OSD's sit around 0.3% CPU usage, ~300MB of ram and no IO wait. The server has roughly 40GB RAM free and is easily 90% idle. Iotop shows no IO coming from any of the Ceph-OSD's. >>>>> >>>>> The admin socket showed no ops in flight but the performance dump, perhistoric ops and config are below. I did notice that the age of some of the ops was quite high considering things were actively trying to write to the cluster. >>>>> # ceph --admin-daemon ./ceph-osd.24.asok perf dump >>>>> >>>>> {"filestore":{"journal_queue_max_ops":500,"journal_queue_ops":0, >>>>> "j >>>>> ou >>>>> rn >>>>> al_ops":20868,"journal_queue_max_bytes":104857600,"journal_queue >>>>> _b >>>>> yt >>>>> es >>>>> ":0,"journal_bytes":1710397238,"journal_latency":{"avgcount":20868," >>>>> su >>>>> m":185.268022000},"journal_wr":13354,"journal_wr_bytes":{"avgcount": >>>>> 13 >>>>> 354,"sum":1770647552},"op_queue_max_ops":500,"op_queue_ops":0,"ops": >>>>> 20 >>>>> 868,"op_queue_max_bytes":104857600,"op_queue_bytes":0,"bytes":17 >>>>> 10 >>>>> 23 >>>>> 25 >>>>> 83,"apply_latency":{"avgcount":20868,"sum":508.896737000},"committing" >>>>> :0,"commitcycle":339,"commitcycle_interval":{"avgcount":339,"sum": >>>>> 17 >>>>> 08 >>>>> .284725000},"commitcycle_latency":{"avgcount":339,"sum":13.24185 >>>>> 00 >>>>> 00 >>>>> }, >>>>> "journal_full":0},"mutex-FileJournal::completions_lock":{"wait":{" >>>>> av >>>>> gc >>>>> ount":0,"sum":0.000000000}},"mutex-FileJournal::finisher_lock":{"wait" >>>>> :{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::write_lock" >>>>> :{ >>>>> "w >>>>> ait":{"avgcount":0,"sum":0.000000000}},"mutex-FileJournal::write >>>>> q_ >>>>> lo >>>>> ck >>>>> ":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager:: >>>>> apply_lo >>>> ck":{"wai >>>> t":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::ApplyManager::com >>>> _l >>>> oc >>>> k":{"wait":{"avgcount":0,"sum":0.000000000}},"mutex-JOS::SubmitMan >>>> ag >>>> er >>>> ::lock":{"wait":{"avgcount":0,"sum":0.000000000}},"osd":{"opq":0," >>>> op >>>> _w >>>> ip":0,"op":31753,"op_in_bytes":797806224,"op_out_bytes":1728414048," >>>> op >>>> _latency":{"avgcount":31753,"sum":82.183373000},"op_r":24945,"op_r >>>> _o >>>> ut >>>> _bytes":1728414048,"op_r_latency":{"avgcount":24945,"sum":16.83368 >>>> 90 >>>> 00 >>>> },"op_w":6808,"op_w_in_bytes":797806224,"op_w_rlat":{"avgcount":6808," >>>> sum":38.223006000},"op_w_latency":{"avgcount":6808,"sum":65.349684 >>>> 00 >>>> 0} >>>> ,"op_rw":0,"op_rw_in_bytes":0,"op_rw_out_bytes":0,"op_rw_rlat":{"a >>>> vg >>>> co >>>> unt":0,"sum":0.000000000},"op_rw_latency":{"avgcount":0,"sum":0.00 >>>> 00 >>>> 00 >>>> 000},"subop":5700,"subop_in_bytes":813224156,"subop_latency":{"avg >>>> co >>>> un >>>> t":5700,"sum":26.877085000},"subop_w":0,"subop_w_in_bytes":813224156," >>>> subop_w_latency":{"avgcount":5700,"sum":26.877085000},"subop_pull":0," >>>> subop_pull_latency":{"avgcount":0,"sum":0.000000000},"subop_push": >>>> 0, >>>> "s >>>> ubop_push_ >>>> in_bytes" >>>> :0,"subop_push_latency":{"avgcount":0,"sum":0.000000000},"pull":0, >>>> "p >>>> us >>>> h":0,"push_out_bytes":0,"push_in":0,"push_in_bytes":0,"recovery_ops" >>>> :0 >>>> ,"loadavg":359,"buffer_bytes":0,"numpg":2486,"numpg_primary":1214, >>>> "n >>>> um >>>> pg_replica":1272,"numpg_stray":0,"heartbeat_to_peers":4,"heartbeat >>>> _f >>>> ro >>>> m_peers":0,"map_messages":44,"map_message_epochs":184,"map_message >>>> _e >>>> po >>>> ch_dups":174},"throttle-filestore_bytes":{"val":0,"max":104857600, >>>> "g >>>> et >>>> ":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0,"take": >>>> 20 >>>> 868,"take_sum":1710397238,"put":13354,"put_sum":1710397238,"wait":{" >>>> av >>>> gcount":0,"sum":0.000000000}},"throttle-filestore_ops":{"val":0,"max": >>>> 500,"get":0,"get_sum":0,"get_or_fail_fail":0,"get_or_fail_success":0," >>>> take":20868,"take_sum":20868,"put":13354,"put_sum":20868,"wait":{" >>>> av >>>> gc >>>> ount":0,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-client": >>>> {"val":0,"max":104857600,"get":31657,"get_sum":795546573,"get_or_f >>>> ai >>>> l_ >>>> fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":31657, >>>> "p >>>> ut >>>> _sum":7955 >>>> 46573,"wa >>>> it":{"avgcount":0,"sum":0.000000000}},"throttle-msgr_dispatch_thro >>>> tt >>>> le >>>> r-cluster":{"val":0,"max":104857600,"get":18913,"get_sum":82801129 >>>> 3, >>>> "g >>>> et_or_fail_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put" >>>> :18913,"put_sum":828011293,"wait":{"avgcount":0,"sum":0.000000000} >>>> }, >>>> "t >>>> hrottle-msgr_dispatch_throttler-hbclient":{"val":0,"max":104857600," >>>> ge >>>> t":4260,"get_sum":200220,"get_or_fail_fail":0,"get_or_fail_success": >>>> 0, >>>> "take":0,"take_sum":0,"put":4260,"put_sum":200220,"wait":{"avgcount" >>>> :0 >>>> ,"sum":0.000000000}},"throttle-msgr_dispatch_throttler-hbserver":{ >>>> "v >>>> al >>>> ":0,"max":104857600,"get":4236,"get_sum":199092,"get_or_fail_fail":0," >>>> get_or_fail_success":0,"take":0,"take_sum":0,"put":4236,"put_sum": >>>> 19 >>>> 90 >>>> 92,"wait":{"avgcount":0,"sum":0.000000000}},"throttle-osd_client_b >>>> yt >>>> es >>>> ":{"val":0,"max":524288000,"get":31308,"get_sum":795353720,"get_or >>>> _f >>>> ai >>>> l_fail":0,"get_or_fail_success":0,"take":0,"take_sum":0,"put":3767 >>>> 1, "p ut_sum":795353720,"wait":{"avgcount":0,"sum":0.000000000}}} >>>>> >>>>> # ceph --admin-daemon ./ceph-osd.24.asok dump_historic_ops >>>>> >>>>> { "description": "osd_op(client.7636.0:101498 rb.0.122c.6b8b4567.000000000357 [write 3600384~4096] 6.42463fe3)", >>>>> "received_at": "2012-12-20 23:34:42.012131", >>>>> "age": "75.428051", >>>>> "duration": "0.009988", >>>>> "flag_point": "waiting for sub ops", >>>>> "client_info": { "client": "client.7636", >>>>> "tid": 101498}, >>>>> "events": [ >>>>> { "time": "2012-12-20 23:34:42.012317", >>>>> "event": "waiting_for_osdmap"}, >>>>> { "time": "2012-12-20 23:34:42.012425", >>>>> "event": "reached_pg"}, >>>>> { "time": "2012-12-20 23:34:42.012678", >>>>> "event": "started"}, >>>>> { "time": "2012-12-20 23:34:42.012720", >>>>> "event": "started"}, >>>>> { "time": "2012-12-20 23:34:42.012815", >>>>> "event": "sub_op_sent"}, >>>>> { "time": "2012-12-20 23:34:42.012958", >>>>> "event": "commit_queued_for_journal_write"}, >>>>> { "time": "2012-12-20 23:34:42.013033", >>>>> "event": "write_thread_in_journal_buffer"}, >>>>> { "time": "2012-12-20 23:34:42.013745", >>>>> "event": "journaled_completion_queued"}, >>>>> { "time": "2012-12-20 23:34:42.013803", >>>>> "event": "op_commit"}, >>>>> { "time": "2012-12-20 23:34:42.017414", >>>>> "event": "sub_op_commit_rec"}, >>>>> { "time": "2012-12-20 23:34:42.022066", >>>>> "event": "op_applied"}, >>>>> { "time": "2012-12-20 23:34:42.022119", >>>>> "event": "done"}]}, >>>>> { "description": "osd_sub_op_reply(client.7636.0:101498 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", >>>>> "received_at": "2012-12-20 23:34:42.017048", >>>>> "age": "75.423134", >>>>> "duration": "0.000411", >>>>> "flag_point": "started", >>>>> "events": [ >>>>> { "time": "2012-12-20 23:34:42.017218", >>>>> "event": "waiting_for_osdmap"}, >>>>> { "time": "2012-12-20 23:34:42.017363", >>>>> "event": "reached_pg"}, >>>>> { "time": "2012-12-20 23:34:42.017401", >>>>> "event": "started"}, >>>>> { "time": "2012-12-20 23:34:42.017459", >>>>> "event": "done"}]}, >>>>> { "description": "osd_op(client.7636.0:101502 rb.0.122c.6b8b4567.000000000064 [write 1512448~61440] 6.a241e34d)", >>>>> "received_at": "2012-12-20 23:34:49.851021", >>>>> "age": "67.589161", >>>>> "duration": "0.007114", >>>>> "flag_point": "waiting for sub ops", >>>>> "client_info": { "client": "client.7636", >>>>> "tid": 101502}, >>>>> "events": [ >>>>> { "time": "2012-12-20 23:34:49.851276", >>>>> "event": "waiting_for_osdmap"}, >>>>> { "time": "2012-12-20 23:34:49.851429", >>>>> "event": "reached_pg"}, >>>>> { "time": "2012-12-20 23:34:49.851616", >>>>> "event": "started"}, >>>>> { "time": "2012-12-20 23:34:49.851648", >>>>> "event": "started"}, >>>>> { "time": "2012-12-20 23:34:49.851712", >>>>> "event": "sub_op_sent"}, >>>>> { "time": "2012-12-20 23:34:49.851770", >>>>> "event": "commit_queued_for_journal_write"}, >>>>> { "time": "2012-12-20 23:34:49.851915", >>>>> "event": "write_thread_in_journal_buffer"}, >>>>> { "time": "2012-12-20 23:34:49.853956", >>>>> "event": "journaled_completion_queued"}, >>>>> { "time": "2012-12-20 23:34:49.853992", >>>>> "event": "op_commit"}, >>>>> { "time": "2012-12-20 23:34:49.857016", >>>>> "event": "sub_op_commit_rec"}, >>>>> { "time": "2012-12-20 23:34:49.858077", >>>>> "event": "op_applied"}, >>>>> { "time": "2012-12-20 23:34:49.858135", >>>>> "event": "done"}]}, >>>>> { "description": "osd_sub_op_reply(client.7636.0:101502 6.234d a241e34d\/rb.0.122c.6b8b4567.000000000064\/head\/\/6 [] ondisk, result = 0)", >>>>> "received_at": "2012-12-20 23:34:49.856762", >>>>> "age": "67.583420", >>>>> "duration": "0.000308", >>>>> "flag_point": "started", >>>>> "events": [ >>>>> { "time": "2012-12-20 23:34:49.856869", >>>>> "event": "waiting_for_osdmap"}, >>>>> { "time": "2012-12-20 23:34:49.856969", >>>>> "event": "reached_pg"}, >>>>> { "time": "2012-12-20 23:34:49.857001", >>>>> "event": "started"}, >>>>> { "time": "2012-12-20 23:34:49.857070", >>>>> "event": "done"}]}, >>>>> { "description": "osd_op(client.7636.0:101504 rb.0.122c.6b8b4567.00000000012c [write 1658880~2048] 6.92d2c38c)", >>>>> "received_at": "2012-12-20 23:35:34.351518", >>>>> "age": "23.088664", >>>>> "duration": "0.009430", >>>>> "flag_point": "waiting for sub ops", >>>>> "client_info": { "client": "client.7636", >>>>> "tid": 101504}, >>>>> "events": [ >>>>> { "time": "2012-12-20 23:35:34.351714", >>>>> "event": "waiting_for_osdmap"}, >>>>> { "time": "2012-12-20 23:35:34.351811", >>>>> "event": "reached_pg"}, >>>>> { "time": "2012-12-20 23:35:34.352030", >>>>> "event": "started"}, >>>>> { "time": "2012-12-20 23:35:34.352071", >>>>> "event": "started"}, >>>>> { "time": "2012-12-20 23:35:34.352146", >>>>> "event": "sub_op_sent"}, >>>>> { "time": "2012-12-20 23:35:34.352213", >>>>> "event": "commit_queued_for_journal_write"}, >>>>> { "time": "2012-12-20 23:35:34.352287", >>>>> "event": "write_thread_in_journal_buffer"}, >>>>> { "time": "2012-12-20 23:35:34.355390", >>>>> "event": "journaled_completion_queued"}, >>>>> { "time": "2012-12-20 23:35:34.355475", >>>>> "event": "op_commit"}, >>>>> { "time": "2012-12-20 23:35:34.356661", >>>>> "event": "sub_op_commit_rec"}, >>>>> { "time": "2012-12-20 23:35:34.360894", >>>>> "event": "op_applied"}, >>>>> { "time": "2012-12-20 23:35:34.360948", >>>>> "event": "done"}]}, >>>>> { "description": "osd_sub_op_reply(client.7636.0:101504 6.38c 92d2c38c\/rb.0.122c.6b8b4567.00000000012c\/head\/\/6 [] ondisk, result = 0)", >>>>> "received_at": "2012-12-20 23:35:34.356286", >>>>> "age": "23.083896", >>>>> "duration": "0.000421", >>>>> "flag_point": "started", >>>>> "events": [ >>>>> { "time": "2012-12-20 23:35:34.356489", >>>>> "event": "waiting_for_osdmap"}, >>>>> { "time": "2012-12-20 23:35:34.356598", >>>>> "event": "reached_pg"}, >>>>> { "time": "2012-12-20 23:35:34.356640", >>>>> "event": "started"}, >>>>> { "time": "2012-12-20 23:35:34.356707", >>>>> "event": "done"}]}, >>>>> { "description": "osd_op(client.7636.0:101512 rb.0.122c.6b8b4567.000000000357 [write 3731456~4096] 6.42463fe3)", >>>>> "received_at": "2012-12-20 23:35:42.133103", >>>>> "age": "15.307079", >>>>> "duration": "0.008357", >>>>> "flag_point": "waiting for sub ops", >>>>> "client_info": { "client": "client.7636", >>>>> "tid": 101512}, >>>>> "events": [ >>>>> { "time": "2012-12-20 23:35:42.133315", >>>>> "event": "waiting_for_osdmap"}, >>>>> { "time": "2012-12-20 23:35:42.133427", >>>>> "event": "reached_pg"}, >>>>> { "time": "2012-12-20 23:35:42.133684", >>>>> "event": "started"}, >>>>> { "time": "2012-12-20 23:35:42.133723", >>>>> "event": "started"}, >>>>> { "time": "2012-12-20 23:35:42.133796", >>>>> "event": "sub_op_sent"}, >>>>> { "time": "2012-12-20 23:35:42.133908", >>>>> "event": "commit_queued_for_journal_write"}, >>>>> { "time": "2012-12-20 23:35:42.133995", >>>>> "event": "write_thread_in_journal_buffer"}, >>>>> { "time": "2012-12-20 23:35:42.136965", >>>>> "event": "journaled_completion_queued"}, >>>>> { "time": "2012-12-20 23:35:42.137025", >>>>> "event": "op_commit"}, >>>>> { "time": "2012-12-20 23:35:42.138456", >>>>> "event": "sub_op_commit_rec"}, >>>>> { "time": "2012-12-20 23:35:42.141408", >>>>> "event": "op_applied"}, >>>>> { "time": "2012-12-20 23:35:42.141460", >>>>> "event": "done"}]}, >>>>> { "description": "osd_sub_op_reply(client.7636.0:101512 6.1fe3 42463fe3\/rb.0.122c.6b8b4567.000000000357\/head\/\/6 [] ondisk, result = 0)", >>>>> "received_at": "2012-12-20 23:35:42.138087", >>>>> "age": "15.302095", >>>>> "duration": "0.000410", >>>>> "flag_point": "started", >>>>> "events": [ >>>>> { "time": "2012-12-20 23:35:42.138248", >>>>> "event": "waiting_for_osdmap"}, >>>>> { "time": "2012-12-20 23:35:42.138363", >>>>> "event": "reached_pg"}, >>>>> { "time": "2012-12-20 23:35:42.138443", >>>>> "event": "started"}, >>>>> { "time": "2012-12-20 23:35:42.138497", >>>>> "event": "done"}]}]} >>>>> >>>>> # ceph --admin-daemon ./ceph-osd.24.asok config show { "name": >>>>> "osd.24", >>>>> "cluster": "ceph", >>>>> "none": "0\/5", >>>>> "lockdep": "0\/1", >>>>> "context": "0\/1", >>>>> "crush": "1\/1", >>>>> "mds": "1\/5", >>>>> "mds_balancer": "1\/5", >>>>> "mds_locker": "1\/5", >>>>> "mds_log": "1\/5", >>>>> "mds_log_expire": "1\/5", >>>>> "mds_migrator": "1\/5", >>>>> "buffer": "0\/1", >>>>> "timer": "0\/1", >>>>> "filer": "0\/1", >>>>> "striper": "0\/1", >>>>> "objecter": "0\/1", >>>>> "rados": "0\/5", >>>>> "rbd": "0\/5", >>>>> "journaler": "0\/5", >>>>> "objectcacher": "0\/5", >>>>> "client": "0\/5", >>>>> "osd": "0\/5", >>>>> "optracker": "0\/5", >>>>> "objclass": "0\/5", >>>>> "filestore": "1\/3", >>>>> "journal": "1\/3", >>>>> "ms": "0\/5", >>>>> "mon": "1\/5", >>>>> "monc": "0\/10", >>>>> "paxos": "0\/5", >>>>> "tp": "0\/5", >>>>> "auth": "1\/5", >>>>> "crypto": "1\/5", >>>>> "finisher": "1\/1", >>>>> "heartbeatmap": "1\/5", >>>>> "perfcounter": "1\/5", >>>>> "rgw": "1\/5", >>>>> "hadoop": "1\/5", >>>>> "javaclient": "1\/5", >>>>> "asok": "1\/5", >>>>> "throttle": "1\/1", >>>>> "host": "KVM04", >>>>> "fsid": "00000000-0000-0000-0000-000000000000", >>>>> "public_addr": ":\/0", >>>>> "cluster_addr": ":\/0", >>>>> "public_network": "", >>>>> "cluster_network": "", >>>>> "num_client": "1", >>>>> "monmap": "", >>>>> "mon_host": "", >>>>> "lockdep": "false", >>>>> "admin_socket": "\/var\/run\/ceph\/ceph-osd.24.asok", >>>>> "daemonize": "true", >>>>> "pid_file": "\/var\/run\/ceph\/osd.24.pid", >>>>> "chdir": "\/", >>>>> "max_open_files": "0", >>>>> "fatal_signal_handlers": "true", >>>>> "log_file": "\/var\/log\/ceph\/osd.24.log", >>>>> "log_max_new": "1000", >>>>> "log_max_recent": "100000", >>>>> "log_to_stderr": "false", >>>>> "err_to_stderr": "true", >>>>> "log_to_syslog": "false", >>>>> "err_to_syslog": "false", >>>>> "log_flush_on_exit": "true", >>>>> "clog_to_monitors": "true", >>>>> "clog_to_syslog": "false", >>>>> "mon_cluster_log_to_syslog": "false", >>>>> "mon_cluster_log_file": "\/var\/log\/ceph\/ceph.log", >>>>> "key": "", >>>>> "keyfile": "", >>>>> "keyring": "\/srv\/ceph\/osd\/osd.24\/keyring", >>>>> "heartbeat_interval": "5", >>>>> "heartbeat_file": "", >>>>> "perf": "true", >>>>> "ms_tcp_nodelay": "true", >>>>> "ms_initial_backoff": "0.2", >>>>> "ms_max_backoff": "15", >>>>> "ms_nocrc": "false", >>>>> "ms_die_on_bad_msg": "false", >>>>> "ms_dispatch_throttle_bytes": "104857600", >>>>> "ms_bind_ipv6": "false", >>>>> "ms_bind_port_min": "6800", >>>>> "ms_bind_port_max": "7100", >>>>> "ms_rwthread_stack_bytes": "1048576", >>>>> "ms_tcp_read_timeout": "900", >>>>> "ms_inject_socket_failures": "0", >>>>> "ms_inject_delay_type": "", >>>>> "ms_inject_delay_max": "1", >>>>> "ms_inject_delay_probability": "0", >>>>> "mon_data": "\/var\/lib\/ceph\/mon\/ceph-24", >>>>> "mon_initial_members": "", >>>>> "mon_sync_fs_threshold": "5", >>>>> "mon_tick_interval": "5", >>>>> "mon_subscribe_interval": "300", >>>>> "mon_osd_laggy_halflife": "3600", >>>>> "mon_osd_laggy_weight": "0.3", >>>>> "mon_osd_adjust_heartbeat_grace": "true", >>>>> "mon_osd_adjust_down_out_interval": "true", >>>>> "mon_osd_auto_mark_in": "false", >>>>> "mon_osd_auto_mark_auto_out_in": "true", >>>>> "mon_osd_auto_mark_new_in": "true", >>>>> "mon_osd_down_out_interval": "300", >>>>> "mon_osd_min_up_ratio": "0.3", >>>>> "mon_osd_min_in_ratio": "0.3", >>>>> "mon_lease": "5", >>>>> "mon_lease_renew_interval": "3", >>>>> "mon_lease_ack_timeout": "10", >>>>> "mon_clock_drift_allowed": "0.05", >>>>> "mon_clock_drift_warn_backoff": "5", >>>>> "mon_accept_timeout": "10", >>>>> "mon_pg_create_interval": "30", >>>>> "mon_pg_stuck_threshold": "300", >>>>> "mon_osd_full_ratio": "0.95", >>>>> "mon_osd_nearfull_ratio": "0.85", >>>>> "mon_globalid_prealloc": "100", >>>>> "mon_osd_report_timeout": "900", >>>>> "mon_force_standby_active": "true", >>>>> "mon_min_osdmap_epochs": "500", >>>>> "mon_max_pgmap_epochs": "500", >>>>> "mon_max_log_epochs": "500", >>>>> "mon_max_osd": "10000", >>>>> "mon_probe_timeout": "2", >>>>> "mon_slurp_timeout": "10", >>>>> "mon_slurp_bytes": "262144", >>>>> "mon_client_bytes": "104857600", >>>>> "mon_daemon_bytes": "419430400", >>>>> "mon_max_log_entries_per_event": "4096", >>>>> "paxos_max_join_drift": "10", >>>>> "paxos_propose_interval": "1", >>>>> "paxos_min_wait": "0.05", >>>>> "clock_offset": "0", >>>>> "auth_cluster_required": "none", >>>>> "auth_service_required": "none", >>>>> "auth_client_required": "none", >>>>> "auth_supported": "none", >>>>> "cephx_require_signatures": "false", >>>>> "cephx_sign_messages": "true", >>>>> "auth_mon_ticket_ttl": "43200", >>>>> "auth_service_ticket_ttl": "3600", >>>>> "mon_client_hunt_interval": "3", >>>>> "mon_client_ping_interval": "10", >>>>> "mon_client_max_log_entries_per_message": "1000", >>>>> "mon_max_pool_pg_num": "65536", >>>>> "client_cache_size": "16384", >>>>> "client_cache_mid": "0.75", >>>>> "client_use_random_mds": "false", >>>>> "client_mount_timeout": "30", >>>>> "client_tick_interval": "1", >>>>> "client_trace": "", >>>>> "client_readahead_min": "131072", >>>>> "client_readahead_max_bytes": "0", >>>>> "client_readahead_max_periods": "4", >>>>> "client_snapdir": ".snap", >>>>> "client_mountpoint": "\/", >>>>> "client_notify_timeout": "10", >>>>> "client_caps_release_delay": "5", >>>>> "client_oc": "true", >>>>> "client_oc_size": "209715200", >>>>> "client_oc_max_dirty": "104857600", >>>>> "client_oc_target_dirty": "8388608", >>>>> "client_oc_max_dirty_age": "5", >>>>> "client_oc_max_objects": "1000", >>>>> "client_debug_force_sync_read": "false", >>>>> "client_debug_inject_tick_delay": "0", >>>>> "fuse_use_invalidate_cb": "false", >>>>> "fuse_big_writes": "true", >>>>> "fuse_debug": "false", >>>>> "objecter_tick_interval": "5", >>>>> "objecter_timeout": "10", >>>>> "objecter_inflight_op_bytes": "104857600", >>>>> "objecter_inflight_ops": "1024", >>>>> "journaler_allow_split_entries": "true", >>>>> "journaler_write_head_interval": "15", >>>>> "journaler_prefetch_periods": "10", >>>>> "journaler_prezero_periods": "5", >>>>> "journaler_batch_interval": "0.001", >>>>> "journaler_batch_max": "0", >>>>> "mds_data": "\/var\/lib\/ceph\/mds\/ceph-24", >>>>> "mds_max_file_size": "1099511627776", >>>>> "mds_cache_size": "100000", >>>>> "mds_cache_mid": "0.7", >>>>> "mds_mem_max": "1048576", >>>>> "mds_dir_commit_ratio": "0.5", >>>>> "mds_dir_max_commit_size": "90", >>>>> "mds_decay_halflife": "5", >>>>> "mds_beacon_interval": "4", >>>>> "mds_beacon_grace": "15", >>>>> "mds_blacklist_interval": "1440", >>>>> "mds_session_timeout": "60", >>>>> "mds_session_autoclose": "300", >>>>> "mds_reconnect_timeout": "45", >>>>> "mds_tick_interval": "5", >>>>> "mds_dirstat_min_interval": "1", >>>>> "mds_scatter_nudge_interval": "5", >>>>> "mds_client_prealloc_inos": "1000", >>>>> "mds_early_reply": "true", >>>>> "mds_use_tmap": "true", >>>>> "mds_default_dir_hash": "2", >>>>> "mds_log": "true", >>>>> "mds_log_skip_corrupt_events": "false", >>>>> "mds_log_max_events": "-1", >>>>> "mds_log_max_segments": "30", >>>>> "mds_log_max_expiring": "20", >>>>> "mds_bal_sample_interval": "3", >>>>> "mds_bal_replicate_threshold": "8000", >>>>> "mds_bal_unreplicate_threshold": "0", >>>>> "mds_bal_frag": "false", >>>>> "mds_bal_split_size": "10000", >>>>> "mds_bal_split_rd": "25000", >>>>> "mds_bal_split_wr": "10000", >>>>> "mds_bal_split_bits": "3", >>>>> "mds_bal_merge_size": "50", >>>>> "mds_bal_merge_rd": "1000", >>>>> "mds_bal_merge_wr": "1000", >>>>> "mds_bal_interval": "10", >>>>> "mds_bal_fragment_interval": "5", >>>>> "mds_bal_idle_threshold": "0", >>>>> "mds_bal_max": "-1", >>>>> "mds_bal_max_until": "-1", >>>>> "mds_bal_mode": "0", >>>>> "mds_bal_min_rebalance": "0.1", >>>>> "mds_bal_min_start": "0.2", >>>>> "mds_bal_need_min": "0.8", >>>>> "mds_bal_need_max": "1.2", >>>>> "mds_bal_midchunk": "0.3", >>>>> "mds_bal_minchunk": "0.001", >>>>> "mds_bal_target_removal_min": "5", >>>>> "mds_bal_target_removal_max": "10", >>>>> "mds_replay_interval": "1", >>>>> "mds_shutdown_check": "0", >>>>> "mds_thrash_exports": "0", >>>>> "mds_thrash_fragments": "0", >>>>> "mds_dump_cache_on_map": "false", >>>>> "mds_dump_cache_after_rejoin": "false", >>>>> "mds_verify_scatter": "false", >>>>> "mds_debug_scatterstat": "false", >>>>> "mds_debug_frag": "false", >>>>> "mds_debug_auth_pins": "false", >>>>> "mds_debug_subtrees": "false", >>>>> "mds_kill_mdstable_at": "0", >>>>> "mds_kill_export_at": "0", >>>>> "mds_kill_import_at": "0", >>>>> "mds_kill_link_at": "0", >>>>> "mds_kill_rename_at": "0", >>>>> "mds_wipe_sessions": "false", >>>>> "mds_wipe_ino_prealloc": "false", >>>>> "mds_skip_ino": "0", >>>>> "max_mds": "1", >>>>> "mds_standby_for_name": "", >>>>> "mds_standby_for_rank": "-1", >>>>> "mds_standby_replay": "false", >>>>> "osd_auto_upgrade_tmap": "true", >>>>> "osd_tmapput_sets_uses_tmap": "false", >>>>> "osd_max_backfills": "10", >>>>> "osd_backfill_full_ratio": "0.85", >>>>> "osd_backfill_retry_interval": "10", >>>>> "osd_uuid": "00000000-0000-0000-0000-000000000000", >>>>> "osd_data": "\/srv\/ceph\/osd\/osd.24", >>>>> "osd_journal": "\/dev\/sda3", >>>>> "osd_journal_size": "5120", >>>>> "osd_max_write_size": "90", >>>>> "osd_max_pgls": "1024", >>>>> "osd_client_message_size_cap": "524288000", >>>>> "osd_pg_bits": "6", >>>>> "osd_pgp_bits": "6", >>>>> "osd_min_rep": "1", >>>>> "osd_max_rep": "10", >>>>> "osd_pool_default_crush_rule": "0", >>>>> "osd_pool_default_size": "2", >>>>> "osd_pool_default_min_size": "0", >>>>> "osd_pool_default_pg_num": "8", >>>>> "osd_pool_default_pgp_num": "8", >>>>> "osd_map_dedup": "true", >>>>> "osd_map_cache_size": "500", >>>>> "osd_map_message_max": "100", >>>>> "osd_op_threads": "2", >>>>> "osd_disk_threads": "1", >>>>> "osd_recovery_threads": "1", >>>>> "osd_recover_clone_overlap": "true", >>>>> "osd_backfill_scan_min": "64", >>>>> "osd_backfill_scan_max": "512", >>>>> "osd_op_thread_timeout": "30", >>>>> "osd_recovery_thread_timeout": "30", >>>>> "osd_snap_trim_thread_timeout": "3600", >>>>> "osd_scrub_thread_timeout": "60", >>>>> "osd_scrub_finalize_thread_timeout": "600", >>>>> "osd_remove_thread_timeout": "3600", >>>>> "osd_command_thread_timeout": "600", >>>>> "osd_age": "0.8", >>>>> "osd_age_time": "0", >>>>> "osd_heartbeat_addr": ":\/0", >>>>> "osd_heartbeat_interval": "6", >>>>> "osd_heartbeat_grace": "20", >>>>> "osd_mon_heartbeat_interval": "30", >>>>> "osd_mon_report_interval_max": "120", >>>>> "osd_mon_report_interval_min": "5", >>>>> "osd_mon_ack_timeout": "30", >>>>> "osd_min_down_reporters": "1", >>>>> "osd_min_down_reports": "3", >>>>> "osd_default_data_pool_replay_window": "45", >>>>> "osd_preserve_trimmed_log": "false", >>>>> "osd_auto_mark_unfound_lost": "false", >>>>> "osd_recovery_delay_start": "0", >>>>> "osd_recovery_max_active": "50", >>>>> "osd_recovery_max_chunk": "8388608", >>>>> "osd_recovery_forget_lost_objects": "false", >>>>> "osd_max_scrubs": "1", >>>>> "osd_scrub_load_threshold": "0.5", >>>>> "osd_scrub_min_interval": "300", >>>>> "osd_scrub_max_interval": "86400", >>>>> "osd_deep_scrub_interval": "604800", >>>>> "osd_deep_scrub_stride": "524288", >>>>> "osd_auto_weight": "false", >>>>> "osd_class_dir": "\/usr\/local\/lib\/rados-classes", >>>>> "osd_check_for_log_corruption": "false", >>>>> "osd_use_stale_snap": "false", >>>>> "osd_rollback_to_cluster_snap": "", >>>>> "osd_default_notify_timeout": "30", >>>>> "osd_kill_backfill_at": "0", >>>>> "osd_min_pg_log_entries": "1000", >>>>> "osd_op_complaint_time": "30", >>>>> "osd_command_max_records": "256", >>>>> "osd_op_log_threshold": "5", >>>>> "osd_verify_sparse_read_holes": "false", >>>>> "osd_debug_drop_ping_probability": "0", >>>>> "osd_debug_drop_ping_duration": "0", >>>>> "osd_debug_drop_pg_create_probability": "0", >>>>> "osd_debug_drop_pg_create_duration": "1", >>>>> "osd_debug_drop_op_probability": "0", >>>>> "osd_op_history_size": "20", >>>>> "osd_op_history_duration": "600", >>>>> "osd_target_transaction_size": "300", >>>>> "osd_client_op_priority": "63", >>>>> "osd_recovery_op_priority": "10", >>>>> "filestore": "false", >>>>> "filestore_index_retry_probability": "0", >>>>> "filestore_debug_omap_check": "false", >>>>> "filestore_xattr_use_omap": "false", >>>>> "filestore_max_inline_xattr_size": "512", >>>>> "filestore_max_inline_xattrs": "2", >>>>> "filestore_max_sync_interval": "5", >>>>> "filestore_min_sync_interval": "0.01", >>>>> "filestore_btrfs_snap": "true", >>>>> "filestore_btrfs_clone_range": "true", >>>>> "filestore_fsync_flushes_journal_data": "false", >>>>> "filestore_fiemap": "false", >>>>> "filestore_flusher": "true", >>>>> "filestore_flusher_max_fds": "512", >>>>> "filestore_flush_min": "65536", >>>>> "filestore_sync_flush": "false", >>>>> "filestore_journal_parallel": "false", >>>>> "filestore_journal_writeahead": "false", >>>>> "filestore_journal_trailing": "false", >>>>> "filestore_queue_max_ops": "500", >>>>> "filestore_queue_max_bytes": "104857600", >>>>> "filestore_queue_committing_max_ops": "500", >>>>> "filestore_queue_committing_max_bytes": "104857600", >>>>> "filestore_op_threads": "2", >>>>> "filestore_op_thread_timeout": "60", >>>>> "filestore_op_thread_suicide_timeout": "180", >>>>> "filestore_commit_timeout": "600", >>>>> "filestore_fiemap_threshold": "4096", >>>>> "filestore_merge_threshold": "10", >>>>> "filestore_split_multiple": "2", >>>>> "filestore_update_to": "1000", >>>>> "filestore_blackhole": "false", >>>>> "filestore_dump_file": "", >>>>> "filestore_kill_at": "0", >>>>> "filestore_fail_eio": "true", >>>>> "journal_dio": "true", >>>>> "journal_aio": "false", >>>>> "journal_block_align": "true", >>>>> "journal_max_write_bytes": "10485760", >>>>> "journal_max_write_entries": "100", >>>>> "journal_queue_max_ops": "500", >>>>> "journal_queue_max_bytes": "104857600", >>>>> "journal_align_min_size": "65536", >>>>> "journal_replay_from": "0", >>>>> "journal_zero_on_create": "false", >>>>> "rbd_cache": "false", >>>>> "rbd_cache_size": "33554432", >>>>> "rbd_cache_max_dirty": "25165824", >>>>> "rbd_cache_target_dirty": "16777216", >>>>> "rbd_cache_max_dirty_age": "1", >>>>> "nss_db_path": "", >>>>> "rgw_data": "\/var\/lib\/ceph\/radosgw\/ceph-24", >>>>> "rgw_enable_apis": "s3, swift, swift_auth, admin", >>>>> "rgw_cache_enabled": "true", >>>>> "rgw_cache_lru_size": "10000", >>>>> "rgw_socket_path": "", >>>>> "rgw_dns_name": "", >>>>> "rgw_swift_url": "", >>>>> "rgw_swift_url_prefix": "swift", >>>>> "rgw_swift_auth_url": "", >>>>> "rgw_swift_auth_entry": "auth", >>>>> "rgw_keystone_url": "", >>>>> "rgw_keystone_admin_token": "", >>>>> "rgw_keystone_accepted_roles": "Member, admin", >>>>> "rgw_keystone_token_cache_size": "10000", >>>>> "rgw_keystone_revocation_interval": "900", >>>>> "rgw_admin_entry": "admin", >>>>> "rgw_enforce_swift_acls": "true", >>>>> "rgw_swift_token_expiration": "86400", >>>>> "rgw_print_continue": "true", >>>>> "rgw_remote_addr_param": "REMOTE_ADDR", >>>>> "rgw_op_thread_timeout": "600", >>>>> "rgw_op_thread_suicide_timeout": "0", >>>>> "rgw_thread_pool_size": "100", >>>>> "rgw_num_control_oids": "8", >>>>> "rgw_cluster_root_pool": ".rgw.root", >>>>> "rgw_log_nonexistent_bucket": "false", >>>>> "rgw_log_object_name": "%Y-%m-%d-%H-%i-%n", >>>>> "rgw_log_object_name_utc": "false", >>>>> "rgw_usage_max_shards": "32", >>>>> "rgw_usage_max_user_shards": "1", >>>>> "rgw_enable_ops_log": "true", >>>>> "rgw_enable_usage_log": "true", >>>>> "rgw_ops_log_rados": "true", >>>>> "rgw_ops_log_socket_path": "", >>>>> "rgw_ops_log_data_backlog": "5242880", >>>>> "rgw_usage_log_flush_threshold": "1024", >>>>> "rgw_usage_log_tick_interval": "30", >>>>> "rgw_intent_log_object_name": "%Y-%m-%d-%i-%n", >>>>> "rgw_intent_log_object_name_utc": "false", >>>>> "rgw_init_timeout": "30", >>>>> "rgw_mime_types_file": "\/etc\/mime.types", >>>>> "rgw_gc_max_objs": "32", >>>>> "rgw_gc_obj_min_wait": "7200", >>>>> "rgw_gc_processor_max_time": "3600", >>>>> "rgw_gc_processor_period": "3600", >>>>> "rgw_s3_success_create_obj_status": "0", >>>>> "rgw_resolve_cname": "false", >>>>> "rgw_obj_stripe_size": "4194304", >>>>> "rgw_extended_http_attrs": "", >>>>> "mutex_perf_counter": "false", >>>>> "internal_safe_to_start_threads": "true"} >>>>> >>>>> >>>>> -----Original Message----- >>>>> From: Mark Nelson [mailto:mark.nelson@inktank.com] >>>>> Sent: Thursday, 20 December 2012 11:28 PM >>>>> To: Matthew Anderson >>>>> Cc: ceph-devel@vger.kernel.org >>>>> Subject: Re: OSD's slow down to a crawl >>>>> >>>>> Hi Matt, >>>>> >>>>> Yeah, your log is looking pretty sparse! I imagine Sam will have a lot of questions for you. Out of curiosity have you looked at any io/cpu/memory stats when things slow down? sysprof may come in handy if it looks like it's CPU related. Also, check out the OSD admin socket. >>>>> Sebastien Han made a nice little blog entry describing it: >>>>> >>>>> http://www.sebastien-han.fr/blog/2012/08/14/ceph-admin-socket/ >>>>> >>>>> You can get a lot of information about the current state of things. >>>>> >>>>> Mark >>>>> >>>> >>>> -- >>>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" >>>> in the body of a message to majordomo@vger.kernel.org More >>>> majordomo info at http://vger.kernel.org/majordomo-info.html >>>> >>>> >>> >> -- Mark Nelson Performance Engineer Inktank ^ permalink raw reply [flat|nested] 13+ messages in thread
end of thread, other threads:[~2013-01-09 14:10 UTC | newest]
Thread overview: 13+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-12-20 15:16 OSD's slow down to a crawl Matthew Anderson
2012-12-20 15:28 ` Mark Nelson
2012-12-20 15:51 ` Matthew Anderson
2012-12-20 15:59 ` Mark Nelson
2012-12-20 16:08 ` Matthew Anderson
2012-12-20 16:22 ` Mark Nelson
2012-12-20 16:34 ` Matthew Anderson
2012-12-20 16:30 ` Sage Weil
[not found] ` <38A500831D3DE24B90BD200D6C8701351BB15433@Exchange2010-2.corit.local>
2012-12-20 17:13 ` Sage Weil
2012-12-21 4:51 ` Matthew Anderson
2012-12-21 16:02 ` Sage Weil
2013-01-09 8:52 ` Matthew Anderson
2013-01-09 14:09 ` Mark Nelson
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.