All of lore.kernel.org
 help / color / mirror / Atom feed
* rbd map command hangs for 15 minutes during system start up
@ 2012-11-08 22:10 Mandell Degerness
  2012-11-09  1:43 ` Josh Durgin
  0 siblings, 1 reply; 56+ messages in thread
From: Mandell Degerness @ 2012-11-08 22:10 UTC (permalink / raw)
  To: ceph-devel

[-- Attachment #1: Type: text/plain, Size: 2084 bytes --]

We are seeing a somewhat random, but frequent hang on our systems
during startup.  The hang happens at the point where an "rbd map
<rbdvol>" command is run.

I've attached the ceph logs from the cluster.  The map command happens
at Nov  8 18:41:09 on server 172.18.0.15.  The process which hung can
be seen in the log as 172.18.0.15:0/1143980479.

It appears as if the TCP socket is opened to the OSD, but then times
out 15 minutes later, the process gets data when the socket is closed
on the client server and it retries.

Please help.

We are using ceph version 0.48.2argonaut
(commit:3e02b2fad88c2a95d9c0c86878f10d1beb780bfe).

We are using a 3.5.7 kernel with the following list of patches applied:

1-libceph-encapsulate-out-message-data-setup.patch
2-libceph-dont-mark-footer-complete-before-it-is.patch
3-libceph-move-init-of-bio_iter.patch
4-libceph-dont-use-bio_iter-as-a-flag.patch
5-libceph-resubmit-linger-ops-when-pg-mapping-changes.patch
6-libceph-re-initialize-bio_iter-on-start-of-message-receive.patch
7-ceph-close-old-con-before-reopening-on-mds-reconnect.patch
8-libceph-protect-ceph_con_open-with-mutex.patch
9-libceph-reset-connection-retry-on-successfully-negotiation.patch
10-rbd-only-reset-capacity-when-pointing-to-head.patch
11-rbd-set-image-size-when-header-is-updated.patch
12-libceph-fix-crypto-key-null-deref-memory-leak.patch
13-ceph-tolerate-and-warn-on-extraneous-dentry-from-mds.patch
14-ceph-avoid-divide-by-zero-in-__validate_layout.patch
15-rbd-drop-dev-reference-on-error-in-rbd_open.patch
16-ceph-Fix-oops-when-handling-mdsmap-that-decreases-max_mds.patch
17-libceph-check-for-invalid-mapping.patch
18-ceph-propagate-layout-error-on-osd-request-creation.patch
19-rbd-BUG-on-invalid-layout.patch
20-ceph-return-EIO-on-invalid-layout-on-GET_DATALOC-ioctl.patch
21-ceph-avoid-32-bit-page-index-overflow.patch
23-ceph-fix-dentry-reference-leak-in-encode_fh.patch

Any suggestions?

One thought is that the following patch (which we could not apply) is
what is required:

22-rbd-reset-BACKOFF-if-unable-to-re-queue.patch

Regards,
Mandell Degerness

[-- Attachment #2: hanglog_ceph.log.gz --]
[-- Type: application/x-gzip, Size: 21632 bytes --]

^ permalink raw reply	[flat|nested] 56+ messages in thread

end of thread, other threads:[~2013-01-02 15:56 UTC | newest]

Thread overview: 56+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-11-08 22:10 rbd map command hangs for 15 minutes during system start up Mandell Degerness
2012-11-09  1:43 ` Josh Durgin
2012-11-12 22:19   ` Nick Bartos
2012-11-12 23:16     ` Sage Weil
2012-11-16  0:21       ` Nick Bartos
2012-11-16  0:25         ` Sage Weil
2012-11-16 18:36           ` Nick Bartos
2012-11-16 19:16             ` Sage Weil
2012-11-16 22:01               ` Nick Bartos
2012-11-16 22:13                 ` Sage Weil
2012-11-16 22:16                   ` Nick Bartos
2012-11-16 22:21                     ` Sage Weil
2012-11-19 23:04                       ` Nick Bartos
2012-11-19 23:34                         ` Gregory Farnum
2012-11-20 21:53                           ` Nick Bartos
2012-11-21  1:31                             ` Nick Bartos
2012-11-21 16:50                               ` Sage Weil
2012-11-21 17:02                                 ` Nick Bartos
2012-11-21 17:34                                   ` Nick Bartos
2012-11-21 21:41                                     ` Nick Bartos
2012-11-22  4:47                                       ` Sage Weil
2012-11-22  5:49                                         ` Nick Bartos
2012-11-22 18:04                                           ` Nick Bartos
2012-11-29 20:37                                             ` Alex Elder
2012-11-30 18:49                                               ` Nick Bartos
2012-11-30 19:10                                                 ` Alex Elder
2012-11-30 19:31                                                   ` Sage Weil
2012-11-30 23:22                                               ` Alex Elder
2012-12-02  5:34                                                 ` Nick Bartos
2012-12-03  4:43                                                   ` Alex Elder
2012-12-10 21:57                                                     ` Alex Elder
2012-12-11 17:26                                                       ` Nick Bartos
2012-12-11 18:01                                                         ` Alex Elder
2012-12-11 19:44                                                           ` Alex Elder
2012-12-13  0:57                                                             ` Nick Bartos
2012-12-13 19:00                                                               ` Nick Bartos
2012-12-13 19:07                                                                 ` Alex Elder
2012-12-14 16:46                                                                 ` Alex Elder
2012-12-14 16:53                                                                   ` Nick Bartos
2012-12-14 18:03                                                                     ` Alex Elder
2012-12-17 17:12                                                                       ` Nick Bartos
2012-12-18 16:09                                                                         ` Alex Elder
2012-12-18 18:05                                                                           ` Nick Bartos
2012-12-19 21:25                                                                             ` Alex Elder
2012-12-19 22:42                                                                               ` Alex Elder
2012-12-20 17:48                                                                                 ` Nick Bartos
2012-12-20 21:59                                                                                   ` Alex Elder
2012-12-26 17:45                                                                                     ` Nick Bartos
2012-12-26 17:50                                                                                       ` Alex Elder
2012-12-26 21:36                                                                                       ` Alex Elder
2012-12-27 17:33                                                                                         ` Nick Bartos
2012-12-27 18:43                                                                                           ` Sage Weil
2012-12-27 19:41                                                                                             ` Alex Elder
2012-12-31 18:22                                                                                         ` Alex Elder
2013-01-02 15:56                                                                                           ` Nick Bartos
2012-11-16 22:23                     ` Gregory Farnum

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.