All of lore.kernel.org
 help / color / mirror / Atom feed
From: Josh Durgin <josh.durgin@inktank.com>
To: Mandell Degerness <mandell@pistoncloud.com>
Cc: ceph-devel@vger.kernel.org
Subject: Re: rbd map command hangs for 15 minutes during system start up
Date: Thu, 08 Nov 2012 17:43:02 -0800	[thread overview]
Message-ID: <509C5FA6.7010600@inktank.com> (raw)
In-Reply-To: <CA+jddaMAx8OKd8-uHaj-eDx8PJwZB=ew7PkkSAy8Y3Uwm5S85g@mail.gmail.com>

On 11/08/2012 02:10 PM, Mandell Degerness wrote:
> We are seeing a somewhat random, but frequent hang on our systems
> during startup.  The hang happens at the point where an "rbd map
> <rbdvol>" command is run.
>
> I've attached the ceph logs from the cluster.  The map command happens
> at Nov  8 18:41:09 on server 172.18.0.15.  The process which hung can
> be seen in the log as 172.18.0.15:0/1143980479.
>
> It appears as if the TCP socket is opened to the OSD, but then times
> out 15 minutes later, the process gets data when the socket is closed
> on the client server and it retries.
>
> Please help.
>
> We are using ceph version 0.48.2argonaut
> (commit:3e02b2fad88c2a95d9c0c86878f10d1beb780bfe).
>
> We are using a 3.5.7 kernel with the following list of patches applied:
>
> 1-libceph-encapsulate-out-message-data-setup.patch
> 2-libceph-dont-mark-footer-complete-before-it-is.patch
> 3-libceph-move-init-of-bio_iter.patch
> 4-libceph-dont-use-bio_iter-as-a-flag.patch
> 5-libceph-resubmit-linger-ops-when-pg-mapping-changes.patch
> 6-libceph-re-initialize-bio_iter-on-start-of-message-receive.patch
> 7-ceph-close-old-con-before-reopening-on-mds-reconnect.patch
> 8-libceph-protect-ceph_con_open-with-mutex.patch
> 9-libceph-reset-connection-retry-on-successfully-negotiation.patch
> 10-rbd-only-reset-capacity-when-pointing-to-head.patch
> 11-rbd-set-image-size-when-header-is-updated.patch
> 12-libceph-fix-crypto-key-null-deref-memory-leak.patch
> 13-ceph-tolerate-and-warn-on-extraneous-dentry-from-mds.patch
> 14-ceph-avoid-divide-by-zero-in-__validate_layout.patch
> 15-rbd-drop-dev-reference-on-error-in-rbd_open.patch
> 16-ceph-Fix-oops-when-handling-mdsmap-that-decreases-max_mds.patch
> 17-libceph-check-for-invalid-mapping.patch
> 18-ceph-propagate-layout-error-on-osd-request-creation.patch
> 19-rbd-BUG-on-invalid-layout.patch
> 20-ceph-return-EIO-on-invalid-layout-on-GET_DATALOC-ioctl.patch
> 21-ceph-avoid-32-bit-page-index-overflow.patch
> 23-ceph-fix-dentry-reference-leak-in-encode_fh.patch
>
> Any suggestions?

The log shows your monitors don't have time sychronized enough among
them to make much progress (including authenticating new connections).
That's probably the real issue. 0.2s is pretty large clock drift.

> One thought is that the following patch (which we could not apply) is
> what is required:
>
> 22-rbd-reset-BACKOFF-if-unable-to-re-queue.patch

This is certainly useful too, but I don't think it's the cause of
the delay in this case.

Josh

  reply	other threads:[~2012-11-09  1:43 UTC|newest]

Thread overview: 56+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-11-08 22:10 rbd map command hangs for 15 minutes during system start up Mandell Degerness
2012-11-09  1:43 ` Josh Durgin [this message]
2012-11-12 22:19   ` Nick Bartos
2012-11-12 23:16     ` Sage Weil
2012-11-16  0:21       ` Nick Bartos
2012-11-16  0:25         ` Sage Weil
2012-11-16 18:36           ` Nick Bartos
2012-11-16 19:16             ` Sage Weil
2012-11-16 22:01               ` Nick Bartos
2012-11-16 22:13                 ` Sage Weil
2012-11-16 22:16                   ` Nick Bartos
2012-11-16 22:21                     ` Sage Weil
2012-11-19 23:04                       ` Nick Bartos
2012-11-19 23:34                         ` Gregory Farnum
2012-11-20 21:53                           ` Nick Bartos
2012-11-21  1:31                             ` Nick Bartos
2012-11-21 16:50                               ` Sage Weil
2012-11-21 17:02                                 ` Nick Bartos
2012-11-21 17:34                                   ` Nick Bartos
2012-11-21 21:41                                     ` Nick Bartos
2012-11-22  4:47                                       ` Sage Weil
2012-11-22  5:49                                         ` Nick Bartos
2012-11-22 18:04                                           ` Nick Bartos
2012-11-29 20:37                                             ` Alex Elder
2012-11-30 18:49                                               ` Nick Bartos
2012-11-30 19:10                                                 ` Alex Elder
2012-11-30 19:31                                                   ` Sage Weil
2012-11-30 23:22                                               ` Alex Elder
2012-12-02  5:34                                                 ` Nick Bartos
2012-12-03  4:43                                                   ` Alex Elder
2012-12-10 21:57                                                     ` Alex Elder
2012-12-11 17:26                                                       ` Nick Bartos
2012-12-11 18:01                                                         ` Alex Elder
2012-12-11 19:44                                                           ` Alex Elder
2012-12-13  0:57                                                             ` Nick Bartos
2012-12-13 19:00                                                               ` Nick Bartos
2012-12-13 19:07                                                                 ` Alex Elder
2012-12-14 16:46                                                                 ` Alex Elder
2012-12-14 16:53                                                                   ` Nick Bartos
2012-12-14 18:03                                                                     ` Alex Elder
2012-12-17 17:12                                                                       ` Nick Bartos
2012-12-18 16:09                                                                         ` Alex Elder
2012-12-18 18:05                                                                           ` Nick Bartos
2012-12-19 21:25                                                                             ` Alex Elder
2012-12-19 22:42                                                                               ` Alex Elder
2012-12-20 17:48                                                                                 ` Nick Bartos
2012-12-20 21:59                                                                                   ` Alex Elder
2012-12-26 17:45                                                                                     ` Nick Bartos
2012-12-26 17:50                                                                                       ` Alex Elder
2012-12-26 21:36                                                                                       ` Alex Elder
2012-12-27 17:33                                                                                         ` Nick Bartos
2012-12-27 18:43                                                                                           ` Sage Weil
2012-12-27 19:41                                                                                             ` Alex Elder
2012-12-31 18:22                                                                                         ` Alex Elder
2013-01-02 15:56                                                                                           ` Nick Bartos
2012-11-16 22:23                     ` Gregory Farnum

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=509C5FA6.7010600@inktank.com \
    --to=josh.durgin@inktank.com \
    --cc=ceph-devel@vger.kernel.org \
    --cc=mandell@pistoncloud.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.