From: Kevin Decherf <kevin@kdecherf.com>
To: Gregory Farnum <greg@inktank.com>
Cc: Sam Lang <sam.lang@inktank.com>,
"ceph-devel@vger.kernel.org" <ceph-devel@vger.kernel.org>,
support@clever-cloud.com
Subject: Re: Crash and strange things on MDS
Date: Mon, 11 Feb 2013 23:24:49 +0100 [thread overview]
Message-ID: <20130211222449.GA553@kdecherf.com> (raw)
In-Reply-To: <CAPYLRzjZgTH5PaDTXSxJz8fW6arqUpOLdnKBHrcv4vohggZLVQ@mail.gmail.com>
On Mon, Feb 11, 2013 at 12:25:59PM -0800, Gregory Farnum wrote:
> On Mon, Feb 4, 2013 at 10:01 AM, Kevin Decherf <kevin@kdecherf.com> wrote:
> > References:
> > [1] http://www.spinics.net/lists/ceph-devel/msg04903.html
> > [2] ceph version 0.56.1 (e4a541624df62ef353e754391cbbb707f54b16f7)
> > 1: /usr/bin/ceph-mds() [0x817e82]
> > 2: (()+0xf140) [0x7f9091d30140]
> > 3: (MDCache::request_drop_foreign_locks(MDRequest*)+0x21) [0x5b9dc1]
> > 4: (MDCache::request_drop_locks(MDRequest*)+0x19) [0x5baae9]
> > 5: (MDCache::request_cleanup(MDRequest*)+0x60) [0x5bab70]
> > 6: (MDCache::request_kill(MDRequest*)+0x80) [0x5bae90]
> > 7: (Server::journal_close_session(Session*, int)+0x372) [0x549aa2]
> > 8: (Server::kill_session(Session*)+0x137) [0x549c67]
> > 9: (Server::find_idle_sessions()+0x12a6) [0x54b0d6]
> > 10: (MDS::tick()+0x338) [0x4da928]
> > 11: (SafeTimer::timer_thread()+0x1af) [0x78151f]
> > 12: (SafeTimerThread::entry()+0xd) [0x782bad]
> > 13: (()+0x7ddf) [0x7f9091d28ddf]
> > 14: (clone()+0x6d) [0x7f90909cc24d]
>
> This in particular is quite odd. Do you have any logging from when
> that happened? (Oftentimes the log can have a bunch of debugging
> information from shortly before the crash.)
Yes, there is a dump of 100,000 events for this backtrace in the linked
archive (I need 7 hours to upload it).
>
> On Mon, Feb 11, 2013 at 10:54 AM, Kevin Decherf <kevin@kdecherf.com> wrote:
> > Furthermore, I observe another strange thing more or less related to the
> > storms.
> >
> > During a rsync command to write ~20G of data on Ceph and during (and
> > after) the storm, one OSD sends a lot of data to the active MDS
> > (400Mbps peak each 6 seconds). After a quick check, I found that when I
> > stop osd.23, osd.14 stops its peaks.
>
> This is consistent with Sam's suggestion that MDS is thrashing its
> cache, and is grabbing a directory object off of the OSDs. How large
> are the directories you're using? If they're a significant fraction of
> your cache size, it might be worth enabling the (sadly less stable)
> directory fragmentation options, which will split them up into smaller
> fragments that can be independently read and written to disk.
The distribution is heterogeneous: we have a folder of ~17G for 300k
objects, another of ~2G for 150k objects and a lof of smaller directories.
Are you talking about the mds bal frag and mds bal split * settings?
Do you have any advice about the value to use?
--
Kevin Decherf - @Kdecherf
GPG C610 FE73 E706 F968 612B E4B2 108A BD75 A81E 6E2F
http://kdecherf.com
next prev parent reply other threads:[~2013-02-11 22:24 UTC|newest]
Thread overview: 27+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-02-04 18:01 Crash and strange things on MDS Kevin Decherf
2013-02-11 13:05 ` Kevin Decherf
2013-02-11 17:00 ` Sam Lang
2013-02-11 18:54 ` Kevin Decherf
2013-02-11 20:25 ` Gregory Farnum
2013-02-11 22:24 ` Kevin Decherf [this message]
2013-02-11 22:47 ` Gregory Farnum
2013-02-11 23:33 ` Kevin Decherf
2013-02-13 11:47 ` Kevin Decherf
2013-02-13 18:19 ` Gregory Farnum
2013-02-16 1:02 ` Kevin Decherf
2013-02-16 17:36 ` Sam Lang
2013-02-16 18:24 ` Kevin Decherf
2013-02-19 18:15 ` Gregory Farnum
2013-02-20 1:00 ` Kevin Decherf
2013-02-20 1:09 ` Gregory Farnum
2013-02-26 17:57 ` Kevin Decherf
2013-02-26 18:10 ` Gregory Farnum
2013-02-26 19:58 ` Kevin Decherf
2013-02-26 20:26 ` Gregory Farnum
2013-02-26 21:57 ` Kevin Decherf
2013-02-26 21:58 ` Gregory Farnum
2013-02-27 0:03 ` Yan, Zheng
2013-02-27 0:14 ` Sage Weil
[not found] ` <20130227004923.GQ16091@kdecherf.com>
[not found] ` <CAPYLRzhbygkA9=DkVr474Nw8AOC2hAFG-1D6uS4WyfR=kUBXWQ@mail.gmail.com>
[not found] ` <20130308232943.GA2197@kdecherf.com>
[not found] ` <20130308232943.GA2197-fShu9kyPgSlWk0Htik3J/w@public.gmane.org>
2013-03-15 20:32 ` Greg Farnum
[not found] ` <ECAA10260D284057A52D78127F8634A8-4GqslpFJ+cxBDgjK7y7TUQ@public.gmane.org>
2013-03-15 22:40 ` Marc-Antoine Perennou
2013-03-15 22:53 ` Greg Farnum
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20130211222449.GA553@kdecherf.com \
--to=kevin@kdecherf.com \
--cc=ceph-devel@vger.kernel.org \
--cc=greg@inktank.com \
--cc=sam.lang@inktank.com \
--cc=support@clever-cloud.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.