All of lore.kernel.org
 help / color / mirror / Atom feed
From: Ted Ts'o <tytso@mit.edu>
To: Sage Weil <sage@newdream.net>
Cc: Gregory Farnum <gregf@hq.newdream.net>,
	ceph-devel@vger.kernel.org, mrubin@google.com
Subject: Re: OOM's on the Ceph client machine
Date: Thu, 21 Oct 2010 18:28:22 -0400	[thread overview]
Message-ID: <20101021222822.GK3127@thunk.org> (raw)
In-Reply-To: <Pine.LNX.4.64.1010211422200.18946@cobra.newdream.net>

On Thu, Oct 21, 2010 at 02:46:11PM -0700, Sage Weil wrote:
> 
> Unfortunately it's not obvious to me from dmesg where the problem is, 
> other than that it looks like some of the osds aren't responding (but are 
> apparently still up).  There is a known regression in v0.22 that can cause 
> crashes in the osd cluster; we should have a fix pushed later today.  
> That would look a bit different, though (you'd see osd down messages).  
> I'll post an update (and probably v0.22.1) when that's been tested.

I looked earlier in the logs, and I do see some "osd down", "osd up",
and "osd socket closed" messages.  So it looks like the v0.22
regression you mentioned.  I'll wait for the git update and try
rebuilding the server.  Thanks!!

> > Also, It seems that there are issues moving back and forth between
> > 0.21 and 0.22 without reformating the ceph client.  Is that accurate?
> 
> Yeah, that isn't expected to work.  In general, rolling backward isn't 
> supported.  In this case we forgot to add an incompat flag to generate a 
> nice error message to that effect.

Is rolling forward between 0.21 and 0.22 expected to work?  Or should
I just do a mkcephfs just to be safe?  It's not a data preservation
issue, but rather the time it takes to do a mkcephfs.  Random
question: how do you feel about using Python?  Trying to make a
version of mkcephfs that runs in parallel would probably be easier if
we could port the shell script to a python script.  I don't think
there are any Python dependencies in Ceph right now, though.

      	      	     		     	  - Ted

  reply	other threads:[~2010-10-21 22:28 UTC|newest]

Thread overview: 13+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-10-13  0:31 OOM's on the Ceph client machine Theodore Ts'o
2010-10-13  2:30 ` Gregory Farnum
2010-10-13  3:34   ` Ted Ts'o
2010-10-13 17:29     ` Sage Weil
2010-10-14  0:03       ` Ted Ts'o
2010-10-14  3:43         ` Sage Weil
2010-10-21 20:36         ` Ted Ts'o
2010-10-21 21:46           ` Sage Weil
2010-10-21 22:28             ` Ted Ts'o [this message]
2010-10-21 22:44               ` Sage Weil
2010-10-13  3:43 ` DongJin Lee
2010-10-13 17:42 ` Sage Weil
2010-10-13 21:25   ` Sage Weil

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20101021222822.GK3127@thunk.org \
    --to=tytso@mit.edu \
    --cc=ceph-devel@vger.kernel.org \
    --cc=gregf@hq.newdream.net \
    --cc=mrubin@google.com \
    --cc=sage@newdream.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.