From mboxrd@z Thu Jan  1 00:00:00 1970
From: Smart Weblications GmbH - Florian Wiessner
	<f.wiessner@smart-weblications.de>
Subject: Re: ceph-fs tests
Date: Wed, 05 Sep 2012 18:42:39 +0200
Message-ID: <504780FF.6020305@smart-weblications.de>
References: <50468E0E.6070807@smart-weblications.de> <CADvuQRGy9gCSwy3jMw+DGAF0MfoeohhYNgR2PJ+vDU1WW9cM+A@mail.gmail.com>
Reply-To: f.wiessner@smart-weblications.de
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: QUOTED-PRINTABLE
Return-path: <ceph-devel-owner@vger.kernel.org>
Received: from mx03.smart-weblications.de ([188.65.144.38]:46357 "EHLO
	mx03.smart-weblications.de" rhost-flags-OK-OK-OK-OK)
	by vger.kernel.org with ESMTP id S1758828Ab2IEQn3 (ORCPT
	<rfc822;ceph-devel@vger.kernel.org>); Wed, 5 Sep 2012 12:43:29 -0400
In-Reply-To: <CADvuQRGy9gCSwy3jMw+DGAF0MfoeohhYNgR2PJ+vDU1WW9cM+A@mail.gmail.com>
Sender: ceph-devel-owner@vger.kernel.org
List-ID: <ceph-devel.vger.kernel.org>
To: Tommi Virtanen <tv@inktank.com>, ceph-devel <ceph-devel@vger.kernel.org>

Am 05.09.2012 18:22, schrieb Tommi Virtanen:
> On Tue, Sep 4, 2012 at 4:26 PM, Smart Weblications GmbH - Florian
> Wiessner <f.wiessner@smart-weblications.de> wrote:
>> i set up a 3 node ceph cluster 0.48.1argonaut to test ceph-fs.
>>
>> i mount ceph via fuse, then i downloaded kernel tree and decompresse=
d a few
>> times, then stopping one osd (osd.1), afer a while of recovering, su=
ddenly:

>=20
> Please provide English error messages when you share things with the
> list. In this case I can figure out what the message is, but really,
> we're all pattern matching animals and the specific strings in
> /usr/include/asm-generic/errno.h are what we know.
>=20

OK, will change locales.

>> no space left on device, but:
>>
>> 2012-09-04 18:46:38.242840 mon.0 [INF] pgmap v2883: 576 pgs: 512 act=
ive+clean,
>> 64 active+recovering; 1250 MB data, 14391 MB used, 844 MB / 15236 MB=
 avail;
>> 36677/215076 degraded (17.053%)
>>
>> there is space left?
>=20
> Only 844 MB available, with the pseudo-random placement policies,
> means you practically are out of space.
>=20
> It looks like you had only 15GB to begin with, and with typical
> replication, that's <5GB usable space. That is dangerously small for
> any real use; Ceph currently does not cope very well with running out
> of space.
>=20

It is a test-cluster running on my thinkpad, its main purpose is to tes=
t cephfs,
there is no need for real space. I added osd.1 again, then after recove=
ry the
problem went away. I forced this situation to check how cephfs will beh=
ave when
cluster is near-full, osd fails and ceph tries to recover until backfil=
l fills
up other osds so ceph is full.

I observed on the client that no IO was possible anymore so that the cl=
ient was
unusable.

Is there a smarter way to handle this? It is bad that cephfs then stall=
s, it
would be better if it just returns that there is no space left, but sti=
ll allow
read access... can this be tuned somewhere?


--=20

Mit freundlichen Gr=C3=BC=C3=9Fen,

=46lorian Wiessner

Smart Weblications GmbH
Martinsberger Str. 1
D-95119 Naila

fon.: +49 9282 9638 200
fax.: +49 9282 9638 205
24/7: +49 900 144 000 00 - 0,99 EUR/Min*
http://www.smart-weblications.de

--
Sitz der Gesellschaft: Naila
Gesch=C3=A4ftsf=C3=BChrer: Florian Wiessner
HRB-Nr.: HRB 3840 Amtsgericht Hof
*aus dem dt. Festnetz, ggf. abweichende Preise aus dem Mobilfunknetz
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" i=
n
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html