All of lore.kernel.org
 help / color / mirror / Atom feed
* v0.48.1 argonaut stable update released
@ 2012-08-14 18:18 Sage Weil
  2012-08-14 19:09 ` Oliver Francke
  0 siblings, 1 reply; 4+ messages in thread
From: Sage Weil @ 2012-08-14 18:18 UTC (permalink / raw)
  To: ceph-devel

We've built and pushed the first update to the argonaut stable release.  
This branch has a range of small fixes for stability, compatibility, and 
performance, but no major changes in functionality.  The stability fixes 
are particularly important for large clusters with many OSDs, and for 
network environments where intermittent network failures are more common.

The highlights include:

 * mkcephfs: use default `keyring', `osd data', `osd journal' paths when 
   not specified in conf
 * msgr: various fixes to socket error handling
 * osd: reduce scrub overhead
 * osd: misc peering fixes (past_interval sharing, pgs stuck in `peering' 
   states)
 * osd: fail on EIO in read path (do not silently ignore read errors from 
   failing disks)
 * osd: avoid internal heartbeat errors by breaking some large 
   transactions into pieces
 * osd: fix osdmap catch-up during startup (catch up and then add daemon 
   to osdmap)
 * osd: fix spurious `misdirected op' messages
 * osd: report scrub status via `pg # query'
 * rbd: fix race when watch registrations are resent
 * rbd: fix rbd image id assignment scheme (new image data objects have 
   slightly different names)
 * rbd: fix perf stats for cache hit rate
 * rbd tool: fix off-by-one in key name (crash when empty key specified)
 * rbd: more robust udev rules
 * rados tool: copy object, pool commands
 * radosgw: fix in usage stats trimming
 * radosgw: misc compatibility fixes (date strings, ETag quoting, swift 
   headers, etc.)
 * ceph-fuse: fix locking in read/write paths
 * mon: fix rare race corrupting on-disk data
 * config: fix admin socket `config set' command
 * log: fix in-memory log event gathering
 * debian: remove crush headers, include librados-config
 * rpm: add ceph-disk-{activate, prepare}

The fix for the radosgw usage trimming is incompatible with v0.48 (which 
was effectively broken).  You now need to use the v0.48.1 version of 
radosgw-admin to initiate usage stats trimming.

There are a range of smaller bug fixes as well.  For a complete list of 
what went into this release, please see the release notes and changelog.

You can get this stable update from the usual locations:

 * Git at git://github.com/ceph/ceph.git
 * Tarball at http://ceph.newdream.net/download/ceph-0.48.1.tar.gz
 * For Debian/Ubuntu packages, see http://ceph.newdream.net/docs/master/install/debian


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: v0.48.1 argonaut stable update released
  2012-08-14 18:18 v0.48.1 argonaut stable update released Sage Weil
@ 2012-08-14 19:09 ` Oliver Francke
  2012-08-14 19:29   ` Sage Weil
  0 siblings, 1 reply; 4+ messages in thread
From: Oliver Francke @ 2012-08-14 19:09 UTC (permalink / raw)
  To: Sage Weil; +Cc: ceph-devel

Hi Sage,

I just updated to debian-testing/0.50 this afternoon, after some hint:

* osd: better tracking of recent slow operations

and it is hereby confirmed to be better in my testing environment. Before I had requests, which could be there for >480 seconds… not any more.

How's about this fix in 0.48.X?

Thnx in @vance,

Oliver - Thus being too lazy to read all change logs - Francke.

Am 14.08.2012 um 20:18 schrieb Sage Weil <sage@inktank.com>:

> We've built and pushed the first update to the argonaut stable release.  
> This branch has a range of small fixes for stability, compatibility, and 
> performance, but no major changes in functionality.  The stability fixes 
> are particularly important for large clusters with many OSDs, and for 
> network environments where intermittent network failures are more common.
> 
> The highlights include:
> 
> * mkcephfs: use default `keyring', `osd data', `osd journal' paths when 
>   not specified in conf
> * msgr: various fixes to socket error handling
> * osd: reduce scrub overhead
> * osd: misc peering fixes (past_interval sharing, pgs stuck in `peering' 
>   states)
> * osd: fail on EIO in read path (do not silently ignore read errors from 
>   failing disks)
> * osd: avoid internal heartbeat errors by breaking some large 
>   transactions into pieces
> * osd: fix osdmap catch-up during startup (catch up and then add daemon 
>   to osdmap)
> * osd: fix spurious `misdirected op' messages
> * osd: report scrub status via `pg # query'
> * rbd: fix race when watch registrations are resent
> * rbd: fix rbd image id assignment scheme (new image data objects have 
>   slightly different names)
> * rbd: fix perf stats for cache hit rate
> * rbd tool: fix off-by-one in key name (crash when empty key specified)
> * rbd: more robust udev rules
> * rados tool: copy object, pool commands
> * radosgw: fix in usage stats trimming
> * radosgw: misc compatibility fixes (date strings, ETag quoting, swift 
>   headers, etc.)
> * ceph-fuse: fix locking in read/write paths
> * mon: fix rare race corrupting on-disk data
> * config: fix admin socket `config set' command
> * log: fix in-memory log event gathering
> * debian: remove crush headers, include librados-config
> * rpm: add ceph-disk-{activate, prepare}
> 
> The fix for the radosgw usage trimming is incompatible with v0.48 (which 
> was effectively broken).  You now need to use the v0.48.1 version of 
> radosgw-admin to initiate usage stats trimming.
> 
> There are a range of smaller bug fixes as well.  For a complete list of 
> what went into this release, please see the release notes and changelog.
> 
> You can get this stable update from the usual locations:
> 
> * Git at git://github.com/ceph/ceph.git
> * Tarball at http://ceph.newdream.net/download/ceph-0.48.1.tar.gz
> * For Debian/Ubuntu packages, see http://ceph.newdream.net/docs/master/install/debian
> 
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: v0.48.1 argonaut stable update released
  2012-08-14 19:09 ` Oliver Francke
@ 2012-08-14 19:29   ` Sage Weil
  2012-08-15 14:06     ` Oliver Francke
  0 siblings, 1 reply; 4+ messages in thread
From: Sage Weil @ 2012-08-14 19:29 UTC (permalink / raw)
  To: Oliver Francke; +Cc: ceph-devel

On Tue, 14 Aug 2012, Oliver Francke wrote:
> Hi Sage,
> 
> I just updated to debian-testing/0.50 this afternoon, after some hint:
> 
> * osd: better tracking of recent slow operations

This is actually about the admin socket command to dump operations in 
flight (more useful information is reported for diagnosis/debugging).

> and it is hereby confirmed to be better in my testing environment. 
> Before I had requests, which could be there for >480 seconds? not any 
> more.

That great news!  That is probably Sam's refactor of the OSD threading at 
work.  There were also a few bugs fixed in 0.48.1 that were causing 
somewhat similar symptoms (ops blocked indefinitely) due to peering 
problems, but that doesn't sound like it's the same thing.

> How's about this fix in 0.48.X?

It's a huge set of changes, and definitely won't go into the 0.48 series, 
sorry!  (In fact, the pending change was one motivation for doing 0.48 
when we did.)  It will be in bobtail, though, which is probably about a 
month away from freeze.

Please let us know what your experience is like with 0.50 (and beyond).

Thanks!
sage


> 
> Thnx in @vance,
> 
> Oliver - Thus being too lazy to read all change logs - Francke.
> 
> Am 14.08.2012 um 20:18 schrieb Sage Weil <sage@inktank.com>:
> 
> > We've built and pushed the first update to the argonaut stable release.  
> > This branch has a range of small fixes for stability, compatibility, and 
> > performance, but no major changes in functionality.  The stability fixes 
> > are particularly important for large clusters with many OSDs, and for 
> > network environments where intermittent network failures are more common.
> > 
> > The highlights include:
> > 
> > * mkcephfs: use default `keyring', `osd data', `osd journal' paths when 
> >   not specified in conf
> > * msgr: various fixes to socket error handling
> > * osd: reduce scrub overhead
> > * osd: misc peering fixes (past_interval sharing, pgs stuck in `peering' 
> >   states)
> > * osd: fail on EIO in read path (do not silently ignore read errors from 
> >   failing disks)
> > * osd: avoid internal heartbeat errors by breaking some large 
> >   transactions into pieces
> > * osd: fix osdmap catch-up during startup (catch up and then add daemon 
> >   to osdmap)
> > * osd: fix spurious `misdirected op' messages
> > * osd: report scrub status via `pg # query'
> > * rbd: fix race when watch registrations are resent
> > * rbd: fix rbd image id assignment scheme (new image data objects have 
> >   slightly different names)
> > * rbd: fix perf stats for cache hit rate
> > * rbd tool: fix off-by-one in key name (crash when empty key specified)
> > * rbd: more robust udev rules
> > * rados tool: copy object, pool commands
> > * radosgw: fix in usage stats trimming
> > * radosgw: misc compatibility fixes (date strings, ETag quoting, swift 
> >   headers, etc.)
> > * ceph-fuse: fix locking in read/write paths
> > * mon: fix rare race corrupting on-disk data
> > * config: fix admin socket `config set' command
> > * log: fix in-memory log event gathering
> > * debian: remove crush headers, include librados-config
> > * rpm: add ceph-disk-{activate, prepare}
> > 
> > The fix for the radosgw usage trimming is incompatible with v0.48 (which 
> > was effectively broken).  You now need to use the v0.48.1 version of 
> > radosgw-admin to initiate usage stats trimming.
> > 
> > There are a range of smaller bug fixes as well.  For a complete list of 
> > what went into this release, please see the release notes and changelog.
> > 
> > You can get this stable update from the usual locations:
> > 
> > * Git at git://github.com/ceph/ceph.git
> > * Tarball at http://ceph.newdream.net/download/ceph-0.48.1.tar.gz
> > * For Debian/Ubuntu packages, see http://ceph.newdream.net/docs/master/install/debian
> > 
> > --
> > To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> > the body of a message to majordomo@vger.kernel.org
> > More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 
> 

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: v0.48.1 argonaut stable update released
  2012-08-14 19:29   ` Sage Weil
@ 2012-08-15 14:06     ` Oliver Francke
  0 siblings, 0 replies; 4+ messages in thread
From: Oliver Francke @ 2012-08-15 14:06 UTC (permalink / raw)
  To: Sage Weil; +Cc: ceph-devel

Well,

On 08/14/2012 09:29 PM, Sage Weil wrote:
> On Tue, 14 Aug 2012, Oliver Francke wrote:
>> Hi Sage,
>>
>> I just updated to debian-testing/0.50 this afternoon, after some hint:
>>
>> * osd: better tracking of recent slow operations
> This is actually about the admin socket command to dump operations in
> flight (more useful information is reported for diagnosis/debugging).
>
>> and it is hereby confirmed to be better in my testing environment.
>> Before I had requests, which could be there for >480 seconds? not any
>> more.
> That great news!  That is probably Sam's refactor of the OSD threading at
> work.  There were also a few bugs fixed in 0.48.1 that were causing
> somewhat similar symptoms (ops blocked indefinitely) due to peering
> problems, but that doesn't sound like it's the same thing.
>
>> How's about this fix in 0.48.X?
> It's a huge set of changes, and definitely won't go into the 0.48 series,
> sorry!  (In fact, the pending change was one motivation for doing 0.48
> when we did.)  It will be in bobtail, though, which is probably about a
> month away from freeze.
>
> Please let us know what your experience is like with 0.50 (and beyond).

the more detailed picture is: it works and is stable, so far no problems 
with my torture-tests.
Sporadically I see a line ala:

--- 8-< ---
delete error: image still has watchers
This means the image is still open or the client using it crashed. Try 
again after closing/unmapping it or waiting 30s for the crashed client 
to timeout.
2012-08-15 15:57:22.072729 7f9fe82a2760 -1 librbd: error removing 
header: (16) Device or resource busy
--- 8-< ---

even from long ago stopped VM's.

Regards,

Oliver.

>
> Thanks!
> sage
>
>
>> Thnx in @vance,
>>
>> Oliver - Thus being too lazy to read all change logs - Francke.
>>
>> Am 14.08.2012 um 20:18 schrieb Sage Weil <sage@inktank.com>:
>>
>>> We've built and pushed the first update to the argonaut stable release.
>>> This branch has a range of small fixes for stability, compatibility, and
>>> performance, but no major changes in functionality.  The stability fixes
>>> are particularly important for large clusters with many OSDs, and for
>>> network environments where intermittent network failures are more common.
>>>
>>> The highlights include:
>>>
>>> * mkcephfs: use default `keyring', `osd data', `osd journal' paths when
>>>    not specified in conf
>>> * msgr: various fixes to socket error handling
>>> * osd: reduce scrub overhead
>>> * osd: misc peering fixes (past_interval sharing, pgs stuck in `peering'
>>>    states)
>>> * osd: fail on EIO in read path (do not silently ignore read errors from
>>>    failing disks)
>>> * osd: avoid internal heartbeat errors by breaking some large
>>>    transactions into pieces
>>> * osd: fix osdmap catch-up during startup (catch up and then add daemon
>>>    to osdmap)
>>> * osd: fix spurious `misdirected op' messages
>>> * osd: report scrub status via `pg # query'
>>> * rbd: fix race when watch registrations are resent
>>> * rbd: fix rbd image id assignment scheme (new image data objects have
>>>    slightly different names)
>>> * rbd: fix perf stats for cache hit rate
>>> * rbd tool: fix off-by-one in key name (crash when empty key specified)
>>> * rbd: more robust udev rules
>>> * rados tool: copy object, pool commands
>>> * radosgw: fix in usage stats trimming
>>> * radosgw: misc compatibility fixes (date strings, ETag quoting, swift
>>>    headers, etc.)
>>> * ceph-fuse: fix locking in read/write paths
>>> * mon: fix rare race corrupting on-disk data
>>> * config: fix admin socket `config set' command
>>> * log: fix in-memory log event gathering
>>> * debian: remove crush headers, include librados-config
>>> * rpm: add ceph-disk-{activate, prepare}
>>>
>>> The fix for the radosgw usage trimming is incompatible with v0.48 (which
>>> was effectively broken).  You now need to use the v0.48.1 version of
>>> radosgw-admin to initiate usage stats trimming.
>>>
>>> There are a range of smaller bug fixes as well.  For a complete list of
>>> what went into this release, please see the release notes and changelog.
>>>
>>> You can get this stable update from the usual locations:
>>>
>>> * Git at git://github.com/ceph/ceph.git
>>> * Tarball at http://ceph.newdream.net/download/ceph-0.48.1.tar.gz
>>> * For Debian/Ubuntu packages, see http://ceph.newdream.net/docs/master/install/debian
>>>
>>> --
>>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
>>> the body of a message to majordomo@vger.kernel.org
>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>> --
>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>
>>


-- 

Oliver Francke

filoo GmbH
Moltkestraße 25a
33330 Gütersloh
HRB4355 AG Gütersloh

Geschäftsführer: S.Grewing | J.Rehpöhler | C.Kunz

Folgen Sie uns auf Twitter: http://twitter.com/filoogmbh

--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2012-08-15 14:06 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-08-14 18:18 v0.48.1 argonaut stable update released Sage Weil
2012-08-14 19:09 ` Oliver Francke
2012-08-14 19:29   ` Sage Weil
2012-08-15 14:06     ` Oliver Francke

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.