* v0.48.1 argonaut stable update released
@ 2012-08-14 18:18 Sage Weil
2012-08-14 19:09 ` Oliver Francke
0 siblings, 1 reply; 4+ messages in thread
From: Sage Weil @ 2012-08-14 18:18 UTC (permalink / raw)
To: ceph-devel
We've built and pushed the first update to the argonaut stable release.
This branch has a range of small fixes for stability, compatibility, and
performance, but no major changes in functionality. The stability fixes
are particularly important for large clusters with many OSDs, and for
network environments where intermittent network failures are more common.
The highlights include:
* mkcephfs: use default `keyring', `osd data', `osd journal' paths when
not specified in conf
* msgr: various fixes to socket error handling
* osd: reduce scrub overhead
* osd: misc peering fixes (past_interval sharing, pgs stuck in `peering'
states)
* osd: fail on EIO in read path (do not silently ignore read errors from
failing disks)
* osd: avoid internal heartbeat errors by breaking some large
transactions into pieces
* osd: fix osdmap catch-up during startup (catch up and then add daemon
to osdmap)
* osd: fix spurious `misdirected op' messages
* osd: report scrub status via `pg # query'
* rbd: fix race when watch registrations are resent
* rbd: fix rbd image id assignment scheme (new image data objects have
slightly different names)
* rbd: fix perf stats for cache hit rate
* rbd tool: fix off-by-one in key name (crash when empty key specified)
* rbd: more robust udev rules
* rados tool: copy object, pool commands
* radosgw: fix in usage stats trimming
* radosgw: misc compatibility fixes (date strings, ETag quoting, swift
headers, etc.)
* ceph-fuse: fix locking in read/write paths
* mon: fix rare race corrupting on-disk data
* config: fix admin socket `config set' command
* log: fix in-memory log event gathering
* debian: remove crush headers, include librados-config
* rpm: add ceph-disk-{activate, prepare}
The fix for the radosgw usage trimming is incompatible with v0.48 (which
was effectively broken). You now need to use the v0.48.1 version of
radosgw-admin to initiate usage stats trimming.
There are a range of smaller bug fixes as well. For a complete list of
what went into this release, please see the release notes and changelog.
You can get this stable update from the usual locations:
* Git at git://github.com/ceph/ceph.git
* Tarball at http://ceph.newdream.net/download/ceph-0.48.1.tar.gz
* For Debian/Ubuntu packages, see http://ceph.newdream.net/docs/master/install/debian
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: v0.48.1 argonaut stable update released
2012-08-14 18:18 v0.48.1 argonaut stable update released Sage Weil
@ 2012-08-14 19:09 ` Oliver Francke
2012-08-14 19:29 ` Sage Weil
0 siblings, 1 reply; 4+ messages in thread
From: Oliver Francke @ 2012-08-14 19:09 UTC (permalink / raw)
To: Sage Weil; +Cc: ceph-devel
Hi Sage,
I just updated to debian-testing/0.50 this afternoon, after some hint:
* osd: better tracking of recent slow operations
and it is hereby confirmed to be better in my testing environment. Before I had requests, which could be there for >480 seconds… not any more.
How's about this fix in 0.48.X?
Thnx in @vance,
Oliver - Thus being too lazy to read all change logs - Francke.
Am 14.08.2012 um 20:18 schrieb Sage Weil <sage@inktank.com>:
> We've built and pushed the first update to the argonaut stable release.
> This branch has a range of small fixes for stability, compatibility, and
> performance, but no major changes in functionality. The stability fixes
> are particularly important for large clusters with many OSDs, and for
> network environments where intermittent network failures are more common.
>
> The highlights include:
>
> * mkcephfs: use default `keyring', `osd data', `osd journal' paths when
> not specified in conf
> * msgr: various fixes to socket error handling
> * osd: reduce scrub overhead
> * osd: misc peering fixes (past_interval sharing, pgs stuck in `peering'
> states)
> * osd: fail on EIO in read path (do not silently ignore read errors from
> failing disks)
> * osd: avoid internal heartbeat errors by breaking some large
> transactions into pieces
> * osd: fix osdmap catch-up during startup (catch up and then add daemon
> to osdmap)
> * osd: fix spurious `misdirected op' messages
> * osd: report scrub status via `pg # query'
> * rbd: fix race when watch registrations are resent
> * rbd: fix rbd image id assignment scheme (new image data objects have
> slightly different names)
> * rbd: fix perf stats for cache hit rate
> * rbd tool: fix off-by-one in key name (crash when empty key specified)
> * rbd: more robust udev rules
> * rados tool: copy object, pool commands
> * radosgw: fix in usage stats trimming
> * radosgw: misc compatibility fixes (date strings, ETag quoting, swift
> headers, etc.)
> * ceph-fuse: fix locking in read/write paths
> * mon: fix rare race corrupting on-disk data
> * config: fix admin socket `config set' command
> * log: fix in-memory log event gathering
> * debian: remove crush headers, include librados-config
> * rpm: add ceph-disk-{activate, prepare}
>
> The fix for the radosgw usage trimming is incompatible with v0.48 (which
> was effectively broken). You now need to use the v0.48.1 version of
> radosgw-admin to initiate usage stats trimming.
>
> There are a range of smaller bug fixes as well. For a complete list of
> what went into this release, please see the release notes and changelog.
>
> You can get this stable update from the usual locations:
>
> * Git at git://github.com/ceph/ceph.git
> * Tarball at http://ceph.newdream.net/download/ceph-0.48.1.tar.gz
> * For Debian/Ubuntu packages, see http://ceph.newdream.net/docs/master/install/debian
>
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: v0.48.1 argonaut stable update released
2012-08-14 19:09 ` Oliver Francke
@ 2012-08-14 19:29 ` Sage Weil
2012-08-15 14:06 ` Oliver Francke
0 siblings, 1 reply; 4+ messages in thread
From: Sage Weil @ 2012-08-14 19:29 UTC (permalink / raw)
To: Oliver Francke; +Cc: ceph-devel
On Tue, 14 Aug 2012, Oliver Francke wrote:
> Hi Sage,
>
> I just updated to debian-testing/0.50 this afternoon, after some hint:
>
> * osd: better tracking of recent slow operations
This is actually about the admin socket command to dump operations in
flight (more useful information is reported for diagnosis/debugging).
> and it is hereby confirmed to be better in my testing environment.
> Before I had requests, which could be there for >480 seconds? not any
> more.
That great news! That is probably Sam's refactor of the OSD threading at
work. There were also a few bugs fixed in 0.48.1 that were causing
somewhat similar symptoms (ops blocked indefinitely) due to peering
problems, but that doesn't sound like it's the same thing.
> How's about this fix in 0.48.X?
It's a huge set of changes, and definitely won't go into the 0.48 series,
sorry! (In fact, the pending change was one motivation for doing 0.48
when we did.) It will be in bobtail, though, which is probably about a
month away from freeze.
Please let us know what your experience is like with 0.50 (and beyond).
Thanks!
sage
>
> Thnx in @vance,
>
> Oliver - Thus being too lazy to read all change logs - Francke.
>
> Am 14.08.2012 um 20:18 schrieb Sage Weil <sage@inktank.com>:
>
> > We've built and pushed the first update to the argonaut stable release.
> > This branch has a range of small fixes for stability, compatibility, and
> > performance, but no major changes in functionality. The stability fixes
> > are particularly important for large clusters with many OSDs, and for
> > network environments where intermittent network failures are more common.
> >
> > The highlights include:
> >
> > * mkcephfs: use default `keyring', `osd data', `osd journal' paths when
> > not specified in conf
> > * msgr: various fixes to socket error handling
> > * osd: reduce scrub overhead
> > * osd: misc peering fixes (past_interval sharing, pgs stuck in `peering'
> > states)
> > * osd: fail on EIO in read path (do not silently ignore read errors from
> > failing disks)
> > * osd: avoid internal heartbeat errors by breaking some large
> > transactions into pieces
> > * osd: fix osdmap catch-up during startup (catch up and then add daemon
> > to osdmap)
> > * osd: fix spurious `misdirected op' messages
> > * osd: report scrub status via `pg # query'
> > * rbd: fix race when watch registrations are resent
> > * rbd: fix rbd image id assignment scheme (new image data objects have
> > slightly different names)
> > * rbd: fix perf stats for cache hit rate
> > * rbd tool: fix off-by-one in key name (crash when empty key specified)
> > * rbd: more robust udev rules
> > * rados tool: copy object, pool commands
> > * radosgw: fix in usage stats trimming
> > * radosgw: misc compatibility fixes (date strings, ETag quoting, swift
> > headers, etc.)
> > * ceph-fuse: fix locking in read/write paths
> > * mon: fix rare race corrupting on-disk data
> > * config: fix admin socket `config set' command
> > * log: fix in-memory log event gathering
> > * debian: remove crush headers, include librados-config
> > * rpm: add ceph-disk-{activate, prepare}
> >
> > The fix for the radosgw usage trimming is incompatible with v0.48 (which
> > was effectively broken). You now need to use the v0.48.1 version of
> > radosgw-admin to initiate usage stats trimming.
> >
> > There are a range of smaller bug fixes as well. For a complete list of
> > what went into this release, please see the release notes and changelog.
> >
> > You can get this stable update from the usual locations:
> >
> > * Git at git://github.com/ceph/ceph.git
> > * Tarball at http://ceph.newdream.net/download/ceph-0.48.1.tar.gz
> > * For Debian/Ubuntu packages, see http://ceph.newdream.net/docs/master/install/debian
> >
> > --
> > To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> > the body of a message to majordomo@vger.kernel.org
> > More majordomo info at http://vger.kernel.org/majordomo-info.html
>
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
>
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: v0.48.1 argonaut stable update released
2012-08-14 19:29 ` Sage Weil
@ 2012-08-15 14:06 ` Oliver Francke
0 siblings, 0 replies; 4+ messages in thread
From: Oliver Francke @ 2012-08-15 14:06 UTC (permalink / raw)
To: Sage Weil; +Cc: ceph-devel
Well,
On 08/14/2012 09:29 PM, Sage Weil wrote:
> On Tue, 14 Aug 2012, Oliver Francke wrote:
>> Hi Sage,
>>
>> I just updated to debian-testing/0.50 this afternoon, after some hint:
>>
>> * osd: better tracking of recent slow operations
> This is actually about the admin socket command to dump operations in
> flight (more useful information is reported for diagnosis/debugging).
>
>> and it is hereby confirmed to be better in my testing environment.
>> Before I had requests, which could be there for >480 seconds? not any
>> more.
> That great news! That is probably Sam's refactor of the OSD threading at
> work. There were also a few bugs fixed in 0.48.1 that were causing
> somewhat similar symptoms (ops blocked indefinitely) due to peering
> problems, but that doesn't sound like it's the same thing.
>
>> How's about this fix in 0.48.X?
> It's a huge set of changes, and definitely won't go into the 0.48 series,
> sorry! (In fact, the pending change was one motivation for doing 0.48
> when we did.) It will be in bobtail, though, which is probably about a
> month away from freeze.
>
> Please let us know what your experience is like with 0.50 (and beyond).
the more detailed picture is: it works and is stable, so far no problems
with my torture-tests.
Sporadically I see a line ala:
--- 8-< ---
delete error: image still has watchers
This means the image is still open or the client using it crashed. Try
again after closing/unmapping it or waiting 30s for the crashed client
to timeout.
2012-08-15 15:57:22.072729 7f9fe82a2760 -1 librbd: error removing
header: (16) Device or resource busy
--- 8-< ---
even from long ago stopped VM's.
Regards,
Oliver.
>
> Thanks!
> sage
>
>
>> Thnx in @vance,
>>
>> Oliver - Thus being too lazy to read all change logs - Francke.
>>
>> Am 14.08.2012 um 20:18 schrieb Sage Weil <sage@inktank.com>:
>>
>>> We've built and pushed the first update to the argonaut stable release.
>>> This branch has a range of small fixes for stability, compatibility, and
>>> performance, but no major changes in functionality. The stability fixes
>>> are particularly important for large clusters with many OSDs, and for
>>> network environments where intermittent network failures are more common.
>>>
>>> The highlights include:
>>>
>>> * mkcephfs: use default `keyring', `osd data', `osd journal' paths when
>>> not specified in conf
>>> * msgr: various fixes to socket error handling
>>> * osd: reduce scrub overhead
>>> * osd: misc peering fixes (past_interval sharing, pgs stuck in `peering'
>>> states)
>>> * osd: fail on EIO in read path (do not silently ignore read errors from
>>> failing disks)
>>> * osd: avoid internal heartbeat errors by breaking some large
>>> transactions into pieces
>>> * osd: fix osdmap catch-up during startup (catch up and then add daemon
>>> to osdmap)
>>> * osd: fix spurious `misdirected op' messages
>>> * osd: report scrub status via `pg # query'
>>> * rbd: fix race when watch registrations are resent
>>> * rbd: fix rbd image id assignment scheme (new image data objects have
>>> slightly different names)
>>> * rbd: fix perf stats for cache hit rate
>>> * rbd tool: fix off-by-one in key name (crash when empty key specified)
>>> * rbd: more robust udev rules
>>> * rados tool: copy object, pool commands
>>> * radosgw: fix in usage stats trimming
>>> * radosgw: misc compatibility fixes (date strings, ETag quoting, swift
>>> headers, etc.)
>>> * ceph-fuse: fix locking in read/write paths
>>> * mon: fix rare race corrupting on-disk data
>>> * config: fix admin socket `config set' command
>>> * log: fix in-memory log event gathering
>>> * debian: remove crush headers, include librados-config
>>> * rpm: add ceph-disk-{activate, prepare}
>>>
>>> The fix for the radosgw usage trimming is incompatible with v0.48 (which
>>> was effectively broken). You now need to use the v0.48.1 version of
>>> radosgw-admin to initiate usage stats trimming.
>>>
>>> There are a range of smaller bug fixes as well. For a complete list of
>>> what went into this release, please see the release notes and changelog.
>>>
>>> You can get this stable update from the usual locations:
>>>
>>> * Git at git://github.com/ceph/ceph.git
>>> * Tarball at http://ceph.newdream.net/download/ceph-0.48.1.tar.gz
>>> * For Debian/Ubuntu packages, see http://ceph.newdream.net/docs/master/install/debian
>>>
>>> --
>>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
>>> the body of a message to majordomo@vger.kernel.org
>>> More majordomo info at http://vger.kernel.org/majordomo-info.html
>> --
>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at http://vger.kernel.org/majordomo-info.html
>>
>>
--
Oliver Francke
filoo GmbH
Moltkestraße 25a
33330 Gütersloh
HRB4355 AG Gütersloh
Geschäftsführer: S.Grewing | J.Rehpöhler | C.Kunz
Folgen Sie uns auf Twitter: http://twitter.com/filoogmbh
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2012-08-15 14:06 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-08-14 18:18 v0.48.1 argonaut stable update released Sage Weil
2012-08-14 19:09 ` Oliver Francke
2012-08-14 19:29 ` Sage Weil
2012-08-15 14:06 ` Oliver Francke
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.