* librbd bug?
@ 2013-03-07 10:16 Wolfgang Hennerbichler
2013-03-07 16:36 ` Sage Weil
2013-03-08 5:14 ` Dan Mick
0 siblings, 2 replies; 4+ messages in thread
From: Wolfgang Hennerbichler @ 2013-03-07 10:16 UTC (permalink / raw)
To: ceph-devel
Hi,
I've a libvirt-VM that gets format 2 rbd-childs 'fed' by the superhost.
It crashed recently with this in the logs:
osdc/ObjectCacher.cc: In function 'void
ObjectCacher::bh_write_commit(int64_t, sobject_t, loff_t, uint64_t,
tid_t, int)' thread 7f0cab5fd700 time 2013-03-01 22:02:37.374410
osdc/ObjectCacher.cc: 834: FAILED assert(ob->last_commit_tid < tid)
ceph version 0.56.3 (6eb7e15a4783b122e9b0c85ea9ba064145958aa5)
1: (ObjectCacher::bh_write_commit(long, sobject_t, long, unsigned long,
unsigned long, int)+0xd68) [0x7f0d087cda28]
2: (ObjectCacher::C_WriteCommit::finish(int)+0x6b) [0x7f0d087d460b]
3: (Context::complete(int)+0xa) [0x7f0d0878c9fa]
4: (librbd::C_Request::finish(int)+0x85) [0x7f0d087bc325]
5: (Context::complete(int)+0xa) [0x7f0d0878c9fa]
6: (librbd::rados_req_cb(void*, void*)+0x47) [0x7f0d087a1387]
7: (librados::C_AioSafe::finish(int)+0x1d) [0x7f0d07b5834d]
8: (Finisher::finisher_thread_entry()+0x1c0) [0x7f0d07bc20d0]
9: (()+0x7e9a) [0x7f0d0546be9a]
10: (clone()+0x6d) [0x7f0d05198cbd]
NOTE: a copy of the executable, or `objdump -rdS <executable>` is
needed to interpret this.
terminate called after throwing an instance of 'ceph::FailedAssertion'
Any clue why that happened?
--
DI (FH) Wolfgang Hennerbichler
Software Development
Unit Advanced Computing Technologies
RISC Software GmbH
A company of the Johannes Kepler University Linz
IT-Center
Softwarepark 35
4232 Hagenberg
Austria
Phone: +43 7236 3343 245
Fax: +43 7236 3343 250
wolfgang.hennerbichler@risc-software.at
http://www.risc-software.at
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: librbd bug?
2013-03-07 10:16 librbd bug? Wolfgang Hennerbichler
@ 2013-03-07 16:36 ` Sage Weil
2013-03-08 5:14 ` Dan Mick
1 sibling, 0 replies; 4+ messages in thread
From: Sage Weil @ 2013-03-07 16:36 UTC (permalink / raw)
To: Wolfgang Hennerbichler; +Cc: ceph-devel
On Thu, 7 Mar 2013, Wolfgang Hennerbichler wrote:
> Hi,
>
> I've a libvirt-VM that gets format 2 rbd-childs 'fed' by the superhost.
> It crashed recently with this in the logs:
>
> osdc/ObjectCacher.cc: In function 'void
> ObjectCacher::bh_write_commit(int64_t, sobject_t, loff_t, uint64_t,
> tid_t, int)' thread 7f0cab5fd700 time 2013-03-01 22:02:37.374410
> osdc/ObjectCacher.cc: 834: FAILED assert(ob->last_commit_tid < tid)
> ceph version 0.56.3 (6eb7e15a4783b122e9b0c85ea9ba064145958aa5)
> 1: (ObjectCacher::bh_write_commit(long, sobject_t, long, unsigned long,
> unsigned long, int)+0xd68) [0x7f0d087cda28]
> 2: (ObjectCacher::C_WriteCommit::finish(int)+0x6b) [0x7f0d087d460b]
> 3: (Context::complete(int)+0xa) [0x7f0d0878c9fa]
> 4: (librbd::C_Request::finish(int)+0x85) [0x7f0d087bc325]
> 5: (Context::complete(int)+0xa) [0x7f0d0878c9fa]
> 6: (librbd::rados_req_cb(void*, void*)+0x47) [0x7f0d087a1387]
> 7: (librados::C_AioSafe::finish(int)+0x1d) [0x7f0d07b5834d]
> 8: (Finisher::finisher_thread_entry()+0x1c0) [0x7f0d07bc20d0]
> 9: (()+0x7e9a) [0x7f0d0546be9a]
> 10: (clone()+0x6d) [0x7f0d05198cbd]
> NOTE: a copy of the executable, or `objdump -rdS <executable>` is
> needed to interpret this.
> terminate called after throwing an instance of 'ceph::FailedAssertion'
>
> Any clue why that happened?
We fixed one bug that triggered this behavior, but I just saw another
occurence yesterday. I'm working on reproducing it now. Once I have
some confidence it is fully resolved I will backport the fix(es) to the
bobtail branch.
Thanks!
sage
>
> --
> DI (FH) Wolfgang Hennerbichler
> Software Development
> Unit Advanced Computing Technologies
> RISC Software GmbH
> A company of the Johannes Kepler University Linz
>
> IT-Center
> Softwarepark 35
> 4232 Hagenberg
> Austria
>
> Phone: +43 7236 3343 245
> Fax: +43 7236 3343 250
> wolfgang.hennerbichler@risc-software.at
> http://www.risc-software.at
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
>
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: librbd bug?
2013-03-07 10:16 librbd bug? Wolfgang Hennerbichler
2013-03-07 16:36 ` Sage Weil
@ 2013-03-08 5:14 ` Dan Mick
2013-03-08 5:20 ` Sage Weil
1 sibling, 1 reply; 4+ messages in thread
From: Dan Mick @ 2013-03-08 5:14 UTC (permalink / raw)
To: Wolfgang Hennerbichler; +Cc: ceph-devel
On 03/07/2013 02:16 AM, Wolfgang Hennerbichler wrote:
> Hi,
>
> I've a libvirt-VM that gets format 2 rbd-childs 'fed' by the superhost.
> It crashed recently with this in the logs:
>
> osdc/ObjectCacher.cc: In function 'void
> ObjectCacher::bh_write_commit(int64_t, sobject_t, loff_t, uint64_t,
> tid_t, int)' thread 7f0cab5fd700 time 2013-03-01 22:02:37.374410
> osdc/ObjectCacher.cc: 834: FAILED assert(ob->last_commit_tid < tid)
> ceph version 0.56.3 (6eb7e15a4783b122e9b0c85ea9ba064145958aa5)
> 1: (ObjectCacher::bh_write_commit(long, sobject_t, long, unsigned long,
> unsigned long, int)+0xd68) [0x7f0d087cda28]
> 2: (ObjectCacher::C_WriteCommit::finish(int)+0x6b) [0x7f0d087d460b]
> 3: (Context::complete(int)+0xa) [0x7f0d0878c9fa]
> 4: (librbd::C_Request::finish(int)+0x85) [0x7f0d087bc325]
> 5: (Context::complete(int)+0xa) [0x7f0d0878c9fa]
> 6: (librbd::rados_req_cb(void*, void*)+0x47) [0x7f0d087a1387]
> 7: (librados::C_AioSafe::finish(int)+0x1d) [0x7f0d07b5834d]
> 8: (Finisher::finisher_thread_entry()+0x1c0) [0x7f0d07bc20d0]
> 9: (()+0x7e9a) [0x7f0d0546be9a]
> 10: (clone()+0x6d) [0x7f0d05198cbd]
> NOTE: a copy of the executable, or `objdump -rdS <executable>` is
> needed to interpret this.
> terminate called after throwing an instance of 'ceph::FailedAssertion'
>
> Any clue why that happened?
>
This looks like
http://tracker.ceph.com/issues/4271
--
Dan Mick, Filesystem Engineering
Inktank Storage, Inc. http://inktank.com
Ceph docs: http://ceph.com/docs
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: librbd bug?
2013-03-08 5:14 ` Dan Mick
@ 2013-03-08 5:20 ` Sage Weil
0 siblings, 0 replies; 4+ messages in thread
From: Sage Weil @ 2013-03-08 5:20 UTC (permalink / raw)
To: Dan Mick; +Cc: Wolfgang Hennerbichler, ceph-devel
On Thu, 7 Mar 2013, Dan Mick wrote:
> On 03/07/2013 02:16 AM, Wolfgang Hennerbichler wrote:
> > Hi,
> >
> > I've a libvirt-VM that gets format 2 rbd-childs 'fed' by the superhost.
> > It crashed recently with this in the logs:
> >
> > osdc/ObjectCacher.cc: In function 'void
> > ObjectCacher::bh_write_commit(int64_t, sobject_t, loff_t, uint64_t,
> > tid_t, int)' thread 7f0cab5fd700 time 2013-03-01 22:02:37.374410
> > osdc/ObjectCacher.cc: 834: FAILED assert(ob->last_commit_tid < tid)
> > ceph version 0.56.3 (6eb7e15a4783b122e9b0c85ea9ba064145958aa5)
> > 1: (ObjectCacher::bh_write_commit(long, sobject_t, long, unsigned long,
> > unsigned long, int)+0xd68) [0x7f0d087cda28]
> > 2: (ObjectCacher::C_WriteCommit::finish(int)+0x6b) [0x7f0d087d460b]
> > 3: (Context::complete(int)+0xa) [0x7f0d0878c9fa]
> > 4: (librbd::C_Request::finish(int)+0x85) [0x7f0d087bc325]
> > 5: (Context::complete(int)+0xa) [0x7f0d0878c9fa]
> > 6: (librbd::rados_req_cb(void*, void*)+0x47) [0x7f0d087a1387]
> > 7: (librados::C_AioSafe::finish(int)+0x1d) [0x7f0d07b5834d]
> > 8: (Finisher::finisher_thread_entry()+0x1c0) [0x7f0d07bc20d0]
> > 9: (()+0x7e9a) [0x7f0d0546be9a]
> > 10: (clone()+0x6d) [0x7f0d05198cbd]
> > NOTE: a copy of the executable, or `objdump -rdS <executable>` is
> > needed to interpret this.
> > terminate called after throwing an instance of 'ceph::FailedAssertion'
> >
> > Any clue why that happened?
> >
>
> This looks like
>
> http://tracker.ceph.com/issues/4271
I am chasing http://tracker.ceph.com/issues/4369, which may indicate a
problem with the fix for #4271. Once this is sorted out, I'll cherry-pick
the fix to bobtail.
sage
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2013-03-08 5:20 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2013-03-07 10:16 librbd bug? Wolfgang Hennerbichler
2013-03-07 16:36 ` Sage Weil
2013-03-08 5:14 ` Dan Mick
2013-03-08 5:20 ` Sage Weil
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.