All of lore.kernel.org
 help / color / mirror / Atom feed
* A couple more ceph-mon crashes
@ 2011-11-19 15:03 Josh Pieper
  2011-11-22  0:03 ` Sage Weil
  0 siblings, 1 reply; 3+ messages in thread
From: Josh Pieper @ 2011-11-19 15:03 UTC (permalink / raw)
  To: ceph-devel

I observed the following two crashes using the same test setup I have
had for the previous reports.  3x ubuntu 11.04 amd64 nodes with an rbd
load, this time each running cc5b5e17.

Links to the full logs are below each stack trace snip.

mon/OSDMonitor.cc: In function 'MOSDMap* OSDMonitor::build_incremental(epoch_t, epoch_t)', in thread '7fd11a363700'
mon/OSDMonitor.cc: 944: FAILED assert(0)
 ceph version 0.38-202-gcc5b5e1 (commit:cc5b5e17e64c011b68cc8560676182e187bbb503)
 1: (OSDMonitor::build_incremental(unsigned int, unsigned int)+0xc86) [0x497f76]
 2: (OSDMonitor::committed()+0x162) [0x49a1f2]
 3: (PaxosService::_commit()+0x19e) [0x491fee]
 4: (Context::complete(int)+0xa) [0x47c10a]
 5: (finish_contexts(CephContext*, std::list<Context*, std::allocator<Context*> >&, int)+0xca) [0x47da6a]
 6: (Paxos::handle_accept(MMonPaxos*)+0x5b4) [0x48dbc4]
 7: (Paxos::dispatch(PaxosServiceMessage*)+0x23b) [0x48f38b]
 8: (Monitor::_ms_dispatch(Message*)+0xcbf) [0x47b64f]
 9: (Monitor::ms_dispatch(Message*)+0x35) [0x486405]

http://joshp.no-ip.com:8080/20111119-mon-build_incremental.log


mon/PaxosService.cc: In function 'void PaxosService::propose_pending()', in thread '7f4dd6acd700'
mon/PaxosService.cc: 127: FAILED assert(have_pending)
 ceph version 0.38-202-gcc5b5e1 (commit:cc5b5e17e64c011b68cc8560676182e187bbb503)
 1: (PaxosService::propose_pending()+0x335) [0x4924b5]
 2: (PGMonitor::check_osd_map(unsigned int)+0xcb0) [0x4d42e0]
 3: (Context::complete(int)+0xa) [0x47c10a]
 4: (finish_contexts(CephContext*, std::list<Context*, std::allocator<Context*> >&, int)+0xca) [0x47da6a]
 5: (Paxos::handle_accept(MMonPaxos*)+0x5d8) [0x48dbe8]
 6: (Paxos::dispatch(PaxosServiceMessage*)+0x23b) [0x48f38b]
 7: (Monitor::_ms_dispatch(Message*)+0xcbf) [0x47b64f]
 8: (Monitor::ms_dispatch(Message*)+0x35) [0x486405]

http://joshp.no-ip.com:8080/20111119-mon-propose_pending.log

-Josh

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: A couple more ceph-mon crashes
  2011-11-19 15:03 A couple more ceph-mon crashes Josh Pieper
@ 2011-11-22  0:03 ` Sage Weil
  2011-12-01 12:50   ` Josh Pieper
  0 siblings, 1 reply; 3+ messages in thread
From: Sage Weil @ 2011-11-22  0:03 UTC (permalink / raw)
  To: Josh Pieper; +Cc: ceph-devel

Hi Josh,

I just pushed a series of mon patches that rework some of this code.  I've 
been trashing on them for a while now without problems, though my test 
still needs to be better about the monitor workload it presents.

In any case, you should have better luck with the current master.

Thanks!
sage


On Sat, 19 Nov 2011, Josh Pieper wrote:

> I observed the following two crashes using the same test setup I have
> had for the previous reports.  3x ubuntu 11.04 amd64 nodes with an rbd
> load, this time each running cc5b5e17.
> 
> Links to the full logs are below each stack trace snip.
> 
> mon/OSDMonitor.cc: In function 'MOSDMap* OSDMonitor::build_incremental(epoch_t, epoch_t)', in thread '7fd11a363700'
> mon/OSDMonitor.cc: 944: FAILED assert(0)
>  ceph version 0.38-202-gcc5b5e1 (commit:cc5b5e17e64c011b68cc8560676182e187bbb503)
>  1: (OSDMonitor::build_incremental(unsigned int, unsigned int)+0xc86) [0x497f76]
>  2: (OSDMonitor::committed()+0x162) [0x49a1f2]
>  3: (PaxosService::_commit()+0x19e) [0x491fee]
>  4: (Context::complete(int)+0xa) [0x47c10a]
>  5: (finish_contexts(CephContext*, std::list<Context*, std::allocator<Context*> >&, int)+0xca) [0x47da6a]
>  6: (Paxos::handle_accept(MMonPaxos*)+0x5b4) [0x48dbc4]
>  7: (Paxos::dispatch(PaxosServiceMessage*)+0x23b) [0x48f38b]
>  8: (Monitor::_ms_dispatch(Message*)+0xcbf) [0x47b64f]
>  9: (Monitor::ms_dispatch(Message*)+0x35) [0x486405]
> 
> http://joshp.no-ip.com:8080/20111119-mon-build_incremental.log
> 
> 
> mon/PaxosService.cc: In function 'void PaxosService::propose_pending()', in thread '7f4dd6acd700'
> mon/PaxosService.cc: 127: FAILED assert(have_pending)
>  ceph version 0.38-202-gcc5b5e1 (commit:cc5b5e17e64c011b68cc8560676182e187bbb503)
>  1: (PaxosService::propose_pending()+0x335) [0x4924b5]
>  2: (PGMonitor::check_osd_map(unsigned int)+0xcb0) [0x4d42e0]
>  3: (Context::complete(int)+0xa) [0x47c10a]
>  4: (finish_contexts(CephContext*, std::list<Context*, std::allocator<Context*> >&, int)+0xca) [0x47da6a]
>  5: (Paxos::handle_accept(MMonPaxos*)+0x5d8) [0x48dbe8]
>  6: (Paxos::dispatch(PaxosServiceMessage*)+0x23b) [0x48f38b]
>  7: (Monitor::_ms_dispatch(Message*)+0xcbf) [0x47b64f]
>  8: (Monitor::ms_dispatch(Message*)+0x35) [0x486405]
> 
> http://joshp.no-ip.com:8080/20111119-mon-propose_pending.log
> 
> -Josh
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 
> 

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: A couple more ceph-mon crashes
  2011-11-22  0:03 ` Sage Weil
@ 2011-12-01 12:50   ` Josh Pieper
  0 siblings, 0 replies; 3+ messages in thread
From: Josh Pieper @ 2011-12-01 12:50 UTC (permalink / raw)
  To: Sage Weil; +Cc: ceph-devel

Sage,

I've been banging on a recent commit for a while now and haven't been
able to reproduce any of those problems.  Thanks!

-Josh

Sage Weil wrote:
> Hi Josh,
> 
> I just pushed a series of mon patches that rework some of this code.  I've 
> been trashing on them for a while now without problems, though my test 
> still needs to be better about the monitor workload it presents.
> 
> In any case, you should have better luck with the current master.
> 
> Thanks!
> sage
> 
> 
> On Sat, 19 Nov 2011, Josh Pieper wrote:
> 
> > I observed the following two crashes using the same test setup I have
> > had for the previous reports.  3x ubuntu 11.04 amd64 nodes with an rbd
> > load, this time each running cc5b5e17.
> > 
> > Links to the full logs are below each stack trace snip.
> > 
> > mon/OSDMonitor.cc: In function 'MOSDMap* OSDMonitor::build_incremental(epoch_t, epoch_t)', in thread '7fd11a363700'
> > mon/OSDMonitor.cc: 944: FAILED assert(0)
> >  ceph version 0.38-202-gcc5b5e1 (commit:cc5b5e17e64c011b68cc8560676182e187bbb503)
> >  1: (OSDMonitor::build_incremental(unsigned int, unsigned int)+0xc86) [0x497f76]
> >  2: (OSDMonitor::committed()+0x162) [0x49a1f2]
> >  3: (PaxosService::_commit()+0x19e) [0x491fee]
> >  4: (Context::complete(int)+0xa) [0x47c10a]
> >  5: (finish_contexts(CephContext*, std::list<Context*, std::allocator<Context*> >&, int)+0xca) [0x47da6a]
> >  6: (Paxos::handle_accept(MMonPaxos*)+0x5b4) [0x48dbc4]
> >  7: (Paxos::dispatch(PaxosServiceMessage*)+0x23b) [0x48f38b]
> >  8: (Monitor::_ms_dispatch(Message*)+0xcbf) [0x47b64f]
> >  9: (Monitor::ms_dispatch(Message*)+0x35) [0x486405]
> > 
> > http://joshp.no-ip.com:8080/20111119-mon-build_incremental.log
> > 
> > 
> > mon/PaxosService.cc: In function 'void PaxosService::propose_pending()', in thread '7f4dd6acd700'
> > mon/PaxosService.cc: 127: FAILED assert(have_pending)
> >  ceph version 0.38-202-gcc5b5e1 (commit:cc5b5e17e64c011b68cc8560676182e187bbb503)
> >  1: (PaxosService::propose_pending()+0x335) [0x4924b5]
> >  2: (PGMonitor::check_osd_map(unsigned int)+0xcb0) [0x4d42e0]
> >  3: (Context::complete(int)+0xa) [0x47c10a]
> >  4: (finish_contexts(CephContext*, std::list<Context*, std::allocator<Context*> >&, int)+0xca) [0x47da6a]
> >  5: (Paxos::handle_accept(MMonPaxos*)+0x5d8) [0x48dbe8]
> >  6: (Paxos::dispatch(PaxosServiceMessage*)+0x23b) [0x48f38b]
> >  7: (Monitor::_ms_dispatch(Message*)+0xcbf) [0x47b64f]
> >  8: (Monitor::ms_dispatch(Message*)+0x35) [0x486405]
> > 
> > http://joshp.no-ip.com:8080/20111119-mon-propose_pending.log
> > 
> > -Josh
> > --
> > To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> > the body of a message to majordomo@vger.kernel.org
> > More majordomo info at  http://vger.kernel.org/majordomo-info.html
> > 
> > 

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2011-12-01 12:50 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-11-19 15:03 A couple more ceph-mon crashes Josh Pieper
2011-11-22  0:03 ` Sage Weil
2011-12-01 12:50   ` Josh Pieper

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.