From mboxrd@z Thu Jan 1 00:00:00 1970 From: =?ISO-8859-1?Q?Jens_Kristian_S=F8gaard?= Subject: Re: Hit suicide timeout after adding new osd Date: Sat, 19 Jan 2013 18:56:53 +0100 Message-ID: <50FADE65.5050403@mermaidconsulting.dk> References: <50F80C3A.9020007@mermaidconsulting.dk> <50F80EFF.7020803@widodh.nl> <50F80FA0.5010504@profihost.ag> <50F819B8.4070004@widodh.nl> <50F81A9F.2090104@profihost.ag> <50F85FEC.7030305@mermaidconsulting.dk> <50F930EE.9070201@mermaidconsulting.dk> <50F9C051.7070900@mermaidconsulting.dk> <50FA6681.10507@mermaidconsulting.dk> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: QUOTED-PRINTABLE Return-path: Received: from mx06.stofanet.dk ([212.10.10.58]:45428 "EHLO mx06.stofanet.dk" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752282Ab3ASR47 (ORCPT ); Sat, 19 Jan 2013 12:56:59 -0500 In-Reply-To: Sender: ceph-devel-owner@vger.kernel.org List-ID: To: Sage Weil Cc: Stefan Priebe , Wido den Hollander , "ceph-devel@vger.kernel.org" Hi Sage, > a I dropped the wip branch. I just repushed the patches (on top of=20 > bobtail) for you. Thanks for the new build! I have finished testing it on one osd - sadly the osd crashed again, bu= t=20 now with a new stack trace. 0> 2013-01-19 18:48:44.636938 7fb5597f2700 -1 ./osd/OSDMap.h: In=20 function 'const epoch_t& OSDMap::get_up_thru(int) const' thread=20 7fb5597f2700 time 2013-01-19 18:48:44.499494 =2E/osd/OSDMap.h: 367: FAILED assert(exists(osd)) ceph version 0.56.1-25-g25a6b1b (25a6b1b325db2a2b45963f83623c447ec577= c5ef) 1: /usr/bin/ceph-osd() [0x60db42] 2: /usr/bin/ceph-osd() [0x6e3b35] 3: (pg_interval_t::check_new_interval(std::vector > const&, std::vector >=20 const&, std::vector > const&, std::vector > const&, unsigned int, unsigned int,=20 std::tr1::shared_ptr, std::tr1::shared_ptr,= =20 long, pg_t, std::map, std::allocator > >*,= =20 std::ostream*)+0x250) [0x935590] 4: (PG::start_peering_interval(std::tr1::shared_ptr,=20 std::vector > const&, std::vector > const&)+0x353) [0x7563c3] 5: (PG::RecoveryState::Reset::react(PG::AdvMap const&)+0x21e) [0x7588= 7e] 6: (boost::statechart::detail::reaction_result=20 boost::statechart::simple_state,=20 (boost::statechart::history_mode)0>::local_react_impl_non_empty::local_= react_impl,=20 boost::statechart::custom_reaction,=20 boost::statechart::custom_reaction,=20 boost::statechart::custom_reaction,=20 boost::statechart::transition,=20 &boost::statechart::detail::no_context::= no_function>=20 >, boost::statechart::simple_state,=20 (boost::statechart::history_mode)0>=20 >(boost::statechart::simple_state,=20 (boost::statechart::history_mode)0>&, boost::statechart::event_base=20 const&, void const*)+0x86) [0x78abb6] 7: (boost::statechart::detail::reaction_result=20 boost::statechart::simple_state,=20 (boost::statechart::history_mode)0>::local_react_impl_non_empty::local_= react_impl,=20 boost::statechart::custom_reaction,=20 boost::statechart::custom_reaction,=20 boost::statechart::custom_reaction,=20 boost::statechart::custom_reaction,=20 boost::statechart::transition,=20 &boost::statechart::detail::no_context::= no_function>,=20 mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na,=20 mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>,=20 boost::statechart::simple_state,=20 (boost::statechart::history_mode)0>=20 >(boost::statechart::simple_state,=20 (boost::statechart::history_mode)0>&, boost::statechart::event_base=20 const&, void const*)+0x53) [0x78ac33] 8:=20 (boost::statechart::state_machine,=20 boost::statechart::null_exception_translator>::send_event(boost::statec= hart::event_base=20 const&)+0x5b) [0x76f58b] 9:=20 (boost::statechart::state_machine,=20 boost::statechart::null_exception_translator>::process_event(boost::sta= techart::event_base=20 const&)+0x19) [0x76f619] 10: (PG::RecoveryState::handle_event(boost::statechart::event_base=20 const&, PG::RecoveryCtx*)+0x4d) [0x76f6cd] 11: (PG::handle_advance_map(std::tr1::shared_ptr,=20 std::tr1::shared_ptr, std::vector=20 >&, std::vector >&, PG::RecoveryCtx*)+0x196)=20 [0x72bf46] 12: (OSD::advance_pg(unsigned int, PG*, PG::RecoveryCtx*,=20 std::set, std::less >= ,=20 std::allocator > >*)+0x48b) [0x6cf14b] 13: (OSD::process_peering_events(std::list >= =20 const&)+0x2a6) [0x6cf7f6] 14: (OSD::PeeringWQ::_process(std::list >=20 const&)+0x17) [0x70a3f7] 15: (ThreadPool::worker(ThreadPool::WorkThread*)+0x95c) [0x8ccccc] 16: (ThreadPool::WorkThread::entry()+0x10) [0x8cdc40] 17: /lib64/libpthread.so.0() [0x360de07d14] 18: (clone()+0x6d) [0x360d6f167d] --=20 Jens Kristian S=F8gaard, Mermaid Consulting ApS, jens@mermaidconsulting.dk, http://www.mermaidconsulting.com/ -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" i= n the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html