From: Ta Ba Tuan <tuantb@vccloud.vn>
To: Samuel Just <sam.just@inktank.com>, Sage Weil <sage@newdream.net>
Cc: "ceph-users@lists.ceph.com" <ceph-users@lists.ceph.com>,
"ceph-devel@vger.kernel.org" <ceph-devel@vger.kernel.org>,
MinhCD <minhcd@vccloud.vn>, Thanhnt <thanhnt@vccloud.vn>
Subject: Re: [ceph-users] Ceph Giant not fixed RepllicatedPG:NotStrimming?
Date: Sat, 01 Nov 2014 09:21:09 +0700 [thread overview]
Message-ID: <54544395.4030504@vccloud.vn> (raw)
In-Reply-To: <CA+4uBUbYPZzHhkkQaB5wS8E2-idnrsnJB21zfjA6akz7NHPrnA@mail.gmail.com>
Hi Samuel and Sage,
I will upgrde to Giant soon, Thank you so much.
--
Tuan
HaNoi-VietNam
On 11/01/2014 01:10 AM, Samuel Just wrote:
> You should start by upgrading to giant, many many bug fixes went in
> between .86 and giant.
> -Sam
>
> On Fri, Oct 31, 2014 at 8:54 AM, Ta Ba Tuan <tuantb@vccloud.vn> wrote:
>> Hi Sage Weil
>>
>> Thank for your repling. Yes, I'm using Ceph v.0.86,
>> I report some related bugs, Hope you help me,
>>
>> 2014-10-31 15:34:52.927965 7f85efb6b700 0 osd.21 104744 do_command r=0
>> 2014-10-31 15:34:53.105533 7f85f036c700 -1 *** Caught signal (Segmentation
>> fault) **
>> in thread 7f85f036c700
>> ceph version 0.86-106-g6f8524e (6f8524ef7673ab4448de2e0ff76638deaf03cae8)
>> 1: /usr/bin/ceph-osd() [0x9b6655]
>> 2: (()+0xfcb0) [0x7f8615726cb0]
>> 3: (ReplicatedPG::trim_object(hobject_t const&)+0x395) [0x811c25]
>> 4: (ReplicatedPG::TrimmingObjects::react(ReplicatedPG::SnapTrim
>> const&)+0x43e) [0x82baae]
>> 5: (boost::statechart::simple_state<ReplicatedPG::TrimmingObjects,
>> ReplicatedPG::SnapTrimmer, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na,
>> mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na,
>> mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na,
>> mpl_::na, mpl_::na, mpl_::na>,
>> (boost::statechart::history_mode)0>::react_impl(boost::statechart::event_base
>> const&, void const*)+0xc0) [0x870c30]
>> 6: (boost::statechart::state_machine<ReplicatedPG::SnapTrimmer,
>> ReplicatedPG::NotTrimming, std::allocator<void>,
>> boost::statechart::null_exception_translator>::process_queued_events()+0xfb)
>> [0x8560db]
>> 7: (boost::statechart::state_machine<ReplicatedPG::SnapTrimmer,
>> ReplicatedPG::NotTrimming, std::allocator<void>,
>> boost::statechart::null_exception_translator>::process_event(boost::statechart::event_base
>> const&)+0x1e) [0x8562ae]
>> 8: (ReplicatedPG::snap_trimmer()+0x4f8) [0x7d5f48]
>> 9: (OSD::SnapTrimWQ::_process(PG*)+0x14) [0x6739b4]
>> 10: (ThreadPool::worker(ThreadPool::WorkThread*)+0x48e) [0xa8fa0e]
>> 11: (ThreadPool::WorkThread::entry()+0x10) [0xa927a0]
>> 12: (()+0x7e9a) [0x7f861571ee9a]
>> 13: (clone()+0x6d) [0x7f86140e931d]
>> NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to
>> interpret this.
>>
>> -9523> 2014-10-31 15:34:45.571962 7f85e3ee0700 5 -- op tracker -- seq:
>> 6937, time: 2014-10-31 15:34:45.531887, event: header_read, op: MOSDPGPus
>> h(6.749 104744
>> [PushOp(d2106749/rbd_data.a2e6185b9a8ef8.0000000000000803/head//6, version:
>> 104736'7736506, data_included: [0~4194304], data_size:
>> 4194304, omap_header_size: 0, omap_entries_size: 0, attrset_size: 2,
>> recovery_info:
>> ObjectRecoveryInfo(d2106749/rbd_data.a2e6185b9a8ef8.0000000000
>> 000803/head//6@104736'7736506, copy_subset: [0~4194304], clone_subset: {}),
>> after_progress: ObjectRecoveryProgress(!first, data_recovered_to:41943
>> 04, data_complete:true, omap_recovered_to:, omap_complete:true),
>> before_progress: ObjectRecoveryProgress(first, data_recovered_to:0,
>> data_complete
>> :false, omap_recovered_to:,
>> omap_complete:false)),PushOp(60940749/rbd_data.3435875ff78f67.0000000000001408/head//6,
>> version: 104736'7736579, data_
>> included: [0~335360], data_size: 335360, omap_header_size: 0,
>> omap_entries_size: 0, attrset_size: 2, recovery_info:
>> ObjectRecoveryInfo(60940749/rb
>> d_data.3435875ff78f67.0000000000001408/head//6@104736'7736579, copy_subset:
>> [0~335360], clone_subset: {}), after_progress: ObjectRecoveryProgress(
>> !first, data_recovered_to:335360, data_complete:true, omap_recovered_to:,
>> omap_complete:true), before_progress: ObjectRecoveryProgress(first, data
>> _recovered_to:0, data_complete:false, omap_recovered_to:,
>> omap_complete:false)),PushOp(922b1749/rbd_data.1c3dade6cdc10.00000000000014c5/head//6,
>> v
>> ersion: 104736'7736866, data_included: [0~4194304], data_size: 4194304,
>> omap_header_size: 0, omap_entries_size: 0, attrset_size: 2, recovery_info:
>>
>> ObjectRecoveryInfo(922b1749/rbd_data.1c3dade6cdc10.00000000000014c5/head//6@104736'7736866,
>> copy_subset: [0~4194304], clone_subset: {}), after_pr
>> ogress: ObjectRecoveryProgress(!first, data_recovered_to:4194304,
>> data_complete:true, omap_recovered_to:, omap_complete:true),
>> before_progress: Ob
>> jectRecoveryProgress(first, data_recovered_to:0, data_complete:false,
>> omap_recovered_to:, omap_complete:false))])
>>
>> -6933> 2014-10-31 15:34tha7.611229 7f85f737a700 5 osd.21 pg_epoch: 104744
>> pg[6.749( v 104744'7741801 (104665'7732106,104744'7741801] lb
>> 14886749/rbd_data.3955b9640616f2.000000000000f5e2/head//6 local-les=104661
>> n=1780 ec=164 les/c 104742/104735 104740/104741/103210) [74,112,21]/[74,112]
>> r=-1 lpr=104741 pi=64005-104740/278 luod=0'0 crt=104744'7741798
>> active+remapped] enter Started/ReplicaActive/RepNotRecovering
>>
>> I think having some missing objects, I can't start one osd that above
>> objects be pushed to that osd. Ceph'versions are slower 0.86 then appear
>> this bug?
>> Should I upgrade to Giant o resolve this bug?,
>>
>>
>> Thank you,
>> --
>> Tuan
>> HaNoi-VietNam
>>
>>
>> On 10/30/2014 10:02 PM, Sage Weil wrote:
>>
>> On Thu, 30 Oct 2014, Ta Ba Tuan wrote:
>>
>> Hi Everyone,
>>
>> I upgraded Ceph to Giant by installing *tar.gz package, but appeared some
>> errors related Object Trimming or Snap Trimming:
>> I think having some missing objects and be not recovered.
>>
>> Note that this isn't giant, which is 0.87, but something a few weeks
>> older. There were a few bugs fixed in this code, but we can't tell if
>> this was one of them without the log leading up to this message, which
>> should include either a failed assertion message or segmentation fault or
>> similar.
>>
>> Thanks!
>> sage
>>
>>
>> ceph version 0.86-106-g6f8524e (6f8524ef7673ab4448de2e0ff76638deaf03cae8)
>> 1: /usr/bin/ceph-osd() [0x9b6655]
>> 2: (()+0xfcb0) [0x7fa52c471cb0]
>> 3: (ReplicatedPG::trim_object(hobject_t const&)+0x395) [0x811c25]
>> 4: (ReplicatedPG::TrimmingObjects::react(ReplicatedPG::SnapTrim
>> const&)+0x43e) [0x82baae]
>> 5: (boost::statechart::simple_state<ReplicatedPG::TrimmingObjects,
>> ReplicatedPG::SnapTrimmer, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na,
>> mpl
>> _::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na,
>> mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na
>> , mpl_::na,
>> mpl_::na>,(boost::statechart::history_mode)0>::react_impl(boost::statechart::event_ba
>> se const&, void const*)+0xc0) [0x870c30]
>> 6: (boost::statechart::state_machine<ReplicatedPG::SnapTrimmer,
>> ReplicatedPG::NotTrimming, std::allocator<void>,
>> boost::statechart::null_excepti
>> on_translator>::process_queued_events()+0xfb) [0x8560db]
>> 7: (boost::statechart::state_machine<ReplicatedPG::SnapTrimmer,
>> ReplicatedPG::NotTrimming, std::allocator<void>,
>> boost::statechart::null_excepti
>> on_translator>::process_event(boost::statechart::event_base const&)+0x1e)
>> [0x8562ae]
>> 8: (ReplicatedPG::snap_trimmer()+0x4f8) [0x7d5f48]
>> 9: (OSD::SnapTrimWQ::_process(PG*)+0x14) [0x6739b4]
>> 10: (ThreadPool::worker(ThreadPool::WorkThread*)+0x48e) [0xa8fa0e]
>> 11: (ThreadPool::WorkThread::entry()+0x10) [0xa927a0]
>> 12: (()+0x7e9a) [0x7fa52c469e9a]
>> 13: (clone()+0x6d) [0x7fa52ae3431d]
>> NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to
>> interpret this.
>>
>>
>> -128> 2014-10-29 13:51:23.049357 7fa50ed9d700 5 osd.21 pg_epoch: 104445
>> pg[6.9d8( v 104445'7857889 (103730'7852406,104445'7857889] local-les=104444
>> n=4345 ec=164 les/c 104444/104272 104443/104443/104443) [21,93,49] r=0
>> lpr=104443 pi=103787-104442/16 crt=104442'7857887 mlcod 104445'7857888
>> active snaptrimq=[1907~1,1941~4,1946~1,19ef~2,19f2~1,19f4~3,19fa~5]] exit
>> Started/Primary/Active/Recovered 0.000084 0 0.000000
>> -127> 2014-10-29 13:51:23.049392 7fa50ed9d700 5 osd.21 pg_epoch: 104445
>> pg[6.9d8( v 104445'7857889 (103730'7852406,104445'7857889] local-les=104444
>> n=4345 ec=164 les/c 104444/104272 104443/104443/104443) [21,93,49] r=0
>> lpr=104443 pi=103787-104442/16 crt=104442'7857887 mlcod 104445'7857888
>> active snaptrimq=[1907~1,1941~4,1946~1,19ef~2,19f2~1,19f4~3,19fa~5]] enter
>> Started/Primary/Active/Clean
>> -126> 2014-10-29 13:51:23.049582 7fa50ed9d700 1 -- 172.30.5.2:6838/22980
>> --> 172.30.5.4:6859/8884 -- pg_info(1 pgs e104445:6.9d8) v4 -- ?+0
>> 0x30d41c00 con 0x26c6ac60
>>
>>
>> Thank you!
>> --
>> Tuan
>> HaNoi-VietNam
>>
>>
>>
>>
>> _______________________________________________
>> ceph-users mailing list
>> ceph-users@lists.ceph.com
>> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>>
next prev parent reply other threads:[~2014-11-01 2:21 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-10-30 8:55 Ceph Giant not fixed RepllicatedPG:NotStrimming? Ta Ba Tuan
[not found] ` <5451FD06.5030000-QlevPasa8l681eZEIcUDRw@public.gmane.org>
2014-10-30 15:02 ` Sage Weil
[not found] ` <alpine.DEB.2.00.1410300801090.27022-vIokxiIdD2AQNTJnQDzGJqxOck334EZe@public.gmane.org>
2014-10-31 15:54 ` Ta Ba Tuan
[not found] ` <5453B0B1.1020404-QlevPasa8l681eZEIcUDRw@public.gmane.org>
2014-10-31 18:10 ` Samuel Just
2014-11-01 2:21 ` Ta Ba Tuan [this message]
[not found] ` <54544395.4030504-QlevPasa8l681eZEIcUDRw@public.gmane.org>
2014-11-03 5:10 ` Ta Ba Tuan
[not found] ` <54570E5D.9050009-QlevPasa8l681eZEIcUDRw@public.gmane.org>
2014-11-03 21:54 ` Samuel Just
[not found] ` <CA+4uBUaEuB3OW-Q3Lup+bx7WRYZqvL9P594XR8UHYO-H5A+82g-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2014-11-04 9:03 ` Ta Ba Tuan
[not found] ` <5458964C.5070901-QlevPasa8l681eZEIcUDRw@public.gmane.org>
2014-11-05 4:36 ` David Zafman
[not found] ` <927B4C33-2889-4D70-9309-BDD1495C2FF1-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2014-11-05 7:59 ` Ta Ba Tuan
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=54544395.4030504@vccloud.vn \
--to=tuantb@vccloud.vn \
--cc=ceph-devel@vger.kernel.org \
--cc=ceph-users@lists.ceph.com \
--cc=minhcd@vccloud.vn \
--cc=sage@newdream.net \
--cc=sam.just@inktank.com \
--cc=thanhnt@vccloud.vn \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.