All of lore.kernel.org
 help / color / mirror / Atom feed
From: Wido den Hollander <wido@42on.com>
To: Mohd Bazli Ab Karim <bazli.abkarim@mimos.my>,
	"ceph-users@lists.ceph.com" <ceph-users@lists.ceph.com>,
	"ceph-devel@vger.kernel.org" <ceph-devel@vger.kernel.org>
Subject: Re: MDS aborted after recovery and active, FAILED assert (r >=0)
Date: Fri, 16 Jan 2015 08:48:38 +0100	[thread overview]
Message-ID: <54B8C256.3060706@42on.com> (raw)
In-Reply-To: <20DBB415DF55354C8FC7AF12F4AE770024D3233A@MBX02-TPM.mimos.local>

On 01/16/2015 08:37 AM, Mohd Bazli Ab Karim wrote:
> Dear Ceph-Users, Ceph-Devel,
> 
> Apologize me if you get double post of this email.
> 
> I am running a ceph cluster version 0.72.2 and one MDS (in fact, it's 3, 2 down and only 1 up) at the moment.
> Plus I have one CephFS client mounted to it.
> 

In Ceph world 0.72.2 is ancient en pretty old. If you want to play with
CephFS I recommend you upgrade to 0.90 and also use at least kernel 3.18

> Now, the MDS always get aborted after recovery and active for 4 secs.
> Some parts of the log are as below:
> 
>     -3> 2015-01-15 14:10:28.464706 7fbcc8226700  1 -- 10.4.118.21:6800/5390 <== osd.19 10.4.118.32:6821/243161 73 ==== osd_op_re
> ply(3742 1000240c57e.00000000 [create 0~0,setxattr (99)] v56640'1871414 uv1871414 ondisk = 0) v6 ==== 221+0+0 (261801329 0 0) 0x
> 7770bc80 con 0x69c7dc0
>     -2> 2015-01-15 14:10:28.464730 7fbcc8226700  1 -- 10.4.118.21:6800/5390 <== osd.18 10.4.118.32:6818/243072 67 ==== osd_op_re
> ply(3645 1000007941c.00000000 [tmapup 0~0] v56640'1769567 uv1769567 ondisk = 0) v6 ==== 179+0+0 (3759887079 0 0) 0x7757ec80 con
> 0x1c6bb00
>     -1> 2015-01-15 14:10:28.464754 7fbcc8226700  1 -- 10.4.118.21:6800/5390 <== osd.47 10.4.118.35:6809/8290 79 ==== osd_op_repl
> y(3419 mds_anchortable [writefull 0~94394932] v0'0 uv0 ondisk = -90 (Message too long)) v6 ==== 174+0+0 (3942056372 0 0) 0x69f94
> a00 con 0x1c6b9a0
>      0> 2015-01-15 14:10:28.471684 7fbcc8226700 -1 mds/MDSTable.cc: In function 'void MDSTable::save_2(int, version_t)' thread 7
> fbcc8226700 time 2015-01-15 14:10:28.469999
> mds/MDSTable.cc: 83: FAILED assert(r >= 0)
> 
>  ceph version  ()
>  1: (MDSTable::save_2(int, unsigned long)+0x325) [0x769e25]
>  2: (Context::complete(int)+0x9) [0x568d29]
>  3: (Objecter::handle_osd_op_reply(MOSDOpReply*)+0x1097) [0x7c15d7]
>  4: (MDS::handle_core_message(Message*)+0x5a0) [0x588900]
>  5: (MDS::_dispatch(Message*)+0x2f) [0x58908f]
>  6: (MDS::ms_dispatch(Message*)+0x1e3) [0x58ab93]
>  7: (DispatchQueue::entry()+0x549) [0x975739]
>  8: (DispatchQueue::DispatchThread::entry()+0xd) [0x8902dd]
>  9: (()+0x7e9a) [0x7fbcccb0de9a]
>  10: (clone()+0x6d) [0x7fbccb4ba3fd]
>  NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
> 
> Is there any workaround/patch to fix this issue? Let me know if need to see the log with debug-mds of certain level as well.
> Any helps would be very much appreciated.
> 
> Thanks.
> Bazli
> 
> ________________________________
> DISCLAIMER:
> 
> 
> This e-mail (including any attachments) is for the addressee(s) only and may be confidential, especially as regards personal data. If you are not the intended recipient, please note that any dealing, review, distribution, printing, copying or use of this e-mail is strictly prohibited. If you have received this email in error, please notify the sender immediately and delete the original message (including any attachments).
> 
> 
> MIMOS Berhad is a research and development institution under the purview of the Malaysian Ministry of Science, Technology and Innovation. Opinions, conclusions and other information in this e-mail that do not relate to the official business of MIMOS Berhad and/or its subsidiaries shall be understood as neither given nor endorsed by MIMOS Berhad and/or its subsidiaries and neither MIMOS Berhad nor its subsidiaries accepts responsibility for the same. All liability arising from or in connection with computer viruses and/or corrupted e-mails is excluded to the fullest extent permitted by law.
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 


-- 
Wido den Hollander
42on B.V.
Ceph trainer and consultant

Phone: +31 (0)20 700 9902
Skype: contact42on

  reply	other threads:[~2015-01-16  7:48 UTC|newest]

Thread overview: 10+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-01-16  7:37 MDS aborted after recovery and active, FAILED assert (r >=0) Mohd Bazli Ab Karim
2015-01-16  7:48 ` Wido den Hollander [this message]
2015-01-16  8:36   ` Mohd Bazli Ab Karim
2015-01-16  8:52     ` Wido den Hollander
2015-01-16 10:57 ` John Spray
2015-01-16 15:57   ` John Spray
2015-01-19  3:33     ` Mohd Bazli Ab Karim
2015-01-20  1:13       ` Mohd Bazli Ab Karim
  -- strict thread matches above, loose matches on Subject: below --
2015-01-15  9:16 Bazli Karim
2015-01-15  6:35 Mohd Bazli Ab Karim

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=54B8C256.3060706@42on.com \
    --to=wido@42on.com \
    --cc=bazli.abkarim@mimos.my \
    --cc=ceph-devel@vger.kernel.org \
    --cc=ceph-users@lists.ceph.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.