* Ceph-mon crashing FAILED assert(addr_name.count(p->second) == 0)
@ 2015-07-10 22:31 kernel neophyte
2015-07-12 8:25 ` Joao Eduardo Luis
0 siblings, 1 reply; 4+ messages in thread
From: kernel neophyte @ 2015-07-10 22:31 UTC (permalink / raw)
To: ceph-devel@vger.kernel.org
Hi,
I am seeing the following error every time I am trying to manually
deploy ceph cluster and do a ceph -s :
mon/MonMap.h: In function 'void MonMap::calc_ranks()' thread
7fb3ccfb6700 time 2015-07-10 15:27:56.004148
mon/MonMap.h: 47: FAILED assert(addr_name.count(p->second) == 0)
ceph version 9.0.1-1445-g4a179ee (4a179eea527f7cbcf45eed4a63ad0fa8f744fc4a)
1: (()+0x12716b) [0x7fb3d53ec16b]
2: (()+0x1c0a6d) [0x7fb3d5485a6d]
3: (()+0x1b7909) [0x7fb3d547c909]
4: (()+0x1b8377) [0x7fb3d547d377]
5: (()+0x2fbbe7) [0x7fb3d55c0be7]
6: (()+0x310f1d) [0x7fb3d55d5f1d]
7: (()+0x3112a0) [0x7fb3d55d62a0]
8: (()+0x8182) [0x7fb3d9e05182]
9: (clone()+0x6d) [0x7fb3d9b3247d]
NOTE: a copy of the executable, or `objdump -rdS <executable>` is
needed to interpret this.
terminate called after throwing an instance of 'ceph::FailedAssertion'
Aborted (core dumped)
Any help greatly appreciated :-)
-Neo
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: Ceph-mon crashing FAILED assert(addr_name.count(p->second) == 0)
2015-07-10 22:31 Ceph-mon crashing FAILED assert(addr_name.count(p->second) == 0) kernel neophyte
@ 2015-07-12 8:25 ` Joao Eduardo Luis
2015-07-15 17:12 ` kernel neophyte
0 siblings, 1 reply; 4+ messages in thread
From: Joao Eduardo Luis @ 2015-07-12 8:25 UTC (permalink / raw)
To: kernel neophyte, ceph-devel@vger.kernel.org
On 07/10/2015 11:31 PM, kernel neophyte wrote:
> Hi,
>
> I am seeing the following error every time I am trying to manually
> deploy ceph cluster and do a ceph -s :
>
> mon/MonMap.h: In function 'void MonMap::calc_ranks()' thread
> 7fb3ccfb6700 time 2015-07-10 15:27:56.004148
>
> mon/MonMap.h: 47: FAILED assert(addr_name.count(p->second) == 0)
Please send us a copy of your monmap and ceph.conf.
You should also open a ticket on the tracker.
Thanks!
-Joao
>
> ceph version 9.0.1-1445-g4a179ee (4a179eea527f7cbcf45eed4a63ad0fa8f744fc4a)
>
> 1: (()+0x12716b) [0x7fb3d53ec16b]
>
> 2: (()+0x1c0a6d) [0x7fb3d5485a6d]
>
> 3: (()+0x1b7909) [0x7fb3d547c909]
>
> 4: (()+0x1b8377) [0x7fb3d547d377]
>
> 5: (()+0x2fbbe7) [0x7fb3d55c0be7]
>
> 6: (()+0x310f1d) [0x7fb3d55d5f1d]
>
> 7: (()+0x3112a0) [0x7fb3d55d62a0]
>
> 8: (()+0x8182) [0x7fb3d9e05182]
>
> 9: (clone()+0x6d) [0x7fb3d9b3247d]
>
> NOTE: a copy of the executable, or `objdump -rdS <executable>` is
> needed to interpret this.
>
> terminate called after throwing an instance of 'ceph::FailedAssertion'
>
> Aborted (core dumped)
>
> Any help greatly appreciated :-)
>
> -Neo
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: Ceph-mon crashing FAILED assert(addr_name.count(p->second) == 0)
2015-07-12 8:25 ` Joao Eduardo Luis
@ 2015-07-15 17:12 ` kernel neophyte
2015-07-15 19:19 ` Vu Pham
0 siblings, 1 reply; 4+ messages in thread
From: kernel neophyte @ 2015-07-15 17:12 UTC (permalink / raw)
To: raju.kurunkad, vu, Sage Weil
Cc: ceph-devel@vger.kernel.org, Joao Eduardo Luis
I was able to dig bit further, I see this happening when using XIO as
messenger (Simple & Async works fine).
My Stack details are:
Linux Distro: Ubuntu
Kernel: 3.13.0-24-generic
OFED: I see it happen in both
MLNX_OFED_LINUX-2.4-1.0.4-ubuntu14.04-x86_64 &
MLNX_OFED_LINUX-3.0-1.0.1-ubuntu14.04-x86_64
Accelio: I see it happen in both https://github.com/accelio/accelio
Branch: master & https://github.com/vuhuong/accelio Branch:
master-v1.3-fix
monmap:
monmaptool -p /tmp/monmap
monmaptool: monmap file /tmp/monmap
epoch 0
fsid 41e024e8-b224-41c7-ab13-9d9b681f3b61
last_changed 2015-07-15 09:48:38.037222
created 2015-07-15 09:48:38.037222
0: 10.13.10.189:6789/0 mon.abc-def-ghij07
Debug/Log:
ceph -s
2015-07-15 09:54:07.763512 7f6df0c12700 -1 WARNING: the following
dangerous and experimental features are enabled: ms-type-xio
2015-07-15 09:54:07.766830 7f6df0c12700 -1 WARNING: the following
dangerous and experimental features are enabled: ms-type-xio
2015-07-15 09:54:07.766864 7f6df0c12700 -1 WARNING: experimental
feature 'ms-type-xio' is enabled
Please be aware that this feature is experimental, untested,
unsupported, and may result in data corruption, data loss,
and/or irreparable damage to your cluster. Do not use
feature with important data.
2015-07-15 09:54:07.767331 7f6df0c12700 2 [debug] xio_mempool.c:500
xio_mempool_create - mempool: using regular allocator
2015-07-15 09:54:07.768458 7f6df0c12700 4 XioMessenger 0x7f6de4018040
get_connection: xio_uri rdma://10.13.10.189:6789
2015-07-15 09:54:07.768466 7f6df0c12700 4 Peer type: mon
throttle_msgs: 1024 throttle_bytes: 536870912
2015-07-15 09:54:07.768483 7f6df0c12700 2 [debug] xio_mempool.c:494
xio_mempool_create - mempool: using huge pages allocator
2015-07-15 09:54:07.768581 7f6df0c12700 20 [trace]
xio_rdma_management.c:2646 xio_rdma_open - xio_rdma_open: [new]
handle:0x7f6de4055d40
2015-07-15 09:54:07.768587 7f6df0c12700 20 [trace] xio_nexus.c:1903
xio_nexus_open - nexus: [new] nexus:0x7f6de4055b20,
transport_hndl:0x7f6de4055d40
2015-07-15 09:54:07.768595 7f6df0c12700 20 [trace] xio_nexus.c:1993
xio_nexus_connect - xio_nexus_connect: nexus:0x7f6de4055b20,
rdma_hndl:0x7f6de4055d40, portal:rdma://10.13.10.189:6789
2015-07-15 09:54:07.768759 7f6df0c12700 2 [debug]
xio_session_client.c:1022 xio_connect - xio_connect:
session:0x7f6de40558f0, connection:0x7f6de4056e20, ctx:0x7f6de40179d0,
nexus:0x7f6de4055b20
2015-07-15 09:54:07.768767 7f6df0c12700 2 new connection xcon:
0x7f6de4055460 up_ready on session 0x7f6de40558f0
2015-07-15 09:54:07.768860 7f6df0c12700 4 _send_message_impl
0x7f6de40578e0 new XioMsg 0x7f6df014f040 req_0 0x7f6df014f160 msg type
17 features: 0 conn 0x7f6de4056e20 sess 0x7f6de40558f0
2015-07-15 09:54:07.768867 7f6df0c12700 10 ex_cnt 0, req_off -1, msg_cnt 1
2015-07-15 09:54:07.768863 7f6de9ac4700 2 [debug]
xio_rdma_management.c:2386 xio_handle_cm_event - cm event:
[RDMA_CM_EVENT_ADDR_RESOLVED], hndl:0x7f6de4055d40, status:0
2015-07-15 09:54:07.769112 7f6de9ac4700 2 [debug]
xio_rdma_management.c:2386 xio_handle_cm_event - cm event:
[RDMA_CM_EVENT_ROUTE_RESOLVED], hndl:0x7f6de4055d40, status:0
2015-07-15 09:54:07.769707 7f6de9ac4700 20 [trace]
xio_rdma_management.c:469 xio_cq_get - comp_vec:17
2015-07-15 09:54:07.770201 7f6de9ac4700 2 [debug]
xio_rdma_management.c:961 xio_qp_create - rdma qp: [new]
handle:0x7f6de4055d40, qp:0x260, max inline:448
2015-07-15 09:54:07.770231 7f6de9ac4700 4 xio_send_msg xio msg: sn: 0
timestamp: 256752506178532
2015-07-15 09:54:07.770232 7f6de9ac4700 4 xio_send_msg ceph header:
front_len: 60 seq: 1 tid: 0 type: 17 prio: 0 name type: 8 name num: -1
version: 1 compat_version: 1 front_len: 60 middle_len: 0 data_len: 0
xio header: msg_cnt: 1
2015-07-15 09:54:07.770235 7f6de9ac4700 4 xio_send_msg ceph footer:
front_crc: 0 middle_crc: 0 data_crc: 0 sig: 0 flags: 3
2015-07-15 09:54:07.778297 7f6de9ac4700 2 [debug]
xio_rdma_management.c:2386 xio_handle_cm_event - cm event:
[RDMA_CM_EVENT_ESTABLISHED], hndl:0x7f6de4055d40, status:0
2015-07-15 09:54:07.778308 7f6de9ac4700 2 [debug] xio_nexus.c:1632
xio_nexus_on_transport_event - nexus: [notification] - transport
established. nexus:0x7f6de4055b20, transport:0x7f6de4055d40
2015-07-15 09:54:07.778330 7f6de9ac4700 20 [trace]
xio_rdma_verbs.c:240 xio_reg_mr_ex_dev - before ibv_reg_mr
2015-07-15 09:54:07.778399 7f6de9ac4700 20 [trace]
xio_rdma_verbs.c:242 xio_reg_mr_ex_dev - after ibv_reg_mr
2015-07-15 09:54:07.778404 7f6de9ac4700 20 [trace]
xio_rdma_verbs.c:240 xio_reg_mr_ex_dev - before ibv_reg_mr
2015-07-15 09:54:07.778462 7f6de9ac4700 20 [trace]
xio_rdma_verbs.c:242 xio_reg_mr_ex_dev - after ibv_reg_mr
2015-07-15 09:54:07.778480 7f6de9ac4700 2 [debug]
xio_rdma_management.c:1274 xio_rdma_initial_pool_post_create -
post_recv conn_setup rx task:0x7f6ddc009400
2015-07-15 09:54:07.778489 7f6de9ac4700 20 [trace] xio_nexus.c:384
xio_nexus_send_setup_req - send setup request
2015-07-15 09:54:07.778492 7f6de9ac4700 20 [trace] xio_nexus.c:426
xio_nexus_send_setup_req - xio_nexus_send_setup_req:
nexus:0x7f6de4055b20, rdma_hndl:0x7f6de4055d40
2015-07-15 09:54:07.778497 7f6de9ac4700 20 [trace]
xio_rdma_datapath.c:4180 xio_rdma_send_setup_req - rdma send setup
request
2015-07-15 09:54:07.782250 7f6de9ac4700 20 [trace]
xio_rdma_datapath.c:4319 xio_rdma_on_setup_msg - setup complete.
send_buf_sz:17408
2015-07-15 09:54:07.782258 7f6de9ac4700 20 [trace] xio_nexus.c:661
xio_nexus_on_recv_setup_rsp - receiving setup response.
nexus:0x7f6de4055b20
2015-07-15 09:54:07.782261 7f6de9ac4700 20 [trace] xio_nexus.c:714
xio_nexus_on_recv_setup_rsp - xio_nexus_on_recv_setup_rsp:
nexus:0x7f6de4055b20, trans_hndl:0x7f6de4055d40
2015-07-15 09:54:07.782866 7f6de9ac4700 20 [trace]
xio_rdma_verbs.c:240 xio_reg_mr_ex_dev - before ibv_reg_mr
2015-07-15 09:54:07.785038 7f6de9ac4700 20 [trace]
xio_rdma_verbs.c:242 xio_reg_mr_ex_dev - after ibv_reg_mr
2015-07-15 09:54:07.785045 7f6de9ac4700 20 [trace]
xio_rdma_verbs.c:240 xio_reg_mr_ex_dev - before ibv_reg_mr
2015-07-15 09:54:07.785247 7f6de9ac4700 20 [trace]
xio_rdma_verbs.c:242 xio_reg_mr_ex_dev - after ibv_reg_mr
2015-07-15 09:54:07.785257 7f6de9ac4700 2 [debug]
xio_rdma_management.c:1572 xio_rdma_primary_pool_slab_pre_create -
pool buf:0x7f6de31e7000, mr:0x7f6ddc0093b0
2015-07-15 09:54:07.785495 7f6de9ac4700 2 [debug]
xio_session_client.c:793 xio_client_on_nexus_event - session:
[notification] - nexus established. session:0x7f6de40558f0,
nexus:0x7f6de4055b20
2015-07-15 09:54:07.785692 7f6de9ac4700 2 [debug]
xio_session_client.c:503 xio_on_setup_rsp_recv - task recycled
2015-07-15 09:54:07.785700 7f6de9ac4700 20 [trace]
xio_session_client.c:517 xio_on_setup_rsp_recv - session state is now
ONLINE. session:0x7f6de40558f0
2015-07-15 09:54:07.785703 7f6de9ac4700 20 [trace]
xio_session_client.c:547 xio_on_setup_rsp_recv - session state is now
ACCEPT. session:0x7f6de40558f0
2015-07-15 09:54:07.785711 7f6de9ac4700 2 [debug]
xio_connection.c:1719 xio_disconnect_initial_connection - send fin
request. session:0x7f6de40558f0, connection:0x7f6ddc015aa0
2015-07-15 09:54:07.785715 7f6de9ac4700 20 [trace]
xio_connection.c:1726 xio_disconnect_initial_connection - connection
0x7f6ddc015aa0 state change: current_state:ONLINE,
next_state:FIN_WAIT_1
2015-07-15 09:54:07.785725 7f6de9ac4700 20 [trace]
xio_rdma_management.c:2646 xio_rdma_open - xio_rdma_open: [new]
handle:0x7f6ddc0162d0
2015-07-15 09:54:07.785727 7f6de9ac4700 20 [trace] xio_nexus.c:1903
xio_nexus_open - nexus: [new] nexus:0x7f6ddc0160b0,
transport_hndl:0x7f6ddc0162d0
2015-07-15 09:54:07.785728 7f6de9ac4700 2 [debug]
xio_session_client.c:196 xio_session_accept_connections - reconnecting
to rdma://10.13.10.189:6800. connection:0x7f6de4056e20,
nexus:0x7f6ddc0160b0
2015-07-15 09:54:07.785734 7f6de9ac4700 20 [trace] xio_nexus.c:1993
xio_nexus_connect - xio_nexus_connect: nexus:0x7f6ddc0160b0,
rdma_hndl:0x7f6ddc0162d0, portal:rdma://10.13.10.189:6800
2015-07-15 09:54:07.785761 7f6de9ac4700 2 [debug]
xio_connection.c:2526 xio_on_fin_req_recv - fin request received.
session:0x7f6de40558f0, connection:0x7f6ddc015aa0
2015-07-15 09:54:07.785764 7f6de9ac4700 2 [debug]
xio_connection.c:1682 xio_send_fin_ack - send fin response.
session:0x7f6de40558f0, connection:0x7f6ddc015aa0
2015-07-15 09:54:07.785767 7f6de9ac4700 2 [debug]
xio_connection.c:2367 xio_on_fin_req_send_comp - got fin request send
completion. session:0x7f6de40558f0, connection:0x7f6ddc015aa0
2015-07-15 09:54:07.785769 7f6de9ac4700 2 [debug]
xio_connection.c:2454 xio_on_fin_ack_recv - got fin ack.
session:0x7f6de40558f0, connection:0x7f6ddc015aa0
2015-07-15 09:54:07.785771 7f6de9ac4700 2 [debug]
xio_connection.c:2495 xio_on_fin_ack_recv - connection 0x7f6ddc015aa0
state change: current_state:FIN_WAIT_1, next_state:FIN_WAIT_2
2015-07-15 09:54:07.785772 7f6de9ac4700 2 [debug]
xio_connection.c:2560 xio_on_fin_ack_send_comp - fin ack send
completion received. session:0x7f6de40558f0, connection:0x7f6ddc015aa0
2015-07-15 09:54:07.785773 7f6de9ac4700 2 [debug]
xio_connection.c:2583 xio_on_fin_ack_send_comp - connection
0x7f6ddc015aa0 state change: current_state:FIN_WAIT_2,
next_state:TIME_WAIT
2015-07-15 09:54:07.785784 7f6de9ac4700 2 [debug]
xio_rdma_management.c:2386 xio_handle_cm_event - cm event:
[RDMA_CM_EVENT_ADDR_RESOLVED], hndl:0x7f6ddc0162d0, status:0
2015-07-15 09:54:07.786008 7f6de9ac4700 2 [debug]
xio_rdma_management.c:2386 xio_handle_cm_event - cm event:
[RDMA_CM_EVENT_ROUTE_RESOLVED], hndl:0x7f6ddc0162d0, status:0
2015-07-15 09:54:07.786468 7f6de9ac4700 2 [debug]
xio_rdma_management.c:961 xio_qp_create - rdma qp: [new]
handle:0x7f6ddc0162d0, qp:0x262, max inline:448
2015-07-15 09:54:07.787793 7f6de9ac4700 2 [debug]
xio_connection.c:2383 xio_close_time_wait - connection 0x7f6ddc015aa0
state change: current_state:TIME_WAIT, next_state:CLOSED
2015-07-15 09:54:07.787804 7f6de9ac4700 2 [debug]
xio_connection.c:2183 xio_connection_destroy - xio_connection_destroy.
session:0x7f6de40558f0, connection:0x7f6ddc015aa0 nexus:0x7f6de4055b20
nr:1, state:CLOSED
2015-07-15 09:54:07.787807 7f6de9ac4700 2 [debug]
xio_connection.c:2108 xio_connection_post_destroy -
xio_connection_post_destroy. session:0x7f6de40558f0,
connection:0x7f6ddc015aa0 conn:0x7f6de4055b20 nr:1
2015-07-15 09:54:07.787810 7f6de9ac4700 20 [trace] xio_nexus.c:2139
xio_nexus_close - nexus: [putref] ptr:0x7f6de4055b20, refcnt:1
2015-07-15 09:54:07.787812 7f6de9ac4700 2 [debug]
xio_session_client.c:808 xio_client_on_nexus_event - session:
[notification] - nexus closed. session:0x7f6de40558f0,
nexus:0x7f6de4055b20
2015-07-15 09:54:07.787814 7f6de9ac4700 20 [trace] xio_session.c:987
xio_on_nexus_closed - session:0x7f6de40558f0 - nexus:0x7f6de4055b20
close complete
2015-07-15 09:54:07.787816 7f6de9ac4700 20 [trace] xio_nexus.c:2111
xio_nexus_delayed_close - xio_nexus_deleyed close.
nexus:0x7f6de4055b20, state:4
2015-07-15 09:54:07.787817 7f6de9ac4700 20 [trace]
xio_connection.c:2122 xio_connection_post_destroy - lead connection is
closed
2015-07-15 09:54:07.790664 7f6de9ac4700 2 [debug]
xio_rdma_management.c:2386 xio_handle_cm_event - cm event:
[RDMA_CM_EVENT_ESTABLISHED], hndl:0x7f6ddc0162d0, status:0
2015-07-15 09:54:07.790672 7f6de9ac4700 2 [debug] xio_nexus.c:1632
xio_nexus_on_transport_event - nexus: [notification] - transport
established. nexus:0x7f6ddc0160b0, transport:0x7f6ddc0162d0
2015-07-15 09:54:07.790675 7f6de9ac4700 2 [debug]
xio_rdma_management.c:1274 xio_rdma_initial_pool_post_create -
post_recv conn_setup rx task:0x7f6ddc009400
2015-07-15 09:54:07.790680 7f6de9ac4700 20 [trace] xio_nexus.c:384
xio_nexus_send_setup_req - send setup request
2015-07-15 09:54:07.790682 7f6de9ac4700 20 [trace] xio_nexus.c:426
xio_nexus_send_setup_req - xio_nexus_send_setup_req:
nexus:0x7f6ddc0160b0, rdma_hndl:0x7f6ddc0162d0
2015-07-15 09:54:07.790683 7f6de9ac4700 20 [trace]
xio_rdma_datapath.c:4180 xio_rdma_send_setup_req - rdma send setup
request
2015-07-15 09:54:07.794359 7f6de9ac4700 20 [trace]
xio_rdma_datapath.c:4319 xio_rdma_on_setup_msg - setup complete.
send_buf_sz:17408
2015-07-15 09:54:07.794367 7f6de9ac4700 20 [trace] xio_nexus.c:661
xio_nexus_on_recv_setup_rsp - receiving setup response.
nexus:0x7f6ddc0160b0
2015-07-15 09:54:07.794369 7f6de9ac4700 20 [trace] xio_nexus.c:714
xio_nexus_on_recv_setup_rsp - xio_nexus_on_recv_setup_rsp:
nexus:0x7f6ddc0160b0, trans_hndl:0x7f6ddc0162d0
2015-07-15 09:54:07.795445 7f6de9ac4700 20 [trace]
xio_rdma_verbs.c:240 xio_reg_mr_ex_dev - before ibv_reg_mr
2015-07-15 09:54:07.798625 7f6de9ac4700 20 [trace]
xio_rdma_verbs.c:242 xio_reg_mr_ex_dev - after ibv_reg_mr
2015-07-15 09:54:07.798633 7f6de9ac4700 20 [trace]
xio_rdma_verbs.c:240 xio_reg_mr_ex_dev - before ibv_reg_mr
2015-07-15 09:54:07.798929 7f6de9ac4700 20 [trace]
xio_rdma_verbs.c:242 xio_reg_mr_ex_dev - after ibv_reg_mr
2015-07-15 09:54:07.798940 7f6de9ac4700 2 [debug]
xio_rdma_management.c:1572 xio_rdma_primary_pool_slab_pre_create -
pool buf:0x7f6de25a1000, mr:0x7f6ddc015d40
2015-07-15 09:54:07.799371 7f6de9ac4700 2 [debug]
xio_session_client.c:793 xio_client_on_nexus_event - session:
[notification] - nexus established. session:0x7f6de40558f0,
nexus:0x7f6ddc0160b0
2015-07-15 09:54:07.799376 7f6de9ac4700 2 [debug]
xio_connection.c:2030 xio_connection_send_hello_req - send hello
request. session:0x7f6de40558f0, connection:0x7f6de4056e20
2015-07-15 09:54:07.799522 7f6de9ac4700 2 [debug]
xio_connection.c:2643 xio_on_connection_hello_rsp_recv - recv hello
response. session:0x7f6de40558f0, connection:0x7f6de4056e20
2015-07-15 09:54:07.799530 7f6de9ac4700 2 [debug]
xio_connection.c:2647 xio_on_connection_hello_rsp_recv - got hello
response. session:0x7f6de40558f0, connection:0x7f6de4056e20
2015-07-15 09:54:07.799533 7f6de9ac4700 4 session event: connection
established. reason: Success
2015-07-15 09:54:07.799538 7f6de9ac4700 2 connection established
0x7f6de4056e20 session 0x7f6de40558f0 xcon 0x7f6de4055460
2015-07-15 09:54:07.799541 7f6de9ac4700 2 learned my addr
10.13.10.189:0/1003368
2015-07-15 09:54:07.799546 7f6de9ac4700 2 client: connected from
10.13.10.189:49134/0 to 10.13.10.189:6800/0
2015-07-15 09:54:07.799576 7f6de9ac4700 11 on_msg_delivered xcon:
0x7f6de4055460 session: 0x7f6de40558f0 msg: 0x7f6df014f160 sn: 0 type:
17 tid: 0 seq: 1
2015-07-15 09:54:07.804553 7f6de9ac4700 10 on_msg_req receive req treq
0x7f6de8162598 msg_cnt 1 iov_base 0x7f6de31f1060 iov_len 216 nents 1
conn 0x7f6de4056e20 sess 0x7f6de40558f0 sn 0
2015-07-15 09:54:07.804564 7f6de9ac4700 4 on_msg_req msg_seq.size()=1
2015-07-15 09:54:07.804590 7f6de9ac4700 10 on_msg_req receive req treq
0x7f6de8160178 msg_cnt 1 iov_base 0x7f6de31e7060 iov_len 216 nents 1
conn 0x7f6de4056e20 sess 0x7f6de40558f0 sn 1
2015-07-15 09:54:07.804594 7f6de9ac4700 4 on_msg_req msg_seq.size()=1
mon/MonMap.h: In function 'void MonMap::calc_ranks()' thread
7f6de92c3700 time 2015-07-15 09:54:07.804654
mon/MonMap.h: 47: FAILED assert(addr_name.count(p->second) == 0)
ceph version 9.0.1-1494-g8fc0496 (8fc049664bc798432e1750da86b1f216f85a842d)
1: (()+0x12637b) [0x7f6df585037b]
2: (()+0x1bf86d) [0x7f6df58e986d]
3: (()+0x1b6709) [0x7f6df58e0709]
4: (()+0x1b7177) [0x7f6df58e1177]
5: (()+0x2faa27) [0x7f6df5a24a27]
6: (()+0x30fd2d) [0x7f6df5a39d2d]
7: (()+0x3100b0) [0x7f6df5a3a0b0]
8: (()+0x8182) [0x7f6dfa785182]
9: (clone()+0x6d) [0x7f6dfa4b247d]
NOTE: a copy of the executable, or `objdump -rdS <executable>` is
needed to interpret this.
terminate called after throwing an instance of 'ceph::FailedAssertion'
Aborted (core dumped) -> No Idea where this core is! looked at PWD, & /var/crash
Any help / pointers greatly appreciated (-:
-Neo
On Sun, Jul 12, 2015 at 1:25 AM, Joao Eduardo Luis <joao@suse.de> wrote:
> On 07/10/2015 11:31 PM, kernel neophyte wrote:
>> Hi,
>>
>> I am seeing the following error every time I am trying to manually
>> deploy ceph cluster and do a ceph -s :
>>
>> mon/MonMap.h: In function 'void MonMap::calc_ranks()' thread
>> 7fb3ccfb6700 time 2015-07-10 15:27:56.004148
>>
>> mon/MonMap.h: 47: FAILED assert(addr_name.count(p->second) == 0)
>
> Please send us a copy of your monmap and ceph.conf.
>
> You should also open a ticket on the tracker.
>
> Thanks!
>
> -Joao
>
>>
>> ceph version 9.0.1-1445-g4a179ee (4a179eea527f7cbcf45eed4a63ad0fa8f744fc4a)
>>
>> 1: (()+0x12716b) [0x7fb3d53ec16b]
>>
>> 2: (()+0x1c0a6d) [0x7fb3d5485a6d]
>>
>> 3: (()+0x1b7909) [0x7fb3d547c909]
>>
>> 4: (()+0x1b8377) [0x7fb3d547d377]
>>
>> 5: (()+0x2fbbe7) [0x7fb3d55c0be7]
>>
>> 6: (()+0x310f1d) [0x7fb3d55d5f1d]
>>
>> 7: (()+0x3112a0) [0x7fb3d55d62a0]
>>
>> 8: (()+0x8182) [0x7fb3d9e05182]
>>
>> 9: (clone()+0x6d) [0x7fb3d9b3247d]
>>
>> NOTE: a copy of the executable, or `objdump -rdS <executable>` is
>> needed to interpret this.
>>
>> terminate called after throwing an instance of 'ceph::FailedAssertion'
>>
>> Aborted (core dumped)
>>
>> Any help greatly appreciated :-)
>>
>> -Neo
>> --
>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at http://vger.kernel.org/majordomo-info.html
>>
>
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: Ceph-mon crashing FAILED assert(addr_name.count(p->second) == 0)
2015-07-15 17:12 ` kernel neophyte
@ 2015-07-15 19:19 ` Vu Pham
0 siblings, 0 replies; 4+ messages in thread
From: Vu Pham @ 2015-07-15 19:19 UTC (permalink / raw)
To: kernel neophyte, raju.kurunkad@sandisk.com, Sage Weil
Cc: ceph-devel@vger.kernel.org, Joao Eduardo Luis
Looking at debug log, I don't see any unusual thing in accelio & xio
messenger.
I guess the problem happened because you installed MLNX_OFED_LINUX
packages after compiling & installing accelio and ceph
Installing MLNX_OFED package (with xen & kdm supports) will look for and
install (re-install) distributions default librados and librbd. This
will mess up your librados & librbd from ceph v9.0.1 package.
Could you re-compile & re-install accelio & ceph v9.0.1 again and
reproduce?
-vu
On 7/15/2015 10:12:03 AM, "kernel neophyte"
<neophyte.hacker001@gmail.com> wrote:
>I was able to dig bit further, I see this happening when using XIO as
>messenger (Simple & Async works fine).
>
>My Stack details are:
>
>Linux Distro: Ubuntu
>Kernel: 3.13.0-24-generic
>OFED: I see it happen in both
>MLNX_OFED_LINUX-2.4-1.0.4-ubuntu14.04-x86_64 &
>MLNX_OFED_LINUX-3.0-1.0.1-ubuntu14.04-x86_64
>Accelio: I see it happen in both https://github.com/accelio/accelio
>Branch: master & https://github.com/vuhuong/accelio Branch:
>master-v1.3-fix
>
>
>monmap:
>
>monmaptool -p /tmp/monmap
>
>monmaptool: monmap file /tmp/monmap
>
>epoch 0
>
>fsid 41e024e8-b224-41c7-ab13-9d9b681f3b61
>
>last_changed 2015-07-15 09:48:38.037222
>
>created 2015-07-15 09:48:38.037222
>
>0: 10.13.10.189:6789/0 mon.abc-def-ghij07
>
>
>
>Debug/Log:
>
>ceph -s
>
>2015-07-15 09:54:07.763512 7f6df0c12700 -1 WARNING: the following
>dangerous and experimental features are enabled: ms-type-xio
>
>2015-07-15 09:54:07.766830 7f6df0c12700 -1 WARNING: the following
>dangerous and experimental features are enabled: ms-type-xio
>
>2015-07-15 09:54:07.766864 7f6df0c12700 -1 WARNING: experimental
>feature 'ms-type-xio' is enabled
>
>Please be aware that this feature is experimental, untested,
>
>unsupported, and may result in data corruption, data loss,
>
>and/or irreparable damage to your cluster. Do not use
>
>feature with important data.
>
>
>2015-07-15 09:54:07.767331 7f6df0c12700 2 [debug] xio_mempool.c:500
>xio_mempool_create - mempool: using regular allocator
>
>
>2015-07-15 09:54:07.768458 7f6df0c12700 4 XioMessenger 0x7f6de4018040
>get_connection: xio_uri rdma://10.13.10.189:6789
>
>2015-07-15 09:54:07.768466 7f6df0c12700 4 Peer type: mon
>throttle_msgs: 1024 throttle_bytes: 536870912
>
>2015-07-15 09:54:07.768483 7f6df0c12700 2 [debug] xio_mempool.c:494
>xio_mempool_create - mempool: using huge pages allocator
>
>
>2015-07-15 09:54:07.768581 7f6df0c12700 20 [trace]
>xio_rdma_management.c:2646 xio_rdma_open - xio_rdma_open: [new]
>handle:0x7f6de4055d40
>
>
>2015-07-15 09:54:07.768587 7f6df0c12700 20 [trace] xio_nexus.c:1903
>xio_nexus_open - nexus: [new] nexus:0x7f6de4055b20,
>transport_hndl:0x7f6de4055d40
>
>
>2015-07-15 09:54:07.768595 7f6df0c12700 20 [trace] xio_nexus.c:1993
>xio_nexus_connect - xio_nexus_connect: nexus:0x7f6de4055b20,
>rdma_hndl:0x7f6de4055d40, portal:rdma://10.13.10.189:6789
>
>
>2015-07-15 09:54:07.768759 7f6df0c12700 2 [debug]
>xio_session_client.c:1022 xio_connect - xio_connect:
>session:0x7f6de40558f0, connection:0x7f6de4056e20, ctx:0x7f6de40179d0,
>nexus:0x7f6de4055b20
>
>
>2015-07-15 09:54:07.768767 7f6df0c12700 2 new connection xcon:
>0x7f6de4055460 up_ready on session 0x7f6de40558f0
>
>2015-07-15 09:54:07.768860 7f6df0c12700 4 _send_message_impl
>0x7f6de40578e0 new XioMsg 0x7f6df014f040 req_0 0x7f6df014f160 msg type
>17 features: 0 conn 0x7f6de4056e20 sess 0x7f6de40558f0
>
>2015-07-15 09:54:07.768867 7f6df0c12700 10 ex_cnt 0, req_off -1,
>msg_cnt 1
>
>2015-07-15 09:54:07.768863 7f6de9ac4700 2 [debug]
>xio_rdma_management.c:2386 xio_handle_cm_event - cm event:
>[RDMA_CM_EVENT_ADDR_RESOLVED], hndl:0x7f6de4055d40, status:0
>
>
>2015-07-15 09:54:07.769112 7f6de9ac4700 2 [debug]
>xio_rdma_management.c:2386 xio_handle_cm_event - cm event:
>[RDMA_CM_EVENT_ROUTE_RESOLVED], hndl:0x7f6de4055d40, status:0
>
>
>2015-07-15 09:54:07.769707 7f6de9ac4700 20 [trace]
>xio_rdma_management.c:469 xio_cq_get - comp_vec:17
>
>
>2015-07-15 09:54:07.770201 7f6de9ac4700 2 [debug]
>xio_rdma_management.c:961 xio_qp_create - rdma qp: [new]
>handle:0x7f6de4055d40, qp:0x260, max inline:448
>
>
>2015-07-15 09:54:07.770231 7f6de9ac4700 4 xio_send_msg xio msg: sn: 0
>timestamp: 256752506178532
>
>2015-07-15 09:54:07.770232 7f6de9ac4700 4 xio_send_msg ceph header:
>front_len: 60 seq: 1 tid: 0 type: 17 prio: 0 name type: 8 name num: -1
>version: 1 compat_version: 1 front_len: 60 middle_len: 0 data_len: 0
>xio header: msg_cnt: 1
>
>2015-07-15 09:54:07.770235 7f6de9ac4700 4 xio_send_msg ceph footer:
>front_crc: 0 middle_crc: 0 data_crc: 0 sig: 0 flags: 3
>
>2015-07-15 09:54:07.778297 7f6de9ac4700 2 [debug]
>xio_rdma_management.c:2386 xio_handle_cm_event - cm event:
>[RDMA_CM_EVENT_ESTABLISHED], hndl:0x7f6de4055d40, status:0
>
>
>2015-07-15 09:54:07.778308 7f6de9ac4700 2 [debug] xio_nexus.c:1632
>xio_nexus_on_transport_event - nexus: [notification] - transport
>established. nexus:0x7f6de4055b20, transport:0x7f6de4055d40
>
>
>2015-07-15 09:54:07.778330 7f6de9ac4700 20 [trace]
>xio_rdma_verbs.c:240 xio_reg_mr_ex_dev - before ibv_reg_mr
>
>
>2015-07-15 09:54:07.778399 7f6de9ac4700 20 [trace]
>xio_rdma_verbs.c:242 xio_reg_mr_ex_dev - after ibv_reg_mr
>
>
>2015-07-15 09:54:07.778404 7f6de9ac4700 20 [trace]
>xio_rdma_verbs.c:240 xio_reg_mr_ex_dev - before ibv_reg_mr
>
>
>2015-07-15 09:54:07.778462 7f6de9ac4700 20 [trace]
>xio_rdma_verbs.c:242 xio_reg_mr_ex_dev - after ibv_reg_mr
>
>
>2015-07-15 09:54:07.778480 7f6de9ac4700 2 [debug]
>xio_rdma_management.c:1274 xio_rdma_initial_pool_post_create -
>post_recv conn_setup rx task:0x7f6ddc009400
>
>
>2015-07-15 09:54:07.778489 7f6de9ac4700 20 [trace] xio_nexus.c:384
>xio_nexus_send_setup_req - send setup request
>
>
>2015-07-15 09:54:07.778492 7f6de9ac4700 20 [trace] xio_nexus.c:426
>xio_nexus_send_setup_req - xio_nexus_send_setup_req:
>nexus:0x7f6de4055b20, rdma_hndl:0x7f6de4055d40
>
>
>2015-07-15 09:54:07.778497 7f6de9ac4700 20 [trace]
>xio_rdma_datapath.c:4180 xio_rdma_send_setup_req - rdma send setup
>request
>
>
>2015-07-15 09:54:07.782250 7f6de9ac4700 20 [trace]
>xio_rdma_datapath.c:4319 xio_rdma_on_setup_msg - setup complete.
>send_buf_sz:17408
>
>
>2015-07-15 09:54:07.782258 7f6de9ac4700 20 [trace] xio_nexus.c:661
>xio_nexus_on_recv_setup_rsp - receiving setup response.
>nexus:0x7f6de4055b20
>
>
>2015-07-15 09:54:07.782261 7f6de9ac4700 20 [trace] xio_nexus.c:714
>xio_nexus_on_recv_setup_rsp - xio_nexus_on_recv_setup_rsp:
>nexus:0x7f6de4055b20, trans_hndl:0x7f6de4055d40
>
>
>2015-07-15 09:54:07.782866 7f6de9ac4700 20 [trace]
>xio_rdma_verbs.c:240 xio_reg_mr_ex_dev - before ibv_reg_mr
>
>
>2015-07-15 09:54:07.785038 7f6de9ac4700 20 [trace]
>xio_rdma_verbs.c:242 xio_reg_mr_ex_dev - after ibv_reg_mr
>
>
>2015-07-15 09:54:07.785045 7f6de9ac4700 20 [trace]
>xio_rdma_verbs.c:240 xio_reg_mr_ex_dev - before ibv_reg_mr
>
>
>2015-07-15 09:54:07.785247 7f6de9ac4700 20 [trace]
>xio_rdma_verbs.c:242 xio_reg_mr_ex_dev - after ibv_reg_mr
>
>
>2015-07-15 09:54:07.785257 7f6de9ac4700 2 [debug]
>xio_rdma_management.c:1572 xio_rdma_primary_pool_slab_pre_create -
>pool buf:0x7f6de31e7000, mr:0x7f6ddc0093b0
>
>
>2015-07-15 09:54:07.785495 7f6de9ac4700 2 [debug]
>xio_session_client.c:793 xio_client_on_nexus_event - session:
>[notification] - nexus established. session:0x7f6de40558f0,
>nexus:0x7f6de4055b20
>
>
>2015-07-15 09:54:07.785692 7f6de9ac4700 2 [debug]
>xio_session_client.c:503 xio_on_setup_rsp_recv - task recycled
>
>
>2015-07-15 09:54:07.785700 7f6de9ac4700 20 [trace]
>xio_session_client.c:517 xio_on_setup_rsp_recv - session state is now
>ONLINE. session:0x7f6de40558f0
>
>
>2015-07-15 09:54:07.785703 7f6de9ac4700 20 [trace]
>xio_session_client.c:547 xio_on_setup_rsp_recv - session state is now
>ACCEPT. session:0x7f6de40558f0
>
>
>2015-07-15 09:54:07.785711 7f6de9ac4700 2 [debug]
>xio_connection.c:1719 xio_disconnect_initial_connection - send fin
>request. session:0x7f6de40558f0, connection:0x7f6ddc015aa0
>
>
>2015-07-15 09:54:07.785715 7f6de9ac4700 20 [trace]
>xio_connection.c:1726 xio_disconnect_initial_connection - connection
>0x7f6ddc015aa0 state change: current_state:ONLINE,
>next_state:FIN_WAIT_1
>
>
>2015-07-15 09:54:07.785725 7f6de9ac4700 20 [trace]
>xio_rdma_management.c:2646 xio_rdma_open - xio_rdma_open: [new]
>handle:0x7f6ddc0162d0
>
>
>2015-07-15 09:54:07.785727 7f6de9ac4700 20 [trace] xio_nexus.c:1903
>xio_nexus_open - nexus: [new] nexus:0x7f6ddc0160b0,
>transport_hndl:0x7f6ddc0162d0
>
>
>2015-07-15 09:54:07.785728 7f6de9ac4700 2 [debug]
>xio_session_client.c:196 xio_session_accept_connections - reconnecting
>to rdma://10.13.10.189:6800. connection:0x7f6de4056e20,
>nexus:0x7f6ddc0160b0
>
>
>2015-07-15 09:54:07.785734 7f6de9ac4700 20 [trace] xio_nexus.c:1993
>xio_nexus_connect - xio_nexus_connect: nexus:0x7f6ddc0160b0,
>rdma_hndl:0x7f6ddc0162d0, portal:rdma://10.13.10.189:6800
>
>
>2015-07-15 09:54:07.785761 7f6de9ac4700 2 [debug]
>xio_connection.c:2526 xio_on_fin_req_recv - fin request received.
>session:0x7f6de40558f0, connection:0x7f6ddc015aa0
>
>
>2015-07-15 09:54:07.785764 7f6de9ac4700 2 [debug]
>xio_connection.c:1682 xio_send_fin_ack - send fin response.
>session:0x7f6de40558f0, connection:0x7f6ddc015aa0
>
>
>2015-07-15 09:54:07.785767 7f6de9ac4700 2 [debug]
>xio_connection.c:2367 xio_on_fin_req_send_comp - got fin request send
>completion. session:0x7f6de40558f0, connection:0x7f6ddc015aa0
>
>
>2015-07-15 09:54:07.785769 7f6de9ac4700 2 [debug]
>xio_connection.c:2454 xio_on_fin_ack_recv - got fin ack.
>session:0x7f6de40558f0, connection:0x7f6ddc015aa0
>
>
>2015-07-15 09:54:07.785771 7f6de9ac4700 2 [debug]
>xio_connection.c:2495 xio_on_fin_ack_recv - connection 0x7f6ddc015aa0
>state change: current_state:FIN_WAIT_1, next_state:FIN_WAIT_2
>
>
>2015-07-15 09:54:07.785772 7f6de9ac4700 2 [debug]
>xio_connection.c:2560 xio_on_fin_ack_send_comp - fin ack send
>completion received. session:0x7f6de40558f0, connection:0x7f6ddc015aa0
>
>
>2015-07-15 09:54:07.785773 7f6de9ac4700 2 [debug]
>xio_connection.c:2583 xio_on_fin_ack_send_comp - connection
>0x7f6ddc015aa0 state change: current_state:FIN_WAIT_2,
>next_state:TIME_WAIT
>
>
>2015-07-15 09:54:07.785784 7f6de9ac4700 2 [debug]
>xio_rdma_management.c:2386 xio_handle_cm_event - cm event:
>[RDMA_CM_EVENT_ADDR_RESOLVED], hndl:0x7f6ddc0162d0, status:0
>
>
>2015-07-15 09:54:07.786008 7f6de9ac4700 2 [debug]
>xio_rdma_management.c:2386 xio_handle_cm_event - cm event:
>[RDMA_CM_EVENT_ROUTE_RESOLVED], hndl:0x7f6ddc0162d0, status:0
>
>
>2015-07-15 09:54:07.786468 7f6de9ac4700 2 [debug]
>xio_rdma_management.c:961 xio_qp_create - rdma qp: [new]
>handle:0x7f6ddc0162d0, qp:0x262, max inline:448
>
>
>2015-07-15 09:54:07.787793 7f6de9ac4700 2 [debug]
>xio_connection.c:2383 xio_close_time_wait - connection 0x7f6ddc015aa0
>state change: current_state:TIME_WAIT, next_state:CLOSED
>
>
>2015-07-15 09:54:07.787804 7f6de9ac4700 2 [debug]
>xio_connection.c:2183 xio_connection_destroy - xio_connection_destroy.
>session:0x7f6de40558f0, connection:0x7f6ddc015aa0 nexus:0x7f6de4055b20
>nr:1, state:CLOSED
>
>
>2015-07-15 09:54:07.787807 7f6de9ac4700 2 [debug]
>xio_connection.c:2108 xio_connection_post_destroy -
>xio_connection_post_destroy. session:0x7f6de40558f0,
>connection:0x7f6ddc015aa0 conn:0x7f6de4055b20 nr:1
>
>
>2015-07-15 09:54:07.787810 7f6de9ac4700 20 [trace] xio_nexus.c:2139
>xio_nexus_close - nexus: [putref] ptr:0x7f6de4055b20, refcnt:1
>
>
>2015-07-15 09:54:07.787812 7f6de9ac4700 2 [debug]
>xio_session_client.c:808 xio_client_on_nexus_event - session:
>[notification] - nexus closed. session:0x7f6de40558f0,
>nexus:0x7f6de4055b20
>
>
>2015-07-15 09:54:07.787814 7f6de9ac4700 20 [trace] xio_session.c:987
>xio_on_nexus_closed - session:0x7f6de40558f0 - nexus:0x7f6de4055b20
>close complete
>
>
>2015-07-15 09:54:07.787816 7f6de9ac4700 20 [trace] xio_nexus.c:2111
>xio_nexus_delayed_close - xio_nexus_deleyed close.
>nexus:0x7f6de4055b20, state:4
>
>
>2015-07-15 09:54:07.787817 7f6de9ac4700 20 [trace]
>xio_connection.c:2122 xio_connection_post_destroy - lead connection is
>closed
>
>
>2015-07-15 09:54:07.790664 7f6de9ac4700 2 [debug]
>xio_rdma_management.c:2386 xio_handle_cm_event - cm event:
>[RDMA_CM_EVENT_ESTABLISHED], hndl:0x7f6ddc0162d0, status:0
>
>
>2015-07-15 09:54:07.790672 7f6de9ac4700 2 [debug] xio_nexus.c:1632
>xio_nexus_on_transport_event - nexus: [notification] - transport
>established. nexus:0x7f6ddc0160b0, transport:0x7f6ddc0162d0
>
>
>2015-07-15 09:54:07.790675 7f6de9ac4700 2 [debug]
>xio_rdma_management.c:1274 xio_rdma_initial_pool_post_create -
>post_recv conn_setup rx task:0x7f6ddc009400
>
>
>2015-07-15 09:54:07.790680 7f6de9ac4700 20 [trace] xio_nexus.c:384
>xio_nexus_send_setup_req - send setup request
>
>
>2015-07-15 09:54:07.790682 7f6de9ac4700 20 [trace] xio_nexus.c:426
>xio_nexus_send_setup_req - xio_nexus_send_setup_req:
>nexus:0x7f6ddc0160b0, rdma_hndl:0x7f6ddc0162d0
>
>
>2015-07-15 09:54:07.790683 7f6de9ac4700 20 [trace]
>xio_rdma_datapath.c:4180 xio_rdma_send_setup_req - rdma send setup
>request
>
>
>2015-07-15 09:54:07.794359 7f6de9ac4700 20 [trace]
>xio_rdma_datapath.c:4319 xio_rdma_on_setup_msg - setup complete.
>send_buf_sz:17408
>
>
>2015-07-15 09:54:07.794367 7f6de9ac4700 20 [trace] xio_nexus.c:661
>xio_nexus_on_recv_setup_rsp - receiving setup response.
>nexus:0x7f6ddc0160b0
>
>
>2015-07-15 09:54:07.794369 7f6de9ac4700 20 [trace] xio_nexus.c:714
>xio_nexus_on_recv_setup_rsp - xio_nexus_on_recv_setup_rsp:
>nexus:0x7f6ddc0160b0, trans_hndl:0x7f6ddc0162d0
>
>
>2015-07-15 09:54:07.795445 7f6de9ac4700 20 [trace]
>xio_rdma_verbs.c:240 xio_reg_mr_ex_dev - before ibv_reg_mr
>
>
>2015-07-15 09:54:07.798625 7f6de9ac4700 20 [trace]
>xio_rdma_verbs.c:242 xio_reg_mr_ex_dev - after ibv_reg_mr
>
>
>2015-07-15 09:54:07.798633 7f6de9ac4700 20 [trace]
>xio_rdma_verbs.c:240 xio_reg_mr_ex_dev - before ibv_reg_mr
>
>
>2015-07-15 09:54:07.798929 7f6de9ac4700 20 [trace]
>xio_rdma_verbs.c:242 xio_reg_mr_ex_dev - after ibv_reg_mr
>
>
>2015-07-15 09:54:07.798940 7f6de9ac4700 2 [debug]
>xio_rdma_management.c:1572 xio_rdma_primary_pool_slab_pre_create -
>pool buf:0x7f6de25a1000, mr:0x7f6ddc015d40
>
>
>2015-07-15 09:54:07.799371 7f6de9ac4700 2 [debug]
>xio_session_client.c:793 xio_client_on_nexus_event - session:
>[notification] - nexus established. session:0x7f6de40558f0,
>nexus:0x7f6ddc0160b0
>
>
>2015-07-15 09:54:07.799376 7f6de9ac4700 2 [debug]
>xio_connection.c:2030 xio_connection_send_hello_req - send hello
>request. session:0x7f6de40558f0, connection:0x7f6de4056e20
>
>
>2015-07-15 09:54:07.799522 7f6de9ac4700 2 [debug]
>xio_connection.c:2643 xio_on_connection_hello_rsp_recv - recv hello
>response. session:0x7f6de40558f0, connection:0x7f6de4056e20
>
>
>2015-07-15 09:54:07.799530 7f6de9ac4700 2 [debug]
>xio_connection.c:2647 xio_on_connection_hello_rsp_recv - got hello
>response. session:0x7f6de40558f0, connection:0x7f6de4056e20
>
>
>2015-07-15 09:54:07.799533 7f6de9ac4700 4 session event: connection
>established. reason: Success
>
>2015-07-15 09:54:07.799538 7f6de9ac4700 2 connection established
>0x7f6de4056e20 session 0x7f6de40558f0 xcon 0x7f6de4055460
>
>2015-07-15 09:54:07.799541 7f6de9ac4700 2 learned my addr
>10.13.10.189:0/1003368
>
>2015-07-15 09:54:07.799546 7f6de9ac4700 2 client: connected from
>10.13.10.189:49134/0 to 10.13.10.189:6800/0
>
>2015-07-15 09:54:07.799576 7f6de9ac4700 11 on_msg_delivered xcon:
>0x7f6de4055460 session: 0x7f6de40558f0 msg: 0x7f6df014f160 sn: 0 type:
>17 tid: 0 seq: 1
>
>2015-07-15 09:54:07.804553 7f6de9ac4700 10 on_msg_req receive req treq
>0x7f6de8162598 msg_cnt 1 iov_base 0x7f6de31f1060 iov_len 216 nents 1
>conn 0x7f6de4056e20 sess 0x7f6de40558f0 sn 0
>
>2015-07-15 09:54:07.804564 7f6de9ac4700 4 on_msg_req msg_seq.size()=1
>
>2015-07-15 09:54:07.804590 7f6de9ac4700 10 on_msg_req receive req treq
>0x7f6de8160178 msg_cnt 1 iov_base 0x7f6de31e7060 iov_len 216 nents 1
>conn 0x7f6de4056e20 sess 0x7f6de40558f0 sn 1
>
>2015-07-15 09:54:07.804594 7f6de9ac4700 4 on_msg_req msg_seq.size()=1
>
>mon/MonMap.h: In function 'void MonMap::calc_ranks()' thread
>7f6de92c3700 time 2015-07-15 09:54:07.804654
>
>mon/MonMap.h: 47: FAILED assert(addr_name.count(p->second) == 0)
>
> ceph version 9.0.1-1494-g8fc0496
>(8fc049664bc798432e1750da86b1f216f85a842d)
>
> 1: (()+0x12637b) [0x7f6df585037b]
>
> 2: (()+0x1bf86d) [0x7f6df58e986d]
>
> 3: (()+0x1b6709) [0x7f6df58e0709]
>
> 4: (()+0x1b7177) [0x7f6df58e1177]
>
> 5: (()+0x2faa27) [0x7f6df5a24a27]
>
> 6: (()+0x30fd2d) [0x7f6df5a39d2d]
>
> 7: (()+0x3100b0) [0x7f6df5a3a0b0]
>
> 8: (()+0x8182) [0x7f6dfa785182]
>
> 9: (clone()+0x6d) [0x7f6dfa4b247d]
>
> NOTE: a copy of the executable, or `objdump -rdS <executable>` is
>needed to interpret this.
>
>terminate called after throwing an instance of 'ceph::FailedAssertion'
>
>Aborted (core dumped) -> No Idea where this core is! looked at PWD, &
>/var/crash
>
>
>
>Any help / pointers greatly appreciated (-:
>
>-Neo
>
>
>
>On Sun, Jul 12, 2015 at 1:25 AM, Joao Eduardo Luis <joao@suse.de>
>wrote:
>> On 07/10/2015 11:31 PM, kernel neophyte wrote:
>>> Hi,
>>>
>>> I am seeing the following error every time I am trying to manually
>>> deploy ceph cluster and do a ceph -s :
>>>
>>> mon/MonMap.h: In function 'void MonMap::calc_ranks()' thread
>>> 7fb3ccfb6700 time 2015-07-10 15:27:56.004148
>>>
>>> mon/MonMap.h: 47: FAILED assert(addr_name.count(p->second) == 0)
>>
>> Please send us a copy of your monmap and ceph.conf.
>>
>> You should also open a ticket on the tracker.
>>
>> Thanks!
>>
>> -Joao
>>
>>>
>>> ceph version 9.0.1-1445-g4a179ee
>>>(4a179eea527f7cbcf45eed4a63ad0fa8f744fc4a)
>>>
>>> 1: (()+0x12716b) [0x7fb3d53ec16b]
>>>
>>> 2: (()+0x1c0a6d) [0x7fb3d5485a6d]
>>>
>>> 3: (()+0x1b7909) [0x7fb3d547c909]
>>>
>>> 4: (()+0x1b8377) [0x7fb3d547d377]
>>>
>>> 5: (()+0x2fbbe7) [0x7fb3d55c0be7]
>>>
>>> 6: (()+0x310f1d) [0x7fb3d55d5f1d]
>>>
>>> 7: (()+0x3112a0) [0x7fb3d55d62a0]
>>>
>>> 8: (()+0x8182) [0x7fb3d9e05182]
>>>
>>> 9: (clone()+0x6d) [0x7fb3d9b3247d]
>>>
>>> NOTE: a copy of the executable, or `objdump -rdS <executable>` is
>>> needed to interpret this.
>>>
>>> terminate called after throwing an instance of
>>>'ceph::FailedAssertion'
>>>
>>> Aborted (core dumped)
>>>
>>> Any help greatly appreciated :-)
>>>
>>> -Neo
>>> --
>>> To unsubscribe from this list: send the line "unsubscribe
>>>ceph-devel" in
>>> the body of a message to majordomo@vger.kernel.org
>>> More majordomo info at http://vger.kernel.org/majordomo-info.html
>>>
>>
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2015-07-15 19:33 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-07-10 22:31 Ceph-mon crashing FAILED assert(addr_name.count(p->second) == 0) kernel neophyte
2015-07-12 8:25 ` Joao Eduardo Luis
2015-07-15 17:12 ` kernel neophyte
2015-07-15 19:19 ` Vu Pham
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.