All of lore.kernel.org
 help / color / mirror / Atom feed
* Bug #1047 reproduced
@ 2011-12-01  8:37 Amon Ott
  2011-12-02 11:17 ` Amon Ott
  0 siblings, 1 reply; 14+ messages in thread
From: Amon Ott @ 2011-12-01  8:37 UTC (permalink / raw)
  To: ceph-devel

[-- Attachment #1: Type: text/plain, Size: 608 bytes --]

On all four nodes of my test cluster, MDS crashes with a trace like that in 
bug #1047. Example and ceph.conf attached. Ceph server side is from git 
master, last commit ce6572273943ffdca4b7dc5344152d6c35106a2d.

MDS does not start on any node here, it reliably crashes with that assert.

Amon Ott
-- 
Dr. Amon Ott
m-privacy GmbH           Tel: +49 30 24342334
Am Köllnischen Park 1    Fax: +49 30 24342336
10179 Berlin             http://www.m-privacy.de

Amtsgericht Charlottenburg, HRB 84946

Geschäftsführer:
 Dipl.-Kfm. Holger Maczkowsky,
 Roman Maczkowsky

GnuPG-Key-ID: 0x2DD3A649

[-- Attachment #2: ceph.conf --]
[-- Type: text/plain, Size: 1054 bytes --]

[global]
	pid file = /var/run/ceph/$name.pid
	debug ms = 1
	keyring = /etc/ceph/keyring
	cluster_network = 192.168.111.0/24
[mon]
	mon data = /var/lib/ceph/mon
; Use odd number of monitors, three is good, five or more on big clusters
[mon.0]
	host = tgpro1
	mon addr = 192.168.111.1
[mon.1]
	host = tgpro2
	mon addr = 192.168.111.2
[mon.2]
	host = tgpro3
	mon addr = 192.168.111.3
[mds]
	max mds = 2
[mds.0]
	host = tgpro1
	mds addr = 192.168.111.1
	mds standby replay = true
[mds.1]
	host = tgpro2
	mds addr = 192.168.111.2
	mds standby replay = true
[mds.2]
	host = tgpro3
	mds addr = 192.168.111.3
	mds standby replay = true
[mds.3]
	host = tgpro4
	mds addr = 192.168.111.4
	mds standby replay = true
[osd]
	sudo = true
	osd data = /ceph/data
;	osd journal = /ceph/journal
;	osd journal size = 512
	osd journal = /dev/sda7
	filestore journal = writeahead
[osd.0]
	host = tgpro1
	osd addr = 192.168.111.1
[osd.1]
	host = tgpro2
	osd addr = 192.168.111.2
[osd.2]
	host = tgpro3
	osd addr = 192.168.111.3
[osd.3]
	host = tgpro4
	osd addr = 192.168.111.4

[-- Attachment #3: mds-crash-anchor_map.log --]
[-- Type: text/x-log, Size: 1747 bytes --]

2011-12-01 09:24:48.852444 486c1b70 -- 192.168.111.4:6802/25235 <== mds.0 192.168.111.4:6802/25235 0 ==== mds_table_request(anchortable query 8 bytes) v1 ==== 0+0+0 (0 0 0) 0x113a5240 con 0x110c0000
mds/AnchorServer.cc: In function 'virtual void AnchorServer::handle_query(MMDSTableRequest*)', in thread '486c1b70'
mds/AnchorServer.cc: 249: FAILED assert(anchor_map.count(curino) == 1)
 ceph version  (commit:)
 1: (AnchorServer::handle_query(MMDSTableRequest*)+0x1c2) [0x10dfd272]
 2: (MDSTableServer::handle_request(MMDSTableRequest*)+0xd4) [0x10dfbb54]
 3: (MDS::handle_deferrable_message(Message*)+0xe01) [0x10b97611]
 4: (MDS::_dispatch(Message*)+0x1ae2) [0x10baf402]
 5: (MDS::ms_dispatch(Message*)+0xa5) [0x10bafac5]
 6: (SimpleMessenger::dispatch_entry()+0x7c9) [0x10ec3fd9]
 7: (SimpleMessenger::DispatchThread::entry()+0x3b) [0x10b83f7b]
 8: (Thread::_entry_func(void*)+0x1c) [0x10e7c3bc]
 9: (()+0x5905) [0x4adfe905]
 10: (clone()+0x5e) [0x4a7968ce]
 ceph version  (commit:)
 1: (AnchorServer::handle_query(MMDSTableRequest*)+0x1c2) [0x10dfd272]
 2: (MDSTableServer::handle_request(MMDSTableRequest*)+0xd4) [0x10dfbb54]
 3: (MDS::handle_deferrable_message(Message*)+0xe01) [0x10b97611]
 4: (MDS::_dispatch(Message*)+0x1ae2) [0x10baf402]
 5: (MDS::ms_dispatch(Message*)+0xa5) [0x10bafac5]
 6: (SimpleMessenger::dispatch_entry()+0x7c9) [0x10ec3fd9]
 7: (SimpleMessenger::DispatchThread::entry()+0x3b) [0x10b83f7b]
 8: (Thread::_entry_func(void*)+0x1c) [0x10e7c3bc]
 9: (()+0x5905) [0x4adfe905]
 10: (clone()+0x5e) [0x4a7968ce]
*** Caught signal (Segmentation fault) **
 in thread 486c1b70
 ceph version  (commit:)
 1: (()+0x4703a3) [0x10f613a3]
 2: [0x4ae40400]
 3: (abort()+0xea) [0x4a6f653a]
reraise_fatal: failed to re-raise signal 11

^ permalink raw reply	[flat|nested] 14+ messages in thread

end of thread, other threads:[~2012-01-27 18:54 UTC | newest]

Thread overview: 14+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-12-01  8:37 Bug #1047 reproduced Amon Ott
2011-12-02 11:17 ` Amon Ott
2011-12-02 16:57   ` Sage Weil
2011-12-05 10:21     ` Amon Ott
2011-12-06 13:20       ` Amon Ott
2011-12-21 12:37     ` Amon Ott
2011-12-21 16:18       ` Gregory Farnum
2011-12-21 16:36         ` Amon Ott
2011-12-23  0:27           ` Gregory Farnum
2011-12-23  9:58             ` Amon Ott
2011-12-29 11:30             ` Amon Ott
2012-01-27 14:23               ` Amon Ott
2012-01-27 14:50                 ` Sage Weil
2012-01-27 18:54                   ` Gregory Farnum

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.