From mboxrd@z Thu Jan 1 00:00:00 1970 From: Guido Winkelmann Subject: Kernel crash Date: Wed, 30 May 2012 19:35:35 +0200 Message-ID: <1934548.axdquzlaJh@pc10> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7Bit Return-path: Received: from unknownsite.de ([62.48.69.106]:47292 "EHLO hartes-hannover.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753290Ab2E3Rfm (ORCPT ); Wed, 30 May 2012 13:35:42 -0400 Received: from pc10.localnet (pc10.asys-h.de [193.98.1.90]) by hartes-hannover.de (Postfix) with ESMTPSA id BD05410C866 for ; Wed, 30 May 2012 19:35:40 +0200 (CEST) Sender: ceph-devel-owner@vger.kernel.org List-ID: To: ceph-devel@vger.kernel.org Hi, I just saw a kernel crash on one of my machines. It had the cephfs from the ceph cluster mounted using the in-kernel client: [522247.751071] [] ? release_sock+0xe3/0x110 [522247.751182] [] __bad_area_nosemaphore+0x1d1/0x1f0 [522247.751290] [] bad_area_nosemaphore+0x13/0x15 [522247.751397] [] do_page_fault+0x416/0x4f0 [522247.751503] [] ? sock_recvmsg+0x11d/0x140 [522247.751611] [] ? cpumask_next_and+0x36/0x50 [522247.751718] [] page_fault+0x25/0x30 [522247.751828] [] ? ceph_x_destroy_authorizer+0x14/0x40 [libceph] [522247.751995] [] get_authorizer+0x6e/0x140 [ceph] [522247.752104] [] ? kernel_recvmsg+0x46/0x60 [522247.752213] [] prepare_write_connect+0x17a/0x270 [libceph] [522247.752378] [] con_work+0x755/0x2c40 [libceph] [522247.752486] [] ? update_rq_clock+0x43/0x1b0 [522247.752598] [] ? ceph_msg_new+0x2d0/0x2d0 [libceph] [522247.752707] [] process_one_work+0x11e/0x470 [522247.752815] [] worker_thread+0x15f/0x360 [522247.752925] [] ? manage_workers+0x230/0x230 [522247.753032] [] kthread+0x93/0xa0 [522247.753137] [] kernel_thread_helper+0x4/0x10 [522247.753245] [] ? kthread_freezable_should_stop+0x70/0x70 [522247.753355] [] ? gs_change+0x13/0x13 [522247.753459] ---[ end trace b9ba686594d99f89 ]--- These lines are all that I could still read on the screen. (Good thing there's Opens Source OCR programs out there...) I do not know how to extract more information about that crash (scrolling up does not work), but I'm leaving the machine like that over night in case someone can tell me. Kernel version was 3.3.6-3.fc16.x86_64, Ceph cluster is version 0.47.2. The crash happened after I issued an rbd command. Another thing that might be related is that I stopped and restarted the entire cluster twice since mounting the cephfs. The first time, I disabled cephx, the second time I enabled it again. Regards, Guido