From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from relay.sgi.com (relay1.corp.sgi.com [137.38.102.111]) by oss.sgi.com (Postfix) with ESMTP id EE69A7F51 for ; Thu, 13 Jun 2013 08:29:43 -0500 (CDT) Received: from cuda.sgi.com (cuda2.sgi.com [192.48.176.25]) by relay1.corp.sgi.com (Postfix) with ESMTP id CA0738F8037 for ; Thu, 13 Jun 2013 06:29:43 -0700 (PDT) Received: from smtp.domeneshop.no (smtp.domeneshop.no [194.63.252.54]) by cuda.sgi.com with ESMTP id dmfPpeTKWNCc02YX (version=TLSv1 cipher=AES256-SHA bits=256 verify=NO) for ; Thu, 13 Jun 2013 06:29:42 -0700 (PDT) Message-ID: <51B9C943.4030900@skagestad.org> Date: Thu, 13 Jun 2013 15:29:39 +0200 From: =?ISO-8859-1?Q?Torbj=F8rn?= MIME-Version: 1.0 Subject: Re: Kernel bug when mounting xfs on dm-crypt + md-raid6 using 3.10-rc5 References: <51B9C28F.2020703@skagestad.org> <51B9C4C9.5000803@sandeen.net> In-Reply-To: <51B9C4C9.5000803@sandeen.net> List-Id: XFS Filesystem from SGI List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="iso-8859-1"; Format="flowed" Errors-To: xfs-bounces@oss.sgi.com Sender: xfs-bounces@oss.sgi.com To: Eric Sandeen Cc: xfs@oss.sgi.com On 06/13/2013 03:10 PM, Eric Sandeen wrote: > On 6/13/13 8:01 AM, Torbj=F8rn wrote: >> Hi, >> >> I have a 8 drive md-raid6 + dm-crypt with xfs on top. >> When trying to mount using 3.10-rc5 (ubuntu mainline ppa) I get the foll= owing kernel bug: >> >> [ 1017.056091] SGI XFS with ACLs, security attributes, realtime, large b= lock/inode numbers, no debug enabled >> [ 1017.057607] XFS (dm-11): Mounting Filesystem >> [ 1017.195409] ------------[ cut here ]------------ >> [ 1017.195881] Kernel BUG at ffffffff81485fb2 [verbose debug info unavai= lable] > Hm that's not so helpful :( So we don't have thread info or > line number information. > >> [ 1017.196603] invalid opcode: 0000 [#1] SMP >> [ 1017.197050] Modules linked in: xfs vhost_net macvtap macvlan ip6table= _filter ip6_tables ebtable_nat ebtables ipt_MASQUERADE ipt_REJECT xt_CHECKS= UM sch_prio bridge stp llc xt_state iptable_filter dm_crypt xt_CLASSIFY xt_= tcpudp xt_DSCP iptable_mangle iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 = nf_nat_ipv4 nf_nat nf_conntrack ip_tables x_tables intel_powerclamp kvm_int= el kvm psmouse serio_raw microcode ppdev lpc_ich mac_hid parport_pc w83627e= hf hwmon_vid coretemp nfsd lp nfs_acl auth_rpcgss nfs parport fscache lockd= sunrpc btrfs zlib_deflate libcrc32c raid1 raid0 multipath linear raid456 a= sync_pq async_xor xor async_memcpy async_raid6_recov raid6_pq async_tx raid= 10 hid_generic usbhid hid ast ttm crc32_pclmul drm_kms_helper ghash_clmulni= _intel drm aesni_intel ablk_helper cryptd lrw gf128mul glue_helper e1000e m= pt2sas i2c_algo_bit ptp sysimgblt sysfillrect pps_core ahci aes_x86_64 sysc= opyarea scsi_transport_sas libahci raid_class video >> [ 1017.206695] CPU: 1 PID: 486 Comm: md0_raid6 Not tainted 3.10.0-031000= rc5-generic #201306082135 >> [ 1017.207603] Hardware name: To be filled by O.E.M. To be filled by O.E= .M./P8B-X series, BIOS 2107 05/04/2012 >> [ 1017.208681] task: ffff88040e509770 ti: ffff88040de2a000 task.ti: ffff= 88040de2a000 >> [ 1017.209498] RIP: 0010:[] [] scsi_= setup_fs_cmnd.part.32+0x82/0x90 > so it crashed in scsi, and nothing in the stack is from xfs. > > Barring weird interactions, I think you need to look elsewhere for the bu= g; > this doesn't look like an xfs problem to me. > > Actually, > https://lkml.org/lkml/2013/6/12/440 looks relevant, which references > https://lkml.org/lkml/2013/5/19/75 > > Guessing this is an md bug. > > -Eric > >> [ 1017.210467] RSP: 0018:ffff88040de2bb68 EFLAGS: 00010046 >> [ 1017.211021] RAX: 0000000000000000 RBX: ffff8804106d4800 RCX: 00000000= 00000002 >> [ 1017.211772] RDX: 0000000000001000 RSI: ffff8803d3b89028 RDI: ffff8804= 106d4800 >> [ 1017.212521] RBP: ffff88040de2bb78 R08: ffff8803d3b88f30 R09: ffff9ef7= 74422900 >> [ 1017.213300] R10: 0000000018422880 R11: 00000000ffffffff R12: ffff8803= d3b89028 >> [ 1017.214054] R13: 0000000000000001 R14: ffff8804106d4800 R15: ffff8804= 1032a800 >> [ 1017.214802] FS: 0000000000000000(0000) GS:ffff88042fc40000(0000) knl= GS:0000000000000000 >> [ 1017.215691] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 >> [ 1017.216287] CR2: 00007f9eaa5a4000 CR3: 0000000001c0c000 CR4: 00000000= 001427e0 >> [ 1017.217069] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 00000000= 00000000 >> [ 1017.217819] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 00000000= 00000400 >> [ 1017.218568] Stack: >> [ 1017.218769] ffff8804106d4800 ffff8803d3b89028 ffff88040de2bb98 fffff= fff81485fef >> [ 1017.219577] ffff8803d3b89028 ffff8804106ac100 ffff88040de2bc08 fffff= fff81496c0f >> [ 1017.220379] ffff88040de2bbd8 ffffffff81328aa5 ffff880400001000 00000= 00018422880 >> [ 1017.221176] Call Trace: >> [ 1017.221415] [] scsi_setup_fs_cmnd+0x2f/0x40 >> [ 1017.222024] [] sd_prep_fn+0xff/0xb00 >> [ 1017.222567] [] ? deadline_remove_request.isra.3+0x= 55/0x90 >> [ 1017.223336] [] blk_peek_request+0xfe/0x270 >> [ 1017.223953] [] scsi_request_fn+0x4f/0x430 >> [ 1017.224546] [] __blk_run_queue+0x37/0x50 >> [ 1017.225145] [] queue_unplugged+0x3d/0xc0 >> [ 1017.225723] [] blk_flush_plug_list+0x183/0x210 >> [ 1017.226360] [] blk_finish_plug+0x18/0x50 >> [ 1017.226943] [] raid5d+0x1b7/0x1d0 [raid456] >> [ 1017.227548] [] md_thread+0x11d/0x170 >> [ 1017.228090] [] ? add_wait_queue+0x60/0x60 >> [ 1017.228681] [] ? md_rdev_init+0x110/0x110 >> [ 1017.229274] [] kthread+0xc0/0xd0 >> [ 1017.229795] [] ? flush_kthread_worker+0xb0/0xb0 >> [ 1017.230468] [] ret_from_fork+0x7c/0xb0 >> [ 1017.231048] [] ? flush_kthread_worker+0xb0/0xb0 >> [ 1017.231719] Code: fd ff ff 5b 41 5c 5d c3 48 8b 00 48 85 c0 74 b7 48 = 8b 40 48 48 85 c0 74 ae ff d0 85 c0 74 a8 eb e2 b8 02 00 00 00 0f 1f 00 eb = d8 <0f> 0b 66 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 55 48 >> [ 1017.234403] RIP [] scsi_setup_fs_cmnd.part.32+0x82= /0x90 >> [ 1017.235121] RSP >> [ 1017.482522] ---[ end trace fa18c0d8cd90bd2f ]--- >> >> 3.10-rc4 has the same issue. I have not tried any earlier 3.10 kernels >> The system mounts fine using 3.9.5 (also ubuntu ppa) >> >> If I can provide any other info to help, please let me know. >> >> -- = >> Torbj=F8rn >> >> _______________________________________________ >> xfs mailing list >> xfs@oss.sgi.com >> http://oss.sgi.com/mailman/listinfo/xfs >> Hi, Thanks for the insight Eric. I'll compile a kernel with proper debug info, and see if linux-raid can = make any use of it. -- Torbj=F8rn _______________________________________________ xfs mailing list xfs@oss.sgi.com http://oss.sgi.com/mailman/listinfo/xfs