From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from relay.sgi.com (relay1.corp.sgi.com [137.38.102.111]) by oss.sgi.com (Postfix) with ESMTP id 3A96C7F37 for ; Thu, 13 Jun 2013 08:10:44 -0500 (CDT) Received: from cuda.sgi.com (cuda2.sgi.com [192.48.176.25]) by relay1.corp.sgi.com (Postfix) with ESMTP id 21E2E8F8054 for ; Thu, 13 Jun 2013 06:10:40 -0700 (PDT) Received: from sandeen.net (sandeen.net [63.231.237.45]) by cuda.sgi.com with ESMTP id r9e273HAvgccmEOH for ; Thu, 13 Jun 2013 06:10:39 -0700 (PDT) Message-ID: <51B9C4C9.5000803@sandeen.net> Date: Thu, 13 Jun 2013 08:10:33 -0500 From: Eric Sandeen MIME-Version: 1.0 Subject: Re: Kernel bug when mounting xfs on dm-crypt + md-raid6 using 3.10-rc5 References: <51B9C28F.2020703@skagestad.org> In-Reply-To: <51B9C28F.2020703@skagestad.org> List-Id: XFS Filesystem from SGI List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Errors-To: xfs-bounces@oss.sgi.com Sender: xfs-bounces@oss.sgi.com To: =?ISO-8859-1?Q?Torbj=F8rn?= Cc: xfs@oss.sgi.com On 6/13/13 8:01 AM, Torbj=F8rn wrote: > Hi, > = > I have a 8 drive md-raid6 + dm-crypt with xfs on top. > When trying to mount using 3.10-rc5 (ubuntu mainline ppa) I get the follo= wing kernel bug: > = > [ 1017.056091] SGI XFS with ACLs, security attributes, realtime, large bl= ock/inode numbers, no debug enabled > [ 1017.057607] XFS (dm-11): Mounting Filesystem > [ 1017.195409] ------------[ cut here ]------------ > [ 1017.195881] Kernel BUG at ffffffff81485fb2 [verbose debug info unavail= able] Hm that's not so helpful :( So we don't have thread info or line number information. > [ 1017.196603] invalid opcode: 0000 [#1] SMP > [ 1017.197050] Modules linked in: xfs vhost_net macvtap macvlan ip6table_= filter ip6_tables ebtable_nat ebtables ipt_MASQUERADE ipt_REJECT xt_CHECKSU= M sch_prio bridge stp llc xt_state iptable_filter dm_crypt xt_CLASSIFY xt_t= cpudp xt_DSCP iptable_mangle iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 n= f_nat_ipv4 nf_nat nf_conntrack ip_tables x_tables intel_powerclamp kvm_inte= l kvm psmouse serio_raw microcode ppdev lpc_ich mac_hid parport_pc w83627eh= f hwmon_vid coretemp nfsd lp nfs_acl auth_rpcgss nfs parport fscache lockd = sunrpc btrfs zlib_deflate libcrc32c raid1 raid0 multipath linear raid456 as= ync_pq async_xor xor async_memcpy async_raid6_recov raid6_pq async_tx raid1= 0 hid_generic usbhid hid ast ttm crc32_pclmul drm_kms_helper ghash_clmulni_= intel drm aesni_intel ablk_helper cryptd lrw gf128mul glue_helper e1000e mp= t2sas i2c_algo_bit ptp sysimgblt sysfillrect pps_core ahci aes_x86_64 sysco= pyarea scsi_transport_sas libahci raid_class video > [ 1017.206695] CPU: 1 PID: 486 Comm: md0_raid6 Not tainted 3.10.0-031000r= c5-generic #201306082135 > [ 1017.207603] Hardware name: To be filled by O.E.M. To be filled by O.E.= M./P8B-X series, BIOS 2107 05/04/2012 > [ 1017.208681] task: ffff88040e509770 ti: ffff88040de2a000 task.ti: ffff8= 8040de2a000 > [ 1017.209498] RIP: 0010:[] [] scsi_s= etup_fs_cmnd.part.32+0x82/0x90 so it crashed in scsi, and nothing in the stack is from xfs. Barring weird interactions, I think you need to look elsewhere for the bug; this doesn't look like an xfs problem to me. Actually, https://lkml.org/lkml/2013/6/12/440 looks relevant, which references https://lkml.org/lkml/2013/5/19/75 Guessing this is an md bug. -Eric > [ 1017.210467] RSP: 0018:ffff88040de2bb68 EFLAGS: 00010046 > [ 1017.211021] RAX: 0000000000000000 RBX: ffff8804106d4800 RCX: 000000000= 0000002 > [ 1017.211772] RDX: 0000000000001000 RSI: ffff8803d3b89028 RDI: ffff88041= 06d4800 > [ 1017.212521] RBP: ffff88040de2bb78 R08: ffff8803d3b88f30 R09: ffff9ef77= 4422900 > [ 1017.213300] R10: 0000000018422880 R11: 00000000ffffffff R12: ffff8803d= 3b89028 > [ 1017.214054] R13: 0000000000000001 R14: ffff8804106d4800 R15: ffff88041= 032a800 > [ 1017.214802] FS: 0000000000000000(0000) GS:ffff88042fc40000(0000) knlG= S:0000000000000000 > [ 1017.215691] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [ 1017.216287] CR2: 00007f9eaa5a4000 CR3: 0000000001c0c000 CR4: 000000000= 01427e0 > [ 1017.217069] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 000000000= 0000000 > [ 1017.217819] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 000000000= 0000400 > [ 1017.218568] Stack: > [ 1017.218769] ffff8804106d4800 ffff8803d3b89028 ffff88040de2bb98 ffffff= ff81485fef > [ 1017.219577] ffff8803d3b89028 ffff8804106ac100 ffff88040de2bc08 ffffff= ff81496c0f > [ 1017.220379] ffff88040de2bbd8 ffffffff81328aa5 ffff880400001000 000000= 0018422880 > [ 1017.221176] Call Trace: > [ 1017.221415] [] scsi_setup_fs_cmnd+0x2f/0x40 > [ 1017.222024] [] sd_prep_fn+0xff/0xb00 > [ 1017.222567] [] ? deadline_remove_request.isra.3+0x5= 5/0x90 > [ 1017.223336] [] blk_peek_request+0xfe/0x270 > [ 1017.223953] [] scsi_request_fn+0x4f/0x430 > [ 1017.224546] [] __blk_run_queue+0x37/0x50 > [ 1017.225145] [] queue_unplugged+0x3d/0xc0 > [ 1017.225723] [] blk_flush_plug_list+0x183/0x210 > [ 1017.226360] [] blk_finish_plug+0x18/0x50 > [ 1017.226943] [] raid5d+0x1b7/0x1d0 [raid456] > [ 1017.227548] [] md_thread+0x11d/0x170 > [ 1017.228090] [] ? add_wait_queue+0x60/0x60 > [ 1017.228681] [] ? md_rdev_init+0x110/0x110 > [ 1017.229274] [] kthread+0xc0/0xd0 > [ 1017.229795] [] ? flush_kthread_worker+0xb0/0xb0 > [ 1017.230468] [] ret_from_fork+0x7c/0xb0 > [ 1017.231048] [] ? flush_kthread_worker+0xb0/0xb0 > [ 1017.231719] Code: fd ff ff 5b 41 5c 5d c3 48 8b 00 48 85 c0 74 b7 48 8= b 40 48 48 85 c0 74 ae ff d0 85 c0 74 a8 eb e2 b8 02 00 00 00 0f 1f 00 eb d= 8 <0f> 0b 66 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 55 48 > [ 1017.234403] RIP [] scsi_setup_fs_cmnd.part.32+0x82/= 0x90 > [ 1017.235121] RSP > [ 1017.482522] ---[ end trace fa18c0d8cd90bd2f ]--- > = > 3.10-rc4 has the same issue. I have not tried any earlier 3.10 kernels > The system mounts fine using 3.9.5 (also ubuntu ppa) > = > If I can provide any other info to help, please let me know. > = > -- = > Torbj=F8rn > = > _______________________________________________ > xfs mailing list > xfs@oss.sgi.com > http://oss.sgi.com/mailman/listinfo/xfs > = _______________________________________________ xfs mailing list xfs@oss.sgi.com http://oss.sgi.com/mailman/listinfo/xfs