From: Daniel Walker <admin@ftwinc.net>
To: linux-raid@vger.kernel.org
Subject: Hard CPU Lockup when accessing MD RAID5
Date: Tue, 12 Apr 2016 21:54:08 +0000 [thread overview]
Message-ID: <570D6E79.1010201@ftwinc.net> (raw)
Im having some issues on a brand new Supermicro server that we have
running in production along side a few other machines which are
identical to this server..
The output from the netconsole attached to the server is here:
Apr 12 21:34:45 [75704.964946] NMI watchdog: Watchdog detected hard
LOCKUP on cpu 6
Apr 12 21:34:45
Apr 12 21:34:45 [75704.964973] Modules linked in:
Apr 12 21:34:45 ipt_REJECT
Apr 12 21:34:45 nf_reject_ipv4
Apr 12 21:34:45 iptable_mangle
Apr 12 21:34:45 tun
Apr 12 21:34:45 netconsole
Apr 12 21:34:45 configfs
Apr 12 21:34:45 xt_multiport
Apr 12 21:34:45 ip6table_filter
Apr 12 21:34:45 ip6_tables
Apr 12 21:34:45 iptable_filter
Apr 12 21:34:45 ip_tables
Apr 12 21:34:45 x_tables
Apr 12 21:34:45 bridge
Apr 12 21:34:45 stp
Apr 12 21:34:45 llc
Apr 12 21:34:45 bonding
Apr 12 21:34:45 ext4
Apr 12 21:34:45 crc16
Apr 12 21:34:45 mbcache
Apr 12 21:34:45 jbd2
Apr 12 21:34:45 raid1
Apr 12 21:34:45 raid0
Apr 12 21:34:45 raid456
Apr 12 21:34:45 async_raid6_recov
Apr 12 21:34:45 async_memcpy
Apr 12 21:34:45 async_pq
Apr 12 21:34:45 async_xor
Apr 12 21:34:45 xor
Apr 12 21:34:45 async_tx
Apr 12 21:34:45 raid6_pq
Apr 12 21:34:45 md_mod
Apr 12 21:34:45 sr_mod
Apr 12 21:34:45 cdrom
Apr 12 21:34:45 usb_storage
Apr 12 21:34:45 hid_generic
Apr 12 21:34:45 usbhid
Apr 12 21:34:45 hid
Apr 12 21:34:45 sg
Apr 12 21:34:45 sd_mod
Apr 12 21:34:45 x86_pkg_temp_thermal
Apr 12 21:34:45 coretemp
Apr 12 21:34:45 crct10dif_pclmul
Apr 12 21:34:45 crc32_pclmul
Apr 12 21:34:45 crc32c_intel
Apr 12 21:34:45 jitterentropy_rng
Apr 12 21:34:45 sha256_ssse3
Apr 12 21:34:45 sha256_generic
Apr 12 21:34:45 hmac
Apr 12 21:34:45 iTCO_wdt
Apr 12 21:34:45 iTCO_vendor_support
Apr 12 21:34:45 drbg
Apr 12 21:34:45 ansi_cprng
Apr 12 21:34:45 aesni_intel
Apr 12 21:34:45 aes_x86_64
Apr 12 21:34:45 lrw
Apr 12 21:34:45 gf128mul
Apr 12 21:34:45 glue_helper
Apr 12 21:34:45 ablk_helper
Apr 12 21:34:45 cryptd
Apr 12 21:34:45 ahci
Apr 12 21:34:45 libahci
Apr 12 21:34:45 sb_edac
Apr 12 21:34:45 libata
Apr 12 21:34:45 igb
Apr 12 21:34:45 megaraid_sas
Apr 12 21:34:45 xhci_pci
Apr 12 21:34:45 ehci_pci
Apr 12 21:34:45 i2c_algo_bit
Apr 12 21:34:45 xhci_hcd
Apr 12 21:34:45 ehci_hcd
Apr 12 21:34:45 edac_core
Apr 12 21:34:45 ptp
Apr 12 21:34:45 mei_me
Apr 12 21:34:45 lpc_ich
Apr 12 21:34:45 i2c_i801
Apr 12 21:34:45 usbcore
Apr 12 21:34:45 pps_core
Apr 12 21:34:45 mfd_core
Apr 12 21:34:45 mei
Apr 12 21:34:45 usb_common
Apr 12 21:34:45 i2c_core
Apr 12 21:34:45 ioatdma
Apr 12 21:34:45 scsi_mod
Apr 12 21:34:45 dca
Apr 12 21:34:45 ipmi_si
Apr 12 21:34:45 ipmi_msghandler
Apr 12 21:34:45 acpi_power_meter
Apr 12 21:34:45 tpm_tis
Apr 12 21:34:45 tpm
Apr 12 21:34:45 processor
Apr 12 21:34:45 button
Apr 12 21:34:45
Apr 12 21:34:45 [75704.965874] CPU: 6 PID: 25339 Comm: main Not tainted
4.4.1 #2
Apr 12 21:34:45 [75704.965916] Hardware name: Supermicro Super
Server/X10DRi-LN4+, BIOS 2.0 12/17/2015
Apr 12 21:34:45 [75704.965979] 0000000000000000
Apr 12 21:34:45 ffffffff812abdf3
Apr 12 21:34:45 0000000000000000
Apr 12 21:34:45 ffffffff810cf5f5
Apr 12 21:34:45
Apr 12 21:34:45 [75704.966054] ffff881ff2870000
Apr 12 21:34:45 ffffffff810fcea2
Apr 12 21:34:45 0000000000000001
Apr 12 21:34:45 ffff881fffcc5e58
Apr 12 21:34:45
Apr 12 21:34:45 [75704.966134] ffff881fffccaf00
Apr 12 21:34:45 ffff881fffccb100
Apr 12 21:34:45 ffff881ff2870000
Apr 12 21:34:45 ffffffff8101bc63
Apr 12 21:34:45
Apr 12 21:34:45 [75704.966211] Call Trace:
Apr 12 21:34:45 [75704.966246] <NMI>
Apr 12 21:34:45 [<ffffffff812abdf3>] ? dump_stack+0x40/0x5d
Apr 12 21:34:45 [75704.966297] [<ffffffff810cf5f5>] ?
watchdog_overflow_callback+0xb5/0xd0
Apr 12 21:34:45 [75704.966339] [<ffffffff810fcea2>] ?
__perf_event_overflow+0x82/0x1c0
Apr 12 21:34:45 [75704.966384] [<ffffffff8101bc63>] ?
intel_pmu_handle_irq+0x1c3/0x3e0
Apr 12 21:34:45 [75704.966431] [<ffffffff8113b5cb>] ?
vunmap_page_range+0x1bb/0x320
Apr 12 21:34:45 [75704.966474] [<ffffffff813213e0>] ?
ghes_copy_tofrom_phys+0x110/0x1d0
Apr 12 21:34:45 [75704.966519] [<ffffffff81014f53>] ?
perf_event_nmi_handler+0x23/0x40
Apr 12 21:34:45 [75704.966560] [<ffffffff81007b85>] ?
nmi_handle+0x65/0x100
Apr 12 21:34:45 [75704.966597] [<ffffffff81007dfe>] ? do_nmi+0x1de/0x360
Apr 12 21:34:45 [75704.970603] [<ffffffff8148f957>] ?
end_repeat_nmi+0x1a/0x1e
Apr 12 21:34:45 [75704.970644] [<ffffffff810862ca>] ?
queued_spin_lock_slowpath+0xea/0x150
Apr 12 21:34:45 [75704.970685] [<ffffffff810862ca>] ?
queued_spin_lock_slowpath+0xea/0x150
Apr 12 21:34:45 [75704.970728] [<ffffffff810862ca>] ?
queued_spin_lock_slowpath+0xea/0x150
Apr 12 21:34:45 [75704.970768] <<EOE>>
Apr 12 21:34:45 [<ffffffffa01b413b>] ? make_request+0x60b/0xbd0 [raid456]
Apr 12 21:34:45 [75704.970838] [<ffffffff810815c0>] ? wait_woken+0x80/0x80
Apr 12 21:34:45 [75704.970878] [<ffffffff81151ec4>] ?
kmem_cache_alloc+0xf4/0x120
Apr 12 21:34:45 [75704.970922] [<ffffffffa017632d>] ?
md_make_request+0xdd/0x220 [md_mod]
Apr 12 21:34:45 [75704.970969] [<ffffffff81219fde>] ?
xfs_map_buffer.isra.12+0x2e/0x60
Apr 12 21:34:45 [75704.971012] [<ffffffff8128691d>] ?
generic_make_request+0xed/0x1d0
Apr 12 21:34:45 [75704.971052] [<ffffffff81286a5a>] ?
submit_bio+0x5a/0x140
Apr 12 21:34:45 [75704.971098] [<ffffffff81113379>] ?
release_pages+0xc9/0x270
Apr 12 21:34:45 [75704.971145] [<ffffffff811a2c01>] ?
do_mpage_readpage+0x2d1/0x640
Apr 12 21:34:45 [75704.971187] [<ffffffff811a304d>] ?
mpage_readpages+0xdd/0x130
Apr 12 21:34:45 [75704.971226] [<ffffffff8121b510>] ?
__xfs_get_blocks+0x750/0x750
Apr 12 21:34:45 [75704.971267] [<ffffffff8121b510>] ?
__xfs_get_blocks+0x750/0x750
Apr 12 21:34:45 [75704.971313] [<ffffffff8114ad45>] ?
alloc_pages_current+0x85/0x110
Apr 12 21:34:45 [75704.971354] [<ffffffff81111d25>] ?
__do_page_cache_readahead+0x165/0x1f0
Apr 12 21:34:45 [75704.971399] [<ffffffff81105902>] ?
pagecache_get_page+0x22/0x1a0
Apr 12 21:34:45 [75704.971441] [<ffffffff8110768c>] ?
filemap_fault+0x37c/0x400
Apr 12 21:34:45 [75704.971481] [<ffffffff8122474b>] ?
xfs_filemap_fault+0x3b/0x80
Apr 12 21:34:45 [75704.971526] [<ffffffff8112d2da>] ? __do_fault+0x3a/0xc0
Apr 12 21:34:45 [75704.971564] [<ffffffff81130883>] ?
handle_mm_fault+0x1063/0x1650
Apr 12 21:34:45 [75704.971614] [<ffffffff8103bdae>] ?
__do_page_fault+0x11e/0x370
Apr 12 21:34:45 [75704.971653] [<ffffffff811aa4ff>] ?
SyS_epoll_wait+0x8f/0xd0
Apr 12 21:34:45 [75704.971694] [<ffffffff8148f64f>] ? page_fault+0x1f/0x30
Apr 12 21:34:45 [75705.493640] NMI watchdog: Watchdog detected hard
LOCKUP on cpu 12
Apr 12 21:34:45
Apr 12 21:34:45 [75705.493668] Modules linked in:
Apr 12 21:34:45 ipt_REJECT
Apr 12 21:34:45 nf_reject_ipv4
Apr 12 21:34:45 iptable_mangle
Apr 12 21:34:45 tun
Apr 12 21:34:45 netconsole
Apr 12 21:34:45 configfs
Apr 12 21:34:45 xt_multiport
Apr 12 21:34:45 ip6table_filter
Apr 12 21:34:45 ip6_tables
Apr 12 21:34:45 iptable_filter
Apr 12 21:34:45 ip_tables
Apr 12 21:34:45 x_tables
Apr 12 21:34:45 bridge
Apr 12 21:34:45 stp
Apr 12 21:34:45 llc
Apr 12 21:34:45 bonding
Apr 12 21:34:45 ext4
Apr 12 21:34:45 crc16
Apr 12 21:34:45 mbcache
Apr 12 21:34:45 jbd2
Apr 12 21:34:45 raid1
Apr 12 21:34:45 raid0
Apr 12 21:34:45 raid456
Apr 12 21:34:45 async_raid6_recov
Apr 12 21:34:45 async_memcpy
Apr 12 21:34:45 async_pq
Apr 12 21:34:45 async_xor
Apr 12 21:34:45 xor
Apr 12 21:34:45 async_tx
Apr 12 21:34:45 raid6_pq
Apr 12 21:34:45 md_mod
Apr 12 21:34:45 sr_mod
Apr 12 21:34:45 cdrom
Apr 12 21:34:45 usb_storage
Apr 12 21:34:45 hid_generic
Apr 12 21:34:45 usbhid
Apr 12 21:34:45 hid
Apr 12 21:34:45 sg
Apr 12 21:34:45 sd_mod
Apr 12 21:34:45 x86_pkg_temp_thermal
Apr 12 21:34:45 coretemp
Apr 12 21:34:45 crct10dif_pclmul
Apr 12 21:34:45 crc32_pclmul
Apr 12 21:34:45 crc32c_intel
Apr 12 21:34:45 jitterentropy_rng
Apr 12 21:34:45 sha256_ssse3
Apr 12 21:34:45 sha256_generic
Apr 12 21:34:45 hmac
Apr 12 21:34:45 iTCO_wdt
Apr 12 21:34:45 iTCO_vendor_support
Apr 12 21:34:45 drbg
Apr 12 21:34:45 ansi_cprng
Apr 12 21:34:45 aesni_intel
Apr 12 21:34:45 aes_x86_64
Apr 12 21:34:45 lrw
Apr 12 21:34:45 gf128mul
Apr 12 21:34:45 glue_helper
Apr 12 21:34:45 ablk_helper
Apr 12 21:34:45 cryptd
Apr 12 21:34:45 ahci
Apr 12 21:34:45 libahci
Apr 12 21:34:45 sb_edac
Apr 12 21:34:45 libata
Apr 12 21:34:45 igb
Apr 12 21:34:45 megaraid_sas
Apr 12 21:34:45 xhci_pci
Apr 12 21:34:45 ehci_pci
Apr 12 21:34:45 i2c_algo_bit
Apr 12 21:34:45 xhci_hcd
Apr 12 21:34:45 ehci_hcd
Apr 12 21:34:45 edac_core
Apr 12 21:34:45 ptp
Apr 12 21:34:45 mei_me
Apr 12 21:34:45 lpc_ich
Apr 12 21:34:45 i2c_i801
Apr 12 21:34:45 usbcore
Apr 12 21:34:45 pps_core
Apr 12 21:34:45 mfd_core
Apr 12 21:34:45 mei
Apr 12 21:34:45 usb_common
Apr 12 21:34:45 i2c_core
Apr 12 21:34:45 ioatdma
Apr 12 21:34:45 scsi_mod
Apr 12 21:34:45 dca
Apr 12 21:34:45 ipmi_si
Apr 12 21:34:45 ipmi_msghandler
Apr 12 21:34:45 acpi_power_meter
Apr 12 21:34:45 tpm_tis
Apr 12 21:34:45 tpm
Apr 12 21:34:45 processor
Apr 12 21:34:45 button
Apr 12 21:34:45
Apr 12 21:34:45 [75705.494688] CPU: 12 PID: 32350 Comm: main Not
tainted 4.4.1 #2
Apr 12 21:34:45 [75705.494728] Hardware name: Supermicro Super
Server/X10DRi-LN4+, BIOS 2.0 12/17/2015
Apr 12 21:34:45 [75705.494790] 0000000000000000
Apr 12 21:34:45 ffffffff812abdf3
Apr 12 21:34:45 0000000000000000
Apr 12 21:34:45 ffffffff810cf5f5
Apr 12 21:34:45
Apr 12 21:34:45 [75705.494886] ffff883ff29a0000
Apr 12 21:34:45 ffffffff810fcea2
Apr 12 21:34:45 0000000000000001
Apr 12 21:34:45 ffff88407fc85e58
Apr 12 21:34:45
Apr 12 21:34:45 [75705.494976] ffff88407fc8af00
Apr 12 21:34:45 ffff88407fc8b100
Apr 12 21:34:45 ffff883ff29a0000
Apr 12 21:34:45 ffffffff8101bc63
Apr 12 21:34:45
Apr 12 21:34:45 [75705.495064] Call Trace:
Apr 12 21:34:45 [75705.495094] <NMI>
Apr 12 21:34:45 [<ffffffff812abdf3>] ? dump_stack+0x40/0x5d
Apr 12 21:34:45 [75705.495150] [<ffffffff810cf5f5>] ?
watchdog_overflow_callback+0xb5/0xd0
Apr 12 21:34:45 [75705.495193] [<ffffffff810fcea2>] ?
__perf_event_overflow+0x82/0x1c0
Apr 12 21:34:45 [75705.495237] [<ffffffff8101bc63>] ?
intel_pmu_handle_irq+0x1c3/0x3e0
Apr 12 21:34:45 [75705.495284] [<ffffffff8113b5cb>] ?
vunmap_page_range+0x1bb/0x320
Apr 12 21:34:45 [75705.495330] [<ffffffff813213e0>] ?
ghes_copy_tofrom_phys+0x110/0x1d0
Apr 12 21:34:45 [75705.495373] [<ffffffff81014f53>] ?
perf_event_nmi_handler+0x23/0x40
Apr 12 21:34:45 [75705.495418] [<ffffffff81007b85>] ?
nmi_handle+0x65/0x100
Apr 12 21:34:45 [75705.495458] [<ffffffff81007d2e>] ? do_nmi+0x10e/0x360
Apr 12 21:34:45 [75705.495497] [<ffffffff8148f957>] ?
end_repeat_nmi+0x1a/0x1e
Apr 12 21:34:45 [75705.495540] [<ffffffff810862ca>] ?
queued_spin_lock_slowpath+0xea/0x150
Apr 12 21:34:45 [75705.495581] [<ffffffff810862ca>] ?
queued_spin_lock_slowpath+0xea/0x150
Apr 12 21:34:45 [75705.495621] [<ffffffff810862ca>] ?
queued_spin_lock_slowpath+0xea/0x150
Apr 12 21:34:45 [75705.495661] <<EOE>>
Apr 12 21:34:45 [<ffffffffa01b413b>] ? make_request+0x60b/0xbd0 [raid456]
Apr 12 21:34:45 [75705.495733] [<ffffffff81282d87>] ?
blk_rq_init+0x87/0xa0
Apr 12 21:34:45 [75705.495771] [<ffffffff81283e3c>] ?
get_request+0x29c/0x6e0
Apr 12 21:34:45 [75705.495812] [<ffffffff810815c0>] ? wait_woken+0x80/0x80
Apr 12 21:34:45 [75705.495853] [<ffffffffa017632d>] ?
md_make_request+0xdd/0x220 [md_mod]
Apr 12 21:34:45 [75705.495898] [<ffffffff8128829e>] ?
blk_queue_bio+0x15e/0x350
Apr 12 21:34:45 [75705.495937] [<ffffffff8128691d>] ?
generic_make_request+0xed/0x1d0
Apr 12 21:34:45 [75705.495978] [<ffffffff81286a5a>] ?
submit_bio+0x5a/0x140
Apr 12 21:34:45 [75705.496018] [<ffffffff811a215e>] ?
mpage_bio_submit+0x1e/0x30
Apr 12 21:34:45 [75705.496057] [<ffffffff811a3076>] ?
mpage_readpages+0x106/0x130
Apr 12 21:34:45 [75705.496102] [<ffffffff8121b510>] ?
__xfs_get_blocks+0x750/0x750
Apr 12 21:34:45 [75705.496144] [<ffffffff8121b510>] ?
__xfs_get_blocks+0x750/0x750
Apr 12 21:34:45 [75705.496185] [<ffffffff8114ad45>] ?
alloc_pages_current+0x85/0x110
Apr 12 21:34:45 [75705.496227] [<ffffffff81111d25>] ?
__do_page_cache_readahead+0x165/0x1f0
Apr 12 21:34:45 [75705.496268] [<ffffffff811344f5>] ? vma_link+0x75/0xb0
Apr 12 21:34:45 [75705.496307] [<ffffffff811120eb>] ?
force_page_cache_readahead+0x9b/0xe0
Apr 12 21:34:45 [75705.496352] [<ffffffff8113f876>] ?
madvise_willneed+0x76/0x140
Apr 12 21:34:45 [75705.496395] [<ffffffff811301ce>] ?
handle_mm_fault+0x9ae/0x1650
Apr 12 21:34:45 [75705.496437] [<ffffffff81133dcb>] ? find_vma+0x5b/0x70
Apr 12 21:34:45 [75705.496476] [<ffffffff8113fc52>] ?
SyS_madvise+0x312/0x6f0
Apr 12 21:34:45 [75705.496515] [<ffffffff8148d9db>] ?
entry_SYSCALL_64_fastpath+0x16/0x6e
Apr 12 21:34:47 [75707.118049] NMI watchdog: Watchdog detected hard
LOCKUP on cpu 15
Apr 12 21:34:47
Apr 12 21:34:47 [75707.118078] Modules linked in:
Apr 12 21:34:47 ipt_REJECT
Apr 12 21:34:47 nf_reject_ipv4
Apr 12 21:34:47 iptable_mangle
Apr 12 21:34:47 tun
Apr 12 21:34:47 netconsole
Apr 12 21:34:47 configfs
Apr 12 21:34:47 xt_multiport
Apr 12 21:34:47 ip6table_filter
Apr 12 21:34:47 ip6_tables
Apr 12 21:34:47 iptable_filter
Apr 12 21:34:47 ip_tables
Apr 12 21:34:47 x_tables
Apr 12 21:34:47 bridge
Apr 12 21:34:47 stp
Apr 12 21:34:47 llc
Apr 12 21:34:47 bonding
Apr 12 21:34:47 ext4
Apr 12 21:34:47 crc16
Apr 12 21:34:47 mbcache
Apr 12 21:34:47 jbd2
Apr 12 21:34:47 raid1
Apr 12 21:34:47 raid0
Apr 12 21:34:47 raid456
Apr 12 21:34:47 async_raid6_recov
Apr 12 21:34:47 async_memcpy
Apr 12 21:34:47 async_pq
Apr 12 21:34:47 async_xor
Apr 12 21:34:47 xor
Apr 12 21:34:47 async_tx
Apr 12 21:34:47 raid6_pq
Apr 12 21:34:47 md_mod
Apr 12 21:34:47 sr_mod
Apr 12 21:34:47 cdrom
Apr 12 21:34:47 usb_storage
Apr 12 21:34:47 hid_generic
Apr 12 21:34:47 usbhid
Apr 12 21:34:47 hid
Apr 12 21:34:47 sg
Apr 12 21:34:47 sd_mod
Apr 12 21:34:47 x86_pkg_temp_thermal
Apr 12 21:34:47 coretemp
Apr 12 21:34:47 crct10dif_pclmul
Apr 12 21:34:47 crc32_pclmul
Apr 12 21:34:47 crc32c_intel
Apr 12 21:34:47 jitterentropy_rng
Apr 12 21:34:47 sha256_ssse3
Apr 12 21:34:47 sha256_generic
Apr 12 21:34:47 hmac
Apr 12 21:34:47 iTCO_wdt
Apr 12 21:34:47 iTCO_vendor_support
Apr 12 21:34:47 drbg
Apr 12 21:34:47 ansi_cprng
Apr 12 21:34:47 aesni_intel
Apr 12 21:34:47 aes_x86_64
Apr 12 21:34:47 lrw
Apr 12 21:34:47 gf128mul
Apr 12 21:34:47 glue_helper
Apr 12 21:34:47 ablk_helper
Apr 12 21:34:47 cryptd
Apr 12 21:34:47 ahci
Apr 12 21:34:47 libahci
Apr 12 21:34:47 sb_edac
Apr 12 21:34:47 libata
Apr 12 21:34:47 igb
Apr 12 21:34:47 megaraid_sas
Apr 12 21:34:47 xhci_pci
Apr 12 21:34:47 ehci_pci
Apr 12 21:34:47 i2c_algo_bit
Apr 12 21:34:47 xhci_hcd
Apr 12 21:34:47 ehci_hcd
Apr 12 21:34:47 edac_core
Apr 12 21:34:47 ptp
Apr 12 21:34:47 mei_me
Apr 12 21:34:47 lpc_ich
Apr 12 21:34:47 i2c_i801
Apr 12 21:34:47 usbcore
Apr 12 21:34:47 pps_core
Apr 12 21:34:47 mfd_core
Apr 12 21:34:47 mei
Apr 12 21:34:47 usb_common
Apr 12 21:34:47 i2c_core
Apr 12 21:34:47 ioatdma
Apr 12 21:34:47 scsi_mod
Apr 12 21:34:47 dca
Apr 12 21:34:47 ipmi_si
Apr 12 21:34:47 ipmi_msghandler
Apr 12 21:34:47 acpi_power_meter
Apr 12 21:34:47 tpm_tis
Apr 12 21:34:47 tpm
Apr 12 21:34:47 processor
Apr 12 21:34:47 button
Apr 12 21:34:47
Apr 12 21:34:47 [75707.119088] CPU: 15 PID: 31940 Comm: main Not
tainted 4.4.1 #2
Apr 12 21:34:47 [75707.119134] Hardware name: Supermicro Super
Server/X10DRi-LN4+, BIOS 2.0 12/17/2015
Apr 12 21:34:47 [75707.119196] 0000000000000000
Apr 12 21:34:47 ffffffff812abdf3
Apr 12 21:34:47 0000000000000000
Apr 12 21:34:47 ffffffff810cf5f5
Apr 12 21:34:47
Apr 12 21:34:47 [75707.119277] ffff883ff2a20000
Apr 12 21:34:47 ffffffff810fcea2
Apr 12 21:34:47 0000000000000001
Apr 12 21:34:47 ffff88407fce5e58
Apr 12 21:34:47
Apr 12 21:34:47 [75707.119360] ffff88407fceaf00
Apr 12 21:34:47 ffff88407fceb100
Apr 12 21:34:47 ffff883ff2a20000
Apr 12 21:34:47 ffffffff8101bc63
Apr 12 21:34:47
Apr 12 21:34:47 [75707.119439] Call Trace:
Apr 12 21:34:47 [75707.119471] <NMI>
Apr 12 21:34:47 [<ffffffff812abdf3>] ? dump_stack+0x40/0x5d
Apr 12 21:34:47 [75707.119527] [<ffffffff810cf5f5>] ?
watchdog_overflow_callback+0xb5/0xd0
Apr 12 21:34:47 [75707.119571] [<ffffffff810fcea2>] ?
__perf_event_overflow+0x82/0x1c0
Apr 12 21:34:47 [75707.119614] [<ffffffff8101bc63>] ?
intel_pmu_handle_irq+0x1c3/0x3e0
Apr 12 21:34:47 [75707.119657] [<ffffffff8113b5cb>] ?
vunmap_page_range+0x1bb/0x320
Apr 12 21:34:47 [75707.119703] [<ffffffff813213e0>] ?
ghes_copy_tofrom_phys+0x110/0x1d0
Apr 12 21:34:47 [75707.119758] [<ffffffff81014f53>] ?
perf_event_nmi_handler+0x23/0x40
Apr 12 21:34:47 [75707.119800] [<ffffffff81007b85>] ?
nmi_handle+0x65/0x100
Apr 12 21:34:47 [75707.119838] [<ffffffff81007d2e>] ? do_nmi+0x10e/0x360
Apr 12 21:34:47 [75707.119878] [<ffffffff8148f957>] ?
end_repeat_nmi+0x1a/0x1e
Apr 12 21:34:47 [75707.119920] [<ffffffff810862ca>] ?
queued_spin_lock_slowpath+0xea/0x150
Apr 12 21:34:47 [75707.119962] [<ffffffff810862ca>] ?
queued_spin_lock_slowpath+0xea/0x150
Apr 12 21:34:47 [75707.120002] [<ffffffff810862ca>] ?
queued_spin_lock_slowpath+0xea/0x150
Apr 12 21:34:47 [75707.120042] <<EOE>>
Apr 12 21:34:47 [<ffffffffa01b413b>] ? make_request+0x60b/0xbd0 [raid456]
Apr 12 21:34:47 [75707.120113] [<ffffffff810815c0>] ? wait_woken+0x80/0x80
Apr 12 21:34:47 [75707.120152] [<ffffffffa017632d>] ?
md_make_request+0xdd/0x220 [md_mod]
Apr 12 21:34:47 [75707.120195] [<ffffffff8128691d>] ?
generic_make_request+0xed/0x1d0
Apr 12 21:34:47 [75707.120236] [<ffffffff81286a5a>] ?
submit_bio+0x5a/0x140
Apr 12 21:34:47 [75707.120277] [<ffffffff8112afaf>] ?
workingset_refault+0x4f/0xa0
Apr 12 21:34:47 [75707.120320] [<ffffffff811a215e>] ?
mpage_bio_submit+0x1e/0x30
Apr 12 21:34:47 [75707.120359] [<ffffffff811a3076>] ?
mpage_readpages+0x106/0x130
Apr 12 21:34:47 [75707.120401] [<ffffffff8121b510>] ?
__xfs_get_blocks+0x750/0x750
Apr 12 21:34:47 [75707.120439] [<ffffffff8121b510>] ?
__xfs_get_blocks+0x750/0x750
Apr 12 21:34:47 [75707.120481] [<ffffffff8114ad45>] ?
alloc_pages_current+0x85/0x110
Apr 12 21:34:47 [75707.120523] [<ffffffff81111d25>] ?
__do_page_cache_readahead+0x165/0x1f0
Apr 12 21:34:47 [75707.120564] [<ffffffff811344f5>] ? vma_link+0x75/0xb0
Apr 12 21:34:47 [75707.120602] [<ffffffff811120c7>] ?
force_page_cache_readahead+0x77/0xe0
Apr 12 21:34:47 [75707.120644] [<ffffffff8113f876>] ?
madvise_willneed+0x76/0x140
Apr 12 21:34:47 [75707.120683] [<ffffffff811301ce>] ?
handle_mm_fault+0x9ae/0x1650
Apr 12 21:34:47 [75707.120722] [<ffffffff81133dcb>] ? find_vma+0x5b/0x70
Apr 12 21:34:47 [75707.120760] [<ffffffff8113fc52>] ?
SyS_madvise+0x312/0x6f0
Apr 12 21:34:47 [75707.120799] [<ffffffff8148d9db>] ?
entry_SYSCALL_64_fastpath+0x16/0x6e
Once this starts, a couple of minutes goes by and the machine locks up
completely.
I have been unable to locate the problem here, anyone that can point me
in the right direction?
Best regards
next reply other threads:[~2016-04-12 21:54 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-04-12 21:54 Daniel Walker [this message]
2016-04-13 17:00 ` Hard CPU Lockup when accessing MD RAID5 Shaohua Li
2016-04-20 6:52 ` Daniel Walker
2016-04-20 15:29 ` John Stoffel
2016-04-21 22:47 ` Daniel Walker
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=570D6E79.1010201@ftwinc.net \
--to=admin@ftwinc.net \
--cc=linux-raid@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.