From: keith.busch@intel.com (Keith Busch)
Subject: kernel BUG at nvme/host/pci.c
Date: Tue, 11 Jul 2017 15:45:24 -0400 [thread overview]
Message-ID: <20170711194523.GD4604@localhost.localdomain> (raw)
In-Reply-To: <64da8c4d-2d69-22fa-06a0-aa4f11903e39@pse-consulting.de>
On Tue, Jul 11, 2017@09:44:47AM +0200, Andreas Pflug wrote:
> Tested with 4.12.0, result is
>
> kernel BUG at drivers/nvme/host/pci.c:610!
>
> Kernel seems to recover from that, but I did a reboot anyway.
Ugh, still observing invalid scatter lists on 4.12. Definitely recommend
rebooting after hitting this.
There should only be two possibilities: either the block layer didn't
split a bio that it should have, or it merged two that it shouldn't. To
determine which, could disable merging for NVMe before running your
test? Something like this should accomplish that:
# echo 1 | tee /sys/block/nvme*/queue/nomerge
On a side note, I think we should make the BUG_ON a WARN_ON, and return
an IO error. While it'd fail IO, it should leave the system stable to
do more prodding.
> Jul 11 09:37:28 xen2 [ 110.002253] ------------[ cut here ]------------
> Jul 11 09:37:28 xen2 [ 110.002310] kernel BUG at drivers/nvme/host/pci.c:610!
> Jul 11 09:37:28 xen2 [ 110.002336] invalid opcode: 0000 [#1] SMP
> Jul 11 09:37:28 xen2 [ 110.002357] Modules linked in: xt_physdev br_netfilter iptable_filter xen_netback xen_blkback netconsole configfs bridge xen_gntdev xen_evtchn xenfs xen_privcmd dm_snapshot dm_bufio intel_rapl sb_edac x86_pkg_temp_thermal intel_powerclamp coretemp crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc aesni_intel aes_x86_64 iTCO_wdt crypto_simd iTCO_vendor_support glue_helper mxm_wmi cryptd snd_pcm snd_timer snd soundcore intel_rapl_perf pcspkr ast ttm e1000e drm_kms_helper joydev i2c_i801 ixgbe mei_me nvme drm ehci_pci ptp lpc_ich i2c_algo_bit sg mfd_core ehci_hcd mei pps_core nvme_core mdio ioatdma shpchp dca wmi acpi_power_meter 8021q garp mrp stp llc button ipmi_si ipmi_devintf ipmi_msghandler drbd lru_cache sunrpc ip_tables x_tables autofs4 ext4 crc16 jbd2 fscrypto mbcache raid10 raid456 libcrc32c
> Jul 11 09:37:28 xen2 [ 110.002638] crc32c_generic async_raid6_recov async_memcpy async_pq async_xor xor async_tx evdev hid_generic usbhid hid raid6_pq raid0 multipath linear bcache dm_mod raid1 md_mod sd_mod crc32c_intel ahci libahci xhci_pci xhci_hcd libata usbcore scsi_mod
> Jul 11 09:37:28 xen2 [ 110.002746] CPU: 0 PID: 5522 Comm: 2.hda-0 Tainted: G W 4.12.0pse #2
> Jul 11 09:37:28 xen2 [ 110.002775] Hardware name: Supermicro X10DRi/X10DRI-T, BIOS 2.1 09/13/2016
> Jul 11 09:37:28 xen2 [ 110.002807] task: ffff88015fb3e140 task.stack: ffffc90047b64000
> Jul 11 09:37:28 xen2 [ 110.002838] RIP: e030:nvme_queue_rq+0x644/0x7c0 [nvme]
> Jul 11 09:37:28 xen2 [ 110.002864] RSP: e02b:ffffc90047b67a10 EFLAGS: 00010286
> Jul 11 09:37:28 xen2 [ 110.002889] RAX: 0000000000000008 RBX: 00000000fffff400 RCX: 0000000000001000
> Jul 11 09:37:28 xen2 [ 110.002922] RDX: 0000000000000000 RSI: 0000000000000200 RDI: 0000000000000200
> Jul 11 09:37:28 xen2 [ 110.002954] RBP: 0000000000711000 R08: 0000000000001400 R09: ffff880171a82a00
> Jul 11 09:37:28 xen2 [ 110.002987] R10: 0000000000001000 R11: ffff880161316d00 R12: 0000000000006000
> Jul 11 09:37:28 xen2 [ 110.003019] R13: 0000000000000200 R14: ffff880161316d00 R15: 0000000000000002
> Jul 11 09:37:28 xen2 [ 110.003056] FS: 0000000000000000(0000) GS:ffff880186a00000(0000) knlGS:ffff880186a00000
> Jul 11 09:37:28 xen2 [ 110.003088] CS: e033 DS: 0000 ES: 0000 CR0: 0000000080050033
> Jul 11 09:37:28 xen2 [ 110.003115] CR2: 00007fed265e5fe8 CR3: 000000016eec0000 CR4: 0000000000042660
> Jul 11 09:37:28 xen2 [ 110.003148] Call Trace:
> Jul 11 09:37:28 xen2 [ 110.003169] ? blk_mq_dispatch_rq_list+0x201/0x400
> Jul 11 09:37:28 xen2 [ 110.003193] ? blk_mq_flush_busy_ctxs+0xc1/0x120
> Jul 11 09:37:28 xen2 [ 110.003217] ? blk_mq_sched_dispatch_requests+0x1b1/0x1e0
> Jul 11 09:37:28 xen2 [ 110.003243] ? __blk_mq_delay_run_hw_queue+0x91/0xa0
> Jul 11 09:37:28 xen2 [ 110.003265] ? blk_mq_flush_plug_list+0x184/0x260
> Jul 11 09:37:28 xen2 [ 110.003290] ? blk_flush_plug_list+0xf2/0x280
> Jul 11 09:37:28 xen2 [ 110.003312] ? blk_finish_plug+0x27/0x40
> Jul 11 09:37:28 xen2 [ 110.003335] ? dispatch_rw_block_io+0x732/0x9c0 [xen_blkback]
> Jul 11 09:37:28 xen2 [ 110.003363] ? __do_block_io_op+0x362/0x690 [xen_blkback]
> Jul 11 09:37:28 xen2 [ 110.003393] ? _raw_spin_unlock_irqrestore+0x16/0x20
> Jul 11 09:37:28 xen2 [ 110.003415] ? __do_block_io_op+0x362/0x690 [xen_blkback]
> Jul 11 09:37:28 xen2 [ 110.003442] ? xen_blkif_schedule+0x116/0x7f0 [xen_blkback]
> Jul 11 09:37:28 xen2 [ 110.003469] ? __schedule+0x3cd/0x850
> Jul 11 09:37:28 xen2 [ 110.003488] ? remove_wait_queue+0x60/0x60
> Jul 11 09:37:28 xen2 [ 110.003511] ? kthread+0xfc/0x130
> Jul 11 09:37:28 xen2 [ 110.003530] ? xen_blkif_be_int+0x30/0x30 [xen_blkback]
> Jul 11 09:37:28 xen2 [ 110.003556] ? kthread_create_on_node+0x70/0x70
> Jul 11 09:37:28 xen2 [ 110.003581] ? do_group_exit+0x3a/0xa0
> Jul 11 09:37:28 xen2 [ 110.004573] ? ret_from_fork+0x25/0x30
> Jul 11 09:37:28 xen2 [ 110.005560] Code: ff 4c 89 ef 89 54 24 20 89 4c 24 18 e8 66 e0 e9 c0 8b 54 24 20 48 89 44 24 10 4c 8b 48 10 44 8b 40 18 8b 4c 24 18 e9 74 fd ff ff <0f> 0b 49 8b 77 68 48 8b 3c 24 e8 8d b3 e8 c0 83 e8 01 74 55 41
> Jul 11 09:37:28 xen2 [ 110.007650] RIP: nvme_queue_rq+0x644/0x7c0 [nvme] RSP: ffffc90047b67a10
> Jul 11 09:37:28 xen2 [ 110.008708] ---[ end trace ad956c9e07e27784 ]---
next prev parent reply other threads:[~2017-07-11 19:45 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-07-10 18:03 kernel BUG at nvme/host/pci.c Andreas Pflug
2017-07-10 19:08 ` Keith Busch
2017-07-11 7:44 ` Andreas Pflug
2017-07-11 19:45 ` Keith Busch [this message]
2017-07-11 19:44 ` Scott Bauer
2017-07-12 6:06 ` Andreas Pflug
2017-07-12 19:50 ` Keith Busch
2017-07-13 8:46 ` Andreas Pflug
2017-07-13 9:00 ` Sagi Grimberg
2017-07-13 13:47 ` Keith Busch
2017-07-14 16:47 ` Andreas Pflug
2017-07-14 17:08 ` Keith Busch
2017-07-15 8:51 ` Christoph Hellwig
2017-07-15 8:51 ` Christoph Hellwig
2017-07-15 13:34 ` Andreas Pflug
2017-07-15 13:34 ` Andreas Pflug
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20170711194523.GD4604@localhost.localdomain \
--to=keith.busch@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.