From: Alex Williamson <alex.williamson@redhat.com>
To: "Wuzongyong (Euler Dept)" <cordius.wu@huawei.com>
Cc: "qemu-devel@nongnu.org" <qemu-devel@nongnu.org>,
"libvir-list@redhat.com" <libvir-list@redhat.com>,
"Chenhaiwu (Euler)" <chenhaiwu1@huawei.com>,
"Wanzongshun (Vincent)" <wanzongshun@huawei.com>
Subject: Re: [Qemu-devel] The results of lspci are inconsistent between vfio reset pci devices and reset devices by sysfs interafce
Date: Tue, 9 Oct 2018 09:08:01 -0600 [thread overview]
Message-ID: <20181009090801.6f53286d@w520.home> (raw)
In-Reply-To: <9BD73EA91F8E404F851CF3F519B14AA80180A4A5@DGGEMI521-MBX.china.huawei.com>
On Tue, 9 Oct 2018 12:11:29 +0000
"Wuzongyong (Euler Dept)" <cordius.wu@huawei.com> wrote:
> Hi,
>
> I start a virtual machine with commandline:
> /usr/libexec/qemu-kvm --enable-kvm -smp 8 -m 8192 -device vfio-pci,host=0000:81:00.0
>
> Then I pause the qemu process before executing the main_loop function by gdb.
> At this moment, lspci shows the regions are disabled like below:
> 81:00.0 3D controller: NVIDIA Corporation GP100GL [Tesla P100 PCIe 16GB] (rev a1)
> Subsystem: NVIDIA Corporation Device 118f
> Physical Slot: 0-6
> Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
> Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
> Interrupt: pin A routed to IRQ 35
> NUMA node: 1
> Region 0: Memory at c8000000 (32-bit, non-prefetchable) [disabled] [size=16M]
> Region 1: Memory at 27800000000 (64-bit, prefetchable) [disabled] [size=16G]
> Region 3: Memory at 27c00000000 (64-bit, prefetchable) [disabled] [size=32M]
>
> But after the command:
> echo 1 > /sys/bus/pci/devices/0000:81:00.0/reset
> lspci shows the regions are *not* disabled:
> 81:00.0 3D controller: NVIDIA Corporation GP100GL [Tesla P100 PCIe 16GB] (rev a1)
> Subsystem: Huawei Technologies Co., Ltd. Device 2061
> Physical Slot: 0-6
> Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+ Stepping- SERR+ FastB2B- DisINTx-
> Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
> Latency: 0, Cache Line Size: 32 bytes
> Interrupt: pin A routed to IRQ 7
> NUMA node: 1
> Region 0: Memory at c8000000 (32-bit, non-prefetchable) [size=16M]
> Region 1: Memory at 27800000000 (64-bit, prefetchable) [size=16G]
> Region 3: Memory at 27c00000000 (64-bit, prefetchable) [size=32M]
>
> AFAIK, qemu performs vfio_pci_reset like the below callstack:
> Qemu:
> vfio_pci_reset
> ioctl(vdev->vbasedev.fd, VFIO_DEVICE_RESET)
> Kernel:
> vfio_pci_ioctl
> pci_try_reset_function
> __pci_reset_function_locked
> pci_parent_bus_reset
> pci_reset_bridge_secondary_bus
>
> and write 1 to the reset interface of sysfs go through the path:
> Kernel:
> reset_store
> pci_reset_function
> __pci_reset_function_locked
> pci_parent_bus_reset
> pci_reset_bridge_secondary_bus
>
> So seem that these two methods are same actually, I am confused why the results are inconsistent.
Maybe there's a misunderstanding here, the kernel PCI reset functions
save and restore config space around the reset. The intention of the
reset is to re-init the internal state of the device while preserving
(via save+restore) the config space. The BARs being disabled is simply
a matter of the Memory bit in the Command register being unset (note
Mem-). Whether this is indicative of some issue depends on whether the
state before reset matches the state after reset, not that the states
after two different paths of triggering a reset are identical.
vfio-pci will hand off the device to the user (QEMU) disabled, so the
states in the first example make sense to me. In the second case, it's
not clear what the starting state is for the device. Was this reset
performed from the starting point of the first case or is the device in
some arbitrary, unknown state prior to reset? Thanks,
Alex
next prev parent reply other threads:[~2018-10-09 15:08 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-10-09 12:11 [Qemu-devel] The results of lspci are inconsistent between vfio reset pci devices and reset devices by sysfs interafce Wuzongyong (Euler Dept)
2018-10-09 15:08 ` Alex Williamson [this message]
2018-10-10 1:26 ` Wuzongyong (Euler Dept)
2018-10-10 1:47 ` Wuzongyong (Euler Dept)
2018-10-10 3:18 ` Alex Williamson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20181009090801.6f53286d@w520.home \
--to=alex.williamson@redhat.com \
--cc=chenhaiwu1@huawei.com \
--cc=cordius.wu@huawei.com \
--cc=libvir-list@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=wanzongshun@huawei.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).