qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
From: Jason Wang <jasowang@redhat.com>
To: Cornelia Huck <cohuck@redhat.com>
Cc: qemu-s390x@nongnu.org, qemu-devel@nongnu.org,
	Cindy Lu <lulu@redhat.com>, "Michael S. Tsirkin" <mst@redhat.com>
Subject: Re: [BUG] vhost-vdpa: qemu-system-s390x crashes with second virtio-net-ccw device
Date: Mon, 27 Jul 2020 15:38:12 +0800	[thread overview]
Message-ID: <676ce079-adf6-a279-c2ea-68f43146e2ac@redhat.com> (raw)
In-Reply-To: <20200727084310.7d29ec6d.cohuck@redhat.com>


On 2020/7/27 下午2:43, Cornelia Huck wrote:
> On Sat, 25 Jul 2020 08:40:07 +0800
> Jason Wang <jasowang@redhat.com> wrote:
>
>> On 2020/7/24 下午11:34, Cornelia Huck wrote:
>>> On Fri, 24 Jul 2020 11:17:57 -0400
>>> "Michael S. Tsirkin"<mst@redhat.com>  wrote:
>>>   
>>>> On Fri, Jul 24, 2020 at 04:56:27PM +0200, Cornelia Huck wrote:
>>>>> On Fri, 24 Jul 2020 09:30:58 -0400
>>>>> "Michael S. Tsirkin"<mst@redhat.com>  wrote:
>>>>>       
>>>>>> On Fri, Jul 24, 2020 at 03:27:18PM +0200, Cornelia Huck wrote:
>>>>>>> When I start qemu with a second virtio-net-ccw device (i.e. adding
>>>>>>> -device virtio-net-ccw in addition to the autogenerated device), I get
>>>>>>> a segfault. gdb points to
>>>>>>>
>>>>>>> #0  0x000055d6ab52681d in virtio_net_get_config (vdev=<optimized out>,
>>>>>>>       config=0x55d6ad9e3f80 "RT") at /home/cohuck/git/qemu/hw/net/virtio-net.c:146
>>>>>>> 146	    if (nc->peer->info->type == NET_CLIENT_DRIVER_VHOST_VDPA) {
>>>>>>>
>>>>>>> (backtrace doesn't go further)
>>>>> The core was incomplete, but running under gdb directly shows that it
>>>>> is just a bog-standard config space access (first for that device).
>>>>>
>>>>> The cause of the crash is that nc->peer is not set... no idea how that
>>>>> can happen, not that familiar with that part of QEMU. (Should the code
>>>>> check, or is that really something that should not happen?)
>>>>>
>>>>> What I don't understand is why it is set correctly for the first,
>>>>> autogenerated virtio-net-ccw device, but not for the second one, and
>>>>> why virtio-net-pci doesn't show these problems. The only difference
>>>>> between -ccw and -pci that comes to my mind here is that config space
>>>>> accesses for ccw are done via an asynchronous operation, so timing
>>>>> might be different.
>>>> Hopefully Jason has an idea. Could you post a full command line
>>>> please? Do you need a working guest to trigger this? Does this trigger
>>>> on an x86 host?
>>> Yes, it does trigger with tcg-on-x86 as well. I've been using
>>>
>>> s390x-softmmu/qemu-system-s390x -M s390-ccw-virtio,accel=tcg -cpu qemu,zpci=on
>>> -m 1024 -nographic -device virtio-scsi-ccw,id=scsi0,devno=fe.0.0001
>>> -drive file=/path/to/image,format=qcow2,if=none,id=drive-scsi0-0-0-0
>>> -device scsi-hd,bus=scsi0.0,channel=0,scsi-id=0,lun=0,drive=drive-scsi0-0-0-0,id=scsi0-0-0-0,bootindex=1
>>> -device virtio-net-ccw
>>>
>>> It seems it needs the guest actually doing something with the nics; I
>>> cannot reproduce the crash if I use the old advent calendar moon buggy
>>> image and just add a virtio-net-ccw device.
>>>
>>> (I don't think it's a problem with my local build, as I see the problem
>>> both on my laptop and on an LPAR.)
>>
>> It looks to me we forget the check the existence of peer.
>>
>> Please try the attached patch to see if it works.
> Thanks, that patch gets my guest up and running again. So, FWIW,
>
> Tested-by: Cornelia Huck <cohuck@redhat.com>
>
> Any idea why this did not hit with virtio-net-pci (or the autogenerated
> virtio-net-ccw device)?


It can be hit with virtio-net-pci as well (just start without peer).

For autogenerated virtio-net-cww, I think the reason is that it has 
already had a peer set.

Thanks


>
>



  reply	other threads:[~2020-07-27  7:38 UTC|newest]

Thread overview: 14+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2020-07-24 13:27 [BUG] vhost-vdpa: qemu-system-s390x crashes with second virtio-net-ccw device Cornelia Huck
2020-07-24 13:30 ` Michael S. Tsirkin
2020-07-24 14:56   ` Cornelia Huck
2020-07-24 15:17     ` Michael S. Tsirkin
2020-07-24 15:34       ` Cornelia Huck
2020-07-25  0:40         ` Jason Wang
2020-07-27  6:43           ` Cornelia Huck
2020-07-27  7:38             ` Jason Wang [this message]
2020-07-27  8:41               ` Cornelia Huck
2020-07-27  8:51                 ` Jason Wang
2020-07-27 11:43                   ` Michael S. Tsirkin
2020-07-27 12:44                     ` Jason Wang
2020-07-27 13:16                       ` Michael S. Tsirkin
2020-07-28  4:10                         ` Jason Wang

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=676ce079-adf6-a279-c2ea-68f43146e2ac@redhat.com \
    --to=jasowang@redhat.com \
    --cc=cohuck@redhat.com \
    --cc=lulu@redhat.com \
    --cc=mst@redhat.com \
    --cc=qemu-devel@nongnu.org \
    --cc=qemu-s390x@nongnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).