From: Francesco Valla <francesco@valla.it>
To: Matias Ezequiel Vara Larsen <mvaralar@redhat.com>
Cc: Marc Kleine-Budde <mkl@pengutronix.de>,
Vincent Mailhol <mailhol@kernel.org>,
Harald Mommer <harald.mommer@oss.qualcomm.com>,
Mikhail Golubev-Ciuchea
<mikhail.golubev-ciuchea@oss.qualcomm.com>,
"Michael S. Tsirkin" <mst@redhat.com>,
Jason Wang <jasowang@redhat.com>,
Xuan Zhuo <xuanzhuo@linux.alibaba.com>,
linux-can@vger.kernel.org, virtualization@lists.linux.dev,
Wolfgang Grandegger <wg@grandegger.com>,
"David S. Miller" <davem@davemloft.net>,
Eric Dumazet <edumazet@google.com>,
Jakub Kicinski <kuba@kernel.org>, Paolo Abeni <pabeni@redhat.com>,
Stefano Garzarella <sgarzare@redhat.com>
Subject: Re: [PATCH v6] can: virtio: Add virtio CAN driver
Date: Thu, 8 Jan 2026 00:00:54 +0100 [thread overview]
Message-ID: <aV7lplXiOehmw2iR@bywater> (raw)
In-Reply-To: <aV6sHofXnZuu84OV@bywater>
On Wed, Jan 07, 2026 at 07:55:26PM +0100, Francesco Valla wrote:
> Hi Matias,
>
> On Wed, Jan 07, 2026 at 05:14:25PM +0100, Matias Ezequiel Vara Larsen wrote:
> > On Fri, Dec 26, 2025 at 4:09 PM Francesco Valla <francesco@valla.it> wrote:
> > >
> > > Hi,
> > >
> > > On Sun, Dec 14, 2025 at 04:25:54PM +0100, Francesco Valla wrote:
> > > > While stress testing this, I noticed that flooding the virtio-can
> > > > interface with packets leads to an hang of the interface itself.
> > > > I am seeing this issuing, at host side:
> > > >
> > > > while true; do cansend can0 123#00; done
> > > >
> > > > with:
> > > >
> > > > - QEMU: the tip of the master branch plus [2]
> > > > - vhost-device: the tip of the main branch
> > > >
> > > > and the following QEMU invocation:
> > > >
> > > > qemu-system-x86_64 -serial mon:stdio \
> > > > -m 2G -smp 2 \
> > > > -kernel $(pwd)/BUILD.bin/arch/x86/boot/bzImage \
> > > > -initrd /home/francesco/SRC/LINUX_KERNEL/initramfs.gz \
> > > > -append "loglevel=7 console=ttyS0" \
> > > > -machine memory-backend=pc.ram \
> > > > -object memory-backend-file,id=pc.ram,size=2G,mem-path=/tmp/pc.ram,share=on \
> > > > -chardev socket,id=can0,path=/tmp/sock-can0 \
> > > > -device vhost-user-can-pci,chardev=can0
> > > >
> > > >
> > > > Restarting the interface (i.e.: ip link set down and the up) does not
> > > > fix the situation.
> > > >
> > > > I'll try to do some more testing during the next days.
> > >
> > > After a deep dive, I _think_ the problem actually lies in vhost-device,
> > > since it is not there (or al least, it seems so) using an alternative
> > > implementation that uses the qemu socketcan support [0] (implementation
> > > which builds on top of the work done by Harald and Mikhail):
> > >
> > > qemu-system-x86_64 -serial mon:stdio \
> > > -m 2G -smp 2 -enable-kvm \
> > > -kernel $(pwd)/BUILD.bin/arch/x86/boot/bzImage \
> > > -initrd /home/francesco/SRC/LINUX_KERNEL/initramfs.gz \
> > > -append "loglevel=7 console=ttyS0" \
> > > -object can-bus,id=canbus0 -object can-host-socketcan,id=canhost0,if=vcan0,canbus=canbus0 \
> > > -device virtio-can-pci,canbus=canbus0
> > >
> > > Unfortunately, my Rust knoweledge is not sufficient to understand the
> > > vhost-device implementation [1]; the issue seems to be related to the
> > > host->guest vring becoming empty and not refilling anymore.
> > >
> >
> > Can you try with
> > https://github.com/MatiasVara/vhost-device/commits/fix-for-923/?
>
> I'll stress test it during the night, but this seems to fix it. Before
> it was reproducible in a consistent manner after mere seconds, while i
> now in a bunch of runs I never reproduced it.
>
> I also agree with your analysis on the commit.
>
Quick update, unfortunately not good: the RX issue is solved, but now I
have a different one: if I send a single message either from the guset
or the host (e.g.: cansend can0 111#00) I get:
[ 16.496923] irq 11: nobody cared (try booting with the "irqpoll" option)
[ 16.511875] CPU: 0 UID: 0 PID: 0 Comm: swapper/0 Not tainted 6.18.0-00002-gded0a4b9da5a #29 PREEMPT(voluntary)
[ 16.511883] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS rel-1.17.0-0-gb52ca86e094d-prebuilt.qemu.org 04/01/2014
[ 16.511887] Call Trace:
[ 16.511933] <IRQ>
[ 16.511938] dump_stack_lvl+0x4d/0x70
[ 16.511973] __report_bad_irq+0x30/0xb7
[ 16.511986] note_interrupt.cold+0x28/0x66
[ 16.511988] handle_irq_event+0x6d/0x70
[ 16.512004] handle_fasteoi_irq+0xd5/0x1f0
[ 16.512011] __common_interrupt+0x3f/0xd0
[ 16.512023] ? tick_nohz_irq_exit+0x2e/0x60
[ 16.512035] common_interrupt+0x3b/0x90
[ 16.512057] asm_common_interrupt+0x26/0x40
[ 16.512073] RIP: 0010:handle_softirqs+0x6d/0x270
[ 16.512081] Code: 02 00 01 00 00 89 5c 24 14 48 89 6c 24 08 c7 44 24 10 0a 00 00 00 89 7c 24 04 31 c0 65 66 89 05 01 ce 3e 02 fb bb ff ff ff ff <49> c7 c2 c0 80 a0 a2 44 89 ed 41 0f bc dd 83 c3 01 74 76 8d 43 ff
[ 16.512082] RSP: 0018:ffffb22480003f98 EFLAGS: 00000246
[ 16.512086] RAX: 0000000000000000 RBX: 00000000ffffffff RCX: 0000000000000838
[ 16.512087] RDX: 0000000000000000 RSI: ffffffffa2a0e940 RDI: 0000000000000000
[ 16.512088] RBP: 00000000fffbac3e R08: 0000000000000001 R09: 0000000000000000
[ 16.512088] R10: ffffa2d57da249d0 R11: ffffb22480003ff8 R12: 0000000000000000
[ 16.512091] R13: 0000000000000082 R14: 0000000000000000 R15: 0000000000000000
[ 16.512097] irq_exit_rcu+0x89/0xb0
[ 16.512099] sysvec_apic_timer_interrupt+0x6b/0x80
[ 16.512103] </IRQ>
[ 16.512104] <TASK>
[ 16.512104] asm_sysvec_apic_timer_interrupt+0x1a/0x20
[ 16.512105] RIP: 0010:pv_native_safe_halt+0xf/0x20
[ 16.512107] Code: 2c 81 00 c3 cc cc cc cc 0f 1f 00 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 f3 0f 1e fa eb 07 0f 00 2d 05 cf 18 00 fb f4 <c3> cc cc cc cc 66 2e 0f 1f 84 00 00 00 00 00 66 90 90 90 90 90 90
[ 16.512109] RSP: 0018:ffffffffa2a03e80 EFLAGS: 00000212
[ 16.512110] RAX: ffffa2d5da523000 RBX: ffffffffa2a0e940 RCX: 0000000000000838
[ 16.512111] RDX: 4000000000000000 RSI: 0000000000000087 RDI: 00000000000a722c
[ 16.512111] RBP: 0000000000000000 R08: 00000000000a722c R09: ffffa2d57da249d0
[ 16.512112] R10: ffffa2d57da1bac0 R11: 0000000000000001 R12: 0000000000000000
[ 16.512112] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000014770
[ 16.512113] default_idle+0x9/0x10
[ 16.512117] default_idle_call+0x2a/0xf0
[ 16.512119] do_idle+0x1cb/0x230
[ 16.512129] cpu_startup_entry+0x24/0x30
[ 16.512130] rest_init+0xbc/0xc0
[ 16.512133] start_kernel+0x6d7/0x6e0
[ 16.512164] x86_64_start_reservations+0x24/0x30
[ 16.512172] x86_64_start_kernel+0xc8/0xd0
[ 16.512173] common_startup_64+0x13e/0x148
[ 16.512181] </TASK>
[ 16.512181] handlers:
[ 16.513166] [<00000000b61218c7>] vp_interrupt
[ 16.515096] Disabling IRQ #11
with IRQ#11 being:
# cat /proc/interrupts
CPU0 CPU1
11: 102218 0 IO-APIC 11-fasteoi virtio0
This cannot be reproduced with the old version of vhost-device. I think
it is due to the removal of the req_rx_buf variable and associated
logic: a vq kick is now being performed at every cycle of the event
loop, even if no processing happened. At guest side, this results in a
IRQ not cared for.
Francesco
next prev parent reply other threads:[~2026-01-07 23:01 UTC|newest]
Thread overview: 32+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-10-29 17:40 [PATCH v6] can: virtio: Add virtio CAN driver Matias Ezequiel Vara Larsen
2025-11-03 21:37 ` Francesco Valla
2025-12-11 17:52 ` Matias Ezequiel Vara Larsen
2025-12-14 15:25 ` Francesco Valla
2025-12-18 19:51 ` Harald Mommer
2025-12-18 23:07 ` Francesco Valla
2026-01-06 17:20 ` Harald Mommer
2025-12-21 13:01 ` Michael S. Tsirkin
2025-12-26 19:45 ` Matias Ezequiel Vara Larsen
2025-12-26 15:08 ` Francesco Valla
2026-01-07 16:14 ` Matias Ezequiel Vara Larsen
2026-01-07 18:55 ` Francesco Valla
2026-01-07 23:00 ` Francesco Valla [this message]
2026-01-07 23:21 ` Matias Ezequiel Vara Larsen
2026-01-08 20:21 ` Matias Ezequiel Vara Larsen
2026-01-09 16:58 ` Francesco Valla
2025-12-26 20:52 ` Matias Ezequiel Vara Larsen
2025-12-26 22:22 ` Francesco Valla
2025-12-29 15:47 ` Matias Ezequiel Vara Larsen
2025-12-29 18:53 ` Matias Ezequiel Vara Larsen
2025-12-29 20:55 ` Francesco Valla
2025-12-31 21:08 ` Matias Ezequiel Vara Larsen
2026-01-06 18:17 ` Matias Ezequiel Vara Larsen
2026-01-06 16:50 ` Harald Mommer
2026-01-06 19:43 ` Oliver Hartkopp
2026-01-06 20:39 ` Francesco Valla
2026-01-07 15:42 ` Oliver Hartkopp
2026-01-07 16:01 ` Matias Ezequiel Vara Larsen
2025-12-12 15:35 ` Matias Ezequiel Vara Larsen
2025-12-14 7:20 ` Michael S. Tsirkin
2025-12-14 14:24 ` Francesco Valla
2025-11-17 9:49 ` Michael S. Tsirkin
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=aV7lplXiOehmw2iR@bywater \
--to=francesco@valla.it \
--cc=davem@davemloft.net \
--cc=edumazet@google.com \
--cc=harald.mommer@oss.qualcomm.com \
--cc=jasowang@redhat.com \
--cc=kuba@kernel.org \
--cc=linux-can@vger.kernel.org \
--cc=mailhol@kernel.org \
--cc=mikhail.golubev-ciuchea@oss.qualcomm.com \
--cc=mkl@pengutronix.de \
--cc=mst@redhat.com \
--cc=mvaralar@redhat.com \
--cc=pabeni@redhat.com \
--cc=sgarzare@redhat.com \
--cc=virtualization@lists.linux.dev \
--cc=wg@grandegger.com \
--cc=xuanzhuo@linux.alibaba.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.