From: Akihiko Odaki <akihiko.odaki@daynix.com>
To: Antoine Damhet <adamhet@scaleway.com>
Cc: qemu-devel@nongnu.org, "Michael S. Tsirkin" <mst@redhat.com>,
Jason Wang <jasowang@redhat.com>,
devel@daynix.com, qemu-stable@nongnu.org
Subject: Re: [PATCH] virtio-net: Copy all for dhclient workaround
Date: Sat, 19 Apr 2025 15:56:35 +0900 [thread overview]
Message-ID: <f67a4bd1-9d81-4e81-84a8-8f7e519926b9@daynix.com> (raw)
In-Reply-To: <pssex66ivae3kkxo7rwxo2mnroit7zpnirxis6eu56b2scaj3h@2flhgmzcxvy6>
On 2025/04/11 22:20, Antoine Damhet wrote:
> On Fri, Apr 11, 2025 at 05:01:01PM +0900, Akihiko Odaki wrote:
>> On 2025/04/07 17:29, Antoine Damhet wrote:
>>> On Sat, Apr 05, 2025 at 05:04:28PM +0900, Akihiko Odaki wrote:
>>>> The goal of commit 7987d2be5a8b ("virtio-net: Copy received header to
>>>> buffer") was to remove the need to patch the (const) input buffer with a
>>>> recomputed UDP checksum by copying headers to a RW region and inject the
>>>> checksum there. The patch computed the checksum only from the header
>>>> fields (missing the rest of the payload) producing an invalid one
>>>> and making guests fail to acquire a DHCP lease.
>>>>
>>>> Fix the issue by copying the entire packet instead of only copying the
>>>> headers.
>>>>
>>>> Fixes: 7987d2be5a8b ("virtio-net: Copy received header to buffer")
>>>> Resolves: https://gitlab.com/qemu-project/qemu/-/issues/2727
>>>> Cc: qemu-stable@nongnu.org
>>>> Signed-off-by: Akihiko Odaki <akihiko.odaki@daynix.com>
>>>
>>> Tested-By: Antoine Damhet <adamhet@scaleway.com>
>>>
>>>> ---
>>>> This patch aims to resolves the issue the following one also does:
>>>> https://lore.kernel.org/qemu-devel/20250404151835.328368-1-adamhet@scaleway.com
>>>>
>>>> The difference from the mentioned patch is that this patch also
>>>> preserves that the original intent of regressing change, which is to
>>>> remove the need to patch the (const) input buffer with a recomputed UDP
>>>> checksum.
>>>>
>>>> To Antoine Damhet:
>>>> I confirmed that DHCP is currently not working and this patch fixes the
>>>> issue, but I would appreciate if you also confirm the fix as I already
>>>> have done testing badly for the regressing patch.
>>>
>>> Thanks for the swift response, ideally I'd like a non-regression test in
>>> the testsuite but a quick test showed me that I couldn't easily
>>> reproduce with user networking so unless someone has a great idea it
>>> would be a pain.
>>>
>>>> ---
>>>> hw/net/virtio-net.c | 35 ++++++++++++++++-------------------
>>>> 1 file changed, 16 insertions(+), 19 deletions(-)
>>>>
>>>> diff --git a/hw/net/virtio-net.c b/hw/net/virtio-net.c
>>>> index de87cfadffe1..a920358a89c5 100644
>>>> --- a/hw/net/virtio-net.c
>>>> +++ b/hw/net/virtio-net.c
>>>> @@ -1687,6 +1687,11 @@ static void virtio_net_hdr_swap(VirtIODevice *vdev, struct virtio_net_hdr *hdr)
>>>> virtio_tswap16s(vdev, &hdr->csum_offset);
>>>> }
>>>> +typedef struct Header {
>>>> + struct virtio_net_hdr_v1_hash virtio_net;
>>>> + uint8_t payload[1500];
>>>> +} Header;
>>>> +
>>>> /* dhclient uses AF_PACKET but doesn't pass auxdata to the kernel so
>>>> * it never finds out that the packets don't have valid checksums. This
>>>> * causes dhclient to get upset. Fedora's carried a patch for ages to
>>>> @@ -1701,7 +1706,7 @@ static void virtio_net_hdr_swap(VirtIODevice *vdev, struct virtio_net_hdr *hdr)
>>>> * we should provide a mechanism to disable it to avoid polluting the host
>>>> * cache.
>>>> */
>>>> -static void work_around_broken_dhclient(struct virtio_net_hdr *hdr,
>>>> +static void work_around_broken_dhclient(struct Header *hdr,
>>>> size_t *hdr_len, const uint8_t *buf,
>>>> size_t buf_size, size_t *buf_offset)
>>>> {
>>>> @@ -1711,20 +1716,20 @@ static void work_around_broken_dhclient(struct virtio_net_hdr *hdr,
>>>> buf += *buf_offset;
>>>> buf_size -= *buf_offset;
>>>> - if ((hdr->flags & VIRTIO_NET_HDR_F_NEEDS_CSUM) && /* missing csum */
>>>> - (buf_size >= csum_size && buf_size < 1500) && /* normal sized MTU */
>>>> + if ((hdr->virtio_net.hdr.flags & VIRTIO_NET_HDR_F_NEEDS_CSUM) && /* missing csum */
>>>> + (buf_size >= csum_size && buf_size < sizeof(hdr->payload)) && /* normal sized MTU */
>>>> (buf[12] == 0x08 && buf[13] == 0x00) && /* ethertype == IPv4 */
>>>> (buf[23] == 17) && /* ip.protocol == UDP */
>>>> (buf[34] == 0 && buf[35] == 67)) { /* udp.srcport == bootps */
>>>> - memcpy((uint8_t *)hdr + *hdr_len, buf, csum_size);
>>>> - net_checksum_calculate((uint8_t *)hdr + *hdr_len, csum_size, CSUM_UDP);
>>>> - hdr->flags &= ~VIRTIO_NET_HDR_F_NEEDS_CSUM;
>>>> - *hdr_len += csum_size;
>>>> - *buf_offset += csum_size;
>>>> + memcpy((uint8_t *)hdr + *hdr_len, buf, buf_size);
>>>> + net_checksum_calculate((uint8_t *)hdr + *hdr_len, buf_size, CSUM_UDP);
>>>> + hdr->virtio_net.hdr.flags &= ~VIRTIO_NET_HDR_F_NEEDS_CSUM;
>>>> + *hdr_len += buf_size;
>>>> + *buf_offset += buf_size;
>>>> }
>>>> }
>>>> -static size_t receive_header(VirtIONet *n, struct virtio_net_hdr *hdr,
>>>> +static size_t receive_header(VirtIONet *n, Header *hdr,
>>>> const void *buf, size_t buf_size,
>>>> size_t *buf_offset)
>>>
>>> `receive_header` can now "receive" the whole packet that's kinda
>>> misleading. I though another approach would be to only do the
>>> detection/flag patching from receive_header and recompute the checksum
>>> directly in the final `iov`, this would also eliminate the extra payload
>>> copy.
>>
>> It is possible to avoid copying but I chose not to do that because this is
>> not a hot path and the code complexity required for that does not look
>> worthwhile for me.
>
> Understood and OK.
>
>>
>> But I agree that the names of receive_header() and Header structure are
>> misleading. The reasoning I used to convince myself is that the "Header" is
>> at the head of the packet at least. I'd like to hear if you have an idea of
>> better naming; otherwise I would rather leave it as is.
>
> Maybe we can sidestep this entirely, do we need to do the workaround
> _inside_ `receive_header` ? WDYT of the following pseudocode:
>
> ```
> guest_offset = receive_header(&header);
> iov_from_buf(&header);
> work_around_broken_dhclient(&header, &payload);
> iov_from_buf(&payload);
> ```
net_checksum_calculate() currently needs a contiguous buffer so it needs
to be changed and it also requires one additional iov_from_buf() call.
It's a bit too complicated to workaround the naming problem I think.
>
> If not maybe something along the line of "PacketPrefix" or
> "PacketStart".
Now I'm inclined for "PacketPrefix". In a normal context, "prefix" and
"start" are no different from "header", but in the networking context,
"header" is frequently used to describe the metadata and implies it
doesn't contain data. Usually I don't like to choose wordings according
to such an implied nuance, but avoiding the word "header" here has a
practical value.
I'll probably choose "prefix" instead of "start" since it sounds more
specific than "start".
Regards,
Akihiko Odaki
>
> Regards,
>
next prev parent reply other threads:[~2025-04-19 6:57 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-04-05 8:04 [PATCH] virtio-net: Copy all for dhclient workaround Akihiko Odaki
2025-04-07 2:09 ` Jason Wang
2025-04-07 8:29 ` Antoine Damhet
2025-04-11 8:01 ` Akihiko Odaki
2025-04-11 13:20 ` Antoine Damhet
2025-04-19 6:56 ` Akihiko Odaki [this message]
2025-05-12 10:11 ` Michael Tokarev
2025-05-14 2:51 ` Jason Wang
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=f67a4bd1-9d81-4e81-84a8-8f7e519926b9@daynix.com \
--to=akihiko.odaki@daynix.com \
--cc=adamhet@scaleway.com \
--cc=devel@daynix.com \
--cc=jasowang@redhat.com \
--cc=mst@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=qemu-stable@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).