From: Vishwanath Seshagiri <vishs@meta.com>
To: Xuan Zhuo <xuanzhuo@linux.alibaba.com>
Cc: "Eugenio Pérez" <eperezma@redhat.com>,
"Andrew Lunn" <andrew+netdev@lunn.ch>,
"David S . Miller" <davem@davemloft.net>,
"Eric Dumazet" <edumazet@google.com>,
"Jakub Kicinski" <kuba@kernel.org>,
"Paolo Abeni" <pabeni@redhat.com>, "David Wei" <dw@davidwei.uk>,
"Matteo Croce" <technoboy85@gmail.com>,
"Ilias Apalodimas" <ilias.apalodimas@linaro.org>,
netdev@vger.kernel.org, virtualization@lists.linux.dev,
linux-kernel@vger.kernel.org, kernel-team@meta.com,
"Michael S . Tsirkin" <mst@redhat.com>,
"Jason Wang" <jasowang@redhat.com>
Subject: Re: [PATCH v6] virtio_net: add page_pool support for buffer allocation
Date: Mon, 9 Feb 2026 11:34:27 -0800 [thread overview]
Message-ID: <5184bd60-2a96-451e-8b0a-bc1132b1498d@meta.com> (raw)
In-Reply-To: <1770621144.341019-1-xuanzhuo@linux.alibaba.com>
On 2/8/26 11:12 PM, Xuan Zhuo wrote:
>>>>
>>>> +static void virtnet_put_page(struct receive_queue *rq, struct page *page,
>>>> + bool allow_direct)
>>>> +{
>>>> + if (page_pool_page_is_pp(page))
>>>> + page_pool_put_page(rq->page_pool, page, -1, allow_direct);
>>>> + else
>>>> + put_page(page);
>>>> +}
>>>
>>> Why we need this?
>>> For the caller, we should know which one should be used?
>>>
>>
>> This was after some feedback to unify the alloc/free path checks in v4.
>> But you raise a valid point - callers already know the mode via
>> virtnet_no_page_pool(). I can simplify this to just call
>> page_pool_put_page() directly, since virtnet_put_page() is only called
>> from paths that already checked we're using page_pool. Would you prefer
>> that?
>
>
> Based on my understanding, the big mode should directly call the Page API, while
> all other modes should directly call the PP API. Therefore, I believe it's
> better for each mode to directly invoke its respective API.
ack. I will move this into each callsite.
>>>>
>>>> +static int virtnet_create_page_pools(struct virtnet_info *vi)
>>>> +{
>>>> + int i, err;
>>>> +
>>>> + if (!vi->mergeable_rx_bufs && vi->big_packets)
>>>> + return 0;
>>>> +
>>>> + for (i = 0; i < vi->max_queue_pairs; i++) {
>>>> + struct receive_queue *rq = &vi->rq[i];
>>>> + struct page_pool_params pp_params = { 0 };
>>>> + struct device *dma_dev;
>>>> +
>>>> + if (rq->page_pool)
>>>> + continue;
>>>> +
>>>> + if (rq->xsk_pool)
>>>> + continue;
>>>> +
>>>> + pp_params.order = 0;
>>>> + pp_params.pool_size = virtqueue_get_vring_size(rq->vq);
>>>> + pp_params.nid = dev_to_node(vi->vdev->dev.parent);
>>>> + pp_params.netdev = vi->dev;
>>>> + pp_params.napi = &rq->napi;
>>>> +
>>>> + /* Check if backend supports DMA API (e.g., vhost, virtio-pci).
>>>> + * If so, use page_pool's DMA mapping for premapped buffers.
>>>> + * Otherwise (e.g., VDUSE), page_pool only handles allocation.
>>>> + */
>>>> + dma_dev = virtqueue_dma_dev(rq->vq);
>>>> + if (dma_dev) {
>>>> + pp_params.dev = dma_dev;
>>>> + pp_params.flags = PP_FLAG_DMA_MAP;
>>>> + pp_params.dma_dir = DMA_FROM_DEVICE;
>>>> + rq->use_page_pool_dma = true;
>>>> + } else {
>>>> + pp_params.dev = vi->vdev->dev.parent;
>>>> + pp_params.flags = 0;
>>>> + rq->use_page_pool_dma = false;
>>>
>>> Can the page pool handles dma with vi->vdev->dev.parent?
>>
>> No, we cannot use the page_pool DMA with vi->vdev->dev.parent in VDUSE
>> case because VDUSE uses its own address translation. virtqueue_dma_dev()
>> returns NULL, virtio doesn't use standard DMA API at all. Now that I
>> think about it, setting pp_params.dev in this branch is unnecessary
>> since it is never accessed. I can remove it, if you prefer.
>
> If that's the case, then it is indeed a bit troublesome. I don't know if VDUSE
> has a better solution. What I don't like is use_page_pool_dma -- it
> introduces many branches into the code, making it more chaotic. We may need to
> look for a better unified solution.
>
> Thanks.
VDUSE does not have a DMA device. virtqueue_dma_dev() returns NULL and
virtqueue_map_single_attrs() just returns virt_to_phys(). There's
nothing to map or sync. These branches exist because page_pool's DMA
APIs require a configured DMA device. I can remove the use_page_pool_dma
and check pool->dma_map directly to reduce state. However, some amount
of branching is unavoidable unless page_pool adds a no-op variant of its
DMA functions.
To bring parity with vhost/virtio-pci, VDUSE would need to implement the
standard DMA API, which conflicts with VDUSE's fundamental architecture
since it uses its own IOVA translation for userpsace access. I don't see
a way to avoid these branches without dropping page_pool for VDUSE,
which I'd prefer not to do. Open to suggestions if I have missed
something.
>
>>
>>
>>>
>>> Thanks.
>>>
prev parent reply other threads:[~2026-02-09 19:34 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-02-08 17:54 [PATCH v6] virtio_net: add page_pool support for buffer allocation Vishwanath Seshagiri
2026-02-08 18:42 ` Michael S. Tsirkin
2026-02-08 18:56 ` Vishwanath Seshagiri
2026-02-08 19:05 ` Michael S. Tsirkin
2026-02-08 20:40 ` Vishwanath Seshagiri
2026-02-09 2:00 ` Xuan Zhuo
2026-02-09 2:42 ` Vishwanath Seshagiri
2026-02-09 7:12 ` Xuan Zhuo
2026-02-09 19:34 ` Vishwanath Seshagiri [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5184bd60-2a96-451e-8b0a-bc1132b1498d@meta.com \
--to=vishs@meta.com \
--cc=andrew+netdev@lunn.ch \
--cc=davem@davemloft.net \
--cc=dw@davidwei.uk \
--cc=edumazet@google.com \
--cc=eperezma@redhat.com \
--cc=ilias.apalodimas@linaro.org \
--cc=jasowang@redhat.com \
--cc=kernel-team@meta.com \
--cc=kuba@kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=mst@redhat.com \
--cc=netdev@vger.kernel.org \
--cc=pabeni@redhat.com \
--cc=technoboy85@gmail.com \
--cc=virtualization@lists.linux.dev \
--cc=xuanzhuo@linux.alibaba.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox