From: Jason Wang <jasowang@redhat.com>
To: "Peter Xu" <peterx@redhat.com>,
"Daniel P. Berrangé" <berrange@redhat.com>
Cc: "Elena Ufimtseva" <elena.ufimtseva@oracle.com>,
"John G Johnson" <john.g.johnson@oracle.com>,
"Jagannathan Raman" <jag.raman@oracle.com>,
qemu-block@nongnu.org, "Juan Quintela" <quintela@redhat.com>,
"Dr. David Alan Gilbert" <dgilbert@redhat.com>,
qemu-devel@nongnu.org, "Leonardo Bras" <leobras@redhat.com>,
"Paolo Bonzini" <pbonzini@redhat.com>,
"Marc-André Lureau" <marcandre.lureau@redhat.com>,
"Fam Zheng" <fam@euphon.net>
Subject: Re: [PATCH v1 3/3] migration: multifd: Enable zerocopy
Date: Thu, 2 Sep 2021 15:23:14 +0800 [thread overview]
Message-ID: <517ed9ad-7e80-098c-52b4-566c6644df31@redhat.com> (raw)
In-Reply-To: <YS+dxUBrhogJQkEY@t490s>
在 2021/9/1 下午11:35, Peter Xu 写道:
> On Wed, Sep 01, 2021 at 09:53:07AM +0100, Daniel P. Berrangé wrote:
>> On Tue, Aug 31, 2021 at 04:29:09PM -0400, Peter Xu wrote:
>>> On Tue, Aug 31, 2021 at 02:16:42PM +0100, Daniel P. Berrangé wrote:
>>>> On Tue, Aug 31, 2021 at 08:02:39AM -0300, Leonardo Bras wrote:
>>>>> Call qio_channel_set_zerocopy(true) in the start of every multifd thread.
>>>>>
>>>>> Change the send_write() interface of multifd, allowing it to pass down
>>>>> flags for qio_channel_write*().
>>>>>
>>>>> Pass down MSG_ZEROCOPY flag for sending memory pages, while keeping the
>>>>> other data being sent at the default copying approach.
>>>>>
>>>>> Signed-off-by: Leonardo Bras <leobras@redhat.com>
>>>>> ---
>>>>> migration/multifd-zlib.c | 7 ++++---
>>>>> migration/multifd-zstd.c | 7 ++++---
>>>>> migration/multifd.c | 9 ++++++---
>>>>> migration/multifd.h | 3 ++-
>>>>> 4 files changed, 16 insertions(+), 10 deletions(-)
>>>>> @@ -675,7 +676,8 @@ static void *multifd_send_thread(void *opaque)
>>>>> }
>>>>>
>>>>> if (used) {
>>>>> - ret = multifd_send_state->ops->send_write(p, used, &local_err);
>>>>> + ret = multifd_send_state->ops->send_write(p, used, MSG_ZEROCOPY,
>>>>> + &local_err);
>>>> I don't think it is valid to unconditionally enable this feature due to the
>>>> resource usage implications
>>>>
>>>> https://www.kernel.org/doc/html/v5.4/networking/msg_zerocopy.html
>>>>
>>>> "A zerocopy failure will return -1 with errno ENOBUFS. This happens
>>>> if the socket option was not set, the socket exceeds its optmem
>>>> limit or the user exceeds its ulimit on locked pages."
>>>>
>>>> The limit on locked pages is something that looks very likely to be
>>>> exceeded unless you happen to be running a QEMU config that already
>>>> implies locked memory (eg PCI assignment)
>>> Yes it would be great to be a migration capability in parallel to multifd. At
>>> initial phase if it's easy to be implemented on multi-fd only, we can add a
>>> dependency between the caps. In the future we can remove that dependency when
>>> the code is ready to go without multifd. Thanks,
>> Also, I'm wondering how zerocopy support interacts with kernel support
>> for kTLS and multipath-TCP, both of which we want to be able to use
>> with migration.
> Copying Jason Wang for net implications between these features on kernel side
Note that the MSG_ZEROCOPY is contributed by Google :)
> and whether they can be enabled together (MSG_ZEROCOPY, mptcp, kTLS).
I think they can. Anyway kernel can choose to do datacopy when necessary.
Note that the "zerocopy" is probably not correct here. What's better is
"Enable MSG_ZEROCOPY" since:
1) kernel supports various kinds of zerocopy, for TX, it has supported
sendfile() for many years.
2) MSG_ZEROCOPY is only used for TX but not RX
3) TCP rx zerocopy is only supported via mmap() which requires some
extra configurations e.g 4K MTU, driver support for header split etc.
[1] https://www.youtube.com/watch?v=_ZfiQGWFvg0
Thanks
>
> From the safe side we may want to only enable one of them until we prove
> they'll work together I guess..
>
> Not a immediate concern as I don't really think any of them is really
> explicitly supported in qemu.
>
> KTLS may be implicitly included by a new gnutls, but we need to mark TLS and
> ZEROCOPY mutual exclusive anyway because at least the userspace TLS code of
> gnutls won't has a way to maintain the tls buffers used by zerocopy. So at
> least we need some knob to detect whether kTLS is enabled in gnutls.
>
next prev parent reply other threads:[~2021-09-02 7:24 UTC|newest]
Thread overview: 62+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-08-31 11:02 [PATCH v1 0/3] QIOChannel flags + multifd zerocopy Leonardo Bras
2021-08-31 11:02 ` [PATCH v1 1/3] io: Enable write flags for QIOChannel Leonardo Bras
2021-09-01 20:54 ` Eric Blake
2021-09-02 8:26 ` Leonardo Bras Soares Passos
2021-08-31 11:02 ` [PATCH v1 2/3] io: Add zerocopy and errqueue Leonardo Bras
2021-08-31 12:57 ` Daniel P. Berrangé
2021-08-31 20:27 ` Peter Xu
2021-09-01 8:50 ` Daniel P. Berrangé
2021-09-01 15:52 ` Peter Xu
2021-09-01 15:59 ` Daniel P. Berrangé
2021-09-02 7:07 ` Leonardo Bras Soares Passos
2021-09-02 6:59 ` Leonardo Bras Soares Passos
2021-09-07 16:44 ` Peter Xu
2021-09-08 20:13 ` Leonardo Bras Soares Passos
2021-09-08 21:04 ` Peter Xu
2021-09-02 6:38 ` Leonardo Bras Soares Passos
2021-09-02 8:47 ` Daniel P. Berrangé
2021-09-02 9:34 ` Leonardo Bras Soares Passos
2021-09-02 9:49 ` Daniel P. Berrangé
2021-09-02 10:19 ` Leonardo Bras Soares Passos
2021-09-02 10:28 ` Daniel P. Berrangé
2021-09-07 11:06 ` Dr. David Alan Gilbert
2021-09-07 18:09 ` Peter Xu
2021-09-08 8:30 ` Dr. David Alan Gilbert
2021-09-08 15:24 ` Peter Xu
2021-09-09 8:49 ` Dr. David Alan Gilbert
2021-09-08 20:25 ` Leonardo Bras Soares Passos
2021-09-08 21:09 ` Peter Xu
2021-09-08 21:57 ` Daniel P. Berrangé
2021-09-09 2:05 ` Peter Xu
2021-09-09 4:58 ` Leonardo Bras Soares Passos
2021-09-09 16:40 ` Peter Xu
2021-08-31 11:02 ` [PATCH v1 3/3] migration: multifd: Enable zerocopy Leonardo Bras
2021-08-31 13:16 ` Daniel P. Berrangé
2021-08-31 20:29 ` Peter Xu
2021-09-01 8:53 ` Daniel P. Berrangé
2021-09-01 15:35 ` Peter Xu
2021-09-01 15:44 ` Daniel P. Berrangé
2021-09-01 16:01 ` Peter Xu
2021-09-02 7:57 ` Leonardo Bras Soares Passos
2021-09-07 11:13 ` Dr. David Alan Gilbert
2021-09-08 15:26 ` Daniel P. Berrangé
2021-09-02 7:23 ` Jason Wang [this message]
2021-09-02 8:08 ` Leonardo Bras Soares Passos
2021-09-02 7:27 ` Leonardo Bras Soares Passos
2021-09-02 7:22 ` Leonardo Bras Soares Passos
2021-09-02 8:20 ` Daniel P. Berrangé
2021-09-02 8:52 ` Leonardo Bras Soares Passos
2021-09-02 9:20 ` Daniel P. Berrangé
2021-09-02 9:49 ` Leonardo Bras Soares Passos
2021-09-02 9:59 ` Daniel P. Berrangé
2021-09-02 10:25 ` Leonardo Bras Soares Passos
2021-09-07 11:17 ` Dr. David Alan Gilbert
2021-09-07 18:32 ` Peter Xu
2021-09-08 2:59 ` Jason Wang
2021-09-08 3:24 ` Peter Xu
2021-09-08 3:26 ` Jason Wang
2021-09-08 8:19 ` Dr. David Alan Gilbert
2021-09-08 15:19 ` Peter Xu
2021-09-09 1:10 ` Jason Wang
2021-08-31 21:24 ` [PATCH v1 0/3] QIOChannel flags + multifd zerocopy Peter Xu
2021-09-01 19:21 ` Leonardo Bras Soares Passos
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=517ed9ad-7e80-098c-52b4-566c6644df31@redhat.com \
--to=jasowang@redhat.com \
--cc=berrange@redhat.com \
--cc=dgilbert@redhat.com \
--cc=elena.ufimtseva@oracle.com \
--cc=fam@euphon.net \
--cc=jag.raman@oracle.com \
--cc=john.g.johnson@oracle.com \
--cc=leobras@redhat.com \
--cc=marcandre.lureau@redhat.com \
--cc=pbonzini@redhat.com \
--cc=peterx@redhat.com \
--cc=qemu-block@nongnu.org \
--cc=qemu-devel@nongnu.org \
--cc=quintela@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).