From: Sowmini Varadhan <sowmini.varadhan@oracle.com>
To: Willem de Bruijn <willemdebruijn.kernel@gmail.com>
Cc: Network Development <netdev@vger.kernel.org>,
David Miller <davem@davemloft.net>,
Santosh Shilimkar <santosh.shilimkar@oracle.com>
Subject: Re: [PATCH V2 net-next 2/3] rds: deliver zerocopy completion notification with data
Date: Sun, 25 Feb 2018 11:20:27 -0500 [thread overview]
Message-ID: <20180225162027.GA32483@oracle.com> (raw)
In-Reply-To: <CAF=yD-JEYZ=nw0eFNTHJW3zZVCzYRtJ0yGL90_Mgjay4HcVLXQ@mail.gmail.com>
On (02/25/18 10:56), Willem de Bruijn wrote:
> > @@ -91,22 +85,19 @@ static void rds_rm_zerocopy_callback(struct rds_sock *rs,
> > spin_unlock_irqrestore(&q->lock, flags);
> > mm_unaccount_pinned_pages(&znotif->z_mmp);
> > consume_skb(rds_skb_from_znotifier(znotif));
> > - sk->sk_error_report(sk);
> > + /* caller should wake up POLLIN */
>
> sk->sk_data_ready(sk);
yes, this was my first thought, but everything else in rds
is calling rds_wake_sk_sleep (this is even done in
rds_recv_incoming(), which actually queues up the data),
so I chose to align with that model (and call this in the caller
of rds_rm_zerocopy_callback()
> Without the error queue, the struct no longer needs to be an skb,
> per se. Converting to a different struct with list_head is definitely
> a longer patch. But kmalloc will be cheaper than alloc_skb.
> Perhaps something to try (as separate follow-on work).
right, I was thinking along these exact lines as well,
and was already planning a follow-up.
> > + if (!sock_flag(rds_rs_to_sk(rs), SOCK_ZEROCOPY) || !skb_peek(q))
> > + return 0;
>
> Racy read?
Can you elaborate? I only put the skb_peek to quickly
bail for sockets that are not using zerocopy at all-
if you race against something that's queuing data, and
miss it on the peek, the next read/recv should find it.
Am I missing some race?
>
> > +
> > + if (!msg->msg_control ||
>
> I'd move this first, so that the cookie queue need not even be probed
> in the common case.
you mean before the check for SOCK_ZEROCOPY?
> > + msg->msg_controllen < CMSG_SPACE(sizeof(*done)))
> > + return 0;
>
> if caller does not satisfy the contract on controllen size, can be
> more explicit and return an error.
if SOCK_ZEROCOPY has been set, but the recv did not specify a cmsghdr,
you mean?
> > + ncookies = rds_recvmsg_zcookie(rs, msg);
Will take care of the remaining comments in V3.
next prev parent reply other threads:[~2018-02-25 16:20 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-02-23 22:08 [PATCH V2 net-next 0/3] RDS: optimized notification for zerocopy completion Sowmini Varadhan
2018-02-23 22:08 ` [PATCH V2 net-next 1/3] selftests/net: revert the zerocopy Rx path for PF_RDS Sowmini Varadhan
2018-02-23 22:08 ` [PATCH V2 net-next 2/3] rds: deliver zerocopy completion notification with data Sowmini Varadhan
2018-02-25 15:56 ` Willem de Bruijn
2018-02-25 16:20 ` Sowmini Varadhan [this message]
2018-02-25 17:36 ` Willem de Bruijn
2018-02-23 22:08 ` [PATCH V2 net-next 3/3] selftests/net: reap zerocopy completions passed up as ancillary data Sowmini Varadhan
2018-02-25 15:56 ` Willem de Bruijn
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20180225162027.GA32483@oracle.com \
--to=sowmini.varadhan@oracle.com \
--cc=davem@davemloft.net \
--cc=netdev@vger.kernel.org \
--cc=santosh.shilimkar@oracle.com \
--cc=willemdebruijn.kernel@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).