From: Benjamin Poirier <bpoirier@nvidia.com>
To: David Ahern <dsahern@gmail.com>
Cc: Mike Manning <mvrmanning@gmail.com>,
Netdev <netdev@vger.kernel.org>,
Saikrishna Arcot <sarcot@microsoft.com>,
Craig Gallek <kraig@google.com>
Subject: Re: [PATCH] net: prefer socket bound to interface when not in VRF
Date: Tue, 14 Jun 2022 09:39:45 +0900 [thread overview]
Message-ID: <YqfY0WifnVQf++iY@d3> (raw)
In-Reply-To: <cb80378b-a8da-7dea-ea71-eed25a21a345@gmail.com>
On 2022-06-12 21:52 -0600, David Ahern wrote:
[...]
> >
> > Hi Mike,
> >
> > I was looking at this commit, 8d6c414cd2fb ("net: prefer socket bound to
> > interface when not in VRF"), and I get the feeling that it is only
> > partially effective. It works with UDP connected sockets but it doesn't
> > work for TCP and UDP unconnected sockets.
> >
> > The compute_score() functions are a bit misleading. Because of the
> > reuseport shortcut in their callers (inet_lhash2_lookup() and the like),
> > the first socket with score > 0 may be chosen, not necessarily the
> > socket with highest score. In order to prefer certain sockets, I think
> > an approach like commit d894ba18d4e4 ("soreuseport: fix ordering for
> > mixed v4/v6 sockets") would be needed. What do you think?
> >
> > Extra info:
> > 1) fcnal-test.sh results
> >
> > I tried to reproduce the fcnal-test.sh test results quoted above but in
> > my case the test cases already pass at 8d6c414cd2fb^ and 9e9fb7655ed5.
> > Moreover I believe those test cases don't have multiple listening
> > sockets. So that just added to my confusion.
> >
> > Running 9e9fb7655ed5,
> > root@vsid:/src/linux/tools/testing/selftests/net# ./fcnal-test.sh -t use_cases
>
> use_cases group is a catchall for bug reports. You want run all of the
> TCP and UDP cases to cover socket permutations and I know some of those
> cover dual listeners (though I can't remember ATM if that is only the
> MD5 tests).
I was talking specifically about the two test cases quoted in Mike's
original posting, in case that wasn't clear.
> If not, you can add them fairly easily and illustrate your
> point.
What about reuseport_bindtodevice.c from my previous message? Running it
on the current net/master (619c010a6539) shows:
root@vsid:~# ./reuseport_bindtodevice
IPv4 TCP ... fail
IPv4 UDP unconnected ... fail
IPv4 UDP connected ... pass
IPv6 TCP ... fail
IPv6 UDP unconnected ... fail
IPv6 UDP connected ... pass
FAIL
>
> As for compute_score, it does weight device binds a bit higher. TCP:
>
> score = sk->sk_bound_dev_if ? 2 : 1;
>
> UDP:
> if (sk->sk_bound_dev_if)
> score += 4;
I think there was a misunderstanding. A higher score does not lead to a
higher preference for a socket in many cases. That's the pitfall I tried
to describe earlier.
next prev parent reply other threads:[~2022-06-14 0:39 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-10-05 13:03 [PATCH] net: prefer socket bound to interface when not in VRF Mike Manning
2021-10-07 14:07 ` Jakub Kicinski
2021-10-07 14:10 ` David Ahern
2021-10-07 14:27 ` Jakub Kicinski
2022-06-13 3:14 ` Benjamin Poirier
2022-06-13 3:52 ` David Ahern
2022-06-14 0:39 ` Benjamin Poirier [this message]
2022-06-26 22:25 ` Mike Manning
2022-06-27 0:05 ` Benjamin Poirier
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=YqfY0WifnVQf++iY@d3 \
--to=bpoirier@nvidia.com \
--cc=dsahern@gmail.com \
--cc=kraig@google.com \
--cc=mvrmanning@gmail.com \
--cc=netdev@vger.kernel.org \
--cc=sarcot@microsoft.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox