netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Maciej Fijalkowski <maciej.fijalkowski@intel.com>
To: Andrii Nakryiko <andrii.nakryiko@gmail.com>
Cc: bpf <bpf@vger.kernel.org>, Networking <netdev@vger.kernel.org>,
	"Daniel Borkmann" <daniel@iogearbox.net>,
	"Alexei Starovoitov" <ast@kernel.org>,
	"Björn Töpel" <bjorn.topel@intel.com>,
	"Magnus Karlsson" <magnus.karlsson@intel.com>,
	ciara.loftus@intel.com,
	"john fastabend" <john.fastabend@gmail.com>,
	"Toke Høiland-Jørgensen" <toke@redhat.com>
Subject: Re: [PATCH v2 bpf-next 06/17] libbpf: xsk: use bpf_link
Date: Mon, 22 Mar 2021 21:09:22 +0100	[thread overview]
Message-ID: <20210322200922.GA56104@ranger.igk.intel.com> (raw)
In-Reply-To: <CAEf4Bza-pGTS+vmE5SvuMtEptGxS5wSbW2d0K34nvt9StG3C8A@mail.gmail.com>

On Mon, Mar 15, 2021 at 10:34:11PM -0700, Andrii Nakryiko wrote:
> On Thu, Mar 11, 2021 at 7:42 AM Maciej Fijalkowski
> <maciej.fijalkowski@intel.com> wrote:
> >
> > Currently, if there are multiple xdpsock instances running on a single
> > interface and in case one of the instances is terminated, the rest of
> > them are left in an inoperable state due to the fact of unloaded XDP
> > prog from interface.
> >
> > Consider the scenario below:
> >
> > // load xdp prog and xskmap and add entry to xskmap at idx 10
> > $ sudo ./xdpsock -i ens801f0 -t -q 10
> >
> > // add entry to xskmap at idx 11
> > $ sudo ./xdpsock -i ens801f0 -t -q 11
> >
> > terminate one of the processes and another one is unable to work due to
> > the fact that the XDP prog was unloaded from interface.
> >
> > To address that, step away from setting bpf prog in favour of bpf_link.
> > This means that refcounting of BPF resources will be done automatically
> > by bpf_link itself.
> >
> > When setting up BPF resources during xsk socket creation, check whether
> > bpf_link for a given ifindex already exists via set of calls to
> > bpf_link_get_next_id -> bpf_link_get_fd_by_id -> bpf_obj_get_info_by_fd
> > and comparing the ifindexes from bpf_link and xsk socket.
> >
> > If there's no bpf_link yet, create one for a given XDP prog. If bpf_link
> > is already at a given ifindex and underlying program is not AF-XDP one,
> > bail out or update the bpf_link's prog given the presence of
> > XDP_FLAGS_UPDATE_IF_NOEXIST.
> >
> > If there's netlink-based XDP prog running on a interface, bail out and
> > ask user to do removal by himself.
> >
> > Signed-off-by: Maciej Fijalkowski <maciej.fijalkowski@intel.com>
> > ---
> >  tools/lib/bpf/xsk.c | 139 ++++++++++++++++++++++++++++++++++++++------
> >  1 file changed, 120 insertions(+), 19 deletions(-)
> >
> 
> [...]
> 
> > +static int xsk_link_lookup(struct xsk_ctx *ctx, __u32 *prog_id)
> > +{
> > +       struct bpf_link_info link_info;
> > +       __u32 link_len;
> > +       __u32 id = 0;
> > +       int err;
> > +       int fd;
> > +
> > +       while (true) {
> > +               err = bpf_link_get_next_id(id, &id);
> > +               if (err) {
> > +                       if (errno == ENOENT) {
> > +                               err = 0;
> > +                               break;
> > +                       }
> > +                       pr_warn("can't get next link: %s\n", strerror(errno));
> > +                       break;
> > +               }
> > +
> > +               fd = bpf_link_get_fd_by_id(id);
> > +               if (fd < 0) {
> > +                       if (errno == ENOENT)
> > +                               continue;
> > +                       pr_warn("can't get link by id (%u): %s\n", id, strerror(errno));
> > +                       err = -errno;
> > +                       break;
> > +               }
> > +
> > +               link_len = sizeof(struct bpf_link_info);
> > +               memset(&link_info, 0, link_len);
> > +               err = bpf_obj_get_info_by_fd(fd, &link_info, &link_len);
> > +               if (err) {
> > +                       pr_warn("can't get link info: %s\n", strerror(errno));
> > +                       close(fd);
> > +                       break;
> > +               }
> > +               if (link_info.xdp.ifindex == ctx->ifindex) {
> 
> how do you know you are looking at XDP bpf_link? link_info.xdp.ifindex
> might as well be attach_type for tracing bpf_linke, netns_ino for
> netns bpf_link, and so on. Do check link_info.type before check other
> per-link type properties.

My mistake, good that you brought that up. I'll fix it.

> 
> > +                       ctx->link_fd = fd;
> > +                       *prog_id = link_info.prog_id;
> > +                       break;
> > +               }
> > +               close(fd);
> > +       }
> > +
> > +       return err;
> > +}
> > +
> >  static int __xsk_setup_xdp_prog(struct xsk_socket *_xdp,
> >                                 int *xsks_map_fd)
> >  {
> > @@ -675,8 +777,7 @@ static int __xsk_setup_xdp_prog(struct xsk_socket *_xdp,
> >         __u32 prog_id = 0;
> >         int err;
> >
> > -       err = bpf_get_link_xdp_id(ctx->ifindex, &prog_id,
> > -                                 xsk->config.xdp_flags);
> > +       err = xsk_link_lookup(ctx, &prog_id);
> >         if (err)
> >                 return err;
> >
> > @@ -686,9 +787,12 @@ static int __xsk_setup_xdp_prog(struct xsk_socket *_xdp,
> >                         return err;
> >
> >                 err = xsk_load_xdp_prog(xsk);
> > -               if (err) {
> > +               if (err)
> >                         goto err_load_xdp_prog;
> > -               }
> > +
> > +               err = xsk_create_bpf_link(xsk);
> > +               if (err)
> > +                       goto err_create_bpf_link;
> 
> what about the backwards compatibility with kernels that don't yet
> support bpf_link?

For that I'll be trying to create or lookup bpf_link on loopback device.
If it failed in any way, then use netlink based logic as the underlying
system doesn't support bpf_link.

Once again, thanks for catching it!

> 
> >         } else {
> >                 ctx->prog_fd = bpf_prog_get_fd_by_id(prog_id);
> >                 if (ctx->prog_fd < 0)
> 
> [...]

  reply	other threads:[~2021-03-22 20:20 UTC|newest]

Thread overview: 25+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-03-11 15:28 [PATCH v2 bpf-next 00/17] AF_XDP selftests improvements & bpf_link Maciej Fijalkowski
2021-03-11 15:28 ` [PATCH v2 bpf-next 01/17] selftests: xsk: don't call worker_pkt_dump() for stats test Maciej Fijalkowski
2021-03-11 15:28 ` [PATCH v2 bpf-next 02/17] selftests: xsk: remove struct ifaceconfigobj Maciej Fijalkowski
2021-03-11 15:28 ` [PATCH v2 bpf-next 03/17] selftests: xsk: remove unused function Maciej Fijalkowski
2021-03-11 15:28 ` [PATCH v2 bpf-next 04/17] selftests: xsk: remove inline keyword from source file Maciej Fijalkowski
2021-03-11 15:28 ` [PATCH v2 bpf-next 05/17] selftests: xsk: simplify frame traversal in dumping thread Maciej Fijalkowski
2021-03-11 15:28 ` [PATCH v2 bpf-next 06/17] libbpf: xsk: use bpf_link Maciej Fijalkowski
2021-03-16  5:34   ` Andrii Nakryiko
2021-03-22 20:09     ` Maciej Fijalkowski [this message]
2021-03-11 15:29 ` [PATCH v2 bpf-next 07/17] samples: bpf: do not unload prog within xdpsock Maciej Fijalkowski
2021-03-16  5:35   ` Andrii Nakryiko
2021-03-11 15:29 ` [PATCH v2 bpf-next 08/17] selftests: xsk: remove thread for netns switch Maciej Fijalkowski
2021-03-11 15:29 ` [PATCH v2 bpf-next 09/17] selftests: xsk: split worker thread Maciej Fijalkowski
2021-03-11 15:29 ` [PATCH v2 bpf-next 10/17] selftests: xsk: remove Tx synchronization resources Maciej Fijalkowski
2021-03-11 15:29 ` [PATCH v2 bpf-next 11/17] selftests: xsk: refactor teardown/bidi test cases and testapp_validate Maciej Fijalkowski
2021-03-11 15:29 ` [PATCH v2 bpf-next 12/17] selftests: xsk: remove sync_mutex_tx and atomic var Maciej Fijalkowski
2021-03-11 15:29 ` [PATCH v2 bpf-next 13/17] veth: implement ethtool's get_channels() callback Maciej Fijalkowski
2021-03-16  8:44   ` Magnus Karlsson
2021-03-22 20:10     ` Maciej Fijalkowski
2021-03-11 15:29 ` [PATCH v2 bpf-next 14/17] selftests: xsk: implement bpf_link test Maciej Fijalkowski
2021-03-16  5:39   ` Andrii Nakryiko
2021-03-22 20:11     ` Maciej Fijalkowski
2021-03-11 15:29 ` [PATCH v2 bpf-next 15/17] selftests: xsk: remove thread attribute Maciej Fijalkowski
2021-03-11 15:29 ` [PATCH v2 bpf-next 16/17] selftest: xsk: Remove mutex and condition variable Maciej Fijalkowski
2021-03-11 15:29 ` [PATCH v2 bpf-next 17/17] selftests: xsk: Remove unused defines Maciej Fijalkowski

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20210322200922.GA56104@ranger.igk.intel.com \
    --to=maciej.fijalkowski@intel.com \
    --cc=andrii.nakryiko@gmail.com \
    --cc=ast@kernel.org \
    --cc=bjorn.topel@intel.com \
    --cc=bpf@vger.kernel.org \
    --cc=ciara.loftus@intel.com \
    --cc=daniel@iogearbox.net \
    --cc=john.fastabend@gmail.com \
    --cc=magnus.karlsson@intel.com \
    --cc=netdev@vger.kernel.org \
    --cc=toke@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).