From: Simon Horman <horms@kernel.org>
To: Eric Dumazet <edumazet@google.com>
Cc: "David S. Miller" <davem@davemloft.net>,
Jakub Kicinski <kuba@kernel.org>, Paolo Abeni <pabeni@redhat.com>,
Jamal Hadi Salim <jhs@mojatatu.com>,
Cong Wang <xiyou.wangcong@gmail.com>,
Jiri Pirko <jiri@resnulli.us>,
netdev@vger.kernel.org, eric.dumazet@gmail.com
Subject: Re: [PATCH net-next 01/14] net_sched: sch_fq: implement lockless fq_dump()
Date: Wed, 17 Apr 2024 10:00:46 +0100 [thread overview]
Message-ID: <20240417090046.GB3846178@kernel.org> (raw)
In-Reply-To: <CANn89i+iNKvCv+RPtCa4KOY9DCEQJfGP9xHSedFUbWZHt2DSFw@mail.gmail.com>
On Wed, Apr 17, 2024 at 10:45:09AM +0200, Eric Dumazet wrote:
> On Tue, Apr 16, 2024 at 8:33 PM Eric Dumazet <edumazet@google.com> wrote:
> >
> > On Tue, Apr 16, 2024 at 8:19 PM Simon Horman <horms@kernel.org> wrote:
> > >
> > > On Mon, Apr 15, 2024 at 01:20:41PM +0000, Eric Dumazet wrote:
> > > > Instead of relying on RTNL, fq_dump() can use READ_ONCE()
> > > > annotations, paired with WRITE_ONCE() in fq_change()
> > > >
> > > > Signed-off-by: Eric Dumazet <edumazet@google.com>
> > > > ---
> > > > net/sched/sch_fq.c | 96 +++++++++++++++++++++++++++++-----------------
> > > > 1 file changed, 60 insertions(+), 36 deletions(-)
> > > >
> > > > diff --git a/net/sched/sch_fq.c b/net/sched/sch_fq.c
> > > > index cdf23ff16f40bf244bb822e76016fde44e0c439b..934c220b3f4336dc2f70af74d7758218492b675d 100644
> > > > --- a/net/sched/sch_fq.c
> > > > +++ b/net/sched/sch_fq.c
> > > > @@ -888,7 +888,7 @@ static int fq_resize(struct Qdisc *sch, u32 log)
> > > > fq_rehash(q, old_fq_root, q->fq_trees_log, array, log);
> > > >
> > > > q->fq_root = array;
> > > > - q->fq_trees_log = log;
> > > > + WRITE_ONCE(q->fq_trees_log, log);
> > > >
> > > > sch_tree_unlock(sch);
> > > >
> > > > @@ -931,7 +931,7 @@ static void fq_prio2band_compress_crumb(const u8 *in, u8 *out)
> > > >
> > > > memset(out, 0, num_elems / 4);
> > > > for (i = 0; i < num_elems; i++)
> > > > - out[i / 4] |= in[i] << (2 * (i & 0x3));
> > > > + out[i / 4] |= READ_ONCE(in[i]) << (2 * (i & 0x3));
> > > > }
> > > >
> > >
> > > Hi Eric,
> > >
> > > I am a little unsure about the handling of q->prio2band in this patch.
> > >
> > > It seems to me that fq_prio2band_compress_crumb() is used to
> > > to store values in q->prio2band, and is called (indirectly)
> > > from fq_change() (and directly from fq_init()).
> > >
> > > While fq_prio2band_decompress_crumb() is used to read values
> > > from q->prio2band, and is called from fq_dump().
> > >
> > > So I am wondering if should use WRITE_ONCE() when storing elements
> > > of out. And fq_prio2band_decompress_crumb should use READ_ONCE when
> > > reading elements of in.
> >
> > Yeah, you are probably right, I recall being a bit lazy on this part,
> > thanks !
>
> I will squash in V2 this part :
>
> diff --git a/net/sched/sch_fq.c b/net/sched/sch_fq.c
> index 934c220b3f4336dc2f70af74d7758218492b675d..238974725679327b0a0d483c011e15fc94ab0878
> 100644
> --- a/net/sched/sch_fq.c
> +++ b/net/sched/sch_fq.c
> @@ -106,6 +106,8 @@ struct fq_perband_flows {
> int quantum; /* based on band nr : 576KB, 192KB, 64KB */
> };
>
> +#define FQ_PRIO2BAND_CRUMB_SIZE ((TC_PRIO_MAX + 1) >> 2)
> +
> struct fq_sched_data {
> /* Read mostly cache line */
>
> @@ -122,7 +124,7 @@ struct fq_sched_data {
> u8 rate_enable;
> u8 fq_trees_log;
> u8 horizon_drop;
> - u8 prio2band[(TC_PRIO_MAX + 1) >> 2];
> + u8 prio2band[FQ_PRIO2BAND_CRUMB_SIZE];
> u32 timer_slack; /* hrtimer slack in ns */
>
> /* Read/Write fields. */
> @@ -159,7 +161,7 @@ struct fq_sched_data {
> /* return the i-th 2-bit value ("crumb") */
> static u8 fq_prio2band(const u8 *prio2band, unsigned int prio)
> {
> - return (prio2band[prio / 4] >> (2 * (prio & 0x3))) & 0x3;
> + return (READ_ONCE(prio2band[prio / 4]) >> (2 * (prio & 0x3))) & 0x3;
> }
Thanks Eric,
assuming that it is ok for this version of fq_prio2band() to run
from fq_enqueue(), this update looks good to me.
>
> /*
> @@ -927,11 +929,15 @@ static const struct nla_policy
> fq_policy[TCA_FQ_MAX + 1] = {
> static void fq_prio2band_compress_crumb(const u8 *in, u8 *out)
> {
> const int num_elems = TC_PRIO_MAX + 1;
> + u8 tmp[FQ_PRIO2BAND_CRUMB_SIZE];
> int i;
>
> - memset(out, 0, num_elems / 4);
> + memset(tmp, 0, sizeof(tmp));
> for (i = 0; i < num_elems; i++)
> - out[i / 4] |= READ_ONCE(in[i]) << (2 * (i & 0x3));
> + tmp[i / 4] |= in[i] << (2 * (i & 0x3));
> +
> + for (i = 0; i < FQ_PRIO2BAND_CRUMB_SIZE; i++)
> + WRITE_ONCE(out[i], tmp[i]);
> }
>
> static void fq_prio2band_decompress_crumb(const u8 *in, u8 *out)
>
next prev parent reply other threads:[~2024-04-17 9:00 UTC|newest]
Thread overview: 44+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-04-15 13:20 [PATCH net-next 00/14] net_sched: first series for RTNL-less qdisc dumps Eric Dumazet
2024-04-15 13:20 ` [PATCH net-next 01/14] net_sched: sch_fq: implement lockless fq_dump() Eric Dumazet
2024-04-16 18:19 ` Simon Horman
2024-04-16 18:33 ` Eric Dumazet
2024-04-17 8:45 ` Eric Dumazet
2024-04-17 9:00 ` Simon Horman [this message]
2024-04-17 9:02 ` Eric Dumazet
2024-04-17 9:23 ` Simon Horman
2024-04-15 13:20 ` [PATCH net-next 02/14] net_sched: cake: implement lockless cake_dump() Eric Dumazet
2024-04-17 8:35 ` Simon Horman
2024-04-17 8:54 ` Eric Dumazet
2024-04-17 9:24 ` Simon Horman
2024-04-17 12:25 ` Toke Høiland-Jørgensen
2024-04-15 13:20 ` [PATCH net-next 03/14] net_sched: sch_cbs: implement lockless cbs_dump() Eric Dumazet
2024-04-17 9:27 ` Simon Horman
2024-04-15 13:20 ` [PATCH net-next 04/14] net_sched: sch_choke: implement lockless choke_dump() Eric Dumazet
2024-04-17 13:14 ` Simon Horman
2024-04-17 13:41 ` Eric Dumazet
2024-04-17 14:44 ` Simon Horman
2024-04-15 13:20 ` [PATCH net-next 05/14] net_sched: sch_codel: implement lockless codel_dump() Eric Dumazet
2024-04-17 15:59 ` Simon Horman
2024-04-17 16:05 ` Eric Dumazet
2024-04-17 16:21 ` Simon Horman
2024-04-15 13:20 ` [PATCH net-next 06/14] net_sched: sch_tfs: implement lockless etf_dump() Eric Dumazet
2024-04-17 16:27 ` Simon Horman
2024-04-15 13:20 ` [PATCH net-next 07/14] net_sched: sch_ets: implement lockless ets_dump() Eric Dumazet
2024-04-17 16:54 ` Simon Horman
2024-04-17 17:08 ` Eric Dumazet
2024-04-17 17:17 ` Simon Horman
2024-04-15 13:20 ` [PATCH net-next 08/14] net_sched: sch_fifo: implement lockless __fifo_dump() Eric Dumazet
2024-04-15 13:20 ` [PATCH net-next 09/14] net_sched: sch_fq_codel: implement lockless fq_codel_dump() Eric Dumazet
2024-04-17 17:07 ` Simon Horman
2024-04-17 17:14 ` Eric Dumazet
2024-04-17 17:22 ` Simon Horman
2024-04-15 13:20 ` [PATCH net-next 10/14] net_sched: sch_fq_pie: implement lockless fq_pie_dump() Eric Dumazet
2024-04-17 17:13 ` Simon Horman
2024-04-17 17:15 ` Eric Dumazet
2024-04-17 17:23 ` Simon Horman
2024-04-15 13:20 ` [PATCH net-next 11/14] net_sched: sch_hfsc: implement lockless accesses to q->defcls Eric Dumazet
2024-04-15 13:20 ` [PATCH net-next 12/14] net_sched: sch_hhf: implement lockless hhf_dump() Eric Dumazet
2024-04-17 17:26 ` Simon Horman
2024-04-15 13:20 ` [PATCH net-next 13/14] net_sched: sch_pie: implement lockless pie_dump() Eric Dumazet
2024-04-17 17:28 ` Simon Horman
2024-04-15 13:20 ` [PATCH net-next 14/14] net_sched: sch_skbprio: implement lockless skbprio_dump() Eric Dumazet
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20240417090046.GB3846178@kernel.org \
--to=horms@kernel.org \
--cc=davem@davemloft.net \
--cc=edumazet@google.com \
--cc=eric.dumazet@gmail.com \
--cc=jhs@mojatatu.com \
--cc=jiri@resnulli.us \
--cc=kuba@kernel.org \
--cc=netdev@vger.kernel.org \
--cc=pabeni@redhat.com \
--cc=xiyou.wangcong@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).