public inbox for netdev@vger.kernel.org
 help / color / mirror / Atom feed
From: Petr Machata <petrm@nvidia.com>
To: Davide Caratti <dcaratti@redhat.com>
Cc: Jamal Hadi Salim <jhs@mojatatu.com>,
	Cong Wang <xiyou.wangcong@gmail.com>,
	Jiri Pirko <jiri@resnulli.us>,
	"David S. Miller" <davem@davemloft.net>,
	"Eric Dumazet" <edumazet@google.com>,
	Jakub Kicinski <kuba@kernel.org>,
	"Paolo Abeni" <pabeni@redhat.com>,
	Simon Horman <horms@kernel.org>,
	Lion Ackermann <nnamrec@gmail.com>,
	Petr Machata <petrm@mellanox.com>, <netdev@vger.kernel.org>,
	Ivan Vecera <ivecera@redhat.com>, Li Shuang <shuali@redhat.com>
Subject: Re: [PATCH net] net/sched: ets: use old 'nbands' while purging unused classes
Date: Fri, 8 Aug 2025 13:44:59 +0200	[thread overview]
Message-ID: <87ms8a9fuv.fsf@nvidia.com> (raw)
In-Reply-To: <f3b9bacc73145f265c19ab80785933da5b7cbdec.1754581577.git.dcaratti@redhat.com>


Davide Caratti <dcaratti@redhat.com> writes:

> Shuang reported sch_ets test-case [1] crashing in ets_class_qlen_notify()
> after recent changes from Lion [2]. The problem is: in ets_qdisc_change()
> we purge unused DWRR queues; the value of 'q->nbands' is the new one, and
> the cleanup should be done with the old one. The problem is here since my
> first attempts to fix ets_qdisc_change(), but it surfaced again after the
> recent qdisc len accounting fixes. Fix it purging idle DWRR queues before
> assigning a new value of 'q->nbands', so that all purge operations find a
> consistent configuration:

First let me just note: that is one sharp paragraph edge!

>  - old 'q->nbands' because it's needed by ets_class_find()

Comes from qdisc_purge_queue() calls qdisc_tree_reduce_backlog() calls
Qdisc_class_ops.find, which is ets_class_find().

>  - old 'q->nstrict' because it's needed by ets_class_is_strict()

From qdisc_purge_queue() calls qdisc_tree_reduce_backlog() calls
Qdisc_class_ops.qlen_notify, which is ets_class_qlen_notify() calls
ets_class_is_strict().

Also qdisc_purge_queue() calls qdisc_reset() calls Qdisc_ops.reset which
is ets_qdisc_reset(), and that accesses both fields.

>  BUG: kernel NULL pointer dereference, address: 0000000000000000
>  #PF: supervisor read access in kernel mode
>  #PF: error_code(0x0000) - not-present page
>  PGD 0 P4D 0
>  Oops: Oops: 0000 [#1] SMP NOPTI
>  CPU: 62 UID: 0 PID: 39457 Comm: tc Kdump: loaded Not tainted 6.12.0-116.el10.x86_64 #1 PREEMPT(voluntary)
>  Hardware name: Dell Inc. PowerEdge R640/06DKY5, BIOS 2.12.2 07/09/2021
>  RIP: 0010:__list_del_entry_valid_or_report+0x4/0x80
>  Code: ff 4c 39 c7 0f 84 39 19 8e ff b8 01 00 00 00 c3 cc cc cc cc 66 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 f3 0f 1e fa <48> 8b 17 48 8b 4f 08 48 85 d2 0f 84 56 19 8e ff 48 85 c9 0f 84 ab
>  RSP: 0018:ffffba186009f400 EFLAGS: 00010202
>  RAX: 00000000000000d6 RBX: 0000000000000000 RCX: 0000000000000004
>  RDX: ffff9f0fa29b69c0 RSI: 0000000000000000 RDI: 0000000000000000
>  RBP: ffffffffc12c2400 R08: 0000000000000008 R09: 0000000000000004
>  R10: ffffffffffffffff R11: 0000000000000004 R12: 0000000000000000
>  R13: ffff9f0f8cfe0000 R14: 0000000000100005 R15: 0000000000000000
>  FS:  00007f2154f37480(0000) GS:ffff9f269c1c0000(0000) knlGS:0000000000000000
>  CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
>  CR2: 0000000000000000 CR3: 00000001530be001 CR4: 00000000007726f0
>  DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
>  DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
>  PKRU: 55555554
>  Call Trace:
>   <TASK>
>   ets_class_qlen_notify+0x65/0x90 [sch_ets]
>   qdisc_tree_reduce_backlog+0x74/0x110
>   ets_qdisc_change+0x630/0xa40 [sch_ets]
>   __tc_modify_qdisc.constprop.0+0x216/0x7f0
>   tc_modify_qdisc+0x7c/0x120
>   rtnetlink_rcv_msg+0x145/0x3f0
>   netlink_rcv_skb+0x53/0x100
>   netlink_unicast+0x245/0x390
>   netlink_sendmsg+0x21b/0x470
>   ____sys_sendmsg+0x39d/0x3d0
>   ___sys_sendmsg+0x9a/0xe0
>   __sys_sendmsg+0x7a/0xd0
>   do_syscall_64+0x7d/0x160
>   entry_SYSCALL_64_after_hwframe+0x76/0x7e
>  RIP: 0033:0x7f2155114084
>  Code: 89 02 b8 ff ff ff ff eb bb 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 00 f3 0f 1e fa 80 3d 25 f0 0c 00 00 74 13 b8 2e 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 48 83 ec 28 89 54 24 1c 48 89
>  RSP: 002b:00007fff1fd7a988 EFLAGS: 00000202 ORIG_RAX: 000000000000002e
>  RAX: ffffffffffffffda RBX: 0000560ec063e5e0 RCX: 00007f2155114084
>  RDX: 0000000000000000 RSI: 00007fff1fd7a9f0 RDI: 0000000000000003
>  RBP: 00007fff1fd7aa60 R08: 0000000000000010 R09: 000000000000003f
>  R10: 0000560ee9b3a010 R11: 0000000000000202 R12: 00007fff1fd7aae0
>  R13: 000000006891ccde R14: 0000560ec063e5e0 R15: 00007fff1fd7aad0
>   </TASK>
>
>  [1] https://lore.kernel.org/netdev/e08c7f4a6882f260011909a868311c6e9b54f3e4.1639153474.git.dcaratti@redhat.com/
>  [2] https://lore.kernel.org/netdev/d912cbd7-193b-4269-9857-525bee8bbb6a@gmail.com/
>
> Fixes: 103406b38c60 ("net/sched: Always pass notifications when child class becomes empty")
> Fixes: c062f2a0b04d ("net/sched: sch_ets: don't remove idle classes from the round-robin list")
> Fixes: dcc68b4d8084 ("net: sch_ets: Add a new Qdisc")
> Reported-by: Li Shuang <shuali@redhat.com>
> Closes: https://issues.redhat.com/browse/RHEL-108026
> Co-developed-by: Ivan Vecera <ivecera@redhat.com>
> Signed-off-by: Ivan Vecera <ivecera@redhat.com>
> Signed-off-by: Davide Caratti <dcaratti@redhat.com>

Reviewed-by: Petr Machata <petrm@nvidia.com>

> ---
>  net/sched/sch_ets.c | 11 ++++++-----
>  1 file changed, 6 insertions(+), 5 deletions(-)
>
> diff --git a/net/sched/sch_ets.c b/net/sched/sch_ets.c
> index 037f764822b9..82635dd2cfa5 100644
> --- a/net/sched/sch_ets.c
> +++ b/net/sched/sch_ets.c
> @@ -651,6 +651,12 @@ static int ets_qdisc_change(struct Qdisc *sch, struct nlattr *opt,
>  
>  	sch_tree_lock(sch);
>  
> +	for (i = nbands; i < oldbands; i++) {
> +		if (i >= q->nstrict && q->classes[i].qdisc->q.qlen)
> +			list_del_init(&q->classes[i].alist);
> +		qdisc_purge_queue(q->classes[i].qdisc);
> +	}
> +
>  	WRITE_ONCE(q->nbands, nbands);
>  	for (i = nstrict; i < q->nstrict; i++) {
>  		if (q->classes[i].qdisc->q.qlen) {
> @@ -658,11 +664,6 @@ static int ets_qdisc_change(struct Qdisc *sch, struct nlattr *opt,
>  			q->classes[i].deficit = quanta[i];
>  		}
>  	}
> -	for (i = q->nbands; i < oldbands; i++) {
> -		if (i >= q->nstrict && q->classes[i].qdisc->q.qlen)
> -			list_del_init(&q->classes[i].alist);
> -		qdisc_purge_queue(q->classes[i].qdisc);
> -	}
>  	WRITE_ONCE(q->nstrict, nstrict);
>  	memcpy(q->prio2band, priomap, sizeof(priomap));


  reply	other threads:[~2025-08-08 12:08 UTC|newest]

Thread overview: 10+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-08-07 15:48 [PATCH net] net/sched: ets: use old 'nbands' while purging unused classes Davide Caratti
2025-08-08 11:44 ` Petr Machata [this message]
2025-08-08 18:15 ` Victor Nogueira
2025-08-11  7:49   ` Davide Caratti
2025-08-11  9:53     ` Victor Nogueira
2025-08-11 13:52       ` Victor Nogueira
2025-08-11 16:09         ` Davide Caratti
2025-08-11 17:35           ` Victor Nogueira
2025-08-12  1:26             ` Jakub Kicinski
2025-08-12 11:21             ` Davide Caratti

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=87ms8a9fuv.fsf@nvidia.com \
    --to=petrm@nvidia.com \
    --cc=davem@davemloft.net \
    --cc=dcaratti@redhat.com \
    --cc=edumazet@google.com \
    --cc=horms@kernel.org \
    --cc=ivecera@redhat.com \
    --cc=jhs@mojatatu.com \
    --cc=jiri@resnulli.us \
    --cc=kuba@kernel.org \
    --cc=netdev@vger.kernel.org \
    --cc=nnamrec@gmail.com \
    --cc=pabeni@redhat.com \
    --cc=petrm@mellanox.com \
    --cc=shuali@redhat.com \
    --cc=xiyou.wangcong@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox