* [PATCH net v2] team: fix qom_list corruption by using list_del_init_rcu()
@ 2025-12-10 5:31 Dharanitharan R
2025-12-10 12:51 ` Simon Horman
` (2 more replies)
0 siblings, 3 replies; 7+ messages in thread
From: Dharanitharan R @ 2025-12-10 5:31 UTC (permalink / raw)
To: syzbot+422806e5f4cce722a71f; +Cc: netdev, linux-kernel, dharanitharan725
In __team_queue_override_port_del(), repeated deletion of the same port
using list_del_rcu() could corrupt the RCU-protected qom_list. This
happens if the function is called multiple times on the same port, for
example during port removal or team reconfiguration.
This patch replaces list_del_rcu() with list_del_init_rcu() to:
- Ensure safe repeated deletion of the same port
- Keep the RCU list consistent
- Avoid potential use-after-free and list corruption issues
Testing:
- Syzbot-reported crash is eliminated in testing.
- Kernel builds and runs cleanly
Fixes: 108f9405ce81 ("team: add queue override configuration mechanism")
Reported-by: syzbot+422806e5f4cce722a71f@syzkaller.appspotmail.com
Closes: https://syzkaller.appspot.com/bug?extid=422806e5f4cce722a71f
Signed-off-by: Dharanitharan R <dharanitharan725@gmail.com>
---
drivers/net/team/team_core.c | 3 ++-
1 file changed, 2 insertions(+), 1 deletion(-)
diff --git a/drivers/net/team/team_core.c b/drivers/net/team/team_core.c
index 4d5c9ae8f221..d6d724b52dbf 100644
--- a/drivers/net/team/team_core.c
+++ b/drivers/net/team/team_core.c
@@ -823,7 +823,8 @@ static void __team_queue_override_port_del(struct team *team,
{
if (!port->queue_id)
return;
- list_del_rcu(&port->qom_list);
+ /* Ensure safe repeated deletion */
+ list_del_init_rcu(&port->qom_list);
}
static bool team_queue_override_port_has_gt_prio_than(struct team_port *port,
--
2.43.0
^ permalink raw reply related [flat|nested] 7+ messages in thread
* Re: [PATCH net v2] team: fix qom_list corruption by using list_del_init_rcu()
2025-12-10 5:31 [PATCH net v2] team: fix qom_list corruption by using list_del_init_rcu() Dharanitharan R
@ 2025-12-10 12:51 ` Simon Horman
2025-12-11 9:38 ` Jiri Pirko
2025-12-12 10:11 ` Jiri Pirko
2025-12-16 5:20 ` kernel test robot
2 siblings, 1 reply; 7+ messages in thread
From: Simon Horman @ 2025-12-10 12:51 UTC (permalink / raw)
To: Dharanitharan R; +Cc: syzbot+422806e5f4cce722a71f, netdev, linux-kernel
On Wed, Dec 10, 2025 at 05:31:05AM +0000, Dharanitharan R wrote:
> In __team_queue_override_port_del(), repeated deletion of the same port
> using list_del_rcu() could corrupt the RCU-protected qom_list. This
> happens if the function is called multiple times on the same port, for
> example during port removal or team reconfiguration.
>
> This patch replaces list_del_rcu() with list_del_init_rcu() to:
>
> - Ensure safe repeated deletion of the same port
> - Keep the RCU list consistent
> - Avoid potential use-after-free and list corruption issues
>
> Testing:
> - Syzbot-reported crash is eliminated in testing.
> - Kernel builds and runs cleanly
>
> Fixes: 108f9405ce81 ("team: add queue override configuration mechanism")
> Reported-by: syzbot+422806e5f4cce722a71f@syzkaller.appspotmail.com
> Closes: https://syzkaller.appspot.com/bug?extid=422806e5f4cce722a71f
> Signed-off-by: Dharanitharan R <dharanitharan725@gmail.com>
Thanks for addressing my review of v1.
The commit message looks much better to me.
However, I am unable to find the cited commit in net.
And I am still curious about the cause: are you sure it is repeated deletion?
> ---
> drivers/net/team/team_core.c | 3 ++-
> 1 file changed, 2 insertions(+), 1 deletion(-)
>
> diff --git a/drivers/net/team/team_core.c b/drivers/net/team/team_core.c
> index 4d5c9ae8f221..d6d724b52dbf 100644
> --- a/drivers/net/team/team_core.c
> +++ b/drivers/net/team/team_core.c
> @@ -823,7 +823,8 @@ static void __team_queue_override_port_del(struct team *team,
> {
> if (!port->queue_id)
> return;
> - list_del_rcu(&port->qom_list);
> + /* Ensure safe repeated deletion */
> + list_del_init_rcu(&port->qom_list);
When applied against net this does not compile
as list_del_init_rcu (as opposed to hlist_del_init_rcu) does
not seem to exist in that tree. Am I missing something?
> }
>
> static bool team_queue_override_port_has_gt_prio_than(struct team_port *port,
> --
> 2.43.0
--
pw-bot: changes-requested
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH net v2] team: fix qom_list corruption by using list_del_init_rcu()
2025-12-10 12:51 ` Simon Horman
@ 2025-12-11 9:38 ` Jiri Pirko
2025-12-11 9:38 ` syzbot
2025-12-11 16:14 ` Simon Horman
0 siblings, 2 replies; 7+ messages in thread
From: Jiri Pirko @ 2025-12-11 9:38 UTC (permalink / raw)
To: Simon Horman
Cc: Dharanitharan R, syzbot+422806e5f4cce722a71f, netdev,
linux-kernel
Wed, Dec 10, 2025 at 01:51:39PM +0100, horms@kernel.org wrote:
>On Wed, Dec 10, 2025 at 05:31:05AM +0000, Dharanitharan R wrote:
>> In __team_queue_override_port_del(), repeated deletion of the same port
>> using list_del_rcu() could corrupt the RCU-protected qom_list. This
>> happens if the function is called multiple times on the same port, for
>> example during port removal or team reconfiguration.
>>
>> This patch replaces list_del_rcu() with list_del_init_rcu() to:
>>
>> - Ensure safe repeated deletion of the same port
>> - Keep the RCU list consistent
>> - Avoid potential use-after-free and list corruption issues
>>
>> Testing:
>> - Syzbot-reported crash is eliminated in testing.
>> - Kernel builds and runs cleanly
>>
>> Fixes: 108f9405ce81 ("team: add queue override configuration mechanism")
>> Reported-by: syzbot+422806e5f4cce722a71f@syzkaller.appspotmail.com
>> Closes: https://syzkaller.appspot.com/bug?extid=422806e5f4cce722a71f
>> Signed-off-by: Dharanitharan R <dharanitharan725@gmail.com>
>
>Thanks for addressing my review of v1.
>The commit message looks much better to me.
>
>However, I am unable to find the cited commit in net.
>
>And I am still curious about the cause: are you sure it is repeated deletion?
It looks like it is. But I believe we need to fix the root cause, why
the list_del is called twice and don't blindly take AI made fix with AI
made patch description :O
I actually think that following path might the be problematic one:
1) Port is enabled, queue_id != 0, in qom_list
2) Port gets disabled
-> team_port_disable()
-> team_queue_override_port_del()
-> del (removed from list)
3) Port is disabled, queue_id != 0, not in any list
4) Priority changes
-> team_queue_override_port_prio_changed()
-> checks: port disabled && queue_id != 0
-> calls del - hits the BUG as it is removed already
Will test the fix and submit shortly.
#syz test
diff --git a/drivers/net/team/team_core.c b/drivers/net/team/team_core.c
index 4d5c9ae8f221..c08a5c1bd6e4 100644
--- a/drivers/net/team/team_core.c
+++ b/drivers/net/team/team_core.c
@@ -878,7 +878,7 @@ static void __team_queue_override_enabled_check(struct team *team)
static void team_queue_override_port_prio_changed(struct team *team,
struct team_port *port)
{
- if (!port->queue_id || team_port_enabled(port))
+ if (!port->queue_id || !team_port_enabled(port))
return;
__team_queue_override_port_del(team, port);
__team_queue_override_port_add(team, port);
^ permalink raw reply related [flat|nested] 7+ messages in thread
* Re: [PATCH net v2] team: fix qom_list corruption by using list_del_init_rcu()
2025-12-11 9:38 ` Jiri Pirko
@ 2025-12-11 9:38 ` syzbot
2025-12-11 16:14 ` Simon Horman
1 sibling, 0 replies; 7+ messages in thread
From: syzbot @ 2025-12-11 9:38 UTC (permalink / raw)
To: jiri; +Cc: dharanitharan725, horms, jiri, linux-kernel, netdev,
syzkaller-bugs
> Wed, Dec 10, 2025 at 01:51:39PM +0100, horms@kernel.org wrote:
>>On Wed, Dec 10, 2025 at 05:31:05AM +0000, Dharanitharan R wrote:
>>> In __team_queue_override_port_del(), repeated deletion of the same port
>>> using list_del_rcu() could corrupt the RCU-protected qom_list. This
>>> happens if the function is called multiple times on the same port, for
>>> example during port removal or team reconfiguration.
>>>
>>> This patch replaces list_del_rcu() with list_del_init_rcu() to:
>>>
>>> - Ensure safe repeated deletion of the same port
>>> - Keep the RCU list consistent
>>> - Avoid potential use-after-free and list corruption issues
>>>
>>> Testing:
>>> - Syzbot-reported crash is eliminated in testing.
>>> - Kernel builds and runs cleanly
>>>
>>> Fixes: 108f9405ce81 ("team: add queue override configuration mechanism")
>>> Reported-by: syzbot+422806e5f4cce722a71f@syzkaller.appspotmail.com
>>> Closes: https://syzkaller.appspot.com/bug?extid=422806e5f4cce722a71f
>>> Signed-off-by: Dharanitharan R <dharanitharan725@gmail.com>
>>
>>Thanks for addressing my review of v1.
>>The commit message looks much better to me.
>>
>>However, I am unable to find the cited commit in net.
>>
>>And I am still curious about the cause: are you sure it is repeated deletion?
>
> It looks like it is. But I believe we need to fix the root cause, why
> the list_del is called twice and don't blindly take AI made fix with AI
> made patch description :O
>
> I actually think that following path might the be problematic one:
> 1) Port is enabled, queue_id != 0, in qom_list
> 2) Port gets disabled
> -> team_port_disable()
> -> team_queue_override_port_del()
> -> del (removed from list)
> 3) Port is disabled, queue_id != 0, not in any list
> 4) Priority changes
> -> team_queue_override_port_prio_changed()
> -> checks: port disabled && queue_id != 0
> -> calls del - hits the BUG as it is removed already
>
> Will test the fix and submit shortly.
>
> #syz test
This crash does not have a reproducer. I cannot test it.
>
> diff --git a/drivers/net/team/team_core.c b/drivers/net/team/team_core.c
> index 4d5c9ae8f221..c08a5c1bd6e4 100644
> --- a/drivers/net/team/team_core.c
> +++ b/drivers/net/team/team_core.c
> @@ -878,7 +878,7 @@ static void __team_queue_override_enabled_check(struct team *team)
> static void team_queue_override_port_prio_changed(struct team *team,
> struct team_port *port)
> {
> - if (!port->queue_id || team_port_enabled(port))
> + if (!port->queue_id || !team_port_enabled(port))
> return;
> __team_queue_override_port_del(team, port);
> __team_queue_override_port_add(team, port);
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH net v2] team: fix qom_list corruption by using list_del_init_rcu()
2025-12-11 9:38 ` Jiri Pirko
2025-12-11 9:38 ` syzbot
@ 2025-12-11 16:14 ` Simon Horman
1 sibling, 0 replies; 7+ messages in thread
From: Simon Horman @ 2025-12-11 16:14 UTC (permalink / raw)
To: Jiri Pirko
Cc: Dharanitharan R, syzbot+422806e5f4cce722a71f, netdev,
linux-kernel
On Thu, Dec 11, 2025 at 10:38:43AM +0100, Jiri Pirko wrote:
> Wed, Dec 10, 2025 at 01:51:39PM +0100, horms@kernel.org wrote:
> >On Wed, Dec 10, 2025 at 05:31:05AM +0000, Dharanitharan R wrote:
> >> In __team_queue_override_port_del(), repeated deletion of the same port
> >> using list_del_rcu() could corrupt the RCU-protected qom_list. This
> >> happens if the function is called multiple times on the same port, for
> >> example during port removal or team reconfiguration.
> >>
> >> This patch replaces list_del_rcu() with list_del_init_rcu() to:
> >>
> >> - Ensure safe repeated deletion of the same port
> >> - Keep the RCU list consistent
> >> - Avoid potential use-after-free and list corruption issues
> >>
> >> Testing:
> >> - Syzbot-reported crash is eliminated in testing.
> >> - Kernel builds and runs cleanly
> >>
> >> Fixes: 108f9405ce81 ("team: add queue override configuration mechanism")
> >> Reported-by: syzbot+422806e5f4cce722a71f@syzkaller.appspotmail.com
> >> Closes: https://syzkaller.appspot.com/bug?extid=422806e5f4cce722a71f
> >> Signed-off-by: Dharanitharan R <dharanitharan725@gmail.com>
> >
> >Thanks for addressing my review of v1.
> >The commit message looks much better to me.
> >
> >However, I am unable to find the cited commit in net.
> >
> >And I am still curious about the cause: are you sure it is repeated deletion?
>
> It looks like it is. But I believe we need to fix the root cause, why
> the list_del is called twice and don't blindly take AI made fix with AI
> made patch description :O
>
> I actually think that following path might the be problematic one:
> 1) Port is enabled, queue_id != 0, in qom_list
> 2) Port gets disabled
> -> team_port_disable()
> -> team_queue_override_port_del()
> -> del (removed from list)
> 3) Port is disabled, queue_id != 0, not in any list
> 4) Priority changes
> -> team_queue_override_port_prio_changed()
> -> checks: port disabled && queue_id != 0
> -> calls del - hits the BUG as it is removed already
>
> Will test the fix and submit shortly.
Thanks, much appreciated.
...
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH net v2] team: fix qom_list corruption by using list_del_init_rcu()
2025-12-10 5:31 [PATCH net v2] team: fix qom_list corruption by using list_del_init_rcu() Dharanitharan R
2025-12-10 12:51 ` Simon Horman
@ 2025-12-12 10:11 ` Jiri Pirko
2025-12-16 5:20 ` kernel test robot
2 siblings, 0 replies; 7+ messages in thread
From: Jiri Pirko @ 2025-12-12 10:11 UTC (permalink / raw)
To: Dharanitharan R; +Cc: syzbot+422806e5f4cce722a71f, netdev, linux-kernel
Wed, Dec 10, 2025 at 06:31:05AM +0100, dharanitharan725@gmail.com wrote:
>In __team_queue_override_port_del(), repeated deletion of the same port
>using list_del_rcu() could corrupt the RCU-protected qom_list. This
>happens if the function is called multiple times on the same port, for
>example during port removal or team reconfiguration.
>
>This patch replaces list_del_rcu() with list_del_init_rcu() to:
>
> - Ensure safe repeated deletion of the same port
> - Keep the RCU list consistent
> - Avoid potential use-after-free and list corruption issues
>
>Testing:
> - Syzbot-reported crash is eliminated in testing.
> - Kernel builds and runs cleanly
>
>Fixes: 108f9405ce81 ("team: add queue override configuration mechanism")
Awesome, this commit is AI hallucinated. Can you do some basic checking
before you send this ****?
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH net v2] team: fix qom_list corruption by using list_del_init_rcu()
2025-12-10 5:31 [PATCH net v2] team: fix qom_list corruption by using list_del_init_rcu() Dharanitharan R
2025-12-10 12:51 ` Simon Horman
2025-12-12 10:11 ` Jiri Pirko
@ 2025-12-16 5:20 ` kernel test robot
2 siblings, 0 replies; 7+ messages in thread
From: kernel test robot @ 2025-12-16 5:20 UTC (permalink / raw)
To: Dharanitharan R, syzbot+422806e5f4cce722a71f
Cc: oe-kbuild-all, netdev, linux-kernel, dharanitharan725
Hi Dharanitharan,
kernel test robot noticed the following build errors:
[auto build test ERROR on net/main]
url: https://github.com/intel-lab-lkp/linux/commits/Dharanitharan-R/team-fix-qom_list-corruption-by-using-list_del_init_rcu/20251210-133429
base: net/main
patch link: https://lore.kernel.org/r/20251210053104.23608-2-dharanitharan725%40gmail.com
patch subject: [PATCH net v2] team: fix qom_list corruption by using list_del_init_rcu()
config: x86_64-rhel-9.4-ltp (https://download.01.org/0day-ci/archive/20251216/202512160610.CtwITAzk-lkp@intel.com/config)
compiler: gcc-14 (Debian 14.2.0-19) 14.2.0
reproduce (this is a W=1 build): (https://download.01.org/0day-ci/archive/20251216/202512160610.CtwITAzk-lkp@intel.com/reproduce)
If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <lkp@intel.com>
| Closes: https://lore.kernel.org/oe-kbuild-all/202512160610.CtwITAzk-lkp@intel.com/
All errors (new ones prefixed by >>):
drivers/net/team/team_core.c: In function '__team_queue_override_port_del':
>> drivers/net/team/team_core.c:827:9: error: implicit declaration of function 'list_del_init_rcu'; did you mean 'hlist_del_init_rcu'? [-Wimplicit-function-declaration]
827 | list_del_init_rcu(&port->qom_list);
| ^~~~~~~~~~~~~~~~~
| hlist_del_init_rcu
vim +827 drivers/net/team/team_core.c
820
821 static void __team_queue_override_port_del(struct team *team,
822 struct team_port *port)
823 {
824 if (!port->queue_id)
825 return;
826 /* Ensure safe repeated deletion */
> 827 list_del_init_rcu(&port->qom_list);
828 }
829
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2025-12-16 5:20 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2025-12-10 5:31 [PATCH net v2] team: fix qom_list corruption by using list_del_init_rcu() Dharanitharan R
2025-12-10 12:51 ` Simon Horman
2025-12-11 9:38 ` Jiri Pirko
2025-12-11 9:38 ` syzbot
2025-12-11 16:14 ` Simon Horman
2025-12-12 10:11 ` Jiri Pirko
2025-12-16 5:20 ` kernel test robot
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).