From: Jakub Kicinski <kuba@kernel.org>
To: Kuniyuki Iwashima <kuniyu@google.com>
Cc: edumazet@google.com, fw@strlen.de, netdev@vger.kernel.org,
pabeni@redhat.com, willemb@google.com
Subject: Re: deadlocks on pernet_ops_rwsem
Date: Wed, 1 Oct 2025 12:26:18 -0700 [thread overview]
Message-ID: <20251001122618.4cf31f3b@kernel.org> (raw)
In-Reply-To: <20251001185310.33321-1-kuniyu@google.com>
On Wed, 1 Oct 2025 18:50:22 +0000 Kuniyuki Iwashima wrote:
> From: Jakub Kicinski <kuba@kernel.org>
> Date: Wed, 1 Oct 2025 10:22:23 -0700
> > To be clear -- AFAICT lockdep misses this.
> >
> > The splat is from the "stuck task" checker.
> >
> > 2 min wait to load a module during test init would definitely be a sign
> > of something going sideways.. but I think it's worse than that, these
> > time out completely and we kill the VM. I think the modprobe is truly
> > stuck here.
> >
> > In one of the splats lockdep was able to say:
> >
> > [ 4302.448228][ T44] INFO: task modprobe:31634 <writer> blocked on an rw-semaphore likely owned by task kworker/u16:0:12 <reader>
> >
> > but most are more useless:
> >
> > [ 4671.090728][ T44] INFO: task modprobe:2342 is blocked on an rw-semaphore, but the owner is not found.
> >
> > (?!?)
>
> Even when it caught the possible owner, lockdep seems confused :/
>
>
> [ 4302.448228][ T44] INFO: task modprobe:31634 <writer> blocked on an rw-semaphore likely owned by task kworker/u16:0:12 <reader>
>
> modprobe:31634 seems to be blocked by kworker/u16:0:12,
>
>
> [ 4302.449035][ T44] task:kworker/u16:0 state:R running task stack:26368 pid:12 tgid:12 ppid:2 task_flags:0x4208060 flags:0x00004000
> [ 4302.449872][ T44] Workqueue: netns cleanup_net
> ...
> [ 4302.460889][ T44] Showing all locks held in the system:
> [ 4302.461368][ T44] 4 locks held by kworker/u16:0/12:
>
> but no lock shows up here for kworker/u16:0/12,
>
>
> [ 4302.461597][ T44] 2 locks held by kworker/u18:0/36:
> [ 4302.461926][ T44] #0: ffff8880010d9d48 ((wq_completion)events_unbound){+.+.}-{0:0}, at: process_one_work+0x7e5/0x1650
> [ 4302.462429][ T44] #1: ffffc9000028fd40 ((work_completion)(&sub_info->work)){+.+.}-{0:0}, at: process_one_work+0xded/0x1650
> [ 4302.463011][ T44] 1 lock held by khungtaskd/44:
> [ 4302.463261][ T44] #0: ffffffffb7b83f80 (rcu_read_lock){....}-{1:3}, at: debug_show_all_locks+0x36/0x260
> [ 4302.463717][ T44] 1 lock held by modprobe/31634:
> [ 4302.463982][ T44] #0: ffffffffb8270430 (pernet_ops_rwsem){++++}-{4:4}, at: register_pernet_subsys+0x1a/0x40
>
> and modprobe/31634 is holding pernet_ops_rwsem ???
>
>
> Was there any update on packages (especially qemu?) used by
> CI around 2025-09-18 ?
No updates according to the logs. First hit was on Thursday so I thought
maybe it came from Linus. But looking at the branches we fast forwarded
2025-09-18--21-00 and there were 2 hits earlier that day (2025-09-18--03-00,
2025-09-18--15-00)
next prev parent reply other threads:[~2025-10-01 19:26 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-10-01 15:20 deadlocks on pernet_ops_rwsem Jakub Kicinski
2025-10-01 16:46 ` Paolo Abeni
2025-10-01 17:22 ` Jakub Kicinski
2025-10-01 18:50 ` Kuniyuki Iwashima
2025-10-01 19:26 ` Jakub Kicinski [this message]
2025-10-01 19:33 ` Willem de Bruijn
2025-10-01 20:30 ` Jakub Kicinski
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20251001122618.4cf31f3b@kernel.org \
--to=kuba@kernel.org \
--cc=edumazet@google.com \
--cc=fw@strlen.de \
--cc=kuniyu@google.com \
--cc=netdev@vger.kernel.org \
--cc=pabeni@redhat.com \
--cc=willemb@google.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.