public inbox for virtualization@lists.linux-foundation.org
 help / color / mirror / Atom feed
From: Leon Romanovsky <leon@kernel.org>
To: Petr Mladek <pmladek@suse.com>
Cc: Itay Aveksis <itayav@nvidia.com>,
	John Ogness <john.ogness@linutronix.de>,
	"Michael S. Tsirkin" <mst@redhat.com>,
	Amit Shah <amit@kernel.org>, Steven Rostedt <rostedt@goodmis.org>,
	virtualization@lists.linux-foundation.org,
	Sergey Senozhatsky <sergey.senozhatsky@gmail.com>,
	Ran Rozenstein <ranro@nvidia.com>
Subject: Re: netconsole deadlock with virtnet
Date: Sun, 22 Nov 2020 10:41:52 +0200	[thread overview]
Message-ID: <20201122084152.GB3159@unreal> (raw)
In-Reply-To: <X7ZrWZ0UJoh+vQXm@alley>

On Thu, Nov 19, 2020 at 01:55:53PM +0100, Petr Mladek wrote:
> On Tue 2020-11-17 09:33:25, Steven Rostedt wrote:
> > On Tue, 17 Nov 2020 12:23:41 +0200
> > Leon Romanovsky <leon@kernel.org> wrote:
> >
> > > Hi,
> > >
> > > Approximately two weeks ago, our regression team started to experience those
> > > netconsole splats. The tested code is Linus's master (-rc4) + netdev net-next
> > > + netdev net-rc.
> > >
> > > Such splats are random and we can't bisect because there is no stable reproducer.
> > >
> > > Any idea, what is the root cause?
> > >
> > > [   21.149739]                       __do_sys_finit_module+0xbc/0x12c
> > > [   21.149740]                       __arm64_sys_finit_module+0x28/0x34
> > > [   21.149741]                       el0_svc_common.constprop.0+0x84/0x200
> > > [   21.149742]                       do_el0_svc+0x2c/0x90
> > > [   21.149743]                       el0_svc+0x18/0x50
> > > [   21.149744]                       el0_sync_handler+0xe0/0x350
> > > [   21.149745]                       el0_sync+0x158/0x180
> > > [   21.149746]  }
> > > [   21.149747]  ... key      at: [<ffff8000093d4018>] target_list_lock+0x18/0xfffffffffffff000 [netconsole]
> > > [   21.149748]  ..
> > > [   21.149750] Lost 190 message(s)!
> >
> > It really sucks that we lose 190 messages that would help to decipher this
> > more. :-p
>
> The message commes from the printk_safe code. The size can be
> increased by CONFIG_PRINTK_SAFE_LOG_BUF_SHIFT.
>
> > Because I'm not sure where the xmit_lock is taken while holding the
> > target_list_lock. But the above does show that printk() calls write_msg()
> > while holding the console_lock, and write_msg() takes the target_list_lock.
> >
> > Thus, the fix would ether require disabling interrupts every time the
> > xmit_lock is taken, or to get it from being taken while holding the
> > target_list_lock.
>
> It seems that the missing messages might help to find the root of
> the problem.

Sorry for not being very responsive, I was in internet-free zone :).

I'll increase CONFIG_PRINTK_SAFE_LOG_BUF_SHIFT from 13 to be 26, let's
see what night run will give us.

Thanks

>
> Best Regards,
> Petr
_______________________________________________
Virtualization mailing list
Virtualization@lists.linux-foundation.org
https://lists.linuxfoundation.org/mailman/listinfo/virtualization

      reply	other threads:[~2020-11-22  8:42 UTC|newest]

Thread overview: 19+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2020-11-17 10:23 netconsole deadlock with virtnet Leon Romanovsky
2020-11-17 14:33 ` Steven Rostedt
2020-11-17 18:12   ` Leon Romanovsky
2020-11-18  2:46   ` Sergey Senozhatsky
2020-11-18  3:15     ` Sergey Senozhatsky
2020-11-18  4:09       ` Jason Wang
2020-11-18 14:12         ` Steven Rostedt
2020-11-23 11:08           ` Leon Romanovsky
2020-11-23 14:31             ` Steven Rostedt
     [not found]               ` <20201123105252.1c295138@kicinski-fedora-pc1c0hjn.dhcp.thefacebook.com>
2020-11-23 19:09                 ` Steven Rostedt
     [not found]                   ` <20201123112130.759b9487@kicinski-fedora-pc1c0hjn.dhcp.thefacebook.com>
2020-11-24  3:22                     ` Jason Wang
2020-11-24  8:01                       ` Leon Romanovsky
2020-11-24  8:57                         ` Jason Wang
2020-11-24  9:26                           ` Leon Romanovsky
2020-11-24 14:31                       ` Steven Rostedt
2020-11-25  6:20                         ` Jason Wang
     [not found]                       ` <20201124082035.3e658fa4@kicinski-fedora-pc1c0hjn.dhcp.thefacebook.com>
2020-11-25  6:21                         ` Jason Wang
2020-11-19 12:55   ` Petr Mladek via Virtualization
2020-11-22  8:41     ` Leon Romanovsky [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20201122084152.GB3159@unreal \
    --to=leon@kernel.org \
    --cc=amit@kernel.org \
    --cc=itayav@nvidia.com \
    --cc=john.ogness@linutronix.de \
    --cc=mst@redhat.com \
    --cc=pmladek@suse.com \
    --cc=ranro@nvidia.com \
    --cc=rostedt@goodmis.org \
    --cc=sergey.senozhatsky@gmail.com \
    --cc=virtualization@lists.linux-foundation.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox