From: Corey Minyard <minyard@acm.org>
To: "Janne Huttunen (Nokia)" <janne.huttunen@nokia.com>
Cc: "stable@vger.kernel.org" <stable@vger.kernel.org>,
"gregkh@linuxfoundation.org" <gregkh@linuxfoundation.org>
Subject: Re: IPMI related kernel panics since v4.19.286
Date: Tue, 20 Jun 2023 06:54:48 -0500 [thread overview]
Message-ID: <ZJGTiNR0putMfSHu@mail.minyard.net> (raw)
In-Reply-To: <dab0ac65c452ba3af6837b268a1d5d9d280360eb.camel@nokia.com>
On Tue, Jun 20, 2023 at 09:56:50AM +0000, Janne Huttunen (Nokia) wrote:
>
> > It looks like
> >
> > b4a34aa6d "ipmi: Fix how the lower layers are told to watch for
> > messages"
> >
> > was backported to fullfill a dependency for another backport, but
> > there was another change:
> >
> > e1891cffd4c4 "ipmi: Make the smi watcher be disabled immediately
> > when not needed"
> >
> > That is needed to avoid calling a lower layer function with
> > xmit_msgs_lock held. It doesn't apply completely cleanly because of
> > other changes, but you just need to leave in the free_user_work()
> > function and delete the other function in the conflict. In addition
> > to that, you will also need:
> >
> > 383035211c79 "ipmi: move message error checking to avoid deadlock"
> >
> > to fix a bug in that change.
> >
> > Can you try this out?
>
> Yes, sorry for the delay, had a bit of technical problems testing
> your proposed patches. In the meantime we found out that over
> a dozen of our test servers have had the same crash, some of them
> multiple times since the kernel update.
I don't consider this a delay, it was quite speedy.
>
> Anyways, with your proposed patches on top of 4.19.286, I couldn't
> trigger the lockdep warning anymore even in a server that without
> the fixes triggers it very reliably right after the boot. I also
> saw in another very similar server (without the fixes) that it
> took almost 17 hours to get even the lockdep warning. Maybe some
> specific BMC behavior affects this or something? Sadly, that kind
> of diminishes the value of the short duration tests, but at least
> there has so far been zero lockdep warnings with the fixes applied.
> The actual lockups are then way too unpredictable to test reliably
> in any kind of short time frame.
It does depend on what you are doing to the driver, but it sounds like
you are running the same software everywhere. I'm not sure; I've seen
timing do strange things before.
>
> Anyways, looking at e1891cffd4c4, it's right there where the issue
> seems to originate from, so it makes total sense to me that it does
> fix it. I was already kind of looking at it when you confirmed it.
> Thanks for pointing out also the 383035211c79 patch, it might have
> been easily missed.
>
Ok, thank you for testing. I'll prepare a stable kernel request.
-corey
prev parent reply other threads:[~2023-06-20 11:54 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-06-19 11:11 IPMI related kernel panics since v4.19.286 Janne Huttunen (Nokia)
2023-06-19 12:23 ` Greg KH
2023-06-19 13:32 ` Corey Minyard
2023-06-19 12:52 ` Corey Minyard
2023-06-20 9:56 ` Janne Huttunen (Nokia)
2023-06-20 11:54 ` Corey Minyard [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=ZJGTiNR0putMfSHu@mail.minyard.net \
--to=minyard@acm.org \
--cc=gregkh@linuxfoundation.org \
--cc=janne.huttunen@nokia.com \
--cc=stable@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).