From: james.morse@arm.com (James Morse)
To: linux-arm-kernel@lists.infradead.org
Subject: kdump: need help with kexec -p
Date: Thu, 12 Oct 2017 12:40:35 +0100 [thread overview]
Message-ID: <59DF54B3.1050404@arm.com> (raw)
In-Reply-To: <HE1PR04MB12416F54193B1A61BC0652ED974A0@HE1PR04MB1241.eurprd04.prod.outlook.com>
Hi Prabhakar,
(+CC: Akashi Takahiro, who wrote the arm64 kdump support)
On 11/10/17 10:11, Prabhakar Kushwaha wrote:
> We are facing some issues while using kexec -p on ARM64 NXP platforms.
>
> 1) After calling kexec -p, if immediately "panic" is triggered the crash kernel
> does not boot. If we run few commands and wait for atleast (20-30 secs), before
> triggering the panic, the crash kernel boots.
What kernel version do you see this on? Can you log the kernel output in each
case, (do you get a 'bye' message even when the new kernel doesn't boot).
Does 'kexec -p' report success in both cases? ($? == 0)
kdump can take many seconds in purgatory, it checksums the kdump image to check
it didn't get corrupted between 'kexec -p' and crash time, but it doesn't sound
like this is what you're seeing.
> 2) We do not see the issue ("1" ), when we do umount -a, before calling the panic
> after kexec-p.
What filesystems (ext4, nfs etc) do you have mounted, and which ones does
'umount -a' get rid of?
Where are these filesystems stored?
How many CPUs does your platform have?
(...does crashing on a different CPU change the behaviour?)
> taskset -c 1 bash -c "echo c > /proc/sysrq-trigger"
> The issue does not seem to pertain to the NXP software it seems. (because this
> observation has been observed on very simple kernel, where most of the
> controllers have been removed from device tree).
> Also found some info related to this on internet where it is mentioned that
> without un-mounting the mounted filesystems, the boot of next kernel is not
> recommended. (this is in context of kexec -e though)
> https://www.linux.com/news/reboot-racecar-kexec.
This is because the filesystem is marked as mounted on-disk, and there may be
vital data you've written but hasn't made it to the disk yet.
For 'kexec -e' I think it tries to shutdown and reboot, then jumps to the new
kernel instead of calling the firmware. This means all filesystems should be
sync()d, umounted or at least remounted read-only.
For kdump, we've already crashed, so you've already lost data. Its a best effort
can we get to a point where you can debug the original crash.
Thanks,
James
next prev parent reply other threads:[~2017-10-12 11:40 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-10-11 9:11 kdump: need help with kexec -p Prabhakar Kushwaha
2017-10-12 11:40 ` James Morse [this message]
2017-10-13 8:36 ` AKASHI Takahiro
2017-10-13 9:41 ` Prabhakar Kushwaha
2017-10-13 10:30 ` James Morse
2017-10-13 11:23 ` Prabhakar Kushwaha
2017-10-23 5:37 ` Prabhakar Kushwaha
2017-10-23 9:33 ` James Morse
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=59DF54B3.1050404@arm.com \
--to=james.morse@arm.com \
--cc=linux-arm-kernel@lists.infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.