From: Andrew Cooper <andrew.cooper3@citrix.com>
To: Jan Beulich <JBeulich@suse.com>
Cc: "xen-devel@lists.xensource.com" <xen-devel@lists.xensource.com>,
"Keir (Xen.org)" <keir@xen.org>
Subject: Re: [RFC] KEXEC: allocate crash note buffers at boot time
Date: Wed, 30 Nov 2011 14:01:55 +0000 [thread overview]
Message-ID: <4ED63753.4000007@citrix.com> (raw)
In-Reply-To: <4ED6037302000078000644A3@nat28.tlf.novell.com>
On 30/11/11 09:20, Jan Beulich wrote:
>>>> On 29.11.11 at 19:56, Andrew Cooper <andrew.cooper3@citrix.com> wrote:
>> As I have little to no knowledge of this stage of the boot process, is
>> this a sensible way to be setting up the per_cpu areas? I have a
>> sneaking suspicion that it will fall over if a CPU is onlined after
>> boot, and may also fall over if a CPU is offlined and reonlined later.
>> There appears to be no infrastructure currently in place for this type
>> of initialization, which is quite possibly why the code exists in its
>> current form.
> I actually wonder how you came to those 4 statements you make in
> the description - none of these seem to me like they are really an
> issue (this would instead be plain bugs in Dom0). Did you actually look
> at the existing Dom0 implementation(s)?
>
> Further, while not being a huge waste of memory, it still is one in case
> kexec gets never enabled, especially when considering a Dom0 kernel
> that's being built without CONFIG_KEXEC (or an incapable on, like any
> pv-ops kernel to date). So I also conceptually question that change.
>
> Jan
We (XenServer) have had many cases of the kexec path failing on customer
boxes under weird and seemingly inexplicable circumstances. This is why
I am working on trying to bullet-proofing the entire path.
We have 1 ticket where the contents of a crash note is clearly bogus (a
PRSTATUS is not 2GB long). We have a ticket where it appears that the
kdump kernel has failed to reassemble /proc/vmcore from elfcorehdr as a
few pcpus worth of crash notes are missing. I seem to remember a ticket
from before my time with a crash while writing the crash notes in Xen
itself. We even have a ticket stating that you get different crash notes
depending on whether you crash using the Xen debug keys (crash notes
appear completely bogus) or /proc/sysrq-trigger in dom0 (seems to be fine).
All of these are uncertain on reproducibility (except the final one
which was shown to reproduce on Xen-3.x and not on Xen-4.x so was not
investigated further) and have a habit of being unreproducible on any of
our local hardware, which makes fixing the problems tricky.
So yes - the 4 points I have made are certainly not regular or common
behavior, but given some of the tickets we have, I am fairly sure it is
not a bug-free path. I have checked the 2.6.32 implementation of dom0's
side of this and agree that it looks ok. However, it is my opinion that
relying on a certain hypercalling pattern from dom0 is a perilous route
for Xen, whether it is likely for that pattern to change in the future
or not.
Having said all of this, I agree with your second paragraph. As already
noted in my other email in this thread, I need to change the
implementation of this, so I will key the initial allocation of memory
on whether crashkernel= has been passed. This should be sufficient
indication as to whether the user minds having the space allocated or not.
--
Andrew Cooper - Dom0 Kernel Engineer, Citrix XenServer
T: +44 (0)1223 225 900, http://www.citrix.com
prev parent reply other threads:[~2011-11-30 14:01 UTC|newest]
Thread overview: 23+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-11-29 18:56 [RFC] KEXEC: allocate crash note buffers at boot time Andrew Cooper
2011-11-29 11:19 ` Keir Fraser
2011-11-30 13:14 ` Andrew Cooper
2011-11-30 17:24 ` [RFC] KEXEC: allocate crash note buffers at boot time v2 Andrew Cooper
2011-12-01 9:08 ` Jan Beulich
2011-12-01 9:49 ` Andrew Cooper
2011-12-01 10:01 ` Jan Beulich
2011-12-01 12:29 ` [RFC] KEXEC: allocate crash note buffers at boot time v3 Andrew Cooper
2011-12-01 12:56 ` Jan Beulich
2011-12-01 5:20 ` Keir Fraser
2011-12-01 14:00 ` Andrew Cooper
2011-12-01 13:59 ` Andrew Cooper
2011-12-01 15:14 ` Jan Beulich
2011-12-01 15:02 ` Andrew Cooper
2011-12-01 15:15 ` Jan Beulich
2011-12-01 17:14 ` [RFC] KEXEC: allocate crash note buffers at boot time v4 Andrew Cooper
2011-12-02 8:02 ` Jan Beulich
2011-12-02 12:33 ` Andrew Cooper
2011-12-02 15:19 ` KEXEC: allocate crash note buffers at boot time v5 Andrew Cooper
2011-12-02 16:04 ` Jan Beulich
2011-12-02 16:10 ` KEXEC: allocate crash note buffers at boot time v Andrew Cooper
2011-11-30 9:20 ` [RFC] KEXEC: allocate crash note buffers at boot time Jan Beulich
2011-11-30 14:01 ` Andrew Cooper [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4ED63753.4000007@citrix.com \
--to=andrew.cooper3@citrix.com \
--cc=JBeulich@suse.com \
--cc=keir@xen.org \
--cc=xen-devel@lists.xensource.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.