qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
From: Laszlo Ersek <lersek@redhat.com>
To: Christopher Covington <cov@codeaurora.org>
Cc: Ekaterina Tumanova <tumanova@linux.vnet.ibm.com>,
	kexec@lists.infradead.org,
	qemu devel list <qemu-devel@nongnu.org>,
	Qiao Nuohan <qiaonuohan@cn.fujitsu.com>,
	Dave Anderson <anderson@redhat.com>,
	kumagai-atsushi@mxc.nes.nec.co.jp, crash-utility@redhat.com
Subject: Re: [Qemu-devel] uniquely identifying KDUMP files that originate from QEMU
Date: Wed, 12 Nov 2014 15:37:58 +0100	[thread overview]
Message-ID: <546370C6.9060702@redhat.com> (raw)
In-Reply-To: <54624710.6070306@codeaurora.org>

On 11/11/14 18:27, Christopher Covington wrote:
> On 11/11/2014 06:22 AM, Laszlo Ersek wrote:
>> (Note: I'm not subscribed to either qemu-devel or the kexec list; please
>> keep me CC'd.)
>>
>> QEMU is able to dump the guest's memory in KDUMP format (kdump-zlib,
>> kdump-lzo, kdump-snappy) with the "dump-guest-memory" QMP command.
>>
>> The resultant vmcore is usually analyzed with the "crash" utility.
>>
>> The original tool producing such files is kdump. Unlike the procedure
>> performed by QEMU, kdump runs from *within* the guest (under a kexec'd
>> kdump kernel), and has more information about the original guest kernel
>> state (which is being dumped) than QEMU. To QEMU, the guest kernel state
>> is opaque.
>>
>> For this reason, the kdump preparation logic in QEMU hardcodes a number
>> of fields in the kdump header. The direct issue is the "phys_base"
>> field. Refer to dump.c, functions create_header32(), create_header64(),
>> and "include/sysemu/dump.h", macro PHYS_BASE (with the replacement text
>> "0").
>>
>> http://git.qemu.org/?p=qemu.git;a=blob;f=dump.c;h=9c7dad8f865af3b778589dd0847e450ba9a75b9d;hb=HEAD
>>
>> http://git.qemu.org/?p=qemu.git;a=blob;f=include/sysemu/dump.h;h=7e4ec5c7d96fb39c943d970d1683aa2dc171c933;hb=HEAD
>>
>> This works in most cases, because the guest Linux kernel indeed tends to
>> be loaded at guest-phys address 0. However, when the guest Linux kernel
>> is booted on top of OVMF (which has a somewhat unusual UEFI memory map),
>> then the guest Linux kernel is loaded at 16MB, thereby getting out of
>> sync with the phys_base=0 setting visible in the KDUMP header.
>>
>> This trips up the "crash" utility.
>>
>> Dave worked around the issue in "crash" for ELF format dumps -- "crash"
>> can identify QEMU as the originator of the vmcore by finding the QEMU
>> notes in the ELF vmcore. If those are present, then "crash" employs a
>> heuristic, probing for a phys_base up to 32MB, in 1MB steps.
> 
> What advantages does KDUMP have over ELF?

This has been discussed, but I'd like to give a short perspective from
personal experience.

The more obvious advantage is the smaller size, due to (a) per-page
compression (which preserves random-access for "crash"), and (b) zero
page sharing. A smaller dump file is easier to store, and easier to
upload if you're requesting assitance with debugging.

The perhaps less obvious advantage is the speed at which qemu writes the
dump. We're talking orders of magnitude, especially on rotational media.
This is because lzo and snappy are *incredibly* fast (put differently:
they incur very little CPU penalty for the same guest RAM size). The CPU
penalty is actually so small that in almost all cases the dumping
procedure stays IO-bound (in my experience: even on an SSD!). Now
combine that with a potential reduction of 4GB -> 256MB in size: that's
a sixteen-fold speedup.

(I'm allowed to praise this qemu feature, I didn't write it. :))

Thanks
Laszlo

  parent reply	other threads:[~2014-11-12 14:38 UTC|newest]

Thread overview: 19+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-11-11 11:22 [Qemu-devel] uniquely identifying KDUMP files that originate from QEMU Laszlo Ersek
2014-11-11 11:46 ` Peter Maydell
2014-11-11 17:27 ` Christopher Covington
2014-11-12  8:05   ` Petr Tesarik
2014-11-12 13:18     ` Christopher Covington
2014-11-12 13:26       ` Petr Tesarik
2014-11-12 13:28         ` Christopher Covington
2014-11-12 14:36           ` Petr Tesarik
2014-11-12 14:40           ` Laszlo Ersek
2014-11-12 14:10         ` Laszlo Ersek
2014-11-12 14:48           ` Christopher Covington
2014-11-12 15:03             ` Laszlo Ersek
2014-11-12 15:43               ` Christopher Covington
2014-11-12 21:10                 ` Petr Tesarik
2014-11-12 14:37   ` Laszlo Ersek [this message]
     [not found] ` <20141111130913.11eec0a3@hananiah.suse.cz>
     [not found]   ` <20141112.120838.303682123986142686.d.hatayama@jp.fujitsu.com>
     [not found]     ` <20141112090441.3ee42632@hananiah.suse.cz>
     [not found]       ` <546373B8.70103@redhat.com>
     [not found]         ` <20141112194325.246ff381@hananiah.suse.cz>
2014-11-12 20:30           ` Laszlo Ersek
2014-11-12 20:41             ` Dave Anderson
2014-11-12 21:21               ` [Qemu-devel] [Crash-utility] " Dave Anderson
2014-11-12 21:20             ` [Qemu-devel] " Petr Tesarik

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=546370C6.9060702@redhat.com \
    --to=lersek@redhat.com \
    --cc=anderson@redhat.com \
    --cc=cov@codeaurora.org \
    --cc=crash-utility@redhat.com \
    --cc=kexec@lists.infradead.org \
    --cc=kumagai-atsushi@mxc.nes.nec.co.jp \
    --cc=qemu-devel@nongnu.org \
    --cc=qiaonuohan@cn.fujitsu.com \
    --cc=tumanova@linux.vnet.ibm.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).