From: Jan Kiszka <jan.kiszka@siemens.com>
To: Huang Ying <ying.huang@intel.com>
Cc: Avi Kivity <avi@redhat.com>,
Marcelo Tosatti <mtosatti@redhat.com>,
Anthony Liguori <aliguori@linux.vnet.ibm.com>,
"qemu-devel@nongnu.org" <qemu-devel@nongnu.org>,
"kvm@vger.kernel.org" <kvm@vger.kernel.org>,
Dean Nelson <dnelson@redhat.com>,
Andi Kleen <andi@firstfloor.org>,
"Wu, Fengguang" <fengguang.wu@intel.com>
Subject: Re: [PATCH uq/master 2/2] MCE, unpoison memory address across reboot
Date: Mon, 17 Jan 2011 10:48:43 +0100 [thread overview]
Message-ID: <4D34107B.6020703@siemens.com> (raw)
In-Reply-To: <1295230088.10748.79.camel@yhuang-dev>
On 2011-01-17 03:08, Huang Ying wrote:
>>>> As indicated, I'm sitting on lots of fixes and refactorings of the MCE
>>>> user space code. How do you test your patches? Any suggestions how to do
>>>> this efficiently would be warmly welcome.
>>>
>>> We use a self-made test script to test. Repository is at:
>>>
>>> git://git.kernel.org/pub/scm/utils/cpu/mce/mce-test.git
>>>
>>> The kvm test script is in kvm sub-directory.
>>>
>>> The qemu patch attached is need by the test script.
>>>
>>
>> Yeah, I already found this yesterday and started reading. I was just
>> searching for p2v in qemu, but now it's clear where it comes from. Will
>> have a look (if you want to preview my changes:
>> git://git.kiszka.org/qemu-kvm.git queues/kvm-upstream).
>>
>> I was almost about to use MADV_HWPOISON instead of the injection module.
>> Is there a way to recover the fake corruption afterward? I think that
>> would allow to move some of the test logic into qemu and avoid p2v which
>> - IIRC - was disliked upstream.
>
> I don't know how to fully recover from MADV_HWPOISON. You can recover
> the virtual address space via qemu_ram_remap() introduced in 1/2 of this
> patchset. But you will lose one or several physical pages for each
> testing. I think that may be not a big issue for a testing machine.
>
> Ccing Andi and Fengguang, they know more than me about MADV_HWPOISON.
"page-types -b hwpoison -x" does the trick of unpoisoning for me. It can
be found at linux/Documentation/vm/page-types.c. So it's quite easy to
set up and clean up a test case based on MADV_HWPOISON IMO. Not sure,
though, if that can simulate all of what you currently do via mce-inject.
>
>> Also, is there a way to simulate corrected errors (BUS_MCEERR_AO)?
>
> BUS_MCEERR_AO is recoverable uncorrected error instead of corrected
> error.
>
> The test script is for BUS_MCEERR_AO and BUS_MCEERR_AR. To see the
> effect of pure BUS_MCEERR_AO, just remove the memory accessing loop
> (memset) in tools/simple_process/simple_process.c.
Yeah, that question was based on lacking knowledge about the different
error types. Meanwhile, I was able to trigger BUS_MCEERR_AO via
MADV_HWPOISON - and also BUS_MCEERR_AR by accessing that page. However,
I did not succeed with using mce-inject so far, thus with mce-test. But
I need to check this again.
Jan
--
Siemens AG, Corporate Technology, CT T DE IT 1
Corporate Competence Center Embedded Linux
WARNING: multiple messages have this Message-ID (diff)
From: Jan Kiszka <jan.kiszka@siemens.com>
To: Huang Ying <ying.huang@intel.com>
Cc: "kvm@vger.kernel.org" <kvm@vger.kernel.org>,
Dean Nelson <dnelson@redhat.com>,
Marcelo Tosatti <mtosatti@redhat.com>,
"qemu-devel@nongnu.org" <qemu-devel@nongnu.org>,
Anthony Liguori <aliguori@linux.vnet.ibm.com>,
Andi Kleen <andi@firstfloor.org>, Avi Kivity <avi@redhat.com>,
"Wu, Fengguang" <fengguang.wu@intel.com>
Subject: [Qemu-devel] Re: [PATCH uq/master 2/2] MCE, unpoison memory address across reboot
Date: Mon, 17 Jan 2011 10:48:43 +0100 [thread overview]
Message-ID: <4D34107B.6020703@siemens.com> (raw)
In-Reply-To: <1295230088.10748.79.camel@yhuang-dev>
On 2011-01-17 03:08, Huang Ying wrote:
>>>> As indicated, I'm sitting on lots of fixes and refactorings of the MCE
>>>> user space code. How do you test your patches? Any suggestions how to do
>>>> this efficiently would be warmly welcome.
>>>
>>> We use a self-made test script to test. Repository is at:
>>>
>>> git://git.kernel.org/pub/scm/utils/cpu/mce/mce-test.git
>>>
>>> The kvm test script is in kvm sub-directory.
>>>
>>> The qemu patch attached is need by the test script.
>>>
>>
>> Yeah, I already found this yesterday and started reading. I was just
>> searching for p2v in qemu, but now it's clear where it comes from. Will
>> have a look (if you want to preview my changes:
>> git://git.kiszka.org/qemu-kvm.git queues/kvm-upstream).
>>
>> I was almost about to use MADV_HWPOISON instead of the injection module.
>> Is there a way to recover the fake corruption afterward? I think that
>> would allow to move some of the test logic into qemu and avoid p2v which
>> - IIRC - was disliked upstream.
>
> I don't know how to fully recover from MADV_HWPOISON. You can recover
> the virtual address space via qemu_ram_remap() introduced in 1/2 of this
> patchset. But you will lose one or several physical pages for each
> testing. I think that may be not a big issue for a testing machine.
>
> Ccing Andi and Fengguang, they know more than me about MADV_HWPOISON.
"page-types -b hwpoison -x" does the trick of unpoisoning for me. It can
be found at linux/Documentation/vm/page-types.c. So it's quite easy to
set up and clean up a test case based on MADV_HWPOISON IMO. Not sure,
though, if that can simulate all of what you currently do via mce-inject.
>
>> Also, is there a way to simulate corrected errors (BUS_MCEERR_AO)?
>
> BUS_MCEERR_AO is recoverable uncorrected error instead of corrected
> error.
>
> The test script is for BUS_MCEERR_AO and BUS_MCEERR_AR. To see the
> effect of pure BUS_MCEERR_AO, just remove the memory accessing loop
> (memset) in tools/simple_process/simple_process.c.
Yeah, that question was based on lacking knowledge about the different
error types. Meanwhile, I was able to trigger BUS_MCEERR_AO via
MADV_HWPOISON - and also BUS_MCEERR_AR by accessing that page. However,
I did not succeed with using mce-inject so far, thus with mce-test. But
I need to check this again.
Jan
--
Siemens AG, Corporate Technology, CT T DE IT 1
Corporate Competence Center Embedded Linux
next prev parent reply other threads:[~2011-01-17 9:49 UTC|newest]
Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-01-13 8:34 [PATCH uq/master 2/2] MCE, unpoison memory address across reboot Huang Ying
2011-01-13 8:34 ` [Qemu-devel] " Huang Ying
2011-01-13 9:01 ` Jan Kiszka
2011-01-13 9:01 ` [Qemu-devel] " Jan Kiszka
2011-01-14 1:51 ` Huang Ying
2011-01-14 1:51 ` [Qemu-devel] " Huang Ying
2011-01-14 8:38 ` Jan Kiszka
2011-01-14 8:38 ` [Qemu-devel] " Jan Kiszka
2011-01-17 2:08 ` Huang Ying
2011-01-17 2:08 ` [Qemu-devel] " Huang Ying
2011-01-17 9:48 ` Jan Kiszka [this message]
2011-01-17 9:48 ` Jan Kiszka
2011-01-13 9:29 ` Jan Kiszka
2011-01-13 9:29 ` [Qemu-devel] " Jan Kiszka
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4D34107B.6020703@siemens.com \
--to=jan.kiszka@siemens.com \
--cc=aliguori@linux.vnet.ibm.com \
--cc=andi@firstfloor.org \
--cc=avi@redhat.com \
--cc=dnelson@redhat.com \
--cc=fengguang.wu@intel.com \
--cc=kvm@vger.kernel.org \
--cc=mtosatti@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=ying.huang@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.