From: Igor Mammedov <imammedo@redhat.com>
To: gengdongjiu <gengdongjiu@huawei.com>
Cc: pbonzini@redhat.com, mst@redhat.com, zhaoshenglong@huawei.com,
peter.maydell@linaro.org, mtosatti@redhat.com, rth@twiddle.net,
ehabkost@redhat.com, james.morse@arm.com,
christoffer.dall@linaro.org, marc.zyngier@arm.com,
kvm@vger.kernel.org, qemu-devel@nongnu.org, qemu-arm@nongnu.org,
huangshaoyu@huawei.com, zhengqiang10@huawei.com,
xuwei5@hisilicon.com
Subject: Re: [Qemu-devel] [PATCH v14 9/9] target-arm: kvm64: handle SIGBUS signal from kernel or KVM
Date: Wed, 3 Jan 2018 14:44:42 +0100 [thread overview]
Message-ID: <20180103144442.21efd2c7@redhat.com> (raw)
In-Reply-To: <fed83a29-d96f-e696-8b94-dc2d1e81ee21@huawei.com>
On Wed, 3 Jan 2018 17:13:45 +0800
gengdongjiu <gengdongjiu@huawei.com> wrote:
> On 2017/12/28 23:07, Igor Mammedov wrote:
> > On Thu, 28 Dec 2017 13:54:18 +0800
> > Dongjiu Geng <gengdongjiu@huawei.com> wrote:
> >
> >> Add SIGBUS signal handler. In this handler, it checks the SIGBUS type,
> >> translates the host VA which is delivered by host to guest PA, then fill
> >> this PA to CPER and fill the CPER to guest APEI GHES memory, finally
> >> notify guest according to the SIGBUS type. There are two kinds of SIGBUS
> >> that QEMU needs to handle, which are BUS_MCEERR_AO and BUS_MCEERR_AR.
> >>
> >> If guest accesses the poisoned memory, it generates Synchronous External
> >> Abort(SEA). Then host kernel gets an APEI notification and call memory_failure()
> >> to unmapped the affected page from the guest's stage2, and SIGBUS_MCEERR_AO
> > s/unmapped/unmap/
> Thanks.
>
> >
> >> is delivered to Qemu's main thread. If Qemu receives this SIGBUS, it will
> >> create a new CPER and add it to guest APEI GHES memory, then notify the
> >> guest with a GPIO-Signal notification.
> > too long sentence, it's hard get what goes on here, pls split it in simple
> > sentences/rephrase so it would be easy to understand behavior.
> I will split it in simple sentences/rephrase.
> Thanks for your detailed review.
>
> >
> >>
> >> When guest hits a PG_hwpoison page, it will trap to KVM as stage2 fault, then a
> >> SIGBUS_MCEERR_AR synchronous signal is delivered to Qemu, Qemu record this error
> >> into guest APEI GHES memory and notify guest using Synchronous-External-Abort(SEA).
> >>
> >> Suggested-by: James Morse <james.morse@arm.com>
> >> Signed-off-by: Dongjiu Geng <gengdongjiu@huawei.com>
> >> ---
> >> Address James's comments to record CPER and notify guest for SIGBUS signal handling.
> >> Shown some discussion in [1].
> >>
> >> [1]:
> >> https://lkml.org/lkml/2017/2/27/246
> >> https://lkml.org/lkml/2017/9/14/241
> >> https://lkml.org/lkml/2017/9/22/499
> >> ---
> >> include/sysemu/kvm.h | 2 +-
> >> target/arm/kvm.c | 2 ++
> >> target/arm/kvm64.c | 34 ++++++++++++++++++++++++++++++++++
> >> 3 files changed, 37 insertions(+), 1 deletion(-)
> >>
> >> diff --git a/include/sysemu/kvm.h b/include/sysemu/kvm.h
> >> index 3a458f5..90c1605 100644
> >> --- a/include/sysemu/kvm.h
> >> +++ b/include/sysemu/kvm.h
> >> @@ -361,7 +361,7 @@ bool kvm_vcpu_id_is_valid(int vcpu_id);
> >> /* Returns VCPU ID to be used on KVM_CREATE_VCPU ioctl() */
> >> unsigned long kvm_arch_vcpu_id(CPUState *cpu);
> >>
> >> -#ifdef TARGET_I386
> >> +#if defined(TARGET_I386) || defined(TARGET_AARCH64)
> >> #define KVM_HAVE_MCE_INJECTION 1
> >> void kvm_arch_on_sigbus_vcpu(CPUState *cpu, int code, void *addr);
> >> #endif
> >> diff --git a/target/arm/kvm.c b/target/arm/kvm.c
> >> index 7c17f0d..9d25f51 100644
> >> --- a/target/arm/kvm.c
> >> +++ b/target/arm/kvm.c
> >> @@ -26,6 +26,7 @@
> >> #include "exec/address-spaces.h"
> >> #include "hw/boards.h"
> >> #include "qemu/log.h"
> >> +#include "exec/ram_addr.h"
> >>
> >> const KVMCapabilityInfo kvm_arch_required_capabilities[] = {
> >> KVM_CAP_LAST_INFO
> >> @@ -182,6 +183,7 @@ int kvm_arch_init(MachineState *ms, KVMState *s)
> >>
> >> cap_has_mp_state = kvm_check_extension(s, KVM_CAP_MP_STATE);
> >>
> >> + qemu_register_reset(kvm_unpoison_all, NULL);
> >> type_register_static(&host_arm_cpu_type_info);
> >>
> >> return 0;
> >> diff --git a/target/arm/kvm64.c b/target/arm/kvm64.c
> >> index c00450d..6955d85 100644
> >> --- a/target/arm/kvm64.c
> >> +++ b/target/arm/kvm64.c
> >> @@ -27,6 +27,9 @@
> >> #include "kvm_arm.h"
> >> #include "internals.h"
> >> #include "hw/arm/arm.h"
> >> +#include "exec/ram_addr.h"
> >> +#include "hw/acpi/acpi-defs.h"
> >> +#include "hw/acpi/hest_ghes.h"
> >>
> >> static bool have_guest_debug;
> >>
> >> @@ -944,6 +947,37 @@ int kvm_arch_get_registers(CPUState *cs)
> >> return ret;
> >> }
> >>
> >> +void kvm_arch_on_sigbus_vcpu(CPUState *c, int code, void *addr)
> >> +{
> >> + ram_addr_t ram_addr;
> >> + hwaddr paddr;
> >> +
> >> + assert(code == BUS_MCEERR_AR || code == BUS_MCEERR_AO);
> >> + if (addr) {
> >> + ram_addr = qemu_ram_addr_from_host(addr);
> >> + if (ram_addr != RAM_ADDR_INVALID &&
> >> + kvm_physical_memory_addr_from_host(c->kvm_state, addr, &paddr)) {
> >> + kvm_hwpoison_page_add(ram_addr);
> >> + if (code == BUS_MCEERR_AR) {
> >> + kvm_cpu_synchronize_state(c);
> >> + ghes_record_errors(ACPI_HEST_NOTIFY_SEA, paddr);
> >> + kvm_inject_arm_sea(c);
> >> + } else if (code == BUS_MCEERR_AO) {
> >> + ghes_record_errors(ACPI_HEST_NOTIFY_GPIO, paddr);
> >> + qemu_hardware_error_notify();
> >> + }
> >> + return;
> >> + }
> >> + fprintf(stderr, "Hardware memory error for memory used by "
> >> + "QEMU itself instead of guest system!\n");
> > not quite sure what above message means,
> When the memory error address belong to QEMU itself, not belong to guest OS.
> it will print above message.
>
> Above message means this memory error happens in QEMU application instead of guest OS.
I'm not really understand what's going here and how it could happen,
so I can't suggest something. Perhaps someone else could comment on it.
> > also fprintf() probably shouldn't be used by new code.
> how about we use error_report()? thanks
I'm not sure what current trend is, but I'd use error_report() vs fprintf()
Also series could benefit from trace-points (I haven't noticed any).
> >
> >> + }
> >> +
> >> + if (code == BUS_MCEERR_AR) {
> >> + fprintf(stderr, "Hardware memory error!\n");
> >> + exit(1);
> >> + }
> >> +}
> >> +
> >> /* C6.6.29 BRK instruction */
> >> static const uint32_t brk_insn = 0xd4200000;
> >>
> >
> >
> > .
> >
>
next prev parent reply other threads:[~2018-01-03 13:45 UTC|newest]
Thread overview: 42+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-12-28 5:54 [Qemu-devel] [PATCH v14 0/9] Add ARMv8 RAS virtualization support in QEMU Dongjiu Geng
2017-12-28 5:54 ` [Qemu-devel] [PATCH v14 1/9] ACPI: add some GHES structures and macros definition Dongjiu Geng
2017-12-28 12:29 ` Igor Mammedov
2018-01-03 10:29 ` gengdongjiu
2017-12-28 5:54 ` [Qemu-devel] [PATCH v14 2/9] ACPI: Add APEI GHES table generation and CPER record support Dongjiu Geng
2017-12-28 14:18 ` Igor Mammedov
2017-12-29 6:33 ` gengdongjiu
2018-01-03 2:21 ` gengdongjiu
2018-01-03 13:31 ` Igor Mammedov
2018-01-04 4:21 ` gengdongjiu
2018-01-09 16:51 ` Peter Maydell
2018-01-10 5:22 ` gengdongjiu
2017-12-28 5:54 ` [Qemu-devel] [PATCH v14 3/9] docs: APEI GHES generation and CPER record description Dongjiu Geng
2017-12-28 5:54 ` [Qemu-devel] [PATCH v14 4/9] ACPI: enable APEI GHES in the configure file Dongjiu Geng
2018-01-09 17:16 ` Peter Maydell
2018-01-10 12:20 ` gengdongjiu
2017-12-28 5:54 ` [Qemu-devel] [PATCH v14 5/9] target-arm: kvm64: inject synchronous External Abort Dongjiu Geng
2017-12-28 13:49 ` Igor Mammedov
2017-12-29 6:27 ` gengdongjiu
2018-01-09 17:30 ` Peter Maydell
2018-01-11 5:59 ` gengdongjiu
2018-01-11 9:53 ` [Qemu-devel] [Qemu-arm] " Peter Maydell
2018-01-11 10:33 ` gengdongjiu
2018-01-13 5:24 ` [Qemu-devel] " gengdongjiu
2018-01-13 8:27 ` gengdongjiu
2017-12-28 5:54 ` [Qemu-devel] [PATCH v14 6/9] Move related hwpoison page functions to accel/kvm/ folder Dongjiu Geng
2017-12-28 5:54 ` [Qemu-devel] [PATCH v14 7/9] ARM: ACPI: Add GPIO notification type for hardware RAS error Dongjiu Geng
2017-12-28 14:53 ` Igor Mammedov
2018-01-03 3:48 ` gengdongjiu
2018-01-03 13:36 ` Igor Mammedov
2018-01-04 4:55 ` gengdongjiu
2017-12-28 5:54 ` [Qemu-devel] [PATCH v14 8/9] hw/arm/virt: Add RAS platform version for migration Dongjiu Geng
2017-12-28 14:58 ` Igor Mammedov
2018-01-03 4:02 ` gengdongjiu
2018-01-09 15:42 ` Peter Maydell
2017-12-28 5:54 ` [Qemu-devel] [PATCH v14 9/9] target-arm: kvm64: handle SIGBUS signal from kernel or KVM Dongjiu Geng
2017-12-28 15:07 ` Igor Mammedov
2018-01-03 9:13 ` gengdongjiu
2018-01-03 13:44 ` Igor Mammedov [this message]
2018-01-04 6:31 ` gengdongjiu
2018-01-09 17:14 ` Peter Maydell
2018-01-10 11:56 ` gengdongjiu
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20180103144442.21efd2c7@redhat.com \
--to=imammedo@redhat.com \
--cc=christoffer.dall@linaro.org \
--cc=ehabkost@redhat.com \
--cc=gengdongjiu@huawei.com \
--cc=huangshaoyu@huawei.com \
--cc=james.morse@arm.com \
--cc=kvm@vger.kernel.org \
--cc=marc.zyngier@arm.com \
--cc=mst@redhat.com \
--cc=mtosatti@redhat.com \
--cc=pbonzini@redhat.com \
--cc=peter.maydell@linaro.org \
--cc=qemu-arm@nongnu.org \
--cc=qemu-devel@nongnu.org \
--cc=rth@twiddle.net \
--cc=xuwei5@hisilicon.com \
--cc=zhaoshenglong@huawei.com \
--cc=zhengqiang10@huawei.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).