From: Jonathan Cameron via <qemu-arm@nongnu.org>
To: Mauro Carvalho Chehab <mchehab+huawei@kernel.org>
Cc: Igor Mammedov <imammedo@redhat.com>,
"Michael S . Tsirkin" <mst@redhat.com>,
Shiju Jose <shiju.jose@huawei.com>, <qemu-arm@nongnu.org>,
<qemu-devel@nongnu.org>, Dongjiu Geng <gengdongjiu1@gmail.com>,
<linux-kernel@vger.kernel.org>
Subject: Re: [PATCH v6 18/19] docs: hest: add new "etc/acpi_table_hest_addr" and update workflow
Date: Fri, 28 Feb 2025 17:36:08 +0800 [thread overview]
Message-ID: <20250228173608.0000779c@huawei.com> (raw)
In-Reply-To: <445f7d41b01ffc5bd79d6fd83bc7c4c621bcbc85.1740671863.git.mchehab+huawei@kernel.org>
On Thu, 27 Feb 2025 17:00:56 +0100
Mauro Carvalho Chehab <mchehab+huawei@kernel.org> wrote:
> While the HEST layout didn't change, there are some internal
> changes related to how offsets are calculated and how memory error
> events are triggered.
>
> Update specs to reflect such changes.
>
> Signed-off-by: Mauro Carvalho Chehab <mchehab+huawei@kernel.org>
One minor editorial suggestion. With that or similar tidy up,
Reviewed-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
> ---
> docs/specs/acpi_hest_ghes.rst | 28 +++++++++++++++++-----------
> 1 file changed, 17 insertions(+), 11 deletions(-)
>
> diff --git a/docs/specs/acpi_hest_ghes.rst b/docs/specs/acpi_hest_ghes.rst
> index c3e9f8d9a702..4311a9536b21 100644
> --- a/docs/specs/acpi_hest_ghes.rst
> +++ b/docs/specs/acpi_hest_ghes.rst
> @@ -89,12 +89,21 @@ Design Details
> addresses in the "error_block_address" fields with a pointer to the
> respective "Error Status Data Block" in the "etc/hardware_errors" blob.
>
> -(8) QEMU defines a third and write-only fw_cfg blob which is called
> - "etc/hardware_errors_addr". Through that blob, the firmware can send back
> - the guest-side allocation addresses to QEMU. The "etc/hardware_errors_addr"
> - blob contains a 8-byte entry. QEMU generates a single WRITE_POINTER command
> - for the firmware. The firmware will write back the start address of
> - "etc/hardware_errors" blob to the fw_cfg file "etc/hardware_errors_addr".
> +(8) QEMU defines a third and write-only fw_cfg blob to store the location
> + where the error block offsets, read ack registers and CPER records are
> + stored.
> +
> + Up to QEMU 9.2, the location was at "etc/hardware_errors_addr", and
> + contains a GPA for the beginning of "etc/hardware_errors".
> +
> + Newer versions place the location at "etc/acpi_table_hest_addr",
> + pointing to the GPA of the HEST table.
> +
> + Through that such GPA values, the firmware can send back the guest-side
This confuses me.
Via those GPA values...? (maybe?)
> + allocation addresses to QEMU. They contain a 8-byte entry. QEMU generates
> + a single WRITE_POINTER command for the firmware. The firmware will write
> + back the start address of either "etc/hardware_errors" or HEST table at
> + the corresponding fw_cfg file.
>
> (9) When QEMU gets a SIGBUS from the kernel, QEMU writes CPER into corresponding
> "Error Status Data Block", guest memory, and then injects platform specific
> @@ -105,8 +114,5 @@ Design Details
> kernel, on receiving notification, guest APEI driver could read the CPER error
> and take appropriate action.
>
> -(11) kvm_arch_on_sigbus_vcpu() uses source_id as index in "etc/hardware_errors" to
> - find out "Error Status Data Block" entry corresponding to error source. So supported
> - source_id values should be assigned here and not be changed afterwards to make sure
> - that guest will write error into expected "Error Status Data Block" even if guest was
> - migrated to a newer QEMU.
> +(11) kvm_arch_on_sigbus_vcpu() report RAS errors via a SEA notifications,
> + when a SIGBUS event is triggered.
WARNING: multiple messages have this Message-ID (diff)
From: Jonathan Cameron via <qemu-devel@nongnu.org>
To: Mauro Carvalho Chehab <mchehab+huawei@kernel.org>
Cc: Igor Mammedov <imammedo@redhat.com>,
"Michael S . Tsirkin" <mst@redhat.com>,
Shiju Jose <shiju.jose@huawei.com>, <qemu-arm@nongnu.org>,
<qemu-devel@nongnu.org>, Dongjiu Geng <gengdongjiu1@gmail.com>,
<linux-kernel@vger.kernel.org>
Subject: Re: [PATCH v6 18/19] docs: hest: add new "etc/acpi_table_hest_addr" and update workflow
Date: Fri, 28 Feb 2025 17:36:08 +0800 [thread overview]
Message-ID: <20250228173608.0000779c@huawei.com> (raw)
In-Reply-To: <445f7d41b01ffc5bd79d6fd83bc7c4c621bcbc85.1740671863.git.mchehab+huawei@kernel.org>
On Thu, 27 Feb 2025 17:00:56 +0100
Mauro Carvalho Chehab <mchehab+huawei@kernel.org> wrote:
> While the HEST layout didn't change, there are some internal
> changes related to how offsets are calculated and how memory error
> events are triggered.
>
> Update specs to reflect such changes.
>
> Signed-off-by: Mauro Carvalho Chehab <mchehab+huawei@kernel.org>
One minor editorial suggestion. With that or similar tidy up,
Reviewed-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
> ---
> docs/specs/acpi_hest_ghes.rst | 28 +++++++++++++++++-----------
> 1 file changed, 17 insertions(+), 11 deletions(-)
>
> diff --git a/docs/specs/acpi_hest_ghes.rst b/docs/specs/acpi_hest_ghes.rst
> index c3e9f8d9a702..4311a9536b21 100644
> --- a/docs/specs/acpi_hest_ghes.rst
> +++ b/docs/specs/acpi_hest_ghes.rst
> @@ -89,12 +89,21 @@ Design Details
> addresses in the "error_block_address" fields with a pointer to the
> respective "Error Status Data Block" in the "etc/hardware_errors" blob.
>
> -(8) QEMU defines a third and write-only fw_cfg blob which is called
> - "etc/hardware_errors_addr". Through that blob, the firmware can send back
> - the guest-side allocation addresses to QEMU. The "etc/hardware_errors_addr"
> - blob contains a 8-byte entry. QEMU generates a single WRITE_POINTER command
> - for the firmware. The firmware will write back the start address of
> - "etc/hardware_errors" blob to the fw_cfg file "etc/hardware_errors_addr".
> +(8) QEMU defines a third and write-only fw_cfg blob to store the location
> + where the error block offsets, read ack registers and CPER records are
> + stored.
> +
> + Up to QEMU 9.2, the location was at "etc/hardware_errors_addr", and
> + contains a GPA for the beginning of "etc/hardware_errors".
> +
> + Newer versions place the location at "etc/acpi_table_hest_addr",
> + pointing to the GPA of the HEST table.
> +
> + Through that such GPA values, the firmware can send back the guest-side
This confuses me.
Via those GPA values...? (maybe?)
> + allocation addresses to QEMU. They contain a 8-byte entry. QEMU generates
> + a single WRITE_POINTER command for the firmware. The firmware will write
> + back the start address of either "etc/hardware_errors" or HEST table at
> + the corresponding fw_cfg file.
>
> (9) When QEMU gets a SIGBUS from the kernel, QEMU writes CPER into corresponding
> "Error Status Data Block", guest memory, and then injects platform specific
> @@ -105,8 +114,5 @@ Design Details
> kernel, on receiving notification, guest APEI driver could read the CPER error
> and take appropriate action.
>
> -(11) kvm_arch_on_sigbus_vcpu() uses source_id as index in "etc/hardware_errors" to
> - find out "Error Status Data Block" entry corresponding to error source. So supported
> - source_id values should be assigned here and not be changed afterwards to make sure
> - that guest will write error into expected "Error Status Data Block" even if guest was
> - migrated to a newer QEMU.
> +(11) kvm_arch_on_sigbus_vcpu() report RAS errors via a SEA notifications,
> + when a SIGBUS event is triggered.
WARNING: multiple messages have this Message-ID (diff)
From: Jonathan Cameron <Jonathan.Cameron@huawei.com>
To: Mauro Carvalho Chehab <mchehab+huawei@kernel.org>
Cc: Igor Mammedov <imammedo@redhat.com>,
"Michael S . Tsirkin" <mst@redhat.com>,
Shiju Jose <shiju.jose@huawei.com>, <qemu-arm@nongnu.org>,
<qemu-devel@nongnu.org>, Dongjiu Geng <gengdongjiu1@gmail.com>,
<linux-kernel@vger.kernel.org>
Subject: Re: [PATCH v6 18/19] docs: hest: add new "etc/acpi_table_hest_addr" and update workflow
Date: Fri, 28 Feb 2025 17:36:08 +0800 [thread overview]
Message-ID: <20250228173608.0000779c@huawei.com> (raw)
In-Reply-To: <445f7d41b01ffc5bd79d6fd83bc7c4c621bcbc85.1740671863.git.mchehab+huawei@kernel.org>
On Thu, 27 Feb 2025 17:00:56 +0100
Mauro Carvalho Chehab <mchehab+huawei@kernel.org> wrote:
> While the HEST layout didn't change, there are some internal
> changes related to how offsets are calculated and how memory error
> events are triggered.
>
> Update specs to reflect such changes.
>
> Signed-off-by: Mauro Carvalho Chehab <mchehab+huawei@kernel.org>
One minor editorial suggestion. With that or similar tidy up,
Reviewed-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
> ---
> docs/specs/acpi_hest_ghes.rst | 28 +++++++++++++++++-----------
> 1 file changed, 17 insertions(+), 11 deletions(-)
>
> diff --git a/docs/specs/acpi_hest_ghes.rst b/docs/specs/acpi_hest_ghes.rst
> index c3e9f8d9a702..4311a9536b21 100644
> --- a/docs/specs/acpi_hest_ghes.rst
> +++ b/docs/specs/acpi_hest_ghes.rst
> @@ -89,12 +89,21 @@ Design Details
> addresses in the "error_block_address" fields with a pointer to the
> respective "Error Status Data Block" in the "etc/hardware_errors" blob.
>
> -(8) QEMU defines a third and write-only fw_cfg blob which is called
> - "etc/hardware_errors_addr". Through that blob, the firmware can send back
> - the guest-side allocation addresses to QEMU. The "etc/hardware_errors_addr"
> - blob contains a 8-byte entry. QEMU generates a single WRITE_POINTER command
> - for the firmware. The firmware will write back the start address of
> - "etc/hardware_errors" blob to the fw_cfg file "etc/hardware_errors_addr".
> +(8) QEMU defines a third and write-only fw_cfg blob to store the location
> + where the error block offsets, read ack registers and CPER records are
> + stored.
> +
> + Up to QEMU 9.2, the location was at "etc/hardware_errors_addr", and
> + contains a GPA for the beginning of "etc/hardware_errors".
> +
> + Newer versions place the location at "etc/acpi_table_hest_addr",
> + pointing to the GPA of the HEST table.
> +
> + Through that such GPA values, the firmware can send back the guest-side
This confuses me.
Via those GPA values...? (maybe?)
> + allocation addresses to QEMU. They contain a 8-byte entry. QEMU generates
> + a single WRITE_POINTER command for the firmware. The firmware will write
> + back the start address of either "etc/hardware_errors" or HEST table at
> + the corresponding fw_cfg file.
>
> (9) When QEMU gets a SIGBUS from the kernel, QEMU writes CPER into corresponding
> "Error Status Data Block", guest memory, and then injects platform specific
> @@ -105,8 +114,5 @@ Design Details
> kernel, on receiving notification, guest APEI driver could read the CPER error
> and take appropriate action.
>
> -(11) kvm_arch_on_sigbus_vcpu() uses source_id as index in "etc/hardware_errors" to
> - find out "Error Status Data Block" entry corresponding to error source. So supported
> - source_id values should be assigned here and not be changed afterwards to make sure
> - that guest will write error into expected "Error Status Data Block" even if guest was
> - migrated to a newer QEMU.
> +(11) kvm_arch_on_sigbus_vcpu() report RAS errors via a SEA notifications,
> + when a SIGBUS event is triggered.
next prev parent reply other threads:[~2025-02-28 9:36 UTC|newest]
Thread overview: 52+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-02-27 16:00 [PATCH v6 00/19] Change ghes to use HEST-based offsets and add support for error inject Mauro Carvalho Chehab
2025-02-27 16:00 ` [PATCH v6 01/19] tests/acpi: virt: add an empty HEST file Mauro Carvalho Chehab
2025-02-28 9:16 ` Jonathan Cameron
2025-02-28 9:16 ` Jonathan Cameron via
2025-02-27 16:00 ` [PATCH v6 02/19] tests/qtest/bios-tables-test: extend to also check HEST table Mauro Carvalho Chehab
2025-02-28 9:17 ` Jonathan Cameron via
2025-02-28 9:17 ` Jonathan Cameron
2025-02-28 9:17 ` Jonathan Cameron via
2025-02-27 16:00 ` [PATCH v6 03/19] tests/acpi: virt: update HEST file with its current data Mauro Carvalho Chehab
2025-02-28 9:18 ` Jonathan Cameron
2025-02-28 9:18 ` Jonathan Cameron via
2025-02-27 16:00 ` [PATCH v6 04/19] acpi/ghes: Cleanup the code which gets ghes ged state Mauro Carvalho Chehab
2025-02-28 9:19 ` Jonathan Cameron
2025-02-28 9:19 ` Jonathan Cameron via
2025-02-28 9:19 ` Jonathan Cameron via
2025-02-27 16:00 ` [PATCH v6 05/19] acpi/ghes: prepare to change the way HEST offsets are calculated Mauro Carvalho Chehab
2025-02-28 9:26 ` Jonathan Cameron via
2025-02-28 9:26 ` Jonathan Cameron
2025-02-28 9:26 ` Jonathan Cameron via
2025-02-28 12:57 ` Igor Mammedov
2025-02-27 16:00 ` [PATCH v6 06/19] acpi/ghes: add a firmware file with HEST address Mauro Carvalho Chehab
2025-02-27 16:00 ` [PATCH v6 07/19] acpi/ghes: Use HEST table offsets when preparing GHES records Mauro Carvalho Chehab
2025-02-27 16:00 ` [PATCH v6 08/19] acpi/ghes: don't hard-code the number of sources for HEST table Mauro Carvalho Chehab
2025-02-27 16:00 ` [PATCH v6 09/19] acpi/ghes: add a notifier to notify when error data is ready Mauro Carvalho Chehab
2025-02-28 12:58 ` Igor Mammedov
2025-02-27 16:00 ` [PATCH v6 10/19] acpi/generic_event_device: Update GHES migration to cover hest addr Mauro Carvalho Chehab
2025-02-27 16:00 ` [PATCH v6 11/19] acpi/generic_event_device: add logic to detect if HEST addr is available Mauro Carvalho Chehab
2025-02-27 16:00 ` [PATCH v6 12/19] acpi/generic_event_device: add an APEI error device Mauro Carvalho Chehab
2025-02-27 16:00 ` [PATCH v6 13/19] tests/acpi: virt: allow acpi table changes at DSDT and HEST tables Mauro Carvalho Chehab
2025-02-28 9:28 ` Jonathan Cameron via
2025-02-28 9:28 ` Jonathan Cameron
2025-02-28 9:28 ` Jonathan Cameron via
2025-02-27 16:00 ` [PATCH v6 14/19] arm/virt: Wire up a GED error device for ACPI / GHES Mauro Carvalho Chehab
2025-02-27 16:00 ` [PATCH v6 15/19] qapi/acpi-hest: add an interface to do generic CPER error injection Mauro Carvalho Chehab
2025-02-27 16:00 ` [PATCH v6 16/19] acpi/generic_event_device.c: enable use_hest_addr for QEMU 10.x Mauro Carvalho Chehab
2025-02-28 9:29 ` Jonathan Cameron via
2025-02-28 9:29 ` Jonathan Cameron
2025-02-28 9:29 ` Jonathan Cameron via
2025-02-28 12:59 ` Igor Mammedov
2025-02-27 16:00 ` [PATCH v6 17/19] tests/acpi: virt: update HEST and DSDT tables Mauro Carvalho Chehab
2025-02-28 9:33 ` Jonathan Cameron via
2025-02-28 9:33 ` Jonathan Cameron
2025-02-28 9:33 ` Jonathan Cameron via
2025-02-28 9:44 ` Mauro Carvalho Chehab
2025-02-28 13:04 ` Igor Mammedov
2025-02-27 16:00 ` [PATCH v6 18/19] docs: hest: add new "etc/acpi_table_hest_addr" and update workflow Mauro Carvalho Chehab
2025-02-28 9:36 ` Jonathan Cameron via [this message]
2025-02-28 9:36 ` Jonathan Cameron
2025-02-28 9:36 ` Jonathan Cameron via
2025-02-28 13:08 ` Igor Mammedov
2025-02-27 16:00 ` [PATCH v6 19/19] scripts/ghes_inject: add a script to generate GHES error inject Mauro Carvalho Chehab
2025-02-27 16:43 ` [PATCH v6 00/19] Change ghes to use HEST-based offsets and add support for " Igor Mammedov
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20250228173608.0000779c@huawei.com \
--to=qemu-arm@nongnu.org \
--cc=Jonathan.Cameron@huawei.com \
--cc=gengdongjiu1@gmail.com \
--cc=imammedo@redhat.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mchehab+huawei@kernel.org \
--cc=mst@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=shiju.jose@huawei.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.