From: christoffer.dall@arm.com (Christoffer Dall)
To: linux-arm-kernel@lists.infradead.org
Subject: [PATCH 4/4] arm64: KVM: Implement workaround for Cortex-A76 erratum 1165522
Date: Thu, 8 Nov 2018 21:10:54 +0100 [thread overview]
Message-ID: <20181108201054.GG3835@e113682-lin.lund.arm.com> (raw)
In-Reply-To: <ec7d0e34-022f-125e-00b4-b06be23ee2a8@arm.com>
On Thu, Nov 08, 2018 at 06:05:55PM +0000, Marc Zyngier wrote:
> On 06/11/18 08:15, Christoffer Dall wrote:
> > On Mon, Nov 05, 2018 at 02:36:16PM +0000, Marc Zyngier wrote:
> >> Early versions of Cortex-A76 can end-up with corrupt TLBs if they
> >> speculate an AT instruction in during a guest switch while the
> >> S1/S2 system registers are in an inconsistent state.
> >>
> >> Work around it by:
> >> - Mandating VHE
> >> - Make sure that S1 and S2 system registers are consistent before
> >> clearing HCR_EL2.TGE, which allows AT to target the EL1 translation
> >> regime
> >>
> >> These two things together ensure that we cannot hit this erratum.
> >>
> >> Signed-off-by: Marc Zyngier <marc.zyngier@arm.com>
> >> ---
> >> Documentation/arm64/silicon-errata.txt | 1 +
> >> arch/arm64/Kconfig | 12 ++++++++++++
> >> arch/arm64/include/asm/cpucaps.h | 3 ++-
> >> arch/arm64/include/asm/kvm_host.h | 3 +++
> >> arch/arm64/include/asm/kvm_hyp.h | 6 ++++++
> >> arch/arm64/kernel/cpu_errata.c | 8 ++++++++
> >> arch/arm64/kvm/hyp/switch.c | 14 ++++++++++++++
> >> 7 files changed, 46 insertions(+), 1 deletion(-)
> >>
> >> diff --git a/Documentation/arm64/silicon-errata.txt b/Documentation/arm64/silicon-errata.txt
> >> index 76ccded8b74c..04f0bc4690c6 100644
> >> --- a/Documentation/arm64/silicon-errata.txt
> >> +++ b/Documentation/arm64/silicon-errata.txt
> >> @@ -57,6 +57,7 @@ stable kernels.
> >> | ARM | Cortex-A73 | #858921 | ARM64_ERRATUM_858921 |
> >> | ARM | Cortex-A55 | #1024718 | ARM64_ERRATUM_1024718 |
> >> | ARM | Cortex-A76 | #1188873 | ARM64_ERRATUM_1188873 |
> >> +| ARM | Cortex-A76 | #1165522 | ARM64_ERRATUM_1165522 |
> >> | ARM | MMU-500 | #841119,#826419 | N/A |
> >> | | | | |
> >> | Cavium | ThunderX ITS | #22375, #24313 | CAVIUM_ERRATUM_22375 |
> >> diff --git a/arch/arm64/Kconfig b/arch/arm64/Kconfig
> >> index 787d7850e064..a68bc6cc2167 100644
> >> --- a/arch/arm64/Kconfig
> >> +++ b/arch/arm64/Kconfig
> >> @@ -497,6 +497,18 @@ config ARM64_ERRATUM_1188873
> >>
> >> If unsure, say Y.
> >>
> >> +config ARM64_ERRATUM_1165522
> >> + bool "Cortex-A76: Speculative AT instruction using out-of-context translation regime could cause subsequent request to generate an incorrect translation"
> >> + default y
> >> + help
> >> + This option adds work arounds for ARM Cortex-A76 erratum 1165522
> >> +
> >> + Affected Cortex-A76 cores (r0p0, r1p0, r2p0) could end-up with
> >> + corrupted TLBs by speculating an AT instruction during a guest
> >> + context switch.
> >> +
> >> + If unsure, say Y.
> >> +
> >> config CAVIUM_ERRATUM_22375
> >> bool "Cavium erratum 22375, 24313"
> >> default y
> >> diff --git a/arch/arm64/include/asm/cpucaps.h b/arch/arm64/include/asm/cpucaps.h
> >> index 6e2d254c09eb..62d8cd15fdf2 100644
> >> --- a/arch/arm64/include/asm/cpucaps.h
> >> +++ b/arch/arm64/include/asm/cpucaps.h
> >> @@ -54,7 +54,8 @@
> >> #define ARM64_HAS_CRC32 33
> >> #define ARM64_SSBS 34
> >> #define ARM64_WORKAROUND_1188873 35
> >> +#define ARM64_WORKAROUND_1165522 36
> >>
> >> -#define ARM64_NCAPS 36
> >> +#define ARM64_NCAPS 37
> >>
> >> #endif /* __ASM_CPUCAPS_H */
> >> diff --git a/arch/arm64/include/asm/kvm_host.h b/arch/arm64/include/asm/kvm_host.h
> >> index 7d6e974d024a..8f486026ac87 100644
> >> --- a/arch/arm64/include/asm/kvm_host.h
> >> +++ b/arch/arm64/include/asm/kvm_host.h
> >> @@ -435,6 +435,9 @@ static inline bool kvm_arch_sve_requires_vhe(void)
> >> static inline bool kvm_arch_impl_requires_vhe(void)
> >> {
> >> /* Some implementations have defects that confine them to VHE */
> >> + if (cpus_have_cap(ARM64_WORKAROUND_1165522))
> >> + return true;
> >> +
> >> return false;
> >> }
> >>
> >> diff --git a/arch/arm64/include/asm/kvm_hyp.h b/arch/arm64/include/asm/kvm_hyp.h
> >> index 23aca66767f9..9681360d0959 100644
> >> --- a/arch/arm64/include/asm/kvm_hyp.h
> >> +++ b/arch/arm64/include/asm/kvm_hyp.h
> >> @@ -163,6 +163,12 @@ static __always_inline void __hyp_text __load_guest_stage2(struct kvm *kvm)
> >> {
> >> write_sysreg(kvm->arch.vtcr, vtcr_el2);
> >> write_sysreg(kvm->arch.vttbr, vttbr_el2);
> >> +
> >> + /*
> >> + * ARM erratum 1165522 requires the actual execution of the
> >> + * above before we can switch to the guest translation regime.
> >> + */
> >
> > Is it about a guest translation 'regime' or should this just say before
> > we can enable stage 2 translation?
>
> No, this isn't strictly about enabling stage-2 translation. This is
> about making sure that anything that impacts the guest translations is
> actually executed.
>
> I wonder if it would be clearer to move this outside of
> __load_guest_stage2 and make it explicit in the callers of this helper...
I think it makes sense to have this here to explain the alternative.
But it's the 'switch to guest translation regime' thing that bothers me
a bit. Is that an architectural concept (I thought we only had EL1 and
EL2 translation regimes in stage 1, and then stage 2 translations). So
When you say 'guest translation regime' I'm just not entirely sure what
that means, unless I'm missing something.
> >
> >> + asm(ALTERNATIVE("nop", "isb", ARM64_WORKAROUND_1165522));
> >> }
> >>
> >> #endif /* __ARM64_KVM_HYP_H__ */
> >> diff --git a/arch/arm64/kernel/cpu_errata.c b/arch/arm64/kernel/cpu_errata.c
> >> index a509e35132d2..476e738e6c46 100644
> >> --- a/arch/arm64/kernel/cpu_errata.c
> >> +++ b/arch/arm64/kernel/cpu_errata.c
> >> @@ -739,6 +739,14 @@ const struct arm64_cpu_capabilities arm64_errata[] = {
> >> .capability = ARM64_WORKAROUND_1188873,
> >> ERRATA_MIDR_RANGE(MIDR_CORTEX_A76, 0, 0, 2, 0),
> >> },
> >> +#endif
> >> +#ifdef CONFIG_ARM64_ERRATUM_1165522
> >> + {
> >> + /* Cortex-A76 r0p0 to r2p0 */
> >> + .desc = "ARM erratum 1165522",
> >> + .capability = ARM64_WORKAROUND_1165522,
> >> + ERRATA_MIDR_RANGE(MIDR_CORTEX_A76, 0, 0, 2, 0),
> >> + },
> >> #endif
> >> {
> >> }
> >> diff --git a/arch/arm64/kvm/hyp/switch.c b/arch/arm64/kvm/hyp/switch.c
> >> index 51d5d966d9e5..322109183853 100644
> >> --- a/arch/arm64/kvm/hyp/switch.c
> >> +++ b/arch/arm64/kvm/hyp/switch.c
> >> @@ -143,6 +143,13 @@ static void deactivate_traps_vhe(void)
> >> {
> >> extern char vectors[]; /* kernel exception vectors */
> >> write_sysreg(HCR_HOST_VHE_FLAGS, hcr_el2);
> >> +
> >> + /*
> >> + * ARM erratum 1165522 requires the actual execution of the
> >> + * above before we can switch to the host translation regime.
> >> + */
> >
> > same here, is it not rather about disabling stage 2 translation than
> > about the host (EL2 and EL0 stage 1) translation regimes?
> >
> >> + asm(ALTERNATIVE("nop", "isb", ARM64_WORKAROUND_1165522));
> >> +
> >> write_sysreg(CPACR_EL1_DEFAULT, cpacr_el1);
> >> write_sysreg(vectors, vbar_el1);
> >> }
> >> @@ -499,6 +506,13 @@ int kvm_vcpu_run_vhe(struct kvm_vcpu *vcpu)
> >>
> >> sysreg_save_host_state_vhe(host_ctxt);
> >>
> >> + /*
> >> + * ARM erratum 1165522 requires us to have all the translation
> >> + * context in place before we clear HCR_TGE. All the offending
> >> + * guest sysregs are loaded in kvm_vcpu_load_sysregs, and
> >> + * __activate_vm has the stage-2 configuration. Once this is
> >> + * done, __activate_trap clears HCR_TGE (among other things).
> >> + */
> >
> > I'm not this comment is needed or is helpful here. For example, I don't
> > understand what you mean with the offending guest sysregs and how that
>
> TTBR*, TCR, SCTLR... Anything that deals with stage-1 translations.
>
> > relates to the problem of configuring stage 2 before clearing TGE. Is
> > this about gettting the stage 1 configuration in place first?
>
> Not only stage-1. Stage-2 is involved as well, as the CPU could
> otherwise end-up with the wrong translations (bypassing stage-2 altogether).
>
> >
> > If so, could I suggest a reword along the lines of:
> >
> > /*
> > * ARM erratum 1165522 requires us to configure both stage 1 and
> > * stage 2 translation for the guest context before we clear
> > * HCR_EL2.TGE.
> > *
> > * We have already configured the guest's stage 1 translation in
> > * kvm_vcpu_load_sysregs above. We must now call __activate_vm
> > * before __activate_traps, because __activate_vm configures
> > * stage 2 translation, and __activate_traps clear HCR_EL2.TGE
> > * (among other things).
> > */
>
> Works for me (and shows that contrary to what you wrote above, you have
> perfectly understood the problem)!.
>
I may have actually understood the problem by writing up that piece of
commentary.
Great!
Thanks,
Christoffer
next prev parent reply other threads:[~2018-11-08 20:10 UTC|newest]
Thread overview: 22+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-11-05 14:36 [PATCH 0/4] Workaround for Cortex-A76 erratum 1165522 Marc Zyngier
2018-11-05 14:36 ` [PATCH 1/4] KVM: arm64: Rework detection of SVE, !VHE systems Marc Zyngier
2018-11-06 7:52 ` Christoffer Dall
2018-11-08 19:14 ` Dave Martin
2018-11-05 14:36 ` [PATCH 2/4] KVM: arm64: Allow implementations to be confined to using VHE Marc Zyngier
2018-11-06 7:53 ` Christoffer Dall
2018-11-08 17:51 ` Marc Zyngier
2018-11-08 19:23 ` Dave Martin
2018-11-05 14:36 ` [PATCH 3/4] arm64: KVM: Install stage-2 translation before enabling traps on VHE Marc Zyngier
2018-11-06 8:06 ` Christoffer Dall
2018-11-08 17:54 ` Marc Zyngier
2018-11-05 14:36 ` [PATCH 4/4] arm64: KVM: Implement workaround for Cortex-A76 erratum 1165522 Marc Zyngier
2018-11-05 18:34 ` James Morse
2018-11-08 17:18 ` Marc Zyngier
2018-11-09 11:39 ` James Morse
2018-11-06 8:15 ` Christoffer Dall
2018-11-08 18:05 ` Marc Zyngier
2018-11-08 20:10 ` Christoffer Dall [this message]
2018-11-22 16:13 ` Marc Zyngier
2018-11-23 9:57 ` Christoffer Dall
2018-11-21 12:23 ` Julien Grall
2018-11-21 12:58 ` Marc Zyngier
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20181108201054.GG3835@e113682-lin.lund.arm.com \
--to=christoffer.dall@arm.com \
--cc=linux-arm-kernel@lists.infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).