From: sashiko-bot@kernel.org
To: "David Woodhouse" <dwmw2@infradead.org>
Cc: kvm@vger.kernel.org
Subject: Re: [PATCH v5 15/34] KVM: x86: Restructure kvm_guest_time_update() for TSC upscaling
Date: Mon, 08 Jun 2026 18:13:40 +0000 [thread overview]
Message-ID: <20260608181340.A508D1F00893@smtp.kernel.org> (raw)
In-Reply-To: <20260608145455.89187-16-dwmw2@infradead.org>
Thank you for your contribution! Sashiko AI review found 2 potential issue(s) to consider:
- [High] Dropping the `local_irq_save()` protection around the TSC catchup calculation exposes an unprotected Read-Modify-Write (RMW) of `l1_tsc_offset`, leading to guest TSC corruption on systems with unstable TSC.
- [Medium] The commit message claims a WARN is added for `kvm_get_time_and_clockread()` unexpected failures, but the implementation silently handles it.
--
commit f05935e83e34b920673a8039b5e1b6c7443ffe07
Author: David Woodhouse <dwmw@amazon.co.uk>
Restructure kvm_guest_time_update() so that kernel_ns/host_tsc are
always "now" when doing TSC catchup, then swap in the master clock
reference values afterward for the hv_clock.
This makes the TSC upscaling code considerably simpler: the catchup
adjustment is computed as the delta between what the guest TSC *should*
be at "now" and what it actually is, rather than mixing "now" and
"master clock reference" timestamps.
> The seqcount loop now also contains the kvm_get_time_and_clockread()
> call (matching get_kvmclock's pattern), with the same WARN for
> unexpected failure.
[Severity: Medium]
The commit message mentions that a WARN is added for unexpected failures of
kvm_get_time_and_clockread(), but the implementation inside the
CONFIG_X86_64 block seems to silently handle the failure by clearing
use_master_clock. Was the WARN intentionally omitted from the patch?
> diff --git a/arch/x86/kvm/x86.c b/arch/x86/kvm/x86.c
> index 8aae22401046..92e32d720523 100644
> --- a/arch/x86/kvm/x86.c
> +++ b/arch/x86/kvm/x86.c
[ ... ]
> @@ -3363,46 +3363,63 @@ static void kvm_setup_guest_pvclock(struct pvclock_vcpu_time_info *ref_hv_clock,
[ ... ]
> - /* Keep irq disabled to prevent changes to the clock */
> - local_irq_save(flags);
> - tgt_tsc_hz = (u64)get_cpu_tsc_khz() * 1000;
> if (unlikely(tgt_tsc_hz == 0)) {
> - local_irq_restore(flags);
> kvm_make_request(KVM_REQ_CLOCK_UPDATE, v);
> return 1;
> }
> +
> if (!use_master_clock) {
> + unsigned long flags;
> +
> + local_irq_save(flags);
> host_tsc = rdtsc();
> kernel_ns = get_kvmclock_base_ns();
> + local_irq_restore(flags);
> }
>
> - tsc_timestamp = kvm_read_l1_tsc(v, host_tsc);
> -
> /*
> * We may have to catch up the TSC to match elapsed wall clock
> * time for two reasons, even if kvmclock is used.
> @@ -3411,17 +3428,32 @@ int kvm_guest_time_update(struct kvm_vcpu *v)
> * entry to avoid unknown leaps of TSC even when running
> * again on the same CPU. This may cause apparent elapsed
> * time to disappear, and the guest to stand still or run
> - * very slowly.
> + * very slowly.
> */
> if (vcpu->tsc_catchup) {
> - u64 tsc = compute_guest_tsc(v, kernel_ns);
> - if (tsc > tsc_timestamp) {
> - adjust_tsc_offset_guest(v, tsc - tsc_timestamp);
> - tsc_timestamp = tsc;
> - }
> + s64 adjustment;
> +
> + /*
> + * Calculate the delta between what the guest TSC *should* be
> + * and what it actually is according to kvm_read_l1_tsc().
> + */
> + adjustment = compute_guest_tsc(v, kernel_ns) -
> + kvm_read_l1_tsc(v, host_tsc);
> + if (adjustment > 0)
> + adjust_tsc_offset_guest(v, adjustment);
[Severity: High]
Does dropping the local_irq_save() protection around the TSC catchup calculation
expose an unprotected read-modify-write of l1_tsc_offset?
If preemption occurs inside adjust_tsc_offset_guest() after reading the offset:
arch/x86/kvm/x86.c:adjust_tsc_offset_guest() {
u64 tsc_offset = vcpu->arch.l1_tsc_offset;
/* <--- preemption here */
kvm_vcpu_write_tsc_offset(vcpu, tsc_offset + adjustment);
}
And the vCPU migrates to a new CPU with an unstable TSC, KVM's preempt
notifiers would call kvm_arch_vcpu_load() and write a new compensation
offset. When this thread resumes, wouldn't it write back the stale base
offset and overwrite the preempt notifier's compensation?
Since the code only corrects positive adjustments, the corrupted offset
would remain, potentially causing the guest TSC to leap forward.
> }
>
> - local_irq_restore(flags);
--
Sashiko AI review · https://sashiko.dev/#/patchset/20260608145455.89187-1-dwmw2@infradead.org?part=15
next prev parent reply other threads:[~2026-06-08 18:13 UTC|newest]
Thread overview: 62+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-06-08 14:47 [PATCH v5 00/34] Cleaning up the KVM clock mess David Woodhouse
2026-06-08 14:47 ` [PATCH v5 01/34] KVM: x86/xen: Do not corrupt KVM clock in kvm_xen_shared_info_init() David Woodhouse
2026-06-08 14:47 ` [PATCH v5 02/34] KVM: x86: Improve accuracy of KVM clock when TSC scaling is in force David Woodhouse
2026-06-08 14:47 ` [PATCH v5 03/34] UAPI: x86: Move pvclock-abi to UAPI for x86 platforms David Woodhouse
2026-06-08 14:47 ` [PATCH v5 04/34] KVM: x86: Add KVM_[GS]ET_CLOCK_GUEST for accurate KVM clock migration David Woodhouse
2026-06-08 15:33 ` sashiko-bot
2026-06-08 14:47 ` [PATCH v5 05/34] KVM: selftests: Add KVM/PV clock selftest to prove timer correction David Woodhouse
2026-06-08 15:49 ` sashiko-bot
2026-06-08 14:47 ` [PATCH v5 06/34] KVM: x86: Explicitly disable TSC scaling without CONSTANT_TSC David Woodhouse
2026-06-08 14:47 ` [PATCH v5 07/34] KVM: x86: Activate master clock immediately on vCPU creation David Woodhouse
2026-06-08 16:27 ` sashiko-bot
2026-06-08 23:29 ` David Woodhouse
2026-06-08 14:47 ` [PATCH v5 08/34] KVM: x86: Add KVM_VCPU_TSC_SCALE and fix the documentation on TSC migration David Woodhouse
2026-06-08 16:39 ` sashiko-bot
2026-06-08 14:47 ` [PATCH v5 09/34] KVM: x86: Avoid NTP frequency skew for KVM clock on 32-bit host David Woodhouse
2026-06-08 14:47 ` [PATCH v5 10/34] KVM: x86: Fold __get_kvmclock() into get_kvmclock() David Woodhouse
2026-06-08 14:47 ` [PATCH v5 11/34] KVM: x86: Restructure get_kvmclock() David Woodhouse
2026-06-08 14:47 ` [PATCH v5 12/34] KVM: x86: Fix KVM clock precision in get_kvmclock() with TSC scaling David Woodhouse
2026-06-08 17:39 ` sashiko-bot
2026-06-08 23:43 ` David Woodhouse
2026-06-08 14:47 ` [PATCH v5 13/34] KVM: x86: Use get_kvmclock() in kvm_get_wall_clock_epoch() David Woodhouse
2026-06-08 14:47 ` [PATCH v5 14/34] KVM: x86: Fix compute_guest_tsc() to handle negative time deltas David Woodhouse
2026-06-08 17:59 ` sashiko-bot
2026-06-09 0:02 ` David Woodhouse
2026-06-08 14:47 ` [PATCH v5 15/34] KVM: x86: Restructure kvm_guest_time_update() for TSC upscaling David Woodhouse
2026-06-08 18:13 ` sashiko-bot [this message]
2026-06-08 14:47 ` [PATCH v5 16/34] KVM: x86: Simplify and comment kvm_get_time_scale() David Woodhouse
2026-06-08 14:47 ` [PATCH v5 17/34] KVM: x86: Remove implicit rdtsc() from kvm_compute_l1_tsc_offset() David Woodhouse
2026-06-08 14:47 ` [PATCH v5 18/34] KVM: x86: Improve synchronization in kvm_synchronize_tsc() David Woodhouse
2026-06-08 18:39 ` sashiko-bot
2026-06-09 0:14 ` David Woodhouse
2026-06-08 14:48 ` [PATCH v5 19/34] KVM: x86: Kill last_tsc_{nsec,write,offset} fields David Woodhouse
2026-06-08 18:53 ` sashiko-bot
2026-06-09 0:34 ` David Woodhouse
2026-06-08 14:48 ` [PATCH v5 20/34] KVM: x86: Replace nr_vcpus_matched_tsc count with all_vcpus_matched_tsc bool David Woodhouse
2026-06-08 14:48 ` [PATCH v5 21/34] KVM: x86: Allow KVM master clock mode when TSCs are offset from each other David Woodhouse
2026-06-08 19:15 ` sashiko-bot
2026-06-08 14:48 ` [PATCH v5 22/34] KVM: selftests: Add master clock offset test David Woodhouse
2026-06-08 19:26 ` sashiko-bot
2026-06-09 0:50 ` David Woodhouse
2026-06-08 14:48 ` [PATCH v5 23/34] KVM: x86: Factor out kvm_use_master_clock() David Woodhouse
2026-06-08 14:48 ` [PATCH v5 24/34] KVM: x86: Avoid gratuitous global clock updates David Woodhouse
2026-06-08 14:48 ` [PATCH v5 25/34] KVM: x86/xen: Prevent runstate times from becoming negative David Woodhouse
2026-06-08 19:58 ` sashiko-bot
2026-06-09 1:02 ` David Woodhouse
2026-06-08 14:48 ` [PATCH v5 26/34] KVM: x86: Avoid redundant masterclock updates from multiple vCPUs David Woodhouse
2026-06-08 20:11 ` sashiko-bot
2026-06-09 1:34 ` David Woodhouse
2026-06-08 14:48 ` [PATCH v5 27/34] KVM: x86: Remove runtime Xen TSC frequency CPUID update David Woodhouse
2026-06-08 14:48 ` [PATCH v5 28/34] KVM: selftests: Add Xen/generic CPUID timing leaf test David Woodhouse
2026-06-09 0:27 ` sashiko-bot
2026-06-09 7:02 ` David Woodhouse
2026-06-08 14:48 ` [PATCH v5 29/34] KVM: x86: Re-synchronize TSC after KVM_SET_TSC_KHZ David Woodhouse
2026-06-09 0:37 ` sashiko-bot
2026-06-08 14:48 ` [PATCH v5 30/34] KVM: selftests: Add Xen runstate migration test David Woodhouse
2026-06-09 0:50 ` sashiko-bot
2026-06-08 14:48 ` [PATCH v5 31/34] KVM: x86: Use ktime_get_snapshot_id() for master clock David Woodhouse
2026-06-09 1:03 ` sashiko-bot
2026-06-08 14:48 ` [PATCH v5 32/34] KVM: x86: Compute kvmclock base without pvclock_gtod_data David Woodhouse
2026-06-08 14:48 ` [PATCH v5 33/34] KVM: x86: Replace pvclock_gtod_data vclock_mode with boolean David Woodhouse
2026-06-09 1:23 ` sashiko-bot
2026-06-08 14:48 ` [PATCH v5 34/34] KVM: x86: Remove pvclock_gtod_data and private timekeeping code David Woodhouse
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20260608181340.A508D1F00893@smtp.kernel.org \
--to=sashiko-bot@kernel.org \
--cc=dwmw2@infradead.org \
--cc=kvm@vger.kernel.org \
--cc=sashiko-reviews@lists.linux.dev \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox