Kernel KVM virtualization development
 help / color / mirror / Atom feed
From: sashiko-bot@kernel.org
To: "David Woodhouse" <dwmw2@infradead.org>
Cc: kvm@vger.kernel.org
Subject: Re: [PATCH v5 19/34] KVM: x86: Kill last_tsc_{nsec,write,offset} fields
Date: Mon, 08 Jun 2026 18:53:00 +0000	[thread overview]
Message-ID: <20260608185301.326AC1F00893@smtp.kernel.org> (raw)
In-Reply-To: <20260608145455.89187-20-dwmw2@infradead.org>

Thank you for your contribution! Sashiko AI review found 4 potential issue(s) to consider:

New issues:
- [High] Mathematical flaw in S4 host resume TSC adjustment permanently breaks cross-vCPU synchronization.
- [High] Lockless read-modify-write of `cur_tsc_write` introduces a race condition with concurrent `KVM_SET_TSC` ioctls.
- [Medium] The commit silently changes the logic for TSC matching after an S4 resume without documenting the functional change.

Pre-existing issues:
- [Critical] Unprotected iteration over `vm_list` in CPU hotplug path leads to Use-After-Free.
--

commit d9bfcd24b7d69aa51689545f7274022cd36bbee8
Author: David Woodhouse <dwmw@amazon.co.uk>

KVM: x86: Remove last_tsc_{nsec,write,khz,offset,scaling_ratio}

This commit removes the redundant last_tsc_* variables, as they pointlessly
duplicate the cur_tsc_* values used for TSC generation tracking. The patch
renames the remaining properties to reflect the current generation and
updates the S4 host suspend logic.

> These pointlessly duplicate the cur_tsc_{nsec,write,offset} values.
> The only place they were used was where the TSC is stable and a new
> vCPU is being synchronized to the previous setting, in which case the
> cur_tsc_* value is definitely identical.
> 
> Rename last_tsc_khz and last_tsc_scaling_ratio to cur_tsc_khz and
> cur_tsc_scaling_ratio respectively, since they are properties of the
> current TSC generation.

[Severity: Medium]
Does this commit also introduce a functional change to the S4 host suspend
logic?

The commit message describes a pure cleanup of duplicated variables, but the
patch removes the explicit zeroing of last_tsc_nsec and last_tsc_write (which
disabled TSC matching) and replaces it with a math adjustment to
cur_tsc_write. Should this behavioral change be documented in the commit
message?

[ ... ]

> diff --git a/arch/x86/kvm/x86.c b/arch/x86/kvm/x86.c
> index c8c0633263fb..bbd642e0dc54 100644
> --- a/arch/x86/kvm/x86.c
> +++ b/arch/x86/kvm/x86.c

[ ... ]

> @@ -13543,13 +13539,15 @@ int kvm_arch_enable_virtualization_cpu(void)
>  			}
>  
>  			/*
> -			 * We have to disable TSC offset matching.. if you were
> -			 * booting a VM while issuing an S4 host suspend....
> -			 * you may have some problem.  Solving this issue is
> -			 * left as an exercise to the reader.
> +			 * Adjust the TSC matching reference by the same
> +			 * delta applied to each vCPU's offset, so that
> +			 * future KVM_SET_TSC / vCPU creation still matches
> +			 * correctly against the adjusted TSC timeline.
> +			 * Scale from host to guest TSC rate.
>  			 */

[Severity: Critical]
This isn't a bug introduced by this patch, but does the iteration over vm_list
in kvm_arch_enable_virtualization_cpu() need to hold kvm_lock?

Since kvm_arch_enable_virtualization_cpu() is called from the CPU hotplug
callback, it runs under the CPU hotplug lock but doesn't seem to serialize
against the kvm_lock mutex. If a VM is destroyed concurrently via
kvm_destroy_vm(), could this lockless iteration dereference a freed KVM
instance?

> -			kvm->arch.last_tsc_nsec = 0;
> -			kvm->arch.last_tsc_write = 0;
> +			kvm->arch.cur_tsc_write -=
> +				kvm_scale_tsc(delta_cyc,
> +					      kvm->arch.cur_tsc_scaling_ratio);

[Severity: High]
Does this S4 resume adjustment to cur_tsc_write permanently break cross-vCPU
synchronization by omitting the symmetrical adjustment to cur_tsc_offset?

During S4 resume, existing vCPUs compensate their offsets via delta_cyc.
When a new vCPU is subsequently created or KVM_SET_TSC is called,
kvm_synchronize_tsc() assigns the new vCPU's offset directly from
kvm->arch.cur_tsc_offset:

kvm_synchronize_tsc() {
    ...
    offset = kvm->arch.cur_tsc_offset;
    ...
}

Because cur_tsc_offset lacks the delta_cyc compensation applied to existing
vCPUs, won't the new vCPU's TSC timeline be permanently shifted backward by
delta_cyc relative to the existing vCPUs?


[Severity: High]
Can this lockless read-modify-write operation race with concurrent KVM_SET_TSC
ioctls?

Since CPU hotplug can occur while KVM is running VMs, if the platform
registers a backwards TSC during CPU bringup, this path executes concurrently
with userspace ioctls like KVM_SET_TSC.

KVM_SET_TSC properly acquires kvm->arch.tsc_write_lock before updating
cur_tsc_write and offsets. Could this lockless subtraction tear the 64-bit
value or overwrite a brand-new TSC generation state from KVM_SET_TSC?

>  		}
>  
>  	}

-- 
Sashiko AI review · https://sashiko.dev/#/patchset/20260608145455.89187-1-dwmw2@infradead.org?part=19

  reply	other threads:[~2026-06-08 18:53 UTC|newest]

Thread overview: 62+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-06-08 14:47 [PATCH v5 00/34] Cleaning up the KVM clock mess David Woodhouse
2026-06-08 14:47 ` [PATCH v5 01/34] KVM: x86/xen: Do not corrupt KVM clock in kvm_xen_shared_info_init() David Woodhouse
2026-06-08 14:47 ` [PATCH v5 02/34] KVM: x86: Improve accuracy of KVM clock when TSC scaling is in force David Woodhouse
2026-06-08 14:47 ` [PATCH v5 03/34] UAPI: x86: Move pvclock-abi to UAPI for x86 platforms David Woodhouse
2026-06-08 14:47 ` [PATCH v5 04/34] KVM: x86: Add KVM_[GS]ET_CLOCK_GUEST for accurate KVM clock migration David Woodhouse
2026-06-08 15:33   ` sashiko-bot
2026-06-08 14:47 ` [PATCH v5 05/34] KVM: selftests: Add KVM/PV clock selftest to prove timer correction David Woodhouse
2026-06-08 15:49   ` sashiko-bot
2026-06-08 14:47 ` [PATCH v5 06/34] KVM: x86: Explicitly disable TSC scaling without CONSTANT_TSC David Woodhouse
2026-06-08 14:47 ` [PATCH v5 07/34] KVM: x86: Activate master clock immediately on vCPU creation David Woodhouse
2026-06-08 16:27   ` sashiko-bot
2026-06-08 23:29     ` David Woodhouse
2026-06-08 14:47 ` [PATCH v5 08/34] KVM: x86: Add KVM_VCPU_TSC_SCALE and fix the documentation on TSC migration David Woodhouse
2026-06-08 16:39   ` sashiko-bot
2026-06-08 14:47 ` [PATCH v5 09/34] KVM: x86: Avoid NTP frequency skew for KVM clock on 32-bit host David Woodhouse
2026-06-08 14:47 ` [PATCH v5 10/34] KVM: x86: Fold __get_kvmclock() into get_kvmclock() David Woodhouse
2026-06-08 14:47 ` [PATCH v5 11/34] KVM: x86: Restructure get_kvmclock() David Woodhouse
2026-06-08 14:47 ` [PATCH v5 12/34] KVM: x86: Fix KVM clock precision in get_kvmclock() with TSC scaling David Woodhouse
2026-06-08 17:39   ` sashiko-bot
2026-06-08 23:43     ` David Woodhouse
2026-06-08 14:47 ` [PATCH v5 13/34] KVM: x86: Use get_kvmclock() in kvm_get_wall_clock_epoch() David Woodhouse
2026-06-08 14:47 ` [PATCH v5 14/34] KVM: x86: Fix compute_guest_tsc() to handle negative time deltas David Woodhouse
2026-06-08 17:59   ` sashiko-bot
2026-06-09  0:02     ` David Woodhouse
2026-06-08 14:47 ` [PATCH v5 15/34] KVM: x86: Restructure kvm_guest_time_update() for TSC upscaling David Woodhouse
2026-06-08 18:13   ` sashiko-bot
2026-06-08 14:47 ` [PATCH v5 16/34] KVM: x86: Simplify and comment kvm_get_time_scale() David Woodhouse
2026-06-08 14:47 ` [PATCH v5 17/34] KVM: x86: Remove implicit rdtsc() from kvm_compute_l1_tsc_offset() David Woodhouse
2026-06-08 14:47 ` [PATCH v5 18/34] KVM: x86: Improve synchronization in kvm_synchronize_tsc() David Woodhouse
2026-06-08 18:39   ` sashiko-bot
2026-06-09  0:14     ` David Woodhouse
2026-06-08 14:48 ` [PATCH v5 19/34] KVM: x86: Kill last_tsc_{nsec,write,offset} fields David Woodhouse
2026-06-08 18:53   ` sashiko-bot [this message]
2026-06-09  0:34     ` David Woodhouse
2026-06-08 14:48 ` [PATCH v5 20/34] KVM: x86: Replace nr_vcpus_matched_tsc count with all_vcpus_matched_tsc bool David Woodhouse
2026-06-08 14:48 ` [PATCH v5 21/34] KVM: x86: Allow KVM master clock mode when TSCs are offset from each other David Woodhouse
2026-06-08 19:15   ` sashiko-bot
2026-06-08 14:48 ` [PATCH v5 22/34] KVM: selftests: Add master clock offset test David Woodhouse
2026-06-08 19:26   ` sashiko-bot
2026-06-09  0:50     ` David Woodhouse
2026-06-08 14:48 ` [PATCH v5 23/34] KVM: x86: Factor out kvm_use_master_clock() David Woodhouse
2026-06-08 14:48 ` [PATCH v5 24/34] KVM: x86: Avoid gratuitous global clock updates David Woodhouse
2026-06-08 14:48 ` [PATCH v5 25/34] KVM: x86/xen: Prevent runstate times from becoming negative David Woodhouse
2026-06-08 19:58   ` sashiko-bot
2026-06-09  1:02     ` David Woodhouse
2026-06-08 14:48 ` [PATCH v5 26/34] KVM: x86: Avoid redundant masterclock updates from multiple vCPUs David Woodhouse
2026-06-08 20:11   ` sashiko-bot
2026-06-09  1:34     ` David Woodhouse
2026-06-08 14:48 ` [PATCH v5 27/34] KVM: x86: Remove runtime Xen TSC frequency CPUID update David Woodhouse
2026-06-08 14:48 ` [PATCH v5 28/34] KVM: selftests: Add Xen/generic CPUID timing leaf test David Woodhouse
2026-06-09  0:27   ` sashiko-bot
2026-06-09  7:02     ` David Woodhouse
2026-06-08 14:48 ` [PATCH v5 29/34] KVM: x86: Re-synchronize TSC after KVM_SET_TSC_KHZ David Woodhouse
2026-06-09  0:37   ` sashiko-bot
2026-06-08 14:48 ` [PATCH v5 30/34] KVM: selftests: Add Xen runstate migration test David Woodhouse
2026-06-09  0:50   ` sashiko-bot
2026-06-08 14:48 ` [PATCH v5 31/34] KVM: x86: Use ktime_get_snapshot_id() for master clock David Woodhouse
2026-06-09  1:03   ` sashiko-bot
2026-06-08 14:48 ` [PATCH v5 32/34] KVM: x86: Compute kvmclock base without pvclock_gtod_data David Woodhouse
2026-06-08 14:48 ` [PATCH v5 33/34] KVM: x86: Replace pvclock_gtod_data vclock_mode with boolean David Woodhouse
2026-06-09  1:23   ` sashiko-bot
2026-06-08 14:48 ` [PATCH v5 34/34] KVM: x86: Remove pvclock_gtod_data and private timekeeping code David Woodhouse

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20260608185301.326AC1F00893@smtp.kernel.org \
    --to=sashiko-bot@kernel.org \
    --cc=dwmw2@infradead.org \
    --cc=kvm@vger.kernel.org \
    --cc=sashiko-reviews@lists.linux.dev \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox