* [PATCH 0/2] RCU related fix based on kvm-coco-queue.
@ 2024-11-04 8:41 Yan Zhao
2024-11-04 8:42 ` [PATCH 1/2] KVM: x86/tdp_mmu: Use rcu_dereference() to protect sptep for dereferencing Yan Zhao
2024-11-04 8:43 ` [PATCH 2/2] KVM: guest_memfd: Remove RCU-protected attribute from slot->gmem.file Yan Zhao
0 siblings, 2 replies; 5+ messages in thread
From: Yan Zhao @ 2024-11-04 8:41 UTC (permalink / raw)
To: pbonzini, seanjc; +Cc: linux-kernel, kvm, Yan Zhao
Hi Paolo and Sean,
The two patches are fixups to RCU related issues reported by sparse.
Patch 1 was reported by kernel test robot related to dereference of
mirrored sptep.
Patch 2 was a result of running sparse locally with error message:
"error: incompatible types in comparison expression (different
address spaces)" in kvm_gmem_get_file() and __kvm_gmem_get_pfn().
Initially, I wanted to fix it by using rcu_access_pointer() to
fetch the rcu-protected pointer slot->gmem.file.
However, after further analysis, I think it's not necessary to
rely on RCU to protect slot->gmem.file.
Please kindly take a look to see if they are appropriate.
Thanks
Yan
Rick Edgecombe (1):
KVM: x86/tdp_mmu: Use rcu_dereference() to protect sptep for
dereferencing
Yan Zhao (1):
KVM: guest_memfd: Remove RCU-protected attribute from slot->gmem.file
arch/x86/kvm/mmu/spte.h | 4 ++--
arch/x86/kvm/mmu/tdp_mmu.c | 6 +++---
include/linux/kvm_host.h | 2 +-
virt/kvm/guest_memfd.c | 23 ++++++++++-------------
4 files changed, 16 insertions(+), 19 deletions(-)
base-commit: 49c492a89914b02fa5011d9ea9848318c6c98dd9
--
2.43.2
^ permalink raw reply [flat|nested] 5+ messages in thread* [PATCH 1/2] KVM: x86/tdp_mmu: Use rcu_dereference() to protect sptep for dereferencing 2024-11-04 8:41 [PATCH 0/2] RCU related fix based on kvm-coco-queue Yan Zhao @ 2024-11-04 8:42 ` Yan Zhao 2024-11-04 8:43 ` [PATCH 2/2] KVM: guest_memfd: Remove RCU-protected attribute from slot->gmem.file Yan Zhao 1 sibling, 0 replies; 5+ messages in thread From: Yan Zhao @ 2024-11-04 8:42 UTC (permalink / raw) To: pbonzini, seanjc Cc: linux-kernel, kvm, Rick Edgecombe, kernel test robot, Yan Zhao From: Rick Edgecombe <rick.p.edgecombe@intel.com> Use rcu_dereference() to copy the RCU-protected pointer sptep into a local variable for later dereferencing. This also checks that the dereferencing occurs within the RCU read-side critical section. Change is_mirror_sptep()'s input type from "u64 *" to "tdp_ptep_t" (typedef as "u64 __rcu *") to centralize the call of rcu_dereference(). Opportunistically, since try_cmpxchg64() is now the only place in __tdp_mmu_set_spte_atomic() that dereferences the local variable, move the rcu_dereference() call closer to its point of use. Reported-by: kernel test robot <lkp@intel.com> Closes: https://lore.kernel.org/oe-kbuild-all/202410121644.Eq7zRGPO-lkp@intel.com Co-developed-by: Yan Zhao <yan.y.zhao@intel.com> Signed-off-by: Yan Zhao <yan.y.zhao@intel.com> Signed-off-by: Rick Edgecombe <rick.p.edgecombe@intel.com> --- arch/x86/kvm/mmu/spte.h | 4 ++-- arch/x86/kvm/mmu/tdp_mmu.c | 6 +++--- 2 files changed, 5 insertions(+), 5 deletions(-) diff --git a/arch/x86/kvm/mmu/spte.h b/arch/x86/kvm/mmu/spte.h index 8496a2abbde2..ef322f972948 100644 --- a/arch/x86/kvm/mmu/spte.h +++ b/arch/x86/kvm/mmu/spte.h @@ -267,9 +267,9 @@ static inline struct kvm_mmu_page *root_to_sp(hpa_t root) return spte_to_child_sp(root); } -static inline bool is_mirror_sptep(u64 *sptep) +static inline bool is_mirror_sptep(tdp_ptep_t sptep) { - return is_mirror_sp(sptep_to_sp(sptep)); + return is_mirror_sp(sptep_to_sp(rcu_dereference(sptep))); } static inline bool is_mmio_spte(struct kvm *kvm, u64 spte) diff --git a/arch/x86/kvm/mmu/tdp_mmu.c b/arch/x86/kvm/mmu/tdp_mmu.c index b0e1c4cb3004..2741b6587ec9 100644 --- a/arch/x86/kvm/mmu/tdp_mmu.c +++ b/arch/x86/kvm/mmu/tdp_mmu.c @@ -511,7 +511,7 @@ static int __must_check set_external_spte_present(struct kvm *kvm, tdp_ptep_t sp * page table has been modified. Use FROZEN_SPTE similar to * the zapping case. */ - if (!try_cmpxchg64(sptep, &old_spte, FROZEN_SPTE)) + if (!try_cmpxchg64(rcu_dereference(sptep), &old_spte, FROZEN_SPTE)) return -EBUSY; /* @@ -637,8 +637,6 @@ static inline int __must_check __tdp_mmu_set_spte_atomic(struct kvm *kvm, struct tdp_iter *iter, u64 new_spte) { - u64 *sptep = rcu_dereference(iter->sptep); - /* * The caller is responsible for ensuring the old SPTE is not a FROZEN * SPTE. KVM should never attempt to zap or manipulate a FROZEN SPTE, @@ -662,6 +660,8 @@ static inline int __must_check __tdp_mmu_set_spte_atomic(struct kvm *kvm, if (ret) return ret; } else { + u64 *sptep = rcu_dereference(iter->sptep); + /* * Note, fast_pf_fix_direct_spte() can also modify TDP MMU SPTEs * and does not hold the mmu_lock. On failure, i.e. if a -- 2.43.2 ^ permalink raw reply related [flat|nested] 5+ messages in thread
* [PATCH 2/2] KVM: guest_memfd: Remove RCU-protected attribute from slot->gmem.file 2024-11-04 8:41 [PATCH 0/2] RCU related fix based on kvm-coco-queue Yan Zhao 2024-11-04 8:42 ` [PATCH 1/2] KVM: x86/tdp_mmu: Use rcu_dereference() to protect sptep for dereferencing Yan Zhao @ 2024-11-04 8:43 ` Yan Zhao 2024-12-22 18:38 ` Paolo Bonzini 1 sibling, 1 reply; 5+ messages in thread From: Yan Zhao @ 2024-11-04 8:43 UTC (permalink / raw) To: pbonzini, seanjc; +Cc: linux-kernel, kvm, Yan Zhao Remove the RCU-protected attribute from slot->gmem.file. No need to use RCU primitives rcu_assign_pointer()/synchronize_rcu() to update this pointer. - slot->gmem.file is updated in 3 places: kvm_gmem_bind(), kvm_gmem_unbind(), kvm_gmem_release(). All of them are protected by kvm->slots_lock. - slot->gmem.file is read in 2 paths: (1) kvm_gmem_populate kvm_gmem_get_file __kvm_gmem_get_pfn (2) kvm_gmem_get_pfn kvm_gmem_get_file __kvm_gmem_get_pfn Path (1) kvm_gmem_populate() requires holding kvm->slots_lock, so slot->gmem.file is protected by the kvm->slots_lock in this path. Path (2) kvm_gmem_get_pfn() does not require holding kvm->slots_lock. However, it's also not guarded by rcu_read_lock() and rcu_read_unlock(). So synchronize_rcu() in kvm_gmem_unbind()/kvm_gmem_release() actually will not wait for the readers in kvm_gmem_get_pfn() due to lack of RCU read-side critical section. The path (2) kvm_gmem_get_pfn() is safe without RCU protection because: a) kvm_gmem_bind() is called on a new memslot, before the memslot is visible to kvm_gmem_get_pfn(). b) kvm->srcu ensures that kvm_gmem_unbind() and freeing of a memslot occur after the memslot is no longer visible to kvm_gmem_get_pfn(). c) get_file_active() ensures that kvm_gmem_get_pfn() will not access the stale file if kvm_gmem_release() sets it to NULL. This is because if kvm_gmem_release() occurs before kvm_gmem_get_pfn(), get_file_active() will return NULL; if get_file_active() does not return NULL, kvm_gmem_release() should not occur until after kvm_gmem_get_pfn() releases the file reference. Signed-off-by: Yan Zhao <yan.y.zhao@intel.com> --- include/linux/kvm_host.h | 2 +- virt/kvm/guest_memfd.c | 23 ++++++++++------------- 2 files changed, 11 insertions(+), 14 deletions(-) diff --git a/include/linux/kvm_host.h b/include/linux/kvm_host.h index c7e4f8be3e17..3c3088a9e336 100644 --- a/include/linux/kvm_host.h +++ b/include/linux/kvm_host.h @@ -600,7 +600,7 @@ struct kvm_memory_slot { #ifdef CONFIG_KVM_PRIVATE_MEM struct { - struct file __rcu *file; + struct file *file; pgoff_t pgoff; } gmem; #endif diff --git a/virt/kvm/guest_memfd.c b/virt/kvm/guest_memfd.c index 651c2f08df62..9d9bf3d033bd 100644 --- a/virt/kvm/guest_memfd.c +++ b/virt/kvm/guest_memfd.c @@ -267,9 +267,7 @@ static int kvm_gmem_release(struct inode *inode, struct file *file) filemap_invalidate_lock(inode->i_mapping); xa_for_each(&gmem->bindings, index, slot) - rcu_assign_pointer(slot->gmem.file, NULL); - - synchronize_rcu(); + WRITE_ONCE(slot->gmem.file, NULL); /* * All in-flight operations are gone and new bindings can be created. @@ -298,8 +296,7 @@ static inline struct file *kvm_gmem_get_file(struct kvm_memory_slot *slot) /* * Do not return slot->gmem.file if it has already been closed; * there might be some time between the last fput() and when - * kvm_gmem_release() clears slot->gmem.file, and you do not - * want to spin in the meanwhile. + * kvm_gmem_release() clears slot->gmem.file. */ return get_file_active(&slot->gmem.file); } @@ -510,11 +507,11 @@ int kvm_gmem_bind(struct kvm *kvm, struct kvm_memory_slot *slot, } /* - * No synchronize_rcu() needed, any in-flight readers are guaranteed to - * be see either a NULL file or this new file, no need for them to go - * away. + * memslots of flag KVM_MEM_GUEST_MEMFD are immutable to change, so + * kvm_gmem_bind() must occur on a new memslot. + * Readers are guaranteed to see this new file. */ - rcu_assign_pointer(slot->gmem.file, file); + WRITE_ONCE(slot->gmem.file, file); slot->gmem.pgoff = start; xa_store_range(&gmem->bindings, start, end - 1, slot, GFP_KERNEL); @@ -550,8 +547,7 @@ void kvm_gmem_unbind(struct kvm_memory_slot *slot) filemap_invalidate_lock(file->f_mapping); xa_store_range(&gmem->bindings, start, end - 1, NULL, GFP_KERNEL); - rcu_assign_pointer(slot->gmem.file, NULL); - synchronize_rcu(); + WRITE_ONCE(slot->gmem.file, NULL); filemap_invalidate_unlock(file->f_mapping); fput(file); @@ -563,11 +559,12 @@ static struct folio *__kvm_gmem_get_pfn(struct file *file, pgoff_t index, kvm_pfn_t *pfn, bool *is_prepared, int *max_order) { + struct file *gmem_file = READ_ONCE(slot->gmem.file); struct kvm_gmem *gmem = file->private_data; struct folio *folio; - if (file != slot->gmem.file) { - WARN_ON_ONCE(slot->gmem.file); + if (file != gmem_file) { + WARN_ON_ONCE(gmem_file); return ERR_PTR(-EFAULT); } -- 2.43.2 ^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: [PATCH 2/2] KVM: guest_memfd: Remove RCU-protected attribute from slot->gmem.file 2024-11-04 8:43 ` [PATCH 2/2] KVM: guest_memfd: Remove RCU-protected attribute from slot->gmem.file Yan Zhao @ 2024-12-22 18:38 ` Paolo Bonzini 2024-12-23 5:43 ` Yan Zhao 0 siblings, 1 reply; 5+ messages in thread From: Paolo Bonzini @ 2024-12-22 18:38 UTC (permalink / raw) To: Yan Zhao, seanjc; +Cc: linux-kernel, kvm On 11/4/24 09:43, Yan Zhao wrote: > Remove the RCU-protected attribute from slot->gmem.file. No need to use RCU > primitives rcu_assign_pointer()/synchronize_rcu() to update this pointer. > > - slot->gmem.file is updated in 3 places: > kvm_gmem_bind(), kvm_gmem_unbind(), kvm_gmem_release(). > All of them are protected by kvm->slots_lock. > > - slot->gmem.file is read in 2 paths: > (1) kvm_gmem_populate > kvm_gmem_get_file > __kvm_gmem_get_pfn > > (2) kvm_gmem_get_pfn > kvm_gmem_get_file > __kvm_gmem_get_pfn > > Path (1) kvm_gmem_populate() requires holding kvm->slots_lock, so > slot->gmem.file is protected by the kvm->slots_lock in this path. > > Path (2) kvm_gmem_get_pfn() does not require holding kvm->slots_lock. > However, it's also not guarded by rcu_read_lock() and rcu_read_unlock(). > So synchronize_rcu() in kvm_gmem_unbind()/kvm_gmem_release() actually > will not wait for the readers in kvm_gmem_get_pfn() due to lack of RCU > read-side critical section. > > The path (2) kvm_gmem_get_pfn() is safe without RCU protection because: > a) kvm_gmem_bind() is called on a new memslot, before the memslot is > visible to kvm_gmem_get_pfn(). > b) kvm->srcu ensures that kvm_gmem_unbind() and freeing of a memslot > occur after the memslot is no longer visible to kvm_gmem_get_pfn(). > c) get_file_active() ensures that kvm_gmem_get_pfn() will not access the > stale file if kvm_gmem_release() sets it to NULL. This is because if > kvm_gmem_release() occurs before kvm_gmem_get_pfn(), get_file_active() > will return NULL; if get_file_active() does not return NULL, > kvm_gmem_release() should not occur until after kvm_gmem_get_pfn() > releases the file reference. Thanks for the analysis, I added some notes: diff --git a/include/linux/kvm_host.h b/include/linux/kvm_host.h index 4ec2564c0d0f..c788d0bd952a 100644 --- a/include/linux/kvm_host.h +++ b/include/linux/kvm_host.h @@ -602,6 +602,11 @@ struct kvm_memory_slot { #ifdef CONFIG_KVM_PRIVATE_MEM struct { + /* + * Writes protected by kvm->slots_lock. Acquiring a + * reference via kvm_gmem_get_file() is protected by + * either kvm->slots_lock or kvm->srcu. + */ struct file *file; pgoff_t pgoff; } gmem; diff --git a/virt/kvm/guest_memfd.c b/virt/kvm/guest_memfd.c index 9d9bf3d033bd..411ff7224caa 100644 --- a/virt/kvm/guest_memfd.c +++ b/virt/kvm/guest_memfd.c @@ -261,6 +261,12 @@ static int kvm_gmem_release(struct inode *inode, struct file *file) * dereferencing the slot for existing bindings needs to be protected * against memslot updates, specifically so that unbind doesn't race * and free the memslot (kvm_gmem_get_file() will return NULL). + * + * Since .release is called only when the reference count is zero, + * after which file_ref_get() and get_file_active() fail, + * kvm_gmem_get_pfn() cannot be using the file concurrently. + * file_ref_put() provides a full barrier, and get_file_active() the + * matching acquire barrier. */ mutex_lock(&kvm->slots_lock); @@ -508,8 +514,8 @@ int kvm_gmem_bind(struct kvm *kvm, struct kvm_memory_slot *slot, /* * memslots of flag KVM_MEM_GUEST_MEMFD are immutable to change, so - * kvm_gmem_bind() must occur on a new memslot. - * Readers are guaranteed to see this new file. + * kvm_gmem_bind() must occur on a new memslot. Because the memslot + * is not visible yet, kvm_gmem_get_pfn() is guaranteed to see the file. */ WRITE_ONCE(slot->gmem.file, file); slot->gmem.pgoff = start; @@ -547,6 +554,11 @@ void kvm_gmem_unbind(struct kvm_memory_slot *slot) filemap_invalidate_lock(file->f_mapping); xa_store_range(&gmem->bindings, start, end - 1, NULL, GFP_KERNEL); + + /* + * synchronize_srcu(&kvm->srcu) ensured that kvm_gmem_get_pfn() + * cannot see this memslot. + */ WRITE_ONCE(slot->gmem.file, NULL); filemap_invalidate_unlock(file->f_mapping); Queued to kvm-coco-queue. Paolo ^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: [PATCH 2/2] KVM: guest_memfd: Remove RCU-protected attribute from slot->gmem.file 2024-12-22 18:38 ` Paolo Bonzini @ 2024-12-23 5:43 ` Yan Zhao 0 siblings, 0 replies; 5+ messages in thread From: Yan Zhao @ 2024-12-23 5:43 UTC (permalink / raw) To: Paolo Bonzini; +Cc: seanjc, linux-kernel, kvm On Sun, Dec 22, 2024 at 07:38:44PM +0100, Paolo Bonzini wrote: > On 11/4/24 09:43, Yan Zhao wrote: > > Remove the RCU-protected attribute from slot->gmem.file. No need to use RCU > > primitives rcu_assign_pointer()/synchronize_rcu() to update this pointer. > > > > - slot->gmem.file is updated in 3 places: > > kvm_gmem_bind(), kvm_gmem_unbind(), kvm_gmem_release(). > > All of them are protected by kvm->slots_lock. > > > > - slot->gmem.file is read in 2 paths: > > (1) kvm_gmem_populate > > kvm_gmem_get_file > > __kvm_gmem_get_pfn > > > > (2) kvm_gmem_get_pfn > > kvm_gmem_get_file > > __kvm_gmem_get_pfn > > > > Path (1) kvm_gmem_populate() requires holding kvm->slots_lock, so > > slot->gmem.file is protected by the kvm->slots_lock in this path. > > > > Path (2) kvm_gmem_get_pfn() does not require holding kvm->slots_lock. > > However, it's also not guarded by rcu_read_lock() and rcu_read_unlock(). > > So synchronize_rcu() in kvm_gmem_unbind()/kvm_gmem_release() actually > > will not wait for the readers in kvm_gmem_get_pfn() due to lack of RCU > > read-side critical section. > > > > The path (2) kvm_gmem_get_pfn() is safe without RCU protection because: > > a) kvm_gmem_bind() is called on a new memslot, before the memslot is > > visible to kvm_gmem_get_pfn(). > > b) kvm->srcu ensures that kvm_gmem_unbind() and freeing of a memslot > > occur after the memslot is no longer visible to kvm_gmem_get_pfn(). > > c) get_file_active() ensures that kvm_gmem_get_pfn() will not access the > > stale file if kvm_gmem_release() sets it to NULL. This is because if > > kvm_gmem_release() occurs before kvm_gmem_get_pfn(), get_file_active() > > will return NULL; if get_file_active() does not return NULL, > > kvm_gmem_release() should not occur until after kvm_gmem_get_pfn() > > releases the file reference. > > Thanks for the analysis, I added some notes: Thank you for adding those notes! > diff --git a/include/linux/kvm_host.h b/include/linux/kvm_host.h > index 4ec2564c0d0f..c788d0bd952a 100644 > --- a/include/linux/kvm_host.h > +++ b/include/linux/kvm_host.h > @@ -602,6 +602,11 @@ struct kvm_memory_slot { > #ifdef CONFIG_KVM_PRIVATE_MEM > struct { > + /* > + * Writes protected by kvm->slots_lock. Acquiring a > + * reference via kvm_gmem_get_file() is protected by > + * either kvm->slots_lock or kvm->srcu. > + */ > struct file *file; > pgoff_t pgoff; > } gmem; > diff --git a/virt/kvm/guest_memfd.c b/virt/kvm/guest_memfd.c > index 9d9bf3d033bd..411ff7224caa 100644 > --- a/virt/kvm/guest_memfd.c > +++ b/virt/kvm/guest_memfd.c > @@ -261,6 +261,12 @@ static int kvm_gmem_release(struct inode *inode, struct file *file) > * dereferencing the slot for existing bindings needs to be protected > * against memslot updates, specifically so that unbind doesn't race > * and free the memslot (kvm_gmem_get_file() will return NULL). > + * > + * Since .release is called only when the reference count is zero, > + * after which file_ref_get() and get_file_active() fail, > + * kvm_gmem_get_pfn() cannot be using the file concurrently. > + * file_ref_put() provides a full barrier, and get_file_active() the > + * matching acquire barrier. > */ > mutex_lock(&kvm->slots_lock); > @@ -508,8 +514,8 @@ int kvm_gmem_bind(struct kvm *kvm, struct kvm_memory_slot *slot, > /* > * memslots of flag KVM_MEM_GUEST_MEMFD are immutable to change, so > - * kvm_gmem_bind() must occur on a new memslot. > - * Readers are guaranteed to see this new file. > + * kvm_gmem_bind() must occur on a new memslot. Because the memslot > + * is not visible yet, kvm_gmem_get_pfn() is guaranteed to see the file. > */ > WRITE_ONCE(slot->gmem.file, file); > slot->gmem.pgoff = start; > @@ -547,6 +554,11 @@ void kvm_gmem_unbind(struct kvm_memory_slot *slot) > filemap_invalidate_lock(file->f_mapping); > xa_store_range(&gmem->bindings, start, end - 1, NULL, GFP_KERNEL); > + > + /* > + * synchronize_srcu(&kvm->srcu) ensured that kvm_gmem_get_pfn() > + * cannot see this memslot. > + */ > WRITE_ONCE(slot->gmem.file, NULL); > filemap_invalidate_unlock(file->f_mapping); > Queued to kvm-coco-queue. > > Paolo > ^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2024-12-23 6:18 UTC | newest] Thread overview: 5+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2024-11-04 8:41 [PATCH 0/2] RCU related fix based on kvm-coco-queue Yan Zhao 2024-11-04 8:42 ` [PATCH 1/2] KVM: x86/tdp_mmu: Use rcu_dereference() to protect sptep for dereferencing Yan Zhao 2024-11-04 8:43 ` [PATCH 2/2] KVM: guest_memfd: Remove RCU-protected attribute from slot->gmem.file Yan Zhao 2024-12-22 18:38 ` Paolo Bonzini 2024-12-23 5:43 ` Yan Zhao
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox