From: Xiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
To: Paolo Bonzini <pbonzini@redhat.com>
Cc: gleb@redhat.com, avi.kivity@gmail.com, mtosatti@redhat.com,
linux-kernel@vger.kernel.org, kvm@vger.kernel.org
Subject: Re: [PATCH 6/7] KVM: MMU: document fast invalidate all pages
Date: Wed, 19 Jun 2013 21:07:00 +0800 [thread overview]
Message-ID: <51C1ACF4.4040004@linux.vnet.ibm.com> (raw)
In-Reply-To: <51C1A327.90900@redhat.com>
On 06/19/2013 08:25 PM, Paolo Bonzini wrote:
> Il 19/06/2013 11:09, Xiao Guangrong ha scritto:
>> Document it to Documentation/virtual/kvm/mmu.txt
>>
>> Signed-off-by: Xiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
>> ---
>> Documentation/virtual/kvm/mmu.txt | 23 +++++++++++++++++++++++
>> arch/x86/include/asm/kvm_host.h | 5 +++++
>> 2 files changed, 28 insertions(+)
>>
>> diff --git a/Documentation/virtual/kvm/mmu.txt b/Documentation/virtual/kvm/mmu.txt
>> index b5ce7dd..f5c4de9 100644
>> --- a/Documentation/virtual/kvm/mmu.txt
>> +++ b/Documentation/virtual/kvm/mmu.txt
>> @@ -210,6 +210,10 @@ Shadow pages contain the following information:
>> A bitmap indicating which sptes in spt point (directly or indirectly) at
>> pages that may be unsynchronized. Used to quickly locate all unsychronized
>> pages reachable from a given page.
>> + mmu_valid_gen:
>> + It is the generation number of the page which cooperates with
>> + kvm->arch.mmu_valid_gen to fast invalidate all pages.
>> + (see "Fast invalidate all pages" below.)
>
> + mmu_valid_gen:
> + Generation number of the page. It is compared with kvm->arch.mmu_valid_gen
> + during hash table lookup, and used to skip invalidated shadow pages (see
> + "Zapping all pages" below.)
>
>> clear_spte_count:
>> It is only used on 32bit host which helps us to detect whether updating the
>> 64bit spte is complete so that we can avoid reading the truncated value out
>> @@ -373,6 +377,25 @@ causes its write_count to be incremented, thus preventing instantiation of
>> a large spte. The frames at the end of an unaligned memory slot have
>> artificially inflated ->write_counts so they can never be instantiated.
>>
>> +Fast invalidate all pages
>> +===========
>> +For the large memory and large vcpus guests, zapping all pages is a challenge
>> +since they have large number of pages need to be zapped, walking and zapping
>> +these pages are really slow and it should hold mmu-lock which stops the memory
>> +access on all vcpus.
>> +
>> +To make it be more scalable, kvm maintains a global mmu valid
>> +generation-number which is stored in kvm->arch.mmu_valid_gen and every shadow
>> +page stores the current global generation-number into sp->mmu_valid_gen when
>> +it is created.
>> +
>> +When KVM need zap all shadow pages sptes, it just simply increases the global
>> +generation-number then reload root shadow pages on all vcpus. Vcpu will create
>> +a new shadow page table according to current kvm's generation-number. It
>> +ensures the old pages are not used any more. The invalid-gen pages
>> +(sp->mmu_valid_gen != kvm->arch.mmu_valid_gen) are zapped by using lock-break
>> +technique.
>> +
>
> +Zapping all pages (page generation count)
> +=========================================
> +
> +For the large memory guests, walking and zapping all pages is really slow
> +(because there are a lot of pages), and also blocks memory accesses of
> +all VCPUs because it needs to hold the MMU lock.
> +
> +To make it be more scalable, kvm maintains a global generation number
> +which is stored in kvm->arch.mmu_valid_gen. Every shadow page stores
> +the current global generation-number into sp->mmu_valid_gen when it
> +is created. Pages with a mismatching generation number are "obsolete".
> +
> +When KVM need zap all shadow pages sptes, it just simply increases the global
> +generation-number then reload root shadow pages on all vcpus. As the VCPUs
> +create new shadow page tables, the old pages are not used because of the
> +mismatching generation number.
> +
> +KVM then walks through all pages and zaps obsolete pages. While the zap
> +operation needs to take the MMU lock, the lock can be released periodically
> +so that the VCPUs can make progress.
> +
>
>> Further reading
>> ===============
>>
>> diff --git a/arch/x86/include/asm/kvm_host.h b/arch/x86/include/asm/kvm_host.h
>> index 5eb5382..c4f90f6 100644
>> --- a/arch/x86/include/asm/kvm_host.h
>> +++ b/arch/x86/include/asm/kvm_host.h
>> @@ -222,6 +222,11 @@ struct kvm_mmu_page {
>> int root_count; /* Currently serving as active root */
>> unsigned int unsync_children;
>> unsigned long parent_ptes; /* Reverse mapping for parent_pte */
>> +
>> + /*
>> + * the generation number of the page which cooperates with
>> + * kvm->arch.mmu_valid_gen to fast invalidate all pages.
>> + */
>
> + /* The page is obsolete if mmu_valid_gen != kvm->arch.mmu_valid_gen. */
>
All the changes are fine to me.
I have learned a lot from your sentences, thanks! ;)
next prev parent reply other threads:[~2013-06-19 13:07 UTC|newest]
Thread overview: 30+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-06-19 9:09 [PATCH 0/7] KVM: MMU: update mmu documentation Xiao Guangrong
2013-06-19 9:09 ` [PATCH 1/7] KVM: MMU: update the documentation for reverse mapping of parent_pte Xiao Guangrong
2013-06-19 10:32 ` Paolo Bonzini
2013-06-19 9:09 ` [PATCH 2/7] KVM: MMU: document clear_spte_count Xiao Guangrong
2013-06-19 11:32 ` Paolo Bonzini
2013-06-19 11:53 ` Xiao Guangrong
2013-06-19 11:55 ` Paolo Bonzini
2013-06-19 12:25 ` Xiao Guangrong
2013-06-19 12:41 ` Paolo Bonzini
2013-06-19 13:29 ` Xiao Guangrong
2013-06-19 11:40 ` Paolo Bonzini
2013-06-19 12:39 ` Xiao Guangrong
2013-06-19 9:09 ` [PATCH 3/7] KVM: MMU: document write_flooding_count Xiao Guangrong
2013-06-19 11:58 ` Paolo Bonzini
2013-06-19 12:43 ` Xiao Guangrong
2013-06-19 9:09 ` [PATCH 4/7] KVM: MMU: document mmio page fault Xiao Guangrong
2013-06-19 12:10 ` Paolo Bonzini
2013-06-19 12:59 ` Xiao Guangrong
2013-06-19 9:09 ` [PATCH 5/7] KVM: MMU: document fast page fault in Xiao Guangrong
2013-06-19 12:13 ` Paolo Bonzini
2013-06-19 13:00 ` Xiao Guangrong
2013-06-19 9:09 ` [PATCH 6/7] KVM: MMU: document fast invalidate all pages Xiao Guangrong
2013-06-19 12:25 ` Paolo Bonzini
2013-06-19 13:07 ` Xiao Guangrong [this message]
2013-06-19 9:09 ` [PATCH 7/7] KVM: MMU: document fast invalidate all mmio sptes Xiao Guangrong
2013-06-19 12:35 ` Paolo Bonzini
2013-06-19 13:10 ` Xiao Guangrong
2013-06-20 5:21 ` Rob Landley
2013-06-20 8:19 ` Paolo Bonzini
2013-06-19 17:41 ` [PATCH 0/7] KVM: MMU: update mmu documentation Paolo Bonzini
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=51C1ACF4.4040004@linux.vnet.ibm.com \
--to=xiaoguangrong@linux.vnet.ibm.com \
--cc=avi.kivity@gmail.com \
--cc=gleb@redhat.com \
--cc=kvm@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=mtosatti@redhat.com \
--cc=pbonzini@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox