All of lore.kernel.org
 help / color / mirror / Atom feed
From: Gleb Natapov <gleb@redhat.com>
To: Avi Kivity <avi@redhat.com>
Cc: kvm@vger.kernel.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org
Subject: Re: [PATCH 06/11] Inject asynchronous page fault into a guest if page is swapped out.
Date: Mon, 2 Nov 2009 17:41:17 +0200	[thread overview]
Message-ID: <20091102154117.GD27911@redhat.com> (raw)
In-Reply-To: <4AEED70E.4050007@redhat.com>

On Mon, Nov 02, 2009 at 02:56:46PM +0200, Avi Kivity wrote:
> On 11/01/2009 01:56 PM, Gleb Natapov wrote:
> >If guest access swapped out memory do not swap it in from vcpu thread
> >context. Setup slow work to do swapping and send async page fault to
> >a guest.
> >
> >Allow async page fault injection only when guest is in user mode since
> >otherwise guest may be in non-sleepable context and will not be able to
> >reschedule.
> 
> That loses us page cache accesses, which may be the majority of
> accesses in some workloads.
> 
This is addressed later in the patch series.

> If we allow the guest to ignore a fault, and ensure that a second
> access to an apf page from the same vcpu doesn't trigger another
> apf, we can simply ignore the apf in a guest when we can't schedule.
> 
> Probably best done with an enable bit for kernel-mode apfs.
> 
> >Signed-off-by: Gleb Natapov<gleb@redhat.com>
> >---
> >  arch/x86/include/asm/kvm_host.h |   20 +++
> >  arch/x86/kvm/mmu.c              |  243 ++++++++++++++++++++++++++++++++++++++-
> >  arch/x86/kvm/mmutrace.h         |   60 ++++++++++
> >  arch/x86/kvm/paging_tmpl.h      |   16 +++-
> >  arch/x86/kvm/x86.c              |   22 +++-
> 
> Much of the code is generic, please move it to virt/kvm.
> 
OK, Will move generic part to virt.

> >+static void async_pf_execute(struct slow_work *work)
> >+{
> >+	struct page *page[1];
> 
> No need to make it an array, just pass its address.
> 
OK

> >+	struct kvm_mmu_async_pf *apf =
> >+		container_of(work, struct kvm_mmu_async_pf, work);
> >+	wait_queue_head_t *q =&apf->vcpu->wq;
> >+
> >+	might_sleep();
> >+
> >+	down_read(&apf->mm->mmap_sem);
> >+	get_user_pages(current, apf->mm, apf->addr, 1, 1, 0, page, NULL);
> >+	up_read(&apf->mm->mmap_sem);
> >+
> >+	spin_lock(&apf->vcpu->arch.mmu_async_pf_lock);
> >+	list_add_tail(&apf->link,&apf->vcpu->arch.mmu_async_pf_done);
> >+	apf->page = page[0];
> >+	spin_unlock(&apf->vcpu->arch.mmu_async_pf_lock);
> >+
> >+	trace_kvm_mmu_async_pf_executed(apf->addr, apf->page, apf->token,
> >+					apf->gva);
> 
> _completed, but maybe better placed in vcpu context.
> 
> >+
> >+static bool can_do_async_pf(struct kvm_vcpu *vcpu)
> >+{
> >+	struct kvm_segment kvm_seg;
> >+
> >+	if (!vcpu->arch.pv_shm ||
> >+	    !(vcpu->arch.pv_shm->features&  KVM_PV_SHM_FEATURES_ASYNC_PF) ||
> >+	    kvm_event_needs_reinjection(vcpu))
> >+		return false;
> >+
> >+	kvm_get_segment(vcpu,&kvm_seg, VCPU_SREG_CS);
> >+
> >+	/* is userspace code? TODO check VM86 mode */
> >+	return !!(kvm_seg.selector&  3);
> 
> There's a ->get_cpl() which is slightly faster.  Note vm86 is
> perfectly fine for async pf.
> 
OK. But the code is removed by following patches anyway.

> >+static int setup_async_pf(struct kvm_vcpu *vcpu, gva_t gva, gfn_t gfn)
> >+{
> >+	struct kvm_mmu_async_pf *work;
> >+
> >+	/* setup slow work */
> >+
> >+	/* do alloc atomic since if we are going to sleep anyway we
> >+	   may as well sleep faulting in page */
> >+	work = kmem_cache_zalloc(mmu_async_pf_cache, GFP_ATOMIC);
> >+	if (!work)
> >+		return 0;
> >+
> >+	atomic_set(&work->used, 1);
> >+	work->page = NULL;
> >+	work->vcpu = vcpu;
> >+	work->gva = gva;
> >+	work->addr = gfn_to_hva(vcpu->kvm, gfn);
> >+	work->token = (vcpu->arch.async_pf_id++<<  12) | vcpu->vcpu_id;
> 
> The shift truncates async_pf_id.
> 
Will fix.

--
			Gleb.

WARNING: multiple messages have this Message-ID (diff)
From: Gleb Natapov <gleb@redhat.com>
To: Avi Kivity <avi@redhat.com>
Cc: kvm@vger.kernel.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org
Subject: Re: [PATCH 06/11] Inject asynchronous page fault into a guest if page is swapped out.
Date: Mon, 2 Nov 2009 17:41:17 +0200	[thread overview]
Message-ID: <20091102154117.GD27911@redhat.com> (raw)
In-Reply-To: <4AEED70E.4050007@redhat.com>

On Mon, Nov 02, 2009 at 02:56:46PM +0200, Avi Kivity wrote:
> On 11/01/2009 01:56 PM, Gleb Natapov wrote:
> >If guest access swapped out memory do not swap it in from vcpu thread
> >context. Setup slow work to do swapping and send async page fault to
> >a guest.
> >
> >Allow async page fault injection only when guest is in user mode since
> >otherwise guest may be in non-sleepable context and will not be able to
> >reschedule.
> 
> That loses us page cache accesses, which may be the majority of
> accesses in some workloads.
> 
This is addressed later in the patch series.

> If we allow the guest to ignore a fault, and ensure that a second
> access to an apf page from the same vcpu doesn't trigger another
> apf, we can simply ignore the apf in a guest when we can't schedule.
> 
> Probably best done with an enable bit for kernel-mode apfs.
> 
> >Signed-off-by: Gleb Natapov<gleb@redhat.com>
> >---
> >  arch/x86/include/asm/kvm_host.h |   20 +++
> >  arch/x86/kvm/mmu.c              |  243 ++++++++++++++++++++++++++++++++++++++-
> >  arch/x86/kvm/mmutrace.h         |   60 ++++++++++
> >  arch/x86/kvm/paging_tmpl.h      |   16 +++-
> >  arch/x86/kvm/x86.c              |   22 +++-
> 
> Much of the code is generic, please move it to virt/kvm.
> 
OK, Will move generic part to virt.

> >+static void async_pf_execute(struct slow_work *work)
> >+{
> >+	struct page *page[1];
> 
> No need to make it an array, just pass its address.
> 
OK

> >+	struct kvm_mmu_async_pf *apf =
> >+		container_of(work, struct kvm_mmu_async_pf, work);
> >+	wait_queue_head_t *q =&apf->vcpu->wq;
> >+
> >+	might_sleep();
> >+
> >+	down_read(&apf->mm->mmap_sem);
> >+	get_user_pages(current, apf->mm, apf->addr, 1, 1, 0, page, NULL);
> >+	up_read(&apf->mm->mmap_sem);
> >+
> >+	spin_lock(&apf->vcpu->arch.mmu_async_pf_lock);
> >+	list_add_tail(&apf->link,&apf->vcpu->arch.mmu_async_pf_done);
> >+	apf->page = page[0];
> >+	spin_unlock(&apf->vcpu->arch.mmu_async_pf_lock);
> >+
> >+	trace_kvm_mmu_async_pf_executed(apf->addr, apf->page, apf->token,
> >+					apf->gva);
> 
> _completed, but maybe better placed in vcpu context.
> 
> >+
> >+static bool can_do_async_pf(struct kvm_vcpu *vcpu)
> >+{
> >+	struct kvm_segment kvm_seg;
> >+
> >+	if (!vcpu->arch.pv_shm ||
> >+	    !(vcpu->arch.pv_shm->features&  KVM_PV_SHM_FEATURES_ASYNC_PF) ||
> >+	    kvm_event_needs_reinjection(vcpu))
> >+		return false;
> >+
> >+	kvm_get_segment(vcpu,&kvm_seg, VCPU_SREG_CS);
> >+
> >+	/* is userspace code? TODO check VM86 mode */
> >+	return !!(kvm_seg.selector&  3);
> 
> There's a ->get_cpl() which is slightly faster.  Note vm86 is
> perfectly fine for async pf.
> 
OK. But the code is removed by following patches anyway.

> >+static int setup_async_pf(struct kvm_vcpu *vcpu, gva_t gva, gfn_t gfn)
> >+{
> >+	struct kvm_mmu_async_pf *work;
> >+
> >+	/* setup slow work */
> >+
> >+	/* do alloc atomic since if we are going to sleep anyway we
> >+	   may as well sleep faulting in page */
> >+	work = kmem_cache_zalloc(mmu_async_pf_cache, GFP_ATOMIC);
> >+	if (!work)
> >+		return 0;
> >+
> >+	atomic_set(&work->used, 1);
> >+	work->page = NULL;
> >+	work->vcpu = vcpu;
> >+	work->gva = gva;
> >+	work->addr = gfn_to_hva(vcpu->kvm, gfn);
> >+	work->token = (vcpu->arch.async_pf_id++<<  12) | vcpu->vcpu_id;
> 
> The shift truncates async_pf_id.
> 
Will fix.

--
			Gleb.

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

  reply	other threads:[~2009-11-02 15:41 UTC|newest]

Thread overview: 112+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-11-01 11:56 [PATCH 00/11] KVM: Add asynchronous page fault for PV guest Gleb Natapov
2009-11-01 11:56 ` Gleb Natapov
2009-11-01 11:56 ` [PATCH 01/11] Add shared memory hypercall to PV Linux guest Gleb Natapov
2009-11-01 11:56   ` Gleb Natapov
2009-11-02  4:27   ` Rik van Riel
2009-11-02  4:27     ` Rik van Riel
2009-11-02  7:07     ` Gleb Natapov
2009-11-02  7:07       ` Gleb Natapov
2009-11-02 12:18   ` Avi Kivity
2009-11-02 12:18     ` Avi Kivity
2009-11-02 16:18     ` Gleb Natapov
2009-11-02 16:18       ` Gleb Natapov
2009-11-03  5:15       ` Avi Kivity
2009-11-03  5:15         ` Avi Kivity
2009-11-03  7:16         ` Gleb Natapov
2009-11-03  7:16           ` Gleb Natapov
2009-11-03  7:40           ` Avi Kivity
2009-11-03  7:40             ` Avi Kivity
2009-11-01 11:56 ` [PATCH 02/11] Add "handle page fault" PV helper Gleb Natapov
2009-11-01 11:56   ` Gleb Natapov
2009-11-02  9:22   ` Ingo Molnar
2009-11-02  9:22     ` Ingo Molnar
2009-11-02 16:04     ` Gleb Natapov
2009-11-02 16:04       ` Gleb Natapov
2009-11-02 16:12       ` Ingo Molnar
2009-11-02 16:12         ` Ingo Molnar
2009-11-02 16:22         ` Gleb Natapov
2009-11-02 16:22           ` Gleb Natapov
2009-11-02 16:29           ` Ingo Molnar
2009-11-02 16:29             ` Ingo Molnar
2009-11-02 16:31             ` Gleb Natapov
2009-11-02 16:31               ` Gleb Natapov
2009-11-02 17:42             ` Gleb Natapov
2009-11-02 17:42               ` Gleb Natapov
2009-11-08 11:36               ` Ingo Molnar
2009-11-08 11:36                 ` Ingo Molnar
2009-11-08 12:43                 ` Avi Kivity
2009-11-08 12:43                   ` Avi Kivity
2009-11-08 12:51                   ` Ingo Molnar
2009-11-08 12:51                     ` Ingo Molnar
2009-11-08 13:01                     ` Avi Kivity
2009-11-08 13:01                       ` Avi Kivity
2009-11-08 13:05                       ` Ingo Molnar
2009-11-08 13:05                         ` Ingo Molnar
2009-11-08 13:08                         ` Avi Kivity
2009-11-08 13:08                           ` Avi Kivity
2009-11-08 16:44                     ` H. Peter Anvin
2009-11-08 16:44                       ` H. Peter Anvin
2009-11-08 16:47                       ` Ingo Molnar
2009-11-08 16:47                         ` Ingo Molnar
2009-11-02 19:03     ` Rik van Riel
2009-11-02 19:03       ` Rik van Riel
2009-11-02 19:33       ` Avi Kivity
2009-11-02 19:33         ` Avi Kivity
2009-11-02 23:35         ` Rik van Riel
2009-11-02 23:35           ` Rik van Riel
2009-11-03  4:57           ` Avi Kivity
2009-11-03  4:57             ` Avi Kivity
2009-11-03  4:57             ` Avi Kivity
2009-11-05  6:44             ` Tian, Kevin
2009-11-05  6:44               ` Tian, Kevin
2009-11-05  8:22               ` Avi Kivity
2009-11-05  8:22                 ` Avi Kivity
2009-11-05  8:22                 ` Avi Kivity
2009-11-01 11:56 ` [PATCH 03/11] Handle asynchronous page fault in a PV guest Gleb Natapov
2009-11-01 11:56   ` Gleb Natapov
2009-11-02 12:38   ` Avi Kivity
2009-11-02 12:38     ` Avi Kivity
2009-11-02 15:54     ` Gleb Natapov
2009-11-02 15:54       ` Gleb Natapov
2009-11-03 14:14   ` Marcelo Tosatti
2009-11-03 14:14     ` Marcelo Tosatti
2009-11-03 14:25     ` Gleb Natapov
2009-11-03 14:25       ` Gleb Natapov
2009-11-03 14:32       ` Marcelo Tosatti
2009-11-03 14:32         ` Marcelo Tosatti
2009-11-03 14:38         ` Avi Kivity
2009-11-03 14:38           ` Avi Kivity
2009-11-01 11:56 ` [PATCH 04/11] Export __get_user_pages_fast Gleb Natapov
2009-11-01 11:56   ` Gleb Natapov
2009-11-02  9:23   ` Ingo Molnar
2009-11-02  9:23     ` Ingo Molnar
2009-11-01 11:56 ` [PATCH 05/11] Add get_user_pages() variant that fails if major fault is required Gleb Natapov
2009-11-01 11:56   ` Gleb Natapov
2009-11-02 19:05   ` Rik van Riel
2009-11-02 19:05     ` Rik van Riel
2009-11-01 11:56 ` [PATCH 06/11] Inject asynchronous page fault into a guest if page is swapped out Gleb Natapov
2009-11-01 11:56   ` Gleb Natapov
2009-11-02 12:56   ` Avi Kivity
2009-11-02 12:56     ` Avi Kivity
2009-11-02 15:41     ` Gleb Natapov [this message]
2009-11-02 15:41       ` Gleb Natapov
2009-11-01 11:56 ` [PATCH 07/11] Retry fault before vmentry Gleb Natapov
2009-11-01 11:56   ` Gleb Natapov
2009-11-02 13:03   ` Avi Kivity
2009-11-02 13:03     ` Avi Kivity
2009-11-01 11:56 ` [PATCH 08/11] Add "wait for page" hypercall Gleb Natapov
2009-11-01 11:56   ` Gleb Natapov
2009-11-02 13:05   ` Avi Kivity
2009-11-02 13:05     ` Avi Kivity
2009-11-02 15:13     ` Gleb Natapov
2009-11-02 15:13       ` Gleb Natapov
2009-11-02 15:19       ` Avi Kivity
2009-11-02 15:19         ` Avi Kivity
2009-11-01 11:56 ` [PATCH 09/11] Maintain preemptability count even for !CONFIG_PREEMPT kernels Gleb Natapov
2009-11-01 11:56   ` Gleb Natapov
2009-11-02  9:24   ` Ingo Molnar
2009-11-02  9:24     ` Ingo Molnar
2009-11-01 11:56 ` [PATCH 10/11] Handle async PF in non preemptable context Gleb Natapov
2009-11-01 11:56   ` Gleb Natapov
2009-11-01 11:56 ` [PATCH 11/11] Send async PF when guest is not in userspace too Gleb Natapov
2009-11-01 11:56   ` Gleb Natapov

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20091102154117.GD27911@redhat.com \
    --to=gleb@redhat.com \
    --cc=avi@redhat.com \
    --cc=kvm@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.