All of lore.kernel.org
 help / color / mirror / Atom feed
From: Avi Kivity <avi@redhat.com>
To: Andrea Arcangeli <aarcange@redhat.com>
Cc: Marcelo Tosatti <mtosatti@redhat.com>, KVM list <kvm@vger.kernel.org>
Subject: Re: mmu_notifers, pte_dirty questions
Date: Mon, 07 Jun 2010 08:09:47 +0300	[thread overview]
Message-ID: <4C0C7F1B.5070802@redhat.com> (raw)
In-Reply-To: <20100606183608.GI28052@random.random>

On 06/06/2010 09:36 PM, Andrea Arcangeli wrote:
> On Sun, Jun 06, 2010 at 03:07:27PM +0300, Avi Kivity wrote:
>    
>> Why no notifer when testing and clearing the dirty bit?
>>
>> (*clear_flush_dirty)(...).
>>
>>      
>>> static int page_mkclean_one(struct page *page, struct vm_area_struct *vma,
>>>                  unsigned long address)
>>> {
>>>      struct mm_struct *mm = vma->vm_mm;
>>>      pte_t *pte;
>>>      spinlock_t *ptl;
>>>      int ret = 0;
>>>
>>>      pte = page_check_address(page, mm, address,&ptl, 1);
>>>      if (!pte)
>>>          goto out;
>>>
>>>      if (pte_dirty(*pte) || pte_write(*pte)) {
>>>          pte_t entry;
>>>
>>>          flush_cache_page(vma, address, pte_pfn(*pte));
>>>          entry = ptep_clear_flush_notify(vma, address, pte);
>>>          entry = pte_wrprotect(entry);
>>>          entry = pte_mkclean(entry);
>>>          set_pte_at(mm, address, pte, entry);
>>>        
>> set_pte_at_notify()?  without this (or clear_flush_dirty) Linux will
>> assume all ptes are now clean; if the guest writes to a page nothing
>> will catch it.
>>
>> ->  with set_pte_at_notify(), we can drop the spte and mark the page as
>> dirty, so the next write will re-instantiate the spte
>> ->  with ->clear_flush_dirty(), we can track the dirty state without
>> dropping the spte.
>>
>>      
>>>          ret = 1;
>>>      }
>>>
>>>      pte_unmap_unlock(pte, ptl);
>>> out:
>>>      return ret;
>>>        
>> I'm probably missing something big as I can't see how this works.
>>      
> Under the PT lock it's safe to keep the PTE zero, just the pte must be
> non zero again before pte_unmap_unlock.
>
> The sptes are all gone by the time ptep_clear_flush_notify returns
> (also gup-fast is stopped with the IPI of the flush) and no spte can
> be established again before pte_unmap_unlock runs, so it's all safe as
> far as I can tell.
>
>    

Somehow  I missed the ptep_clear_flush_notify()...  so all should be fine.

> set_pte_at_notify might prevent a minor fault though.
>    

I'm thinking of how to implement speculative write access for kvm: 
consider a read fault for a writeable page.  We could install a 
writeable spte with the dirty bit clear, and examine the dirty bit at 
pte_clear_flush_notify() time and transfer it to the page flags.  
However I can't see where the mm code checks the pte dirty bit for 
anonymous pages?  Does it assume anonymous pages are always dirty? (they 
could have a clean copy in swap, no?)

-- 
I have a truly marvellous patch that fixes the bug which this
signature is too narrow to contain.


      reply	other threads:[~2010-06-07  5:09 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-06-06 12:07 mmu_notifers, pte_dirty questions Avi Kivity
2010-06-06 18:36 ` Andrea Arcangeli
2010-06-07  5:09   ` Avi Kivity [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4C0C7F1B.5070802@redhat.com \
    --to=avi@redhat.com \
    --cc=aarcange@redhat.com \
    --cc=kvm@vger.kernel.org \
    --cc=mtosatti@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.