From mboxrd@z Thu Jan 1 00:00:00 1970 From: Julien Grall Subject: Re: [PATCH] xen/arm: p2m: Correctly flush TLB in create_p2m_entries Date: Thu, 09 Jan 2014 13:14:25 +0000 Message-ID: <52CEA0B1.3020403@linaro.org> References: <1389200759-22177-1-git-send-email-julien.grall@linaro.org> <1389265175.27473.67.camel@kazak.uk.xensource.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; Format="flowed" Content-Transfer-Encoding: 7bit Return-path: Received: from mail6.bemta5.messagelabs.com ([195.245.231.135]) by lists.xen.org with esmtp (Exim 4.72) (envelope-from ) id 1W1FRN-0004AF-Nk for xen-devel@lists.xenproject.org; Thu, 09 Jan 2014 13:14:29 +0000 Received: by mail-we0-f179.google.com with SMTP id q59so2788034wes.10 for ; Thu, 09 Jan 2014 05:14:28 -0800 (PST) In-Reply-To: <1389265175.27473.67.camel@kazak.uk.xensource.com> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Ian Campbell , Stefano Stabellini Cc: xen-devel@lists.xenproject.org, tim@xen.org, stefano.stabellini@citrix.com, patches@linaro.org List-Id: xen-devel@lists.xenproject.org On 01/09/2014 10:59 AM, Ian Campbell wrote: > On Wed, 2014-01-08 at 17:13 +0000, Stefano Stabellini wrote: >> On Wed, 8 Jan 2014, Julien Grall wrote: >>> The p2m is shared between VCPUs for each domain. Currently Xen only flush >>> TLB on the local PCPU. This could result to mismatch between the mapping in the >>> p2m and TLBs. >>> >>> Flush TLBs used by this domain on every PCPU. >>> >>> Signed-off-by: Julien Grall >> >> The fix makes sense to me. > > Me too. Has anyone had a grep for similar issues? I think flush_xen_data_tlb and flush_xen_text_tlb should also be innershareable. The former one is used by flush_tlb_mask. But ... this function seems badly implement, it's weird to use flush_xen_data_tlb because we are mainly using flush_tlb_mask in common/grant-table.c. Any ideas? > (the reason for getting this wrong is that for cache flushes the "is" > suffix restricts the flush to the IS domain, whereas with tlb flushes > the "is" suffix broadcasts instead of keeping it local, which is a bit > confusing ;-)) > >> >>> --- >>> >>> This is a possible bug fix (found by reading the code) for Xen 4.4. I have >>> added a small optimisation to avoid flushing all TLBs when a VCPU of this >>> domain is running on the current cpu. >>> >>> The downside of this patch is the function can be a little bit slower because >>> Xen is flushing more TLBs. >> >> Yes, I wonder how much slower it is going to be, considering that the flush >> is executed for every iteration of the loop. > > It might be better to set the current VMID to the target domain for the > duration of this function, we'd still need the broadcast but at least we > wouldn't be killing unrelated VMIDs. I can modify the patch to handle that. > > Pulling the flush out of the loop would require great case WRT accesses > from other VCPUs, e.g. you'd have to put the pages on a list (page->list > might be available?) and issue the put_page() after the flush, otherwise > it might get recycled into another domain while the first domain still > has TLB entries for it. > > Or is there always something outside this function which holds another > ref count such that the page definitely won't be freed by the put_page > here? > > Actually, do the existing code not have this issue already? The put_page > is before the flush. If this bug does exist now then I'd be inclined to > consider this a bug fix for 4.4, rather than a potential optimisation > for 4.5. For now we don't take reference when we map/unmap mapping. Most of the time create_p2m_entries is called by common code which take care of having a reference when this function is called. So we should be safe. I would prefer to wait 4.5 for this optimisation (moving the flush outside the loop). > While looking at this function I'm now wondering what happens to the > existing page on ALLOCATE or INSERT, is it leaked? For ALLOCATE page, it's on the domain page list so the page will be freed duing relinquish. For INSERT, except for foreign mapping we don't have refcount for mapping. So the only issue could be with foreign mapping. -- Julien Grall