Re: [PATCH v15 01/11] multicall: add no preemption ability between two calls

xen-devel.lists.xenproject.org archive mirror
 help / color / mirror / Atom feed

From: Chao Peng <chao.p.peng@linux.intel.com>
To: Jan Beulich <JBeulich@suse.com>,
	Andrew Cooper <andrew.cooper3@citrix.com>
Cc: keir@xen.org, Ian.Campbell@citrix.com,
	stefano.stabellini@eu.citrix.com, George.Dunlap@eu.citrix.com,
	Ian.Jackson@eu.citrix.com, xen-devel@lists.xen.org,
	dgdegra@tycho.nsa.gov
Subject: Re: [PATCH v15 01/11] multicall: add no preemption ability between two calls
Date: Thu, 18 Sep 2014 21:45:42 +0800	[thread overview]
Message-ID: <20140918134542.GA6631@pengc-linux> (raw)
In-Reply-To: <5419740C02000078000359A9@mail.emea.novell.com>

On Wed, Sep 17, 2014 at 10:44:12AM +0100, Jan Beulich wrote:
> >>> On 17.09.14 at 11:22, <chao.p.peng@linux.intel.com> wrote:
> > On Fri, Sep 12, 2014 at 10:55:43AM +0800, Chao Peng wrote:
> >> On Wed, Sep 10, 2014 at 12:12:07PM +0100, Andrew Cooper wrote:
> >> > On 10/09/14 11:25, Jan Beulich wrote:
> >> > >>>> On 10.09.14 at 12:15, <andrew.cooper3@citrix.com> wrote:
> >> > >> On 10/09/14 11:07, Jan Beulich wrote:
> >> > >>>>>> On 10.09.14 at 11:43, <andrew.cooper3@citrix.com> wrote:
> >> > >>>> Actually, on further thought, using multicalls like this cannot possibly
> >> > >>>> be correct from a functional point of view.
> >> > >>>>
> >> > >>>> Even with the no preempt flag between a wrmsr/rdmsr hypercall pair,
> >> > >>>> there is no guarantee that accesses to remote cpus msrs won't interleave
> >> > >>>> with a different natural access, clobbering the results of the wrmsr.
> >> > >>>>
> >> > >>>> However this is solved, the wrmsr/rdmsr pair *must* be part of the same
> >> > >>>> synchronous thread of execution on the appropriate cpu.  You can trust
> >> > >>>> that interrupts won't play with these msrs, but you absolutely can't
> >> > >>>> guarantee that IPI/wrmsr/IPI/rdmsr will work.
> >> > >>> Not sure I follow, particularly in the context of the white listing of
> >> > >>> MSRs permitted here (which ought to not include anything the
> >> > >>> hypervisor needs control over).
> >> > >> Consider two dom0 vcpus both using this new multicall mechanism to read
> >> > >> QoS information for different domains, which end up both targeting the
> >> > >> same remote cpu.  They will both end up using IPI/wrmsr/IPI/rdmsr, which
> >> > >> may interleave and clobber the first wrmsr.
> >> > > But that situation doesn't result from the multicall use here - it would
> >> > > equally be the case for an inherently batchable hypercall.
> >> > 
> >> > Indeed - I called out multicall because of the current implementation,
> >> > but I should have been more clear.
> >> > 
> >> > > To deal with
> >> > > that we'd need a wrmsr-then-rdmsr operation, or move the entire
> >> > > execution of the batch onto the target CPU. Since the former would
> >> > > quickly become unwieldy for more complex operations, I think this
> >> > > gets us back to aiming at using continue_hypercall_on_cpu() here.
> >> > 
> >> > Which gets us back to the problem that you cannot use
> >> > copy_{to,from}_guest() after continue_hypercall_on_cpu(), due to being
> >> > in the wrong context.
> >> > 
> >> > 
> >> > I think this requires a step back and rethink.  I can't offhand think of
> >> > any combination of existing bits of infrastructure which will allow this
> >> > to work correctly, which means something new needs designing.
> >> > 
> >> How about this:
> >> 
> >> 1)  Still do the batch in do_platform_op() but add a iteration field in
> >> the interface structure.
> >> 
> >> 2)  Still use on_selected_cpus() but group the adjacent resource_ops
> >> which have a same cpu and NO_PREEMPT set into one and do it as a whole
> >> in the new cpu context.
> >> 
> > Any suggestion for this?
> 
> 1 is ugly (contradicting everything we do elsewhere), but would be a
> last resort option.
> 
> 2 would be perhaps an option if small, non-preemptible batches
> would be handled in do_platform_op() while preemptible larger
> groups then ought to use the multicall interface.
> 
> Option 3 would be to fiddle with the current vCPU's affinity before
> invoking a continuation (perhaps already on the first iteration to
> get onto the needed pCPU).
> 
Thanks Jan.

On further thought, I think we may over design for this.

Why not make it simple and also scalable?
The answer is also simple: do_platform_op() is always non-preemptible.

It can accept one operation or small batch of operations but it
guarantees all the operations are non-preemptible. (eg it never calls
hypercall_create_continuation() )
It's the minimum unit for non-preemptible operation.

If the caller(userspace tool) wants to make preemptible batch calls,
then multicall mechanism can be employed. 
We don't need to add NO_PREEMPT ability for multicall. Just keep it
preemptible.

This is almost option 2 above.

Chao
> 
> 
> _______________________________________________
> Xen-devel mailing list
> Xen-devel@lists.xen.org
> http://lists.xen.org/xen-devel

next prev parent reply	other threads:[~2014-09-18 13:45 UTC|newest]

Thread overview: 36+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-09-05  8:37 [PATCH v15 00/11] enable Cache QoS Monitoring (CQM) feature Chao Peng
2014-09-05  8:37 ` [PATCH v15 01/11] multicall: add no preemption ability between two calls Chao Peng
2014-09-05 10:46   ` Andrew Cooper
2014-09-09  6:43     ` Chao Peng
2014-09-09 10:39       ` Jan Beulich
2014-09-09 10:51         ` Andrew Cooper
2014-09-09 11:51           ` Jan Beulich
2014-09-09 12:44             ` Andrew Cooper
2014-09-09 13:15               ` Jan Beulich
2014-09-10  1:32                 ` Chao Peng
2014-09-10  9:43                   ` Andrew Cooper
2014-09-10 10:07                     ` Jan Beulich
2014-09-10 10:15                       ` Andrew Cooper
2014-09-10 10:25                         ` Jan Beulich
2014-09-10 11:12                           ` Andrew Cooper
2014-09-12  2:55                             ` Chao Peng
2014-09-17  9:22                               ` Chao Peng
2014-09-17  9:44                                 ` Jan Beulich
2014-09-18 13:45                                   ` Chao Peng [this message]
2014-09-18 14:22                                     ` Jan Beulich
2014-09-05  8:37 ` [PATCH v15 02/11] x86: add generic resource (e.g. MSR) access hypercall Chao Peng
2014-09-05 10:59   ` Andrew Cooper
2014-09-05 11:49     ` Jan Beulich
2014-09-10  2:55     ` Chao Peng
2014-09-29 18:52       ` Konrad Rzeszutek Wilk
2014-09-30  7:45         ` Jan Beulich
2014-09-05  8:37 ` [PATCH v15 03/11] xsm: add resource operation related xsm policy Chao Peng
2014-09-05  8:37 ` [PATCH v15 04/11] tools: provide interface for generic resource access Chao Peng
2014-09-05  8:37 ` [PATCH v15 05/11] x86: detect and initialize Platform QoS Monitoring feature Chao Peng
2014-09-05 11:05   ` Andrew Cooper
2014-09-05  8:37 ` [PATCH v15 06/11] x86: dynamically attach/detach QoS monitoring service for a guest Chao Peng
2014-09-05  8:37 ` [PATCH v15 07/11] x86: collect global QoS monitoring information Chao Peng
2014-09-05  8:37 ` [PATCH v15 08/11] x86: enable QoS monitoring for each domain RMID Chao Peng
2014-09-05  8:37 ` [PATCH v15 09/11] x86: add QoS monitoring related MSRs in allowed list Chao Peng
2014-09-05  8:37 ` [PATCH v15 10/11] xsm: add platform QoS related xsm policies Chao Peng
2014-09-05  8:37 ` [PATCH v15 11/11] tools: CMDs and APIs for Platform QoS Monitoring Chao Peng

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20140918134542.GA6631@pengc-linux \
    --to=chao.p.peng@linux.intel.com \
    --cc=George.Dunlap@eu.citrix.com \
    --cc=Ian.Campbell@citrix.com \
    --cc=Ian.Jackson@eu.citrix.com \
    --cc=JBeulich@suse.com \
    --cc=andrew.cooper3@citrix.com \
    --cc=dgdegra@tycho.nsa.gov \
    --cc=keir@xen.org \
    --cc=stefano.stabellini@eu.citrix.com \
    --cc=xen-devel@lists.xen.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).