All of lore.kernel.org
 help / color / mirror / Atom feed
From: Andrew Cooper <andrew.cooper3@citrix.com>
To: Jan Beulich <JBeulich@suse.com>
Cc: xen-devel <xen-devel@lists.xenproject.org>,
	Keir Fraser <keir@xen.org>, Eddie Dong <eddie.dong@intel.com>,
	Jun Nakajima <jun.nakajima@intel.com>
Subject: Re: [PATCH 1/4] VMX: streamline entry.S code
Date: Mon, 26 Aug 2013 14:22:39 +0100	[thread overview]
Message-ID: <521B569F.1040408@citrix.com> (raw)
In-Reply-To: <521B704C02000078000EE661@nat28.tlf.novell.com>

On 26/08/2013 14:12, Jan Beulich wrote:
>>>> On 26.08.13 at 13:48, Andrew Cooper <andrew.cooper3@citrix.com> wrote:
>> On 26/08/2013 12:01, Jan Beulich wrote:
>>>>> -.globl vmx_asm_do_vmentry
>>>>> -vmx_asm_do_vmentry:
>>>> If you move the ENTRY(vmx_asm_do_vmentry) up from below, you should be
>>>> able to completely drop the jmp in it.
>>> That would be possible, at the expense of added padding. I prefer
>>> it the way it is now, as vmx_asm_do_vmentry is not performance
>>> critical (as being used exactly once per HVM vCPU).
>> There are a number of places where we have ENTRY()-like constructs but
>> don't want the padding with it.
>>
>> Would an __ENTRY() macro go down well?  I can spin a patch for it.
> x86 Linux has GLOBAL() for that purpose - I'd like this better than
> __ENTRY() both from a name space perspective and from
> describing its purpose.

Ok - I will spin a patch.

>
>> My point about re-executing it does still apply.  Looking at the code, I
>> do not believe it is correct to be executing vmx_intr_assist or
>> nvmx_switch_guest multiple times on a context switch to an HVM VCPU. 
>> vmx_intr_assist at the very least has a huge amount of work to do before
>> it considers exiting.
>>
>> It does appear that there is possible interaction between do_softirq()
>> and vmx_intr_assist(), at which point vmx_intr_assist() should be run
>> after do_softirq(), which removes the apparently redundant run with
>> interrupts enabled.
> None of this seems related to the patch anymore - if you think
> there's more stuff that needs changing, let's discuss this in a
> separate thread.

Certainly.

>
>>> The %cr2 write's move is indeed debatable - I tried to get it farther
>>> away from the producer of the data in %rax, but it's not clear
>>> whether that's very useful. The second purpose was to get
>>> something interleaved with the many "pop"s, so that the CPU can
>>> get busy other than just its memory load ports. If controversial
>>> I'm fine with undoing that change.
>> From my understanding of a serialising instruction, it forces the
>> completion of all previous instructions before starting, and prevents
>> the issue of any subsequent instructions until it itself has completed.
>>
>> Therefore, I doubt it has the intended effect.
> Wait - this is again also a separation from the producer of the
> data. Whether modern CPUs can deal with that I'm not sure,
> but it surely doesn't hurt to hide eventual latency.
>
> Jan
>

For non-serialising instructions, it is a good idea (and likely some a
compiler would anyway).  Moving the GET_CURRENT() will probably be quite
effective as most subsequent instructions depend on it.

Serialising instructions on the other hand will not be affected by these
issues (given their nature), but I would prefer to defer judgement to
someone who has a better idea of the microarchitectural implications.

Either way, as the concerns are now just down to playing with the
optimal static instruction scheduling,

Reviewed-by: Andrew Cooper <andrew.cooper3@citrix.com>

  reply	other threads:[~2013-08-26 13:22 UTC|newest]

Thread overview: 60+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-08-23 13:58 [PATCH 0/4] HVM: produce better binary code Jan Beulich
2013-08-23 14:01 ` [PATCH 1/4] VMX: streamline entry.S code Jan Beulich
2013-08-26 10:44   ` Andrew Cooper
2013-08-26 11:01     ` Jan Beulich
2013-08-26 11:48       ` Andrew Cooper
2013-08-26 13:12         ` Jan Beulich
2013-08-26 13:22           ` Andrew Cooper [this message]
2013-08-29 11:01   ` Tim Deegan
2013-08-29 12:35     ` Jan Beulich
2013-08-23 14:02 ` [PATCH 2/4] VMX: move various uses of UD2 out of fast paths Jan Beulich
2013-08-23 22:06   ` Andrew Cooper
2013-08-26  8:50     ` Jan Beulich
2013-08-26  9:07       ` Andrew Cooper
2013-08-26  8:58     ` [PATCH v2 " Jan Beulich
2013-08-26  9:09       ` Andrew Cooper
2013-08-29 11:08       ` Tim Deegan
2013-08-23 14:03 ` [PATCH 3/4] VMX: use proper instruction mnemonics if assembler supports them Jan Beulich
2013-08-24 22:18   ` Andrew Cooper
2013-08-26  9:06     ` Jan Beulich
2013-08-26  9:25       ` Andrew Cooper
2013-08-26  9:41         ` Jan Beulich
2013-08-26 10:18         ` [PATCH v3 " Jan Beulich
2013-08-26 13:05           ` Andrew Cooper
2013-08-26 13:20             ` Jan Beulich
2013-08-26 14:03             ` [PATCH v4 " Jan Beulich
2013-08-26 14:18               ` Andrew Cooper
2013-08-26 14:29                 ` Jan Beulich
2013-08-26 15:07                   ` Andrew Cooper
2013-08-26 15:10                     ` Andrew Cooper
2013-08-26 15:30                       ` Jan Beulich
2013-08-26 15:29                     ` Jan Beulich
2013-08-26 15:33                       ` Andrew Cooper
2013-08-26 15:31                 ` [PATCH v5 " Jan Beulich
2013-08-26 15:36                   ` Andrew Cooper
2013-08-29 11:47                   ` Tim Deegan
2013-08-29 12:30                     ` Jan Beulich
2013-08-29 13:11                       ` Tim Deegan
2013-08-29 13:27                         ` Jan Beulich
2013-08-29 14:02                           ` Tim Deegan
2013-08-29 12:45                     ` Jan Beulich
2013-08-29 13:19                       ` Tim Deegan
2013-08-26  9:03   ` [PATCH v2 " Jan Beulich
2013-08-23 14:04 ` [PATCH 4/4] SVM: streamline entry.S code Jan Beulich
2013-08-26 16:20   ` Andrew Cooper
2013-08-26 17:20     ` Keir Fraser
2013-08-26 17:46       ` Andrew Cooper
2013-08-26 21:47   ` Andrew Cooper
2013-08-27  7:38     ` Jan Beulich
2013-08-29 11:56   ` Tim Deegan
2013-09-04 14:39   ` Boris Ostrovsky
2013-09-04 14:50     ` Jan Beulich
2013-09-04 15:09       ` Boris Ostrovsky
2013-09-04 15:20         ` Jan Beulich
2013-09-04 16:42           ` Boris Ostrovsky
2013-09-05  7:10             ` Jan Beulich
2013-09-04 10:06 ` Ping: [PATCH 0/4] HVM: produce better binary code Jan Beulich
2013-09-04 16:16   ` Andrew Cooper
2013-09-04 16:30     ` Tim Deegan
2013-09-05  7:52       ` Jan Beulich
2013-09-05  7:58         ` Tim Deegan

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=521B569F.1040408@citrix.com \
    --to=andrew.cooper3@citrix.com \
    --cc=JBeulich@suse.com \
    --cc=eddie.dong@intel.com \
    --cc=jun.nakajima@intel.com \
    --cc=keir@xen.org \
    --cc=xen-devel@lists.xenproject.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.