From: marc.zyngier@arm.com (Marc Zyngier)
To: linux-arm-kernel@lists.infradead.org
Subject: [PATCH 1/2] ARM: KVM: Yield CPU when vcpu executes a WFE
Date: Mon, 07 Oct 2013 17:16:30 +0100 [thread overview]
Message-ID: <5252DE5E.6060700@arm.com> (raw)
In-Reply-To: <52F72A87-9CC3-4B38-ACDA-F5EA66FA7375@suse.de>
On 07/10/13 17:04, Alexander Graf wrote:
>
> On 07.10.2013, at 17:40, Marc Zyngier <marc.zyngier@arm.com> wrote:
>
>> On an (even slightly) oversubscribed system, spinlocks are quickly
>> becoming a bottleneck, as some vcpus are spinning, waiting for a
>> lock to be released, while the vcpu holding the lock may not be
>> running at all.
>>
>> This creates contention, and the observed slowdown is 40x for
>> hackbench. No, this isn't a typo.
>>
>> The solution is to trap blocking WFEs and tell KVM that we're now
>> spinning. This ensures that other vpus will get a scheduling boost,
>> allowing the lock to be released more quickly.
>>
>>> From a performance point of view: hackbench 1 process 1000
>>
>> 2xA15 host (baseline): 1.843s
>>
>> 2xA15 guest w/o patch: 2.083s 4xA15 guest w/o patch: 80.212s
>>
>> 2xA15 guest w/ patch: 2.072s 4xA15 guest w/ patch: 3.202s
>
> I'm confused. You got from 2.083s when not exiting on spin locks to
> 2.072 when exiting on _every_ spin lock that didn't immediately
> succeed. I would've expected to second number to be worse rather than
> better. I assume it's within jitter, I'm still puzzled why you don't
> see any significant drop in performance.
The key is in the ARM ARM:
B1.14.9: "When HCR.TWE is set to 1, and the processor is in a Non-secure
mode other than Hyp mode, execution of a WFE instruction generates a Hyp
Trap exception if, ignoring the value of the HCR.TWE bit, conditions
permit the processor to suspend execution."
So, on a non-overcommitted system, you rarely hit a blocking spinlock,
hence not trapping. Otherwise, performance would go down the drain very
quickly.
And yes, the difference is pretty much noise.
M.
--
Jazz is not dead. It just smells funny...
WARNING: multiple messages have this Message-ID (diff)
From: Marc Zyngier <marc.zyngier@arm.com>
To: Alexander Graf <agraf@suse.de>
Cc: linux-arm-kernel <linux-arm-kernel@lists.infradead.org>,
"kvmarm@lists.cs.columbia.edu" <kvmarm@lists.cs.columbia.edu>,
"kvm@vger.kernel.org mailing list" <kvm@vger.kernel.org>
Subject: Re: [PATCH 1/2] ARM: KVM: Yield CPU when vcpu executes a WFE
Date: Mon, 07 Oct 2013 17:16:30 +0100 [thread overview]
Message-ID: <5252DE5E.6060700@arm.com> (raw)
In-Reply-To: <52F72A87-9CC3-4B38-ACDA-F5EA66FA7375@suse.de>
On 07/10/13 17:04, Alexander Graf wrote:
>
> On 07.10.2013, at 17:40, Marc Zyngier <marc.zyngier@arm.com> wrote:
>
>> On an (even slightly) oversubscribed system, spinlocks are quickly
>> becoming a bottleneck, as some vcpus are spinning, waiting for a
>> lock to be released, while the vcpu holding the lock may not be
>> running at all.
>>
>> This creates contention, and the observed slowdown is 40x for
>> hackbench. No, this isn't a typo.
>>
>> The solution is to trap blocking WFEs and tell KVM that we're now
>> spinning. This ensures that other vpus will get a scheduling boost,
>> allowing the lock to be released more quickly.
>>
>>> From a performance point of view: hackbench 1 process 1000
>>
>> 2xA15 host (baseline): 1.843s
>>
>> 2xA15 guest w/o patch: 2.083s 4xA15 guest w/o patch: 80.212s
>>
>> 2xA15 guest w/ patch: 2.072s 4xA15 guest w/ patch: 3.202s
>
> I'm confused. You got from 2.083s when not exiting on spin locks to
> 2.072 when exiting on _every_ spin lock that didn't immediately
> succeed. I would've expected to second number to be worse rather than
> better. I assume it's within jitter, I'm still puzzled why you don't
> see any significant drop in performance.
The key is in the ARM ARM:
B1.14.9: "When HCR.TWE is set to 1, and the processor is in a Non-secure
mode other than Hyp mode, execution of a WFE instruction generates a Hyp
Trap exception if, ignoring the value of the HCR.TWE bit, conditions
permit the processor to suspend execution."
So, on a non-overcommitted system, you rarely hit a blocking spinlock,
hence not trapping. Otherwise, performance would go down the drain very
quickly.
And yes, the difference is pretty much noise.
M.
--
Jazz is not dead. It just smells funny...
next prev parent reply other threads:[~2013-10-07 16:16 UTC|newest]
Thread overview: 50+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-10-07 15:40 [PATCH 0/2] ARM/arm64: KVM: Yield CPU when vcpu executes a WFE Marc Zyngier
2013-10-07 15:40 ` Marc Zyngier
2013-10-07 15:40 ` [PATCH 1/2] ARM: " Marc Zyngier
2013-10-07 15:40 ` Marc Zyngier
2013-10-07 16:04 ` Alexander Graf
2013-10-07 16:04 ` Alexander Graf
2013-10-07 16:16 ` Marc Zyngier [this message]
2013-10-07 16:16 ` Marc Zyngier
2013-10-07 16:30 ` Alexander Graf
2013-10-07 16:30 ` Alexander Graf
2013-10-07 16:53 ` Gleb Natapov
2013-10-07 16:53 ` Gleb Natapov
2013-10-09 13:09 ` Alexander Graf
2013-10-09 13:09 ` Alexander Graf
2013-10-09 13:26 ` Gleb Natapov
2013-10-09 13:26 ` Gleb Natapov
2013-10-09 14:18 ` Marc Zyngier
2013-10-09 14:18 ` Marc Zyngier
2013-10-09 14:50 ` Anup Patel
2013-10-09 14:50 ` Anup Patel
2013-10-09 14:52 ` Anup Patel
2013-10-09 14:52 ` Anup Patel
2013-10-09 14:59 ` Marc Zyngier
2013-10-09 14:59 ` Marc Zyngier
2013-10-09 15:10 ` Anup Patel
2013-10-09 15:10 ` Anup Patel
2013-10-09 15:17 ` Marc Zyngier
2013-10-09 15:17 ` Marc Zyngier
2013-10-09 15:17 ` Anup Patel
2013-10-09 15:17 ` Anup Patel
2013-10-07 16:55 ` Marc Zyngier
2013-10-07 16:55 ` Marc Zyngier
2013-10-08 11:26 ` Raghavendra KT
2013-10-08 11:26 ` Raghavendra KT
2013-10-08 12:43 ` Marc Zyngier
2013-10-08 12:43 ` Marc Zyngier
2013-10-08 15:02 ` Raghavendra K T
2013-10-08 15:02 ` Raghavendra K T
2013-10-08 15:06 ` Marc Zyngier
2013-10-08 15:06 ` Marc Zyngier
2013-10-08 15:13 ` Raghavendra K T
2013-10-08 15:13 ` Raghavendra K T
2013-10-08 16:09 ` Marc Zyngier
2013-10-08 16:09 ` Marc Zyngier
2013-10-07 15:40 ` [PATCH 2/2] arm64: " Marc Zyngier
2013-10-07 15:40 ` Marc Zyngier
2013-10-07 15:52 ` Bhushan Bharat-R65777
2013-10-07 15:52 ` Bhushan Bharat-R65777
2013-10-07 16:00 ` Marc Zyngier
2013-10-07 16:00 ` Marc Zyngier
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5252DE5E.6060700@arm.com \
--to=marc.zyngier@arm.com \
--cc=linux-arm-kernel@lists.infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.