From: Peter Zijlstra <peterz@infradead.org>
To: Waiman Long <Waiman.Long@hp.com>
Cc: Thomas Gleixner <tglx@linutronix.de>,
Ingo Molnar <mingo@redhat.com>, "H. Peter Anvin" <hpa@zytor.com>,
linux-arch@vger.kernel.org, x86@kernel.org,
linux-kernel@vger.kernel.org,
virtualization@lists.linux-foundation.org,
xen-devel@lists.xenproject.org, kvm@vger.kernel.org,
Paolo Bonzini <paolo.bonzini@gmail.com>,
Konrad Rzeszutek Wilk <konrad.wilk@oracle.com>,
Boris Ostrovsky <boris.ostrovsky@oracle.com>,
"Paul E. McKenney" <paulmck@linux.vnet.ibm.com>,
Rik van Riel <riel@redhat.com>,
Linus Torvalds <torvalds@linux-foundation.org>,
Raghavendra K T <raghavendra.kt@linux.vnet.ibm.com>,
David Vrabel <david.vrabel@citrix.com>,
Oleg Nesterov <oleg@redhat.com>,
Scott J Norton <scott.norton@hp.com>,
Douglas Hatch <doug.hatch@hp.com>
Subject: Re: [PATCH v12 09/11] pvqspinlock, x86: Add para-virtualization support
Date: Fri, 24 Oct 2014 10:47:38 +0200 [thread overview]
Message-ID: <20141024084738.GU21513@worktop.programming.kicks-ass.net> (raw)
In-Reply-To: <1413483040-58399-10-git-send-email-Waiman.Long@hp.com>
On Thu, Oct 16, 2014 at 02:10:38PM -0400, Waiman Long wrote:
> +static inline void pv_init_node(struct mcs_spinlock *node)
> +{
> + struct pv_qnode *pn = (struct pv_qnode *)node;
> +
> + BUILD_BUG_ON(sizeof(struct pv_qnode) > 5*sizeof(struct mcs_spinlock));
> +
> + if (!pv_enabled())
> + return;
> +
> + pn->cpustate = PV_CPU_ACTIVE;
> + pn->mayhalt = false;
> + pn->mycpu = smp_processor_id();
> + pn->head = PV_INVALID_HEAD;
> +}
> @@ -333,6 +393,7 @@ queue:
> node += idx;
> node->locked = 0;
> node->next = NULL;
> + pv_init_node(node);
>
> /*
> * We touched a (possibly) cold cacheline in the per-cpu queue node;
So even if !pv_enabled() the compiler will still have to emit the code
for that inline, which will generate additional register pressure,
icache pressure and lovely stuff like that.
The patch I had used pv-ops for these things that would turn into NOPs
in the regular case and callee-saved function calls for the PV case.
That still does not entirely eliminate cost, but does reduce it
significant. Please consider using that.
next prev parent reply other threads:[~2014-10-24 8:47 UTC|newest]
Thread overview: 108+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-10-16 18:10 [PATCH v12 00/11] qspinlock: a 4-byte queue spinlock with PV support Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` [PATCH v12 01/11] qspinlock: A simple generic 4-byte queue spinlock Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` [PATCH v12 02/11] qspinlock, x86: Enable x86-64 to use " Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` [PATCH v12 03/11] qspinlock: Add pending bit Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` [PATCH v12 04/11] qspinlock: Extract out code snippets for the next patch Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` [PATCH v12 05/11] qspinlock: Optimize for smaller NR_CPUS Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` [PATCH v12 06/11] qspinlock: Use a simple write to grab the lock Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` [PATCH v12 07/11] qspinlock: Revert to test-and-set on hypervisors Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` [PATCH v12 08/11] qspinlock, x86: Rename paravirt_ticketlocks_enabled Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` [PATCH v12 09/11] pvqspinlock, x86: Add para-virtualization support Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-24 8:47 ` Peter Zijlstra
2014-10-24 8:47 ` Peter Zijlstra [this message]
2014-10-24 20:53 ` Waiman Long
2014-10-24 20:53 ` Waiman Long
2014-10-24 22:04 ` Peter Zijlstra
2014-10-24 22:04 ` Peter Zijlstra
2014-10-24 22:04 ` Peter Zijlstra
2014-10-25 4:30 ` Mike Galbraith
2014-10-25 4:30 ` Mike Galbraith
2014-10-25 4:30 ` Mike Galbraith
2014-10-27 17:15 ` Waiman Long
2014-10-27 17:15 ` Waiman Long
2014-10-27 17:27 ` Peter Zijlstra
2014-10-27 17:27 ` Peter Zijlstra
2014-10-27 20:50 ` Waiman Long
2014-10-27 20:50 ` Waiman Long
2014-10-27 20:50 ` Waiman Long
2014-10-27 17:27 ` Peter Zijlstra
2014-10-27 17:15 ` Waiman Long
2014-10-24 20:53 ` Waiman Long
2014-10-24 8:47 ` Peter Zijlstra
2014-10-24 8:54 ` Peter Zijlstra
2014-10-24 8:54 ` Peter Zijlstra
2014-10-24 8:54 ` Peter Zijlstra
2014-10-27 17:38 ` Waiman Long
2014-10-27 17:38 ` Waiman Long
2014-10-27 18:02 ` Konrad Rzeszutek Wilk
2014-10-27 18:02 ` Konrad Rzeszutek Wilk
2014-10-27 20:55 ` Waiman Long
2014-10-27 20:55 ` Waiman Long
2014-10-27 20:55 ` Waiman Long
2014-11-26 0:33 ` Waiman Long
2014-12-01 16:51 ` Konrad Rzeszutek Wilk
2014-12-01 16:51 ` Konrad Rzeszutek Wilk
2014-12-01 16:51 ` Konrad Rzeszutek Wilk
2014-11-26 0:33 ` Waiman Long
2014-11-26 0:33 ` Waiman Long
2014-10-27 18:02 ` Konrad Rzeszutek Wilk
2014-10-27 18:04 ` Peter Zijlstra
2014-10-27 21:22 ` Waiman Long
2014-10-29 19:05 ` Waiman Long
2014-10-29 19:05 ` Waiman Long
2014-10-29 19:05 ` Waiman Long
2014-10-29 20:25 ` Waiman Long
2014-10-29 20:25 ` Waiman Long
2014-10-29 20:25 ` Waiman Long
2014-10-27 21:22 ` Waiman Long
2014-10-27 21:22 ` Waiman Long
2014-10-27 18:04 ` Peter Zijlstra
2014-10-27 18:04 ` Peter Zijlstra
2014-10-27 17:38 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` [PATCH v12 10/11] pvqspinlock, x86: Enable PV qspinlock for KVM Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` [PATCH v12 11/11] pvqspinlock, x86: Enable PV qspinlock for XEN Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-16 18:10 ` Waiman Long
2014-10-24 8:57 ` [PATCH v12 00/11] qspinlock: a 4-byte queue spinlock with PV support Peter Zijlstra
2014-10-24 8:57 ` Peter Zijlstra
2014-10-27 18:00 ` Waiman Long
2014-10-27 18:00 ` Waiman Long
2014-10-27 18:00 ` Waiman Long
2014-10-24 8:57 ` Peter Zijlstra
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20141024084738.GU21513@worktop.programming.kicks-ass.net \
--to=peterz@infradead.org \
--cc=Waiman.Long@hp.com \
--cc=boris.ostrovsky@oracle.com \
--cc=david.vrabel@citrix.com \
--cc=doug.hatch@hp.com \
--cc=hpa@zytor.com \
--cc=konrad.wilk@oracle.com \
--cc=kvm@vger.kernel.org \
--cc=linux-arch@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@redhat.com \
--cc=oleg@redhat.com \
--cc=paolo.bonzini@gmail.com \
--cc=paulmck@linux.vnet.ibm.com \
--cc=raghavendra.kt@linux.vnet.ibm.com \
--cc=riel@redhat.com \
--cc=scott.norton@hp.com \
--cc=tglx@linutronix.de \
--cc=torvalds@linux-foundation.org \
--cc=virtualization@lists.linux-foundation.org \
--cc=x86@kernel.org \
--cc=xen-devel@lists.xenproject.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.