From: will.deacon@arm.com (Will Deacon)
To: linux-arm-kernel@lists.infradead.org
Subject: [PATCH 3/3] rseq/selftests: Add support for arm64
Date: Tue, 26 Jun 2018 16:14:27 +0100 [thread overview]
Message-ID: <20180626151427.GF23375@arm.com> (raw)
In-Reply-To: <501929863.3051.1529950210436.JavaMail.zimbra@efficios.com>
Hi Mathieu,
On Mon, Jun 25, 2018 at 02:10:10PM -0400, Mathieu Desnoyers wrote:
> ----- On Jun 25, 2018, at 1:54 PM, Will Deacon will.deacon at arm.com wrote:
> > +#define __RSEQ_ASM_DEFINE_TABLE(label, version, flags, start_ip, \
> > + post_commit_offset, abort_ip) \
> > + " .pushsection __rseq_table, \"aw\"\n" \
> > + " .balign 32\n" \
> > + __rseq_str(label) ":\n" \
> > + " .long " __rseq_str(version) ", " __rseq_str(flags) "\n" \
> > + " .quad " __rseq_str(start_ip) ", " \
> > + __rseq_str(post_commit_offset) ", " \
> > + __rseq_str(abort_ip) "\n" \
> > + " .popsection\n"
> > +
> > +#define RSEQ_ASM_DEFINE_TABLE(label, start_ip, post_commit_ip, abort_ip) \
> > + __RSEQ_ASM_DEFINE_TABLE(label, 0x0, 0x0, start_ip, \
> > + (post_commit_ip - start_ip), abort_ip)
> > +
> > +#define RSEQ_ASM_STORE_RSEQ_CS(label, cs_label, rseq_cs) \
> > + RSEQ_INJECT_ASM(1) \
> > + " adrp " RSEQ_ASM_TMP_REG ", " __rseq_str(cs_label) "\n" \
> > + " add " RSEQ_ASM_TMP_REG ", " RSEQ_ASM_TMP_REG \
> > + ", :lo12:" __rseq_str(cs_label) "\n" \
> > + " str " RSEQ_ASM_TMP_REG ", %[" __rseq_str(rseq_cs) "]\n" \
> > + __rseq_str(label) ":\n"
> > +
> > +#define RSEQ_ASM_DEFINE_ABORT(label, abort_label) \
> > + " .pushsection __rseq_failure, \"ax\"\n" \
> > + " .long " __rseq_str(RSEQ_SIG) "\n" \
> > + __rseq_str(label) ":\n" \
> > + " b %l[" __rseq_str(abort_label) "]\n" \
> > + " .popsection\n"
>
> Thanks Will for porting rseq to arm64 !
That's ok, it was good fun :)
I'm going to chat with our compiler guys to see if there's any room for
improving the flexibility in the critical section, since having a temporary
in the clobber list is pretty grotty.
> I notice you are using the instructions
>
> adrp
> add
> str
>
> to implement RSEQ_ASM_STORE_RSEQ_CS(). Did you compare
> performance-wise with an approach using a literal pool
> near the instruction pointer like I did on arm32 ?
I didn't, no. Do you have a benchmark to hand so I can give this a go?
The two reasons I didn't go down this route are:
1. It introduces data which is mapped as executable. I don't have a
specific security concern here, but the way things have gone so far
this year, I've realised that I'm not bright enough to anticipate
these things.
2. It introduces a branch over the table on the fast path, which is likely
to have a relatively higher branch misprediction cost on more advanced
CPUs.
I also find it grotty that we emit two tables so that debuggers can cope,
but that's just a cosmetic nit.
> With that approach, this ends up being simply
>
> adr
> str
>
> which provides significantly better performance on my test
> platform over loading a pointer targeting a separate data
> section.
My understanding is that your test platform is based on Cortex-A7, so I'd
be wary about concluding too much about general performance from that CPU
since its a pretty straightforward in-order design.
Will
WARNING: multiple messages have this Message-ID (diff)
From: Will Deacon <will.deacon@arm.com>
To: Mathieu Desnoyers <mathieu.desnoyers@efficios.com>
Cc: linux-arm-kernel <linux-arm-kernel@lists.infradead.org>,
linux-kernel <linux-kernel@vger.kernel.org>,
Arnd Bergmann <arnd@arndb.de>,
Peter Zijlstra <peterz@infradead.org>,
"Paul E. McKenney" <paulmck@linux.vnet.ibm.com>,
Boqun Feng <boqun.feng@gmail.com>,
Catalin Marinas <catalin.marinas@arm.com>,
peter maydell <peter.maydell@linaro.org>,
Mark Rutland <mark.rutland@arm.com>
Subject: Re: [PATCH 3/3] rseq/selftests: Add support for arm64
Date: Tue, 26 Jun 2018 16:14:27 +0100 [thread overview]
Message-ID: <20180626151427.GF23375@arm.com> (raw)
In-Reply-To: <501929863.3051.1529950210436.JavaMail.zimbra@efficios.com>
Hi Mathieu,
On Mon, Jun 25, 2018 at 02:10:10PM -0400, Mathieu Desnoyers wrote:
> ----- On Jun 25, 2018, at 1:54 PM, Will Deacon will.deacon@arm.com wrote:
> > +#define __RSEQ_ASM_DEFINE_TABLE(label, version, flags, start_ip, \
> > + post_commit_offset, abort_ip) \
> > + " .pushsection __rseq_table, \"aw\"\n" \
> > + " .balign 32\n" \
> > + __rseq_str(label) ":\n" \
> > + " .long " __rseq_str(version) ", " __rseq_str(flags) "\n" \
> > + " .quad " __rseq_str(start_ip) ", " \
> > + __rseq_str(post_commit_offset) ", " \
> > + __rseq_str(abort_ip) "\n" \
> > + " .popsection\n"
> > +
> > +#define RSEQ_ASM_DEFINE_TABLE(label, start_ip, post_commit_ip, abort_ip) \
> > + __RSEQ_ASM_DEFINE_TABLE(label, 0x0, 0x0, start_ip, \
> > + (post_commit_ip - start_ip), abort_ip)
> > +
> > +#define RSEQ_ASM_STORE_RSEQ_CS(label, cs_label, rseq_cs) \
> > + RSEQ_INJECT_ASM(1) \
> > + " adrp " RSEQ_ASM_TMP_REG ", " __rseq_str(cs_label) "\n" \
> > + " add " RSEQ_ASM_TMP_REG ", " RSEQ_ASM_TMP_REG \
> > + ", :lo12:" __rseq_str(cs_label) "\n" \
> > + " str " RSEQ_ASM_TMP_REG ", %[" __rseq_str(rseq_cs) "]\n" \
> > + __rseq_str(label) ":\n"
> > +
> > +#define RSEQ_ASM_DEFINE_ABORT(label, abort_label) \
> > + " .pushsection __rseq_failure, \"ax\"\n" \
> > + " .long " __rseq_str(RSEQ_SIG) "\n" \
> > + __rseq_str(label) ":\n" \
> > + " b %l[" __rseq_str(abort_label) "]\n" \
> > + " .popsection\n"
>
> Thanks Will for porting rseq to arm64 !
That's ok, it was good fun :)
I'm going to chat with our compiler guys to see if there's any room for
improving the flexibility in the critical section, since having a temporary
in the clobber list is pretty grotty.
> I notice you are using the instructions
>
> adrp
> add
> str
>
> to implement RSEQ_ASM_STORE_RSEQ_CS(). Did you compare
> performance-wise with an approach using a literal pool
> near the instruction pointer like I did on arm32 ?
I didn't, no. Do you have a benchmark to hand so I can give this a go?
The two reasons I didn't go down this route are:
1. It introduces data which is mapped as executable. I don't have a
specific security concern here, but the way things have gone so far
this year, I've realised that I'm not bright enough to anticipate
these things.
2. It introduces a branch over the table on the fast path, which is likely
to have a relatively higher branch misprediction cost on more advanced
CPUs.
I also find it grotty that we emit two tables so that debuggers can cope,
but that's just a cosmetic nit.
> With that approach, this ends up being simply
>
> adr
> str
>
> which provides significantly better performance on my test
> platform over loading a pointer targeting a separate data
> section.
My understanding is that your test platform is based on Cortex-A7, so I'd
be wary about concluding too much about general performance from that CPU
since its a pretty straightforward in-order design.
Will
next prev parent reply other threads:[~2018-06-26 15:14 UTC|newest]
Thread overview: 24+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-06-25 17:54 [PATCH 0/3] Support rseq on arm64 Will Deacon
2018-06-25 17:54 ` Will Deacon
2018-06-25 17:54 ` [PATCH 1/3] arm64: rseq: Implement backend rseq calls and select HAVE_RSEQ Will Deacon
2018-06-25 17:54 ` Will Deacon
2018-06-26 10:31 ` Mark Rutland
2018-06-26 10:31 ` Mark Rutland
2018-06-25 17:54 ` [PATCH 2/3] asm-generic: unistd.h: Wire up sys_rseq Will Deacon
2018-06-25 17:54 ` Will Deacon
2018-06-25 17:54 ` [PATCH 3/3] rseq/selftests: Add support for arm64 Will Deacon
2018-06-25 17:54 ` Will Deacon
2018-06-25 18:10 ` Mathieu Desnoyers
2018-06-25 18:10 ` Mathieu Desnoyers
2018-06-26 15:14 ` Will Deacon [this message]
2018-06-26 15:14 ` Will Deacon
2018-06-26 16:11 ` Mathieu Desnoyers
2018-06-26 16:11 ` Mathieu Desnoyers
2018-06-28 16:47 ` Will Deacon
2018-06-28 16:47 ` Will Deacon
2018-06-28 20:50 ` Mathieu Desnoyers
2018-06-28 20:50 ` Mathieu Desnoyers
2018-07-02 16:49 ` Will Deacon
2018-07-02 16:49 ` Will Deacon
2018-07-02 17:47 ` Mathieu Desnoyers
2018-07-02 17:47 ` Mathieu Desnoyers
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20180626151427.GF23375@arm.com \
--to=will.deacon@arm.com \
--cc=linux-arm-kernel@lists.infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.