All of lore.kernel.org
 help / color / mirror / Atom feed
From: Jamie Lokier <jamie@shareable.org>
To: Tony Lindgren <tony@atomide.com>
Cc: linux-arm-kernel@lists.infradead.org, linux-omap@vger.kernel.org
Subject: Re: [PATCH 1/2] arm: Replace CONFIG_HAS_TLS_REG with HWCAP_TLS and check for it on V6
Date: Wed, 23 Jun 2010 14:36:36 +0100	[thread overview]
Message-ID: <20100623133636.GC7058@shareable.org> (raw)
In-Reply-To: <20100623073912.GA12255@atomide.com>

Tony Lindgren wrote:
> * Jamie Lokier <jamie@shareable.org> [100622 19:54]:
> > Tony Lindgren wrote:
> > >  __kuser_get_tls:				@ 0xffff0fe0
> > > -
> > > -#if !defined(CONFIG_HAS_TLS_REG) && !defined(CONFIG_TLS_REG_EMUL)
> > > -	ldr	r0, [pc, #(16 - 8)]		@ TLS stored at 0xffff0ff0
> > > -#else
> > > -	mrc	p15, 0, r0, c13, c0, 3		@ read TLS register
> > > -#endif
> > > +	ldr     r0, [pc, #(20 - 8)]		@ software TLS set in 0xffff0ff4?
> > > +	cmp	r0, #0				@ hardware TLS if flag not set
> > > +	mrceq	p15, 0, r0, c13, c0, 3		@ read hardware TLS register
> > > +	ldrne	r0, [pc, #(12 - 8)]		@ software TLS val at 0xffff0ff8
> > >  	usr_ret	lr
> > > -
> > > -	.rep	5
> > > -	.word	0			@ pad up to __kuser_helper_version
> > > -	.endr
> > > +	.word	0				@ non-zero for software TLS
> > > +	.word	0				@ software TLS value
> > 
> > It'd be nice not to waste instructions checking for HWCAP_TLS on archs
> > which definitely don't have it.  I guess it doesn't matter elsewhere;
> > I'd expect this to be a warm path for some programs making extensive
> > use of TLS (I haven't measured though).
> 
> OK, but let's try to figure out a way that does not add more ifdef else
> code as that makes it harder to build support for multiple ARM cores.
>  
> > As it's only a single instruction, and the code is in a writable page
> > already (copied at init), how about just patching the instruction
> > when ELF_HWCAP is set?
> 
> Yeah that can be done for __kuser_get_tls if it's always writable.
> But __switch_to is trickier because of the CONFIG_MMU ifdefs there.

__kuser_get_tls must be writable in kernels where !HAS_TLS_REG is
supported, because the TLS value is written to the same page.

I was thinking of changing *only* __kuser_get_tls, by the way.  Out of
all the different places, that's the only one I'd expect to be a hot
path in some TLS-using programs.

> What if we have optional __switch_to and __kuser_get_tls implementations
> in the mm/proc-*.S files that get copied over the current locations
> if implemented?

As __kuser_get_tls varies by only ones instruction, I don't think
there's any point doing anything other than a single word write, at
the point where the HWCAP is set, with its initial value being the
!HAS_TLS_REG instruction.

For other things like __switch_to and maybe cache maintenance calls,
dmb() etc, I'd suggest first doing a generic asm mechanism like x86's
"alternatives", keeping in mind that one of the alternatives has to be
XIP friendly.

-- Jamie

WARNING: multiple messages have this Message-ID (diff)
From: jamie@shareable.org (Jamie Lokier)
To: linux-arm-kernel@lists.infradead.org
Subject: [PATCH 1/2] arm: Replace CONFIG_HAS_TLS_REG with HWCAP_TLS and check for it on V6
Date: Wed, 23 Jun 2010 14:36:36 +0100	[thread overview]
Message-ID: <20100623133636.GC7058@shareable.org> (raw)
In-Reply-To: <20100623073912.GA12255@atomide.com>

Tony Lindgren wrote:
> * Jamie Lokier <jamie@shareable.org> [100622 19:54]:
> > Tony Lindgren wrote:
> > >  __kuser_get_tls:				@ 0xffff0fe0
> > > -
> > > -#if !defined(CONFIG_HAS_TLS_REG) && !defined(CONFIG_TLS_REG_EMUL)
> > > -	ldr	r0, [pc, #(16 - 8)]		@ TLS stored at 0xffff0ff0
> > > -#else
> > > -	mrc	p15, 0, r0, c13, c0, 3		@ read TLS register
> > > -#endif
> > > +	ldr     r0, [pc, #(20 - 8)]		@ software TLS set in 0xffff0ff4?
> > > +	cmp	r0, #0				@ hardware TLS if flag not set
> > > +	mrceq	p15, 0, r0, c13, c0, 3		@ read hardware TLS register
> > > +	ldrne	r0, [pc, #(12 - 8)]		@ software TLS val at 0xffff0ff8
> > >  	usr_ret	lr
> > > -
> > > -	.rep	5
> > > -	.word	0			@ pad up to __kuser_helper_version
> > > -	.endr
> > > +	.word	0				@ non-zero for software TLS
> > > +	.word	0				@ software TLS value
> > 
> > It'd be nice not to waste instructions checking for HWCAP_TLS on archs
> > which definitely don't have it.  I guess it doesn't matter elsewhere;
> > I'd expect this to be a warm path for some programs making extensive
> > use of TLS (I haven't measured though).
> 
> OK, but let's try to figure out a way that does not add more ifdef else
> code as that makes it harder to build support for multiple ARM cores.
>  
> > As it's only a single instruction, and the code is in a writable page
> > already (copied at init), how about just patching the instruction
> > when ELF_HWCAP is set?
> 
> Yeah that can be done for __kuser_get_tls if it's always writable.
> But __switch_to is trickier because of the CONFIG_MMU ifdefs there.

__kuser_get_tls must be writable in kernels where !HAS_TLS_REG is
supported, because the TLS value is written to the same page.

I was thinking of changing *only* __kuser_get_tls, by the way.  Out of
all the different places, that's the only one I'd expect to be a hot
path in some TLS-using programs.

> What if we have optional __switch_to and __kuser_get_tls implementations
> in the mm/proc-*.S files that get copied over the current locations
> if implemented?

As __kuser_get_tls varies by only ones instruction, I don't think
there's any point doing anything other than a single word write, at
the point where the HWCAP is set, with its initial value being the
!HAS_TLS_REG instruction.

For other things like __switch_to and maybe cache maintenance calls,
dmb() etc, I'd suggest first doing a generic asm mechanism like x86's
"alternatives", keeping in mind that one of the alternatives has to be
XIP friendly.

-- Jamie

  parent reply	other threads:[~2010-06-23 13:36 UTC|newest]

Thread overview: 66+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-06-21 13:51 [PATCH 0/2] Make ARMv6 behave with TLS, VFPv3, and NEON Tony Lindgren
2010-06-21 13:51 ` Tony Lindgren
2010-06-21 13:51 ` [PATCH 1/2] arm: Replace CONFIG_HAS_TLS_REG with HWCAP_TLS and check for it on V6 Tony Lindgren
2010-06-21 13:51   ` Tony Lindgren
2010-06-22  9:28   ` Tony Lindgren
2010-06-22  9:28     ` Tony Lindgren
2010-06-22 17:00   ` Jamie Lokier
2010-06-22 17:00     ` Jamie Lokier
2010-06-23  7:39     ` Tony Lindgren
2010-06-23  7:39       ` Tony Lindgren
2010-06-23  8:12       ` Russell King - ARM Linux
2010-06-23  8:12         ` Russell King - ARM Linux
2010-06-23  9:28         ` Tony Lindgren
2010-06-23  9:28           ` Tony Lindgren
2010-06-23  9:32           ` Russell King - ARM Linux
2010-06-23  9:32             ` Russell King - ARM Linux
2010-06-23 13:28           ` Jamie Lokier
2010-06-23 13:28             ` Jamie Lokier
2010-06-23 13:36       ` Jamie Lokier [this message]
2010-06-23 13:36         ` Jamie Lokier
2010-06-23 14:19         ` Nicolas Pitre
2010-06-23 14:19           ` Nicolas Pitre
2010-06-24  0:28           ` Jamie Lokier
2010-06-24  0:28             ` Jamie Lokier
2010-06-29 14:18         ` Tony Lindgren
2010-06-29 14:18           ` Tony Lindgren
2010-06-29 19:20           ` Nicolas Pitre
2010-06-29 19:20             ` Nicolas Pitre
2010-06-30 11:08             ` Tony Lindgren
2010-06-30 11:08               ` Tony Lindgren
2010-06-30 13:17               ` Tony Lindgren
2010-06-30 13:17                 ` Tony Lindgren
2010-06-30 14:42                 ` Nicolas Pitre
2010-06-30 14:42                   ` Nicolas Pitre
2010-07-01  9:25                   ` Tony Lindgren
2010-07-01  9:25                     ` Tony Lindgren
2010-07-01 17:40                     ` Jamie Lokier
2010-07-01 17:40                       ` Jamie Lokier
2010-07-02  2:37                       ` Nicolas Pitre
2010-07-02  2:37                         ` Nicolas Pitre
2010-07-02 10:37                         ` Tony Lindgren
2010-07-02 10:37                           ` Tony Lindgren
2010-07-05 13:55                           ` Tony Lindgren
2010-07-05 13:55                             ` Tony Lindgren
2011-04-08  3:39                             ` Li Li
2011-04-08  3:39                               ` Li Li
2011-04-08 13:19                               ` Nicolas Pitre
2011-04-08 13:19                                 ` Nicolas Pitre
2011-04-08 13:35                                 ` Li Li
2011-04-08 13:35                                   ` Li Li
2011-04-08 14:35                                   ` Jamie Lokier
2011-04-08 14:35                                     ` Jamie Lokier
2011-04-08 14:40                                     ` Li Li
2011-04-08 14:40                                       ` Li Li
2010-06-21 13:51 ` [PATCH 2/2] arm: Make VFPv3 usable on ARMv6 Tony Lindgren
2010-06-21 13:51   ` Tony Lindgren
2010-06-22 12:59   ` Catalin Marinas
2010-06-22 12:59     ` Catalin Marinas
2010-06-22 13:20     ` Tony Lindgren
2010-06-22 13:20       ` Tony Lindgren
2010-06-23  7:57       ` Tony Lindgren
2010-06-23  7:57         ` Tony Lindgren
2010-06-25 13:50         ` Catalin Marinas
2010-06-25 13:50           ` Catalin Marinas
2010-07-01 12:42           ` Tony Lindgren
2010-07-01 12:42             ` Tony Lindgren

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20100623133636.GC7058@shareable.org \
    --to=jamie@shareable.org \
    --cc=linux-arm-kernel@lists.infradead.org \
    --cc=linux-omap@vger.kernel.org \
    --cc=tony@atomide.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.