public inbox for linux-ia64@vger.kernel.org
 help / color / mirror / Atom feed
* speeding up thread-creation
@ 2003-11-21  8:19 David Mosberger
  2003-11-21 17:22 ` Stephane Eranian
  2003-11-21 18:31 ` David Mosberger
  0 siblings, 2 replies; 3+ messages in thread
From: David Mosberger @ 2003-11-21  8:19 UTC (permalink / raw)
  To: linux-ia64

It occurred to me that at present, we're copying lots of state on a
clone2() for absolutely no reason.  Not only that, but the large size
of the "thread_struct" probably also causes poor cache-locality since
the task-structure is effectively split in two, with a large unused
gap in between.  I think it might make sense to move all the large
thread_struct-state (IA-32 registers, pmcs[], pmds[], dbr[], ibr[],
and fph[]) into a separate "thread_lazy" structure and then put that
structure at a place where it doesn't hurt (perhaps above the
thread_info structure).  If I counted right, this state accounts for
2KB so not copying it in copy_process() ought to speed up
thread-creation significantly and avoid stomping needlessly on the L1
d-cache.

Anyone interested in playing with this idea?

	--david

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: speeding up thread-creation
  2003-11-21  8:19 speeding up thread-creation David Mosberger
@ 2003-11-21 17:22 ` Stephane Eranian
  2003-11-21 18:31 ` David Mosberger
  1 sibling, 0 replies; 3+ messages in thread
From: Stephane Eranian @ 2003-11-21 17:22 UTC (permalink / raw)
  To: linux-ia64

David,

On Fri, Nov 21, 2003 at 12:19:59AM -0800, David Mosberger wrote:
> It occurred to me that at present, we're copying lots of state on a
> clone2() for absolutely no reason.  Not only that, but the large size
> of the "thread_struct" probably also causes poor cache-locality since
> the task-structure is effectively split in two, with a large unused
> gap in between.  I think it might make sense to move all the large
> thread_struct-state (IA-32 registers, pmcs[], pmds[], dbr[], ibr[],
> and fph[]) into a separate "thread_lazy" structure and then put that
> structure at a place where it doesn't hurt (perhaps above the
> thread_info structure).  If I counted right, this state accounts for
> 2KB so not copying it in copy_process() ought to speed up
> thread-creation significantly and avoid stomping needlessly on the L1
> d-cache.
> 
That looks like an good idea.

I assume you want to rely on the thread's flags to determine if it is worth
copying the thread_lazy structure during a clone. For perfmon, we may need
to have two flags: one that says we are storing information in pmds/pmcs and
one that says we need to context switch the PMU state. Today PM_VALID flag
is used to mean the latter only.

-- 
-Stephane

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: speeding up thread-creation
  2003-11-21  8:19 speeding up thread-creation David Mosberger
  2003-11-21 17:22 ` Stephane Eranian
@ 2003-11-21 18:31 ` David Mosberger
  1 sibling, 0 replies; 3+ messages in thread
From: David Mosberger @ 2003-11-21 18:31 UTC (permalink / raw)
  To: linux-ia64

>>>>> On Fri, 21 Nov 2003 09:22:17 -0800, Stephane Eranian <eranian@hpl.hp.com> said:

  Stephane> I assume you want to rely on the thread's flags to
  Stephane> determine if it is worth copying the thread_lazy structure
  Stephane> during a clone.

Yup.  I'd say "necessary" instead of "worth" though.  Also, we know
that the fph partition never needs to be copied (it contains scratch
registers only).

  Stephane> For perfmon, we may need to have two flags: one that says
  Stephane> we are storing information in pmds/pmcs and one that says
  Stephane> we need to context switch the PMU state. Today PM_VALID
  Stephane> flag is used to mean the latter only.

I think that would be fine.  In what cases would you want to clone the
PMU state when PM_VALID is false?

	--david

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2003-11-21 18:31 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-11-21  8:19 speeding up thread-creation David Mosberger
2003-11-21 17:22 ` Stephane Eranian
2003-11-21 18:31 ` David Mosberger

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox