public inbox for linux-ia64@vger.kernel.org
 help / color / mirror / Atom feed
* speeding up thread-creation
@ 2003-11-21  8:19 David Mosberger
  2003-11-21 17:22 ` Stephane Eranian
  2003-11-21 18:31 ` David Mosberger
  0 siblings, 2 replies; 3+ messages in thread
From: David Mosberger @ 2003-11-21  8:19 UTC (permalink / raw)
  To: linux-ia64

It occurred to me that at present, we're copying lots of state on a
clone2() for absolutely no reason.  Not only that, but the large size
of the "thread_struct" probably also causes poor cache-locality since
the task-structure is effectively split in two, with a large unused
gap in between.  I think it might make sense to move all the large
thread_struct-state (IA-32 registers, pmcs[], pmds[], dbr[], ibr[],
and fph[]) into a separate "thread_lazy" structure and then put that
structure at a place where it doesn't hurt (perhaps above the
thread_info structure).  If I counted right, this state accounts for
2KB so not copying it in copy_process() ought to speed up
thread-creation significantly and avoid stomping needlessly on the L1
d-cache.

Anyone interested in playing with this idea?

	--david

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2003-11-21 18:31 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-11-21  8:19 speeding up thread-creation David Mosberger
2003-11-21 17:22 ` Stephane Eranian
2003-11-21 18:31 ` David Mosberger

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox