public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* Is notify_die being overloaded?
@ 2006-04-13 19:46 Robin Holt
  2006-04-15  6:19 ` Keith Owens
  2006-04-17 16:45 ` Keshavamurthy Anil S
  0 siblings, 2 replies; 14+ messages in thread
From: Robin Holt @ 2006-04-13 19:46 UTC (permalink / raw)
  To: linux-kernel
  Cc: Andrew Morton, Anil S Keshavamurthy, Keith Owens, Dean Nelson

notify_die seems to be called to indicate the machine is going down as
well as there are trapped events for the process.

Specifically, the following call notify_die when there are machine
related events:
ia64_mca_rendez_int_handler (DIE_MCA_RENDZVOUS_ENTER,
	DIE_MCA_RENDZVOUS_PROCESS, DIE_MCA_RENDZVOUS_LEAVE)
ia64_mca_handler (DIE_MCA_MONARCH_ENTER, DIE_MCA_MONARCH_PROCESS,
	DIE_MCA_MONARCH_LEAVE)
ia64_init_handler (DIE_INIT_ENTER,
	DIE_INIT_{SLAVE|MONARCH}_{ENTER|PROCESS|LEAVE})
ia64_mca_init (DIE_MCA_NEW_TIMEOUT)
machine_restart (DIE_MACHINE_RESTART)
machine_halt (DIE_MACHINE_HALT)
die (DIE_OOPS)


The following seem to be process related:
ia64_bad_break (DIE_BREAK, DIE_FAULT)
ia64_do_page_fault (DIE_PAGE_FAULT)


Shouldn't these really be seperated into two seperate notifier chains?
One for OS level die() type activity and another for process faults
which a debugger et. al. would want to know about?

The specific concern is some testing we have been doing with an upcoming
OSD release.  We see notify_die being called from ia64_do_page_fault
frequently in our performance samples.  On these machines, xpc has
registers a die notifier and therefore callouts are occuring which have
no relationship to a processes page faulting.  XPC is looking for events
which indicate the OS is stopping.  Additionally, kdb is installed on
this machine as well and it has registered a die notifier as well.

Thanks,
Robin Holt

^ permalink raw reply	[flat|nested] 14+ messages in thread
* RE: ia64_do_page_fault shows 19.4% slowdown from notify_die.
@ 2006-04-18 23:40 Luck, Tony
  0 siblings, 0 replies; 14+ messages in thread
From: Luck, Tony @ 2006-04-18 23:40 UTC (permalink / raw)
  To: Keshavamurthy, Anil S, Robin Holt
  Cc: Keith Owens, prasanna, ananth, davem, linux-kernel, Andrew Morton

> 499 nSec/fault ia64_do_page_fault notify_die commented out.
> 501 nSec/fault ia64_do_page_fault with nobody registered.
> 533 nSec/fault notify_die in and just kprobes.
> 596 nSec/fault notify_die in and kdb, kprobes, mca, and xpc loaded.
> 
> The 596 nSec/fault is a 19.4% slowdown.  This is an upcoming OSD beta
> kernel.  It will be representative of what our typical customer will
> have loaded.
> 
> Is this enough justification for breaking notify_die into
> notify_page_fault for the fault path?

I didn't see quite the stability from run to run that your results
suggest.  Running the benchmark five times on the same kernel, I saw
the mean value of the 128 results go from as low as 439 to as high
as 445.  So the difference between commenting in/out the notify_die
call is in the noise.

But comparing the first and last of your results shows that there
is significant slowdown when the notify chain is loaded up with a
ton of stuff, way more than the noise that I see, and I'm glad to see
Anil jumping in to fix this.

-Tony

^ permalink raw reply	[flat|nested] 14+ messages in thread

end of thread, other threads:[~2006-04-19 11:11 UTC | newest]

Thread overview: 14+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-04-13 19:46 Is notify_die being overloaded? Robin Holt
2006-04-15  6:19 ` Keith Owens
2006-04-15 10:43   ` Robin Holt
2006-04-17  7:52     ` Keith Owens
2006-04-17 10:51       ` Robin Holt
2006-04-17 11:25         ` Robin Holt
2006-04-18  0:23           ` Keith Owens
2006-04-18 22:16             ` ia64_do_page_fault shows 19.4% slowdown from notify_die Robin Holt
2006-04-18 23:03               ` Keshavamurthy Anil S
2006-04-19  0:30               ` Andi Kleen
2006-04-19 11:11                 ` Robin Holt
2006-04-17 16:50       ` Is notify_die being overloaded? Keshavamurthy Anil S
2006-04-17 16:45 ` Keshavamurthy Anil S
  -- strict thread matches above, loose matches on Subject: below --
2006-04-18 23:40 ia64_do_page_fault shows 19.4% slowdown from notify_die Luck, Tony

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox