* [PATCH] improve scalability of 2.4.21 fpassist
@ 2003-07-03 20:36 John Hawkes
2003-07-29 23:06 ` Bjorn Helgaas
0 siblings, 1 reply; 2+ messages in thread
From: John Hawkes @ 2003-07-03 20:36 UTC (permalink / raw)
To: linux-ia64
This change has already made its way into linux-2.5.72-ia64-031619.diff.
Here is the diff relative to 2.4.21 + linux-2.4.21-ia64-030702.diff.
This patch *significantly* improves the scalability of handle_fpu_swa()
when multiple CPUs are generating concurrent fp-assist traps.
The original algorithm produces cacheline ping-ponging for the cacheline
that contains fpu_swa_count. This new algorithm only dirties
fpu_sw_count when the printk() is about to occur, rather than on every
fpassist trap, which throttles the ping-ponging to no more than five per
second.
John Hawkes
diff -X /home/hawkes/Patches/ignore.dirs -Naur linux-2.4.21-ia64/arch/ia64/kernel/traps.c linux-2.4.21-ia64-fp/arch/ia64/kernel/traps.c
--- linux-2.4.21-ia64/arch/ia64/kernel/traps.c Thu Jul 3 13:10:46 2003
+++ linux-2.4.21-ia64-fp/arch/ia64/kernel/traps.c Thu Jul 3 13:15:42 2003
@@ -335,8 +335,9 @@
if (jiffies - last_time > 5*HZ)
fpu_swa_count = 0;
- if ((++fpu_swa_count < 5) && !(current->thread.flags & IA64_THREAD_FPEMU_NOPRINT)) {
+ if ((fpu_swa_count < 4) && !(current->thread.flags & IA64_THREAD_FPEMU_NOPRINT)) {
last_time = jiffies;
+ ++fpu_swa_count;
printk(KERN_WARNING "%s(%d): floating-point assist fault at ip %016lx, isr %016lx\n",
current->comm, current->pid, regs->cr_iip + ia64_psr(regs)->ri, isr);
}
^ permalink raw reply [flat|nested] 2+ messages in thread* Re: [PATCH] improve scalability of 2.4.21 fpassist
2003-07-03 20:36 [PATCH] improve scalability of 2.4.21 fpassist John Hawkes
@ 2003-07-29 23:06 ` Bjorn Helgaas
0 siblings, 0 replies; 2+ messages in thread
From: Bjorn Helgaas @ 2003-07-29 23:06 UTC (permalink / raw)
To: linux-ia64
On Thursday 03 July 2003 2:36 pm, John Hawkes wrote:
> This change has already made its way into linux-2.5.72-ia64-031619.diff.
> Here is the diff relative to 2.4.21 + linux-2.4.21-ia64-030702.diff.
>
> This patch *significantly* improves the scalability of handle_fpu_swa()
> when multiple CPUs are generating concurrent fp-assist traps.
> The original algorithm produces cacheline ping-ponging for the cacheline
> that contains fpu_swa_count. This new algorithm only dirties
> fpu_sw_count when the printk() is about to occur, rather than on every
> fpassist trap, which throttles the ping-ponging to no more than five per
> second.
I applied this for 2.4. Thanks!
Bjorn
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2003-07-29 23:06 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-07-03 20:36 [PATCH] improve scalability of 2.4.21 fpassist John Hawkes
2003-07-29 23:06 ` Bjorn Helgaas
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox