From: Marc Gonzalez-Sigler <marc.gonzalez-sigler@inria.fr>
To: linux-ia64@vger.kernel.org
Subject: Re: Consistency problem on IPF
Date: Mon, 12 Jul 2004 11:39:06 +0000 [thread overview]
Message-ID: <40F2785A.10903@inria.fr> (raw)
In-Reply-To: <40F2562C.10208@inria.fr>
Hello Duraid,
Did you get the same standard deviation for NP0 and NQ2?
What page size did you use?
When NP0, I assume most of your programs complete in approximately 0.6
seconds?
Marc
Duraid Madina wrote:
> For the record, on HP-UX 11.23 the standard deviation is down around
> 0.1s for NP0, 512. For N\x1024, it's basically zero. This is on a 2-way
> 1.5G/6M system.
>
> Linux has a way to go yet...
>
> Duraid
>
> Erich Focht wrote:
>
>> Hi Marc,
>>
>> usually, if you can solve the problem at user level it is improbable
>> that someone will provide a solution in the kernel. To my knowledge
>> there is no knob for optimizing memory layout in 2.6 and 2.4 results
>> will be similar. What plaform do you use? If it's NUMA, there are more
>> answers...
>>
>> At user level you could:
>> - try an optimized matrix-matrix multiply (BLAS3 DGEMM function from
>> the Intel MKL (math kernel library) ?). I'd expect that to be coded in
>> such a way that the impact of your data layout is reduced/limited.
>> - try using pages from hugetlbfs and keep all data in one page.
>>
>> Regards,
>> Erich
>>
>> On Monday 12 July 2004 11:13, Marc Gonzalez-Sigler wrote:
>>
>>> Hello,
>>>
>>> Several weeks ago, I wrote a naive matrix-matrix multiply program.
>>>
>>> int main(void)
>>> {
>>> static double A[N][N], B[N][N], C[N][N];
>>>
>>> /* Initialize A and B */
>>>
>>> /* Main loop */
>>> for (i=0; i < N; ++i)
>>> for (j=0; j < N; ++j)
>>> for (k=0; k < N; ++k)
>>> C[i][j] += A[i][k]*B[k][j];
>>>
>>> /* Print the sum of all elements of C */
>>> }
>>>
>>> The system:
>>>
>>> $ cat /proc/cpuinfo
>>> processor : 0
>>> vendor : GenuineIntel
>>> arch : IA-64
>>> family : Itanium 2
>>> model : 1
>>> revision : 5
>>> archrev : 0
>>> features : branchlong
>>> cpu number : 0
>>> cpu regs : 4
>>> cpu MHz : 1300.000000
>>> itc MHz : 1300.000000
>>> BogoMIPS : 1946.15
>>>
>>> processor : 1
>>> [same as processor 0]
>>>
>>> $ uname -a
>>> Linux c64 2.6.6 #2 SMP Thu Jun 10 18:03:20 CEST 2004 ia64 GNU/Linux
>>>
>>>
>>> I started with NQ2, which was a bad idea. I ran the same program
>>> 100 times on an empty system, and saw very different execution times.
>>> I tried to pin the program to a single CPU, but the results were
>>> similar.
>>>
>>> NQ2
>>> MIN = 1.190000
>>> MAX = 11.470000
>>> MEAN = 4.686900
>>> MEDIAN = 1.390000
>>> STDDEV = 4.181866
>>>
>>> OK. NQ2 was probably a pathological case. Let us try NP0.
>>>
>>> NP0
>>> MIN = 0.670000
>>> MAX = 1.770000
>>> MEAN = 1.013100
>>> MEDIAN = 0.670000
>>> STDDEV = 0.466653
>>>
>>> Better, but still quite inconsistent...
>>>
>>> The same experiment on a 3.0 GHz Northwood running 2.4.22
>>>
>>> NP0
>>> MEAN = 1.375200
>>> MEDIAN = 1.375000
>>> STDDEV = 0.002825
>>>
>>> Tony Luck, an Intel engineer, told me on a different list this was a
>>> page-coloring issue. Would you agree? Is there a knob in Linux 2.6 to
>>> request a smarter physical page allocation policy? Do you think I
>>> would get similar results if I used 2.4 instead of 2.6?
next prev parent reply other threads:[~2004-07-12 11:39 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2004-07-12 9:13 Consistency problem on IPF Marc Gonzalez-Sigler
2004-07-12 10:06 ` Erich Focht
2004-07-12 11:12 ` Duraid Madina
2004-07-12 11:24 ` Erich Focht
2004-07-12 11:39 ` Marc Gonzalez-Sigler [this message]
2004-07-12 12:01 ` Marc Gonzalez-Sigler
2004-07-12 20:37 ` David Mosberger
2004-07-12 21:49 ` Duraid Madina
2004-07-12 22:05 ` David Mosberger
2004-07-12 22:08 ` Rich Altmaier
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=40F2785A.10903@inria.fr \
--to=marc.gonzalez-sigler@inria.fr \
--cc=linux-ia64@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox