public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* Re: smp cputime issues
@ 2002-01-02 13:11 Martin Knoblauch
  2002-01-02 15:07 ` M. Edward Borasky
  0 siblings, 1 reply; 6+ messages in thread
From: Martin Knoblauch @ 2002-01-02 13:11 UTC (permalink / raw)
  To: linux-kernel; +Cc: hauan

> smp cputime issues
> 
> 
> hello,
> 
>   we are encountering some weird timing behaviour on our linux cluster.
> 
>   specifically: when running 2 copies of selected programs on a
>   dual-cpu system, the cputime reported for each process is up to 25%
>   higher than when the processes are run on their own. however, if running
>   two different jobs on the same machine, both complete with a cputime
>   equal to when run individually. sample timing output attached.
> 
>   profiling confirms that everything slows down approximately to scale.
>   the results reproduce on a range of different machines (see below).
> 
> additional specifications:
>   - kernel version 2.4.16 (with apic enabled)
>   - chipsets: apollo pro 133, apollo pro 266,
>               intel i860, serverworks LE
>   - all jobs requires less than 1/10 of physical memory
>   - no significant disk i/o takes place
>   - timing with dtime(), /usr/bin/time and shell built-in time
>   - this behavior is NOT seen for all applications. the worst
>     "offender" spends most of its time doing linear algebra.
> 
>   ideas or info-pointers appreciated. more specs available on request.
> 

 two points. First for clarification - do you see the effects also on
elapsed time? Or do you say that the CPU time reporting is screwed?

 Second - you mention that you see the effect mainly on linear algebra
stuff. Could it be that you are memory bandwidth limited if you run two
of them together? Are you using Intel CPUs (my guess) which have the FSB
concept that may make memory bandwidth scaling a problem, or AMD Athlons
which use the Alpha/EV6 bus and should be a bit more friendly.

 Finally, how big is "1/10th of physical" memory? What kind of memory.

Martin
-- 
+-----------------------------------------------------+
|Martin Knoblauch                                     |
|-----------------------------------------------------|
|http://www.knobisoft.de/cats                         |
|-----------------------------------------------------|
|e-mail: knobi@knobisoft.de                           |
+-----------------------------------------------------+

^ permalink raw reply	[flat|nested] 6+ messages in thread
* smp cputime issues
@ 2002-01-02  1:00 Steinar Hauan
  2002-01-02  1:31 ` M. Edward Borasky
  0 siblings, 1 reply; 6+ messages in thread
From: Steinar Hauan @ 2002-01-02  1:00 UTC (permalink / raw)
  To: linux-kernel

[-- Attachment #1: Type: TEXT/PLAIN, Size: 1196 bytes --]

hello,

  we are encountering some weird timing behaviour on our linux cluster.

  specifically: when running 2 copies of selected programs on a
  dual-cpu system, the cputime reported for each process is up to 25%
  higher than when the processes are run on their own. however, if running
  two different jobs on the same machine, both complete with a cputime
  equal to when run individually. sample timing output attached.

  profiling confirms that everything slows down approximately to scale.
  the results reproduce on a range of different machines (see below).

additional specifications:
  - kernel version 2.4.16 (with apic enabled)
  - chipsets: apollo pro 133, apollo pro 266,
              intel i860, serverworks LE
  - all jobs requires less than 1/10 of physical memory
  - no significant disk i/o takes place
  - timing with dtime(), /usr/bin/time and shell built-in time
  - this behavior is NOT seen for all applications. the worst
    "offender" spends most of its time doing linear algebra.

  ideas or info-pointers appreciated. more specs available on request.

regards,
--
  Steinar Hauan, dept of ChemE  --  hauan@cmu.edu
  Carnegie Mellon University, Pittsburgh PA, USA

[-- Attachment #2: Type: TEXT/PLAIN, Size: 936 bytes --]

output from running a single image copy

[reported by dtime()]

CPU seconds spent in IPOPT and function evaluations =     131.9999982

[reported by /usr/bin/time -v ]

	Command being timed: "./ipopt robot_2000.nl"
	User time (seconds): 134.01
	System time (seconds): 0.36
	Percent of CPU this job got: 99%
	Elapsed (wall clock) time (h:mm:ss or m:ss): 2:14.42
	Average shared text size (kbytes): 0
	Average unshared data size (kbytes): 0
	Average stack size (kbytes): 0
	Average total size (kbytes): 0
	Maximum resident set size (kbytes): 0
	Average resident set size (kbytes): 0
	Major (requiring I/O) page faults: 293
	Minor (reclaiming a frame) page faults: 23352
	Voluntary context switches: 0
	Involuntary context switches: 0
	Swaps: 0
	File system inputs: 0
	File system outputs: 0
	Socket messages sent: 0
	Socket messages received: 0
	Signals delivered: 0
	Page size (bytes): 4096
	Exit status: 0

[-- Attachment #3: Type: TEXT/PLAIN, Size: 940 bytes --]

output from running two images simultaneously

[reported by dtime()]

CPU seconds spent in IPOPT and function evaluations =     157.7000024

[reported by /usr/bin/time -v ]
	Command being timed: "./ipopt robot_2000.nl"
	User time (seconds): 159.81
	System time (seconds): 0.50
	Percent of CPU this job got: 99%
	Elapsed (wall clock) time (h:mm:ss or m:ss): 2:40.41
	Average shared text size (kbytes): 0
	Average unshared data size (kbytes): 0
	Average stack size (kbytes): 0
	Average total size (kbytes): 0
	Maximum resident set size (kbytes): 0
	Average resident set size (kbytes): 0
	Major (requiring I/O) page faults: 293
	Minor (reclaiming a frame) page faults: 23352
	Voluntary context switches: 0
	Involuntary context switches: 0
	Swaps: 0
	File system inputs: 0
	File system outputs: 0
	Socket messages sent: 0
	Socket messages received: 0
	Signals delivered: 0
	Page size (bytes): 4096
	Exit status: 0

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2002-01-02 17:53 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <Pine.GSO.4.33L-022.0201020832230.1894-100000@unix12.andrew.cmu.edu>
2002-01-02 17:46 ` smp cputime issues Martin Knoblauch
2002-01-02 13:11 Martin Knoblauch
2002-01-02 15:07 ` M. Edward Borasky
  -- strict thread matches above, loose matches on Subject: below --
2002-01-02  1:00 Steinar Hauan
2002-01-02  1:31 ` M. Edward Borasky
2002-01-02 13:54   ` Steinar Hauan

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox