public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* [BUG]  Serious scheduler starvation
@ 2003-08-17 12:11 Måns Rullgård
  2003-08-17 12:52 ` Con Kolivas
  2003-08-17 12:55 ` Daniel Phillips
  0 siblings, 2 replies; 6+ messages in thread
From: Måns Rullgård @ 2003-08-17 12:11 UTC (permalink / raw)
  To: linux-kernel


I'm reposting this, since I got no response last time.

First the machine details.  It's a Pentium4 running at 2 GHz.  Linux
version 2.6.0-test3 + O16int + softrr.

I just experienced something that might be a scheduler problem.  I was
working in XEmacs, when suddenly the machine became very
unresponsive.  The mouse pointer in X moved sporadically.  I could
switch to a text console and log in, though typing lagged tens of
seconds.  Switching between text consoles was fast, though.  I killed
xemacs, and the system was back to normal.  Further investigation
showed that xemacs was stuck in a nasty regexp match.  If I was quick
enough, I could interrupt it with C-g.

With X and the window manager reniced to -10, they seem to be able to
get their job done.  This leads me to believe that maybe xemacs is
considered interactive, and given too high priority when it suddenly
starts burning the cpu.

I'll try it later with other kernel versions, but right now I don't
want to reboot.

What can I do to collect more information about the problem?

-- 
Måns Rullgård
mru@users.sf.net

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [BUG]  Serious scheduler starvation
  2003-08-17 12:11 [BUG] Serious scheduler starvation Måns Rullgård
@ 2003-08-17 12:52 ` Con Kolivas
  2003-08-17 13:17   ` Måns Rullgård
  2003-08-17 13:43   ` Daniel Phillips
  2003-08-17 12:55 ` Daniel Phillips
  1 sibling, 2 replies; 6+ messages in thread
From: Con Kolivas @ 2003-08-17 12:52 UTC (permalink / raw)
  To: Måns Rullgård, linux-kernel

On Sun, 17 Aug 2003 22:11, Måns Rullgård wrote:
> I'm reposting this, since I got no response last time.
>
> First the machine details.  It's a Pentium4 running at 2 GHz.  Linux
> version 2.6.0-test3 + O16int + softrr.

Softrr ? Which patch? Davide's? Noone has tried to make them compatible 
(yet?). Even so, this may be unrelated to softrr.

> I just experienced something that might be a scheduler problem.  I was

Almost certainly is.

> working in XEmacs, when suddenly the machine became very
> unresponsive.  The mouse pointer in X moved sporadically.  I could
> switch to a text console and log in, though typing lagged tens of
> seconds.  Switching between text consoles was fast, though.  I killed
> xemacs, and the system was back to normal.  Further investigation
> showed that xemacs was stuck in a nasty regexp match.  If I was quick
> enough, I could interrupt it with C-g.
>
> With X and the window manager reniced to -10, they seem to be able to
> get their job done.  This leads me to believe that maybe xemacs is
> considered interactive, and given too high priority when it suddenly
> starts burning the cpu.
>
> I'll try it later with other kernel versions, but right now I don't
> want to reboot.
>
> What can I do to collect more information about the problem?

Run top in batch mode as root reniced to -11 so it doesn't get preempted and 
capture it happening before you kill XEmacs. Then try running XEmacs niced 
+10 and see if it doesn't happen there. Also if it was lucky enough that you 
booted with profiling enabled you could profile it, but top will tell if it's 
a simple scheduler starvation error. 

Con


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [BUG]  Serious scheduler starvation
  2003-08-17 12:11 [BUG] Serious scheduler starvation Måns Rullgård
  2003-08-17 12:52 ` Con Kolivas
@ 2003-08-17 12:55 ` Daniel Phillips
  1 sibling, 0 replies; 6+ messages in thread
From: Daniel Phillips @ 2003-08-17 12:55 UTC (permalink / raw)
  To: Måns Rullgård, linux-kernel

On Sunday 17 August 2003 14:11, Måns Rullgård wrote:
> What can I do to collect more information about the problem?

Look in top's "PRI" column, which is where you see the effects of the dynamic 
priority adjustment.

Regards,

Daniel


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [BUG]  Serious scheduler starvation
  2003-08-17 12:52 ` Con Kolivas
@ 2003-08-17 13:17   ` Måns Rullgård
  2003-08-17 13:43   ` Daniel Phillips
  1 sibling, 0 replies; 6+ messages in thread
From: Måns Rullgård @ 2003-08-17 13:17 UTC (permalink / raw)
  To: linux-kernel

Con Kolivas <kernel@kolivas.org> writes:

>> First the machine details.  It's a Pentium4 running at 2 GHz.  Linux
>> version 2.6.0-test3 + O16int + softrr.
>
> Softrr ? Which patch? Davide's? Noone has tried to make them compatible 
> (yet?). Even so, this may be unrelated to softrr.

Are there more than one.  I'm using something off xmailserver.org.
Anyhow, no softrr tasks were running at the time.

>> What can I do to collect more information about the problem?
>
> Run top in batch mode as root reniced to -11 so it doesn't get preempted and 
> capture it happening before you kill XEmacs. Then try running XEmacs niced 
> +10 and see if it doesn't happen there. Also if it was lucky enough that you 
> booted with profiling enabled you could profile it, but top will
> tell if it's a simple scheduler starvation error.

I'll do that, it's easily reproducible, at least.

-- 
Måns Rullgård
mru@users.sf.net


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [BUG]  Serious scheduler starvation
  2003-08-17 12:52 ` Con Kolivas
  2003-08-17 13:17   ` Måns Rullgård
@ 2003-08-17 13:43   ` Daniel Phillips
  2003-08-17 13:49     ` Con Kolivas
  1 sibling, 1 reply; 6+ messages in thread
From: Daniel Phillips @ 2003-08-17 13:43 UTC (permalink / raw)
  To: Con Kolivas, Måns Rullgård, linux-kernel

On Sunday 17 August 2003 14:52, Con Kolivas wrote:
> On Sun, 17 Aug 2003 22:11, Måns Rullgård wrote:
> > First the machine details.  It's a Pentium4 running at 2 GHz.  Linux
> > version 2.6.0-test3 + O16int + softrr.
>
> Softrr ? Which patch? Davide's? Noone has tried to make them compatible
> (yet?). Even so, this may be unrelated to softrr.

Almost certainly unrelated, since there is no effect unless he runs SCHED_RR 
applications as non-root.

Obviously, he should back the patches out one by one when he does get time to 
reboot.

Regards,

Daniel


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [BUG]  Serious scheduler starvation
  2003-08-17 13:43   ` Daniel Phillips
@ 2003-08-17 13:49     ` Con Kolivas
  0 siblings, 0 replies; 6+ messages in thread
From: Con Kolivas @ 2003-08-17 13:49 UTC (permalink / raw)
  To: Daniel Phillips, Måns Rullgård, linux-kernel

On Sun, 17 Aug 2003 23:43, Daniel Phillips wrote:
> On Sunday 17 August 2003 14:52, Con Kolivas wrote:
> > On Sun, 17 Aug 2003 22:11, Måns Rullgård wrote:
> > > First the machine details.  It's a Pentium4 running at 2 GHz.  Linux
> > > version 2.6.0-test3 + O16int + softrr.
> >
> > Softrr ? Which patch? Davide's? Noone has tried to make them compatible
> > (yet?). Even so, this may be unrelated to softrr.
>
> Almost certainly unrelated, since there is no effect unless he runs
> SCHED_RR applications as non-root.
>
> Obviously, he should back the patches out one by one when he does get time
> to reboot.

No need, there is a known issue in my patches that can cause it. Check the 
email I just sent about a similar issue.

Con


^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2003-08-17 13:43 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-08-17 12:11 [BUG] Serious scheduler starvation Måns Rullgård
2003-08-17 12:52 ` Con Kolivas
2003-08-17 13:17   ` Måns Rullgård
2003-08-17 13:43   ` Daniel Phillips
2003-08-17 13:49     ` Con Kolivas
2003-08-17 12:55 ` Daniel Phillips

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox