public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* so, no way to kill process? have to reboot?
@ 2001-10-11 21:05 Christopher Friesen
  2001-10-11 23:08 ` James Sutherland
                   ` (3 more replies)
  0 siblings, 4 replies; 7+ messages in thread
From: Christopher Friesen @ 2001-10-11 21:05 UTC (permalink / raw)
  To: linux-kernel


Well, the unkillable process continues on.  Does nobody else have any ideas on
how to kill an unkillable process in the R state thats sucking up all my unused
cpu cycles?

If not I'm going to have to reboot this thing...

Chris

-- 
Chris Friesen                    | MailStop: 043/33/F10  
Nortel Networks                  | work: (613) 765-0557
3500 Carling Avenue              | fax:  (613) 765-2986
Nepean, ON K2H 8E9 Canada        | email: cfriesen@nortelnetworks.com

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: so, no way to kill process? have to reboot?
  2001-10-11 21:05 so, no way to kill process? have to reboot? Christopher Friesen
@ 2001-10-11 23:08 ` James Sutherland
  2001-10-12  6:16 ` Paul Mackerras
                   ` (2 subsequent siblings)
  3 siblings, 0 replies; 7+ messages in thread
From: James Sutherland @ 2001-10-11 23:08 UTC (permalink / raw)
  To: Christopher Friesen; +Cc: linux-kernel

On Thu, 11 Oct 2001, Christopher Friesen wrote:

> Well, the unkillable process continues on.  Does nobody else have any ideas on
> how to kill an unkillable process in the R state thats sucking up all my unused
> cpu cycles?
>
> If not I'm going to have to reboot this thing...

Short term hack: renice it to 20, so it doesn't interfere with normal
workload. Also try sending it a SIGSTOP, although I doubt that will work
here. I think strace will fail the same way gdb does, but try that too...


James.
-- 
"Our attitude with TCP/IP is, `Hey, we'll do it, but don't make a big
system, because we can't fix it if it breaks -- nobody can.'"

"TCP/IP is OK if you've got a little informal club, and it doesn't make
any difference if it takes a while to fix it."
		-- Ken Olson, in Digital News, 1988


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: so, no way to kill process? have to reboot?
  2001-10-11 21:05 so, no way to kill process? have to reboot? Christopher Friesen
  2001-10-11 23:08 ` James Sutherland
@ 2001-10-12  6:16 ` Paul Mackerras
  2001-10-12 11:36   ` Terje Eggestad
  2001-10-12  8:22 ` Peter Wächtler
  2001-10-12 13:42 ` so, no way to kill process? have to reboot? Christopher Friesen
  3 siblings, 1 reply; 7+ messages in thread
From: Paul Mackerras @ 2001-10-12  6:16 UTC (permalink / raw)
  To: Christopher Friesen; +Cc: linux-kernel

Christopher Friesen writes:

> Well, the unkillable process continues on.  Does nobody else have any ideas on
> how to kill an unkillable process in the R state thats sucking up all my unused
> cpu cycles?

I would suspect that it is actually looping inside the kernel, which
would mean that there indeed was no way to kill it.  You could try
alt-scrolllock on the console and see if you get a register dump of
it, or maybe one of the alt-sysrq magic keys might give you some
information.  But I suspect that rebooting is ultimately going to be
your only solution.

Paul.


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: so, no way to kill process? have to reboot?
  2001-10-11 21:05 so, no way to kill process? have to reboot? Christopher Friesen
  2001-10-11 23:08 ` James Sutherland
  2001-10-12  6:16 ` Paul Mackerras
@ 2001-10-12  8:22 ` Peter Wächtler
  2001-10-12 10:06   ` Ability to kill (was: Re: so, no way to kill process? have to reboot?) vda
  2001-10-12 13:42 ` so, no way to kill process? have to reboot? Christopher Friesen
  3 siblings, 1 reply; 7+ messages in thread
From: Peter Wächtler @ 2001-10-12  8:22 UTC (permalink / raw)
  To: Christopher Friesen; +Cc: linux-kernel

Christopher Friesen wrote:
> 
> Well, the unkillable process continues on.  Does nobody else have any ideas on
> how to kill an unkillable process in the R state thats sucking up all my unused
> cpu cycles?
> 
> If not I'm going to have to reboot this thing...
> 

Well, I'd suspect it in "D" state - waiting for some disk I/O to
finish...

But in "R" with your described behavior looks like a bug.
If you care about the CPU time waisted: what about kill -STOP <pid>?

Can you describe your filesystem layout?
I think of a symlink recursion bug or something wrong in /dev/shm
or alike... (no flame, just guessing :)

What are the parameters of "find"?

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Ability to kill (was: Re: so, no way to kill process? have to reboot?)
  2001-10-12  8:22 ` Peter Wächtler
@ 2001-10-12 10:06   ` vda
  0 siblings, 0 replies; 7+ messages in thread
From: vda @ 2001-10-12 10:06 UTC (permalink / raw)
  To: linux-kernel

PW> Well, I'd suspect it in "D" state - waiting for some disk I/O to
PW> finish...

If a process is stuck in D state it's a kernel bug - I
don't think it's ever legitimate to wait forever for something
which could never happen. However, some such bugs are rarely
happening (e.g. a swapin failure due to hdd malfunction)
and thus will unlikely be fixed.

PW> But in "R" with your described behavior looks like a bug.
PW> If you care about the CPU time waisted: what about kill -STOP <pid>?

R state unkillable hang is possible too (infinite loop in kernel
preventing return from a syscall).

In short, in my understanding any syscall should return sooner
or later in order to process to be killable. Anything preventing
that is a kernel bug.

However, I'm not a UNIX guru, I may be wrong.
I really like to be enlightened if I'm wrong.
-- 
Best regards, vda
mailto:vda@port.imtp.ilyichevsk.odessa.ua



^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: so, no way to kill process? have to reboot?
  2001-10-12  6:16 ` Paul Mackerras
@ 2001-10-12 11:36   ` Terje Eggestad
  0 siblings, 0 replies; 7+ messages in thread
From: Terje Eggestad @ 2001-10-12 11:36 UTC (permalink / raw)
  To: paulus; +Cc: linux-kernel


fre, 2001-10-12 kl. 08:16 skrev Paul Mackerras:
    Christopher Friesen writes:
    
    > Well, the unkillable process continues on.  Does nobody else have any ideas on
    > how to kill an unkillable process in the R state thats sucking up all my unused
    > cpu cycles?
    
    I would suspect that it is actually looping inside the kernel, which
    would mean that there indeed was no way to kill it.  You could try
    alt-scrolllock on the console and see if you get a register dump of
    it, or maybe one of the alt-sysrq magic keys might give you some
    information.  But I suspect that rebooting is ultimately going to be
    your only solution.
    
You might find out if it's looping inside the kernel by doing strace -p
<pid>, if you're stuck in a syscall, I *belive* strace'll tell you.

You wouldn't by any chance be developing a kernel module??

    Paul.
    
    -
    To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
    the body of a message to majordomo@vger.kernel.org
    More majordomo info at  http://vger.kernel.org/majordomo-info.html
    Please read the FAQ at  http://www.tux.org/lkml/
-- 
_________________________________________________________________________

Terje Eggestad                  terje.eggestad@scali.no
Scali Scalable Linux Systems    http://www.scali.com

Olaf Helsets Vei 6              tel:    +47 22 62 89 61 (OFFICE)
P.O.Box 70 Bogerud                      +47 975 31 574  (MOBILE)
N-0621 Oslo                     fax:    +47 22 62 89 51
NORWAY            
_________________________________________________________________________


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: so, no way to kill process? have to reboot?
  2001-10-11 21:05 so, no way to kill process? have to reboot? Christopher Friesen
                   ` (2 preceding siblings ...)
  2001-10-12  8:22 ` Peter Wächtler
@ 2001-10-12 13:42 ` Christopher Friesen
  3 siblings, 0 replies; 7+ messages in thread
From: Christopher Friesen @ 2001-10-12 13:42 UTC (permalink / raw)
  To: linux-kernel

"Friesen, Christopher [CAR:3R60:EXCH]" wrote:

Well, I've rebooted the thing.

It appears that whatever it was looping on was in the kernel.  I suspect that it
has something to do with NFS--this is a 2.2.17 kernel and we ran into some
issues with it and NFS on some other systems.

Thanks for the help guys...unfortunately I rebooted before getting the messages
about checking in /proc--it would have been interesting to see what it was
doing.

Chris

-- 
Chris Friesen                    | MailStop: 043/33/F10  
Nortel Networks                  | work: (613) 765-0557
3500 Carling Avenue              | fax:  (613) 765-2986
Nepean, ON K2H 8E9 Canada        | email: cfriesen@nortelnetworks.com

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2001-10-12 13:40 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2001-10-11 21:05 so, no way to kill process? have to reboot? Christopher Friesen
2001-10-11 23:08 ` James Sutherland
2001-10-12  6:16 ` Paul Mackerras
2001-10-12 11:36   ` Terje Eggestad
2001-10-12  8:22 ` Peter Wächtler
2001-10-12 10:06   ` Ability to kill (was: Re: so, no way to kill process? have to reboot?) vda
2001-10-12 13:42 ` so, no way to kill process? have to reboot? Christopher Friesen

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox