All of lore.kernel.org
 help / color / mirror / Atom feed
* so, no way to kill process? have to reboot?
@ 2001-10-11 21:05 Christopher Friesen
  2001-10-11 23:08 ` James Sutherland
                   ` (3 more replies)
  0 siblings, 4 replies; 7+ messages in thread
From: Christopher Friesen @ 2001-10-11 21:05 UTC (permalink / raw)
  To: linux-kernel


Well, the unkillable process continues on.  Does nobody else have any ideas on
how to kill an unkillable process in the R state thats sucking up all my unused
cpu cycles?

If not I'm going to have to reboot this thing...

Chris

-- 
Chris Friesen                    | MailStop: 043/33/F10  
Nortel Networks                  | work: (613) 765-0557
3500 Carling Avenue              | fax:  (613) 765-2986
Nepean, ON K2H 8E9 Canada        | email: cfriesen@nortelnetworks.com

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: so, no way to kill process? have to reboot?
  2001-10-11 21:05 so, no way to kill process? have to reboot? Christopher Friesen
@ 2001-10-11 23:08 ` James Sutherland
  2001-10-12  6:16 ` Paul Mackerras
                   ` (2 subsequent siblings)
  3 siblings, 0 replies; 7+ messages in thread
From: James Sutherland @ 2001-10-11 23:08 UTC (permalink / raw)
  To: Christopher Friesen; +Cc: linux-kernel

On Thu, 11 Oct 2001, Christopher Friesen wrote:

> Well, the unkillable process continues on.  Does nobody else have any ideas on
> how to kill an unkillable process in the R state thats sucking up all my unused
> cpu cycles?
>
> If not I'm going to have to reboot this thing...

Short term hack: renice it to 20, so it doesn't interfere with normal
workload. Also try sending it a SIGSTOP, although I doubt that will work
here. I think strace will fail the same way gdb does, but try that too...


James.
-- 
"Our attitude with TCP/IP is, `Hey, we'll do it, but don't make a big
system, because we can't fix it if it breaks -- nobody can.'"

"TCP/IP is OK if you've got a little informal club, and it doesn't make
any difference if it takes a while to fix it."
		-- Ken Olson, in Digital News, 1988


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: so, no way to kill process? have to reboot?
  2001-10-11 21:05 so, no way to kill process? have to reboot? Christopher Friesen
  2001-10-11 23:08 ` James Sutherland
@ 2001-10-12  6:16 ` Paul Mackerras
  2001-10-12 11:36   ` Terje Eggestad
  2001-10-12  8:22 ` Peter Wächtler
  2001-10-12 13:42 ` so, no way to kill process? have to reboot? Christopher Friesen
  3 siblings, 1 reply; 7+ messages in thread
From: Paul Mackerras @ 2001-10-12  6:16 UTC (permalink / raw)
  To: Christopher Friesen; +Cc: linux-kernel

Christopher Friesen writes:

> Well, the unkillable process continues on.  Does nobody else have any ideas on
> how to kill an unkillable process in the R state thats sucking up all my unused
> cpu cycles?

I would suspect that it is actually looping inside the kernel, which
would mean that there indeed was no way to kill it.  You could try
alt-scrolllock on the console and see if you get a register dump of
it, or maybe one of the alt-sysrq magic keys might give you some
information.  But I suspect that rebooting is ultimately going to be
your only solution.

Paul.


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: so, no way to kill process? have to reboot?
  2001-10-11 21:05 so, no way to kill process? have to reboot? Christopher Friesen
  2001-10-11 23:08 ` James Sutherland
  2001-10-12  6:16 ` Paul Mackerras
@ 2001-10-12  8:22 ` Peter Wächtler
  2001-10-12 10:06   ` Ability to kill (was: Re: so, no way to kill process? have to reboot?) vda
  2001-10-12 13:42 ` so, no way to kill process? have to reboot? Christopher Friesen
  3 siblings, 1 reply; 7+ messages in thread
From: Peter Wächtler @ 2001-10-12  8:22 UTC (permalink / raw)
  To: Christopher Friesen; +Cc: linux-kernel

Christopher Friesen wrote:
> 
> Well, the unkillable process continues on.  Does nobody else have any ideas on
> how to kill an unkillable process in the R state thats sucking up all my unused
> cpu cycles?
> 
> If not I'm going to have to reboot this thing...
> 

Well, I'd suspect it in "D" state - waiting for some disk I/O to
finish...

But in "R" with your described behavior looks like a bug.
If you care about the CPU time waisted: what about kill -STOP <pid>?

Can you describe your filesystem layout?
I think of a symlink recursion bug or something wrong in /dev/shm
or alike... (no flame, just guessing :)

What are the parameters of "find"?

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Ability to kill (was: Re: so, no way to kill process? have to reboot?)
  2001-10-12  8:22 ` Peter Wächtler
@ 2001-10-12 10:06   ` vda
  0 siblings, 0 replies; 7+ messages in thread
From: vda @ 2001-10-12 10:06 UTC (permalink / raw)
  To: linux-kernel

PW> Well, I'd suspect it in "D" state - waiting for some disk I/O to
PW> finish...

If a process is stuck in D state it's a kernel bug - I
don't think it's ever legitimate to wait forever for something
which could never happen. However, some such bugs are rarely
happening (e.g. a swapin failure due to hdd malfunction)
and thus will unlikely be fixed.

PW> But in "R" with your described behavior looks like a bug.
PW> If you care about the CPU time waisted: what about kill -STOP <pid>?

R state unkillable hang is possible too (infinite loop in kernel
preventing return from a syscall).

In short, in my understanding any syscall should return sooner
or later in order to process to be killable. Anything preventing
that is a kernel bug.

However, I'm not a UNIX guru, I may be wrong.
I really like to be enlightened if I'm wrong.
-- 
Best regards, vda
mailto:vda@port.imtp.ilyichevsk.odessa.ua



^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: so, no way to kill process? have to reboot?
  2001-10-12  6:16 ` Paul Mackerras
@ 2001-10-12 11:36   ` Terje Eggestad
  0 siblings, 0 replies; 7+ messages in thread
From: Terje Eggestad @ 2001-10-12 11:36 UTC (permalink / raw)
  To: paulus; +Cc: linux-kernel


fre, 2001-10-12 kl. 08:16 skrev Paul Mackerras:
    Christopher Friesen writes:
    
    > Well, the unkillable process continues on.  Does nobody else have any ideas on
    > how to kill an unkillable process in the R state thats sucking up all my unused
    > cpu cycles?
    
    I would suspect that it is actually looping inside the kernel, which
    would mean that there indeed was no way to kill it.  You could try
    alt-scrolllock on the console and see if you get a register dump of
    it, or maybe one of the alt-sysrq magic keys might give you some
    information.  But I suspect that rebooting is ultimately going to be
    your only solution.
    
You might find out if it's looping inside the kernel by doing strace -p
<pid>, if you're stuck in a syscall, I *belive* strace'll tell you.

You wouldn't by any chance be developing a kernel module??

    Paul.
    
    -
    To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
    the body of a message to majordomo@vger.kernel.org
    More majordomo info at  http://vger.kernel.org/majordomo-info.html
    Please read the FAQ at  http://www.tux.org/lkml/
-- 
_________________________________________________________________________

Terje Eggestad                  terje.eggestad@scali.no
Scali Scalable Linux Systems    http://www.scali.com

Olaf Helsets Vei 6              tel:    +47 22 62 89 61 (OFFICE)
P.O.Box 70 Bogerud                      +47 975 31 574  (MOBILE)
N-0621 Oslo                     fax:    +47 22 62 89 51
NORWAY            
_________________________________________________________________________


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: so, no way to kill process? have to reboot?
  2001-10-11 21:05 so, no way to kill process? have to reboot? Christopher Friesen
                   ` (2 preceding siblings ...)
  2001-10-12  8:22 ` Peter Wächtler
@ 2001-10-12 13:42 ` Christopher Friesen
  3 siblings, 0 replies; 7+ messages in thread
From: Christopher Friesen @ 2001-10-12 13:42 UTC (permalink / raw)
  To: linux-kernel

"Friesen, Christopher [CAR:3R60:EXCH]" wrote:

Well, I've rebooted the thing.

It appears that whatever it was looping on was in the kernel.  I suspect that it
has something to do with NFS--this is a 2.2.17 kernel and we ran into some
issues with it and NFS on some other systems.

Thanks for the help guys...unfortunately I rebooted before getting the messages
about checking in /proc--it would have been interesting to see what it was
doing.

Chris

-- 
Chris Friesen                    | MailStop: 043/33/F10  
Nortel Networks                  | work: (613) 765-0557
3500 Carling Avenue              | fax:  (613) 765-2986
Nepean, ON K2H 8E9 Canada        | email: cfriesen@nortelnetworks.com

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2001-10-12 13:40 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2001-10-11 21:05 so, no way to kill process? have to reboot? Christopher Friesen
2001-10-11 23:08 ` James Sutherland
2001-10-12  6:16 ` Paul Mackerras
2001-10-12 11:36   ` Terje Eggestad
2001-10-12  8:22 ` Peter Wächtler
2001-10-12 10:06   ` Ability to kill (was: Re: so, no way to kill process? have to reboot?) vda
2001-10-12 13:42 ` so, no way to kill process? have to reboot? Christopher Friesen

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.