From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755486AbZKWPVZ (ORCPT ); Mon, 23 Nov 2009 10:21:25 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1755067AbZKWPVX (ORCPT ); Mon, 23 Nov 2009 10:21:23 -0500 Received: from mx1.redhat.com ([209.132.183.28]:53417 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755464AbZKWPVU (ORCPT ); Mon, 23 Nov 2009 10:21:20 -0500 Date: Mon, 23 Nov 2009 16:16:12 +0100 From: Oleg Nesterov To: Nick Piggin Cc: Linux Kernel Mailing List , Roland McGrath Subject: Re: Zombie process when ptracing Message-ID: <20091123151612.GC4495@redhat.com> References: <20091119102543.GB5602@wotan.suse.de> <20091120012930.GA3985@redhat.com> <20091123083623.GD5602@wotan.suse.de> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20091123083623.GD5602@wotan.suse.de> User-Agent: Mutt/1.5.18 (2008-05-17) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 11/23, Nick Piggin wrote: > > On Fri, Nov 20, 2009 at 02:29:30AM +0100, Oleg Nesterov wrote: > > Hi, > > > > On 11/19, Nick Piggin wrote: > > > > > > Running recent git kernel, I have a process stuck in Z state > > > > > > bash ? 0000000000000000 0 3188 3187 0x00000000 > > > ffff88012e24fec8 0000000000000046 0000000000000000 0000000000000012 > > > ffff88012e24fec8 ffff88012e24e000 ffff88012e24ffd8 ffff88012e24e000 > > > 000000000000efc8 ffff88012e24e000 ffff88012ea82090 ffff88012ff78640 > > > Call Trace: > > > [] ? proc_clear_tty+0x5e/0x70 > > > [] ? exit_ptrace+0xb8/0x140 > > > [] do_exit+0x58a/0x7c0 > > > [] do_group_exit+0x3d/0xb0 > > > [] sys_exit_group+0x12/0x20 > > > [] system_call_fastpath+0x16/0x1b > > > > > > This was after stracing a few test programs. > > > > > > It also seems to have lost job control (^C) at the same time. > > > > This can happen if the tracer (strace) itself hangs, zombies > > should go away once the tracer is killed. Or its ->real_parent > > is stopped or hangs... > > > > (I assume you didn't strace /sbin/init) > > No, I straced something else, and all straces seemed to be > killed but bash remained. I was running a script that in > turn launched another process, so I ran it via > strace -ff bash ./script.sh OK, thanks. Hmm. Just noticed the state above == '?'. Looks like sched_show_task() is buggy, it should check ->exit_state for "ZX" from TASK_STATE_TO_CHAR_STR. But this is off-topic. > > If you see a zombie again, could you look at its /ptoc/pid/status? > > OK, any other hints if I see it again? Well, also the contents of /proc/PPid/status and /proc/TracerPid/status may help. And sysrq-t ouput. Otherwise, currently I have no idea where to start. Oleg.