All of lore.kernel.org
 help / color / mirror / Atom feed
From: rwhron@earthlink.net
To: linux-kernel@vger.kernel.org
Cc: akpm@osdl.org
Subject: 2.6.0-test4-mm5 dbench stuck in D state
Date: Thu, 4 Sep 2003 22:38:13 -0400	[thread overview]
Message-ID: <20030905023813.GA29171@rushmore> (raw)

Uniprocessor x86 ext2 on IDE system running 2.6.0-test4-mm5
has dbench stuck in uninterruptible sleep.  -mm4 didn't 
have a problem with the same workload.

dbench 32 was hung.  I did pkill -9 dbench.  Some of the dbench
processes exited.  Some did not.  Below is "ps aux".   The hung
"dbench 32" processes have START time of 02:09.  

Apparently the mother of all dbench processes exited, because the
wrapper script moved on to the next iteration of dbench.  

A few hours later I noticed dbench wasn't making progress on 
another iteration of dbench 32.  pkill -9 ... one process with
start time of 07:49 didn't exit.

A few hours later "dbench 64" was stuck.  The disk access light
was out. The load average steady at 16.  In the ps below,
those processes have START time of 21:49.

ps aux
USER       PID %CPU %MEM   VSZ  RSS TTY      STAT START   TIME COMMAND
root         1  0.0  0.0  1312  168 ?        S    Sep03   0:04 init [3]
root         2  0.0  0.0     0    0 ?        SWN  Sep03   0:00 [ksoftirqd/0]
root         3  0.0  0.0     0    0 ?        SW<  Sep03   0:01 [events/0]
root         4  0.0  0.0     0    0 ?        SW<  Sep03   0:00 [kblockd/0]
root         7  0.0  0.0     0    0 ?        SW   Sep03   0:28 [kswapd0]
root         8  0.0  0.0     0    0 ?        SW<  Sep03   0:00 [aio/0]
root         9  0.0  0.0     0    0 ?        SW<  Sep03   0:00 [aio_fput/0]
root        10  0.0  0.0     0    0 ?        SW   Sep03   0:00 [jfsIO]
root        11  0.0  0.0     0    0 ?        SW   Sep03   0:00 [jfsCommit]
root        12  0.0  0.0     0    0 ?        SW   Sep03   0:00 [jfsSync]
root        13  0.0  0.0     0    0 ?        SW<  Sep03   0:00 [xfslogd/0]
root        14  0.0  0.0     0    0 ?        SW<  Sep03   0:00 [xfsdatad/0]
root        15  0.0  0.0     0    0 ?        SW   Sep03   0:00 [pagebufd]
root        16  0.0  0.0     0    0 ?        SW   Sep03   0:25 [kjournald]
root       403  0.0  0.0  1368  216 ?        S    Sep03   0:00 syslogd -m 0
root       408  0.0  0.0  1300  280 ?        S    Sep03   0:00 klogd -x
rpc        429  0.0  0.0  1448  104 ?        S    Sep03   0:00 portmap
root       542  0.0  0.0  2568  140 ?        S    Sep03   0:00 /usr/sbin/sshd
root       552  0.0  0.0  1284  116 tty1     S    Sep03   0:00 /sbin/mingetty tt
root       553  0.0  0.2  2208 1028 ?        S    Sep03   0:00 login -- root
root       556  0.0  0.1  3440  400 ?        S    Sep03   0:01 /usr/sbin/sshd
rwhron     557  0.0  0.0  2200  144 pts/0    S    Sep03   0:00 -bash
root       601  0.0  0.0  2000  144 pts/0    S    Sep03   0:00 su -
root       602  0.0  0.0  2400  144 pts/0    S    Sep03   0:00 -bash
root       652  0.0  0.1  2216  544 pts/0    S    Sep03   0:00 /bin/bash ./runte
root     16813  0.0  0.0  3352  160 ?        S    Sep03   0:00 /usr/sbin/sshd
rwhron   16814  0.0  0.0  2200  152 pts/1    S    Sep03   0:00 -bash
root     28785  0.0  0.2  2180 1020 pts/0    S    02:09   0:00 /bin/bash /usr/lo
root     28786  0.0  0.1  1568  544 pts/0    S    02:09   0:00 tee -a /root/2.6.
root     28787  0.0  0.2  2180 1040 pts/0    S    02:09   0:00 /bin/bash ./dbrun
root     28790  0.0  0.1  1568  544 pts/0    S    02:09   0:00 tee -a /root/2.6.
root     28797  0.0  0.1  1496  580 pts/0    S    02:09   0:13 ping localhost
root     28816  0.0  0.1  1372  512 pts/0    D    02:09   0:09 ./dbench 32
root     28821  0.0  0.1  1372  512 pts/0    D    02:09   0:03 ./dbench 32
root     28827  0.0  0.1  1372  512 pts/0    D    02:09   0:10 ./dbench 32
root     28831  0.0  0.1  1372  512 pts/0    D    02:09   0:10 ./dbench 32
root     28836  0.0  0.1  1372  512 pts/0    D    02:09   0:10 ./dbench 32
root     28842  0.0  0.1  1372  512 pts/0    D    02:09   0:10 ./dbench 32
root     28889  0.0  0.3  2408 1304 ttyS1    S    07:38   0:27 -bash
root     29133  0.0  0.1  1372  512 pts/0    D    07:49   0:09 ./dbench 32
root     29278  0.0  0.0     0    0 ?        SW   21:48   0:02 [pdflush]
root     29284  0.0  0.1  1364  424 pts/0    S    21:49   0:00 ./dbench 64
root     29290  0.4  0.1  1372  512 pts/0    D    21:49   0:09 ./dbench 64
root     29300  0.3  0.1  1372  512 pts/0    D    21:49   0:08 ./dbench 64
root     29304  0.3  0.1  1372  512 pts/0    D    21:49   0:08 ./dbench 64
root     29308  0.4  0.1  1372  512 pts/0    D    21:49   0:09 ./dbench 64
root     29311  0.4  0.1  1372  512 pts/0    D    21:49   0:09 ./dbench 64
root     29322  0.4  0.1  1372  512 pts/0    D    21:49   0:09 ./dbench 64
root     29324  0.4  0.1  1372  512 pts/0    D    21:49   0:09 ./dbench 64
root     29337  0.3  0.1  1372  512 pts/0    D    21:49   0:08 ./dbench 64
root     29345  0.3  0.1  1372  512 pts/0    D    21:49   0:08 ./dbench 64
root     29422  0.1  0.0     0    0 ?        SW   21:52   0:02 [pdflush]
root     29433  0.0  0.1  2556  680 ttyS1    R    22:26   0:00 ps aux


The behavior is not perfectly repeatable.  :(   To recap, the
sequence of events was like this:

dbench 32    # pkill -9 dbench
dbench 32    # exited normally
dbench 32    # pkill -9 dbench
dbench 32    # pkill -9 dbench
dbench 32    # appeared to exit normally
dbench 64    # some processes don't exit

There is a <sysrq T> trace at:
http://home.earthlink.net/~rwhron/2.6.0-test4-mm5.trace

Is there something else that would help track this down?
2.6.0-test4-mm4 did not exhibit this behavior.

-- 
Randy Hron
http://home.earthlink.net/~rwhron/kernel/bigbox.html


             reply	other threads:[~2003-09-05  2:34 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2003-09-05  2:38 rwhron [this message]
2003-09-05  3:44 ` 2.6.0-test4-mm5 dbench stuck in D state Andrew Morton
  -- strict thread matches above, loose matches on Subject: below --
2003-09-05  4:32 rwhron

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20030905023813.GA29171@rushmore \
    --to=rwhron@earthlink.net \
    --cc=akpm@osdl.org \
    --cc=linux-kernel@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.