linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Jeremy Sanders <jeremy@jeremysanders.net>
To: linux-raid@vger.kernel.org
Subject: stuck tasks
Date: Mon, 12 Apr 2010 11:40:59 +0100	[thread overview]
Message-ID: <hputbs$qq2$1@dough.gmane.org> (raw)

Hi - I'm not getting any joy with Fedora's bugzilla. Has anyone seen 
problems like this with Fedora 12? Our systems have recently been getting 
stuck while rsyncing data onto an MD device:

https://bugzilla.redhat.com/show_bug.cgi?id=578549

INFO: task kthreadd:2 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
kthreadd      D 0000000000000002     0     2      0 0x00000000
 ffff88007dbfd4c0 0000000000000046 0000000000000000 0000000a00000000
 ffff880000000001 ffff880079f9b800 ffff88007dbfdfd8 ffff88007dbfdfd8
 ffff88007dbf1b38 000000000000f980 0000000000015740 ffff88007dbf1b38
Call Trace:
 [<ffffffff8107c30d>] ? ktime_get_ts+0x85/0x8e
 [<ffffffff810d604d>] ? sync_page+0x0/0x4a
 [<ffffffff810d604d>] ? sync_page+0x0/0x4a
 [<ffffffff814546f5>] io_schedule+0x43/0x5d
 [<ffffffff810d6093>] sync_page+0x46/0x4a
 [<ffffffff81454c48>] __wait_on_bit+0x48/0x7b
 ...

Several processes end up stuck in a D state:
USER       PID %CPU %MEM    VSZ   RSS TTY      STAT START   TIME COMMAND
root         2  0.0  0.0      0     0 ?        D    Mar31   0:07 [kthreadd]
root        14  0.0  0.0      0     0 ?        D    Mar31   9:38 [async/mgr]
root        17  0.0  0.0      0     0 ?        D    Mar31   0:00 [bdi-
default]
root        34  0.0  0.0      0     0 ?        D    Mar31  10:06 [kswapd0]
root      5509  0.0  0.3  50732  7900 ?        D    Apr09   0:03 rsync -
raHSx --stats --whole-file --numeric-ids --link-
dest=/xback2_back1/YY/20100407-000501 --exclude=/lost+found --
exclude=.mozilla/*/*/Cache/* XX:/XX_data1/data/YY/ 
/xback2_back1/YY/20100409-000502/
root     17457  0.0  0.2  61920  5756 ?        D    Apr11   0:00 python 
/data/soft3/backup/diskbackup/diskbackup.py 
/data/soft3/backup/diskbackup/main.cfg
root     18402  0.0  0.0      0     0 ?        D    Apr09   0:11 [flush-9:0]
root     20259  0.0  0.1   4284  3424 ?        DN   Apr11   0:00 
/usr/sbin/prelink -av -mR -q

It only seems to affect our MD systems. The kernel is 
2.6.32.10-90.fc12.x86_64. The systems have 3ware 96xx controllers. This 
kernel does have the issue when there are lots of aio processes.

The two affected systems have different file systems: xfs and ext3.

Jeremy



             reply	other threads:[~2010-04-12 10:40 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-04-12 10:40 Jeremy Sanders [this message]
2010-04-12 11:06 ` stuck tasks MRK
2010-04-12 11:14   ` Jeremy Sanders
2010-04-12 12:49     ` MRK
2010-04-12 13:02       ` Jeremy Sanders
2010-04-12 13:38         ` MRK

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='hputbs$qq2$1@dough.gmane.org' \
    --to=jeremy@jeremysanders.net \
    --cc=linux-raid@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).