From: Richard Kennedy <richard@rsk.demon.co.uk>
To: Jens Axboe <jaxboe@fusionio.com>
Cc: Tejun Heo <tj@kernel.org>, Rob Landley <rlandley@parallels.com>,
Pete Clements <clem@clem.clem-digital.net>,
linux-kernel <linux-kernel@vger.kernel.org>,
"linux-ide@vger.kernel.org" <linux-ide@vger.kernel.org>
Subject: Re: Commit 7eaceaccab5f40 causing boot hang.
Date: Mon, 04 Apr 2011 14:47:43 +0100 [thread overview]
Message-ID: <1301924863.8526.9.camel@castor.rsk> (raw)
In-Reply-To: <1301582977.1984.7.camel@castor.rsk>
On Thu, 2011-03-31 at 15:49 +0100, Richard Kennedy wrote:
> On Thu, 2011-03-31 at 15:33 +0200, Jens Axboe wrote:
>[...]
> > >>> Hi Jens,
> > >>>
> > >>> I'm seeing a problem with fio never completing when writing to 2 disks
> > >>> simultaneously. In my test case I'm writing 2Gb to both a LVM volume & a
> > >>> pata drive on x86_64 on a AMD X2. Could this be a related issue?
> > >>>
> > >>> I'm not getting anything reported in the log, lockup detection doesn't
> > >>> report anything either. The write seems to have finished (the disk light
> > >>> activity has stopped) and the cpu cores are both below 10% usage, but
> > >>> fio never returns. The test does complete some times, but it seems to be
> > >>> one 1 in 4.
> > >>
> > >> So when you say PATA, it's /dev/hdaX something as well?
> > >>
> > >>> I'm going to try tracing it and see if I can spot where it's stuck.
> > >>
> > >> Thanks, that would be nice.
> > >>
> > > The second drive is /dev/sdb1 mounted on /opt, both file systems are
> > > ext4.
> >
> > So probably not related. What does the fio job look like?
> >
> fio job file --
> [global]
> pre_read=1
> ioengine=mmap
>
> [f1]
> size=2g
> rw=write
> directory=/home/tests
>
> [f2]
> size=2g
> rw=write
> directory=/opt/tests
>
> Fio gets run from a script that also collects stats but it's been
> running without any problems up until 2.6.39-rc1.
>
Hi Jens
I've upgrade to the latest fio version in the git repo 1.51 and I'm
still seeing this problem.
Fio gets stuck after it writes the 100% complete message and strace on
the processes shows this.
the controlling fio process :-
...
[pid 8439] wait4(8442, 0x7fff848203ac, WNOHANG, NULL) = 0
[pid 8439] nanosleep({0, 10000000}, NULL) = 0
[pid 8439] wait4(8441, 0x7fff848203ac, WNOHANG, NULL) = 0
[pid 8439] wait4(8442, 0x7fff848203ac, WNOHANG, NULL) = 0
[pid 8439] nanosleep({0, 10000000}
& the 2 workers are both stopped here, strace shows only the one line
for each process.
Process 8441 attached - interrupt to quit
futex(0x7f9db76a802c, FUTEX_WAIT_PRIVATE, 2, NULL
Process 8442 attached - interrupt to quit
futex(0x7f9db76a802c, FUTEX_WAIT_PRIVATE, 2, NULL
How do I find out which futex it's waiting for?
Any ideas where I should look next ?
I can run the same test successfully on 2.6.38 so is it worth trying to
bisect this ?
thanks
Richard
next prev parent reply other threads:[~2011-04-04 13:47 UTC|newest]
Thread overview: 32+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-03-29 6:19 Commit 7eaceaccab5f40 causing boot hang Rob Landley
2011-03-29 6:35 ` Jens Axboe
2011-03-29 8:52 ` Rob Landley
2011-03-29 13:59 ` Jens Axboe
2011-03-29 14:13 ` Rob Landley
2011-03-29 14:16 ` Jens Axboe
2011-03-29 15:51 ` Pete Clements
2011-03-30 6:06 ` Rob Landley
2011-03-30 11:38 ` Jens Axboe
2011-03-30 13:52 ` Rob Landley
2011-03-31 9:02 ` Jens Axboe
2011-03-31 9:11 ` Rob Landley
2011-03-31 10:07 ` Jens Axboe
2011-03-31 10:37 ` Jens Axboe
2011-03-31 10:47 ` Rob Landley
2011-03-31 11:09 ` Jens Axboe
2011-04-01 3:11 ` Nicholas A. Bellinger
2011-04-03 7:05 ` Andreas Mohr
2011-04-01 12:26 ` Mark Lord
2011-04-09 2:56 ` Rob Landley
2011-04-09 3:00 ` Jens Axboe
2011-03-31 12:11 ` Tejun Heo
2011-03-31 12:33 ` Jens Axboe
2011-03-31 13:09 ` Richard Kennedy
2011-03-31 13:18 ` Jens Axboe
2011-03-31 13:23 ` Richard Kennedy
2011-03-31 13:33 ` Jens Axboe
2011-03-31 14:49 ` Richard Kennedy
2011-04-04 13:47 ` Richard Kennedy [this message]
2011-04-13 10:25 ` Richard Kennedy
2011-04-13 10:36 ` Jens Axboe
2011-04-06 11:55 ` Rob Landley
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1301924863.8526.9.camel@castor.rsk \
--to=richard@rsk.demon.co.uk \
--cc=clem@clem.clem-digital.net \
--cc=jaxboe@fusionio.com \
--cc=linux-ide@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=rlandley@parallels.com \
--cc=tj@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).