linux-ide.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Jens Axboe <jaxboe@fusionio.com>
To: Richard Kennedy <richard@rsk.demon.co.uk>
Cc: Tejun Heo <tj@kernel.org>, Rob Landley <rlandley@parallels.com>,
	Pete Clements <clem@clem.clem-digital.net>,
	linux-kernel <linux-kernel@vger.kernel.org>,
	"linux-ide@vger.kernel.org" <linux-ide@vger.kernel.org>
Subject: Re: Commit 7eaceaccab5f40 causing boot hang.
Date: Wed, 13 Apr 2011 12:36:48 +0200	[thread overview]
Message-ID: <4DA57CC0.8010400@fusionio.com> (raw)
In-Reply-To: <1302690300.1993.7.camel@castor.rsk>

On 2011-04-13 12:25, Richard Kennedy wrote:
> On Mon, 2011-04-04 at 14:47 +0100, Richard Kennedy wrote:
>> On Thu, 2011-03-31 at 15:49 +0100, Richard Kennedy wrote:
>>> On Thu, 2011-03-31 at 15:33 +0200, Jens Axboe wrote:
>>> [...]
>>>>>>> Hi Jens,
>>>>>>>
>>>>>>> I'm seeing a problem with fio never completing when writing to 2 disks
>>>>>>> simultaneously. In my test case I'm writing 2Gb to both a LVM volume & a
>>>>>>> pata drive on x86_64 on a AMD X2. Could this be a related issue?
>>>>>>>
>>>>>>> I'm not getting anything reported in the log, lockup detection doesn't
>>>>>>> report anything either. The write seems to have finished (the disk light
>>>>>>> activity has stopped) and the cpu cores are both below 10% usage, but
>>>>>>> fio never returns. The test does complete some times, but it seems to be
>>>>>>> one 1 in 4.
>>>>>>
>>>>>> So when you say PATA, it's /dev/hdaX something as well?
>>>>>>
>>>>>>> I'm going to try tracing it and see if I can spot where it's stuck.
>>>>>>
>>>>>> Thanks, that would be nice.
>>>>>>
>>>>> The second drive is /dev/sdb1 mounted on /opt, both file systems are
>>>>> ext4.
>>>>
>>>> So probably not related. What does the fio job look like?
>>>>
>>> fio job file --
>>> [global]
>>> pre_read=1
>>> ioengine=mmap
>>>
>>> [f1]
>>> size=2g
>>> rw=write
>>> directory=/home/tests
>>>
>>> [f2]
>>> size=2g
>>> rw=write
>>> directory=/opt/tests
>>>
>>> Fio gets run from a script that also collects stats but it's been
>>> running without any problems up until 2.6.39-rc1.
>>>
>> Hi Jens
>> I've upgrade to the latest fio version in the git repo 1.51 and I'm
>> still seeing this problem. 
>>
>> Fio gets stuck after it writes the 100% complete message and strace on
>> the processes shows this.
>>
>> the controlling fio process :- 
>>  ...
>> [pid  8439] wait4(8442, 0x7fff848203ac, WNOHANG, NULL) = 0
>> [pid  8439] nanosleep({0, 10000000}, NULL) = 0
>> [pid  8439] wait4(8441, 0x7fff848203ac, WNOHANG, NULL) = 0
>> [pid  8439] wait4(8442, 0x7fff848203ac, WNOHANG, NULL) = 0
>> [pid  8439] nanosleep({0, 10000000}
>>
>> & the 2 workers are both stopped here, strace shows only the one line
>> for each process.
>>
>> Process 8441 attached - interrupt to quit
>> futex(0x7f9db76a802c, FUTEX_WAIT_PRIVATE, 2, NULL
>>
>>
>> Process 8442 attached - interrupt to quit
>> futex(0x7f9db76a802c, FUTEX_WAIT_PRIVATE, 2, NULL
>>
>> How do I find out which futex it's waiting for? 
>> Any ideas where I should look next ?
>>
>> I can run the same test successfully on 2.6.38 so is it worth trying to
>> bisect this ? 
>>
>> thanks 
>> Richard
>>
> My problem has gone away in v2.6.39-rc3.
> I've just finished bisecting it down to 6de9843dab3f, & that got
> reverted in rc3, so no problem ;)
> 
> (The data corruption caused by that faulty commit was zeroing out the
> shared mutexs in fio & the worker threads were getting stuck on the
> writeout_mutex.)

Great, that's one less regression to worry about :-)


-- 
Jens Axboe


  reply	other threads:[~2011-04-13 10:36 UTC|newest]

Thread overview: 32+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-03-29  6:19 Commit 7eaceaccab5f40 causing boot hang Rob Landley
2011-03-29  6:35 ` Jens Axboe
2011-03-29  8:52   ` Rob Landley
2011-03-29 13:59     ` Jens Axboe
2011-03-29 14:13       ` Rob Landley
2011-03-29 14:16         ` Jens Axboe
2011-03-29 15:51           ` Pete Clements
2011-03-30  6:06             ` Rob Landley
2011-03-30 11:38               ` Jens Axboe
2011-03-30 13:52                 ` Rob Landley
2011-03-31  9:02                   ` Jens Axboe
2011-03-31  9:11                     ` Rob Landley
2011-03-31 10:07                       ` Jens Axboe
2011-03-31 10:37                         ` Jens Axboe
2011-03-31 10:47                           ` Rob Landley
2011-03-31 11:09                             ` Jens Axboe
2011-04-01  3:11                               ` Nicholas A. Bellinger
2011-04-03  7:05                                 ` Andreas Mohr
2011-04-01 12:26                               ` Mark Lord
2011-04-09  2:56                               ` Rob Landley
2011-04-09  3:00                                 ` Jens Axboe
2011-03-31 12:11                           ` Tejun Heo
2011-03-31 12:33                             ` Jens Axboe
2011-03-31 13:09                               ` Richard Kennedy
2011-03-31 13:18                                 ` Jens Axboe
2011-03-31 13:23                                   ` Richard Kennedy
2011-03-31 13:33                                     ` Jens Axboe
2011-03-31 14:49                                       ` Richard Kennedy
2011-04-04 13:47                                         ` Richard Kennedy
2011-04-13 10:25                                           ` Richard Kennedy
2011-04-13 10:36                                             ` Jens Axboe [this message]
2011-04-06 11:55                               ` Rob Landley

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4DA57CC0.8010400@fusionio.com \
    --to=jaxboe@fusionio.com \
    --cc=clem@clem.clem-digital.net \
    --cc=linux-ide@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=richard@rsk.demon.co.uk \
    --cc=rlandley@parallels.com \
    --cc=tj@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).