From: Jens Axboe <axboe@suse.de>
To: Warren Togami <wtogami@redhat.com>
Cc: Markus Lidel <Markus.Lidel@shadowconnect.com>,
Arjan van de Ven <arjanv@redhat.com>,
linux-scsi@vger.kernel.org, linux-kernel@vger.kernel.org,
Alan Cox <alan@redhat.com>
Subject: Re: [PATCH] i2o_block Fix, possible CFQ elevator problem?
Date: Tue, 20 Apr 2004 11:05:23 +0200 [thread overview]
Message-ID: <20040420090523.GE25806@suse.de> (raw)
In-Reply-To: <4084E671.4090509@redhat.com>
On Mon, Apr 19 2004, Warren Togami wrote:
> Jens Axboe wrote:
> >>We figured removing error handling was not safe, the previous post was
> >>only reporting test results to ask for more suggestions. I have now
> >>tested your suggested patch above and it seems to crash in the same way
> >>as originally.
> >>
> >>http://togami.com/~warren/archive/2004/i2o_cfq_quad_bonnie2.txt
> >
> >
> >As a temporary safe work-around, you can apply this patch.
> >
> >
> >>This makes me curious, the other elevators lacked this type of error
> >>checking. Did this mean they were possibly allowing data corruption to
> >>happen with buggy drivers like this? Kind of scary! We were lucky to
> >>test this now, because this was one of the first FC kernels that
> >>included cfq by default.
> >
> >
> >Not necessarily, it's most likely a CFQ bug. Otherwise it would have
> >surfaced before :-)
> >
>
> I forgot to mention in the previous reports:
>
> Prior to three of your original suggested cleanups of i2o_block, four
> simultaneous bonnie++'s on four independent arrays would almost
> immediately cause the crash while running elevator=cfq. After those
> three cleanups four simultaneous bonnie++ would survive for a while
> without crashing... until you run "sync" in another terminal. We
> however did not test it enough times to determine if without "sync" it
> can survive the test run. Do you want this tested without "sync"?
Repeat the tests that made it crash. The last patch I sent should work
for you, at least until the real issue is found.
> With the deadline scheduler "sync" would take maybe 30 seconds and
> return. With the cfq scheduler "sync" would be stuck there for much
> longer, then trigger the crash. Markus has suspected that it crashes
> when sync returns, but we have not confirmed that.
Probably not the case. Running sync only initiates the dirty data
flushing, the actual write out happens out of that context. So it's
probably just running the sync that changes the timings a little bit,
triggering the bug.
--
Jens Axboe
next prev parent reply other threads:[~2004-04-20 9:06 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
2004-04-19 11:37 [PATCH] i2o_block Fix, possible CFQ elevator problem? Warren Togami
2004-04-19 12:12 ` Jens Axboe
2004-04-20 0:42 ` Warren Togami
[not found] ` <40848159.7090605@togami.com>
2004-04-20 7:08 ` Jens Axboe
2004-04-20 7:58 ` Warren Togami
2004-04-20 8:03 ` Jens Axboe
2004-04-20 8:59 ` Warren Togami
2004-04-20 9:05 ` Jens Axboe [this message]
2004-04-20 10:53 ` Warren Togami
2004-04-20 10:56 ` Jens Axboe
2004-04-20 11:29 ` Warren Togami
2004-04-20 11:34 ` Jens Axboe
2004-04-20 11:38 ` Jens Axboe
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20040420090523.GE25806@suse.de \
--to=axboe@suse.de \
--cc=Markus.Lidel@shadowconnect.com \
--cc=alan@redhat.com \
--cc=arjanv@redhat.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-scsi@vger.kernel.org \
--cc=wtogami@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox