From: Stan Hoeppner <stan@hardwarefreak.com>
To: lists@yazzy.org
Cc: "Mathias Burén" <mathias.buren@gmail.com>, linux-raid@vger.kernel.org
Subject: Re: How to stress test an RAID 6 array?
Date: Mon, 03 Oct 2011 22:56:45 -0500 [thread overview]
Message-ID: <4E8A83FD.3060805@hardwarefreak.com> (raw)
In-Reply-To: <4E89BF73.8020604@yazzy.org>
On 10/3/2011 8:58 AM, Marcin M. Jessa wrote:
> exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
This line is not important ^^^
> ata9.00: failed command: FLUSH CACHE EXT
THIS one is:^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
> That "exception Emask" part pointed me to misc threads where people
> mentioned bugs in the Linux kernel.
According to your dmesg output the kernel believes the drives are not
completing the ATA6 (and later) FLUSH_CACHE_EXT command. hdparm will
confirm your drives drives do support it. FLUSH_CACHE_EXT is sent to a
drive to force data in the cache to hit the platters. This is done for
data consistency and to prevent filesystem corruption due to power
outages, system crashes, and the like.
What you need to figure out is why the apparent flush command faliures
are occurring. The cause will likely be a kernel/driver issue, a
motherboard/sata controller issue, a PSU issue, or a drive issue.
The few instances of this FLUSH_CACHE_EXT error I located seemed to
center somewhere around kernel 2.6.34. IIRC those experiencing this
issue on FC and Ubuntu instantly fixed it with a distro upgrade.
Thus, upgrade your kernel to 2.6.38.8 or later. If that doesn't fix it,
disable the write caches on your array member drives (a very good idea
with non BBU RAID anyway). The proper/preferred way to do this may vary
amongst distros. Adding a boot script containing something like the
following to the appropriate /etc/rc.x directory should do the trick on
all distros:
#!/bin/sh
hdparm -W0 /dev/sda
hdparm -W0 /dev/sdb
hdparm -W0 /dev/sdc
hdparm -W0 /dev/sdd
hdparm -W0 /dev/sde
Reboot. Confirm the write caches are disabled with something like this:
#!/bin/bash
for i in {a..e}
do
echo -n "sd$i: "
hdparm -i /dev/sd$i|grep -i writecache|awk '{ print $2 }'
done
If neither of these suggestions fixes the problem then you may need to
start replacing or adding hardware. At that point I'd recommend
dropping an LSI SAS 9211-8i into your free PCIe x16 slot.
--
Stan
next prev parent reply other threads:[~2011-10-04 3:56 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-10-03 13:26 How to stress test an RAID 6 array? Marcin M. Jessa
2011-10-03 13:39 ` Mathias Burén
2011-10-03 13:58 ` Marcin M. Jessa
2011-10-03 14:03 ` Mathias Burén
2011-10-03 14:18 ` Marcin M. Jessa
2011-10-03 14:29 ` Mathias Burén
2011-10-03 15:17 ` Marcin M. Jessa
2011-10-04 4:42 ` Stan Hoeppner
2011-10-04 3:56 ` Stan Hoeppner [this message]
2011-10-04 8:37 ` Marcin M. Jessa
2011-10-05 17:41 ` Stan Hoeppner
2011-10-03 14:24 ` Joe Landman
2011-10-03 15:40 ` Marcin M. Jessa
2011-10-03 20:35 ` Marcin M. Jessa
2011-10-03 16:16 ` maurice
2011-10-08 14:44 ` Gordon Henderson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4E8A83FD.3060805@hardwarefreak.com \
--to=stan@hardwarefreak.com \
--cc=linux-raid@vger.kernel.org \
--cc=lists@yazzy.org \
--cc=mathias.buren@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.