From: Danilo Godec <danilo.godec@agenda.si>
To: David Lethe <david@santools.com>
Cc: Linux RAID Mailing List <linux-raid@vger.kernel.org>
Subject: Re: SATA errors?
Date: Wed, 01 Oct 2008 23:36:13 +0200 [thread overview]
Message-ID: <48E3ED4D.6000409@agenda.si> (raw)
In-Reply-To: <A20315AE59B5C34585629E258D76A97C021D052A@34093-C3-EVS3.exchange.rackspace.com>
I don't want to start any holly wars, but I'm not using a RAID
controller. It's just a plain old on-board SATA controller (at least
that's what I think it is):
00:1f.2 SATA controller: Intel Corporation 631xESB/632xESB SATA Storage
Controller AHCI (rev 09)
David Lethe wrote:
> If this was my system, I would ...
> 1) First check into upgrading firmware/bios/drivers of disk controller.
> 2) Look at cron jobs and see if anything that needs capacity runs around
> the time the errors are reported. Something has to run to start this
> off, so you need to find it.
> 3) Use logger and a shell script to try to catch system in the 15 second
> window when you have this problem, and see what programs are running.
> 4) Actually, if this was my system, and if I/O wasn't actually being
> suspended during those 15 seconds, then I probably would do step 1 only,
> and if everything is current, then I would move on and not worry about
> it. Even if you find the offending program, then that doesn't mean
> that the author of the program has or will make an acceptable change in
> their code.
>
1. I will get a new server in a couple of days and then I'll be able to
move the Xen VM's from the 'problematic' server. Then I'll see what can
be updated/upgraded.
2. The errors are pretty much random and there is nothing in the cron at
all. I don't think Xen VM's could do anything with the physical drive,
so their crons shouldn't be relevant.
3. It's not really a problem that we (the users) would feel. It's just
the logs that got me worried (I don't like unexplainable hard drive errors).
4. As said before, I changed the scripts to use 'smartctl' with one of
the other drives. So far it seems better - there hasn't been a single
error in 12 hours.
> The problem you have from SCSI perspective is the bozo who wrote this
> chunk of code did it the wrong way. The CORRECT way to determine
> addressable blocks is to send out the READCAP10, look for return value
> of FFFFFFFFh blocks, then issue a 16-byte (0x9e 0x10) READ CAPACITY,
> because you have > FFFFFFFE blocks on the disk. This architect never
> imagined that the READCAP10 would have to deal with large disks, and
> assumed if there was a problem, then the disk needs to be reset.
If it turns out that 'smartctl' was causing this I'll report it to
'smartmontools' guys.
Thanks for the help, Danilo
next prev parent reply other threads:[~2008-10-01 21:36 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-10-01 9:25 SATA errors? Danilo Godec
2008-10-01 10:08 ` Wolfgang Denk
2008-10-01 10:41 ` Danilo Godec
2008-10-01 11:48 ` David Lethe
2008-10-01 12:17 ` David Greaves
2008-10-01 13:11 ` David Lethe
2008-10-01 21:36 ` Danilo Godec [this message]
2008-10-01 22:03 ` Justin Piszcz
2008-10-03 9:52 ` Danilo Godec
2008-10-03 16:13 ` Iustin Pop
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=48E3ED4D.6000409@agenda.si \
--to=danilo.godec@agenda.si \
--cc=david@santools.com \
--cc=linux-raid@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).