All of lore.kernel.org
 help / color / mirror / Atom feed
From: Robert Hancock <hancockrwd@gmail.com>
To: "Håkon Løvdal" <hlovdal@gmail.com>
Cc: linux-ide@vger.kernel.org
Subject: Re: "raid5:md0: read error not correctable (sector 795463080 on sdf1)" error on controller with SIL 3114
Date: Mon, 08 Feb 2010 23:13:33 -0600	[thread overview]
Message-ID: <4B70EEFD.1040603@gmail.com> (raw)
In-Reply-To: <a01a16b51002080311i420f6429ld3545c44c04603eb@mail.gmail.com>

On 02/08/2010 05:11 AM, Håkon Løvdal wrote:
> Hi. I have had some trouble with the machine I want to have as a file server.
>
> After having let the "get raid up and running reliably" project lie
> dormant for some time, I tried again this Friday. After connecting the
> disks, the status was the following: 4 out of 6 disk in a raid6 setup
> were recognised (see log-1). I was able to mount the volume when the
> machine was finished booting.
>
> I then added the two missing disks with mdadm, one of them started
> rebuilding and the other one were not recognised in some way (log-2).
> The rebuild of the disk was successfull (log-3), but later some errors
> occured, see log-4 below, and now only three disks are left in the
> array (log-5).
>
> Are these errors related to Tejun's recent statement "Sil3112/3114 are
> now virtually the only controllers with occassional and unresolved data
> corruption issues."? Disks sda (hosting root file system for os), sdb
> sdc and sdd are connected the motherboard while sde, sdf and sdg are
> connected to a controller card using 3114:

..

> ---BEGIN log-4---
> Feb  6 07:09:57 localhost kernel: ata8.00: exception Emask 0x0 SAct
> 0x0 SErr 0x0 action 0x0
> Feb  6 07:09:57 localhost kernel: ata8.00: BMDMA2 stat 0x6c0009
> Feb  6 07:09:57 localhost kernel: ata8.00: cmd
> 25/00:80:cf:cd:69/00:00:2f:00:00/e0 tag 0 dma 65536 in
> Feb  6 07:09:57 localhost kernel:         res
> 51/40:00:e4:cd:69/00:00:2f:00:00/e0 Emask 0x9 (media error)
> Feb  6 07:09:57 localhost kernel: ata8.00: status: { DRDY ERR }
> Feb  6 07:09:57 localhost kernel: ata8.00: error: { UNC }

That's fairly definitive, uncorrected read error reported by the drive. 
You might want to check its SMART status. Could be a bad drive, or 
potentially other causes like excessive vibration, high temperature, 
power issues..

  reply	other threads:[~2010-02-09  5:13 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-02-08 11:11 "raid5:md0: read error not correctable (sector 795463080 on sdf1)" error on controller with SIL 3114 Håkon Løvdal
2010-02-09  5:13 ` Robert Hancock [this message]
2010-02-17  2:42   ` Håkon Løvdal
2010-02-20 13:05 ` Håkon Løvdal

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4B70EEFD.1040603@gmail.com \
    --to=hancockrwd@gmail.com \
    --cc=hlovdal@gmail.com \
    --cc=linux-ide@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.