From: Tejun Heo <htejun@gmail.com>
To: Robert Hancock <hancockr@shaw.ca>
Cc: Norbert Preining <preining@logic.at>,
linux-kernel@vger.kernel.org, linux-ide@vger.kernel.org
Subject: Re: SATA eating my disk, port reset, destroying unrelated data
Date: Mon, 12 Nov 2007 12:57:25 +0900 [thread overview]
Message-ID: <4737CF25.6070100@gmail.com> (raw)
In-Reply-To: <473256BD.1080205@shaw.ca>
Robert Hancock wrote:
> Norbert Preining wrote:
>> ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x400000 action 0x2
>
> Serror 0x400000 means a handshake error. Usually Serror indications are
> due to a hardware problem (bad SATA cable, power or drive problem).
>
>> ata1.00: (BMDMA stat 0x25)
>> ata1.00: cmd 35/00:00:2a:6f:c0/00:04:0c:00:00/e0 tag 0 cdb 0x0 data
>> 524288 out
>> res 51/84:10:1a:72:c0/84:01:0c:00:00/e0 Emask 0x10 (ATA bus
>> error)
And error register value 0x84 - ICRC (CRC error on ATA bus) and ABRT
(command abort) - agrees with what SError indicates. You're
experiencing transmission errors.
>> ata1: soft resetting port
>> ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
>> ata1.00: configured for UDMA/133
>> ata1: EH complete
>>
>> Even worse, sometimes the reset does not work ...
>>
>> ata1: device not ready (errno=-16), forcing hardreset
>> ata1: hard resetting port
>> ata1 SRST failed (errno=-19)
>> ata1: reset failed (errno=-19), retrying in 10 secs
>> ..
>>
>> (typed from a digital photo, nothing remains in the logs)
>>
>> After this I need to do a cold boot otherwise the drive is really in a
>> bad state and not even the bios gets it right.
>
> If even the BIOS cannot reset properly then that also really points to a
> hardware problem..
Also, firmware on some harddrives die solidly when transmission errors
occur. Maybe yours is one. There's nothing much the driver can do
about both the transmission error and the drive locking up afterward.
If the transmission errors occur frequently, the chance is that either
your drive or power supply is dying. As for the drive lock up, maybe
firmware update will help.
* "smartctl -a /dev/sda" might report something interesting.
* Connect the drive to different SATA and power connector and see if the
problem persists.
* Connect the drive to a separate power supply and see if the problem
persists.
--
tejun
next prev parent reply other threads:[~2007-11-12 3:57 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <fa.49dhbrKdHhj8Seg3o+OTGRYqytk@ifi.uio.no>
2007-11-08 0:22 ` SATA eating my disk, port reset, destroying unrelated data Robert Hancock
2007-11-12 3:57 ` Tejun Heo [this message]
2007-11-04 20:58 Norbert Preining
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4737CF25.6070100@gmail.com \
--to=htejun@gmail.com \
--cc=hancockr@shaw.ca \
--cc=linux-ide@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=preining@logic.at \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).