linux-ide.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Robert Hancock <hancockrwd@gmail.com>
To: "Török Edwin" <edwintorok@gmail.com>
Cc: linux-ide@vger.kernel.org, linux-kernel@vger.kernel.org
Subject: Re: ata: failed to IDENTIFY / SRST failed (errno = -16) problems on/after booting 2.6.35-rc3
Date: Mon, 05 Jul 2010 17:50:57 -0600	[thread overview]
Message-ID: <4C326FE1.2020005@gmail.com> (raw)
In-Reply-To: <20100705224627.3a158e8c@debian>

On 07/05/2010 01:46 PM, Török Edwin wrote:
> On Sun, 27 Jun 2010 23:23:47 +0300
> Török Edwin<edwintorok@gmail.com>  wrote:
>
>> Hi,
>>
>> Using 2.6.35-rc3 I noticed this in my dmesg (see end of email for full dmesg output)
>> [28144.351747] ata9: drained 65536 bytes to clear DRQ.
>> [28144.460834] ata9.01: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6
>> [28144.460838] sr 8:0:1:0: CDB: Prevent/Allow Medium Removal: 1e 00 00
>> 00 00 00 [28144.460846] ata9.01: cmd
>> a0/00:00:00:00:00/00:00:00:00:00/b0 tag 0 [28144.460846]          res
>> 7f/7f:7f:7f:7f:7f/00:00:00:00:00/7f Emask 0x3 (HSM violation)
>> [28144.460849] ata9.01: status: { DRDY DF DRQ ERR } [28144.460867]
>> ata9: soft resetting link
>> ....
>> [32977.433092] ata9: EH complete
>
> The problem has just become worse:
>   - an error occurs on ata9 during boot, taking several minutes to bring
>     up the link:
>
> Jul  5 09:41:49 debian kernel: [   15.824148] ata9.01: qc timeout (cmd
> 0xa1)
> Jul  5 09:41:49 debian kernel: [   15.824155] ata9.01: failed to
> IDENTIFY (I/O error, err_mask=0x4)
> Jul  5 09:41:49 debian kernel: [   20.864007] ata9: link is slow to
> respond, please be patient (ready=0)
> Jul  5 09:41:49 debian kernel: [   25.848007] ata9: device not ready
> (errno=-16), forcing hardreset
> Jul  5 09:41:49 debian kernel: [   31.044007] ata9: link is slow to
> respond, please be patient (ready=0)
> Jul  5 09:41:49 debian kernel: [   41.056006] ata9: link is slow to
> respond, please be patient (ready=0)
> Jul  5 09:41:49 debian kernel: [   51.068007] ata9: link is slow to
> respond, please be patient (ready=0)
> Jul  5 09:41:49 debian kernel: [   74.492148] ata9.00: qc timeout (cmd
> 0xa1)
> Jul  5 09:41:49 debian kernel: [   74.492154] ata9.00: failed to
> IDENTIFY (I/O error, err_mask=0x4)
> Jul  5 09:41:49 debian kernel: [   79.532006] ata9: link is slow to
> respond, please be patient (ready=0)
> Jul  5 09:41:49 debian kernel: [   84.516007] ata9: device not ready
> (errno=-16), forcing hardreset
> Jul  5 09:41:49 debian kernel: [   89.712006] ata9: link is slow to
> respond, please be patient (ready=0)
> Jul  5 09:41:49 debian kernel: [   99.724007] ata9: link is slow to
> respond, please be patient (ready=0)
> Jul  5 09:41:49 debian kernel: [  109.736007] ata9: link is slow to
> respond, please be patient (ready=0)
> Jul  5 09:41:49 debian kernel: [  138.184642] ata9.00: ATAPI: ASUS
> CRW-5232AS, 1.01, max UDMA/33
> Jul  5 09:41:49 debian kernel: [  138.192670] ata9.00: configured for
> UDMA/33
>
>     - sometimes the link never comes up (well never is ~5m, I
> didn't wait longer). it just keeps trying to reset the link saying
> that SRST failed with errno -16 ... endlessly, hence booting is
> impossible.
>
> This is bad: the CDROM is not required to successfully boot (in this
> case anyway), the kernel should IMHO just try reestablishing that link
> in a background thread and finish booting normally.

I think it would if pata_jmicron had parallel scanning enabled, which it 
currently doesn't. It may be able to be turned on, someone just has to 
make sure it's safe for that chipset.

>
> Note that while this DID started to occur soon after I installed
> 2.6.35-rc3 (like 1 bisection run + 5 more boots later), if I now try to
> boot 2.6.34 the same thing happens (i.e. link resets endlessly on boot).
> This has NEVER happened with a kernel<2.6.35-rc3 though .. until
> now.
>
> Also I noticed that the BIOS sometimes hanged during boot (probably
> trying to establish a link to the CDROM too), resetting it a couple of
> times allowed it to reach Linux, but then Linux hanged.
> It could be a hardware failure of the CDROM that just happened to occur
> after I installed 2.6.35-rc3, I don't know.

It does sound like a hardware problem, yes, from those symptoms.

>
> For now I pulled out the power+data cables from my 2 CDROMs so I can at
> least boot. That of course made all these problems go away.
>
> When I have some more time I'll try plugging back the 2 CDROMs one at a
> time, exchange the cables, etc. to see if it is a problem with one of
> the CDROM drives themselves.
>
> In the meantime are there any debug messages I can enable for the next
> time I try booting with the CDROMs?
> Is there any diagnostic I can run from Linux to tell where the problem
> is:
>   - the JMicron PATA controller?
>   - the cables?
>   - the CDROM drive(s) themselves?

It's probably going to be difficult to isolate that problem from 
software, it's likely easiest to remove or swap components until the 
problem goes away.

  reply	other threads:[~2010-07-05 23:51 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-06-27 20:23 hald-addon-storage causing ata soft resets on CDROM drive? Török Edwin
2010-07-05 19:46 ` ata: failed to IDENTIFY / SRST failed (errno = -16) problems on/after booting 2.6.35-rc3 Török Edwin
2010-07-05 23:50   ` Robert Hancock [this message]
2010-07-10  9:58     ` Török Edvin

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4C326FE1.2020005@gmail.com \
    --to=hancockrwd@gmail.com \
    --cc=edwintorok@gmail.com \
    --cc=linux-ide@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).