linux-ide.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Re: SATA problems
       [not found]   ` <467173C4.1050500@lbsd.net>
@ 2007-06-14 17:33     ` Jeff Garzik
  2007-06-18  8:05       ` Nigel Kukard
  0 siblings, 1 reply; 4+ messages in thread
From: Jeff Garzik @ 2007-06-14 17:33 UTC (permalink / raw)
  To: Nigel Kukard; +Cc: linux-kernel, IDE/ATA development list

Nigel Kukard wrote:
>>> I'm stumped trying to track down the below intermittent problem.....
>>>
>>> I've confirmed this problem on 2.6.19, 2.6.20 and 2.6.21.

>>> Jun 14 07:55:52 nigel-m2v kernel: ata2.00: exception Emask 0x0 SAct 0x0
>>> SErr 0x0 action 0x2 frozen
>>> Jun 14 07:55:52 nigel-m2v kernel: ata2.00: cmd
>>> ca/00:18:87:e7:00/00:00:00:00:00/e0 tag 0 cdb 0x0 data 12288 out
>>> Jun 14 07:55:52 nigel-m2v kernel:          res
>>> 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
>>> Jun 14 07:55:52 nigel-m2v kernel: ata2: soft resetting port
>>> Jun 14 07:55:52 nigel-m2v kernel: ATA: abnormal status 0x7F on port
>>> 0x0001c807
>>> Jun 14 07:55:52 nigel-m2v kernel: ATA: abnormal status 0x7F on port
>>> 0x0001c807
>>> Jun 14 07:56:22 nigel-m2v kernel: ata2.00: qc timeout (cmd 0xef)
>>> Jun 14 07:56:22 nigel-m2v kernel: ata2.00: failed to set xfermode
>>> (err_mask=0x4)

>> Try 2.6.22-rc4-gitX...

> Is there a patch in particular I can maybe apply? I see you made a
> couple of commits ... my problem is this is also happening on one of my
> production boxes which has a few other patches applied, I'm a bit scared
> of conflicts ... I don't really want to break anything by upgrading the
> kernel.

The two most relevant git commits:

commit 51b94d2a5a90d4800e74d7348bcde098a28f4fb3
Author: Tejun Heo <htejun@gmail.com>
Date:   Fri Jun 8 13:46:55 2007 -0700

     sata_promise: use TF interface for polling NODATA commands

commit 464cf177df7727efcc5506322fc5d0c8b896f545
Author: Tejun Heo <htejun@gmail.com>
Date:   Sun May 27 15:10:40 2007 +0200

     libata: always use polling SETXFER

If you have a git tree local to you, "git-diff-tree -p $COMMIT" will 
extract a patch, otherwise click "raw" after surfing to 
http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commitdiff;h=$COMMIT

Regards,

	Jeff



^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: SATA problems
  2007-06-14 17:33     ` Jeff Garzik
@ 2007-06-18  8:05       ` Nigel Kukard
  0 siblings, 0 replies; 4+ messages in thread
From: Nigel Kukard @ 2007-06-18  8:05 UTC (permalink / raw)
  To: Jeff Garzik; +Cc: linux-kernel, IDE/ATA development list

[-- Attachment #1: Type: text/plain, Size: 5200 bytes --]

Hi Jeff,

Ok ... second part of my problem. Where should I look in trying to debug
the below problem...

Regards
Nigel

Jun 18 07:59:56 nigel-m2v kernel: ata2.00: exception Emask 0x0 SAct 0x0
SErr 0x0 action 0x2 frozen
Jun 18 07:59:56 nigel-m2v kernel: ata2.00: cmd
ca/00:08:bf:ab:68/00:00:00:00:00/e8 tag 0 cdb 0x0 data 4096 out
Jun 18 07:59:56 nigel-m2v kernel:          res
40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Jun 18 07:59:56 nigel-m2v kernel: ata2: soft resetting port
Jun 18 07:59:56 nigel-m2v kernel: ATA: abnormal status 0x7F on port
0x0001c807
Jun 18 07:59:56 nigel-m2v kernel: ATA: abnormal status 0x7F on port
0x0001c807
Jun 18 07:59:56 nigel-m2v kernel: ata2.00: configured for UDMA/133
Jun 18 07:59:56 nigel-m2v kernel: ata2: EH complete
Jun 18 08:00:26 nigel-m2v kernel: rtc: lost 7740 interrupts
Jun 18 08:00:26 nigel-m2v kernel: ata2.00: exception Emask 0x0 SAct 0x0
SErr 0x0 action 0x2 frozen
Jun 18 08:00:26 nigel-m2v kernel: ata2.00: cmd
ca/00:08:bf:ab:68/00:00:00:00:00/e8 tag 0 cdb 0x0 data 4096 out
Jun 18 08:00:26 nigel-m2v kernel:          res
40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Jun 18 08:00:26 nigel-m2v kernel: ata2: soft resetting port
Jun 18 08:00:26 nigel-m2v kernel: ATA: abnormal status 0x7F on port
0x0001c807
Jun 18 08:00:27 nigel-m2v kernel: ATA: abnormal status 0x7F on port
0x0001c807
Jun 18 08:00:27 nigel-m2v kernel: ata2.00: configured for UDMA/133
Jun 18 08:00:27 nigel-m2v kernel: ata2: EH complete
Jun 18 08:00:57 nigel-m2v kernel: rtc: lost 7741 interrupts
Jun 18 08:00:57 nigel-m2v kernel: ata2.00: exception Emask 0x0 SAct 0x0
SErr 0x0 action 0x2 frozen
Jun 18 08:00:57 nigel-m2v kernel: ata2.00: cmd
ca/00:08:bf:ab:68/00:00:00:00:00/e8 tag 0 cdb 0x0 data 4096 out
Jun 18 08:00:57 nigel-m2v kernel:          res
40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Jun 18 08:00:57 nigel-m2v kernel: ata2: soft resetting port
Jun 18 08:00:57 nigel-m2v kernel: ATA: abnormal status 0x7F on port
0x0001c807
Jun 18 08:00:57 nigel-m2v kernel: ATA: abnormal status 0x7F on port
0x0001c807
Jun 18 08:00:57 nigel-m2v kernel: ata2.00: configured for UDMA/133
Jun 18 08:00:57 nigel-m2v kernel: ata2: EH complete
Jun 18 08:01:27 nigel-m2v kernel: rtc: lost 7740 interrupts
Jun 18 08:01:27 nigel-m2v kernel: ata2.00: limiting speed to UDMA/100:PIO4
Jun 18 08:01:27 nigel-m2v kernel: ata2.00: exception Emask 0x0 SAct 0x0
SErr 0x0 action 0x2 frozen
Jun 18 08:01:27 nigel-m2v kernel: ata2.00: cmd
ca/00:08:bf:ab:68/00:00:00:00:00/e8 tag 0 cdb 0x0 data 4096 out
Jun 18 08:01:27 nigel-m2v kernel:          res
40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Jun 18 08:01:27 nigel-m2v kernel: ata2: soft resetting port
Jun 18 08:01:27 nigel-m2v kernel: ATA: abnormal status 0x7F on port
0x0001c807
Jun 18 08:01:27 nigel-m2v kernel: ATA: abnormal status 0x7F on port
0x0001c807
Jun 18 08:01:27 nigel-m2v kernel: ata2.00: configured for UDMA/100
Jun 18 08:01:27 nigel-m2v kernel: ata2: EH complete
Jun 18 08:01:57 nigel-m2v kernel: rtc: lost 7740 interrupts
Jun 18 08:01:57 nigel-m2v kernel: ata2.00: exception Emask 0x0 SAct 0x0
SErr 0x0 action 0x2 frozen
Jun 18 08:01:57 nigel-m2v kernel: ata2.00: cmd
ca/00:08:bf:ab:68/00:00:00:00:00/e8 tag 0 cdb 0x0 data 4096 out
Jun 18 08:01:57 nigel-m2v kernel:          res
40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Jun 18 08:01:57 nigel-m2v kernel: ata2: soft resetting port
Jun 18 08:01:57 nigel-m2v kernel: ATA: abnormal status 0x7F on port
0x0001c807
Jun 18 08:01:57 nigel-m2v kernel: ATA: abnormal status 0x7F on port
0x0001c807
Jun 18 08:01:57 nigel-m2v kernel: ata2.00: configured for UDMA/100
Jun 18 08:01:57 nigel-m2v kernel: ata2: EH complete
Jun 18 08:02:27 nigel-m2v kernel: rtc: lost 7741 interrupts
Jun 18 08:02:27 nigel-m2v kernel: ata2.00: exception Emask 0x0 SAct 0x0
SErr 0x0 action 0x2 frozen
Jun 18 08:02:27 nigel-m2v kernel: ata2.00: cmd
ca/00:08:bf:ab:68/00:00:00:00:00/e8 tag 0 cdb 0x0 data 4096 out
Jun 18 08:02:27 nigel-m2v kernel:          res
40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Jun 18 08:02:27 nigel-m2v kernel: ata2: soft resetting port
Jun 18 08:02:27 nigel-m2v kernel: ATA: abnormal status 0x7F on port
0x0001c807
Jun 18 08:02:27 nigel-m2v kernel: ATA: abnormal status 0x7F on port
0x0001c807
Jun 18 08:02:28 nigel-m2v kernel: ata2.00: configured for UDMA/100
Jun 18 08:02:28 nigel-m2v kernel: sd 3:0:0:0: SCSI error: return code =
0x08000002
Jun 18 08:02:28 nigel-m2v kernel: sda: Current [descriptor]: sense key=0xb
Jun 18 08:02:28 nigel-m2v kernel:     ASC=0x0 ASCQ=0x0
Jun 18 08:02:28 nigel-m2v kernel: Descriptor sense data with sense
descriptors (in hex):
Jun 18 08:02:28 nigel-m2v kernel:         72 0b 00 00 00 00 00 0c 00 0a
80 00 00 00 00 00
Jun 18 08:02:28 nigel-m2v kernel:         00 00 00 00
Jun 18 08:02:28 nigel-m2v kernel: end_request: I/O error, dev sda,
sector 141077439
Jun 18 08:02:28 nigel-m2v kernel: Buffer I/O error on device sda1,
logical block 17634672
Jun 18 08:02:28 nigel-m2v kernel: lost page write due to I/O error on sda1
Jun 18 08:02:28 nigel-m2v kernel: ata2: EH complete



[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: SATA problems
       [not found]         ` <46D68CC2.3030206@lbsd.net>
@ 2007-09-10  9:02           ` Andrew Morton
  2007-09-13  8:55             ` Tejun Heo
  0 siblings, 1 reply; 4+ messages in thread
From: Andrew Morton @ 2007-09-10  9:02 UTC (permalink / raw)
  To: Nigel Kukard; +Cc: Dave Jones, Jeff Garzik, linux-kernel, linux-ide

On Thu, 30 Aug 2007 09:24:18 +0000 Nigel Kukard <nkukard@lbsd.net> wrote:

> Hrmmm,
> 
> >>  > 
> >>  >  > > Jun 14 07:55:52 nigel-m2v kernel: ATA: abnormal status 0x7F on port
> >>  >  > > 0x0001c807
> >>  >  > > Jun 14 07:55:52 nigel-m2v kernel: ATA: abnormal status 0x7F on port
> >>  >  > > 0x0001c807
> >>  > 
> >>  > Unrelated to the other error, but I've been meaning to ask for a while..
> >>  > If this is 'abnormal', why does every SATA box I've seen do it?
> >>
> >> *crickets*

chirp, chirp.

> >> Should we check for this case explicitly, and not print this?
> >>
> >>   
> >>     
> > After I get the above errors, my entire SATA bus crashes and I need to
> > hard reset the box ... not sure we can just ignore the errors?
> >
> >   
> 
> Appears even with the patch provided a few months ago I'm getting
> freezes. Replaced the HDD & all cables, same errors ... especially
> whilst doing heavy IO.
> 
> Can anyone shed some light?
> 

I think I was told last week that copying the appropriate mailing list will
at least prevent chirping, so let's try that.

Original thread here: http://lkml.org/lkml/2007/6/14/154

> ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
> ata2.00: cmd c8/00:00:9f:c9:cb/00:00:00:00:00/e1 tag 0 cdb 0x0 data
> 131072 in
>          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> ata2: soft resetting port
> ATA: abnormal status 0x7F on port 0x0001c807
> ATA: abnormal status 0x7F on port 0x0001c807
> ata2.00: configured for UDMA/133
> ata2: EH complete
> ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
> ata2.00: cmd c8/00:00:9f:c9:cb/00:00:00:00:00/e1 tag 0 cdb 0x0 data
> 131072 in
>          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> ata2: soft resetting port
> ATA: abnormal status 0x7F on port 0x0001c807
> ATA: abnormal status 0x7F on port 0x0001c807
> ata2.00: configured for UDMA/133
> ata2: EH complete
> ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
> ata2.00: cmd c8/00:00:9f:c9:cb/00:00:00:00:00/e1 tag 0 cdb 0x0 data
> 131072 in
>          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> ata2: soft resetting port
> ATA: abnormal status 0x7F on port 0x0001c807
> ATA: abnormal status 0x7F on port 0x0001c807
> ata2.00: configured for UDMA/133
> ata2: EH complete
> ata2.00: limiting speed to UDMA/100:PIO4
> ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
> ata2.00: cmd c8/00:00:9f:c9:cb/00:00:00:00:00/e1 tag 0 cdb 0x0 data
> 131072 in
>          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> ata2: soft resetting port
> ATA: abnormal status 0x7F on port 0x0001c807
> ATA: abnormal status 0x7F on port 0x0001c807
> ata2.00: configured for UDMA/100
> ata2: EH complete
> ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
> ata2.00: cmd c8/00:00:9f:c9:cb/00:00:00:00:00/e1 tag 0 cdb 0x0 data
> 131072 in
>          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> ata2: soft resetting port
> ATA: abnormal status 0x7F on port 0x0001c807
> ATA: abnormal status 0x7F on port 0x0001c807
> ata2.00: configured for UDMA/100
> ata2: EH complete
> ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
> ata2.00: cmd c8/00:00:9f:c9:cb/00:00:00:00:00/e1 tag 0 cdb 0x0 data
> 131072 in
>          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> ata2: soft resetting port
> ATA: abnormal status 0x7F on port 0x0001c807
> ATA: abnormal status 0x7F on port 0x0001c807
> ata2.00: configured for UDMA/100
> sd 3:0:0:0: SCSI error: return code = 0x08000002
> sda: Current [descriptor]: sense key=0xb
>     ASC=0x0 ASCQ=0x0
> Descriptor sense data with sense descriptors (in hex):
>         72 0b 00 00 00 00 00 0c 00 0a 80 00 00 00 00 00
>         00 00 00 00
> end_request: I/O error, dev sda, sector 30132639
> ata2: EH complete
> ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
> ata2.00: cmd c8/00:00:9f:ca:cb/00:00:00:00:00/e1 tag 0 cdb 0x0 data
> 131072 in
>          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> ata2: soft resetting port
> ATA: abnormal status 0x7F on port 0x0001c807
> ATA: abnormal status 0x7F on port 0x0001c807
> ata2.00: configured for UDMA/100
> ata2: EH complete
> ata2.00: limiting speed to UDMA/33:PIO4
> ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
> ata2.00: cmd c8/00:00:9f:ca:cb/00:00:00:00:00/e1 tag 0 cdb 0x0 data
> 131072 in
>          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> ata2: soft resetting port
> ATA: abnormal status 0x7F on port 0x0001c807
> ATA: abnormal status 0x7F on port 0x0001c807
> ata2.00: configured for UDMA/33
> ata2: EH complete
> ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
> ata2.00: cmd c8/00:00:9f:ca:cb/00:00:00:00:00/e1 tag 0 cdb 0x0 data
> 131072 in
>          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> ata2: soft resetting port
> ATA: abnormal status 0x7F on port 0x0001c807
> ATA: abnormal status 0x7F on port 0x0001c807
> ata2.00: configured for UDMA/33
> ata2: EH complete
> ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
> ata2.00: cmd c8/00:00:9f:ca:cb/00:00:00:00:00/e1 tag 0 cdb 0x0 data
> 131072 in
>          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> ata2: soft resetting port
> ATA: abnormal status 0x7F on port 0x0001c807
> ATA: abnormal status 0x7F on port 0x0001c807
> ata2.00: configured for UDMA/33
> ata2: EH complete
> 
> 
> 

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: SATA problems
  2007-09-10  9:02           ` SATA problems Andrew Morton
@ 2007-09-13  8:55             ` Tejun Heo
  0 siblings, 0 replies; 4+ messages in thread
From: Tejun Heo @ 2007-09-13  8:55 UTC (permalink / raw)
  To: Andrew Morton
  Cc: Nigel Kukard, Dave Jones, Jeff Garzik, linux-kernel, linux-ide

Andrew Morton wrote:
> On Thu, 30 Aug 2007 09:24:18 +0000 Nigel Kukard <nkukard@lbsd.net> wrote:
> 
>> Hrmmm,
>>
>>>>  > 
>>>>  >  > > Jun 14 07:55:52 nigel-m2v kernel: ATA: abnormal status 0x7F on port
>>>>  >  > > 0x0001c807
>>>>  >  > > Jun 14 07:55:52 nigel-m2v kernel: ATA: abnormal status 0x7F on port
>>>>  >  > > 0x0001c807
>>>>  > 
>>>>  > Unrelated to the other error, but I've been meaning to ask for a while..
>>>>  > If this is 'abnormal', why does every SATA box I've seen do it?
>>>>
>>>> *crickets*

It's removed (finally).  :-)

-- 
tejun

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2007-09-13  8:57 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <4671366B.7090702@lbsd.net>
     [not found] ` <46716B1D.1070902@garzik.org>
     [not found]   ` <20070614182854.GC1223@redhat.com>
     [not found]     ` <20070618200703.GB13344@redhat.com>
     [not found]       ` <46776223.3050402@lbsd.net>
     [not found]         ` <46D68CC2.3030206@lbsd.net>
2007-09-10  9:02           ` SATA problems Andrew Morton
2007-09-13  8:55             ` Tejun Heo
     [not found] <8vX5T-4hg-5@gated-at.bofh.it>
     [not found] ` <8w0n1-15u-27@gated-at.bofh.it>
     [not found]   ` <467173C4.1050500@lbsd.net>
2007-06-14 17:33     ` Jeff Garzik
2007-06-18  8:05       ` Nigel Kukard

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).