* T61 SATA error in log
@ 2007-09-27 7:21 Benjamin Herrenschmidt
2007-09-27 20:35 ` Tejun Heo
0 siblings, 1 reply; 4+ messages in thread
From: Benjamin Herrenschmidt @ 2007-09-27 7:21 UTC (permalink / raw)
To: linux-ide@vger.kernel.org; +Cc: Jeff Garzik, Tejun Heo
Saw that popping up in my log today on a brand new T61 thinkpad:
[ 427.712000] ata1.00: exception Emask 0x2 SAct 0x18 SErr 0x0 action 0x2 frozen
[ 427.712000] ata1.00: (spurious completions during NCQ issue=0x0 SAct=0x18 FIS=004040a1:00000024)
[ 427.712000] ata1.00: cmd 61/08:18:f4:74:54/00:00:04:00:00/40 tag 3 cdb 0x0 data 4096 out
[ 427.712000] res 40/00:28:64:e4:51/00:00:04:00:00/40 Emask 0x2 (HSM violation)
[ 427.712000] ata1.00: cmd 61/08:20:84:10:81/00:00:04:00:00/40 tag 4 cdb 0x0 data 4096 out
[ 427.712000] res 40/00:28:64:e4:51/00:00:04:00:00/40 Emask 0x2 (HSM violation)
[ 428.024000] ata1: soft resetting port
[ 428.196000] ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
[ 428.204000] ata1.00: configured for UDMA/100
[ 428.204000] ata1: EH complete
[ 428.204000] sd 0:0:0:0: [sda] 234441648 512-byte hardware sectors (120034 MB)
[ 428.204000] sd 0:0:0:0: [sda] Write Protect is off
[ 428.204000] sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
[ 428.204000] sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
Kernel recovered just fine and things seem smooth so far. Is that something I need
to worry about ?
Kernel is ubuntu gutsy's 2.6.22 and controller is:
Cheers,00:1f.2 SATA controller: Intel Corporation 82801HBM/HEM (ICH8M/ICH8M-E) SATA AHCI Controller (rev 03) (prog-if 01 [AHCI 1.0])
Subsystem: Lenovo Unknown device 20a7
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B-
Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 0
Interrupt: pin B routed to IRQ 218
Region 0: I/O ports at 1c50 [size=8]
Region 1: I/O ports at 1c44 [size=4]
Region 2: I/O ports at 1c48 [size=8]
Region 3: I/O ports at 1c40 [size=4]
Region 4: I/O ports at 1c20 [size=32]
Region 5: Memory at fe226000 (32-bit, non-prefetchable) [size=2K]
Capabilities: [80] Message Signalled Interrupts: Mask- 64bit- Queue=0/2 Enable+
Address: fee0300c Data: 4152
Capabilities: [70] Power Management version 3
Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot+,D3cold-)
Status: D0 PME-Enable- DSel=0 DScale=0 PME-
Capabilities: [a8] #12 [0010]
Cheers,
Ben.
^ permalink raw reply [flat|nested] 4+ messages in thread* Re: T61 SATA error in log
2007-09-27 7:21 T61 SATA error in log Benjamin Herrenschmidt
@ 2007-09-27 20:35 ` Tejun Heo
2007-09-27 23:36 ` Benjamin Herrenschmidt
0 siblings, 1 reply; 4+ messages in thread
From: Tejun Heo @ 2007-09-27 20:35 UTC (permalink / raw)
To: benh; +Cc: linux-ide@vger.kernel.org, Jeff Garzik
Benjamin Herrenschmidt wrote:
> Saw that popping up in my log today on a brand new T61 thinkpad:
>
> [ 427.712000] ata1.00: exception Emask 0x2 SAct 0x18 SErr 0x0 action 0x2 frozen
> [ 427.712000] ata1.00: (spurious completions during NCQ issue=0x0 SAct=0x18 FIS=004040a1:00000024)
> [ 427.712000] ata1.00: cmd 61/08:18:f4:74:54/00:00:04:00:00/40 tag 3 cdb 0x0 data 4096 out
> [ 427.712000] res 40/00:28:64:e4:51/00:00:04:00:00/40 Emask 0x2 (HSM violation)
> [ 427.712000] ata1.00: cmd 61/08:20:84:10:81/00:00:04:00:00/40 tag 4 cdb 0x0 data 4096 out
> [ 427.712000] res 40/00:28:64:e4:51/00:00:04:00:00/40 Emask 0x2 (HSM violation)
> [ 428.024000] ata1: soft resetting port
> [ 428.196000] ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
> [ 428.204000] ata1.00: configured for UDMA/100
> [ 428.204000] ata1: EH complete
> [ 428.204000] sd 0:0:0:0: [sda] 234441648 512-byte hardware sectors (120034 MB)
> [ 428.204000] sd 0:0:0:0: [sda] Write Protect is off
> [ 428.204000] sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
> [ 428.204000] sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
>
> Kernel recovered just fine and things seem smooth so far. Is that something I need
> to worry about ?
Please post the result of 'hdparm -I /dev/sda' and you don't need to
worry about it too much.
--
tejun
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: T61 SATA error in log
2007-09-27 20:35 ` Tejun Heo
@ 2007-09-27 23:36 ` Benjamin Herrenschmidt
2007-09-27 23:50 ` Tejun Heo
0 siblings, 1 reply; 4+ messages in thread
From: Benjamin Herrenschmidt @ 2007-09-27 23:36 UTC (permalink / raw)
To: Tejun Heo; +Cc: linux-ide@vger.kernel.org, Jeff Garzik
On Thu, 2007-09-27 at 13:35 -0700, Tejun Heo wrote:
> Benjamin Herrenschmidt wrote:
> > Saw that popping up in my log today on a brand new T61 thinkpad:
> >
> > [ 427.712000] ata1.00: exception Emask 0x2 SAct 0x18 SErr 0x0 action 0x2 frozen
> > [ 427.712000] ata1.00: (spurious completions during NCQ issue=0x0 SAct=0x18 FIS=004040a1:00000024)
> > [ 427.712000] ata1.00: cmd 61/08:18:f4:74:54/00:00:04:00:00/40 tag 3 cdb 0x0 data 4096 out
> > [ 427.712000] res 40/00:28:64:e4:51/00:00:04:00:00/40 Emask 0x2 (HSM violation)
> > [ 427.712000] ata1.00: cmd 61/08:20:84:10:81/00:00:04:00:00/40 tag 4 cdb 0x0 data 4096 out
> > [ 427.712000] res 40/00:28:64:e4:51/00:00:04:00:00/40 Emask 0x2 (HSM violation)
> > [ 428.024000] ata1: soft resetting port
> > [ 428.196000] ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
> > [ 428.204000] ata1.00: configured for UDMA/100
> > [ 428.204000] ata1: EH complete
> > [ 428.204000] sd 0:0:0:0: [sda] 234441648 512-byte hardware sectors (120034 MB)
> > [ 428.204000] sd 0:0:0:0: [sda] Write Protect is off
> > [ 428.204000] sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
> > [ 428.204000] sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
> >
> > Kernel recovered just fine and things seem smooth so far. Is that something I need
> > to worry about ?
>
> Please post the result of 'hdparm -I /dev/sda' and you don't need to
> worry about it too much.
Allright. What is happening exactly ? A fluke on the link ?
> benh@pasglop:~/kernels/linux-2.6$ sudo hdparm -I /dev/sda
/dev/sda:
ATA device, with non-removable media
Model Number: ST9120822AS
Serial Number: 5LZ50702
Firmware Revision: 3.CLF
Standards:
Supported: 7 6 5 4
Likely used: 7
Configuration:
Logical max current
cylinders 16383 16383
heads 16 16
sectors/track 63 63
--
CHS current addressable sectors: 16514064
LBA user addressable sectors: 234441648
LBA48 user addressable sectors: 234441648
device size with M = 1024*1024: 114473 MBytes
device size with M = 1000*1000: 120034 MBytes (120 GB)
Capabilities:
LBA, IORDY(can be disabled)
Queue depth: 32
Standby timer values: spec'd by Standard, no device specific
minimum
R/W multiple sector transfer: Max = 16 Current = ?
Advanced power management level: unknown setting (0x8080)
Recommended acoustic management value: 254, current value: 0
DMA: mdma0 mdma1 mdma2 udma0 udma1 udma2 udma3 udma4 *udma5
Cycle time: min=120ns recommended=120ns
PIO: pio0 pio1 pio2 pio3 pio4
Cycle time: no flow control=240ns IORDY flow control=120ns
Commands/features:
Enabled Supported:
* SMART feature set
Security Mode feature set
* Power Management feature set
* Write cache
* Look-ahead
* Host Protected Area feature set
* WRITE_BUFFER command
* READ_BUFFER command
* DOWNLOAD_MICROCODE
* Advanced Power Management feature set
SET_MAX security extension
* 48-bit Address feature set
* Device Configuration Overlay feature set
* Mandatory FLUSH_CACHE
* FLUSH_CACHE_EXT
* SMART error logging
* SMART self-test
* General Purpose Logging feature set
* IDLE_IMMEDIATE with UNLOAD
* SATA-I signaling speed (1.5Gb/s)
* Native Command Queueing (NCQ)
* Phy event counters
* Device-initiated interface power management
* Software settings preservation
* SMART Command Transport (SCT) feature set
Security:
Master password revision code = 65534
supported
not enabled
not locked
frozen
not expired: security count
supported: enhanced erase
66min for SECURITY ERASE UNIT. 66min for ENHANCED SECURITY ERASE
UNIT.
Checksum: correct
^ permalink raw reply [flat|nested] 4+ messages in thread* Re: T61 SATA error in log
2007-09-27 23:36 ` Benjamin Herrenschmidt
@ 2007-09-27 23:50 ` Tejun Heo
0 siblings, 0 replies; 4+ messages in thread
From: Tejun Heo @ 2007-09-27 23:50 UTC (permalink / raw)
To: benh; +Cc: linux-ide@vger.kernel.org, Jeff Garzik
Benjamin Herrenschmidt wrote:
> Allright. What is happening exactly ? A fluke on the link ?
The drive is sending spurious completions of NCQ commands (ie. it's
sending completions for commands which are not pending). We believe
this happens due to firmware bugs and it matched the reality pretty well
(drives with spurious completions usually had other NCQ related
problems) but recently too many new drives are causing this problem. I
have no idea what's going on. I guess it's about time to contact
harddrive vendors.
>> benh@pasglop:~/kernels/linux-2.6$ sudo hdparm -I /dev/sda
>
> /dev/sda:
>
> ATA device, with non-removable media
> Model Number: ST9120822AS
> Serial Number: 5LZ50702
> Firmware Revision: 3.CLF
Will add to blacklist. Thanks.
--
tejun
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2007-09-27 23:51 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-09-27 7:21 T61 SATA error in log Benjamin Herrenschmidt
2007-09-27 20:35 ` Tejun Heo
2007-09-27 23:36 ` Benjamin Herrenschmidt
2007-09-27 23:50 ` Tejun Heo
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).