All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Lars Michael Jogbäck" <lm@jogback.se>
To: Tejun Heo <htejun@gmail.com>
Cc: linux-ide@vger.kernel.org
Subject: Re: Problems w/ Sil3124 + Port Multiplier
Date: Thu, 03 May 2007 13:04:59 +0200	[thread overview]
Message-ID: <4639C1DB.3030501@jogback.se> (raw)
In-Reply-To: <4639A6A3.20001@gmail.com>

Tejun Heo wrote:
> Lars Michael Jogbäck wrote:
>   
>>> I think the disk attached to port 0 might be bad.  Please report the
>>> result of 'smartctl -d ata -a /dev/sdX' where sdX is the device attached
>>> to the failing port.
>>>
>>>   
>>>       
>> SMART Error Log Version: 1
>> No Errors Logged
>>     
>
> I was hoping to see some error logs but no.  Hardware_ECC_Recovered
> count seems high (385707184) but I dunno whether the value is normal or
> not.  Different manufacturers use different norms in counting them.  If
> you have other disks of the same model, you can compare the values and
> see whether if it's unusually high.
>   
I think that is normal for that kind of disk. This is another disk but 
the same model (this one is attached to a 3ware 9500-controller)
and it shows the same.

smartctl version 5.36 [i686-pc-linux-gnu] Copyright (C) 2002-6 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Device Model:     SAMSUNG HD501LJ
Serial Number:    S0VVJ1NP300014
Firmware Version: CR100-10
User Capacity:    500,107,862,016 bytes
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   8
ATA Standard is:  Not recognized. Minor revision code: 0x52
Local Time is:    Thu May  3 12:36:32 2007 CEST

==> WARNING: May need -F samsung or -F samsung2 enabled; see manual for 
details.

SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: 
Disabled.
Self-test execution status:      (   0) The previous self-test routine 
completed
                                        without error or no self-test 
has ever
                                        been run.
Total time to complete Offline
data collection:                 (8852) seconds.
Offline data collection
capabilities:                    (0x5b) SMART execute Offline immediate.
                                        Auto Offline data collection 
on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        ( 151) minutes.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      
UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   100   100   051    Pre-fail  
Always       -       3
  3 Spin_Up_Time            0x0007   100   100   015    Pre-fail  
Always       -       7104
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   
Always       -       7
  5 Reallocated_Sector_Ct   0x0033   253   253   010    Pre-fail  
Always       -       0
  7 Seek_Error_Rate         0x000f   253   253   051    Pre-fail  
Always       -       0
  8 Seek_Time_Performance   0x0025   253   253   015    Pre-fail  
Offline      -       0
  9 Power_On_Hours          0x0032   253   253   000    Old_age   
Always       -       97
 10 Spin_Retry_Count        0x0033   253   253   051    Pre-fail  
Always       -       0
 11 Calibration_Retry_Count 0x0012   253   253   000    Old_age   
Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   
Always       -       7
187 Unknown_Attribute       0x0032   253   253   000    Old_age   
Always       -       0
188 Unknown_Attribute       0x0032   253   253   000    Old_age   
Always       -       0
190 Unknown_Attribute       0x0022   068   067   000    Old_age   
Always       -       32
194 Temperature_Celsius     0x0022   142   139   000    Old_age   
Always       -       32
195 Hardware_ECC_Recovered  0x001a   100   100   000    Old_age   
Always       -       455228167
196 Reallocated_Event_Count 0x0032   253   253   000    Old_age   
Always       -       0
197 Current_Pending_Sector  0x0012   253   253   000    Old_age   
Always       -       0
198 Offline_Uncorrectable   0x0030   253   253   000    Old_age   
Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   
Always       -       0
200 Multi_Zone_Error_Rate   0x000a   100   100   000    Old_age   
Always       -       0
201 Soft_Read_Error_Rate    0x000a   100   100   000    Old_age   
Always       -       0
202 TA_Increase_Count       0x0032   253   253   000    Old_age   
Always       -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  
LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%        
97         -

SMART Selective Self-Test Log Data Structure Revision Number (0) should be 1
SMART Selective self-test log data structure revision number 0
Warning: ATA Specification requires selective self-test log data 
structure revision number = 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

Also, something else happened with error after a while.

First it changed to show this instead:
May  1 23:52:25 cleopatra kernel: ata5.00: limiting SATA link speed to 
1.5 Gbps
May  1 23:52:25 cleopatra kernel: ata5.00: exception Emask 0x0 SAct 0x0 
SErr 0x0 action 0x6 frozen
May  1 23:52:25 cleopatra kernel: ata5.00: tag 2 cmd 0xea Emask 0x4 stat 
0x40 err 0x0 (timeout)
May  1 23:52:25 cleopatra kernel: ata5.15: hard resetting port
May  1 23:52:27 cleopatra kernel: ata5.15: SATA link up 3.0 Gbps 
(SStatus 123 SControl 300)
May  1 23:52:27 cleopatra kernel: ata5.00: hard resetting port
May  1 23:52:28 cleopatra kernel: ata5.00: SATA link up 1.5 Gbps 
(SStatus 113 SControl 310)
May  1 23:52:28 cleopatra kernel: ata5.01: hard resetting port
May  1 23:52:28 cleopatra kernel: ata5.01: SATA link up 3.0 Gbps 
(SStatus 123 SControl 300)
May  1 23:52:29 cleopatra kernel: ata5.02: hard resetting port
May  1 23:52:29 cleopatra kernel: ata5.02: SATA link up 1.5 Gbps 
(SStatus 113 SControl 300)
May  1 23:52:29 cleopatra kernel: ata5.03: hard resetting port
May  1 23:52:30 cleopatra kernel: ata5.03: SATA link up 1.5 Gbps 
(SStatus 113 SControl 300)
May  1 23:52:30 cleopatra kernel: ata5.04: hard resetting port
May  1 23:52:30 cleopatra kernel: ata5.04: SATA link up 1.5 Gbps 
(SStatus 113 SControl 300)
May  1 23:52:30 cleopatra kernel: ata5.00: configured for UDMA/100
May  1 23:52:30 cleopatra kernel: ata5.01: configured for UDMA/100
May  1 23:52:30 cleopatra kernel: ata5.02: configured for UDMA/100
May  1 23:52:30 cleopatra kernel: ata5.03: configured for UDMA/100
May  1 23:52:30 cleopatra kernel: ata5.04: configured for UDMA/100
May  1 23:52:30 cleopatra kernel: ata5: EH complete

and later it showed this:
May  1 23:53:36 cleopatra kernel: ata5.00: limiting speed to UDMA/66
May  1 23:53:36 cleopatra kernel: ata5.00: exception Emask 0x0 SAct 0x0 
SErr 0x0 action 0x2 frozen
May  1 23:53:36 cleopatra kernel: ata5.00: tag 0 cmd 0xea Emask 0x4 stat 
0x40 err 0x0 (timeout)
May  1 23:53:36 cleopatra kernel: ata5.15: hard resetting port
May  1 23:53:38 cleopatra kernel: ata5.15: SATA link up 3.0 Gbps 
(SStatus 123 SControl 300)
May  1 23:53:38 cleopatra kernel: ata5.00: hard resetting port
May  1 23:53:39 cleopatra kernel: ata5.00: SATA link up 1.5 Gbps 
(SStatus 113 SControl 310)
May  1 23:53:39 cleopatra kernel: ata5.01: hard resetting port
May  1 23:53:40 cleopatra kernel: ata5.01: SATA link up 3.0 Gbps 
(SStatus 123 SControl 300)
May  1 23:53:40 cleopatra kernel: ata5.02: hard resetting port
May  1 23:53:40 cleopatra kernel: ata5.02: SATA link up 1.5 Gbps 
(SStatus 113 SControl 300)
May  1 23:53:40 cleopatra kernel: ata5.03: hard resetting port
May  1 23:53:41 cleopatra kernel: ata5.03: SATA link up 1.5 Gbps 
(SStatus 113 SControl 300)
May  1 23:53:41 cleopatra kernel: ata5.04: hard resetting port
May  1 23:53:41 cleopatra kernel: ata5.04: SATA link up 1.5 Gbps 
(SStatus 113 SControl 300)
May  1 23:53:41 cleopatra kernel: ata5.00: configured for UDMA/66
May  1 23:53:41 cleopatra kernel: ata5.01: configured for UDMA/100
May  1 23:53:41 cleopatra kernel: ata5.02: configured for UDMA/100
May  1 23:53:41 cleopatra kernel: ata5.03: configured for UDMA/100
May  1 23:53:41 cleopatra kernel: ata5.04: configured for UDMA/100
May  1 23:53:41 cleopatra kernel: ata5: EH complete

and then it continues with approx 1/hour of the above. So it seems 
something is strange in the interface between the drive and the 
computer. I can swap the drive (it's in an raid5-array) to another drive 
of the same model if helps in any way; but I suspect that it will show 
the same result.

Regards,
/LM


  reply	other threads:[~2007-05-03 11:04 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2007-05-02  6:00 Problems w/ Sil3124 + Port Multiplier Lars Michael Jogbäck
2007-05-02 12:47 ` Tejun Heo
2007-05-02 13:30   ` Lars Michael Jogbäck
2007-05-02 13:34     ` Mark Lord
2007-05-03  9:08     ` Tejun Heo
2007-05-03 11:04       ` Lars Michael Jogbäck [this message]
2007-05-03 11:37         ` Tejun Heo

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4639C1DB.3030501@jogback.se \
    --to=lm@jogback.se \
    --cc=htejun@gmail.com \
    --cc=linux-ide@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.