linux-ide.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* SATA resets via SMART selftest
@ 2008-10-10 20:38 Scott Beardsley
  2008-10-10 21:08 ` Alan Cox
  0 siblings, 1 reply; 4+ messages in thread
From: Scott Beardsley @ 2008-10-10 20:38 UTC (permalink / raw)
  To: linux-ide

I originally posted this to smartmontools but was redirected here.

I am running into a problem with short or long smartctl selftests 
causing a disk reset. I'm using kernel.org v2.6.27 (I've also tried a 
few CentOS kernels) and smartmontools v5.38 (the latest of each). When I 
initiate a short selftest it'll run fine for a couple seconds then the 
iowait jumps up while the disk resets. I don't think this is a disk 
issue since I have 36 identical machines (and they all have this same 
reproducible behavior). I also don't think it is a power problem because 
the disks have seated correctly and the machine stays online when the 
cpus are at 100%. The disks seem to be functioning normally because I 
can read and write the whole disk. After I request a short selftest I 
get this in dmesg:

sd 0:0:0:0: [sda] 976773168 512-byte hardware sectors (500108 MB)
sd 0:0:0:0: [sda] Write Protect is off
sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't
support DPO or FUA
ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
ata1.00: cmd ca/00:40:bf:58:00/00:00:00:00:00/e0 tag 0 dma 32768 out
           res 40/00:00:01:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout)
ata1.00: status: { DRDY }
ata1: link is slow to respond, please be patient (ready=0)
ata1: device not ready (errno=-16), forcing hardreset
ata1: soft resetting link
ata1: link is slow to respond, please be patient (ready=0)
ata1.00: configured for UDMA/133
ata1: EH complete

Here is some smartctl info that might be helpful (you can see it reset 
three times):

# ./smartctl -d ata -a /dev/sda
smartctl version 5.38 [x86_64-unknown-linux-gnu] Copyright (C) 2002-8
Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Device Model:     ST3500320NS
Serial Number:    9QM6YX2A
Firmware Version: SN05
User Capacity:    500,107,862,016 bytes
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   8
ATA Standard is:  ATA-8-ACS revision 4
Local Time is:    Fri Oct 10 12:33:59 2008 CDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
                                          was completed without error.
                                          Auto Offline Data Collection:
Enabled.
Self-test execution status:      (  41) The self-test routine was
interrupted
                                          by the host with a hard or soft
reset.
Total time to complete Offline
data collection:                 ( 634) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                          Auto Offline data collection
on/off support.
                                          Suspend Offline collection 
upon new
                                          command.
                                          Offline surface scan supported.
                                          Self-test supported.
                                          Conveyance Self-test supported.
                                          Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                          power-saving mode.
                                          Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                          General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        ( 114) minutes.
Conveyance self-test routine
recommended polling time:        (   2) minutes.
SCT capabilities:              (0x003d) SCT Status supported.
                                          SCT Feature Control supported.
                                          SCT Data Table supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE
UPDATED  WHEN_FAILED RAW_VALUE
    1 Raw_Read_Error_Rate     0x000f   072   069   044    Pre-fail
Always       -       20837137
    3 Spin_Up_Time            0x0003   099   099   000    Pre-fail
Always       -       0
    4 Start_Stop_Count        0x0032   100   100   020    Old_age
Always       -       8
    5 Reallocated_Sector_Ct   0x0033   100   100   036    Pre-fail
Always       -       0
    7 Seek_Error_Rate         0x000f   100   253   030    Pre-fail
Always       -       644433
    9 Power_On_Hours          0x0032   100   100   000    Old_age
Always       -       212
   10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail
Always       -       0
   12 Power_Cycle_Count       0x0032   100   037   020    Old_age
Always       -       8
184 Unknown_Attribute       0x0032   100   100   099    Old_age   Always
        -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always
        -       0
188 Unknown_Attribute       0x0032   100   100   000    Old_age   Always
        -       0
189 High_Fly_Writes         0x003a   100   100   000    Old_age   Always
        -       0
190 Airflow_Temperature_Cel 0x0022   072   070   045    Old_age   Always
        -       28 (Lifetime Min/Max 28/28)
194 Temperature_Celsius     0x0022   028   040   000    Old_age   Always
        -       28 (0 25 0 0)
195 Hardware_ECC_Recovered  0x001a   035   032   000    Old_age   Always
        -       20837137
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always
        -       0
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age
Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always
        -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining
LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Interrupted (host reset)      00%       212
       -
# 2  Short offline       Interrupted (host reset)      00%       211
       -
# 3  Extended offline    Interrupted (host reset)      00%       166
       -

SMART Selective self-test log data structure revision number 1
   SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
      1        0        0  Not_testing
      2        0        0  Not_testing
      3        0        0  Not_testing
      4        0        0  Not_testing
      5        0        0  Not_testing
Selective self-test flags (0x0):
    After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

Any ideas?
Scott


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: SATA resets via SMART selftest
  2008-10-10 20:38 SATA resets via SMART selftest Scott Beardsley
@ 2008-10-10 21:08 ` Alan Cox
  2008-10-13 17:18   ` Scott Beardsley
  0 siblings, 1 reply; 4+ messages in thread
From: Alan Cox @ 2008-10-10 21:08 UTC (permalink / raw)
  To: Scott Beardsley; +Cc: linux-ide

On Fri, 10 Oct 2008 13:38:15 -0700
Scott Beardsley <scott@cse.ucdavis.edu> wrote:

> I originally posted this to smartmontools but was redirected here.
> 
> I am running into a problem with short or long smartctl selftests 
> causing a disk reset. I'm using kernel.org v2.6.27 (I've also tried a 
> few CentOS kernels) and smartmontools v5.38 (the latest of each). When I 
> initiate a short selftest it'll run fine for a couple seconds then the 
> iowait jumps up while the disk resets. I don't think this is a disk 
> issue since I have 36 identical machines (and they all have this same 
> reproducible behavior). I also don't think it is a power problem because 
> the disks have seated correctly and the machine stays online when the 
> cpus are at 100%. The disks seem to be functioning normally because I 
> can read and write the whole disk. After I request a short selftest I 
> get this in dmesg:

The timeout set on the command expired so the kernel aborted the command
and recovered the link.

Dump the actual SG_IO command block issued and see what timeout and other
options are set.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: SATA resets via SMART selftest
  2008-10-10 21:08 ` Alan Cox
@ 2008-10-13 17:18   ` Scott Beardsley
  2008-10-14  0:19     ` Scott Beardsley
  0 siblings, 1 reply; 4+ messages in thread
From: Scott Beardsley @ 2008-10-13 17:18 UTC (permalink / raw)
  To: linux-ide

Alan Cox wrote:
> The timeout set on the command expired so the kernel aborted the command
> and recovered the link.

That makes sense. AFAICT the smartctl ioctl commands are completing 
successfully (with a timeout of 6 secs). I think the timeout might be 
with any subsequent ioctl commands but it seems to be triggered when 
smartctl kicks off a test.

For some strange reason I can't get scsi_logging to work correctly 
(likely my fault). Is there a way to log all ioctl's? I compiled the 
kernel with CONFIG_SCSI_LOGGING=y. I tried playing with the 
scsi_logging=all kernel option but no dice. Also I get a write error 
when I try to enable logging on the fly:

# echo "scsi log timeout 7" > /proc/scsi/scsi
-bash: echo: write error: Invalid argument
#

Read on for the poorman's logger (strace).

> Dump the actual SG_IO command block issued and see what timeout and other
> options are set.

I'm using strace and this code (I only changed the timeout)[1] that just 
runs an inquiry with a timeout of 6 secs. I shut down all other active 
daemons and managed to catch an ioctl in the middle of the reset (after 
a few tries). First I ran the smartctl command (v5.36 this time):

# strace -rt -e ioctl smartctl -d ata -t short -r ioctl,10 /dev/sda
smartctl version 5.36 [x86_64-redhat-linux-gnu] Copyright (C) 2002-6 
Bruce Allen
Home page is http://smartmontools.sourceforge.net/


REPORT-IOCTL: DeviceFD=3 Command=IDENTIFY DEVICE
      0.000000 ioctl(3, 0x30d, 0x7fffffe0f930) = 0
      0.000122 ioctl(3, 0x31f, 0x7fffffe0fb30) = 0
REPORT-IOCTL: DeviceFD=3 Command=IDENTIFY DEVICE returned 0

===== [IDENTIFY DEVICE] DATA START (BASE-16) =====
000-015: 5a 0c ff 3f 37 c8 10 00 00 00 00 00 3f 00 00 00
016-031: 00 00 00 00 20 20 20 20 20 20 20 20 20 20 20 20
032-047: 51 39 36 4d 58 59 41 32 00 00 00 00 04 00 4e 53
048-063: 35 30 20 20 20 20 54 53 35 33 30 30 32 33 4e 30
064-079: 20 53 20 20 20 20 20 20 20 20 20 20 20 20 20 20
080-095: 20 20 20 20 20 20 20 20 20 20 20 20 20 20 10 80
096-111: 00 00 00 2f 00 40 00 02 00 02 07 00 ff 3f 10 00
112-127: 3f 00 10 fc fb 00 10 00 ff ff ff 0f 00 00 07 00
128-143: 03 00 78 00 78 00 78 00 78 00 00 00 00 00 00 00
144-159: 00 00 00 00 00 00 1f 00 02 05 00 00 40 00 40 00
160-175: f0 01 29 00 6b 34 01 7d 23 41 69 34 01 bc 23 41
176-191: 7f 40 31 00 31 00 00 00 fe ff 00 00 00 fe 00 00
192-207: 00 00 00 00 00 00 00 00 30 60 38 3a 00 00 00 00
208-223: 00 00 00 00 00 00 00 00 00 50 00 c5 d2 0d 57 1f
224-239: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 1e 40
240-255: 1e 40 00 00 00 00 00 00 00 00 00 00 00 00 00 00
256-271: 21 00 30 60 38 3a 30 60 38 3a 00 00 02 00 40 01
272-287: 00 01 00 50 06 3c 0a 3c 00 00 3c 00 00 00 08 00
288-303: 00 00 00 00 0f 00 80 02 00 00 00 00 08 00 00 00
304-319: 00 00 00 00 00 00 00 00 00 00 00 00 00 27 00 80
320-335: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
336-351: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
352-367: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
368-383: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
384-399: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
400-415: 00 00 00 00 00 00 00 00 00 00 00 00 3d 00 00 00
416-431: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
432-447: 00 00 20 1c 00 00 00 00 02 00 00 00 10 10 00 00
448-463: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
464-479: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
480-495: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
496-511: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 a5 a2
===== [IDENTIFY DEVICE] DATA END (512 Bytes) =====


REPORT-IOCTL: DeviceFD=3 Command=SMART STATUS
      0.014834 ioctl(3, 0x31f, 0x7fffffe0fb40) = 0
REPORT-IOCTL: DeviceFD=3 Command=SMART STATUS returned 0

REPORT-IOCTL: DeviceFD=3 Command=SMART STATUS CHECK
      0.044981 ioctl(3, 0x31e, 0x7fffffe0fb50) = 0
REPORT-IOCTL: DeviceFD=3 Command=SMART STATUS CHECK returned 0

REPORT-IOCTL: DeviceFD=3 Command=SMART READ ATTRIBUTE VALUES
      0.041645 ioctl(3, 0x31f, 0x7fffffe0fb30) = 0
REPORT-IOCTL: DeviceFD=3 Command=SMART READ ATTRIBUTE VALUES returned 0

===== [SMART READ ATTRIBUTE VALUES] DATA START (BASE-16) =====
000-015: 0a 00 01 0f 00 4b 45 a8 ab d5 01 00 00 00 03 03
016-031: 00 63 63 00 00 00 00 00 00 00 04 32 00 64 64 14
032-047: 00 00 00 00 00 00 05 33 00 64 64 00 00 00 00 00
048-063: 00 00 07 0f 00 64 fd 1a 0b 0d 00 00 00 00 09 32
064-079: 00 64 64 1b 01 00 00 00 00 00 0a 13 00 64 64 00
080-095: 00 00 00 00 00 00 0c 32 00 64 25 14 00 00 00 00
096-111: 00 00 b8 32 00 64 64 00 00 00 00 00 00 00 bb 32
112-127: 00 64 64 00 00 00 00 00 00 00 bc 32 00 64 64 00
128-143: 00 00 00 00 00 00 bd 3a 00 64 64 00 00 00 00 00
144-159: 00 00 be 22 00 48 46 1c 00 1c 1c 00 00 00 c2 22
160-175: 00 1c 28 1c 00 00 00 19 00 00 c3 1a 00 23 20 a8
176-191: ab d5 01 00 00 00 c5 12 00 64 64 00 00 00 00 00
192-207: 00 00 c6 10 00 64 64 00 00 00 00 00 00 00 c7 3e
208-223: 00 c8 c8 00 00 00 00 00 00 00 00 00 00 00 00 00
224-239: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
240-255: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
256-271: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
272-287: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
288-303: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
304-319: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
320-335: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
336-351: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
352-367: 00 00 00 00 00 00 00 00 00 00 82 29 7a 02 00 7b
368-383: 03 00 01 00 01 72 02 00 00 00 00 00 00 00 00 00
384-399: 00 00 05 00 50 0b 00 00 04 01 01 01 01 01 01 01
400-415: 01 00 00 00 00 00 00 00 00 01 00 00 00 00 00 00
416-431: 00 00 00 00 00 00 00 00 4a bc 2c 73 ed 00 00 00
432-447: 00 00 00 00 01 00 a0 00 0f ba 7c 02 00 00 00 00
448-463: e8 9a 8b 49 0b 00 00 00 00 00 00 00 8d d4 e0 02
464-479: 00 00 00 00 00 00 00 00 9d 00 00 00 00 00 00 00
480-495: 11 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
496-511: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 5d
===== [SMART READ ATTRIBUTE VALUES] DATA END (512 Bytes) =====


REPORT-IOCTL: DeviceFD=3 Command=SMART READ ATTRIBUTE THRESHOLDS
      0.102167 ioctl(3, 0x31f, 0x7fffffe0fb40) = 0
REPORT-IOCTL: DeviceFD=3 Command=SMART READ ATTRIBUTE THRESHOLDS returned 0

===== [SMART READ ATTRIBUTE THRESHOLDS] DATA START (BASE-16) =====
000-015: 01 00 01 2c 00 00 00 00 00 00 00 00 00 00 03 00
016-031: 00 00 00 00 00 00 00 00 00 00 04 14 00 00 00 00
032-047: 00 00 00 00 00 00 05 24 00 00 00 00 00 00 00 00
048-063: 00 00 07 1e 00 00 00 00 00 00 00 00 00 00 09 00
064-079: 00 00 00 00 00 00 00 00 00 00 0a 61 00 00 00 00
080-095: 00 00 00 00 00 00 0c 14 00 00 00 00 00 00 00 00
096-111: 00 00 b8 63 00 00 00 00 00 00 00 00 00 00 bb 00
112-127: 00 00 00 00 00 00 00 00 00 00 bc 00 00 00 00 00
128-143: 00 00 00 00 00 00 bd 00 00 00 00 00 00 00 00 00
144-159: 00 00 be 2d 00 00 00 00 00 00 00 00 00 00 c2 00
160-175: 00 00 00 00 00 00 00 00 00 00 c3 00 00 00 00 00
176-191: 00 00 00 00 00 00 c5 00 00 00 00 00 00 00 00 00
192-207: 00 00 c6 00 00 00 00 00 00 00 00 00 00 00 c7 00
208-223: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
224-239: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
240-255: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
256-271: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
272-287: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
288-303: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
304-319: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
320-335: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
336-351: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
352-367: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
368-383: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
384-399: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
400-415: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
416-431: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
432-447: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
448-463: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
464-479: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
480-495: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
496-511: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 c4
===== [SMART READ ATTRIBUTE THRESHOLDS] DATA END (512 Bytes) =====

=== START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION ===
Sending command: "Execute SMART Short self-test routine immediately in 
off-line mode".

REPORT-IOCTL: DeviceFD=3 Command=SMART IMMEDIATE OFFLINE InputParameter=1
      0.015623 ioctl(3, 0x31f, 0x7fffffe0f9f0) = 0
REPORT-IOCTL: DeviceFD=3 Command=SMART IMMEDIATE OFFLINE returned 0
Drive command "Execute SMART Short self-test routine immediately in 
off-line mode" successful.
Testing has begun.
Please wait 1 minutes for test to complete.
Test will complete after Mon Oct 13 11:51:41 2008

Use smartctl -X to abort test.
#


Then I ran the inquiry binary until it hung in the middle of an ioctl:


# strace -rt -e ioctl /root/test /dev/sda
      0.000000 ioctl(3, SG_GET_VERSION_NUM, 0x7fff65e00540) = 0
timeout: 6 secs: Success
      0.000299 ioctl(3, SG_IO, {'S', SG_DXFER_FROM_DEV, cmd[6]=[12, 00, 
00, 00, 60, 00], mx_sb_len=32, iovec_count=0, dxfer_len=96, 
timeout=6000, flags=0

... disk resets here ...

, data[96]=["\0\0\5\2[\0\0\0ATA     ST3500320NS     "...], status=00, 
masked_status=00, sb[0]=[], host_status=0, driver_status=0, resid=0, 
duration=77887, info=0}) = 0
Some of the INQUIRY command's response:
     ATA       ST3500320NS       SN05
INQUIRY duration=77887 millisecs, resid=0
#


You can see it hangs for around 78 secs.
Scott
---------------
[1] http://www.faqs.org/docs/Linux-HOWTO/SCSI-Generic-HOWTO.html#PEXAMPLE

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: SATA resets via SMART selftest
  2008-10-13 17:18   ` Scott Beardsley
@ 2008-10-14  0:19     ` Scott Beardsley
  0 siblings, 0 replies; 4+ messages in thread
From: Scott Beardsley @ 2008-10-14  0:19 UTC (permalink / raw)
  To: linux-ide

Scott Beardsley wrote:
> Alan Cox wrote:
>> The timeout set on the command expired so the kernel aborted the command
>> and recovered the link.
> 
> For some strange reason I can't get scsi_logging to work correctly 
> (likely my fault).

Yep, I was looking in the wrong place (and probably at old docs). I had 
to run:

# echo 7 > /proc/sys/dev/scsi/logging_level

Below is that same sequence (SMART short test, then INQUIRYs with 7.5sec 
timeouts this time). The SMART cmd was issued at 1083, the first INQUIRY 
(of a series of 10) is at 1084, the disk finishes resetting at 1168.

Perhaps this is a problem with Seagate's implementation of SMART on this 
disk? hdparm -I states that it does, in fact, support self-tests. The 
timeouts don't seem unusually short (10+sec). I'm stumped why it would 
be timing out.

Scott
---------------------

[ 1064.969694] scsi_add_timer: scmd: ffff88021d4f9800, time: 60000, 
(ffffffffa006f521)
[ 1064.970262] scsi_delete_timer: scmd: ffff88021d4f9800, rtn: 1
[ 1064.970270] scsi_add_timer: scmd: ffff88021d4f91c0, time: 60000, 
(ffffffffa006f521)
[ 1064.970769] scsi_delete_timer: scmd: ffff88021d4f91c0, rtn: 1
[ 1083.788393] scsi_add_timer: scmd: ffff88021d4f91c0, time: 60000, 
(ffffffffa006f521)
[ 1083.797591] scsi_delete_timer: scmd: ffff88021d4f91c0, rtn: 1
[ 1083.797641] scsi_add_timer: scmd: ffff88021d91be80, time: 60000, 
(ffffffffa006f521)
[ 1083.808215] scsi_delete_timer: scmd: ffff88021d91be80, rtn: 1
[ 1083.808358] scsi_add_timer: scmd: ffff88021d91bd40, time: 60000, 
(ffffffffa006f521)
[ 1083.808936] scsi_delete_timer: scmd: ffff88021d91bd40, rtn: 1
[ 1083.808966] scsi_add_timer: scmd: ffff88021d91bc00, time: 60000, 
(ffffffffa006f521)
[ 1083.811553] scsi_delete_timer: scmd: ffff88021d91bc00, rtn: 1
[ 1083.812533] scsi_add_timer: scmd: ffff88021d91bac0, time: 60000, 
(ffffffffa006f521)
[ 1083.812668] scsi_delete_timer: scmd: ffff88021d91bac0, rtn: 1
[ 1083.812678] scsi_add_timer: scmd: ffff88021d91b980, time: 60000, 
(ffffffffa006f521)
[ 1083.812981] scsi_delete_timer: scmd: ffff88021d91b980, rtn: 1
[ 1083.812992] scsi_add_timer: scmd: ffff88021d91bac0, time: 60000, 
(ffffffffa006f521)
[ 1083.813789] scsi_delete_timer: scmd: ffff88021d91bac0, rtn: 1
[ 1083.813921] scsi_block_when_processing_errors: rtn: 1
[ 1083.813940] scsi_add_timer: scmd: ffff88021d91b840, time: 60000, 
(ffffffffa006f521)
[ 1083.814089] scsi_delete_timer: scmd: ffff88021d91b840, rtn: 1
[ 1083.814126] scsi_block_when_processing_errors: rtn: 1
[ 1083.814132] scsi_block_when_processing_errors: rtn: 1
[ 1083.814136] scsi_block_when_processing_errors: rtn: 1
[ 1083.814142] scsi_block_when_processing_errors: rtn: 1
[ 1083.814156] scsi_add_timer: scmd: ffff88021d91b700, time: 10000, 
(ffffffffa006f521)
[ 1083.816710] scsi_delete_timer: scmd: ffff88021d91b700, rtn: 1
[ 1083.822368] scsi_block_when_processing_errors: rtn: 1
[ 1083.822372] scsi_block_when_processing_errors: rtn: 1
[ 1083.822380] scsi_add_timer: scmd: ffff88021d91b5c0, time: 10000, 
(ffffffffa006f521)
[ 1083.878472] scsi_delete_timer: scmd: ffff88021d91b5c0, rtn: 1
[ 1083.878496] scsi_block_when_processing_errors: rtn: 1
[ 1083.878500] scsi_block_when_processing_errors: rtn: 1
[ 1083.878506] scsi_add_timer: scmd: ffff88021d91b480, time: 10000, 
(ffffffffa006f521)
[ 1083.920116] scsi_delete_timer: scmd: ffff88021d91b480, rtn: 1
[ 1083.920139] scsi_block_when_processing_errors: rtn: 1
[ 1083.920143] scsi_block_when_processing_errors: rtn: 1
[ 1083.920150] scsi_add_timer: scmd: ffff88021d91b340, time: 10000, 
(ffffffffa006f521)
[ 1084.012873] scsi_delete_timer: scmd: ffff88021d91b340, rtn: 1
[ 1084.012995] scsi_block_when_processing_errors: rtn: 1
[ 1084.012999] scsi_block_when_processing_errors: rtn: 1
[ 1084.013006] scsi_add_timer: scmd: ffff88021d91b200, time: 10000, 
(ffffffffa006f521)
[ 1084.020169] scsi_delete_timer: scmd: ffff88021d91b200, rtn: 1
[ 1084.020207] scsi_block_when_processing_errors: rtn: 1
[ 1084.020211] scsi_block_when_processing_errors: rtn: 1
[ 1084.020216] scsi_add_timer: scmd: ffff88021d91b0c0, time: 10000, 
(ffffffffa006f521)
[ 1084.021323] scsi_delete_timer: scmd: ffff88021d91b0c0, rtn: 1
[ 1084.023222] scsi_block_when_processing_errors: rtn: 1
[ 1084.023318] scsi_block_when_processing_errors: rtn: 1
[ 1084.023322] scsi_block_when_processing_errors: rtn: 1
[ 1084.023731] scsi_block_when_processing_errors: rtn: 1
[ 1084.023735] scsi_block_when_processing_errors: rtn: 1
[ 1084.023754] scsi_add_timer: scmd: ffff88021d474e40, time: 7500, 
(ffffffffa006f521)
[ 1084.023765] scsi_delete_timer: scmd: ffff88021d474e40, rtn: 1
[ 1084.788506] scsi_block_when_processing_errors: rtn: 1
[ 1084.788568] scsi_block_when_processing_errors: rtn: 1
[ 1084.788571] scsi_block_when_processing_errors: rtn: 1
[ 1084.788856] scsi_block_when_processing_errors: rtn: 1
[ 1084.788859] scsi_block_when_processing_errors: rtn: 1
[ 1084.788872] scsi_add_timer: scmd: ffff88021bd48e80, time: 7500, 
(ffffffffa006f521)
[ 1084.788877] scsi_delete_timer: scmd: ffff88021bd48e80, rtn: 1
[ 1085.716417] scsi_block_when_processing_errors: rtn: 1
[ 1085.716488] scsi_block_when_processing_errors: rtn: 1
[ 1085.716490] scsi_block_when_processing_errors: rtn: 1
[ 1085.716790] scsi_block_when_processing_errors: rtn: 1
[ 1085.716793] scsi_block_when_processing_errors: rtn: 1
[ 1085.716809] scsi_add_timer: scmd: ffff88021d91b980, time: 7500, 
(ffffffffa006f521)
[ 1085.716816] scsi_delete_timer: scmd: ffff88021d91b980, rtn: 1
[ 1086.596181] scsi_block_when_processing_errors: rtn: 1
[ 1086.596277] scsi_block_when_processing_errors: rtn: 1
[ 1086.596279] scsi_block_when_processing_errors: rtn: 1
[ 1086.596631] scsi_block_when_processing_errors: rtn: 1
[ 1086.596634] scsi_block_when_processing_errors: rtn: 1
[ 1086.596697] scsi_add_timer: scmd: ffff88021d91b0c0, time: 7500, 
(ffffffffa006f521)
[ 1086.596703] scsi_delete_timer: scmd: ffff88021d91b0c0, rtn: 1
[ 1087.428274] scsi_block_when_processing_errors: rtn: 1
[ 1087.428369] scsi_block_when_processing_errors: rtn: 1
[ 1087.428372] scsi_block_when_processing_errors: rtn: 1
[ 1087.428724] scsi_block_when_processing_errors: rtn: 1
[ 1087.428727] scsi_block_when_processing_errors: rtn: 1
[ 1087.428788] scsi_add_timer: scmd: ffff88021d91b0c0, time: 7500, 
(ffffffffa006f521)
[ 1087.428794] scsi_delete_timer: scmd: ffff88021d91b0c0, rtn: 1
[ 1088.100442] scsi_block_when_processing_errors: rtn: 1
[ 1088.100509] scsi_block_when_processing_errors: rtn: 1
[ 1088.100512] scsi_block_when_processing_errors: rtn: 1
[ 1088.100801] scsi_block_when_processing_errors: rtn: 1
[ 1088.100804] scsi_block_when_processing_errors: rtn: 1
[ 1088.100820] scsi_add_timer: scmd: ffff88021bd48e80, time: 7500, 
(ffffffffa006f521)
[ 1088.100826] scsi_delete_timer: scmd: ffff88021bd48e80, rtn: 1
[ 1088.580211] scsi_block_when_processing_errors: rtn: 1
[ 1088.580304] scsi_block_when_processing_errors: rtn: 1
[ 1088.580308] scsi_block_when_processing_errors: rtn: 1
[ 1088.580710] scsi_block_when_processing_errors: rtn: 1
[ 1088.580714] scsi_block_when_processing_errors: rtn: 1
[ 1088.580727] scsi_add_timer: scmd: ffff88021d474e40, time: 7500, 
(ffffffffa006f521)
[ 1088.580731] scsi_delete_timer: scmd: ffff88021d474e40, rtn: 1
[ 1089.011690] scsi_block_when_processing_errors: rtn: 1
[ 1089.011754] scsi_block_when_processing_errors: rtn: 1
[ 1089.011757] scsi_block_when_processing_errors: rtn: 1
[ 1089.012053] scsi_block_when_processing_errors: rtn: 1
[ 1089.012056] scsi_block_when_processing_errors: rtn: 1
[ 1089.012069] scsi_add_timer: scmd: ffff88021d91b980, time: 7500, 
(ffffffffa006f521)
[ 1089.012075] scsi_delete_timer: scmd: ffff88021d91b980, rtn: 1
[ 1089.507506] scsi_add_timer: scmd: ffff88021d474e40, time: 60000, 
(ffffffffa006f521)
[ 1149.503878] Waking error handler thread
[ 1149.503883] Error handler scsi_eh_0 waking up
[ 1149.503903] ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 
frozen
[ 1149.503909] ata1.00: cmd ca/00:70:ff:5e:00/00:00:00:00:00/e0 tag 0 
dma 57344 out
[ 1149.503911]          res 40/00:00:01:4f:c2/00:00:00:00:00/00 Emask 
0x4 (timeout)
[ 1149.503919] ata1.00: status: { DRDY }
[ 1154.552602] ata1: link is slow to respond, please be patient (ready=0)
[ 1159.550330] ata1: device not ready (errno=-16), forcing hardreset
[ 1159.550336] ata1: soft resetting link
[ 1164.750045] ata1: link is slow to respond, please be patient (ready=0)
[ 1168.108894] ata1.00: configured for UDMA/133
[ 1168.108905] scsi_eh_0: flush retry cmd: ffff88021d474e40
[ 1168.108909] ata1: EH complete
[ 1168.108913] scsi_restart_operations: waking up host to restart
[ 1168.108917] scsi_add_timer: scmd: ffff88021d474e40, time: 60000, 
(ffffffffa006f521)
[ 1168.109002] Error handler scsi_eh_0 sleeping
[ 1168.129442] scsi_delete_timer: scmd: ffff88021d474e40, rtn: 1
[ 1168.129460] scsi_add_timer: scmd: ffff88021d4f9440, time: 30000, 
(ffffffffa006f521)
[ 1168.129464] scsi_delete_timer: scmd: ffff88021d4f9440, rtn: 1
[ 1168.129469] scsi_add_timer: scmd: ffff88021d4f91c0, time: 60000, 
(ffffffffa006f521)
[ 1168.129973] scsi_delete_timer: scmd: ffff88021d4f91c0, rtn: 1
[ 1168.129987] scsi_add_timer: scmd: ffff88021d4f9440, time: 30000, 
(ffffffffa006f521)
[ 1168.129991] scsi_delete_timer: scmd: ffff88021d4f9440, rtn: 1
[ 1168.130003] scsi_add_timer: scmd: ffff88021d4f91c0, time: 60000, 
(ffffffffa006f521)
[ 1168.130007] sd 0:0:0:0: [sda] 976773168 512-byte hardware sectors 
(500108 MB)
[ 1168.130384] scsi_delete_timer: scmd: ffff88021d4f91c0, rtn: 1
[ 1168.130393] scsi_add_timer: scmd: ffff88021d474e40, time: 30000, 
(ffffffffa006f521)
[ 1168.130397] scsi_delete_timer: scmd: ffff88021d474e40, rtn: 1
[ 1168.130405] scsi_add_timer: scmd: ffff88021d4f91c0, time: 60000, 
(ffffffffa006f521)
[ 1168.130458] sd 0:0:0:0: [sda] Write Protect is off
[ 1168.130460] sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
[ 1168.130928] scsi_delete_timer: scmd: ffff88021d4f91c0, rtn: 1
[ 1168.130938] scsi_add_timer: scmd: ffff88021d474e40, time: 30000, 
(ffffffffa006f521)
[ 1168.130942] scsi_delete_timer: scmd: ffff88021d474e40, rtn: 1
[ 1168.130951] scsi_add_timer: scmd: ffff88021d4f91c0, time: 60000, 
(ffffffffa006f521)
[ 1168.131462] scsi_delete_timer: scmd: ffff88021d4f91c0, rtn: 1
[ 1168.131471] scsi_add_timer: scmd: ffff88021d4f9940, time: 30000, 
(ffffffffa006f521)
[ 1168.131475] scsi_delete_timer: scmd: ffff88021d4f9940, rtn: 1
[ 1168.131482] scsi_add_timer: scmd: ffff88021d4f91c0, time: 60000, 
(ffffffffa006f521)
[ 1168.131532] sd 0:0:0:0: [sda] Write cache: enabled, read cache: 
enabled, doesn't support DPO or FUA
[ 1168.131899] scsi_delete_timer: scmd: ffff88021d4f91c0, rtn: 1
[ 1168.131903] scsi_block_when_processing_errors: rtn: 1
[ 1168.131907] scsi_add_timer: scmd: ffff88021d4f9940, time: 60000, 
(ffffffffa006f521)
[ 1168.131969] scsi_block_when_processing_errors: rtn: 1
[ 1168.131971] scsi_block_when_processing_errors: rtn: 1
[ 1168.132259] scsi_block_when_processing_errors: rtn: 1
[ 1168.132260] scsi_block_when_processing_errors: rtn: 1
[ 1168.132343] scsi_delete_timer: scmd: ffff88021d4f9940, rtn: 1
[ 1168.132352] scsi_add_timer: scmd: ffff88021d4f9800, time: 60000, 
(ffffffffa006f521)
[ 1168.132854] scsi_delete_timer: scmd: ffff88021d4f9800, rtn: 1
[ 1168.132862] scsi_add_timer: scmd: ffff88021d4f9940, time: 7500, 
(ffffffffa006f521)
[ 1168.132865] scsi_delete_timer: scmd: ffff88021d4f9940, rtn: 1
[ 1168.132889] scsi_add_timer: scmd: ffff88021d4f9940, time: 60000, 
(ffffffffa006f521)
[ 1168.133297] scsi_delete_timer: scmd: ffff88021d4f9940, rtn: 1
[ 1168.133315] scsi_add_timer: scmd: ffff88021d4f9940, time: 60000, 
(ffffffffa006f521)
[ 1168.133737] scsi_delete_timer: scmd: ffff88021d4f9940, rtn: 1
[ 1168.133798] scsi_add_timer: scmd: ffff88021d4f9940, time: 60000, 
(ffffffffa006f521)
[ 1168.134198] scsi_delete_timer: scmd: ffff88021d4f9940, rtn: 1
[ 1168.134220] scsi_add_timer: scmd: ffff88021d4f9940, time: 60000, 
(ffffffffa006f521)
[ 1168.134740] scsi_delete_timer: scmd: ffff88021d4f9940, rtn: 1
[ 1168.134762] scsi_add_timer: scmd: ffff88021d4f9800, time: 60000, 
(ffffffffa006f521)
[ 1168.135126] scsi_block_when_processing_errors: rtn: 1
[ 1168.135182] scsi_delete_timer: scmd: ffff88021d4f9800, rtn: 1
[ 1168.135206] scsi_block_when_processing_errors: rtn: 1
[ 1168.135208] scsi_block_when_processing_errors: rtn: 1
[ 1168.135504] scsi_block_when_processing_errors: rtn: 1
[ 1168.135505] scsi_block_when_processing_errors: rtn: 1
[ 1168.135518] scsi_add_timer: scmd: ffff88021d4f9800, time: 7500, 
(ffffffffa006f521)
[ 1168.135522] scsi_delete_timer: scmd: ffff88021d4f9800, rtn: 1
[ 1168.135807] scsi_add_timer: scmd: ffff88021d4f9800, time: 60000, 
(ffffffffa006f521)
[ 1168.136165] scsi_delete_timer: scmd: ffff88021d4f9800, rtn: 1

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2008-10-14  0:19 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-10-10 20:38 SATA resets via SMART selftest Scott Beardsley
2008-10-10 21:08 ` Alan Cox
2008-10-13 17:18   ` Scott Beardsley
2008-10-14  0:19     ` Scott Beardsley

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).