* SATA resets via SMART selftest
@ 2008-10-10 20:38 Scott Beardsley
2008-10-10 21:08 ` Alan Cox
0 siblings, 1 reply; 4+ messages in thread
From: Scott Beardsley @ 2008-10-10 20:38 UTC (permalink / raw)
To: linux-ide
I originally posted this to smartmontools but was redirected here.
I am running into a problem with short or long smartctl selftests
causing a disk reset. I'm using kernel.org v2.6.27 (I've also tried a
few CentOS kernels) and smartmontools v5.38 (the latest of each). When I
initiate a short selftest it'll run fine for a couple seconds then the
iowait jumps up while the disk resets. I don't think this is a disk
issue since I have 36 identical machines (and they all have this same
reproducible behavior). I also don't think it is a power problem because
the disks have seated correctly and the machine stays online when the
cpus are at 100%. The disks seem to be functioning normally because I
can read and write the whole disk. After I request a short selftest I
get this in dmesg:
sd 0:0:0:0: [sda] 976773168 512-byte hardware sectors (500108 MB)
sd 0:0:0:0: [sda] Write Protect is off
sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't
support DPO or FUA
ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
ata1.00: cmd ca/00:40:bf:58:00/00:00:00:00:00/e0 tag 0 dma 32768 out
res 40/00:00:01:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout)
ata1.00: status: { DRDY }
ata1: link is slow to respond, please be patient (ready=0)
ata1: device not ready (errno=-16), forcing hardreset
ata1: soft resetting link
ata1: link is slow to respond, please be patient (ready=0)
ata1.00: configured for UDMA/133
ata1: EH complete
Here is some smartctl info that might be helpful (you can see it reset
three times):
# ./smartctl -d ata -a /dev/sda
smartctl version 5.38 [x86_64-unknown-linux-gnu] Copyright (C) 2002-8
Bruce Allen
Home page is http://smartmontools.sourceforge.net/
=== START OF INFORMATION SECTION ===
Device Model: ST3500320NS
Serial Number: 9QM6YX2A
Firmware Version: SN05
User Capacity: 500,107,862,016 bytes
Device is: Not in smartctl database [for details use: -P showall]
ATA Version is: 8
ATA Standard is: ATA-8-ACS revision 4
Local Time is: Fri Oct 10 12:33:59 2008 CDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x82) Offline data collection activity
was completed without error.
Auto Offline Data Collection:
Enabled.
Self-test execution status: ( 41) The self-test routine was
interrupted
by the host with a hard or soft
reset.
Total time to complete Offline
data collection: ( 634) seconds.
Offline data collection
capabilities: (0x7b) SMART execute Offline immediate.
Auto Offline data collection
on/off support.
Suspend Offline collection
upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 1) minutes.
Extended self-test routine
recommended polling time: ( 114) minutes.
Conveyance self-test routine
recommended polling time: ( 2) minutes.
SCT capabilities: (0x003d) SCT Status supported.
SCT Feature Control supported.
SCT Data Table supported.
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE
UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 072 069 044 Pre-fail
Always - 20837137
3 Spin_Up_Time 0x0003 099 099 000 Pre-fail
Always - 0
4 Start_Stop_Count 0x0032 100 100 020 Old_age
Always - 8
5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail
Always - 0
7 Seek_Error_Rate 0x000f 100 253 030 Pre-fail
Always - 644433
9 Power_On_Hours 0x0032 100 100 000 Old_age
Always - 212
10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail
Always - 0
12 Power_Cycle_Count 0x0032 100 037 020 Old_age
Always - 8
184 Unknown_Attribute 0x0032 100 100 099 Old_age Always
- 0
187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always
- 0
188 Unknown_Attribute 0x0032 100 100 000 Old_age Always
- 0
189 High_Fly_Writes 0x003a 100 100 000 Old_age Always
- 0
190 Airflow_Temperature_Cel 0x0022 072 070 045 Old_age Always
- 28 (Lifetime Min/Max 28/28)
194 Temperature_Celsius 0x0022 028 040 000 Old_age Always
- 28 (0 25 0 0)
195 Hardware_ECC_Recovered 0x001a 035 032 000 Old_age Always
- 20837137
197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always
- 0
198 Offline_Uncorrectable 0x0010 100 100 000 Old_age
Offline - 0
199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always
- 0
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining
LifeTime(hours) LBA_of_first_error
# 1 Short offline Interrupted (host reset) 00% 212
-
# 2 Short offline Interrupted (host reset) 00% 211
-
# 3 Extended offline Interrupted (host reset) 00% 166
-
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
Any ideas?
Scott
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: SATA resets via SMART selftest
2008-10-10 20:38 SATA resets via SMART selftest Scott Beardsley
@ 2008-10-10 21:08 ` Alan Cox
2008-10-13 17:18 ` Scott Beardsley
0 siblings, 1 reply; 4+ messages in thread
From: Alan Cox @ 2008-10-10 21:08 UTC (permalink / raw)
To: Scott Beardsley; +Cc: linux-ide
On Fri, 10 Oct 2008 13:38:15 -0700
Scott Beardsley <scott@cse.ucdavis.edu> wrote:
> I originally posted this to smartmontools but was redirected here.
>
> I am running into a problem with short or long smartctl selftests
> causing a disk reset. I'm using kernel.org v2.6.27 (I've also tried a
> few CentOS kernels) and smartmontools v5.38 (the latest of each). When I
> initiate a short selftest it'll run fine for a couple seconds then the
> iowait jumps up while the disk resets. I don't think this is a disk
> issue since I have 36 identical machines (and they all have this same
> reproducible behavior). I also don't think it is a power problem because
> the disks have seated correctly and the machine stays online when the
> cpus are at 100%. The disks seem to be functioning normally because I
> can read and write the whole disk. After I request a short selftest I
> get this in dmesg:
The timeout set on the command expired so the kernel aborted the command
and recovered the link.
Dump the actual SG_IO command block issued and see what timeout and other
options are set.
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: SATA resets via SMART selftest
2008-10-10 21:08 ` Alan Cox
@ 2008-10-13 17:18 ` Scott Beardsley
2008-10-14 0:19 ` Scott Beardsley
0 siblings, 1 reply; 4+ messages in thread
From: Scott Beardsley @ 2008-10-13 17:18 UTC (permalink / raw)
To: linux-ide
Alan Cox wrote:
> The timeout set on the command expired so the kernel aborted the command
> and recovered the link.
That makes sense. AFAICT the smartctl ioctl commands are completing
successfully (with a timeout of 6 secs). I think the timeout might be
with any subsequent ioctl commands but it seems to be triggered when
smartctl kicks off a test.
For some strange reason I can't get scsi_logging to work correctly
(likely my fault). Is there a way to log all ioctl's? I compiled the
kernel with CONFIG_SCSI_LOGGING=y. I tried playing with the
scsi_logging=all kernel option but no dice. Also I get a write error
when I try to enable logging on the fly:
# echo "scsi log timeout 7" > /proc/scsi/scsi
-bash: echo: write error: Invalid argument
#
Read on for the poorman's logger (strace).
> Dump the actual SG_IO command block issued and see what timeout and other
> options are set.
I'm using strace and this code (I only changed the timeout)[1] that just
runs an inquiry with a timeout of 6 secs. I shut down all other active
daemons and managed to catch an ioctl in the middle of the reset (after
a few tries). First I ran the smartctl command (v5.36 this time):
# strace -rt -e ioctl smartctl -d ata -t short -r ioctl,10 /dev/sda
smartctl version 5.36 [x86_64-redhat-linux-gnu] Copyright (C) 2002-6
Bruce Allen
Home page is http://smartmontools.sourceforge.net/
REPORT-IOCTL: DeviceFD=3 Command=IDENTIFY DEVICE
0.000000 ioctl(3, 0x30d, 0x7fffffe0f930) = 0
0.000122 ioctl(3, 0x31f, 0x7fffffe0fb30) = 0
REPORT-IOCTL: DeviceFD=3 Command=IDENTIFY DEVICE returned 0
===== [IDENTIFY DEVICE] DATA START (BASE-16) =====
000-015: 5a 0c ff 3f 37 c8 10 00 00 00 00 00 3f 00 00 00
016-031: 00 00 00 00 20 20 20 20 20 20 20 20 20 20 20 20
032-047: 51 39 36 4d 58 59 41 32 00 00 00 00 04 00 4e 53
048-063: 35 30 20 20 20 20 54 53 35 33 30 30 32 33 4e 30
064-079: 20 53 20 20 20 20 20 20 20 20 20 20 20 20 20 20
080-095: 20 20 20 20 20 20 20 20 20 20 20 20 20 20 10 80
096-111: 00 00 00 2f 00 40 00 02 00 02 07 00 ff 3f 10 00
112-127: 3f 00 10 fc fb 00 10 00 ff ff ff 0f 00 00 07 00
128-143: 03 00 78 00 78 00 78 00 78 00 00 00 00 00 00 00
144-159: 00 00 00 00 00 00 1f 00 02 05 00 00 40 00 40 00
160-175: f0 01 29 00 6b 34 01 7d 23 41 69 34 01 bc 23 41
176-191: 7f 40 31 00 31 00 00 00 fe ff 00 00 00 fe 00 00
192-207: 00 00 00 00 00 00 00 00 30 60 38 3a 00 00 00 00
208-223: 00 00 00 00 00 00 00 00 00 50 00 c5 d2 0d 57 1f
224-239: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 1e 40
240-255: 1e 40 00 00 00 00 00 00 00 00 00 00 00 00 00 00
256-271: 21 00 30 60 38 3a 30 60 38 3a 00 00 02 00 40 01
272-287: 00 01 00 50 06 3c 0a 3c 00 00 3c 00 00 00 08 00
288-303: 00 00 00 00 0f 00 80 02 00 00 00 00 08 00 00 00
304-319: 00 00 00 00 00 00 00 00 00 00 00 00 00 27 00 80
320-335: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
336-351: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
352-367: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
368-383: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
384-399: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
400-415: 00 00 00 00 00 00 00 00 00 00 00 00 3d 00 00 00
416-431: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
432-447: 00 00 20 1c 00 00 00 00 02 00 00 00 10 10 00 00
448-463: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
464-479: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
480-495: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
496-511: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 a5 a2
===== [IDENTIFY DEVICE] DATA END (512 Bytes) =====
REPORT-IOCTL: DeviceFD=3 Command=SMART STATUS
0.014834 ioctl(3, 0x31f, 0x7fffffe0fb40) = 0
REPORT-IOCTL: DeviceFD=3 Command=SMART STATUS returned 0
REPORT-IOCTL: DeviceFD=3 Command=SMART STATUS CHECK
0.044981 ioctl(3, 0x31e, 0x7fffffe0fb50) = 0
REPORT-IOCTL: DeviceFD=3 Command=SMART STATUS CHECK returned 0
REPORT-IOCTL: DeviceFD=3 Command=SMART READ ATTRIBUTE VALUES
0.041645 ioctl(3, 0x31f, 0x7fffffe0fb30) = 0
REPORT-IOCTL: DeviceFD=3 Command=SMART READ ATTRIBUTE VALUES returned 0
===== [SMART READ ATTRIBUTE VALUES] DATA START (BASE-16) =====
000-015: 0a 00 01 0f 00 4b 45 a8 ab d5 01 00 00 00 03 03
016-031: 00 63 63 00 00 00 00 00 00 00 04 32 00 64 64 14
032-047: 00 00 00 00 00 00 05 33 00 64 64 00 00 00 00 00
048-063: 00 00 07 0f 00 64 fd 1a 0b 0d 00 00 00 00 09 32
064-079: 00 64 64 1b 01 00 00 00 00 00 0a 13 00 64 64 00
080-095: 00 00 00 00 00 00 0c 32 00 64 25 14 00 00 00 00
096-111: 00 00 b8 32 00 64 64 00 00 00 00 00 00 00 bb 32
112-127: 00 64 64 00 00 00 00 00 00 00 bc 32 00 64 64 00
128-143: 00 00 00 00 00 00 bd 3a 00 64 64 00 00 00 00 00
144-159: 00 00 be 22 00 48 46 1c 00 1c 1c 00 00 00 c2 22
160-175: 00 1c 28 1c 00 00 00 19 00 00 c3 1a 00 23 20 a8
176-191: ab d5 01 00 00 00 c5 12 00 64 64 00 00 00 00 00
192-207: 00 00 c6 10 00 64 64 00 00 00 00 00 00 00 c7 3e
208-223: 00 c8 c8 00 00 00 00 00 00 00 00 00 00 00 00 00
224-239: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
240-255: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
256-271: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
272-287: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
288-303: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
304-319: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
320-335: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
336-351: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
352-367: 00 00 00 00 00 00 00 00 00 00 82 29 7a 02 00 7b
368-383: 03 00 01 00 01 72 02 00 00 00 00 00 00 00 00 00
384-399: 00 00 05 00 50 0b 00 00 04 01 01 01 01 01 01 01
400-415: 01 00 00 00 00 00 00 00 00 01 00 00 00 00 00 00
416-431: 00 00 00 00 00 00 00 00 4a bc 2c 73 ed 00 00 00
432-447: 00 00 00 00 01 00 a0 00 0f ba 7c 02 00 00 00 00
448-463: e8 9a 8b 49 0b 00 00 00 00 00 00 00 8d d4 e0 02
464-479: 00 00 00 00 00 00 00 00 9d 00 00 00 00 00 00 00
480-495: 11 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
496-511: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 5d
===== [SMART READ ATTRIBUTE VALUES] DATA END (512 Bytes) =====
REPORT-IOCTL: DeviceFD=3 Command=SMART READ ATTRIBUTE THRESHOLDS
0.102167 ioctl(3, 0x31f, 0x7fffffe0fb40) = 0
REPORT-IOCTL: DeviceFD=3 Command=SMART READ ATTRIBUTE THRESHOLDS returned 0
===== [SMART READ ATTRIBUTE THRESHOLDS] DATA START (BASE-16) =====
000-015: 01 00 01 2c 00 00 00 00 00 00 00 00 00 00 03 00
016-031: 00 00 00 00 00 00 00 00 00 00 04 14 00 00 00 00
032-047: 00 00 00 00 00 00 05 24 00 00 00 00 00 00 00 00
048-063: 00 00 07 1e 00 00 00 00 00 00 00 00 00 00 09 00
064-079: 00 00 00 00 00 00 00 00 00 00 0a 61 00 00 00 00
080-095: 00 00 00 00 00 00 0c 14 00 00 00 00 00 00 00 00
096-111: 00 00 b8 63 00 00 00 00 00 00 00 00 00 00 bb 00
112-127: 00 00 00 00 00 00 00 00 00 00 bc 00 00 00 00 00
128-143: 00 00 00 00 00 00 bd 00 00 00 00 00 00 00 00 00
144-159: 00 00 be 2d 00 00 00 00 00 00 00 00 00 00 c2 00
160-175: 00 00 00 00 00 00 00 00 00 00 c3 00 00 00 00 00
176-191: 00 00 00 00 00 00 c5 00 00 00 00 00 00 00 00 00
192-207: 00 00 c6 00 00 00 00 00 00 00 00 00 00 00 c7 00
208-223: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
224-239: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
240-255: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
256-271: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
272-287: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
288-303: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
304-319: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
320-335: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
336-351: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
352-367: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
368-383: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
384-399: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
400-415: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
416-431: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
432-447: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
448-463: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
464-479: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
480-495: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
496-511: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 c4
===== [SMART READ ATTRIBUTE THRESHOLDS] DATA END (512 Bytes) =====
=== START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION ===
Sending command: "Execute SMART Short self-test routine immediately in
off-line mode".
REPORT-IOCTL: DeviceFD=3 Command=SMART IMMEDIATE OFFLINE InputParameter=1
0.015623 ioctl(3, 0x31f, 0x7fffffe0f9f0) = 0
REPORT-IOCTL: DeviceFD=3 Command=SMART IMMEDIATE OFFLINE returned 0
Drive command "Execute SMART Short self-test routine immediately in
off-line mode" successful.
Testing has begun.
Please wait 1 minutes for test to complete.
Test will complete after Mon Oct 13 11:51:41 2008
Use smartctl -X to abort test.
#
Then I ran the inquiry binary until it hung in the middle of an ioctl:
# strace -rt -e ioctl /root/test /dev/sda
0.000000 ioctl(3, SG_GET_VERSION_NUM, 0x7fff65e00540) = 0
timeout: 6 secs: Success
0.000299 ioctl(3, SG_IO, {'S', SG_DXFER_FROM_DEV, cmd[6]=[12, 00,
00, 00, 60, 00], mx_sb_len=32, iovec_count=0, dxfer_len=96,
timeout=6000, flags=0
... disk resets here ...
, data[96]=["\0\0\5\2[\0\0\0ATA ST3500320NS "...], status=00,
masked_status=00, sb[0]=[], host_status=0, driver_status=0, resid=0,
duration=77887, info=0}) = 0
Some of the INQUIRY command's response:
ATA ST3500320NS SN05
INQUIRY duration=77887 millisecs, resid=0
#
You can see it hangs for around 78 secs.
Scott
---------------
[1] http://www.faqs.org/docs/Linux-HOWTO/SCSI-Generic-HOWTO.html#PEXAMPLE
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: SATA resets via SMART selftest
2008-10-13 17:18 ` Scott Beardsley
@ 2008-10-14 0:19 ` Scott Beardsley
0 siblings, 0 replies; 4+ messages in thread
From: Scott Beardsley @ 2008-10-14 0:19 UTC (permalink / raw)
To: linux-ide
Scott Beardsley wrote:
> Alan Cox wrote:
>> The timeout set on the command expired so the kernel aborted the command
>> and recovered the link.
>
> For some strange reason I can't get scsi_logging to work correctly
> (likely my fault).
Yep, I was looking in the wrong place (and probably at old docs). I had
to run:
# echo 7 > /proc/sys/dev/scsi/logging_level
Below is that same sequence (SMART short test, then INQUIRYs with 7.5sec
timeouts this time). The SMART cmd was issued at 1083, the first INQUIRY
(of a series of 10) is at 1084, the disk finishes resetting at 1168.
Perhaps this is a problem with Seagate's implementation of SMART on this
disk? hdparm -I states that it does, in fact, support self-tests. The
timeouts don't seem unusually short (10+sec). I'm stumped why it would
be timing out.
Scott
---------------------
[ 1064.969694] scsi_add_timer: scmd: ffff88021d4f9800, time: 60000,
(ffffffffa006f521)
[ 1064.970262] scsi_delete_timer: scmd: ffff88021d4f9800, rtn: 1
[ 1064.970270] scsi_add_timer: scmd: ffff88021d4f91c0, time: 60000,
(ffffffffa006f521)
[ 1064.970769] scsi_delete_timer: scmd: ffff88021d4f91c0, rtn: 1
[ 1083.788393] scsi_add_timer: scmd: ffff88021d4f91c0, time: 60000,
(ffffffffa006f521)
[ 1083.797591] scsi_delete_timer: scmd: ffff88021d4f91c0, rtn: 1
[ 1083.797641] scsi_add_timer: scmd: ffff88021d91be80, time: 60000,
(ffffffffa006f521)
[ 1083.808215] scsi_delete_timer: scmd: ffff88021d91be80, rtn: 1
[ 1083.808358] scsi_add_timer: scmd: ffff88021d91bd40, time: 60000,
(ffffffffa006f521)
[ 1083.808936] scsi_delete_timer: scmd: ffff88021d91bd40, rtn: 1
[ 1083.808966] scsi_add_timer: scmd: ffff88021d91bc00, time: 60000,
(ffffffffa006f521)
[ 1083.811553] scsi_delete_timer: scmd: ffff88021d91bc00, rtn: 1
[ 1083.812533] scsi_add_timer: scmd: ffff88021d91bac0, time: 60000,
(ffffffffa006f521)
[ 1083.812668] scsi_delete_timer: scmd: ffff88021d91bac0, rtn: 1
[ 1083.812678] scsi_add_timer: scmd: ffff88021d91b980, time: 60000,
(ffffffffa006f521)
[ 1083.812981] scsi_delete_timer: scmd: ffff88021d91b980, rtn: 1
[ 1083.812992] scsi_add_timer: scmd: ffff88021d91bac0, time: 60000,
(ffffffffa006f521)
[ 1083.813789] scsi_delete_timer: scmd: ffff88021d91bac0, rtn: 1
[ 1083.813921] scsi_block_when_processing_errors: rtn: 1
[ 1083.813940] scsi_add_timer: scmd: ffff88021d91b840, time: 60000,
(ffffffffa006f521)
[ 1083.814089] scsi_delete_timer: scmd: ffff88021d91b840, rtn: 1
[ 1083.814126] scsi_block_when_processing_errors: rtn: 1
[ 1083.814132] scsi_block_when_processing_errors: rtn: 1
[ 1083.814136] scsi_block_when_processing_errors: rtn: 1
[ 1083.814142] scsi_block_when_processing_errors: rtn: 1
[ 1083.814156] scsi_add_timer: scmd: ffff88021d91b700, time: 10000,
(ffffffffa006f521)
[ 1083.816710] scsi_delete_timer: scmd: ffff88021d91b700, rtn: 1
[ 1083.822368] scsi_block_when_processing_errors: rtn: 1
[ 1083.822372] scsi_block_when_processing_errors: rtn: 1
[ 1083.822380] scsi_add_timer: scmd: ffff88021d91b5c0, time: 10000,
(ffffffffa006f521)
[ 1083.878472] scsi_delete_timer: scmd: ffff88021d91b5c0, rtn: 1
[ 1083.878496] scsi_block_when_processing_errors: rtn: 1
[ 1083.878500] scsi_block_when_processing_errors: rtn: 1
[ 1083.878506] scsi_add_timer: scmd: ffff88021d91b480, time: 10000,
(ffffffffa006f521)
[ 1083.920116] scsi_delete_timer: scmd: ffff88021d91b480, rtn: 1
[ 1083.920139] scsi_block_when_processing_errors: rtn: 1
[ 1083.920143] scsi_block_when_processing_errors: rtn: 1
[ 1083.920150] scsi_add_timer: scmd: ffff88021d91b340, time: 10000,
(ffffffffa006f521)
[ 1084.012873] scsi_delete_timer: scmd: ffff88021d91b340, rtn: 1
[ 1084.012995] scsi_block_when_processing_errors: rtn: 1
[ 1084.012999] scsi_block_when_processing_errors: rtn: 1
[ 1084.013006] scsi_add_timer: scmd: ffff88021d91b200, time: 10000,
(ffffffffa006f521)
[ 1084.020169] scsi_delete_timer: scmd: ffff88021d91b200, rtn: 1
[ 1084.020207] scsi_block_when_processing_errors: rtn: 1
[ 1084.020211] scsi_block_when_processing_errors: rtn: 1
[ 1084.020216] scsi_add_timer: scmd: ffff88021d91b0c0, time: 10000,
(ffffffffa006f521)
[ 1084.021323] scsi_delete_timer: scmd: ffff88021d91b0c0, rtn: 1
[ 1084.023222] scsi_block_when_processing_errors: rtn: 1
[ 1084.023318] scsi_block_when_processing_errors: rtn: 1
[ 1084.023322] scsi_block_when_processing_errors: rtn: 1
[ 1084.023731] scsi_block_when_processing_errors: rtn: 1
[ 1084.023735] scsi_block_when_processing_errors: rtn: 1
[ 1084.023754] scsi_add_timer: scmd: ffff88021d474e40, time: 7500,
(ffffffffa006f521)
[ 1084.023765] scsi_delete_timer: scmd: ffff88021d474e40, rtn: 1
[ 1084.788506] scsi_block_when_processing_errors: rtn: 1
[ 1084.788568] scsi_block_when_processing_errors: rtn: 1
[ 1084.788571] scsi_block_when_processing_errors: rtn: 1
[ 1084.788856] scsi_block_when_processing_errors: rtn: 1
[ 1084.788859] scsi_block_when_processing_errors: rtn: 1
[ 1084.788872] scsi_add_timer: scmd: ffff88021bd48e80, time: 7500,
(ffffffffa006f521)
[ 1084.788877] scsi_delete_timer: scmd: ffff88021bd48e80, rtn: 1
[ 1085.716417] scsi_block_when_processing_errors: rtn: 1
[ 1085.716488] scsi_block_when_processing_errors: rtn: 1
[ 1085.716490] scsi_block_when_processing_errors: rtn: 1
[ 1085.716790] scsi_block_when_processing_errors: rtn: 1
[ 1085.716793] scsi_block_when_processing_errors: rtn: 1
[ 1085.716809] scsi_add_timer: scmd: ffff88021d91b980, time: 7500,
(ffffffffa006f521)
[ 1085.716816] scsi_delete_timer: scmd: ffff88021d91b980, rtn: 1
[ 1086.596181] scsi_block_when_processing_errors: rtn: 1
[ 1086.596277] scsi_block_when_processing_errors: rtn: 1
[ 1086.596279] scsi_block_when_processing_errors: rtn: 1
[ 1086.596631] scsi_block_when_processing_errors: rtn: 1
[ 1086.596634] scsi_block_when_processing_errors: rtn: 1
[ 1086.596697] scsi_add_timer: scmd: ffff88021d91b0c0, time: 7500,
(ffffffffa006f521)
[ 1086.596703] scsi_delete_timer: scmd: ffff88021d91b0c0, rtn: 1
[ 1087.428274] scsi_block_when_processing_errors: rtn: 1
[ 1087.428369] scsi_block_when_processing_errors: rtn: 1
[ 1087.428372] scsi_block_when_processing_errors: rtn: 1
[ 1087.428724] scsi_block_when_processing_errors: rtn: 1
[ 1087.428727] scsi_block_when_processing_errors: rtn: 1
[ 1087.428788] scsi_add_timer: scmd: ffff88021d91b0c0, time: 7500,
(ffffffffa006f521)
[ 1087.428794] scsi_delete_timer: scmd: ffff88021d91b0c0, rtn: 1
[ 1088.100442] scsi_block_when_processing_errors: rtn: 1
[ 1088.100509] scsi_block_when_processing_errors: rtn: 1
[ 1088.100512] scsi_block_when_processing_errors: rtn: 1
[ 1088.100801] scsi_block_when_processing_errors: rtn: 1
[ 1088.100804] scsi_block_when_processing_errors: rtn: 1
[ 1088.100820] scsi_add_timer: scmd: ffff88021bd48e80, time: 7500,
(ffffffffa006f521)
[ 1088.100826] scsi_delete_timer: scmd: ffff88021bd48e80, rtn: 1
[ 1088.580211] scsi_block_when_processing_errors: rtn: 1
[ 1088.580304] scsi_block_when_processing_errors: rtn: 1
[ 1088.580308] scsi_block_when_processing_errors: rtn: 1
[ 1088.580710] scsi_block_when_processing_errors: rtn: 1
[ 1088.580714] scsi_block_when_processing_errors: rtn: 1
[ 1088.580727] scsi_add_timer: scmd: ffff88021d474e40, time: 7500,
(ffffffffa006f521)
[ 1088.580731] scsi_delete_timer: scmd: ffff88021d474e40, rtn: 1
[ 1089.011690] scsi_block_when_processing_errors: rtn: 1
[ 1089.011754] scsi_block_when_processing_errors: rtn: 1
[ 1089.011757] scsi_block_when_processing_errors: rtn: 1
[ 1089.012053] scsi_block_when_processing_errors: rtn: 1
[ 1089.012056] scsi_block_when_processing_errors: rtn: 1
[ 1089.012069] scsi_add_timer: scmd: ffff88021d91b980, time: 7500,
(ffffffffa006f521)
[ 1089.012075] scsi_delete_timer: scmd: ffff88021d91b980, rtn: 1
[ 1089.507506] scsi_add_timer: scmd: ffff88021d474e40, time: 60000,
(ffffffffa006f521)
[ 1149.503878] Waking error handler thread
[ 1149.503883] Error handler scsi_eh_0 waking up
[ 1149.503903] ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6
frozen
[ 1149.503909] ata1.00: cmd ca/00:70:ff:5e:00/00:00:00:00:00/e0 tag 0
dma 57344 out
[ 1149.503911] res 40/00:00:01:4f:c2/00:00:00:00:00/00 Emask
0x4 (timeout)
[ 1149.503919] ata1.00: status: { DRDY }
[ 1154.552602] ata1: link is slow to respond, please be patient (ready=0)
[ 1159.550330] ata1: device not ready (errno=-16), forcing hardreset
[ 1159.550336] ata1: soft resetting link
[ 1164.750045] ata1: link is slow to respond, please be patient (ready=0)
[ 1168.108894] ata1.00: configured for UDMA/133
[ 1168.108905] scsi_eh_0: flush retry cmd: ffff88021d474e40
[ 1168.108909] ata1: EH complete
[ 1168.108913] scsi_restart_operations: waking up host to restart
[ 1168.108917] scsi_add_timer: scmd: ffff88021d474e40, time: 60000,
(ffffffffa006f521)
[ 1168.109002] Error handler scsi_eh_0 sleeping
[ 1168.129442] scsi_delete_timer: scmd: ffff88021d474e40, rtn: 1
[ 1168.129460] scsi_add_timer: scmd: ffff88021d4f9440, time: 30000,
(ffffffffa006f521)
[ 1168.129464] scsi_delete_timer: scmd: ffff88021d4f9440, rtn: 1
[ 1168.129469] scsi_add_timer: scmd: ffff88021d4f91c0, time: 60000,
(ffffffffa006f521)
[ 1168.129973] scsi_delete_timer: scmd: ffff88021d4f91c0, rtn: 1
[ 1168.129987] scsi_add_timer: scmd: ffff88021d4f9440, time: 30000,
(ffffffffa006f521)
[ 1168.129991] scsi_delete_timer: scmd: ffff88021d4f9440, rtn: 1
[ 1168.130003] scsi_add_timer: scmd: ffff88021d4f91c0, time: 60000,
(ffffffffa006f521)
[ 1168.130007] sd 0:0:0:0: [sda] 976773168 512-byte hardware sectors
(500108 MB)
[ 1168.130384] scsi_delete_timer: scmd: ffff88021d4f91c0, rtn: 1
[ 1168.130393] scsi_add_timer: scmd: ffff88021d474e40, time: 30000,
(ffffffffa006f521)
[ 1168.130397] scsi_delete_timer: scmd: ffff88021d474e40, rtn: 1
[ 1168.130405] scsi_add_timer: scmd: ffff88021d4f91c0, time: 60000,
(ffffffffa006f521)
[ 1168.130458] sd 0:0:0:0: [sda] Write Protect is off
[ 1168.130460] sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
[ 1168.130928] scsi_delete_timer: scmd: ffff88021d4f91c0, rtn: 1
[ 1168.130938] scsi_add_timer: scmd: ffff88021d474e40, time: 30000,
(ffffffffa006f521)
[ 1168.130942] scsi_delete_timer: scmd: ffff88021d474e40, rtn: 1
[ 1168.130951] scsi_add_timer: scmd: ffff88021d4f91c0, time: 60000,
(ffffffffa006f521)
[ 1168.131462] scsi_delete_timer: scmd: ffff88021d4f91c0, rtn: 1
[ 1168.131471] scsi_add_timer: scmd: ffff88021d4f9940, time: 30000,
(ffffffffa006f521)
[ 1168.131475] scsi_delete_timer: scmd: ffff88021d4f9940, rtn: 1
[ 1168.131482] scsi_add_timer: scmd: ffff88021d4f91c0, time: 60000,
(ffffffffa006f521)
[ 1168.131532] sd 0:0:0:0: [sda] Write cache: enabled, read cache:
enabled, doesn't support DPO or FUA
[ 1168.131899] scsi_delete_timer: scmd: ffff88021d4f91c0, rtn: 1
[ 1168.131903] scsi_block_when_processing_errors: rtn: 1
[ 1168.131907] scsi_add_timer: scmd: ffff88021d4f9940, time: 60000,
(ffffffffa006f521)
[ 1168.131969] scsi_block_when_processing_errors: rtn: 1
[ 1168.131971] scsi_block_when_processing_errors: rtn: 1
[ 1168.132259] scsi_block_when_processing_errors: rtn: 1
[ 1168.132260] scsi_block_when_processing_errors: rtn: 1
[ 1168.132343] scsi_delete_timer: scmd: ffff88021d4f9940, rtn: 1
[ 1168.132352] scsi_add_timer: scmd: ffff88021d4f9800, time: 60000,
(ffffffffa006f521)
[ 1168.132854] scsi_delete_timer: scmd: ffff88021d4f9800, rtn: 1
[ 1168.132862] scsi_add_timer: scmd: ffff88021d4f9940, time: 7500,
(ffffffffa006f521)
[ 1168.132865] scsi_delete_timer: scmd: ffff88021d4f9940, rtn: 1
[ 1168.132889] scsi_add_timer: scmd: ffff88021d4f9940, time: 60000,
(ffffffffa006f521)
[ 1168.133297] scsi_delete_timer: scmd: ffff88021d4f9940, rtn: 1
[ 1168.133315] scsi_add_timer: scmd: ffff88021d4f9940, time: 60000,
(ffffffffa006f521)
[ 1168.133737] scsi_delete_timer: scmd: ffff88021d4f9940, rtn: 1
[ 1168.133798] scsi_add_timer: scmd: ffff88021d4f9940, time: 60000,
(ffffffffa006f521)
[ 1168.134198] scsi_delete_timer: scmd: ffff88021d4f9940, rtn: 1
[ 1168.134220] scsi_add_timer: scmd: ffff88021d4f9940, time: 60000,
(ffffffffa006f521)
[ 1168.134740] scsi_delete_timer: scmd: ffff88021d4f9940, rtn: 1
[ 1168.134762] scsi_add_timer: scmd: ffff88021d4f9800, time: 60000,
(ffffffffa006f521)
[ 1168.135126] scsi_block_when_processing_errors: rtn: 1
[ 1168.135182] scsi_delete_timer: scmd: ffff88021d4f9800, rtn: 1
[ 1168.135206] scsi_block_when_processing_errors: rtn: 1
[ 1168.135208] scsi_block_when_processing_errors: rtn: 1
[ 1168.135504] scsi_block_when_processing_errors: rtn: 1
[ 1168.135505] scsi_block_when_processing_errors: rtn: 1
[ 1168.135518] scsi_add_timer: scmd: ffff88021d4f9800, time: 7500,
(ffffffffa006f521)
[ 1168.135522] scsi_delete_timer: scmd: ffff88021d4f9800, rtn: 1
[ 1168.135807] scsi_add_timer: scmd: ffff88021d4f9800, time: 60000,
(ffffffffa006f521)
[ 1168.136165] scsi_delete_timer: scmd: ffff88021d4f9800, rtn: 1
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2008-10-14 0:19 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-10-10 20:38 SATA resets via SMART selftest Scott Beardsley
2008-10-10 21:08 ` Alan Cox
2008-10-13 17:18 ` Scott Beardsley
2008-10-14 0:19 ` Scott Beardsley
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).