From: Michael Stumpf <mjstumpf@pobox.com>
To: Justin Piszcz <jpiszcz@lucidpixels.com>
Cc: linux-raid@vger.kernel.org
Subject: Re: detecting/correcting _slightly_ flaky disks
Date: Mon, 05 Mar 2007 11:01:49 -0600 [thread overview]
Message-ID: <45EC4CFD.3050106@pobox.com> (raw)
In-Reply-To: <Pine.LNX.4.64.0703051008190.14012@p34.internal.lan>
This is the drive I think is most suspect. What isn't obvious, because
it isn't listed in the self test log, is between #1 and #2 there was an
aborted, hung test. The #4 short test that was aborted was also a hung
test that I eventually, manually aborted--heard clicking from drives at
that time, can't swear it was from this drive though.
Not sure I fully understand the nuances of this report. If anything
jumps out at you, I'd appreciate a tip on how you read it. (to me,
looks mostly healthy)
>
>
> Also, what does smartctl -a /dev/hda for each of your drives show?
>
> Justin.
>
>
=== START OF INFORMATION SECTION ===
Model Family: Western Digital Caviar SE family
Device Model: WDC WD1200JB-75CRA0
Serial Number: WD-WMA8C3115683
Firmware Version: 16.06V76
User Capacity: 120,000,000,000 bytes
Device is: In smartctl database [for details use: -P show]
ATA Version is: 5
ATA Standard is: Exact ATA specification draft version not indicated
Local Time is: Mon Mar 05 10:52:05 2007 CAST
SMART support is: Available - device has SMART capability.
Enabled status cached by OS, trying SMART RETURN
STATUS cmd.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x85) Offline data collection activity
was aborted by an interrupting
command from host.
Auto Offline Data Collection:
Enabled.
Self-test execution status: ( 0) The previous self-test routine
completed
without error or no self-test
has ever
been run.
Total time to complete Offline
data collection: (4680) seconds.
Offline data collection
capabilities: (0x3b) SMART execute Offline immediate.
Auto Offline data collection
on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
No Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
No General Purpose Logging support.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 87) minutes.
Conveyance self-test routine
recommended polling time: ( 5) minutes.
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE
UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000b 200 200 051 Pre-fail
Always - 0
3 Spin_Up_Time 0x0007 146 098 021 Pre-fail
Always - 3491
4 Start_Stop_Count 0x0032 100 100 040 Old_age
Always - 399
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail
Always - 0
7 Seek_Error_Rate 0x000b 200 200 051 Pre-fail
Always - 0
9 Power_On_Hours 0x0032 070 070 000 Old_age
Always - 22147
10 Spin_Retry_Count 0x0013 100 100 051 Pre-fail
Always - 0
11 Calibration_Retry_Count 0x0013 100 100 051 Pre-fail
Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age
Always - 397
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age
Always - 0
197 Current_Pending_Sector 0x0012 200 200 000 Old_age
Always - 0
198 Offline_Uncorrectable 0x0012 200 200 000 Old_age
Always - 0
199 UDMA_CRC_Error_Count 0x000a 200 253 000 Old_age
Always - 0
200 Multi_Zone_Error_Rate 0x0009 200 200 051 Pre-fail
Offline - 0
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining
LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed without error 00%
299 -
# 2 Extended offline Interrupted (host reset) 50%
279 -
# 3 Short offline Completed without error 00%
279 -
# 4 Short offline Aborted by host 80%
279 -
# 5 Extended offline Completed without error 00%
102 -
# 6 Extended offline Completed without error 00%
1026 -
# 7 Extended offline Completed without error 00%
859 -
# 8 Extended offline Completed without error 00%
692 -
# 9 Extended offline Completed without error 00%
525 -
#10 Extended offline Completed without error 00%
380 -
#11 Extended offline Completed without error 00%
370 -
Device does not support Selective Self Tests/Logging
next prev parent reply other threads:[~2007-03-05 17:01 UTC|newest]
Thread overview: 36+ messages / expand[flat|nested] mbox.gz Atom feed top
2007-03-04 11:22 mismatch_cnt questions Christian Pernegger
2007-03-04 11:50 ` Neil Brown
2007-03-04 12:01 ` Christian Pernegger
2007-03-04 22:19 ` Neil Brown
2007-03-06 10:04 ` mismatch_cnt questions - how about raid10? Peter Rabbitson
2007-03-06 10:20 ` Neil Brown
2007-03-06 10:56 ` Peter Rabbitson
2007-03-06 10:59 ` Justin Piszcz
2007-03-12 5:35 ` Neil Brown
2007-03-12 14:26 ` Peter Rabbitson
2007-03-04 21:21 ` mismatch_cnt questions Eyal Lebedinsky
2007-03-04 22:30 ` Neil Brown
2007-03-05 7:45 ` Eyal Lebedinsky
2007-03-05 14:56 ` detecting/correcting _slightly_ flaky disks Michael Stumpf
2007-03-05 15:09 ` Justin Piszcz
2007-03-05 17:01 ` Michael Stumpf [this message]
2007-03-05 17:11 ` Justin Piszcz
2007-03-07 0:14 ` Bill Davidsen
2007-03-07 1:37 ` Michael Stumpf
2007-03-07 13:57 ` berk walker
2007-03-07 15:01 ` Bill Davidsen
2007-03-05 23:40 ` mismatch_cnt questions Neil Brown
2007-03-07 0:22 ` Bill Davidsen
2007-03-08 6:39 ` H. Peter Anvin
2007-03-08 13:54 ` Martin K. Petersen
2007-03-09 2:00 ` Bill Davidsen
2007-03-09 4:20 ` H. Peter Anvin
2007-03-09 5:20 ` Bill Davidsen
2007-03-08 6:34 ` H. Peter Anvin
2007-03-08 7:00 ` H. Peter Anvin
2007-03-08 8:21 ` H. Peter Anvin
2007-03-13 9:58 ` Andre Noll
2007-03-13 23:46 ` H. Peter Anvin
2007-03-06 6:27 ` Paul Davidson
2008-05-12 11:16 ` Bas van Schaik
2008-05-12 14:31 ` Justin Piszcz
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=45EC4CFD.3050106@pobox.com \
--to=mjstumpf@pobox.com \
--cc=jpiszcz@lucidpixels.com \
--cc=linux-raid@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).