All of lore.kernel.org
 help / color / mirror / Atom feed
From: Carsten Aulbert <Carsten.Aulbert@aei.mpg.de>
To: "Mathias Burén" <mathias.buren@gmail.com>
Cc: Linux RAID <linux-raid@vger.kernel.org>
Subject: Re: Recovering from two almost simultaneously failed devices in RAID1
Date: Sat, 10 Aug 2013 20:05:42 +0200	[thread overview]
Message-ID: <520680F6.1090108@aei.mpg.de> (raw)
In-Reply-To: <CADNH=7G=MFJuwrTQ=r4uFgTji87iVr3ou04q7yN50tCojJGKfA@mail.gmail.com>

[-- Attachment #1: Type: text/plain, Size: 4571 bytes --]

Hi

On 08/10/2013 07:45 PM, Mathias Burén wrote:
> smartctl -a 

Looks pretty much innocent:

  1 Raw_Read_Error_Rate     0x000a   100   100   000    Old_age   Always
      -       0
  2 Throughput_Performance  0x0005   100   100   050    Pre-fail
Offline      -       0
  3 Spin_Up_Time            0x0007   100   100   050    Pre-fail  Always
      -       0
  5 Reallocated_Sector_Ct   0x0013   100   100   050    Pre-fail  Always
      -       0
  7 Seek_Error_Rate         0x000b   100   100   050    Pre-fail  Always
      -       0
  8 Seek_Time_Performance   0x0005   100   100   050    Pre-fail
Offline      -       0
  9 Power_On_Hours          0x0012   100   100   000    Old_age   Always
      -       9098
 10 Spin_Retry_Count        0x0013   100   100   050    Pre-fail  Always
      -       0
 12 Power_Cycle_Count       0x0012   100   100   000    Old_age   Always
      -       13
167 Unknown_Attribute       0x0022   100   100   000    Old_age   Always
      -       0
168 Unknown_Attribute       0x0012   100   100   000    Old_age   Always
      -       2
169 Unknown_Attribute       0x0013   092   092   010    Pre-fail  Always
      -       0
173 Unknown_Attribute       0x0012   169   169   000    Old_age   Always
      -       0
175 Program_Fail_Count_Chip 0x0013   100   100   010    Pre-fail  Always
      -       0
192 Power-Off_Retract_Count 0x0012   100   100   000    Old_age   Always
      -       0
194 Temperature_Celsius     0x0023   073   073   030    Pre-fail  Always
      -       27 (Lifetime Min/Max 26/40)
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always
      -       0
240 Head_Flying_Hours       0x0013   100   100   050    Pre-fail  Always
      -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining
LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%      9098
     -


and

ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE
UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000a   100   100   000    Old_age   Always
      -       0
  2 Throughput_Performance  0x0005   100   100   050    Pre-fail
Offline      -       0
  3 Spin_Up_Time            0x0007   100   100   050    Pre-fail  Always
      -       0
  5 Reallocated_Sector_Ct   0x0013   100   100   050    Pre-fail  Always
      -       0
  7 Seek_Error_Rate         0x000b   100   100   050    Pre-fail  Always
      -       0
  8 Seek_Time_Performance   0x0005   100   100   050    Pre-fail
Offline      -       0
  9 Power_On_Hours          0x0012   100   100   000    Old_age   Always
      -       9098
 10 Spin_Retry_Count        0x0013   100   100   050    Pre-fail  Always
      -       0
 12 Power_Cycle_Count       0x0012   100   100   000    Old_age   Always
      -       12
167 Unknown_Attribute       0x0022   100   100   000    Old_age   Always
      -       0
168 Unknown_Attribute       0x0012   100   100   000    Old_age   Always
      -       2
169 Unknown_Attribute       0x0013   095   095   010    Pre-fail  Always
      -       0
173 Unknown_Attribute       0x0012   169   169   000    Old_age   Always
      -       0
175 Program_Fail_Count_Chip 0x0013   100   100   010    Pre-fail  Always
      -       0
192 Power-Off_Retract_Count 0x0012   100   100   000    Old_age   Always
      -       0
194 Temperature_Celsius     0x0023   070   070   030    Pre-fail  Always
      -       30 (Lifetime Min/Max 29/40)
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always
      -       0
240 Head_Flying_Hours       0x0013   100   100   050    Pre-fail  Always
      -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining
LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%      9098
     -


I think I need to check the BIOS settings and reseat all internal
connections on Monday, when I'm back in office. Also, I will contact
Supermicro if they know more about this (and perhaps a new BIOS).

More suggestions?

Cheers

carsten


-- 
Dr. Carsten Aulbert - Max Planck Institute for Gravitational Physics
Callinstrasse 38, 30167 Hannover, Germany
phone/fax: +49 511 762-17185 / -17193
https://wiki.atlas.aei.uni-hannover.de/foswiki/bin/view/ATLAS/WebHome


[-- Attachment #2: S/MIME Cryptographic Signature --]
[-- Type: application/pkcs7-signature, Size: 2044 bytes --]

      reply	other threads:[~2013-08-10 18:05 UTC|newest]

Thread overview: 5+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-08-10 16:29 Recovering from two almost simultaneously failed devices in RAID1 Carsten Aulbert
2013-08-10 16:33 ` Carsten Aulbert
2013-08-10 17:39   ` Carsten Aulbert
2013-08-10 17:45     ` Mathias Burén
2013-08-10 18:05       ` Carsten Aulbert [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=520680F6.1090108@aei.mpg.de \
    --to=carsten.aulbert@aei.mpg.de \
    --cc=linux-raid@vger.kernel.org \
    --cc=mathias.buren@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.