* Two system lockups, Magic SysKeys works, RAID6 resyncs on reboot
@ 2015-03-31 15:00 Mark Knecht
2015-03-31 20:30 ` Roman Mamedov
0 siblings, 1 reply; 5+ messages in thread
From: Mark Knecht @ 2015-03-31 15:00 UTC (permalink / raw)
To: Linux-RAID
Hi,
In the past week, having moved from a gentoo-sources-3.14.34 kernel
to a gentoo-sources-3.18.10 kernel, I've had two full system lockups
on what has been a very stable, day-to-day machine I use for my
everyday work at home 7 days a week. I've run this machine in this
basic configuration for about 5 years without anything like this
happening. Two in a week is alarming. The Linux part of recovering
hasn't been a big problem but making sure my Win 7 VMs are OK takes
quite awhile. I was running 3 VMs at the time of this morning's
lockup, one of them being new yesterday so it wasn't involved in the
first lockup. The other two RAID1s on the system are not mounted or
effected by this event.
I'm going to try to supply all the stuff I see asked for in most
threads like this. If I miss something please let me know. Attached
are the initramfs setup stuff, mdadm -D & mdadm -E outputs, smartctl
--all for all 5 drives, the output of lsdrv run as root and the
contents of /var/log/messages around the time of the reboot. (7AM)
By my quick read I think the disks look OK but maybe someone with a
deeper skillset can double check that and point out anything I'm
missing. smartctl --long testing is run on each drive once a week
(probably overkill) and other than a couple of system shutdowns while
in progress the smartctl data looks OK to me.
Backups are done every Sunday and the system is functional right
now (I'm writing this email) so I can do more
investigation/backups/study as needed. I don't think there's much
chance of anything really important being lost yet but I'd like to try
and understand what's causing this.
The lockups seem to effect the apps but not the mouse or keyboard.
I can switch to the console and the machine responds to the magic
syskey sequence REISUB. When the machine reboots there's no obvious
problem however the main system drive, a 5-drive RAID6, is in the
process of resyncing by the time I log in and look around. The first
time this happened the resync completed with no problems and this one
is proceeding OK so far.
c2RAID6 ~ # eix mdadm
[I] sys-fs/mdadm
Available versions: 3.2.6-r1^t 3.3.1-r2^t{tbz2} ~3.3.2^t {static}
Installed versions: 3.3.1-r2^t{tbz2}(15:11:51 11/14/14)(static)
Homepage: http://neil.brown.name/blog/mdadm
Description: A useful tool for running RAID systems - it
can be used as a replacement for the raidtools
c2RAID6 ~ #
c2RAID6 ~ # cat /proc/mdstat
Personalities : [linear] [raid0] [raid1] [raid10] [raid6] [raid5] [raid4]
md10 : active raid1 sda10[0] sdb10[1]
1430135296 blocks super 1.2 [2/2] [UU]
md4 : active raid1 sdb3[1] sda3[0]
524156736 blocks super 1.2 [2/2] [UU]
md3 : active raid6 sdc3[9] sdg3[5] sdf3[6] sde3[7] sdd3[8]
1452264480 blocks super 1.2 level 6, 16k chunk, algorithm 2 [5/5] [UUUUU]
[====>................] resync = 22.8% (110608304/484088160)
finish=78.5min speed=79206K/sec
unused devices: <none>
c2RAID6 ~ #
Again, nothing appears lost at this time but if there's a way to
find out what's causing this I'd very much like to learn how to do
that.
Thanks,
Mark
1) The disk is assembled in my (by hand) initramfs which is built into
the kernel.
c2RAID6 ~ # cat /usr/src/initramfs_init_new.sh
#!/bin/busybox sh
rescue_shell() {
echo "-----"
echo "!!!!!"
echo "Something went wrong."
echo "Dropping into busybox"
echo "!!!!!"
echo "-----"
busybox --install -s
exec /bin/sh
}
/bin/busybox --install -s
# Mount the /proc and /sys filesystems.
mount -t proc none /proc
mount -t sysfs none /sys
mount -t devtmpfs none /dev
# Do your stuff here.
echo " "
echo "This init script assembles the RAID6 root as /dev/md3"
echo "mounts it as rootfs and, if there are no problems,"
echo "boots it up, else calls rescue_shell/busybox"
echo " "
mdadm --assemble /dev/md3
# Mount the root filesystem.
mount -o ro /dev/md3 /realroot || rescue_shell
# Clean up.
umount /dev
umount /proc
umount /sys
# Boot the real thing.
exec switch_root /realroot /sbin/init
c2RAID6 ~ #
c2RAID6 ~ # cat /usr/src/mdadm_initramfs.conf
ARRAY /dev/md/3 metadata=1.2 UUID=de47f991:86d98467:0637635b:9c6d0591
ARRAY /dev/md/4 metadata=1.2 UUID=c3e562a4:6871679e:2d83518e:fefd237f
ARRAY /dev/md/10 metadata=1.2 UUID=d9624704:b29e1a8e:148b47be:cabad736
c2RAID6 ~ #
c2RAID6 ~ # mdadm -D /dev/md3
/dev/md3:
Version : 1.2
Creation Time : Thu Dec 30 17:40:50 2010
Raid Level : raid6
Array Size : 1452264480 (1384.99 GiB 1487.12 GB)
Used Dev Size : 484088160 (461.66 GiB 495.71 GB)
Raid Devices : 5
Total Devices : 5
Persistence : Superblock is persistent
Update Time : Tue Mar 31 07:11:01 2015
State : active, resyncing
Active Devices : 5
Working Devices : 5
Failed Devices : 0
Spare Devices : 0
Layout : left-symmetric
Chunk Size : 16K
Resync Status : 15% complete
Name : c2stable:3
UUID : de47f991:86d98467:0637635b:9c6d0591
Events : 38369
Number Major Minor RaidDevice State
9 8 35 0 active sync /dev/sdc3
8 8 51 1 active sync /dev/sdd3
7 8 67 2 active sync /dev/sde3
6 8 83 3 active sync /dev/sdf3
5 8 99 4 active sync /dev/sdg3
c2RAID6 ~ #
c2RAID6 ~ # mdadm -E /dev/sdc3
/dev/sdc3:
Magic : a92b4efc
Version : 1.2
Feature Map : 0x0
Array UUID : de47f991:86d98467:0637635b:9c6d0591
Name : c2stable:3
Creation Time : Thu Dec 30 17:40:50 2010
Raid Level : raid6
Raid Devices : 5
Avail Dev Size : 968176345 (461.66 GiB 495.71 GB)
Array Size : 1452264480 (1384.99 GiB 1487.12 GB)
Used Dev Size : 968176320 (461.66 GiB 495.71 GB)
Data Offset : 2048 sectors
Super Offset : 8 sectors
Unused Space : before=1968 sectors, after=25 sectors
State : active
Device UUID : da06e9db:a8a3dffe:feec1851:d0cbdde6
Update Time : Tue Mar 31 07:26:02 2015
Checksum : 909e27f6 - correct
Events : 38372
Layout : left-symmetric
Chunk Size : 16K
Device Role : Active device 0
Array State : AAAAA ('A' == active, '.' == missing, 'R' == replacing)
c2RAID6 ~ #
c2RAID6 ~ # mdadm -E /dev/sdd3
/dev/sdd3:
Magic : a92b4efc
Version : 1.2
Feature Map : 0x0
Array UUID : de47f991:86d98467:0637635b:9c6d0591
Name : c2stable:3
Creation Time : Thu Dec 30 17:40:50 2010
Raid Level : raid6
Raid Devices : 5
Avail Dev Size : 968176345 (461.66 GiB 495.71 GB)
Array Size : 1452264480 (1384.99 GiB 1487.12 GB)
Used Dev Size : 968176320 (461.66 GiB 495.71 GB)
Data Offset : 2048 sectors
Super Offset : 8 sectors
Unused Space : before=1968 sectors, after=25 sectors
State : active
Device UUID : f3dd1d1e:8fb1a087:1060acfe:b8c9a537
Update Time : Tue Mar 31 07:26:02 2015
Checksum : 59ef7dee - correct
Events : 38372
Layout : left-symmetric
Chunk Size : 16K
Device Role : Active device 1
Array State : AAAAA ('A' == active, '.' == missing, 'R' == replacing)
c2RAID6 ~ #
c2RAID6 ~ # mdadm -E /dev/sde3
/dev/sde3:
Magic : a92b4efc
Version : 1.2
Feature Map : 0x0
Array UUID : de47f991:86d98467:0637635b:9c6d0591
Name : c2stable:3
Creation Time : Thu Dec 30 17:40:50 2010
Raid Level : raid6
Raid Devices : 5
Avail Dev Size : 968176345 (461.66 GiB 495.71 GB)
Array Size : 1452264480 (1384.99 GiB 1487.12 GB)
Used Dev Size : 968176320 (461.66 GiB 495.71 GB)
Data Offset : 2048 sectors
Super Offset : 8 sectors
Unused Space : before=1968 sectors, after=25 sectors
State : active
Device UUID : 6bbd236b:77c22abc:da7c8fc2:e29e601e
Update Time : Tue Mar 31 07:26:02 2015
Checksum : 861d6041 - correct
Events : 38372
Layout : left-symmetric
Chunk Size : 16K
Device Role : Active device 2
Array State : AAAAA ('A' == active, '.' == missing, 'R' == replacing)
c2RAID6 ~ #
c2RAID6 ~ # mdadm -E /dev/sdf3
/dev/sdf3:
Magic : a92b4efc
Version : 1.2
Feature Map : 0x0
Array UUID : de47f991:86d98467:0637635b:9c6d0591
Name : c2stable:3
Creation Time : Thu Dec 30 17:40:50 2010
Raid Level : raid6
Raid Devices : 5
Avail Dev Size : 968176345 (461.66 GiB 495.71 GB)
Array Size : 1452264480 (1384.99 GiB 1487.12 GB)
Used Dev Size : 968176320 (461.66 GiB 495.71 GB)
Data Offset : 2048 sectors
Super Offset : 8 sectors
Unused Space : before=1968 sectors, after=25 sectors
State : active
Device UUID : 8a361a4a:ac34f9cc:cee70fc3:5a8bed2f
Update Time : Tue Mar 31 07:26:02 2015
Checksum : 87efa300 - correct
Events : 38372
Layout : left-symmetric
Chunk Size : 16K
Device Role : Active device 3
Array State : AAAAA ('A' == active, '.' == missing, 'R' == replacing)
c2RAID6 ~ #
c2RAID6 ~ # mdadm -E /dev/sdg3
/dev/sdg3:
Magic : a92b4efc
Version : 1.2
Feature Map : 0x0
Array UUID : de47f991:86d98467:0637635b:9c6d0591
Name : c2stable:3
Creation Time : Thu Dec 30 17:40:50 2010
Raid Level : raid6
Raid Devices : 5
Avail Dev Size : 968176320 (461.66 GiB 495.71 GB)
Array Size : 1452264480 (1384.99 GiB 1487.12 GB)
Data Offset : 1392 sectors
Super Offset : 8 sectors
Unused Space : before=1312 sectors, after=0 sectors
State : active
Device UUID : c56ca951:928864db:c2746525:37716872
Update Time : Tue Mar 31 07:26:02 2015
Checksum : 42ba9d48 - correct
Events : 38372
Layout : left-symmetric
Chunk Size : 16K
Device Role : Active device 4
Array State : AAAAA ('A' == active, '.' == missing, 'R' == replacing)
c2RAID6 ~ #
c2RAID6 ~ # smartctl --all /dev/sdc
smartctl 6.3 2014-07-26 r3976 [x86_64-linux-3.18.10-gentoo] (local build)
Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Model Family: Western Digital RE3 Serial ATA
Device Model: WDC WD5002ABYS-02B1B0
Serial Number: WD-WCASYA846988
LU WWN Device Id: 5 0014ee 2042c3477
Firmware Version: 02.03B03
User Capacity: 500,107,862,016 bytes [500 GB]
Sector Size: 512 bytes logical/physical
Rotation Rate: 7200 rpm
Device is: In smartctl database [for details use: -P show]
ATA Version is: ATA8-ACS (minor revision not indicated)
SATA Version is: SATA 2.5, 3.0 Gb/s
Local Time is: Tue Mar 31 07:30:21 2015 PDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x84) Offline data collection activity
was suspended by an
interrupting command from host.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 9480) seconds.
Offline data collection
capabilities: (0x7b) SMART execute Offline immediate.
Auto Offline data collection
on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 112) minutes.
Conveyance self-test routine
recommended polling time: ( 5) minutes.
SCT capabilities: (0x303f) SCT Status supported.
SCT Error Recovery Control supported.
SCT Feature Control supported.
SCT Data Table supported.
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE
UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail
Always - 0
3 Spin_Up_Time 0x0027 239 229 021 Pre-fail
Always - 1033
4 Start_Stop_Count 0x0032 098 098 000 Old_age
Always - 2477
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail
Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age
Always - 0
9 Power_On_Hours 0x0032 075 075 000 Old_age
Always - 18331
10 Spin_Retry_Count 0x0032 100 100 000 Old_age
Always - 0
11 Calibration_Retry_Count 0x0032 100 100 000 Old_age
Always - 0
12 Power_Cycle_Count 0x0032 098 098 000 Old_age
Always - 2474
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age
Always - 48
193 Load_Cycle_Count 0x0032 200 200 000 Old_age
Always - 2428
194 Temperature_Celsius 0x0022 092 080 000 Old_age
Always - 55
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age
Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age
Always - 0
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age
Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age
Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age
Offline - 0
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining
LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed without error 00% 18262 -
# 2 Extended offline Completed without error 00% 18180 -
# 3 Extended offline Completed without error 00% 18095 -
# 4 Extended offline Completed without error 00% 18011 -
# 5 Extended offline Completed without error 00% 17930 -
# 6 Extended offline Completed without error 00% 17848 -
# 7 Extended offline Completed without error 00% 17682 -
# 8 Extended offline Completed without error 00% 17592 -
# 9 Extended offline Completed without error 00% 17505 -
#10 Extended offline Completed without error 00% 17423 -
#11 Extended offline Completed without error 00% 17340 -
#12 Extended offline Completed without error 00% 17260 -
#13 Extended offline Completed without error 00% 17173 -
#14 Extended offline Completed without error 00% 17128 -
#15 Extended offline Completed without error 00% 17032 -
#16 Extended offline Completed without error 00% 16942 -
#17 Extended offline Completed without error 00% 16865 -
#18 Extended offline Completed without error 00% 16657 -
#19 Extended offline Completed without error 00% 16581 -
#20 Extended offline Completed without error 00% 16489 -
#21 Extended offline Interrupted (host reset) 30% 16397 -
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
c2RAID6 ~ #
c2RAID6 ~ # smartctl --all /dev/sdd
smartctl 6.3 2014-07-26 r3976 [x86_64-linux-3.18.10-gentoo] (local build)
Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Model Family: Western Digital RE3 Serial ATA
Device Model: WDC WD5002ABYS-02B1B0
Serial Number: WD-WCASYA811977
LU WWN Device Id: 5 0014ee 2042c29e7
Firmware Version: 02.03B03
User Capacity: 500,107,862,016 bytes [500 GB]
Sector Size: 512 bytes logical/physical
Rotation Rate: 7200 rpm
Device is: In smartctl database [for details use: -P show]
ATA Version is: ATA8-ACS (minor revision not indicated)
SATA Version is: SATA 2.5, 3.0 Gb/s
Local Time is: Tue Mar 31 07:30:50 2015 PDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x84) Offline data collection activity
was suspended by an
interrupting command from host.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 9480) seconds.
Offline data collection
capabilities: (0x7b) SMART execute Offline immediate.
Auto Offline data collection
on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 112) minutes.
Conveyance self-test routine
recommended polling time: ( 5) minutes.
SCT capabilities: (0x303f) SCT Status supported.
SCT Error Recovery Control supported.
SCT Feature Control supported.
SCT Data Table supported.
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE
UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail
Always - 0
3 Spin_Up_Time 0x0027 239 228 021 Pre-fail
Always - 1033
4 Start_Stop_Count 0x0032 098 098 000 Old_age
Always - 2476
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail
Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age
Always - 0
9 Power_On_Hours 0x0032 075 075 000 Old_age
Always - 18330
10 Spin_Retry_Count 0x0032 100 100 000 Old_age
Always - 0
11 Calibration_Retry_Count 0x0032 100 100 000 Old_age
Always - 0
12 Power_Cycle_Count 0x0032 098 098 000 Old_age
Always - 2473
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age
Always - 48
193 Load_Cycle_Count 0x0032 200 200 000 Old_age
Always - 2427
194 Temperature_Celsius 0x0022 093 080 000 Old_age
Always - 54
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age
Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age
Always - 0
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age
Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age
Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age
Offline - 0
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining
LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed without error 00% 18272 -
# 2 Extended offline Completed without error 00% 18195 -
# 3 Extended offline Completed without error 00% 18109 -
# 4 Extended offline Completed without error 00% 18023 -
# 5 Extended offline Completed without error 00% 17940 -
# 6 Extended offline Completed without error 00% 17859 -
# 7 Extended offline Completed without error 00% 17779 -
# 8 Extended offline Completed without error 00% 17692 -
# 9 Extended offline Completed without error 00% 17606 -
#10 Extended offline Interrupted (host reset) 30% 17516 -
#11 Extended offline Completed without error 00% 17437 -
#12 Extended offline Completed without error 00% 17352 -
#13 Extended offline Completed without error 00% 17183 -
#14 Extended offline Completed without error 00% 17137 -
#15 Extended offline Completed without error 00% 17045 -
#16 Extended offline Completed without error 00% 16956 -
#17 Extended offline Completed without error 00% 16876 -
#18 Extended offline Completed without error 00% 16794 -
#19 Extended offline Completed without error 00% 16668 -
#20 Extended offline Completed without error 00% 16592 -
#21 Extended offline Completed without error 00% 16501 -
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
c2RAID6 ~ #
c2RAID6 ~ # smartctl --all /dev/sde
smartctl 6.3 2014-07-26 r3976 [x86_64-linux-3.18.10-gentoo] (local build)
Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Model Family: Western Digital RE3 Serial ATA
Device Model: WDC WD5002ABYS-02B1B0
Serial Number: WD-WCASYA814051
LU WWN Device Id: 5 0014ee 259812251
Firmware Version: 02.03B03
User Capacity: 500,107,862,016 bytes [500 GB]
Sector Size: 512 bytes logical/physical
Rotation Rate: 7200 rpm
Device is: In smartctl database [for details use: -P show]
ATA Version is: ATA8-ACS (minor revision not indicated)
SATA Version is: SATA 2.5, 3.0 Gb/s
Local Time is: Tue Mar 31 07:31:19 2015 PDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x84) Offline data collection activity
was suspended by an
interrupting command from host.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 9480) seconds.
Offline data collection
capabilities: (0x7b) SMART execute Offline immediate.
Auto Offline data collection
on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 112) minutes.
Conveyance self-test routine
recommended polling time: ( 5) minutes.
SCT capabilities: (0x303f) SCT Status supported.
SCT Error Recovery Control supported.
SCT Feature Control supported.
SCT Data Table supported.
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE
UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail
Always - 0
3 Spin_Up_Time 0x0027 239 229 021 Pre-fail
Always - 1050
4 Start_Stop_Count 0x0032 098 098 000 Old_age
Always - 2476
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail
Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age
Always - 0
9 Power_On_Hours 0x0032 075 075 000 Old_age
Always - 18329
10 Spin_Retry_Count 0x0032 100 100 000 Old_age
Always - 0
11 Calibration_Retry_Count 0x0032 100 100 000 Old_age
Always - 0
12 Power_Cycle_Count 0x0032 098 098 000 Old_age
Always - 2473
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age
Always - 47
193 Load_Cycle_Count 0x0032 200 200 000 Old_age
Always - 2428
194 Temperature_Celsius 0x0022 095 084 000 Old_age
Always - 52
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age
Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age
Always - 0
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age
Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age
Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age
Offline - 0
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining
LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed without error 00% 18286 -
# 2 Extended offline Completed without error 00% 18206 -
# 3 Extended offline Completed without error 00% 18119 -
# 4 Extended offline Completed without error 00% 18034 -
# 5 Extended offline Completed without error 00% 17953 -
# 6 Extended offline Completed without error 00% 17871 -
# 7 Extended offline Completed without error 00% 17790 -
# 8 Extended offline Completed without error 00% 17706 -
# 9 Extended offline Completed without error 00% 17617 -
#10 Extended offline Completed without error 00% 17446 -
#11 Extended offline Completed without error 00% 17362 -
#12 Extended offline Completed without error 00% 17279 -
#13 Extended offline Completed without error 00% 17195 -
#14 Extended offline Completed without error 00% 17061 -
#15 Extended offline Completed without error 00% 16968 -
#16 Extended offline Completed without error 00% 16885 -
#17 Extended offline Completed without error 00% 16678 -
#18 Extended offline Completed without error 00% 16512 -
#19 Extended offline Completed without error 00% 16423 -
#20 Extended offline Interrupted (host reset) 40% 16342 -
#21 Extended offline Completed without error 00% 16265 -
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
c2RAID6 ~ #
c2RAID6 ~ # smartctl --all /dev/sdf
smartctl 6.3 2014-07-26 r3976 [x86_64-linux-3.18.10-gentoo] (local build)
Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Model Family: Western Digital RE3 Serial ATA
Device Model: WDC WD5002ABYS-02B1B0
Serial Number: WD-WCASYA837674
LU WWN Device Id: 5 0014ee 2042bc993
Firmware Version: 02.03B03
User Capacity: 500,107,862,016 bytes [500 GB]
Sector Size: 512 bytes logical/physical
Rotation Rate: 7200 rpm
Device is: In smartctl database [for details use: -P show]
ATA Version is: ATA8-ACS (minor revision not indicated)
SATA Version is: SATA 2.5, 3.0 Gb/s
Local Time is: Tue Mar 31 07:31:53 2015 PDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x84) Offline data collection activity
was suspended by an
interrupting command from host.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 9480) seconds.
Offline data collection
capabilities: (0x7b) SMART execute Offline immediate.
Auto Offline data collection
on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 112) minutes.
Conveyance self-test routine
recommended polling time: ( 5) minutes.
SCT capabilities: (0x303f) SCT Status supported.
SCT Error Recovery Control supported.
SCT Feature Control supported.
SCT Data Table supported.
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE
UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail
Always - 1
3 Spin_Up_Time 0x0027 239 229 021 Pre-fail
Always - 1033
4 Start_Stop_Count 0x0032 098 098 000 Old_age
Always - 2465
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail
Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age
Always - 0
9 Power_On_Hours 0x0032 076 076 000 Old_age
Always - 18242
10 Spin_Retry_Count 0x0032 100 100 000 Old_age
Always - 0
11 Calibration_Retry_Count 0x0032 100 100 000 Old_age
Always - 0
12 Power_Cycle_Count 0x0032 098 098 000 Old_age
Always - 2461
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age
Always - 62
193 Load_Cycle_Count 0x0032 200 200 000 Old_age
Always - 2402
194 Temperature_Celsius 0x0022 094 083 000 Old_age
Always - 53
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age
Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age
Always - 0
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age
Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age
Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age
Offline - 0
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining
LifeTime(hours) LBA_of_first_error
# 1 Extended offline Interrupted (host reset) 10% 18135 -
# 2 Extended offline Completed without error 00% 18046 -
# 3 Extended offline Completed without error 00% 17962 -
# 4 Extended offline Completed without error 00% 17876 -
# 5 Extended offline Completed without error 00% 17719 -
# 6 Extended offline Completed without error 00% 17632 -
# 7 Extended offline Completed without error 00% 17541 -
# 8 Extended offline Completed without error 00% 17456 -
# 9 Extended offline Completed without error 00% 17372 -
#10 Extended offline Completed without error 00% 17291 -
#11 Extended offline Completed without error 00% 17204 -
#12 Extended offline Completed without error 00% 17124 -
#13 Extended offline Completed without error 00% 16991 -
#14 Extended offline Completed without error 00% 16896 -
#15 Extended offline Completed without error 00% 16808 -
#16 Extended offline Completed without error 00% 16727 -
#17 Extended offline Completed without error 00% 16646 -
#18 Extended offline Completed without error 00% 16608 -
#19 Extended offline Completed without error 00% 16526 -
#20 Extended offline Completed without error 00% 16440 -
#21 Extended offline Completed without error 00% 16350 -
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
c2RAID6 ~ #
c2RAID6 ~ # smartctl --all /dev/sdg
smartctl 6.3 2014-07-26 r3976 [x86_64-linux-3.18.10-gentoo] (local build)
Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Model Family: Western Digital RE3 Serial ATA
Device Model: WDC WD5002ABYS-02B1B0
Serial Number: WD-WCASYA831311
LU WWN Device Id: 5 0014ee 259814854
Firmware Version: 02.03B03
User Capacity: 500,107,862,016 bytes [500 GB]
Sector Size: 512 bytes logical/physical
Rotation Rate: 7200 rpm
Device is: In smartctl database [for details use: -P show]
ATA Version is: ATA8-ACS (minor revision not indicated)
SATA Version is: SATA 2.5, 3.0 Gb/s
Local Time is: Tue Mar 31 07:32:25 2015 PDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x84) Offline data collection activity
was suspended by an
interrupting command from host.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 9480) seconds.
Offline data collection
capabilities: (0x7b) SMART execute Offline immediate.
Auto Offline data collection
on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 112) minutes.
Conveyance self-test routine
recommended polling time: ( 5) minutes.
SCT capabilities: (0x303f) SCT Status supported.
SCT Error Recovery Control supported.
SCT Feature Control supported.
SCT Data Table supported.
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE
UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail
Always - 0
3 Spin_Up_Time 0x0027 239 229 021 Pre-fail
Always - 1050
4 Start_Stop_Count 0x0032 098 098 000 Old_age
Always - 2462
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail
Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age
Always - 0
9 Power_On_Hours 0x0032 076 076 000 Old_age
Always - 18241
10 Spin_Retry_Count 0x0032 100 100 000 Old_age
Always - 0
11 Calibration_Retry_Count 0x0032 100 100 000 Old_age
Always - 0
12 Power_Cycle_Count 0x0032 098 098 000 Old_age
Always - 2458
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age
Always - 49
193 Load_Cycle_Count 0x0032 200 200 000 Old_age
Always - 2412
194 Temperature_Celsius 0x0022 093 084 000 Old_age
Always - 54
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age
Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age
Always - 0
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age
Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age
Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age
Offline - 0
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining
LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed without error 00% 18237 -
# 2 Extended offline Completed without error 00% 18160 -
# 3 Extended offline Completed without error 00% 18079 -
# 4 Extended offline Completed without error 00% 17993 -
# 5 Extended offline Completed without error 00% 17908 -
# 6 Extended offline Completed without error 00% 17828 -
# 7 Extended offline Completed without error 00% 17747 -
# 8 Extended offline Completed without error 00% 17667 -
# 9 Extended offline Completed without error 00% 17578 -
#10 Extended offline Completed without error 00% 17491 -
#11 Extended offline Completed without error 00% 17404 -
#12 Extended offline Completed without error 00% 17240 -
#13 Extended offline Completed without error 00% 17157 -
#14 Extended offline Completed without error 00% 17071 -
#15 Extended offline Completed without error 00% 17026 -
#16 Extended offline Completed without error 00% 16931 -
#17 Extended offline Completed without error 00% 16764 -
#18 Extended offline Completed without error 00% 16682 -
#19 Extended offline Completed without error 00% 16555 -
#20 Extended offline Interrupted (host reset) 40% 16475 -
#21 Extended offline Completed without error 00% 16388 -
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
c2RAID6 ~ #
c2RAID6 ~ # /home/mark/lsdrv
PCI [ahci] 03:00.0 SATA controller: Marvell Technology Group Ltd.
88SE9123 PCIe SATA 6.0 Gb/s controller (rev 11)
ââscsi 0:0:0:0 ATA WDC WD30EFRX-68E {WD-WCC4N0990937}
â ââsda: [8:0] Partitioned (gpt) 2.73t
â ââsda1: [8:1] (ext2) 200.00m {c0e7f8e7-0871-4b07-80da-ed0fae480491}
â ââsda2: [8:2] (swap) 6.00g {9f9d2c4a-6a47-481f-a5be-06b0bb8b0dca}
â ââsda3: [8:3] MD raid1 (0/2) 500.00g md4 clean in_sync
'c2RAID6:4' {c3e562a4-6871-679e-2d83-518efefd237f}
â â ââmd4: [9:4] (ext4) 499.87g 'RAID1root'
{08d7f067-ec60-4d80-ab28-39b3eff5896b}
â ââsda10: [8:10] MD raid1 (0/2) 1.33t md10 clean in_sync
'c2RAID6:videoBack' {d9624704-b29e-1a8e-148b-47becabad736}
â ââmd10: [9:10] (ext4) 1.33t 'VideoLib'
{3b25ecce-99f2-4e60-87fa-3c69f3a8331e}
ââscsi 1:0:0:0 ATA WDC WD30EFRX-68E {WD-WCC4N0959002}
â ââsdb: [8:16] Partitioned (gpt) 2.73t
â ââsdb1: [8:17] Empty/Unknown 200.00m
â ââsdb2: [8:18] (swap) 6.00g {307ecb02-80cf-4ff3-835b-a7b70a463282}
â ââsdb3: [8:19] MD raid1 (1/2) 500.00g md4 clean in_sync
'c2RAID6:4' {c3e562a4-6871-679e-2d83-518efefd237f}
â ââsdb10: [8:26] MD raid1 (1/2) 1.33t md10 clean in_sync
'c2RAID6:videoBack' {d9624704-b29e-1a8e-148b-47becabad736}
ââscsi 2:x:x:x [Empty]
ââscsi 3:x:x:x [Empty]
ââscsi 4:x:x:x [Empty]
ââscsi 5:x:x:x [Empty]
ââscsi 6:x:x:x [Empty]
PCI [ahci] 06:00.0 SATA controller: JMicron Technology Corp. JMB363
SATA/IDE Controller (rev 03)
ââscsi 8:x:x:x [Empty]
ââscsi 9:x:x:x [Empty]
PCI [ata_piix] 00:1f.2 IDE interface: Intel Corporation 82801JI (ICH10
Family) 4 port SATA IDE Controller #1
ââscsi 10:0:0:0 Optiarc DVD RW AD-7241S {Optiarc_DVD_RW_AD-7241S}
â ââsr0: [11:0] Empty/Unknown 1.00g
ââscsi 10:0:1:0 ATA WDC WD5002ABYS-0 {WD-WCASYA846988}
â ââsdc: [8:32] Partitioned (dos) 465.76g
â ââsdc1: [8:33] (ext2) 54.88m 'boot'
{4efd04f2-839f-4e11-a32d-8fa25129b541}
â ââsdc2: [8:34] (swap) 4.01g 'swapA'
{ed41217e-abf4-4c60-a985-c0420eb054f3}
â ââsdc3: [8:35] MD raid6 (0/5) 461.66g md3 active in_sync
'c2stable:3' {de47f991-86d9-8467-0637-635b9c6d0591}
â ââmd3: [9:3] (ext4) 1.35t 'RAID6root'
{80c3ebf0-d92f-415f-a6d7-1cdb1ceab593}
â ââMounted as /dev/md3 @ /
ââscsi 11:0:0:0 ATA WDC WD5002ABYS-0 {WD-WCASYA811977}
â ââsdd: [8:48] Partitioned (dos) 465.76g
â ââsdd1: [8:49] (ext2) 54.88m {b9dd1fb9-5c7b-459c-a7fc-ca8ebe032fcc}
â ââsdd2: [8:50] (swap) 4.01g 'swapB'
{2ea74bbb-8c1d-4a32-939d-512db4e40a54}
â ââsdd3: [8:51] MD raid6 (1/5) 461.66g md3 active in_sync
'c2stable:3' {de47f991-86d9-8467-0637-635b9c6d0591}
ââscsi 11:0:1:0 ATA WDC WD5002ABYS-0 {WD-WCASYA814051}
ââsde: [8:64] Partitioned (dos) 465.76g
ââsde1: [8:65] (ext2) 54.88m {27f276fa-6f62-4769-b9d7-deb511dfc60a}
ââsde2: [8:66] (swap) 4.01g 'swapC'
{0c6a1bd0-291e-4157-9b6a-67f7801c7a5e}
ââsde3: [8:67] MD raid6 (2/5) 461.66g md3 active in_sync
'c2stable:3' {de47f991-86d9-8467-0637-635b9c6d0591}
PCI [ata_piix] 00:1f.5 IDE interface: Intel Corporation 82801JI (ICH10
Family) 2 port SATA IDE Controller #2
ââscsi 12:0:0:0 ATA WDC WD5002ABYS-0 {WD-WCASYA837674}
â ââsdf: [8:80] Partitioned (dos) 465.76g
â ââsdf1: [8:81] Empty/Unknown 4.10g
â ââsdf3: [8:83] MD raid6 (3/5) 461.66g md3 active in_sync
'c2stable:3' {de47f991-86d9-8467-0637-635b9c6d0591}
ââscsi 13:0:0:0 ATA WDC WD5002ABYS-0 {WD-WCASYA831311}
ââsdg: [8:96] Partitioned (dos) 465.76g
ââsdg1: [8:97] (ext2) 4.10g {e7b2dbb0-df11-4bd4-902c-da2221b53e88}
ââsdg3: [8:99] MD raid6 (4/5) 461.66g md3 active in_sync
'c2stable:3' {de47f991-86d9-8467-0637-635b9c6d0591}
Other Block Devices
ââram0: [1:0] Empty/Unknown 16.00m
ââram1: [1:1] Empty/Unknown 16.00m
ââram2: [1:2] Empty/Unknown 16.00m
ââram3: [1:3] Empty/Unknown 16.00m
ââram4: [1:4] Empty/Unknown 16.00m
ââram5: [1:5] Empty/Unknown 16.00m
ââram6: [1:6] Empty/Unknown 16.00m
ââram7: [1:7] Empty/Unknown 16.00m
ââram8: [1:8] Empty/Unknown 16.00m
ââram9: [1:9] Empty/Unknown 16.00m
ââram10: [1:10] Empty/Unknown 16.00m
ââram11: [1:11] Empty/Unknown 16.00m
ââram12: [1:12] Empty/Unknown 16.00m
ââram13: [1:13] Empty/Unknown 16.00m
ââram14: [1:14] Empty/Unknown 16.00m
ââram15: [1:15] Empty/Unknown 16.00m
c2RAID6 ~ #
Mar 31 06:19:23 c2RAID6 kdm[2588]: :0[2588]:
pam_ck_connector(kde:session): nox11 mode, ignoring PAM_TTY :0
Mar 31 06:19:32 c2RAID6 dbus[2273]: [system] Activating service
name='org.freedesktop.UPower' (using servicehelper)
Mar 31 06:19:32 c2RAID6 dbus[2273]: [system] Successfully activated
service 'org.freedesktop.UPower'
Mar 31 06:19:32 c2RAID6 dbus[2273]: [system] Activating service
name='org.freedesktop.UDisks2' (using servicehelper)
Mar 31 06:19:33 c2RAID6 udisksd[3062]: udisks daemon version 2.1.3 starting
Mar 31 06:19:33 c2RAID6 dbus[2273]: [system] Successfully activated
service 'org.freedesktop.UDisks2'
Mar 31 06:19:33 c2RAID6 udisksd[3062]: Acquired the name
org.freedesktop.UDisks2 on the system message bus
Mar 31 06:19:33 c2RAID6 dbus[2273]: [system] Activating service
name='org.kde.powerdevil.backlighthelper' (using servicehelper)
Mar 31 06:19:33 c2RAID6 dbus[2273]: [system] Successfully activated
service 'org.kde.powerdevil.backlighthelper'
Mar 31 06:19:47 c2RAID6 polkitd[2954]: Registered Authentication Agent
for unix-session:/org/freedesktop/ConsoleKit/Session1 (system bus name
:1.19 [/usr/lib64/kde4/libexec/polkit-kde-authentication-agent-1],
object path /org/kde/PolicyKit1/AuthenticationAgent, locale
en_US.UTF-8)
Mar 31 06:19:59 c2RAID6 kernel: capability: warning: `VirtualBox' uses
32-bit capabilities (legacy support in use)
Mar 31 06:20:01 c2RAID6 cron[3442]: (root) CMD (test -x
/usr/sbin/run-crons && /usr/sbin/run-crons)
Mar 31 06:20:01 c2RAID6 cron[3441]: (root) CMD (updatedb)
Mar 31 06:20:08 c2RAID6 kernel: device eth0 entered promiscuous mode
Mar 31 06:30:01 c2RAID6 cron[5497]: (root) CMD (test -x
/usr/sbin/run-crons && /usr/sbin/run-crons)
Mar 31 06:40:01 c2RAID6 cron[5911]: (root) CMD (test -x
/usr/sbin/run-crons && /usr/sbin/run-crons)
Mar 31 06:50:01 c2RAID6 cron[7406]: (root) CMD (test -x
/usr/sbin/run-crons && /usr/sbin/run-crons)
Mar 31 07:01:14 c2RAID6 syslog-ng[2227]: syslog-ng starting up; version='3.6.2'
Mar 31 07:01:14 c2RAID6 acpid[2247]: starting up with netlink and the
input layer
Mar 31 07:01:14 c2RAID6 acpid[2247]: 1 rule loaded
Mar 31 07:01:14 c2RAID6 acpid[2247]: waiting for events: event logging is off
Mar 31 07:01:16 c2RAID6 kernel: sky2 0000:07:00.0 eth0: enabling interface
Mar 31 07:01:18 c2RAID6 kernel: sky2 0000:07:00.0 eth0: Link is up at
100 Mbps, full duplex, flow control both
Mar 31 07:01:19 c2RAID6 kernel: nvidia 0000:02:00.0: irq 37 for MSI/MSI-X
Mar 31 07:01:20 c2RAID6 kernel: nvidia 0000:04:00.0: irq 38 for MSI/MSI-X
Mar 31 07:01:19 c2RAID6 acpid[2247]: client connected from 2479[0:0]
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: Two system lockups, Magic SysKeys works, RAID6 resyncs on reboot
2015-03-31 15:00 Two system lockups, Magic SysKeys works, RAID6 resyncs on reboot Mark Knecht
@ 2015-03-31 20:30 ` Roman Mamedov
2015-03-31 22:08 ` Mark Knecht
2015-04-06 18:39 ` Mark Knecht
0 siblings, 2 replies; 5+ messages in thread
From: Roman Mamedov @ 2015-03-31 20:30 UTC (permalink / raw)
To: Mark Knecht; +Cc: Linux-RAID
[-- Attachment #1: Type: text/plain, Size: 490 bytes --]
On Tue, 31 Mar 2015 08:00:27 -0700
Mark Knecht <markknecht@gmail.com> wrote:
> md3 : active raid6 sdc3[9] sdg3[5] sdf3[6] sde3[7] sdd3[8]
> 1452264480 blocks super 1.2 level 6, 16k chunk, algorithm 2 [5/5] [UUUUU]
> [====>................] resync = 22.8% (110608304/484088160)
> finish=78.5min speed=79206K/sec
Looks like you are running this without a write intent bitmap, why? It would
at least save you from a full resync each time.
--
With respect,
Roman
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 198 bytes --]
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: Two system lockups, Magic SysKeys works, RAID6 resyncs on reboot
2015-03-31 20:30 ` Roman Mamedov
@ 2015-03-31 22:08 ` Mark Knecht
2015-04-01 19:07 ` Mark Knecht
2015-04-06 18:39 ` Mark Knecht
1 sibling, 1 reply; 5+ messages in thread
From: Mark Knecht @ 2015-03-31 22:08 UTC (permalink / raw)
To: Roman Mamedov; +Cc: Linux-RAID
On Tue, Mar 31, 2015 at 1:30 PM, Roman Mamedov <rm@romanrm.net> wrote:
> On Tue, 31 Mar 2015 08:00:27 -0700
> Mark Knecht <markknecht@gmail.com> wrote:
>
>> md3 : active raid6 sdc3[9] sdg3[5] sdf3[6] sde3[7] sdd3[8]
>> 1452264480 blocks super 1.2 level 6, 16k chunk, algorithm 2 [5/5] [UUUUU]
>> [====>................] resync = 22.8% (110608304/484088160)
>> finish=78.5min speed=79206K/sec
>
> Looks like you are running this without a write intent bitmap, why? It would
> at least save you from a full resync each time.
>
> --
> With respect,
> Roman
Honestly, I don't know. The machine was the first I ever built using
RAID for the system drive and I probably didn't find out about it, or
was just trying to keep it simple? Not at all sure.
I'll check it out.
Thanks,
Mark
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: Two system lockups, Magic SysKeys works, RAID6 resyncs on reboot
2015-03-31 22:08 ` Mark Knecht
@ 2015-04-01 19:07 ` Mark Knecht
0 siblings, 0 replies; 5+ messages in thread
From: Mark Knecht @ 2015-04-01 19:07 UTC (permalink / raw)
To: Roman Mamedov; +Cc: Linux-RAID
On Tue, Mar 31, 2015 at 3:08 PM, Mark Knecht <markknecht@gmail.com> wrote:
> On Tue, Mar 31, 2015 at 1:30 PM, Roman Mamedov <rm@romanrm.net> wrote:
>> On Tue, 31 Mar 2015 08:00:27 -0700
>> Mark Knecht <markknecht@gmail.com> wrote:
>>
>>> md3 : active raid6 sdc3[9] sdg3[5] sdf3[6] sde3[7] sdd3[8]
>>> 1452264480 blocks super 1.2 level 6, 16k chunk, algorithm 2 [5/5] [UUUUU]
>>> [====>................] resync = 22.8% (110608304/484088160)
>>> finish=78.5min speed=79206K/sec
>>
>> Looks like you are running this without a write intent bitmap, why? It would
>> at least save you from a full resync each time.
>>
>> --
>> With respect,
>> Roman
>
> Honestly, I don't know. The machine was the first I ever built using
> RAID for the system drive and I probably didn't find out about it, or
> was just trying to keep it simple? Not at all sure.
>
> I'll check it out.
>
> Thanks,
> Mark
At this point, having put everything back together, I cannot find any
specific clues about what caused the problem. Possibly the system
might be showing its age. Maybe it's a corner condition in the 3.18
series kernel vs the 3.14 I was running before. No idea. The mdadm
RAID seems to have done it's job wonderfully. At least at the Linux
level there's no evidence at this time that anything was lost. Within
my Windows VMs that were running on the mdadm RAID6 at the time of the
Magic Sys Key reboot there seem to be minimal issues. One Windows
program (Schwab's StreetSmart Edge) lost some of my trading
environment configuration and I needed to recall only that file off
the web server to get the platform configured correctly again.
TradeStation was running in a second VM and showed no issues. Both
trading platforms were live at the time of the lockup so I came out of
this really well, and as I have good backups nothing would have been
lost anyway.
As per Roman's question/suggestion I've added the write intent bitmaps
to all three RAIDs. Nominally only md3 is mounted most of the time
anyway, but it's there to help if this problem continues.
If anyone sees anything else in the data I posted yesterday let me
know but everything mdadm specific seems good at this point.
Cheers,
Mark
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: Two system lockups, Magic SysKeys works, RAID6 resyncs on reboot
2015-03-31 20:30 ` Roman Mamedov
2015-03-31 22:08 ` Mark Knecht
@ 2015-04-06 18:39 ` Mark Knecht
1 sibling, 0 replies; 5+ messages in thread
From: Mark Knecht @ 2015-04-06 18:39 UTC (permalink / raw)
To: Roman Mamedov; +Cc: Linux-RAID
On Tue, Mar 31, 2015 at 1:30 PM, Roman Mamedov <rm@romanrm.net> wrote:
> On Tue, 31 Mar 2015 08:00:27 -0700
> Mark Knecht <markknecht@gmail.com> wrote:
>
>> md3 : active raid6 sdc3[9] sdg3[5] sdf3[6] sde3[7] sdd3[8]
>> 1452264480 blocks super 1.2 level 6, 16k chunk, algorithm 2 [5/5] [UUUUU]
>> [====>................] resync = 22.8% (110608304/484088160)
>> finish=78.5min speed=79206K/sec
>
> Looks like you are running this without a write intent bitmap, why? It would
> at least save you from a full resync each time.
>
> --
> With respect,
> Roman
I had a 3rd lockup this morning. No idea what's causing them but the
write intent bitmap really got past the resync issue quickly.
Thanks!
Cheers,
Mark
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2015-04-06 18:39 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-03-31 15:00 Two system lockups, Magic SysKeys works, RAID6 resyncs on reboot Mark Knecht
2015-03-31 20:30 ` Roman Mamedov
2015-03-31 22:08 ` Mark Knecht
2015-04-01 19:07 ` Mark Knecht
2015-04-06 18:39 ` Mark Knecht
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox