From mboxrd@z Thu Jan 1 00:00:00 1970 From: Tomasz Chmielewski Subject: checking state of RAID (for automated notifications) Date: Wed, 06 Sep 2006 10:44:24 +0200 Message-ID: <44FE8A68.8060306@wpkg.org> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-2; format=flowed Content-Transfer-Encoding: 7bit Return-path: Sender: linux-raid-owner@vger.kernel.org To: linux-raid@vger.kernel.org List-Id: linux-raid.ids I would like to have RAID status monitored by nagios. This sounds like a simple script, but I'm not sure what approach is correct. Considering, that the "health" status of /proc/mdstat looks like this: # cat /proc/mdstat Personalities : [raid1] [raid10] md2 : active raid10 sda2[4] sdd2[3] sdc2[2] sdb2[1] 779264640 blocks super 1.0 64K chunks 2 near-copies [4/4] [UUUU] md1 : active raid1 sdd1[1] sdc1[0] 1076224 blocks [2/2] [UU] md0 : active raid1 sdb1[1] sda1[0] 1076224 blocks [2/2] [UU] unused devices: What my script should be checking? Does the number of "U" (8 for this host) letters indicate that RAID is healthy? Or should I count "in_sync" in "cat /sys/block/md*/md/rd*/state"? Perhaps the two approaches are the same, though. What's the best way to determine that the RAID is running fine? -- Tomasz Chmielewski http://wpkg.org