linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Re: Why does one get mismatches?
@ 2010-01-20 15:03 Jon Hardcastle
  2010-01-20 15:34 ` Brett Russ
  0 siblings, 1 reply; 93+ messages in thread
From: Jon Hardcastle @ 2010-01-20 15:03 UTC (permalink / raw)
  To: linux-raid, Brett Russ

[-- Attachment #1: Type: text/plain, Size: 1512 bytes --]

--- On Wed, 20/1/10, Brett Russ <bruss@netezza.com> wrote:

> From: Brett Russ <bruss@netezza.com>
> Subject: Re: Why does one get mismatches?
> To: linux-raid@vger.kernel.org
> Date: Wednesday, 20 January, 2010, 14:46
> On 01/20/2010 09:34 AM, Jon
> Hardcastle wrote:
> > I will gather the information you require, but so it
> is clear it is a
> > a echo 'check' that is kicking off the ultimate
> mismatch not from
> > boot.
> 
> What do you mean by mismatches detected?  How is this
> observed?
> -BR
> 


cat /sys/block/md4/md/mismatch_cnt

I have a script that i use that runs a 'check' then looks at this value once the check is complete. If it is > 0 it reports this number to me via email and then starts a repair. I am under the impression that the repair - when complete should report (in an ideal world) an identical number indicating that it did indeed find x errors and repaired them. But i am getting differing amount check shows 8 repair shows 12 next run it'll be 24 and 6 so something is up.

I have been able to get the info you asked for using mdadm -E but not with the array stopped. I cant stop it just yet. What would you be looking for in this data?

-----------------------
N: Jon Hardcastle
E: Jon@eHardcastle.com
'Do not worry about tomorrow, for tomorrow will bring worries of its own.'

***********
Please note, I am phasing out jd_hardcastle AT yahoo.com and replacing it with jon AT eHardcastle.com
***********

-----------------------


      

[-- Attachment #2: mdadm-EsdX1.txt --]
[-- Type: text/plain, Size: 8603 bytes --]

/dev/sda1:
          Magic : a92b4efc
        Version : 0.90.00
           UUID : 7438efd1:9e6ca2b5:d6b88274:7003b1d3
  Creation Time : Thu Oct 11 00:01:49 2007
     Raid Level : raid6
  Used Dev Size : 488383936 (465.76 GiB 500.11 GB)
     Array Size : 2441919680 (2328.80 GiB 2500.53 GB)
   Raid Devices : 7
  Total Devices : 7
Preferred Minor : 4

    Update Time : Wed Jan 20 12:54:16 2010
          State : clean
 Active Devices : 7
Working Devices : 7
 Failed Devices : 0
  Spare Devices : 0
       Checksum : b2468ec2 - correct
         Events : 1834225

         Layout : left-symmetric
     Chunk Size : 64K

      Number   Major   Minor   RaidDevice State
this     5       8        1        5      active sync   /dev/sda1

   0     0       8       65        0      active sync   /dev/sde1
   1     1       8       33        1      active sync   /dev/sdc1
   2     2       8       97        2      active sync   /dev/sdg1
   3     3       8       17        3      active sync   /dev/sdb1
   4     4       8       49        4      active sync   /dev/sdd1
   5     5       8        1        5      active sync   /dev/sda1
   6     6       8       81        6      active sync   /dev/sdf1
/dev/sdb1:
          Magic : a92b4efc
        Version : 0.90.00
           UUID : 7438efd1:9e6ca2b5:d6b88274:7003b1d3
  Creation Time : Thu Oct 11 00:01:49 2007
     Raid Level : raid6
  Used Dev Size : 488383936 (465.76 GiB 500.11 GB)
     Array Size : 2441919680 (2328.80 GiB 2500.53 GB)
   Raid Devices : 7
  Total Devices : 7
Preferred Minor : 4

    Update Time : Wed Jan 20 12:54:16 2010
          State : clean
 Active Devices : 7
Working Devices : 7
 Failed Devices : 0
  Spare Devices : 0
       Checksum : b2468ece - correct
         Events : 1834225

         Layout : left-symmetric
     Chunk Size : 64K

      Number   Major   Minor   RaidDevice State
this     3       8       17        3      active sync   /dev/sdb1

   0     0       8       65        0      active sync   /dev/sde1
   1     1       8       33        1      active sync   /dev/sdc1
   2     2       8       97        2      active sync   /dev/sdg1
   3     3       8       17        3      active sync   /dev/sdb1
   4     4       8       49        4      active sync   /dev/sdd1
   5     5       8        1        5      active sync   /dev/sda1
   6     6       8       81        6      active sync   /dev/sdf1
/dev/sdc1:
          Magic : a92b4efc
        Version : 0.90.00
           UUID : 7438efd1:9e6ca2b5:d6b88274:7003b1d3
  Creation Time : Thu Oct 11 00:01:49 2007
     Raid Level : raid6
  Used Dev Size : 488383936 (465.76 GiB 500.11 GB)
     Array Size : 2441919680 (2328.80 GiB 2500.53 GB)
   Raid Devices : 7
  Total Devices : 7
Preferred Minor : 4

    Update Time : Wed Jan 20 12:54:16 2010
          State : clean
 Active Devices : 7
Working Devices : 7
 Failed Devices : 0
  Spare Devices : 0
       Checksum : b2468eda - correct
         Events : 1834225

         Layout : left-symmetric
     Chunk Size : 64K

      Number   Major   Minor   RaidDevice State
this     1       8       33        1      active sync   /dev/sdc1

   0     0       8       65        0      active sync   /dev/sde1
   1     1       8       33        1      active sync   /dev/sdc1
   2     2       8       97        2      active sync   /dev/sdg1
   3     3       8       17        3      active sync   /dev/sdb1
   4     4       8       49        4      active sync   /dev/sdd1
   5     5       8        1        5      active sync   /dev/sda1
   6     6       8       81        6      active sync   /dev/sdf1
/dev/sdd1:
          Magic : a92b4efc
        Version : 0.90.00
           UUID : 7438efd1:9e6ca2b5:d6b88274:7003b1d3
  Creation Time : Thu Oct 11 00:01:49 2007
     Raid Level : raid6
  Used Dev Size : 488383936 (465.76 GiB 500.11 GB)
     Array Size : 2441919680 (2328.80 GiB 2500.53 GB)
   Raid Devices : 7
  Total Devices : 7
Preferred Minor : 4

    Update Time : Wed Jan 20 12:54:16 2010
          State : clean
 Active Devices : 7
Working Devices : 7
 Failed Devices : 0
  Spare Devices : 0
       Checksum : b2468ef0 - correct
         Events : 1834225

         Layout : left-symmetric
     Chunk Size : 64K

      Number   Major   Minor   RaidDevice State
this     4       8       49        4      active sync   /dev/sdd1

   0     0       8       65        0      active sync   /dev/sde1
   1     1       8       33        1      active sync   /dev/sdc1
   2     2       8       97        2      active sync   /dev/sdg1
   3     3       8       17        3      active sync   /dev/sdb1
   4     4       8       49        4      active sync   /dev/sdd1
   5     5       8        1        5      active sync   /dev/sda1
   6     6       8       81        6      active sync   /dev/sdf1
/dev/sde1:
          Magic : a92b4efc
        Version : 0.90.00
           UUID : 7438efd1:9e6ca2b5:d6b88274:7003b1d3
  Creation Time : Thu Oct 11 00:01:49 2007
     Raid Level : raid6
  Used Dev Size : 488383936 (465.76 GiB 500.11 GB)
     Array Size : 2441919680 (2328.80 GiB 2500.53 GB)
   Raid Devices : 7
  Total Devices : 7
Preferred Minor : 4

    Update Time : Wed Jan 20 12:54:16 2010
          State : clean
 Active Devices : 7
Working Devices : 7
 Failed Devices : 0
  Spare Devices : 0
       Checksum : b2468ef8 - correct
         Events : 1834225

         Layout : left-symmetric
     Chunk Size : 64K

      Number   Major   Minor   RaidDevice State
this     0       8       65        0      active sync   /dev/sde1

   0     0       8       65        0      active sync   /dev/sde1
   1     1       8       33        1      active sync   /dev/sdc1
   2     2       8       97        2      active sync   /dev/sdg1
   3     3       8       17        3      active sync   /dev/sdb1
   4     4       8       49        4      active sync   /dev/sdd1
   5     5       8        1        5      active sync   /dev/sda1
   6     6       8       81        6      active sync   /dev/sdf1
/dev/sdf1:
          Magic : a92b4efc
        Version : 0.90.00
           UUID : 7438efd1:9e6ca2b5:d6b88274:7003b1d3
  Creation Time : Thu Oct 11 00:01:49 2007
     Raid Level : raid6
  Used Dev Size : 488383936 (465.76 GiB 500.11 GB)
     Array Size : 2441919680 (2328.80 GiB 2500.53 GB)
   Raid Devices : 7
  Total Devices : 7
Preferred Minor : 4

    Update Time : Wed Jan 20 12:54:16 2010
          State : clean
 Active Devices : 7
Working Devices : 7
 Failed Devices : 0
  Spare Devices : 0
       Checksum : b2468f14 - correct
         Events : 1834225

         Layout : left-symmetric
     Chunk Size : 64K

      Number   Major   Minor   RaidDevice State
this     6       8       81        6      active sync   /dev/sdf1

   0     0       8       65        0      active sync   /dev/sde1
   1     1       8       33        1      active sync   /dev/sdc1
   2     2       8       97        2      active sync   /dev/sdg1
   3     3       8       17        3      active sync   /dev/sdb1
   4     4       8       49        4      active sync   /dev/sdd1
   5     5       8        1        5      active sync   /dev/sda1
   6     6       8       81        6      active sync   /dev/sdf1
/dev/sdg1:
          Magic : a92b4efc
        Version : 0.90.00
           UUID : 7438efd1:9e6ca2b5:d6b88274:7003b1d3
  Creation Time : Thu Oct 11 00:01:49 2007
     Raid Level : raid6
  Used Dev Size : 488383936 (465.76 GiB 500.11 GB)
     Array Size : 2441919680 (2328.80 GiB 2500.53 GB)
   Raid Devices : 7
  Total Devices : 7
Preferred Minor : 4

    Update Time : Wed Jan 20 12:54:16 2010
          State : clean
 Active Devices : 7
Working Devices : 7
 Failed Devices : 0
  Spare Devices : 0
       Checksum : b2468f1c - correct
         Events : 1834225

         Layout : left-symmetric
     Chunk Size : 64K

      Number   Major   Minor   RaidDevice State
this     2       8       97        2      active sync   /dev/sdg1

   0     0       8       65        0      active sync   /dev/sde1
   1     1       8       33        1      active sync   /dev/sdc1
   2     2       8       97        2      active sync   /dev/sdg1
   3     3       8       17        3      active sync   /dev/sdb1
   4     4       8       49        4      active sync   /dev/sdd1
   5     5       8        1        5      active sync   /dev/sda1
   6     6       8       81        6      active sync   /dev/sdf1

^ permalink raw reply	[flat|nested] 93+ messages in thread
* RE: Why does one get mismatches?
@ 2010-02-01 23:14 Jon Hardcastle
  0 siblings, 0 replies; 93+ messages in thread
From: Jon Hardcastle @ 2010-02-01 23:14 UTC (permalink / raw)
  To: Neil Brown, Bill Davidsen; +Cc: Jon, linux-raid

But what if 2 of the three sources say xyz then you can make a guess that has a higher propencity to beng right? I guess this would also work for raid 6.

Incidently the problem of my mismatches was almost certainly badly seated ram... But my tests are ongoing to be sure....



-----Original Message-----
From: Neil Brown <neilb@suse.de>
Sent: 01 February 2010 22:37
To: Bill Davidsen <davidsen@tmr.com>
Cc: Jon@eHardcastle.com; linux-raid@vger.kernel.org
Subject: Re: Why does one get mismatches?

On Mon, 01 Feb 2010 16:18:23 -0500
Bill Davidsen <davidsen@tmr.com> wrote:

> Comment: when there is a three way RAID-1, why doesn't repair *vote* on 
> the correct value instead of just making a guess?
> 

Because truth is not democratic.

(and I defy you to define "correct" in any general way in this context).

NeilBrown


^ permalink raw reply	[flat|nested] 93+ messages in thread
* Re: Why does one get mismatches?
@ 2010-01-25 20:43 greg
  2010-01-25 22:49 ` Steven Haigh
  0 siblings, 1 reply; 93+ messages in thread
From: greg @ 2010-01-25 20:43 UTC (permalink / raw)
  To: Farkas Levente, Steven Haigh; +Cc: Asdo, linux-raid

On Jan 21, 12:48pm, Farkas Levente wrote:
} Subject: Re: Why does one get mismatches?

Good afternoon to everyone, hope the week is starting well.

> On 01/21/2010 11:52 AM, Steven Haigh wrote:
> > On Thu, 21 Jan 2010 09:08:42 +0100, Asdo<asdo@shiftmail.org>  wrote:
> >> Steven Haigh wrote:
> >>> On Wed, 20 Jan 2010 17:43:45 -0500, Brett Russ<bruss@netezza.com>
> > wrote:
> >>>
> >>> CUT!
> >> Might that be a problem of the disks/controllers?
> >> Jon and Steven, what hardware do you have?
> >
> > I'm running some fairly old hardware on this particular server. It's a
> > dual P3 1Ghz.
> >
> > After running a repair on /dev/md2, I now see:
> > # cat /sys/block/md2/md/mismatch_cnt
> > 1536
> >
> > Again, no smart errors, nothing to indicate a disk problem at all :(
> >
> > As this really keeps killing the machine and it is a live system - the
> > only thing I can really think of doing is to break the RAID and just rsync
> > the drives twice daily :\

> the same happened with many people. and we all hate it since it
> cause a huge load at all weekend on most of our servers:-( according
> to redhat it's not a bug:-(

The RAID check/mismatch_count is an example of well intentioned
technology suffering from 'featuritis' by the distributions which is,
as I predicted a couple of times in this forum, causing all sorts of
angst and problems throughout the world.  I've had some posts on this
subject but will summarize in the hopes of giving some background
information which will be useful to people.

There is an issue in the kernel which causes these mismatches.  The
problem seems to be particularly bad with RAID1 arrays.  The
contention is that these mismatches are 'harmless' because they only
occur in areas of the filesystems which are not being used.

The best description is that the buffers containing the data to be
written are not 'pinned' all the way down the I/O stack.  This can
cause the contents of a buffer to be changed while in transit through
the I/O stack.  Thus one copy of a mirror gets a buffer written to it
different then the other side of the mirror.

I've read reasoned discussions about why this occurs with swap over
RAID1 and why its harmless.  I've set to see the same type of reasoned
discussion as to why it is not problematic with a filesystem over
RAID1.  There has been some discussion that its due to high levels of
MMAP activity on the filesystem.

We have confirmed, that at least with RAID1, this all occurs with no
physical corruption on the 'disk drives'.  We implement geographically
mirror storage with RAID1 against two separate data-centers.  At each
data-center the RAID1 'block-device' are RAID5 volumes.  These latter
volumes check out with no errors/mismatch counts etc.  So the issue is
at the RAID1 data abstraction layer.

There do not appear to be any tools which allow one to determine
'where' the mismatches are.  Such a tool, or logging by the kernel,
would be useful for people who want to verify what files, if any, are
affected by the mismatch.  Otherwise running a 'repair' results in the
RAID1 code arbitraily deciding which of the two blocks is the
'correct' one.

So thats sort of a thumbnail sketch of what is going on.  The fact the
distributions chose to implement this without understanding the issues
it presents is a bit problematic.

>    Levente                               "Si vis pacem para bellum!"

Hopefully this information is helpful.

Greg

}-- End of excerpt from Farkas Levente

As always,
Dr. G.W. Wettstein, Ph.D.   Enjellic Systems Development, LLC.
4206 N. 19th Ave.           Specializing in information infra-structure
Fargo, ND  58102            development.
PH: 701-281-1686
FAX: 701-281-3949           EMAIL: greg@enjellic.com
------------------------------------------------------------------------------
"I am returning this otherwise good typing paper to you because
 someone has printed gibberish all over it and put your name at the
 top.
                                -- English Professor, Ohio University

^ permalink raw reply	[flat|nested] 93+ messages in thread
* Fw: Why does one get mismatches?
@ 2010-01-20 11:52 Jon Hardcastle
  2010-02-01 21:18 ` Bill Davidsen
  0 siblings, 1 reply; 93+ messages in thread
From: Jon Hardcastle @ 2010-01-20 11:52 UTC (permalink / raw)
  To: linux-raid

--- On Tue, 19/1/10, Jon Hardcastle <jd_hardcastle@yahoo.com> wrote:

> From: Jon Hardcastle <jd_hardcastle@yahoo.com>
> Subject: Why does one get mismatches?
> To: linux-raid@vger.kernel.org
> Date: Tuesday, 19 January, 2010, 10:04
> Hi,
> 
> I kicked off a check/repair cycle on my machine after i
> moved the phyiscal ordering of my drives around and I am now
> on my second check/repair cycle and it has kept finding
> mismatches.
> 
> Is it correct that the mismatch value after a repair was
> needed should equal the value present after a check? What if
> it doesn't? What does it mean if another check STILL reveals
> mismatches?
> 
> I had something similar after i reshaped from raid 5 to 6 i
> had to run check/repair/check/repair several times before i
> got my 0.
> 
> 

Guys,

Anyone got any suggestions here? I am now on my ~5 check/repair and after a reboot the first check is still returning 8.

All i have done is move the drives around. It is the same controllers/cables/etc 

I really dont like the seeming random nature of what can/does/has caused the mismatches?


      

^ permalink raw reply	[flat|nested] 93+ messages in thread
* Why does one get mismatches?
@ 2010-01-19 10:04 Jon Hardcastle
  2010-01-20 14:19 ` Brett Russ
  0 siblings, 1 reply; 93+ messages in thread
From: Jon Hardcastle @ 2010-01-19 10:04 UTC (permalink / raw)
  To: linux-raid

Hi,

I kicked off a check/repair cycle on my machine after i moved the phyiscal ordering of my drives around and I am now on my second check/repair cycle and it has kept finding mismatches.

Is it correct that the mismatch value after a repair was needed should equal the value present after a check? What if it doesn't? What does it mean if another check STILL reveals mismatches?

I had something similar after i reshaped from raid 5 to 6 i had to run check/repair/check/repair several times before i got my 0.


-----------------------
N: Jon Hardcastle
E: Jon@eHardcastle.com
'Do not worry about tomorrow, for tomorrow will bring worries of its own.'

***********
Please note, I am phasing out jd_hardcastle AT yahoo.com and replacing it with jon AT eHardcastle.com
***********

-----------------------


      

^ permalink raw reply	[flat|nested] 93+ messages in thread

end of thread, other threads:[~2010-03-03 12:03 UTC | newest]

Thread overview: 93+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-01-20 15:03 Why does one get mismatches? Jon Hardcastle
2010-01-20 15:34 ` Brett Russ
2010-01-20 20:44   ` Majed B.
2010-01-20 22:25     ` Brett Russ
2010-01-20 22:30       ` Majed B.
2010-01-20 22:43         ` Brett Russ
2010-01-20 23:01           ` Christopher Chen
2010-01-21  4:17           ` Steven Haigh
2010-01-21  8:08             ` Asdo
2010-01-21 10:52               ` Steven Haigh
2010-01-21 11:48                 ` Farkas Levente
2010-01-21 12:15                   ` Jon Hardcastle
  -- strict thread matches above, loose matches on Subject: below --
2010-02-01 23:14 Jon Hardcastle
2010-01-25 20:43 greg
2010-01-25 22:49 ` Steven Haigh
2010-01-27 21:54   ` Tirumala Reddy Marri
2010-01-28  9:16     ` Jon Hardcastle
2010-01-28 10:29       ` Asdo
2010-01-28 17:20     ` Tirumala Reddy Marri
2010-01-28 18:23       ` Goswin von Brederlow
2010-01-28 19:03         ` Tirumala Reddy Marri
2010-01-28 20:24           ` Goswin von Brederlow
2010-01-29 15:37             ` Jon Hardcastle
2010-01-29 23:52               ` Goswin von Brederlow
2010-01-30 10:39                 ` Jon Hardcastle
2010-02-01 21:10               ` Bill Davidsen
2010-01-20 11:52 Fw: " Jon Hardcastle
2010-02-01 21:18 ` Bill Davidsen
2010-02-01 22:37   ` Neil Brown
2010-02-02 15:11     ` Bill Davidsen
2010-02-03 11:17       ` Goswin von Brederlow
2010-02-11  5:14       ` Neil Brown
2010-02-11 17:51         ` Bryan Mesich
2010-02-16 21:25           ` Bill Davidsen
2010-02-16 21:38             ` Steven Haigh
2010-02-17  3:19               ` Bryan Mesich
2010-02-17 23:05               ` Neil Brown
2010-02-19 15:18                 ` Piergiorgio Sartor
2010-02-19 22:02                   ` Neil Brown
2010-02-19 22:37                     ` Piergiorgio Sartor
2010-02-19 23:34                     ` Asdo
2010-02-20  4:27                       ` Goswin von Brederlow
2010-02-20 11:12                         ` Asdo
2010-02-21 11:13                           ` Goswin von Brederlow
     [not found]                             ` <8754A21825504719B463AD9809E54349@m5>
     [not found]                               ` <20100221194400.GA2570@lazy.lzy>
2010-02-22 13:01                                 ` Asdo
2010-02-22 13:30                                   ` Piergiorgio Sartor
2010-02-22 13:44                                   ` Piergiorgio Sartor
2010-02-24 19:42                               ` Bill Davidsen
2010-02-20  4:23                     ` Goswin von Brederlow
2010-02-24 14:54                     ` Bill Davidsen
2010-02-24 21:37                       ` Neil Brown
2010-02-26 20:48                         ` Bill Davidsen
2010-02-26 21:09                           ` Neil Brown
2010-02-26 22:01                             ` Piergiorgio Sartor
2010-02-26 22:15                             ` Bill Davidsen
2010-02-26 22:21                               ` Piergiorgio Sartor
2010-02-26 22:20                             ` Asdo
2010-02-27  6:01                               ` Michael Evans
2010-02-28  0:01                                 ` Bill Davidsen
2010-02-24 14:46                 ` Bill Davidsen
2010-02-24 16:12                   ` Martin K. Petersen
2010-02-24 18:51                     ` Piergiorgio Sartor
2010-02-24 22:21                       ` Neil Brown
2010-02-25  8:41                         ` Piergiorgio Sartor
2010-03-02  4:57                           ` Neil Brown
2010-03-02 18:49                             ` Piergiorgio Sartor
2010-02-24 21:39                     ` Neil Brown
     [not found]                       ` <4B8640A2.4060307@shiftmail.org>
2010-02-25 10:41                         ` Neil Brown
2010-02-28  8:09                       ` Luca Berra
2010-03-02  5:01                         ` Neil Brown
2010-03-02  7:36                           ` Luca Berra
2010-03-02 10:04                             ` Michael Evans
2010-03-02 11:02                               ` Luca Berra
2010-03-02 12:13                                 ` Michael Evans
2010-03-02 18:14                                 ` Asdo
2010-03-02 18:52                                   ` Piergiorgio Sartor
2010-03-02 23:27                                     ` Asdo
2010-03-03  9:13                                       ` Piergiorgio Sartor
2010-03-03 11:42                                         ` Asdo
2010-03-03 12:03                                           ` Piergiorgio Sartor
2010-03-02 20:17                                   ` Neil Brown
2010-02-24 21:32                   ` Neil Brown
2010-02-25  7:22                     ` Goswin von Brederlow
2010-02-25  7:39                       ` Neil Brown
2010-02-25  8:47                     ` John Robinson
2010-02-25  9:07                       ` Neil Brown
2010-02-11 18:12         ` Piergiorgio Sartor
2010-01-19 10:04 Jon Hardcastle
2010-01-20 14:19 ` Brett Russ
2010-01-20 14:34   ` Jon Hardcastle
2010-01-20 14:46     ` Brett Russ
2010-02-01 20:48       ` Bill Davidsen
2010-01-22 16:22   ` Jon Hardcastle
2010-01-22 16:34     ` Asdo
2010-01-22 17:41     ` Brett Russ

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).