linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: NeilBrown <neilb@suse.de>
To: linbloke <linbloke@fastmail.fm>
Cc: CoolCold <coolthecold@gmail.com>,
	Paul Clements <paul.clements@us.sios.com>,
	John Robinson <john.robinson@anonymous.org.uk>,
	Linux RAID <linux-raid@vger.kernel.org>
Subject: Re: possible bug - bitmap dirty pages status
Date: Wed, 16 Nov 2011 13:30:45 +1100	[thread overview]
Message-ID: <20111116133045.2528310b@notabene.brown> (raw)
In-Reply-To: <4EC1A037.4080406@fastmail.fm>

[-- Attachment #1: Type: text/plain, Size: 2266 bytes --]

On Tue, 15 Nov 2011 10:11:51 +1100 linbloke <linbloke@fastmail.fm> wrote:
> Hello,
> 
> Sorry for bumping this thread but I couldn't find any resolution 
> post-dated. I'm seeing the same thing with SLES11 SP1. No matter how 
> long I wait or how often I sync(8), the number of dirty bitmap pages 
> does not reduce to zero - 52 has become the new zero for this array 
> (md101). I've tried writing more data to prod the sync  - the result was 
> an increase in the dirty page count (53/465) and then return to the base 
> count (52/465) after 5seconds. I haven't tried removing the bitmaps and 
> am a little reluctant to unless this would help to diagnose the bug.
> 
> This array is part of a nested array set as mentioned in another mail 
> list thread with the Subject: Rotating RAID 1. Another thing happening 
> with this array is that the top array (md106), the one with the 
> filesystem on it, has the file system exported via NFS to a dozen or so 
> other systems. There has been no activity on this array for at least a 
> couple of minutes.
> 
> I certainly don't feel comfortable that I have created a mirror of the 
> component devices. Can I expect the devices to actually be in sync at 
> this point?

Hi,
 thanks for the report.
 I can understand your discomfort.  Unfortunately I haven't been able to
 discover with any confidence what the problem is, so I cannot completely
 relieve that discomfort.  I have found another possible issue - a race that
 could cause md to forget that it needs to clean out a page of the bitmap.
 I could imagine that causing 1 or maybe 2 pages to be stuck, but I don't
 think it can explain 52.

 Can can check if you actually have a mirror by:
    echo check > /sys/block/md101/md/sync_action
 then wait for that to finish and check ..../mismatch_cnt.
 I'm quite confident that will report 0.  I strongly suspect the problem is
 that we forget to clear pages or bits, not that we forget to use them during
 recovery.

 So don't think that keeping the bitmaps will help in diagnosing the
 problem.   We I need is a sequence of events that is likely to produce the
 problem, and I realise that is hard to come by.

 Sorry that I cannot be more helpful yet.

NeilBrown


[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 828 bytes --]

  reply	other threads:[~2011-11-16  2:30 UTC|newest]

Thread overview: 13+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-08-27  9:58 possible bug - bitmap dirty pages status CoolCold
2011-08-31  9:05 ` CoolCold
2011-08-31 12:30   ` Paul Clements
2011-08-31 12:56     ` John Robinson
2011-08-31 13:16       ` CoolCold
2011-08-31 14:08         ` Paul Clements
2011-08-31 20:16           ` CoolCold
2011-09-01  5:40             ` NeilBrown
2011-11-14 23:11               ` linbloke
2011-11-16  2:30                 ` NeilBrown [this message]
2011-11-21 21:50                   ` linbloke
     [not found]                 ` <CAGqmV7qpQBHLcJ9J9cP1zDw6kp6aLcaCMneFYEgcPOu7doXSMA@mail.gmail.com>
2011-11-16  3:07                   ` NeilBrown
2011-11-16  9:36                     ` CoolCold

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20111116133045.2528310b@notabene.brown \
    --to=neilb@suse.de \
    --cc=coolthecold@gmail.com \
    --cc=john.robinson@anonymous.org.uk \
    --cc=linbloke@fastmail.fm \
    --cc=linux-raid@vger.kernel.org \
    --cc=paul.clements@us.sios.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).