linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Christoph Hellwig <hch@infradead.org>
To: Shaohua Li <shli@fb.com>
Cc: Christoph Hellwig <hch@infradead.org>,
	linux-raid@vger.kernel.org, Kernel-team@fb.com,
	songliubraving@fb.com, dan.j.williams@intel.com, neilb@suse.de
Subject: Re: [PATCH 0/5] a caching layer for raid 5/6
Date: Tue, 12 May 2015 00:18:54 -0700	[thread overview]
Message-ID: <20150512071854.GA16731@infradead.org> (raw)
In-Reply-To: <20150511160349.GA545551@devbig257.prn2.facebook.com>

On Mon, May 11, 2015 at 09:03:51AM -0700, Shaohua Li wrote:
> >   - What is the reason for retry_bio_list?  If a driver returns an
> >     I/O error to the higher levels it already has retried and came
> >     to the conclusion this is a permanent error.
> 
> The retry_bio_list is to handle io to cache disk. If IO to cache disk
> has error, it's not a permanent error here. The cache disk is a cache,
> We can still dispatch the IO to its final destination, the raid disks.

How does this work in practice?  We've filled our cache disk with
dirty data, and it now returns non-correctable write errors.  At this
point we had claimed to caller that data is on stable disk, but our
cache disk is toast now.  Is it really a good idea to now start a large
window where we do not actually have the cache data on stable storage
we can get back at but pretent business as usual?

IMHO the only sane way is to shut down the array when write to the cache
disk fail.  Hopefully the disk will still allow reading from it.  Note
that to be on the safe side you'll need a mirrored cache disk anyway.


  reply	other threads:[~2015-05-12  7:18 UTC|newest]

Thread overview: 11+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-05-06 23:57 [PATCH 0/5] a caching layer for raid 5/6 Shaohua Li
2015-05-06 23:57 ` [PATCH 1/5] MD: add a new disk role to present cache device Shaohua Li
2015-05-06 23:57 ` [PATCH 2/5] raid5: directly use mddev->queue Shaohua Li
2015-05-06 23:57 ` [PATCH 3/5] A caching layer for RAID5/6 Shaohua Li
2015-05-07 16:52   ` Christoph Hellwig
2015-05-06 23:57 ` [PATCH 4/5] raid5-cache: add some sysfs entries Shaohua Li
2015-05-06 23:57 ` [PATCH 5/5] md: don't allow resize/reshape with cache support Shaohua Li
2015-05-11 12:23 ` [PATCH 0/5] a caching layer for raid 5/6 Christoph Hellwig
2015-05-11 16:03   ` Shaohua Li
2015-05-12  7:18     ` Christoph Hellwig [this message]
2015-05-12 15:23       ` Shaohua Li

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20150512071854.GA16731@infradead.org \
    --to=hch@infradead.org \
    --cc=Kernel-team@fb.com \
    --cc=dan.j.williams@intel.com \
    --cc=linux-raid@vger.kernel.org \
    --cc=neilb@suse.de \
    --cc=shli@fb.com \
    --cc=songliubraving@fb.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).