linux-btrfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Anand Jain <anand.jain@oracle.com>
To: Kai Krakow <hurikhan77@gmail.com>, linux-btrfs@vger.kernel.org
Subject: Re: [PATCH 00/13 v3] Introduce device state 'failed', Hot spare and Auto replace
Date: Mon, 4 Apr 2016 14:19:23 +0800	[thread overview]
Message-ID: <5702076B.3050108@oracle.com> (raw)
In-Reply-To: <20160404020043.47ad35bf@jupiter.sol.kaishome.de>



On 04/04/2016 08:00 AM, Kai Krakow wrote:
> Am Sat,  2 Apr 2016 09:30:38 +0800
> schrieb Anand Jain <anand.jain@oracle.com>:
>
>> Auto replace:
>>   Replace happens automatically, that is when there is any write
>>   failed or flush failed, the device will be marked as failed, which
>>   will stop any further IO attempt to that device. And in the next
>>   commit cycle the auto replace will pick the spare device to
>>   replace the failed device. And so the btrfs volume is back to a
>>   healthy state.
>
> Does this also implement "copy-back" - thus, it returns the hot-spare
> device to global hot-spares when the failed device has been replaced?

  Actually no. That means +1x more IO.

> Traditionally, the wording "hot spare" implies that the storage
> system runs a copy-back operation when the failing device has been
> replaced. The "hot spare" device then returns to its original "hot
> spare" function: sitting and waiting to jump in place...



> This also helps to keep your drives order in your storage enclosure.
> Without copy-back, the association between member drive and storage
> enclosure would fluctuate over time and you'd have to constantly update
> documentation.
> Add many identical systems, and you have a lot of
> different system configurations over time with individual
> documentation. This becomes confusing at best, and turns out
> dangerously probably (when pulling the wrong drive due to confusion).

   Noted and I agree, its always been very challenging in this area
   at the data centers.

   But strictly speaking its an enclosure services RFC to
   mitigate the issue of poor SW/CLI UI with hardware slots.
   ZFSSA HW added few more LEDs to help, I am not too sure
   but other recent HWs should have / might have enhancements
   in this area to help easy mapping/management.
   Further for the reason above, we shouldn't fix the problem at
   the wrong side, copy-back involves more IO, but.. I agree
   naming can be better to avoid confusion as such.

> Otherwise, I find "hot spare" misleading and it should be renamed.

  I never thought hot spare would be narrowed to such a specifics.
  When I wrote Solaris cpqarray (it was a very long time back
  though) I remember some RAID+JBOD HW wanted a strict disks ordering
  for _performance-reasons_, which made stronger reasons to have
  copy-back.

  Further in the long term what we need in btrfs is disk based tagging
  and grouping, that is it will help to create a btrfs across mixed
  types of JBODs, and then I hope it shall take care of narrower
  group based spare assignments.

  If there is any further reasons that copy-back should be a feature
  I think its ok to have it as an optional feature.

  About the naming.. the progs called it 'global spare' (device),
  kernel calls is 'spare'. Sorry this email thread called it
  hot spare. I should have paid little more attention here to maintain
  consistency.

  Thanks for the note.

Anand

  parent reply	other threads:[~2016-04-04  6:20 UTC|newest]

Thread overview: 26+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-04-02  1:30 [PATCH 00/13 v3] Introduce device state 'failed', Hot spare and Auto replace Anand Jain
2016-04-02  1:30 ` [PATCH 01/13] btrfs: Introduce a new function to check if all chunks a OK for degraded mount Anand Jain
2016-04-02  1:30 ` [PATCH 02/13] btrfs: Do per-chunk check for mount time check Anand Jain
2016-04-02  1:30 ` [PATCH 03/13] btrfs: Do per-chunk degraded check for remount Anand Jain
2016-04-02  1:30 ` [PATCH 04/13] btrfs: Allow barrier_all_devices to do per-chunk device check Anand Jain
2016-04-02  1:30 ` [PATCH 05/13] btrfs: Cleanup num_tolerated_disk_barrier_failures Anand Jain
2016-04-02  1:30 ` [PATCH 06/13] btrfs: introduce BTRFS_FEATURE_INCOMPAT_SPARE_DEV Anand Jain
2016-04-02  1:30 ` [PATCH 07/13] btrfs: add check not to mount a spare device Anand Jain
2016-04-02  1:30 ` [PATCH 08/13] btrfs: support btrfs dev scan for " Anand Jain
2016-04-02  1:30 ` [PATCH 09/13] btrfs: provide framework to get and put a " Anand Jain
2016-04-02  1:30 ` [PATCH 10/13] btrfs: introduce helper functions to perform hot replace Anand Jain
2016-04-02  5:40   ` kbuild test robot
2016-04-07 20:00   ` Yauhen Kharuzhy
2016-04-08  3:58     ` Anand Jain
2016-04-08 22:05   ` Yauhen Kharuzhy
2016-04-12 14:16     ` Anand Jain
2016-04-02  1:30 ` [PATCH 11/13] btrfs: introduce device dynamic state transition to offline or failed Anand Jain
2016-04-02  1:30 ` [PATCH 12/13] btrfs: check device for critical errors and mark failed Anand Jain
2016-04-02  1:30 ` [PATCH 13/13] btrfs: check for failed device and hot replace Anand Jain
2016-04-04  0:00 ` [PATCH 00/13 v3] Introduce device state 'failed', Hot spare and Auto replace Kai Krakow
2016-04-04  4:45   ` Duncan
2016-04-04  6:09     ` Duncan
2016-04-04 20:15     ` Kai Krakow
2016-04-05  3:02       ` Duncan
2016-04-04  6:19   ` Anand Jain [this message]
2016-04-04 20:07     ` Kai Krakow

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5702076B.3050108@oracle.com \
    --to=anand.jain@oracle.com \
    --cc=hurikhan77@gmail.com \
    --cc=linux-btrfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).