Linux-NVME Archive on lore.kernel.org
 help / color / mirror / Atom feed
From: Sagi Grimberg <sagi@grimberg.me>
To: Hannes Reinecke <hare@suse.de>, Keith Busch <kbusch@kernel.org>
Cc: Christoph Hellwig <hch@lst.de>, Keith Busch <keith.busch@wdc.com>,
	linux-nvme@lists.infradead.org,
	Daniel Wagner <daniel.wagner@suse.de>
Subject: Re: [PATCHv3] nvme-mpath: delete disk after last connection
Date: Wed, 5 May 2021 13:40:29 -0700	[thread overview]
Message-ID: <8a274c79-6db8-a21b-e60e-4e73a9d139b5@grimberg.me> (raw)
In-Reply-To: <6a43fe69-e73a-07f8-f050-8116add83074@suse.de>


>>>>> As stated in the v3 review this is an incompatible change.  We'll need
>>>>> the queue_if_no_path attribute first, and default it to on to keep
>>>>> compatability.
>>>>>
>>>>
>>>> That is what I tried the last time, but the direction I got was to treat
>>>> both, NVMe-PCI and NVMe-oF identically:
>>>> (https://lore.kernel.org/linux-nvme/34e5c178-8bc4-68d3-8374-fbc1b451b6e8@grimberg.me/)
>>>
>>> Yes, I'm not sure I understand your comment Christoph. This addresses an
>>> issue with mdraid where hot unplug+replug does not restore the device to
>>> the raid group (pci and fabrics alike), where before multipath this used
>>> to work.
>>>
>>> queue_if_no_path is a dm-multipath feature so I'm not entirely clear
>>> what is the concern? mdraid on nvme (pci/fabrics) used to work a certain
>>> way, with the introduction of nvme-mpath the behavior was broken (as far
>>> as I understand from Hannes).
>>>
>>> My thinking is that if we want queue_if_no_path functionality in nvme
>>> mpath we should have it explicitly stated properly such that people
>>> that actually need it will use it and have mdraid function correctly
>>> again. Also, queue_if_no_path applies really if all the paths are
>>> gone in the sense they are completely removed, and doesn't apply
>>> to controller reset/reconnect.
>>>
>>> I agree we should probably have queue_if_no_path attribute on the
>>> mpath device, but it doesn't sound right to default it to true given
>>> that it breaks mdraid stacking on top of it..
>>
>> If you want "queue_if_no_path" behavior, can't you just set really high
>> reconnect_delay and ctrl_loss_tmo values? That prevents the path from
>> being deleted while it is unreachable, then restart IO on the existing
>> path once connection is re-established.
>>
> Precisely my thinking.
> We _could_ add a queue_if_no_path attribute, but we can also achieve the
> same behaviour by setting the ctrl_loss_tmo value to infinity.
> Provided we can change it on the fly, though; but it not that's easily
> fixed.
> 
> In fact, that's what we recommend to our customers to avoid the bug
> fixed by this patch.

You can change ctrl_loss_tmo on the fly. How does that address the
issue? the original issue is ctrl_loss_tmo expires for fabrics? or
pci unplug (which ctrl_loss_tmo does not apply to it)?

_______________________________________________
Linux-nvme mailing list
Linux-nvme@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/linux-nvme

  reply	other threads:[~2021-05-05 20:40 UTC|newest]

Thread overview: 21+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-05-01 12:04 [PATCHv3] nvme-mpath: delete disk after last connection Hannes Reinecke
2021-05-04  8:54 ` Christoph Hellwig
2021-05-04 13:40   ` Hannes Reinecke
2021-05-04 19:54     ` Sagi Grimberg
2021-05-05 15:26       ` Keith Busch
2021-05-05 16:15         ` Hannes Reinecke
2021-05-05 20:40           ` Sagi Grimberg [this message]
2021-05-06  2:50             ` Keith Busch
2021-05-06  6:13             ` Hannes Reinecke
2021-05-06  7:43       ` Christoph Hellwig
2021-05-06  8:42         ` Hannes Reinecke
2021-05-06  9:47           ` Sagi Grimberg
2021-05-06 12:08             ` Christoph Hellwig
2021-05-06 15:54               ` Hannes Reinecke
2021-05-07  6:46                 ` Christoph Hellwig
2021-05-07 17:02                   ` Hannes Reinecke
2021-05-07 17:20                     ` Sagi Grimberg
2021-05-10  6:23                     ` Christoph Hellwig
2021-05-10 13:01                       ` Hannes Reinecke
2021-05-10 13:57                         ` Hannes Reinecke
2021-05-10 14:48                       ` Hannes Reinecke

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=8a274c79-6db8-a21b-e60e-4e73a9d139b5@grimberg.me \
    --to=sagi@grimberg.me \
    --cc=daniel.wagner@suse.de \
    --cc=hare@suse.de \
    --cc=hch@lst.de \
    --cc=kbusch@kernel.org \
    --cc=keith.busch@wdc.com \
    --cc=linux-nvme@lists.infradead.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox