From: Benjamin Marzinski <bmarzins@redhat.com>
To: Martin Wilck <martin.wilck@suse.com>
Cc: Christophe Varoqui <christophe.varoqui@opensvc.com>,
dm-devel@lists.linux.dev, Martin Wilck <mwilck@suse.com>
Subject: Re: [PATCH 03/13] multipathd: allow map removal in do_sync_mpp()
Date: Tue, 10 Dec 2024 18:30:57 -0500 [thread overview]
Message-ID: <Z1jPMVejcI61qP9z@redhat.com> (raw)
In-Reply-To: <20241206233617.382200-4-mwilck@suse.com>
On Sat, Dec 07, 2024 at 12:36:07AM +0100, Martin Wilck wrote:
> We previously didn't allow map removal inside the checker loop. But
> with the late updates to the checkerloop code, it should be safe to orphan
> paths and delete maps even in this situation. We remove such maps everywhere
> else in the code already, whenever refresh_multipath() or setup_multipath()
> is called.
Actually, thinking about this more, what do we get by proactively
deleting the multipath device if something goes wrong in the checker? If
we successfully reload a device, but can't sync it with the kernel,
that's one thing, But that was triggered by a change in the device, and
we know that when we reloaded the device, device-mapper was working. I'm
leery of possibly deleting the map because of a transient device-mapper
issue. I'm not sure if on a check that we do repeatedly, we should
delete the device on an error. We haven't in the past, and as far as I
know, it doesn't cause problems.
Without a benefit to doing this, I'm not sure it makes sense.
-Ben
>
> Signed-off-by: Martin Wilck <mwilck@suse.com>
> ---
> multipathd/main.c | 43 ++++++++++++++++++++-----------------------
> 1 file changed, 20 insertions(+), 23 deletions(-)
>
> diff --git a/multipathd/main.c b/multipathd/main.c
> index 4a28fbb..131dab6 100644
> --- a/multipathd/main.c
> +++ b/multipathd/main.c
> @@ -2446,34 +2446,30 @@ get_new_state(struct path *pp)
> return newstate;
> }
>
> -static void
> -do_sync_mpp(struct vectors * vecs, struct multipath *mpp)
> +/* Returns true if the mpp was deleted */
> +static int
> +do_sync_mpp(struct vectors *vecs, struct multipath *mpp)
> {
> - int i, ret;
> - struct path *pp;
> + int ret;
> +
> + ret = refresh_multipath(vecs, mpp);
> + if (ret)
> + return ret;
>
> - ret = update_multipath_strings(mpp, vecs->pathvec);
> - if (ret != DMP_OK) {
> - condlog(1, "%s: %s", mpp->alias, ret == DMP_NOT_FOUND ?
> - "device not found" :
> - "couldn't synchronize with kernel state");
> - vector_foreach_slot (mpp->paths, pp, i)
> - pp->dmstate = PSTATE_UNDEF;
> - return;
> - }
> set_no_path_retry(mpp);
> + return 0;
> }
>
> -static void
> +static int
> sync_mpp(struct vectors * vecs, struct multipath *mpp, unsigned int ticks)
> {
> if (mpp->sync_tick)
> mpp->sync_tick -= (mpp->sync_tick > ticks) ? ticks :
> mpp->sync_tick;
> if (mpp->sync_tick)
> - return;
> + return 0;
>
> - do_sync_mpp(vecs, mpp);
> + return do_sync_mpp(vecs, mpp);
> }
>
> static int
> @@ -2513,12 +2509,10 @@ update_path_state (struct vectors * vecs, struct path * pp)
> return handle_path_wwid_change(pp, vecs)? CHECK_PATH_REMOVED :
> CHECK_PATH_SKIPPED;
> }
> - if (pp->mpp->synced_count == 0) {
> - do_sync_mpp(vecs, pp->mpp);
> + if (pp->mpp->synced_count == 0 && do_sync_mpp(vecs, pp->mpp))
> /* if update_multipath_strings orphaned the path, quit early */
> - if (!pp->mpp)
> - return CHECK_PATH_SKIPPED;
> - }
> + return CHECK_PATH_SKIPPED;
> +
> if ((newstate != PATH_UP && newstate != PATH_GHOST &&
> newstate != PATH_PENDING) && (pp->state == PATH_DELAYED)) {
> /* If path state become failed again cancel path delay state */
> @@ -3018,8 +3012,11 @@ checkerloop (void *ap)
> mpp->synced_count = 0;
> if (checker_state == CHECKER_STARTING) {
> vector_foreach_slot(vecs->mpvec, mpp, i) {
> - sync_mpp(vecs, mpp, ticks);
> - mpp->prio_update = PRIO_UPDATE_NONE;
> + if (sync_mpp(vecs, mpp, ticks))
> + /* map deleted */
> + i--;
> + else
> + mpp->prio_update = PRIO_UPDATE_NONE;
> }
> vector_foreach_slot(vecs->pathvec, pp, i)
> pp->is_checked = CHECK_PATH_UNCHECKED;
> --
> 2.47.0
next prev parent reply other threads:[~2024-12-10 23:31 UTC|newest]
Thread overview: 31+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-12-06 23:36 [PATCH 00/13] multipathd: More map reload handling, and checkerloop work Martin Wilck
2024-12-06 23:36 ` [PATCH 01/13] multipathd: don't reload map in update_mpp_prio() Martin Wilck
2024-12-06 23:36 ` [PATCH 02/13] multipathd: remove dm_get_info() call from refresh_multipath() Martin Wilck
2024-12-06 23:36 ` [PATCH 03/13] multipathd: allow map removal in do_sync_mpp() Martin Wilck
2024-12-10 19:02 ` Benjamin Marzinski
2024-12-10 19:44 ` Benjamin Marzinski
2024-12-10 21:05 ` Martin Wilck
2024-12-10 22:49 ` Benjamin Marzinski
2024-12-11 20:48 ` Martin Wilck
2024-12-10 23:30 ` Benjamin Marzinski [this message]
2024-12-11 12:06 ` Martin Wilck
2024-12-11 17:09 ` Benjamin Marzinski
2024-12-11 20:20 ` Martin Wilck
2024-12-11 20:33 ` Martin Wilck
2024-12-12 17:12 ` Benjamin Marzinski
2024-12-12 17:18 ` Martin Wilck
2024-12-12 17:50 ` Benjamin Marzinski
2024-12-06 23:36 ` [PATCH 04/13] multipathd: reload maps in do_sync_mpp() if necessary Martin Wilck
2024-12-10 19:20 ` Benjamin Marzinski
2024-12-06 23:36 ` [PATCH 05/13] multipathd: move yielding for waiters to start of checkerloop Martin Wilck
2024-12-06 23:36 ` [PATCH 06/13] multipathd: add checker_finished() Martin Wilck
2024-12-06 23:36 ` [PATCH 07/13] multipathd: move "tick" calls into checker_finished() Martin Wilck
2024-12-06 23:36 ` [PATCH 08/13] multipathd: remove mpvec_garbage_collector() Martin Wilck
2024-12-10 23:34 ` Benjamin Marzinski
2024-12-06 23:36 ` [PATCH 09/13] multipathd: don't call reload_and_sync_map() from deferred_failback_tick() Martin Wilck
2024-12-06 23:36 ` [PATCH 10/13] multipathd: move retry_count_tick() into existing mpvec loop Martin Wilck
2024-12-06 23:36 ` [PATCH 11/13] multipathd: don't call update_map() from missing_uev_wait_tick() Martin Wilck
2024-12-10 23:13 ` Benjamin Marzinski
2024-12-06 23:36 ` [PATCH 12/13] multipathd: don't call udpate_map() from ghost_delay_tick() Martin Wilck
2024-12-06 23:36 ` [PATCH 13/13] multipathd: only call reload_and_sync_map() when ghost delay expires Martin Wilck
2024-12-11 0:02 ` [PATCH 00/13] multipathd: More map reload handling, and checkerloop work Benjamin Marzinski
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=Z1jPMVejcI61qP9z@redhat.com \
--to=bmarzins@redhat.com \
--cc=christophe.varoqui@opensvc.com \
--cc=dm-devel@lists.linux.dev \
--cc=martin.wilck@suse.com \
--cc=mwilck@suse.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.