ocfs2-devel.oss.oracle.com archive mirror
 help / color / mirror / Atom feed
From: Mark Fasheh <mfasheh@suse.de>
To: ocfs2-devel@oss.oracle.com
Subject: [Ocfs2-devel] [patch 5/8] ocfs2: do not return	DLM_MIGRATE_RESPONSE_MASTERY_REF to avoid endless, loop during umount
Date: Sun, 30 Mar 2014 19:23:42 -0700	[thread overview]
Message-ID: <20140331022342.GE4488@wotan.suse.de> (raw)
In-Reply-To: <20140319211004.23C7F31C1E1@corp2gmr1-1.hot.corp.google.com>

On Wed, Mar 19, 2014 at 02:10:03PM -0700, Andrew Morton wrote:
> From: jiangyiwen <jiangyiwen@huawei.com>
> Subject: ocfs2: do not return DLM_MIGRATE_RESPONSE_MASTERY_REF to avoid endless,loop during umount
> 
> The following case may lead to endless loop during umount.
> 
> node A         node B               node C       node D
> umount volume,
> migrate lockres1
> to B
>                                                  want to lock lockres1,
>                                                  send
>                                                  MASTER_REQUEST_MSG
>                                                  to C
>                                     init block mle
>                send
>                MIGRATE_REQUEST_MSG
>                to C
>                                     find a block
>                                     mle, and then
>                                     return
>                                     DLM_MIGRATE_RESPONSE_MASTERY_REF
>                                     to B
>                set C in refmap
>                                     umount successfully
>                try to umount, endless
>                loop occurs when migrate
>                lockres1 since C is in
>                refmap
> 
> So we can fix this endless loop case by only returning
> DLM_MIGRATE_RESPONSE_MASTERY_REF if it has a mastery mle when receiving
> MIGRATE_REQUEST_MSG.
> 
> [akpm at linux-foundation.org: coding-style fixes]
> Signed-off-by: jiangyiwen <jiangyiwen@huawei.com>
> Cc: Mark Fasheh <mfasheh@suse.com>
> Cc: Joel Becker <jlbec@evilplan.org>
> Cc: Xue jiufei <xuejiufei@huawei.com>
> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>

Ok, I _think_ I got this race condition, and the patch itself seems sane.

How was this bug hit, and how much testing did you do with this patch? I ask
because dlm changes can sometimes have unintended effects and I really don't
understand that particular code well enough right now to tell with 100%
certainty we didn't mess something else up.

Actually, I'm going to CC Sunil in the hopes he can look at this.
	--Mark


> ---
> 
>  fs/ocfs2/dlm/dlmmaster.c |   14 +++++++++-----
>  1 file changed, 9 insertions(+), 5 deletions(-)
> 
> diff -puN fs/ocfs2/dlm/dlmmaster.c~ocfs2-do-not-return-dlm_migrate_response_mastery_ref-to-avoid-endlessloop-during-umount fs/ocfs2/dlm/dlmmaster.c
> --- a/fs/ocfs2/dlm/dlmmaster.c~ocfs2-do-not-return-dlm_migrate_response_mastery_ref-to-avoid-endlessloop-during-umount
> +++ a/fs/ocfs2/dlm/dlmmaster.c
> @@ -3084,11 +3084,15 @@ static int dlm_add_migration_mle(struct
>  			/* remove it so that only one mle will be found */
>  			__dlm_unlink_mle(dlm, tmp);
>  			__dlm_mle_detach_hb_events(dlm, tmp);
> -			ret = DLM_MIGRATE_RESPONSE_MASTERY_REF;
> -			mlog(0, "%s:%.*s: master=%u, newmaster=%u, "
> -			    "telling master to get ref for cleared out mle "
> -			    "during migration\n", dlm->name, namelen, name,
> -			    master, new_master);
> +			if (tmp->type == DLM_MLE_MASTER) {
> +				ret = DLM_MIGRATE_RESPONSE_MASTERY_REF;
> +				mlog(0, "%s:%.*s: master=%u, newmaster=%u, "
> +						"telling master to get ref "
> +						"for cleared out mle during "
> +						"migration\n", dlm->name,
> +						namelen, name, master,
> +						new_master);
> +			}
>  		}
>  		spin_unlock(&tmp->spinlock);
>  	}
> _
--
Mark Fasheh

      reply	other threads:[~2014-03-31  2:23 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-03-19 21:10 [Ocfs2-devel] [patch 5/8] ocfs2: do not return DLM_MIGRATE_RESPONSE_MASTERY_REF to avoid endless, loop during umount akpm at linux-foundation.org
2014-03-31  2:23 ` Mark Fasheh [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20140331022342.GE4488@wotan.suse.de \
    --to=mfasheh@suse.de \
    --cc=ocfs2-devel@oss.oracle.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).