All of lore.kernel.org
 help / color / mirror / Atom feed
From: Sunil Mushran <sunil.mushran@oracle.com>
To: ocfs2-devel@oss.oracle.com
Subject: [Ocfs2-devel] [PATCH] remove lockres from purge list when we are getting it for creating lock
Date: Thu, 09 Jun 2011 11:34:17 -0700	[thread overview]
Message-ID: <4DF11229.2070706@oracle.com> (raw)
In-Reply-To: <4DF0576F.1010002@oracle.com>

Different issue. That patch plugged a hole when the master was remote.
In this case the master is local.

In this case, the window is very tiny as we not only need the recovery to
kick-in in the that window, we also need the lockres to be moved to the
purgelist in that window.

But it just so happens that ovm via userdlm hits createlock a lot making
this issue more likely to trigger than with the fs.

On 06/08/2011 10:17 PM, Srinivas Eeda wrote:
> I think I have seen this problem in ocfs2-1.2 and it was addressed by
> using a new state DLM_LOCK_RES_IN_USE. But we didn't merge into mainline
> as sunil suggested we need to look for a different approach
>
> http://oss.oracle.com/pipermail/ocfs2-devel/2010-June/006669.html
> http://www.mail-archive.com/ocfs2-devel at oss.oracle.com/msg05733.html
>
> one comment inline
>
> On 6/8/2011 3:04 AM, Wengang Wang wrote:
>> When we are to purge a lockres, we check if it's unused.
>> the check includes
>> 1. no locks attached to the lockres.
>> 2. not dirty.
>> 3. not in recovery.
>> 4. no interested nodes in refmap.
>> If the the above four are satisfied, we are going to purge it(remove it
>> from hash table).
>>
>> While, when a "create lock" is in progress especially when lockres is owned
>> remotely(spinlocks are dropped when networking), the lockres can satisfy above
>> four condition. If it's put to purge list, it can be purged out from hash table
>> when we are still accessing on it(sending request to owner for example). That's
>> not what we want.
>>
>> I have met such a problem (orabug 12405575).
>> The lockres is in the purge list already(there is a delay for real purge work)
>> and the create lock request comes. When we are sending network message to the
>> owner in dlmlock_remote(), the owner crashed. So we get DLM_RECOVERING as return
>> value and retries dlmlock_remote(). And before the owner crash, we have purged
>> the lockres. So the lockres become stale(on lockres->onwer). Thus the code calls
>> dlmlock_remote() infinitely.
>>
>> fix:
>> we remove the lockres from purge list if it's there in dlm_get_lock_resource()
>> which is called for only createlock case. So that the lockres can't be purged
>> when we are in progress of createlock.
>>
>> Signed-off-by: Wengang Wang<wen.gang.wang@oracle.com>
>> ---
>>    fs/ocfs2/dlm/dlmmaster.c |   41 +++++++++++++++++++++++++++++------------
>>    1 files changed, 29 insertions(+), 12 deletions(-)
>>
>> diff --git a/fs/ocfs2/dlm/dlmmaster.c b/fs/ocfs2/dlm/dlmmaster.c
>> index 11eefb8..511d43c 100644
>> --- a/fs/ocfs2/dlm/dlmmaster.c
>> +++ b/fs/ocfs2/dlm/dlmmaster.c
>> @@ -709,28 +709,27 @@ lookup:
>>    	spin_lock(&dlm->spinlock);
>>    	tmpres = __dlm_lookup_lockres_full(dlm, lockid, namelen, hash);
>>    	if (tmpres) {
>> -		int dropping_ref = 0;
>> -
>> -		spin_unlock(&dlm->spinlock);
>> -
>>    		spin_lock(&tmpres->spinlock);
>>    		/* We wait for the other thread that is mastering the resource */
>>    		if (tmpres->owner == DLM_LOCK_RES_OWNER_UNKNOWN) {
>> +			spin_unlock(&dlm->spinlock);
>>    			__dlm_wait_on_lockres(tmpres);
>>    			BUG_ON(tmpres->owner == DLM_LOCK_RES_OWNER_UNKNOWN);
>> +			spin_unlock(&tmpres->spinlock);
>> +			dlm_lockres_put(tmpres);
>> +			tmpres = NULL;
>> +			goto lookup;
>>    		}
>>
>>    		if (tmpres->owner == dlm->node_num) {
>>    			BUG_ON(tmpres->state&   DLM_LOCK_RES_DROPPING_REF);
>>    			dlm_lockres_grab_inflight_ref(dlm, tmpres);
>> -		} else if (tmpres->state&   DLM_LOCK_RES_DROPPING_REF)
>> -			dropping_ref = 1;
>> -		spin_unlock(&tmpres->spinlock);
>> -
>> -		/* wait until done messaging the master, drop our ref to allow
>> -		 * the lockres to be purged, start over. */
>> -		if (dropping_ref) {
>> -			spin_lock(&tmpres->spinlock);
>> +		} else if (tmpres->state&   DLM_LOCK_RES_DROPPING_REF) {
>> +			/*
>> +			 * wait until done messaging the master, drop our ref to
>> +			 * allow the lockres to be purged, start over.
>> +			 */
>> +			spin_unlock(&dlm->spinlock);
>>    			__dlm_wait_on_lockres_flags(tmpres, DLM_LOCK_RES_DROPPING_REF);
>>    			spin_unlock(&tmpres->spinlock);
>>    			dlm_lockres_put(tmpres);
>> @@ -739,6 +738,24 @@ lookup:
>>    		}
>>
>>    		mlog(0, "found in hash!\n");
>> +		/*
>> +		 * we are going to do a create-lock next. so remove the lockres
>> +		 * from purge list to avoid the case that we will access on the
>> +		 * lockres which is already purged out from hash table in
>> +		 * dlm_run_purge_list() path.
>> +		 * otherwise, we could run into a problem:
>> +		 * the owner dead(recovery didn't take care of this lockres
>> +		 * since it's not in hashtable), and the code keeps sending
>> +		 * request to the dead node and getting DLM_RECOVERING and
>> +		 * then retrying infinitely.
>> +		 */
>> +		if (!list_empty(&tmpres->purge)) {
>> +			list_del_init(&tmpres->purge);
>> +			dlm_lockres_put(tmpres);
>> +		}
>> +		
>> +		spin_unlock(&tmpres->spinlock);
>> +		spin_unlock(&dlm->spinlock);
> lockres could still get added to purgelist at this point and we could
> still have the same problem? I think, here we need some mechanism that
> marks the lockres is in use that would protect it from adding to the
> purgelist?
>>    		if (res)
>>    			dlm_lockres_put(res);
>>    		res = tmpres;
> _______________________________________________
> Ocfs2-devel mailing list
> Ocfs2-devel at oss.oracle.com
> http://oss.oracle.com/mailman/listinfo/ocfs2-devel

  parent reply	other threads:[~2011-06-09 18:34 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-06-08 10:04 [Ocfs2-devel] [PATCH] remove lockres from purge list when we are getting it for creating lock Wengang Wang
2011-06-09  5:17 ` Srinivas Eeda
2011-06-09  6:40   ` Wengang Wang
2011-06-09  7:43     ` Srinivas Eeda
2011-06-09  8:10       ` Wengang Wang
2011-06-09 18:34   ` Sunil Mushran [this message]
2011-06-10  0:32     ` Wengang Wang
2011-06-10  0:43       ` Sunil Mushran
2011-06-09 17:53 ` Sunil Mushran
2011-06-10  1:01   ` Wengang Wang
2011-06-10  1:10     ` Sunil Mushran
2011-06-10  1:21       ` Wengang Wang

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4DF11229.2070706@oracle.com \
    --to=sunil.mushran@oracle.com \
    --cc=ocfs2-devel@oss.oracle.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.