From: Andrea Arcangeli <aarcange@redhat.com>
To: Zhou Chengming <zhouchengming1@huawei.com>
Cc: akpm@linux-foundation.org, hughd@google.com,
kirill.shutemov@linux.intel.com, vbabka@suse.cz,
geliangtang@163.com, minchan@kernel.org, linux-mm@kvack.org,
linux-kernel@vger.kernel.org, guohanjun@huawei.com,
dingtianhong@huawei.com, huawei.libin@huawei.com,
thunder.leizhen@huawei.com, qiuxishi@huawei.com
Subject: Re: [PATCH v2] ksm: fix conflict between mmput and scan_get_next_rmap_item
Date: Fri, 6 May 2016 16:24:31 +0200 [thread overview]
Message-ID: <20160506142431.GA4855@redhat.com> (raw)
In-Reply-To: <1462505256-37301-1-git-send-email-zhouchengming1@huawei.com>
On Fri, May 06, 2016 at 11:27:36AM +0800, Zhou Chengming wrote:
> @@ -1650,16 +1647,22 @@ next_mm:
> */
> hash_del(&slot->link);
> list_del(&slot->mm_list);
> - spin_unlock(&ksm_mmlist_lock);
>
> free_mm_slot(slot);
> clear_bit(MMF_VM_MERGEABLE, &mm->flags);
> up_read(&mm->mmap_sem);
> mmdrop(mm);
> } else {
> - spin_unlock(&ksm_mmlist_lock);
> up_read(&mm->mmap_sem);
> }
> + /*
> + * up_read(&mm->mmap_sem) first because after
> + * spin_unlock(&ksm_mmlist_lock) run, the "mm" may
> + * already have been freed under us by __ksm_exit()
> + * because the "mm_slot" is still hashed and
> + * ksm_scan.mm_slot doesn't point to it anymore.
> + */
> + spin_unlock(&ksm_mmlist_lock);
>
> /* Repeat until we've completed scanning the whole list */
> slot = ksm_scan.mm_slot;
Reviewed-by: Andrea Arcangeli <aarcange@redhat.com>
While the above patch is correct, I would however prefer if you could
update it to keep releasing the ksm_mmlist_lock as before (I'm talking
only about the quoted part, not the other one not quoted), because
it's "strictier" and it better documents that it's only needed up
until:
hash_del(&slot->link);
list_del(&slot->mm_list);
It should be also a bit more scalable but to me this is just about
keeping implicit documentation on the locking by keeping it strict.
The fact up_read happens exactly after clear_bit also avoided me to
overlook that it was really needed, same thing with the
ksm_mmlist_lock after list_del, I'd like to keep it there and just
invert the order of spin_unlock; up_read in the else branch.
That should be enough because after hash_del get_mm_slot will return
NULL so the mmdrop will not happen anymore in __ksm_exit, this is
further explicit by the code doing mmdrop itself just after
up_read.
The SMP race condition is fixed by just the two liner that reverse the
order of spin_unlock; up_read without increasing the size of the
spinlock critical section for the ksm_scan.address == 0 case. This is
also why it wasn't reproducible because it's about 1 instruction window.
Thanks!
Andrea
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
WARNING: multiple messages have this Message-ID (diff)
From: Andrea Arcangeli <aarcange@redhat.com>
To: Zhou Chengming <zhouchengming1@huawei.com>
Cc: akpm@linux-foundation.org, hughd@google.com,
kirill.shutemov@linux.intel.com, vbabka@suse.cz,
geliangtang@163.com, minchan@kernel.org, linux-mm@kvack.org,
linux-kernel@vger.kernel.org, guohanjun@huawei.com,
dingtianhong@huawei.com, huawei.libin@huawei.com,
thunder.leizhen@huawei.com, qiuxishi@huawei.com
Subject: Re: [PATCH v2] ksm: fix conflict between mmput and scan_get_next_rmap_item
Date: Fri, 6 May 2016 16:24:31 +0200 [thread overview]
Message-ID: <20160506142431.GA4855@redhat.com> (raw)
In-Reply-To: <1462505256-37301-1-git-send-email-zhouchengming1@huawei.com>
On Fri, May 06, 2016 at 11:27:36AM +0800, Zhou Chengming wrote:
> @@ -1650,16 +1647,22 @@ next_mm:
> */
> hash_del(&slot->link);
> list_del(&slot->mm_list);
> - spin_unlock(&ksm_mmlist_lock);
>
> free_mm_slot(slot);
> clear_bit(MMF_VM_MERGEABLE, &mm->flags);
> up_read(&mm->mmap_sem);
> mmdrop(mm);
> } else {
> - spin_unlock(&ksm_mmlist_lock);
> up_read(&mm->mmap_sem);
> }
> + /*
> + * up_read(&mm->mmap_sem) first because after
> + * spin_unlock(&ksm_mmlist_lock) run, the "mm" may
> + * already have been freed under us by __ksm_exit()
> + * because the "mm_slot" is still hashed and
> + * ksm_scan.mm_slot doesn't point to it anymore.
> + */
> + spin_unlock(&ksm_mmlist_lock);
>
> /* Repeat until we've completed scanning the whole list */
> slot = ksm_scan.mm_slot;
Reviewed-by: Andrea Arcangeli <aarcange@redhat.com>
While the above patch is correct, I would however prefer if you could
update it to keep releasing the ksm_mmlist_lock as before (I'm talking
only about the quoted part, not the other one not quoted), because
it's "strictier" and it better documents that it's only needed up
until:
hash_del(&slot->link);
list_del(&slot->mm_list);
It should be also a bit more scalable but to me this is just about
keeping implicit documentation on the locking by keeping it strict.
The fact up_read happens exactly after clear_bit also avoided me to
overlook that it was really needed, same thing with the
ksm_mmlist_lock after list_del, I'd like to keep it there and just
invert the order of spin_unlock; up_read in the else branch.
That should be enough because after hash_del get_mm_slot will return
NULL so the mmdrop will not happen anymore in __ksm_exit, this is
further explicit by the code doing mmdrop itself just after
up_read.
The SMP race condition is fixed by just the two liner that reverse the
order of spin_unlock; up_read without increasing the size of the
spinlock critical section for the ksm_scan.address == 0 case. This is
also why it wasn't reproducible because it's about 1 instruction window.
Thanks!
Andrea
next prev parent reply other threads:[~2016-05-06 14:24 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-05-06 3:27 [PATCH v2] ksm: fix conflict between mmput and scan_get_next_rmap_item Zhou Chengming
2016-05-06 3:27 ` Zhou Chengming
2016-05-06 14:24 ` Andrea Arcangeli [this message]
2016-05-06 14:24 ` Andrea Arcangeli
2016-05-08 6:03 ` zhouchengming
2016-05-08 6:03 ` zhouchengming
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20160506142431.GA4855@redhat.com \
--to=aarcange@redhat.com \
--cc=akpm@linux-foundation.org \
--cc=dingtianhong@huawei.com \
--cc=geliangtang@163.com \
--cc=guohanjun@huawei.com \
--cc=huawei.libin@huawei.com \
--cc=hughd@google.com \
--cc=kirill.shutemov@linux.intel.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=minchan@kernel.org \
--cc=qiuxishi@huawei.com \
--cc=thunder.leizhen@huawei.com \
--cc=vbabka@suse.cz \
--cc=zhouchengming1@huawei.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.