From: Wei Yang <richard.weiyang@gmail.com>
To: osalvador@suse.de
Cc: Dave Hansen <dave.hansen@intel.com>,
Michal Hocko <mhocko@suse.com>,
Wei Yang <richard.weiyang@gmail.com>,
akpm@linux-foundation.org, linux-mm@kvack.org,
owner-linux-mm@kvack.org
Subject: Re: [PATCH] mm, sparse: drop pgdat_resize_lock in sparse_add/remove_one_section()
Date: Wed, 28 Nov 2018 00:29:52 +0000 [thread overview]
Message-ID: <20181128002952.x2m33nvlunzij5tk@master> (raw)
In-Reply-To: <4fe3f8203a35ea01c9e0ed87c361465e@suse.de>
On Tue, Nov 27, 2018 at 08:52:14AM +0100, osalvador@suse.de wrote:
>> I think mem_hotplug_lock protects this case these days, though. I don't
>> think we had it in the early days and were just slumming it with the
>> pgdat locks.
>
>Yes, it does.
>
>>
>> I really don't like the idea of removing the lock by just saying it
>> doesn't protect anything without doing some homework first, though. It
>> would actually be really nice to comment the entire call chain from the
>> mem_hotplug_lock acquisition to here. There is precious little
>> commenting in there and it could use some love.
>
>[hot-add operation]
>add_memory_resource : acquire mem_hotplug lock
> arch_add_memory
> add_pages
> __add_pages
> __add_section
> sparse_add_one_section
> sparse_init_one_section
>
>[hot-remove operation]
>__remove_memory : acquire mem_hotplug lock
> arch_remove_memory
> __remove_pages
> __remove_section
> sparse_remove_one_section
>
Thanks for this detailed analysis.
>Both operations are serialized by the mem_hotplug lock, so they cannot step
>on each other's feet.
>
>Now, there seems to be an agreement/thought to remove the global mem_hotplug
>lock, in favor of a range locking for hot-add/remove and online/offline
>stage.
>So, although removing the lock here is pretty straightforward, it does not
>really get us closer to that goal IMHO, if that is what we want to do in the
>end.
>
My current idea is :
we can try to get rid of global mem_hotplug_lock in logical
online/offline phase first, and leave the physical add/remove phase
serialized by mem_hotplug_lock for now.
There are two phase in memory hotplug:
* physical add/remove phase
* logical online/offline phase
Currently, both of them are protected by the global mem_hotplug_lock.
While get rid of the this in logical online/offline phase is a little
easier to me, since this procedure is protected by memory_block_dev's lock.
This ensures there is no pfn over lap during parallel processing.
The physical add/remove phase is a little harder, because it will touch
* memblock
* kernel page table
* node online
* sparse mem
And I don't see a similar lock as memory_block_dev's lock.
Below is the call trace for these two phase and I list some suspicious
point which is not safe without mem_hotplug_lock.
1. physical phase
__add_memory()
register_memory_resource() <- protected by resource_lock
add_memory_resource()
mem_hotplug_begin()
memblock_add_node() <- add to memblock.memory, not safe
__try_online_node() <- not safe, related to node_set_online()
arch_add_memory()
init_memory_mapping() <- not safe
add_pages()
__add_pages()
__add_section()
sparse_add_one_section()
update_end_of_memory_vars() <- not safe
node_set_online(nid) <- need to hold mem_hotplug
__register_one_node(nid)
link_mem_sections()
firmware_map_add_hotplug()
mem_hotplug_done()
2. logical phase
device_lock(memory_block_dev)
online_pages()
mem_hotplug_begin()
mem = find_memory_block() <- not
zone = move_pfn_range()
zone_for_pfn_range();
move_pfn_range_to_zone()
!populated_zone()
setup_zone_pageset(zone)
online_pages_range() <- looks safe
build_all_zonelists() <- not
init_per_zone_wmark_min() <- not
kswapd_run() <- may not
vm_total_pages = nr_free_pagecache_pages()
mem_hotplug_done()
device_unlock(memory_block_dev)
--
Wei Yang
Help you, Help me
next prev parent reply other threads:[~2018-11-28 0:29 UTC|newest]
Thread overview: 38+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-11-27 2:36 [PATCH] mm, sparse: drop pgdat_resize_lock in sparse_add/remove_one_section() Wei Yang
2018-11-27 6:25 ` Michal Hocko
2018-11-27 7:17 ` Dave Hansen
2018-11-27 7:30 ` Michal Hocko
2018-11-27 7:52 ` osalvador
2018-11-27 8:00 ` Michal Hocko
2018-11-27 8:18 ` osalvador
2018-11-28 0:29 ` Wei Yang [this message]
2018-11-28 8:19 ` Oscar Salvador
2018-11-28 8:41 ` Wei Yang
2018-11-28 1:01 ` Wei Yang
2018-11-28 8:47 ` Wei Yang
2018-11-28 9:17 ` Wei Yang
2018-11-28 12:34 ` Michal Hocko
2018-11-28 9:12 ` [PATCH v2] " Wei Yang
2018-11-28 10:28 ` David Hildenbrand
2018-11-29 8:54 ` Michal Hocko
2018-11-29 9:29 ` Wei Yang
2018-11-29 15:53 ` [PATCH v3 1/2] " Wei Yang
2018-11-29 15:53 ` [PATCH v3 2/2] mm, sparse: pass nid instead of pgdat to sparse_add_one_section() Wei Yang
2018-11-29 16:01 ` David Hildenbrand
2018-11-30 1:22 ` Wei Yang
2018-11-30 9:20 ` David Hildenbrand
2018-11-29 17:15 ` Michal Hocko
2018-11-29 23:57 ` Wei Yang
2018-11-29 16:06 ` [PATCH v3 1/2] mm, sparse: drop pgdat_resize_lock in sparse_add/remove_one_section() David Hildenbrand
2018-11-29 17:17 ` Michal Hocko
2018-11-30 4:28 ` Wei Yang
2018-11-30 9:19 ` David Hildenbrand
2018-11-30 9:52 ` Michal Hocko
2018-12-04 8:53 ` Wei Yang
2018-12-01 0:31 ` Wei Yang
2018-12-03 11:25 ` David Hildenbrand
2018-12-03 21:06 ` Wei Yang
2018-11-29 17:14 ` Michal Hocko
2018-12-04 8:56 ` [PATCH v4 " Wei Yang
2018-12-04 8:56 ` [PATCH v4 2/2] mm, sparse: pass nid instead of pgdat to sparse_add_one_section() Wei Yang
2018-12-04 9:24 ` [PATCH v4 1/2] mm, sparse: drop pgdat_resize_lock in sparse_add/remove_one_section() David Hildenbrand
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20181128002952.x2m33nvlunzij5tk@master \
--to=richard.weiyang@gmail.com \
--cc=akpm@linux-foundation.org \
--cc=dave.hansen@intel.com \
--cc=linux-mm@kvack.org \
--cc=mhocko@suse.com \
--cc=osalvador@suse.de \
--cc=owner-linux-mm@kvack.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).