From: "Huang\, Ying" <ying.huang@intel.com>
To: Johannes Weiner <hannes@cmpxchg.org>
Cc: "Huang, Ying" <ying.huang@intel.com>,
Andrew Morton <akpm@linux-foundation.org>,
linux-mm@kvack.org, linux-kernel@vger.kernel.org
Subject: Re: [PATCH -mm -v7 9/9] mm, THP, swap: Delay splitting THP during swap out
Date: Thu, 30 Mar 2017 12:15:13 +0800 [thread overview]
Message-ID: <871stftn72.fsf@yhuang-dev.intel.com> (raw)
In-Reply-To: <20170329171654.GD31821@cmpxchg.org> (Johannes Weiner's message of "Wed, 29 Mar 2017 13:16:54 -0400")
Johannes Weiner <hannes@cmpxchg.org> writes:
> On Tue, Mar 28, 2017 at 01:32:09PM +0800, Huang, Ying wrote:
>> @@ -183,12 +184,53 @@ void __delete_from_swap_cache(struct page *page)
>> ADD_CACHE_INFO(del_total, nr);
>> }
>>
>> +#ifdef CONFIG_THP_SWAP_CLUSTER
>> +int add_to_swap_trans_huge(struct page *page, struct list_head *list)
>> +{
>> + swp_entry_t entry;
>> + int ret = 0;
>> +
>> + /* cannot split, which may be needed during swap in, skip it */
>> + if (!can_split_huge_page(page, NULL))
>> + return -EBUSY;
>> + /* fallback to split huge page firstly if no PMD map */
>> + if (!compound_mapcount(page))
>> + return 0;
>> + entry = get_huge_swap_page();
>> + if (!entry.val)
>> + return 0;
>> + if (mem_cgroup_try_charge_swap(page, entry, HPAGE_PMD_NR)) {
>> + __swapcache_free(entry, true);
>> + return -EOVERFLOW;
>> + }
>> + ret = add_to_swap_cache(page, entry,
>> + __GFP_HIGH | __GFP_NOMEMALLOC|__GFP_NOWARN);
>> + /* -ENOMEM radix-tree allocation failure */
>> + if (ret) {
>> + __swapcache_free(entry, true);
>> + return 0;
>> + }
>> + ret = split_huge_page_to_list(page, list);
>> + if (ret) {
>> + delete_from_swap_cache(page);
>> + return -EBUSY;
>> + }
>> + return 1;
>> +}
>> +#else
>> +static inline int add_to_swap_trans_huge(struct page *page,
>> + struct list_head *list)
>> +{
>> + return 0;
>> +}
>> +#endif
>> +
>> /**
>> * add_to_swap - allocate swap space for a page
>> * @page: page we want to move to swap
>> *
>> * Allocate swap space for the page and add the page to the
>> - * swap cache. Caller needs to hold the page lock.
>> + * swap cache. Caller needs to hold the page lock.
>> */
>> int add_to_swap(struct page *page, struct list_head *list)
>> {
>> @@ -198,6 +240,18 @@ int add_to_swap(struct page *page, struct list_head *list)
>> VM_BUG_ON_PAGE(!PageLocked(page), page);
>> VM_BUG_ON_PAGE(!PageUptodate(page), page);
>>
>> + if (unlikely(PageTransHuge(page))) {
>> + err = add_to_swap_trans_huge(page, list);
>> + switch (err) {
>> + case 1:
>> + return 1;
>> + case 0:
>> + /* fallback to split firstly if return 0 */
>> + break;
>> + default:
>> + return 0;
>> + }
>> + }
>> entry = get_swap_page();
>> if (!entry.val)
>> return 0;
>
> add_to_swap_trans_huge() is too close a copy of add_to_swap(), which
> makes the code error prone for future modifications to the swap slot
> allocation protocol.
>
> This should read:
>
> retry:
> entry = get_swap_page(page);
> if (!entry.val) {
> if (PageTransHuge(page)) {
> split_huge_page_to_list(page, list);
> goto retry;
> }
> return 0;
> }
If the swap space is used up, that is, get_swap_page() cannot allocate
even 1 swap entry for a normal page. We will split THP unnecessarily
with the change, but in the original code, we just skip the THP. There
may be a performance regression here. Similar problem exists for
mem_cgroup_try_charge_swap() too. If the mem cgroup exceeds the swap
limit, the THP will be split unnecessary with the change too.
> And get_swap_page(), mem_cgroup_try_charge_swap() etc. should all
> check PageTransHuge() instead of having extra parameters or separate
> code paths for the huge page case.
>
> In general, don't try to tack this feature onto the side of the
> VM. Because right now, this looks a bit like the hugetlb code, with
> one big branch in the beginning that opens up an alternate
> reality. Instead, these functions should handle THP all the way down
> the stack, and without passing down redundant information.
Yes. We should share the code as much as possible. I just have some
questions as above. Could you help me on that?
Best Regards,
Huang, Ying
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2017-03-30 4:15 UTC|newest]
Thread overview: 29+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-03-28 5:32 [PATCH -mm -v7 0/9] THP swap: Delay splitting THP during swapping out Huang, Ying
2017-03-28 5:32 ` [PATCH -mm -v7 1/9] mm, swap: Make swap cluster size same of THP size on x86_64 Huang, Ying
2017-03-28 23:30 ` Kirill A. Shutemov
2017-03-29 1:10 ` Huang, Ying
2017-03-29 16:55 ` Johannes Weiner
2017-03-30 0:45 ` Huang, Ying
2017-03-31 14:56 ` Johannes Weiner
2017-04-01 3:29 ` Huang, Ying
2017-03-28 5:32 ` [PATCH -mm -v7 2/9] mm, memcg: Support to charge/uncharge multiple swap entries Huang, Ying
2017-03-29 16:57 ` Johannes Weiner
2017-03-30 0:53 ` Huang, Ying
2017-03-31 14:59 ` Johannes Weiner
2017-03-28 5:32 ` [PATCH -mm -v7 3/9] mm, THP, swap: Add swap cluster allocate/free functions Huang, Ying
2017-03-28 5:32 ` [PATCH -mm -v7 4/9] mm, THP, swap: Add get_huge_swap_page() Huang, Ying
2017-03-29 17:08 ` Johannes Weiner
2017-03-30 4:28 ` Huang, Ying
2017-03-31 15:24 ` Johannes Weiner
2017-04-01 3:32 ` Huang, Ying
2017-03-28 5:32 ` [PATCH -mm -v7 5/9] mm, THP, swap: Support to clear SWAP_HAS_CACHE for huge page Huang, Ying
2017-03-28 5:32 ` [PATCH -mm -v7 6/9] mm, THP, swap: Support to add/delete THP to/from swap cache Huang, Ying
2017-03-28 5:32 ` [PATCH -mm -v7 7/9] mm, THP: Add can_split_huge_page() Huang, Ying
2017-03-28 5:32 ` [PATCH -mm -v7 8/9] mm, THP, swap: Support to split THP in swap cache Huang, Ying
2017-03-28 5:32 ` [PATCH -mm -v7 9/9] mm, THP, swap: Delay splitting THP during swap out Huang, Ying
2017-03-29 17:16 ` Johannes Weiner
2017-03-30 4:15 ` Huang, Ying [this message]
2017-03-31 14:49 ` Johannes Weiner
2017-04-01 2:54 ` Huang, Ying
2017-03-28 22:13 ` [PATCH -mm -v7 0/9] THP swap: Delay splitting THP during swapping out Andrew Morton
2017-03-29 8:52 ` Huang, Ying
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=871stftn72.fsf@yhuang-dev.intel.com \
--to=ying.huang@intel.com \
--cc=akpm@linux-foundation.org \
--cc=hannes@cmpxchg.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).