From: Baolin Wang <baolin.wang@linux.alibaba.com>
To: "Barry Song (Xiaomi)" <baohua@kernel.org>, david@kernel.org
Cc: akpm@linux-foundation.org, axelrasmussen@google.com,
dev.jain@arm.com, kasong@tencent.com, lance.yang@linux.dev,
liam@infradead.org, linux-kernel@vger.kernel.org,
linux-mm@kvack.org, ljs@kernel.org, npache@redhat.com,
qi.zheng@linux.dev, ryan.roberts@arm.com, shakeel.butt@linux.dev,
weixugc@google.com, yuanchu@google.com, zhaonanzhe@xiaomi.com,
ziy@nvidia.com
Subject: Re: [RFC PATCH] mm: Avoiding split large folios if swap has no space
Date: Mon, 22 Jun 2026 11:04:14 +0800 [thread overview]
Message-ID: <6e89f868-ca7a-484f-aeea-5d8d029714f2@linux.alibaba.com> (raw)
In-Reply-To: <20260620081017.89085-1-baohua@kernel.org>
On 6/20/26 4:10 PM, Barry Song (Xiaomi) wrote:
> On Fri, Jun 19, 2026 at 10:04 PM David Hildenbrand (Arm) <david@kernel.org> wrote:
> [...]
>>> /*
>>> * The page can not be swapped.
>>> *
>>> @@ -1280,6 +1289,8 @@ static unsigned int shrink_folio_list(struct list_head *folio_list,
>>>
>>> if (!folio_test_large(folio))
>>> goto activate_locked_split;
>>> + if (!__can_reclaim_anon_pages(memcg, sc))
>>> + goto activate_locked_split;
>>
>> Why are we even trying to allocate swap space if we cannot reclaim such pages?
>> Makes we wonder whether we would want to have that check earlier, before the
>> folio_alloc_swap().
>>
>> Any downsides?
>
> I don't think there are any obvious downsides there. One issue is that
> the memcg may not be passed from reclaim_pages(), so memcg would
> always be NULL. However, the folio could still belong to a memcg
> whose swap quota has been exhausted. In that case, my
> __can_reclaim_anon_pages() will fail when checking whether we can
> swap out. But switching to folio_memcg() also seems awkward.
>
> So I feel Kairui’s suggestion [1] might be the best approach. In
> folio_alloc_swap(), we return -EAGAIN to tell vmscan.c that
> we can split the folio and retry the swap-out.
> only when there are sufficient swap slots and sufficient memcg swap
> quota do we return -EAGAIN, allowing vmscan to perform a split.
>
> diff --git a/mm/swapfile.c b/mm/swapfile.c
> index 78b49b0658ad..62e2c506ccae 100644
> --- a/mm/swapfile.c
> +++ b/mm/swapfile.c
> @@ -1755,6 +1755,9 @@ int folio_alloc_swap(struct folio *folio)
> VM_WARN_ON_ONCE(1);
> return -EINVAL;
> }
> +
> + if (get_nr_swap_pages() < (1 << order))
> + return -ENOMEM;
Shouldn't this return -EAGAIN? Suppose we try to swap out an order-9
large folio but get_nr_swap_pages() returns 256, then we'd still need to
split the order-9 large folio to reclaim some memory.
> }
>
> again:
> @@ -1769,11 +1772,13 @@ int folio_alloc_swap(struct folio *folio)
> }
>
> /* Need to call this even if allocation failed, for MEMCG_SWAP_FAIL. */
> - if (unlikely(mem_cgroup_try_charge_swap(folio)))
> + if (unlikely(mem_cgroup_try_charge_swap(folio))) {
> swap_cache_del_folio(folio);
> + return -ENOMEM;
> + }
>
> if (unlikely(!folio_test_swapcache(folio)))
> - return -ENOMEM;
> + return -EAGAIN;
>
> return 0;
> }
> diff --git a/mm/vmscan.c b/mm/vmscan.c
> index 299b5d9e8836..63e8578454ea 100644
> --- a/mm/vmscan.c
> +++ b/mm/vmscan.c
> @@ -1257,6 +1257,8 @@ static unsigned int shrink_folio_list(struct list_head *folio_list,
> */
> if (folio_test_anon(folio) && folio_test_swapbacked(folio) &&
> !folio_test_swapcache(folio)) {
> + int ret;
> +
> if (!(sc->gfp_mask & __GFP_IO))
> goto keep_locked;
> if (folio_maybe_dma_pinned(folio))
> @@ -1275,10 +1277,10 @@ static unsigned int shrink_folio_list(struct list_head *folio_list,
> split_folio_to_list(folio, folio_list))
> goto activate_locked;
> }
> - if (folio_alloc_swap(folio)) {
> + if ((ret = folio_alloc_swap(folio))) {
Also, please give shmem some love (shmem also calls folio_alloc_swap()
when swapping out) :)
next prev parent reply other threads:[~2026-06-22 3:04 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-06-18 22:17 [RFC PATCH] mm: Avoiding split large folios if swap has no space Barry Song (Xiaomi)
2026-06-18 23:46 ` Nico Pache
2026-06-19 0:59 ` Barry Song
2026-06-19 14:01 ` David Hildenbrand (Arm)
2026-06-19 23:01 ` Barry Song
2026-06-19 14:04 ` David Hildenbrand (Arm)
2026-06-20 8:10 ` Barry Song (Xiaomi)
2026-06-22 3:04 ` Baolin Wang [this message]
2026-06-22 3:36 ` Barry Song
2026-06-22 4:06 ` Baolin Wang
2026-06-22 8:58 ` David Hildenbrand (Arm)
2026-06-19 19:17 ` Kairui Song
2026-06-19 22:42 ` Barry Song (Xiaomi)
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=6e89f868-ca7a-484f-aeea-5d8d029714f2@linux.alibaba.com \
--to=baolin.wang@linux.alibaba.com \
--cc=akpm@linux-foundation.org \
--cc=axelrasmussen@google.com \
--cc=baohua@kernel.org \
--cc=david@kernel.org \
--cc=dev.jain@arm.com \
--cc=kasong@tencent.com \
--cc=lance.yang@linux.dev \
--cc=liam@infradead.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=ljs@kernel.org \
--cc=npache@redhat.com \
--cc=qi.zheng@linux.dev \
--cc=ryan.roberts@arm.com \
--cc=shakeel.butt@linux.dev \
--cc=weixugc@google.com \
--cc=yuanchu@google.com \
--cc=zhaonanzhe@xiaomi.com \
--cc=ziy@nvidia.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox