From: Andrew Morton <akpm@linux-foundation.org>
To: cgel.zte@gmail.com
Cc: naoya.horiguchi@nec.com, mhocko@kernel.org, minchan@kernel.org,
hannes@cmpxchg.org, rogerq@kernel.org, linux-mm@kvack.org,
linux-kernel@vger.kernel.org, guo.ziliang@zte.com.cn,
Zeal Robot <zealci@zte.com.cn>,
Ran Xiaokai <ran.xiaokai@zte.com.cn>,
Jiang Xuexin <jiang.xuexin@zte.com.cn>,
Yang Yang <yang.yang29@zte.com.cn>,
Hugh Dickins <hughd@google.com>
Subject: Re: [PATCH linux-next] mm: swap: get rid of deadloop in swapin readahead
Date: Fri, 25 Feb 2022 17:24:40 -0800 [thread overview]
Message-ID: <20220225172440.ec62edf97b405d32061bcb37@linux-foundation.org> (raw)
In-Reply-To: <20220221111749.1928222-1-cgel.zte@gmail.com>
On Mon, 21 Feb 2022 11:17:49 +0000 cgel.zte@gmail.com wrote:
> From: Guo Ziliang <guo.ziliang@zte.com.cn>
>
> In our testing, a deadloop task was found. Through sysrq printing, same
> stack was found every time, as follows:
> __swap_duplicate+0x58/0x1a0
> swapcache_prepare+0x24/0x30
> __read_swap_cache_async+0xac/0x220
> read_swap_cache_async+0x58/0xa0
> swapin_readahead+0x24c/0x628
> do_swap_page+0x374/0x8a0
> __handle_mm_fault+0x598/0xd60
> handle_mm_fault+0x114/0x200
> do_page_fault+0x148/0x4d0
> do_translation_fault+0xb0/0xd4
> do_mem_abort+0x50/0xb0
>
> The reason for the deadloop is that swapcache_prepare() always returns
> EEXIST, indicating that SWAP_HAS_CACHE has not been cleared, so that
> it cannot jump out of the loop. We suspect that the task that clears
> the SWAP_HAS_CACHE flag never gets a chance to run. We try to lower
> the priority of the task stuck in a deadloop so that the task that
> clears the SWAP_HAS_CACHE flag will run. The results show that the
> system returns to normal after the priority is lowered.
>
> In our testing, multiple real-time tasks are bound to the same core,
> and the task in the deadloop is the highest priority task of the
> core, so the deadloop task cannot be preempted.
>
> Although cond_resched() is used by __read_swap_cache_async, it is an
> empty function in the preemptive system and cannot achieve the purpose
> of releasing the CPU. A high-priority task cannot release the CPU
> unless preempted by a higher-priority task. But when this task
> is already the highest priority task on this core, other tasks
> will not be able to be scheduled. So we think we should replace
> cond_resched() with schedule_timeout_uninterruptible(1),
> schedule_timeout_interruptible will call set_current_state
> first to set the task state, so the task will be removed
> from the running queue, so as to achieve the purpose of
> giving up the CPU and prevent it from running in kernel
> mode for too long.
>
> ...
>
> --- a/mm/swap_state.c
> +++ b/mm/swap_state.c
> @@ -478,7 +478,7 @@ struct page *__read_swap_cache_async(swp_entry_t entry, gfp_t gfp_mask,
> * __read_swap_cache_async(), which has set SWAP_HAS_CACHE
> * in swap_map, but not yet added its page to swap cache.
> */
> - cond_resched();
> + schedule_timeout_uninterruptible(1);
> }
>
> /*
Sigh. I guess yes, we should do this, at least in a short-term,
backportable-to-stable way.
But busy-waiting while hoping that someone else will save us isn't an
attractive design. Hugh, have you ever thought about something more
deterministic in there?
Thanks.
next prev parent reply other threads:[~2022-02-26 1:24 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-02-21 11:17 [PATCH linux-next] mm: swap: get rid of deadloop in swapin readahead cgel.zte
2022-02-26 1:24 ` Andrew Morton [this message]
2022-03-01 4:07 ` Hugh Dickins
2022-03-02 0:32 ` Andrew Morton
2022-03-02 19:31 ` Hugh Dickins
2022-02-28 7:57 ` Michal Hocko
2022-02-28 15:33 ` Andrew Morton
2022-03-02 9:46 ` Michal Hocko
2022-03-02 20:38 ` Hugh Dickins
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20220225172440.ec62edf97b405d32061bcb37@linux-foundation.org \
--to=akpm@linux-foundation.org \
--cc=cgel.zte@gmail.com \
--cc=guo.ziliang@zte.com.cn \
--cc=hannes@cmpxchg.org \
--cc=hughd@google.com \
--cc=jiang.xuexin@zte.com.cn \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mhocko@kernel.org \
--cc=minchan@kernel.org \
--cc=naoya.horiguchi@nec.com \
--cc=ran.xiaokai@zte.com.cn \
--cc=rogerq@kernel.org \
--cc=yang.yang29@zte.com.cn \
--cc=zealci@zte.com.cn \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox