From: SeongJae Park <sj@kernel.org>
To: David Hildenbrand <david@redhat.com>
Cc: SeongJae Park <sj@kernel.org>,
akpm@linux-foundation.org, osalvador@suse.de, linux-mm@kvack.org,
linux-kernel@vger.kernel.org
Subject: Re: [PATCH] mm/memory_hotplug: return zero from do_migrate_range() for only success
Date: Wed, 15 Feb 2023 22:33:55 +0000 [thread overview]
Message-ID: <20230215223355.102508-1-sj@kernel.org> (raw)
In-Reply-To: <1ddc2eff-f1bd-be62-3c62-abe6d539feef@redhat.com>
On Wed, 15 Feb 2023 21:00:50 +0100 David Hildenbrand <david@redhat.com> wrote:
> On 15.02.23 19:03, SeongJae Park wrote:
> > On Wed, 15 Feb 2023 14:16:05 +0100 David Hildenbrand <david@redhat.com> wrote:
> >
> >> On 14.02.23 23:32, SeongJae Park wrote:
> >>> do_migrate_range() returns migrate_pages() return value, which zero
> >>> means perfect success, in usual cases. If all pages are failed to be
> >>> isolated, however, it returns isolate_{lru,movalbe}_page() return
> >>> values, or zero if all pfn were invalid, were hugetlb or hwpoisoned. So
> >>> do_migrate_range() returning zero means either perfect success, or
> >>> special cases of isolation total failure.
> >>>
> >>> Actually, the return value is not checked by any caller, so it might be
> >>> better to simply make it a void function. However, there is a TODO for
> >>> checking the return value.
> >>
> >> I'd prefer to not add more dead code ;) Let's not return an error instead.
> >
> > Makes sense, I will send next spin soon.
> >
> >>
> >> It's still unclear which kind of fatal migration issues we actually care
> >> about and how to really detect them.
> >
> > What do you think about treating the isolation/migration rate limit
> > (migrate_rs) hit in do_migrate_range() as fatal? It warns for the event
> > already, so definitely a bad sign.
> >
> > If that's not that bad enough to be treated as fatal, I think we could have yet
> > another rate limit to be considered fatal.
>
> IIRC, there are some setups where offlining might take several minutes
> (e.g., heavy O_DIRECT load) and that's to be expected.
>
> So the existing code warns for better debugging, but keeps trying. So
> the ratelimit is rather to not produce too much debug output, not to
> really indicate that something is fatal.
Thank you for clarification, David!
Thanks,
SJ
>
> --
> Thanks,
>
> David / dhildenb
prev parent reply other threads:[~2023-02-15 22:34 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-02-14 22:32 [PATCH] mm/memory_hotplug: return zero from do_migrate_range() for only success SeongJae Park
2023-02-15 13:16 ` David Hildenbrand
2023-02-15 18:03 ` SeongJae Park
2023-02-15 20:00 ` David Hildenbrand
2023-02-15 22:33 ` SeongJae Park [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20230215223355.102508-1-sj@kernel.org \
--to=sj@kernel.org \
--cc=akpm@linux-foundation.org \
--cc=david@redhat.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=osalvador@suse.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.