From: Vlastimil Babka <vbabka@suse.cz>
To: David Rientjes <rientjes@google.com>
Cc: linux-mm@kvack.org, linux-kernel@vger.kernel.org,
Andrew Morton <akpm@linux-foundation.org>,
Greg Thelen <gthelen@google.com>,
Minchan Kim <minchan@kernel.org>, Mel Gorman <mgorman@suse.de>,
Michal Nazarewicz <mina86@mina86.com>,
Naoya Horiguchi <n-horiguchi@ah.jp.nec.com>,
Christoph Lameter <cl@linux.com>, Rik van Riel <riel@redhat.com>
Subject: Re: [RFC PATCH 1/6] mm, compaction: periodically drop lock and restore IRQs in scanners
Date: Thu, 05 Jun 2014 11:05:17 +0200 [thread overview]
Message-ID: <539032CD.9070506@suse.cz> (raw)
In-Reply-To: <alpine.DEB.2.02.1406041628100.18899@chino.kir.corp.google.com>
On 06/05/2014 01:39 AM, David Rientjes wrote:
> On Wed, 4 Jun 2014, Vlastimil Babka wrote:
>
>> diff --git a/mm/compaction.c b/mm/compaction.c
>> index ed7102c..f0fd4b5 100644
>> --- a/mm/compaction.c
>> +++ b/mm/compaction.c
>> @@ -185,47 +185,74 @@ static void update_pageblock_skip(struct compact_control *cc,
>> }
>> #endif /* CONFIG_COMPACTION */
>>
>> -static inline bool should_release_lock(spinlock_t *lock)
>> +/*
>> + * Compaction requires the taking of some coarse locks that are potentially
>> + * very heavily contended. Check if the process needs to be scheduled or
>> + * if the lock is contended. For async compaction, back out if the process
>> + * needs to be scheduled, or the lock cannot be taken immediately. For sync
>> + * compaction, schedule and spin on the lock if needed.
>> + *
>> + * Returns true if the lock is held
>> + * Returns false if the lock is not held and compaction should abort
>> + */
>> +static inline bool compact_trylock_irqsave(spinlock_t *lock,
>> + unsigned long *flags, struct compact_control *cc)
>
> Hmm, what tree is this series based on? It doesn't apply cleanly to
> linux-next, I think you're missing
> mm-compaction-properly-signal-and-act-upon-lock-and-need_sched-contention-fix.patch
> in your tree.
It was 3.15-rc5 and all compaction stuff in -mm applied. I do have the
fix, but perhaps I forgot something else, or linux-next did not have the
fix yet (Andrew picked it up on tuesday I think).
Anyway I saw the pending compaction stuff was just merged so things will
get easier the next round :)
> Is there a performance benefit to doing the inlining here?
I admit I didn't check, I just thought it might make sense since there
are just two call sites. I'll reconsider but I doubt I will be able to
see any runtime difference in this path. Maybe just based on the object
size.
>> {
>> - return need_resched() || spin_is_contended(lock);
>> + if (cc->mode == MIGRATE_ASYNC) {
>> + if (need_resched() || !spin_trylock_irqsave(lock, *flags)) {
>> + cc->contended = true;
>> + return false;
>> + }
>> + } else {
>> + cond_resched();
>
> Why do we need this cond_resched() here if there is already a
> cond_resched() in compact_unlock_should_abort() for non-async compaction?
>
> If this is trying to reschedule right before disabling irqs because
> otherwise spinning on the lock causes irq starvation, then that's a very
> delicate balance and I think we're going to get in trouble later.
I wasn't thinking about real starvation. I guess it's already not so bad
if nobody noticed that the periodical unlock and IRQ enable was broken
for so long and the original patch was adding it based on profiles
looking better, not solving any particular problem encountered.
So here I just thought it wouldn't be too costly to check before taking
the lock, since it might have been up to 32 iterations since the last
check. But it's true it might be pointless since when we hold the lock,
we check once per 32 iterations (with this patch) anyway.
>> + spin_lock_irqsave(lock, *flags);
>> + }
>> +
>> + return true;
>> }
>>
>> /*
>> * Compaction requires the taking of some coarse locks that are potentially
>> - * very heavily contended. Check if the process needs to be scheduled or
>> - * if the lock is contended. For async compaction, back out in the event
>> - * if contention is severe. For sync compaction, schedule.
>> + * very heavily contended. The lock should be periodically unlocked to avoid
>> + * having disabled IRQs for a long time, even when there is nobody waiting on
>> + * the lock. It might also be that allowing the IRQs will result in
>> + * need_resched() becoming true. If scheduling is needed, or somebody else
>> + * has taken the lock, async compaction aborts. Sync compaction schedules.
>> + * Either compaction type will also abort if a fatal signal is pending.
>> + * In either case if the lock was locked, it is dropped and not regained.
>> *
>> - * Returns true if the lock is held.
>> - * Returns false if the lock is released and compaction should abort
>> + * Returns true if compaction should abort due to fatal signal pending, or
>> + * async compaction due to lock contention or need to schedule
>> + * Returns false when compaction can continue (sync compaction might have
>> + * scheduled)
>> */
>> -static bool compact_checklock_irqsave(spinlock_t *lock, unsigned long *flags,
>> - bool locked, struct compact_control *cc)
>> +static inline bool compact_unlock_should_abort(spinlock_t *lock,
>> + unsigned long flags, bool *locked, struct compact_control *cc)
>
> This inlining is also suspicious and I think keeping both of them
> out-of-line for the freeing and migration scanners is going to be the best
> route unless there's some measurable performance benefit I'm not seeing.
OK.
>> {
>> - if (should_release_lock(lock)) {
>> - if (locked) {
>> - spin_unlock_irqrestore(lock, *flags);
>> - locked = false;
>> - }
>> + if (*locked) {
>> + spin_unlock_irqrestore(lock, flags);
>> + *locked = false;
>> + }
>>
>> - /* async aborts if taking too long or contended */
>> - if (cc->mode == MIGRATE_ASYNC) {
>> + if (fatal_signal_pending(current))
>> + return true;
>> +
>> + if (cc->mode == MIGRATE_ASYNC) {
>> + if (need_resched() || spin_is_locked(lock)) {
>> cc->contended = true;
>> - return false;
>> + return true;
>> }
>> -
>> + } else {
>> cond_resched();
>> }
>>
>> - if (!locked)
>> - spin_lock_irqsave(lock, *flags);
>> - return true;
>> + return false;
>> }
>>
>> /*
>> - * Aside from avoiding lock contention, compaction also periodically checks
>> + * Aside from avoiding lock contention, compaction should also periodically checks
>
> Not sure what the purpose of this commentary change is, it's gramatically
> incorrect now.
>
>> * need_resched() and either schedules in sync compaction, or aborts async
>> - * compaction. This is similar to compact_checklock_irqsave() does, but used
>> + * compaction. This is similar to compact_unlock_should_abort() does, but used
>
> This was and still is gramatically incorrect :)
Thanks, will fix.
>
>> * where no lock is concerned.
>> *
>> * Returns false when no scheduling was needed, or sync compaction scheduled.
>> @@ -285,6 +312,16 @@ static unsigned long isolate_freepages_block(struct compact_control *cc,
>> int isolated, i;
>> struct page *page = cursor;
>>
>> + /*
>> + * Periodically drop the lock (if held) regardless of its
>> + * contention, to give chance to IRQs. Abort async compaction
>> + * if contended.
>> + */
>> + if (!(blockpfn % SWAP_CLUSTER_MAX)
>> + && compact_unlock_should_abort(&cc->zone->lock, flags,
>> + &locked, cc))
>> + break;
>> +
>> nr_scanned++;
>> if (!pfn_valid_within(blockpfn))
>> goto isolate_fail;
>> @@ -302,8 +339,9 @@ static unsigned long isolate_freepages_block(struct compact_control *cc,
>> * spin on the lock and we acquire the lock as late as
>> * possible.
>> */
>> - locked = compact_checklock_irqsave(&cc->zone->lock, &flags,
>> - locked, cc);
>> + if (!locked)
>> + locked = compact_trylock_irqsave(&cc->zone->lock,
>> + &flags, cc);
>> if (!locked)
>> break;
>>
>> @@ -523,13 +561,15 @@ isolate_migratepages_range(struct zone *zone, struct compact_control *cc,
>>
>> /* Time to isolate some pages for migration */
>> for (; low_pfn < end_pfn; low_pfn++) {
>> - /* give a chance to irqs before checking need_resched() */
>> - if (locked && !(low_pfn % SWAP_CLUSTER_MAX)) {
>> - if (should_release_lock(&zone->lru_lock)) {
>> - spin_unlock_irqrestore(&zone->lru_lock, flags);
>> - locked = false;
>> - }
>> - }
>> + /*
>> + * Periodically drop the lock (if held) regardless of its
>> + * contention, to give chance to IRQs. Abort async compaction
>> + * if contended.
>> + */
>> + if (!(low_pfn % SWAP_CLUSTER_MAX)
>> + && compact_unlock_should_abort(&zone->lru_lock, flags,
>> + &locked, cc))
>> + break;
>>
>> /*
>> * migrate_pfn does not necessarily start aligned to a
>> @@ -631,10 +671,11 @@ isolate_migratepages_range(struct zone *zone, struct compact_control *cc,
>> page_count(page) > page_mapcount(page))
>> continue;
>>
>> - /* Check if it is ok to still hold the lock */
>> - locked = compact_checklock_irqsave(&zone->lru_lock, &flags,
>> - locked, cc);
>> - if (!locked || fatal_signal_pending(current))
>> + /* If the lock is not held, try to take it */
>> + if (!locked)
>> + locked = compact_trylock_irqsave(&zone->lru_lock,
>> + &flags, cc);
>> + if (!locked)
>> break;
>>
>> /* Recheck PageLRU and PageTransHuge under lock */
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2014-06-05 9:05 UTC|newest]
Thread overview: 135+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-05-01 0:45 [patch 1/2] mm, migration: add destination page freeing callback David Rientjes
2014-05-01 0:45 ` [patch 2/2] mm, compaction: return failed migration target pages back to freelist David Rientjes
2014-05-01 5:10 ` Naoya Horiguchi
2014-05-01 21:02 ` David Rientjes
2014-05-01 5:08 ` [patch 1/2] mm, migration: add destination page freeing callback Naoya Horiguchi
[not found] ` <5361d71e.236ec20a.1b3d.ffffc8aeSMTPIN_ADDED_BROKEN@mx.google.com>
2014-05-01 21:02 ` David Rientjes
2014-05-01 21:35 ` [patch v2 1/4] " David Rientjes
2014-05-01 21:35 ` [patch v2 2/4] mm, compaction: return failed migration target pages back to freelist David Rientjes
2014-05-02 10:11 ` Mel Gorman
2014-05-02 15:23 ` Vlastimil Babka
2014-05-02 15:26 ` [PATCH] mm/compaction: do not count migratepages when unnecessary Vlastimil Babka
2014-05-06 21:18 ` Naoya Horiguchi
[not found] ` <1399411134-k43fsr0p@n-horiguchi@ah.jp.nec.com>
2014-05-07 9:33 ` Vlastimil Babka
2014-05-02 15:27 ` [PATCH 2/2] mm/compaction: avoid rescanning pageblocks in isolate_freepages Vlastimil Babka
2014-05-06 22:19 ` Naoya Horiguchi
[not found] ` <1399414778-xakujfb3@n-horiguchi@ah.jp.nec.com>
2014-05-07 9:22 ` Vlastimil Babka
2014-05-02 15:29 ` [PATCH 1/2] mm/compaction: do not count migratepages when unnecessary Vlastimil Babka
2014-05-01 21:35 ` [patch v2 3/4] mm, compaction: add per-zone migration pfn cache for async compaction David Rientjes
2014-05-05 9:34 ` Vlastimil Babka
2014-05-05 9:51 ` David Rientjes
2014-05-05 14:24 ` Vlastimil Babka
2014-05-06 0:29 ` David Rientjes
2014-05-06 11:52 ` Vlastimil Babka
2014-05-01 21:35 ` [patch v2 4/4] mm, thp: do not perform sync compaction on pagefault David Rientjes
2014-05-02 10:22 ` Mel Gorman
2014-05-02 11:22 ` David Rientjes
2014-05-02 11:58 ` Mel Gorman
2014-05-02 20:29 ` David Rientjes
2014-05-05 14:48 ` Vlastimil Babka
2014-05-06 8:55 ` Mel Gorman
2014-05-06 15:05 ` Vlastimil Babka
2014-05-02 10:10 ` [patch v2 1/4] mm, migration: add destination page freeing callback Mel Gorman
2014-05-07 2:22 ` [patch v3 1/6] " David Rientjes
2014-05-07 2:22 ` [patch v3 2/6] mm, compaction: return failed migration target pages back to freelist David Rientjes
2014-05-07 14:14 ` Naoya Horiguchi
2014-05-07 21:15 ` Andrew Morton
2014-05-07 21:21 ` David Rientjes
2014-05-12 8:35 ` Vlastimil Babka
2014-05-07 21:39 ` Greg Thelen
2014-05-12 8:37 ` Vlastimil Babka
2014-05-07 2:22 ` [patch v3 3/6] mm, compaction: add per-zone migration pfn cache for async compaction David Rientjes
2014-05-07 9:34 ` Vlastimil Babka
2014-05-07 20:56 ` Naoya Horiguchi
2014-05-07 2:22 ` [patch v3 4/6] mm, compaction: embed migration mode in compact_control David Rientjes
2014-05-07 9:55 ` Vlastimil Babka
2014-05-07 10:36 ` [patch v4 " David Rientjes
2014-05-09 22:03 ` Andrew Morton
2014-05-07 2:22 ` [patch v3 5/6] mm, thp: avoid excessive compaction latency during fault David Rientjes
2014-05-07 9:39 ` Mel Gorman
2014-05-08 5:30 ` [patch -mm] mm, thp: avoid excessive compaction latency during fault fix David Rientjes
2014-05-13 10:00 ` Vlastimil Babka
2014-05-22 2:49 ` David Rientjes
2014-05-22 8:43 ` Vlastimil Babka
2014-05-07 2:22 ` [patch v3 6/6] mm, compaction: terminate async compaction when rescheduling David Rientjes
2014-05-07 9:41 ` Mel Gorman
2014-05-07 12:09 ` [PATCH v2 1/2] mm/compaction: do not count migratepages when unnecessary Vlastimil Babka
2014-05-07 12:09 ` [PATCH v2 2/2] mm/compaction: avoid rescanning pageblocks in isolate_freepages Vlastimil Babka
2014-05-07 21:47 ` David Rientjes
2014-05-07 22:06 ` Naoya Horiguchi
2014-05-08 5:28 ` Joonsoo Kim
2014-05-12 9:09 ` Vlastimil Babka
2014-05-13 1:15 ` Joonsoo Kim
2014-05-09 15:49 ` Michal Nazarewicz
2014-05-19 10:14 ` Vlastimil Babka
2014-05-22 2:51 ` David Rientjes
2014-05-07 21:44 ` [PATCH v2 1/2] mm/compaction: do not count migratepages when unnecessary David Rientjes
2014-05-09 15:48 ` Michal Nazarewicz
2014-05-12 9:51 ` Vlastimil Babka
2014-05-07 12:10 ` [patch v3 6/6] mm, compaction: terminate async compaction when rescheduling Vlastimil Babka
2014-05-07 21:20 ` Andrew Morton
2014-05-07 21:28 ` David Rientjes
2014-05-08 5:17 ` Joonsoo Kim
2014-05-12 14:15 ` [PATCH] mm, compaction: properly signal and act upon lock and need_sched() contention Vlastimil Babka
2014-05-12 15:34 ` Naoya Horiguchi
[not found] ` <1399908847-ouuxeneo@n-horiguchi@ah.jp.nec.com>
2014-05-12 15:45 ` Vlastimil Babka
2014-05-12 15:53 ` Naoya Horiguchi
2014-05-12 20:28 ` David Rientjes
2014-05-13 8:50 ` Vlastimil Babka
2014-05-13 0:44 ` Joonsoo Kim
2014-05-13 8:54 ` Vlastimil Babka
2014-05-15 2:21 ` Joonsoo Kim
2014-05-16 9:47 ` [PATCH v2] " Vlastimil Babka
2014-05-16 17:33 ` Michal Nazarewicz
2014-05-19 23:37 ` Andrew Morton
2014-05-21 14:13 ` Vlastimil Babka
2014-05-21 20:11 ` Andrew Morton
2014-05-22 3:20 ` compaction is still too expensive for thp (was: [PATCH v2] mm, compaction: properly signal and act upon lock and need_sched() contention) David Rientjes
2014-05-22 8:10 ` compaction is still too expensive for thp Vlastimil Babka
2014-05-22 8:55 ` David Rientjes
2014-05-22 12:03 ` Vlastimil Babka
2014-06-04 0:29 ` [patch -mm 1/3] mm: rename allocflags_to_migratetype for clarity David Rientjes
2014-06-04 0:29 ` [patch -mm 2/3] mm, compaction: pass gfp mask to compact_control David Rientjes
2014-06-04 0:30 ` [patch -mm 3/3] mm, compaction: avoid compacting memory for thp if pageblock cannot become free David Rientjes
2014-06-04 11:04 ` Mel Gorman
2014-06-04 22:02 ` David Rientjes
2014-06-04 16:07 ` Vlastimil Babka
2014-06-04 16:11 ` [RFC PATCH 1/6] mm, compaction: periodically drop lock and restore IRQs in scanners Vlastimil Babka
2014-06-04 16:11 ` [RFC PATCH 2/6] mm, compaction: skip rechecks when lock was already held Vlastimil Babka
2014-06-04 23:46 ` David Rientjes
2014-06-04 16:11 ` [RFC PATCH 3/6] mm, compaction: remember position within pageblock in free pages scanner Vlastimil Babka
2014-06-04 16:11 ` [RFC PATCH 4/6] mm, compaction: skip buddy pages by their order in the migrate scanner Vlastimil Babka
2014-06-05 0:02 ` David Rientjes
2014-06-05 9:24 ` Vlastimil Babka
2014-06-05 21:30 ` David Rientjes
2014-06-06 7:20 ` Vlastimil Babka
2014-06-09 9:09 ` David Rientjes
2014-06-09 11:35 ` Vlastimil Babka
2014-06-09 22:25 ` David Rientjes
2014-06-10 7:26 ` Vlastimil Babka
2014-06-10 23:54 ` David Rientjes
2014-06-11 12:18 ` Vlastimil Babka
2014-06-12 0:21 ` David Rientjes
2014-06-12 11:56 ` Vlastimil Babka
2014-06-12 21:48 ` David Rientjes
2014-06-04 16:11 ` [RFC PATCH 5/6] mm, compaction: try to capture the just-created high-order freepage Vlastimil Babka
2014-06-04 16:11 ` [RFC PATCH 6/6] mm, compaction: don't migrate in blocks that cannot be fully compacted in async direct compaction Vlastimil Babka
2014-06-05 0:08 ` David Rientjes
2014-06-05 15:38 ` Vlastimil Babka
2014-06-05 21:38 ` David Rientjes
2014-06-06 7:33 ` Vlastimil Babka
2014-06-09 9:06 ` David Rientjes
2014-06-12 12:18 ` Vlastimil Babka
2014-06-04 23:39 ` [RFC PATCH 1/6] mm, compaction: periodically drop lock and restore IRQs in scanners David Rientjes
2014-06-05 9:05 ` Vlastimil Babka [this message]
2014-05-22 23:49 ` [PATCH v2] mm, compaction: properly signal and act upon lock and need_sched() contention Kevin Hilman
2014-05-23 2:48 ` Shawn Guo
2014-05-23 8:34 ` Vlastimil Babka
2014-05-23 10:49 ` Shawn Guo
2014-05-23 15:07 ` Kevin Hilman
2014-05-30 16:59 ` Stephen Warren
2014-06-02 13:35 ` Fabio Estevam
2014-06-02 14:33 ` [PATCH -mm] mm, compaction: properly signal and act upon lock and need_sched() contention - fix Vlastimil Babka
2014-06-02 15:18 ` Fabio Estevam
2014-06-02 20:09 ` David Rientjes
2014-05-02 13:16 ` [patch 1/2] mm, migration: add destination page freeing callback Vlastimil Babka
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=539032CD.9070506@suse.cz \
--to=vbabka@suse.cz \
--cc=akpm@linux-foundation.org \
--cc=cl@linux.com \
--cc=gthelen@google.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mgorman@suse.de \
--cc=mina86@mina86.com \
--cc=minchan@kernel.org \
--cc=n-horiguchi@ah.jp.nec.com \
--cc=riel@redhat.com \
--cc=rientjes@google.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).