From: Minchan Kim <minchan@kernel.org>
To: Mel Gorman <mgorman@suse.de>
Cc: Richard Davies <richard@arachsys.com>, KVM <kvm@vger.kernel.org>,
QEMU-devel <qemu-devel@nongnu.org>,
LKML <linux-kernel@vger.kernel.org>,
Linux-MM <linux-mm@kvack.org>, Avi Kivity <avi@redhat.com>,
Andrew Morton <akpm@linux-foundation.org>,
Shaohua Li <shli@kernel.org>
Subject: Re: [Qemu-devel] [PATCH 5/9] mm: compaction: Acquire the zone->lru_lock as late as possible
Date: Tue, 25 Sep 2012 17:13:27 +0900 [thread overview]
Message-ID: <20120925081327.GA7759@bbox> (raw)
In-Reply-To: <20120925075105.GC11266@suse.de>
On Tue, Sep 25, 2012 at 08:51:05AM +0100, Mel Gorman wrote:
> On Tue, Sep 25, 2012 at 04:05:17PM +0900, Minchan Kim wrote:
> > Hi Mel,
> >
> > I have a question below.
> >
> > On Fri, Sep 21, 2012 at 11:46:19AM +0100, Mel Gorman wrote:
> > > Compactions migrate scanner acquires the zone->lru_lock when scanning a range
> > > of pages looking for LRU pages to acquire. It does this even if there are
> > > no LRU pages in the range. If multiple processes are compacting then this
> > > can cause severe locking contention. To make matters worse commit b2eef8c0
> > > (mm: compaction: minimise the time IRQs are disabled while isolating pages
> > > for migration) releases the lru_lock every SWAP_CLUSTER_MAX pages that are
> > > scanned.
> > >
> > > This patch makes two changes to how the migrate scanner acquires the LRU
> > > lock. First, it only releases the LRU lock every SWAP_CLUSTER_MAX pages if
> > > the lock is contended. This reduces the number of times it unnecessarily
> > > disables and re-enables IRQs. The second is that it defers acquiring the
> > > LRU lock for as long as possible. If there are no LRU pages or the only
> > > LRU pages are transhuge then the LRU lock will not be acquired at all
> > > which reduces contention on zone->lru_lock.
> > >
> > > Signed-off-by: Mel Gorman <mgorman@suse.de>
> > > Acked-by: Rik van Riel <riel@redhat.com>
> > > ---
> > > mm/compaction.c | 63 +++++++++++++++++++++++++++++++++++++------------------
> > > 1 file changed, 43 insertions(+), 20 deletions(-)
> > >
> > > diff --git a/mm/compaction.c b/mm/compaction.c
> > > index 6b55491..a6068ff 100644
> > > --- a/mm/compaction.c
> > > +++ b/mm/compaction.c
> > > @@ -50,6 +50,11 @@ static inline bool migrate_async_suitable(int migratetype)
> > > return is_migrate_cma(migratetype) || migratetype == MIGRATE_MOVABLE;
> > > }
> > >
> > > +static inline bool should_release_lock(spinlock_t *lock)
> > > +{
> > > + return need_resched() || spin_is_contended(lock);
> > > +}
> > > +
> > > /*
> > > * Compaction requires the taking of some coarse locks that are potentially
> > > * very heavily contended. Check if the process needs to be scheduled or
> > > @@ -62,7 +67,7 @@ static inline bool migrate_async_suitable(int migratetype)
> > > static bool compact_checklock_irqsave(spinlock_t *lock, unsigned long *flags,
> > > bool locked, struct compact_control *cc)
> > > {
> > > - if (need_resched() || spin_is_contended(lock)) {
> > > + if (should_release_lock(lock)) {
> > > if (locked) {
> > > spin_unlock_irqrestore(lock, *flags);
> > > locked = false;
> > > @@ -327,7 +332,7 @@ isolate_migratepages_range(struct zone *zone, struct compact_control *cc,
> > > isolate_mode_t mode = 0;
> > > struct lruvec *lruvec;
> > > unsigned long flags;
> > > - bool locked;
> > > + bool locked = false;
> > >
> > > /*
> > > * Ensure that there are not too many pages isolated from the LRU
> > > @@ -347,23 +352,17 @@ isolate_migratepages_range(struct zone *zone, struct compact_control *cc,
> > >
> > > /* Time to isolate some pages for migration */
> > > cond_resched();
> > > - spin_lock_irqsave(&zone->lru_lock, flags);
> > > - locked = true;
> > > for (; low_pfn < end_pfn; low_pfn++) {
> > > struct page *page;
> > >
> > > /* give a chance to irqs before checking need_resched() */
> > > - if (!((low_pfn+1) % SWAP_CLUSTER_MAX)) {
> > > - spin_unlock_irqrestore(&zone->lru_lock, flags);
> > > - locked = false;
> > > + if (locked && !((low_pfn+1) % SWAP_CLUSTER_MAX)) {
> > > + if (should_release_lock(&zone->lru_lock)) {
> > > + spin_unlock_irqrestore(&zone->lru_lock, flags);
> > > + locked = false;
> > > + }
> > > }
> > >
> > > - /* Check if it is ok to still hold the lock */
> > > - locked = compact_checklock_irqsave(&zone->lru_lock, &flags,
> > > - locked, cc);
> > > - if (!locked || fatal_signal_pending(current))
> > > - break;
> > > -
> > > /*
> > > * migrate_pfn does not necessarily start aligned to a
> > > * pageblock. Ensure that pfn_valid is called when moving
> > > @@ -403,21 +402,38 @@ isolate_migratepages_range(struct zone *zone, struct compact_control *cc,
> > > pageblock_nr = low_pfn >> pageblock_order;
> > > if (!cc->sync && last_pageblock_nr != pageblock_nr &&
> > > !migrate_async_suitable(get_pageblock_migratetype(page))) {
> > > - low_pfn += pageblock_nr_pages;
> > > - low_pfn = ALIGN(low_pfn, pageblock_nr_pages) - 1;
> > > - last_pageblock_nr = pageblock_nr;
> > > - continue;
> > > + goto next_pageblock;
> > > }
> > >
> > > + /* Check may be lockless but that's ok as we recheck later */
> > > if (!PageLRU(page))
> > > continue;
> > >
> > > /*
> > > - * PageLRU is set, and lru_lock excludes isolation,
> > > - * splitting and collapsing (collapsing has already
> > > - * happened if PageLRU is set).
> > > + * PageLRU is set. lru_lock normally excludes isolation
> > > + * splitting and collapsing (collapsing has already happened
> > > + * if PageLRU is set) but the lock is not necessarily taken
> > > + * here and it is wasteful to take it just to check transhuge.
> > > + * Check transhuge without lock and skip if it's either a
> > > + * transhuge or hugetlbfs page.
> > > */
> > > if (PageTransHuge(page)) {
> > > + if (!locked)
> > > + goto next_pageblock;
> >
> > Why skip all pages in a pageblock if !locked?
> > Shouldn't we add some comment?
> >
>
> The comment is above the block already. The lru_lock normally excludes
> isolation and splitting. If we do not hold the hold, it's not safe to
> call compound_order so instead we skip the entire pageblock.
I see. To me, your saying is better than current comment.
I hope comment could be more explicit.
diff --git a/mm/compaction.c b/mm/compaction.c
index df01b4e..f1d2cc7 100644
--- a/mm/compaction.c
+++ b/mm/compaction.c
@@ -542,8 +542,9 @@ isolate_migratepages_range(struct zone *zone, struct compact_control *cc,
* splitting and collapsing (collapsing has already happened
* if PageLRU is set) but the lock is not necessarily taken
* here and it is wasteful to take it just to check transhuge.
- * Check transhuge without lock and skip if it's either a
- * transhuge or hugetlbfs page.
+ * Check transhuge without lock and *skip* if it's either a
+ * transhuge or hugetlbfs page because it's not safe to call
+ * compound_order.
*/
if (PageTransHuge(page)) {
if (!locked)
Anyway, it's trivial and if anyone think it's valuable, feel free to apply it, please.
>
> --
> Mel Gorman
> SUSE Labs
>
> --
> To unsubscribe, send a message with 'unsubscribe linux-mm' in
> the body to majordomo@kvack.org. For more info on Linux MM,
> see: http://www.linux-mm.org/ .
> Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
--
Kind regards,
Minchan Kim
next prev parent reply other threads:[~2012-09-25 8:10 UTC|newest]
Thread overview: 42+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-09-21 10:46 [Qemu-devel] [PATCH 0/9] Reduce compaction scanning and lock contention Mel Gorman
2012-09-21 10:46 ` [Qemu-devel] [PATCH 1/9] Revert "mm: compaction: check lock contention first before taking lock" Mel Gorman
2012-09-21 17:46 ` Rafael Aquini
2012-09-21 10:46 ` [Qemu-devel] [PATCH 2/9] Revert "mm-compaction-abort-compaction-loop-if-lock-is-contended-or-run-too-long-fix" Mel Gorman
2012-09-21 17:47 ` Rafael Aquini
2012-09-21 10:46 ` [Qemu-devel] [PATCH 3/9] Revert "mm: compaction: abort compaction loop if lock is contended or run too long" Mel Gorman
2012-09-21 17:48 ` Rafael Aquini
2012-09-21 10:46 ` [Qemu-devel] [PATCH 4/9] mm: compaction: Abort compaction loop if lock is contended or run too long Mel Gorman
2012-09-21 17:50 ` Rafael Aquini
2012-09-21 21:31 ` Andrew Morton
2012-09-25 7:34 ` Minchan Kim
2012-09-21 10:46 ` [Qemu-devel] [PATCH 5/9] mm: compaction: Acquire the zone->lru_lock as late as possible Mel Gorman
2012-09-21 17:51 ` Rafael Aquini
2012-09-25 7:05 ` Minchan Kim
2012-09-25 7:51 ` Mel Gorman
2012-09-25 8:13 ` Minchan Kim [this message]
2012-09-25 21:39 ` Andrew Morton
2012-09-26 0:23 ` Minchan Kim
2012-09-26 10:17 ` Mel Gorman
2012-09-21 10:46 ` [Qemu-devel] [PATCH 6/9] mm: compaction: Acquire the zone->lock " Mel Gorman
2012-09-21 17:52 ` Rafael Aquini
2012-09-21 21:35 ` Andrew Morton
2012-09-24 8:52 ` Mel Gorman
2012-09-25 7:36 ` Minchan Kim
2012-09-25 7:35 ` Minchan Kim
2012-09-21 10:46 ` [Qemu-devel] [PATCH 7/9] Revert "mm: have order > 0 compaction start off where it left" Mel Gorman
2012-09-21 17:52 ` Rafael Aquini
2012-09-25 7:37 ` Minchan Kim
2012-09-21 10:46 ` [Qemu-devel] [PATCH 8/9] mm: compaction: Cache if a pageblock was scanned and no pages were isolated Mel Gorman
2012-09-21 17:53 ` Rafael Aquini
2012-09-21 21:36 ` Andrew Morton
2012-09-24 9:39 ` Mel Gorman
2012-09-24 21:26 ` Andrew Morton
2012-09-25 9:12 ` Mel Gorman
2012-09-25 20:03 ` Andrew Morton
2012-09-27 12:06 ` [Qemu-devel] [PATCH] mm: compaction: cache if a pageblock was scanned and no pages were isolated -fix2 Mel Gorman
2012-09-27 13:12 ` [Qemu-devel] [PATCH 8/9] mm: compaction: Cache if a pageblock was scanned and no pages were isolated Mel Gorman
2012-09-26 0:49 ` Minchan Kim
2012-09-27 12:14 ` Mel Gorman
2012-09-21 10:46 ` [Qemu-devel] [PATCH 9/9] mm: compaction: Restart compaction from near where it left off Mel Gorman
2012-09-21 17:54 ` Rafael Aquini
2012-09-21 13:51 ` [Qemu-devel] [PATCH 0/9] Reduce compaction scanning and lock contention Rik van Riel
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20120925081327.GA7759@bbox \
--to=minchan@kernel.org \
--cc=akpm@linux-foundation.org \
--cc=avi@redhat.com \
--cc=kvm@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mgorman@suse.de \
--cc=qemu-devel@nongnu.org \
--cc=richard@arachsys.com \
--cc=shli@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).