Re: [mm/page_alloc] f26b3fa046: netperf.Throughput_Mbps -18.0% regression

All of lore.kernel.org
 help / color / mirror / Atom feed

From: Peter Zijlstra <peterz@infradead.org>
To: lkp@lists.01.org
Subject: Re: [mm/page_alloc] f26b3fa046: netperf.Throughput_Mbps -18.0% regression
Date: Tue, 10 May 2022 21:27:08 +0200	[thread overview]
Message-ID: <20220510192708.GQ76023@worktop.programming.kicks-ass.net> (raw)
In-Reply-To: <CAHk-=wjguW5nxjagV99GHvc_-E_7mSg+LMvGtFjJ9LUSx4Skig@mail.gmail.com>

[-- Attachment #1: Type: text/plain, Size: 1340 bytes --]

On Tue, May 10, 2022 at 11:05:01AM -0700, Linus Torvalds wrote:

> I think from a pure lock standpoint, it's the right thing to do (no
> unnecessary bouncing, with the lock releaser doing just one write, and
> the head waiter spinning on it is doing the right thing).
> 
> But I think this is an example of where you end up having that
> spinning on the lock possibly then being a disturbance on the other
> fields around the lock.
> 
> I wonder if Waiman / PeterZ / Will have any comments on that. Maybe
> that "spin on the lock itself" is just fundamentally the only correct
> thing, but since my initial reaction was "no, we're spinning on the
> mcs node", maybe that would be _possible_?
> 
> We do have a lot of those spinlocks embedded in other data structures
> cases. And if "somebody else is waiting for the lock" contends badly
> with "the lock holder is doing a lot of writes close to the lock",
> then that's not great.

The immediate problem is that we don't always have a node. Notably we
only do the whole MCS queueing thing when there's more than 1 contender.

Always doing the MCS thing had a hefty performance penalty vs the
simpler spinlock implementations for the uncontended and light contended
lock cases (by far the most common scenario) due to the extra cache-miss
of getting an MCS node.

WARNING: multiple messages have this Message-ID (diff)

From: Peter Zijlstra <peterz@infradead.org>
To: Linus Torvalds <torvalds@linux-foundation.org>
Cc: "ying.huang@intel.com" <ying.huang@intel.com>,
	Waiman Long <longman@redhat.com>, Will Deacon <will@kernel.org>,
	Aaron Lu <aaron.lu@intel.com>,
	Mel Gorman <mgorman@techsingularity.net>,
	kernel test robot <oliver.sang@intel.com>,
	Vlastimil Babka <vbabka@suse.cz>,
	Dave Hansen <dave.hansen@linux.intel.com>,
	Jesper Dangaard Brouer <brouer@redhat.com>,
	Michal Hocko <mhocko@kernel.org>,
	Andrew Morton <akpm@linux-foundation.org>,
	LKML <linux-kernel@vger.kernel.org>,
	lkp@lists.01.org, kernel test robot <lkp@intel.com>,
	Feng Tang <feng.tang@intel.com>,
	Zhengjun Xing <zhengjun.xing@linux.intel.com>,
	fengwei.yin@intel.com
Subject: Re: [mm/page_alloc] f26b3fa046: netperf.Throughput_Mbps -18.0% regression
Date: Tue, 10 May 2022 21:27:08 +0200	[thread overview]
Message-ID: <20220510192708.GQ76023@worktop.programming.kicks-ass.net> (raw)
In-Reply-To: <CAHk-=wjguW5nxjagV99GHvc_-E_7mSg+LMvGtFjJ9LUSx4Skig@mail.gmail.com>

On Tue, May 10, 2022 at 11:05:01AM -0700, Linus Torvalds wrote:

> I think from a pure lock standpoint, it's the right thing to do (no
> unnecessary bouncing, with the lock releaser doing just one write, and
> the head waiter spinning on it is doing the right thing).
> 
> But I think this is an example of where you end up having that
> spinning on the lock possibly then being a disturbance on the other
> fields around the lock.
> 
> I wonder if Waiman / PeterZ / Will have any comments on that. Maybe
> that "spin on the lock itself" is just fundamentally the only correct
> thing, but since my initial reaction was "no, we're spinning on the
> mcs node", maybe that would be _possible_?
> 
> We do have a lot of those spinlocks embedded in other data structures
> cases. And if "somebody else is waiting for the lock" contends badly
> with "the lock holder is doing a lot of writes close to the lock",
> then that's not great.

The immediate problem is that we don't always have a node. Notably we
only do the whole MCS queueing thing when there's more than 1 contender.

Always doing the MCS thing had a hefty performance penalty vs the
simpler spinlock implementations for the uncontended and light contended
lock cases (by far the most common scenario) due to the extra cache-miss
of getting an MCS node.

next prev parent reply	other threads:[~2022-05-10 19:27 UTC|newest]

Thread overview: 73+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-04-20  1:35 [mm/page_alloc] f26b3fa046: netperf.Throughput_Mbps -18.0% regression kernel test robot
2022-04-20  1:35 ` kernel test robot
2022-04-29 11:29 ` Aaron Lu
2022-04-29 11:29   ` Aaron Lu
2022-04-29 13:39   ` Mel Gorman
2022-04-29 13:39     ` Mel Gorman
2022-05-05  8:27     ` Aaron Lu
2022-05-05  8:27       ` Aaron Lu
2022-05-05 11:09       ` Mel Gorman
2022-05-05 11:09         ` Mel Gorman
2022-05-05 14:29         ` Aaron Lu
2022-05-05 14:29           ` Aaron Lu
2022-05-06  8:40   ` ying.huang
2022-05-06  8:40     ` ying.huang
2022-05-06 12:17     ` Aaron Lu
2022-05-06 12:17       ` Aaron Lu
2022-05-07  0:54       ` ying.huang
2022-05-07  0:54         ` ying.huang
2022-05-07  3:27         ` Aaron Lu
2022-05-07  3:27           ` Aaron Lu
2022-05-07  7:11           ` ying.huang
2022-05-07  7:11             ` ying.huang
2022-05-07  7:31             ` Aaron Lu
2022-05-07  7:31               ` Aaron Lu
2022-05-07  7:44               ` ying.huang
2022-05-07  7:44                 ` ying.huang
2022-05-10  3:43                 ` Aaron Lu
2022-05-10  3:43                   ` Aaron Lu
2022-05-10  6:23                   ` ying.huang
2022-05-10  6:23                     ` ying.huang
2022-05-10 18:05                     ` Linus Torvalds
2022-05-10 18:05                       ` Linus Torvalds
2022-05-10 18:47                       ` Waiman Long
2022-05-10 18:47                         ` Waiman Long
2022-05-10 19:03                         ` Linus Torvalds
2022-05-10 19:03                           ` Linus Torvalds
2022-05-10 19:25                           ` Linus Torvalds
2022-05-10 19:25                             ` Linus Torvalds
2022-05-10 19:46                           ` Waiman Long
2022-05-10 19:46                             ` Waiman Long
2022-05-10 19:27                       ` Peter Zijlstra [this message]
2022-05-10 19:27                         ` Peter Zijlstra
2022-05-11  1:58                       ` ying.huang
2022-05-11  1:58                         ` ying.huang
2022-05-11  2:06                         ` Waiman Long
2022-05-11  2:06                           ` Waiman Long
2022-05-11 11:04                         ` Aaron Lu
2022-05-11 11:04                           ` Aaron Lu
2022-05-12  3:17                           ` ying.huang
2022-05-12  3:17                             ` ying.huang
2022-05-12 12:45                             ` Aaron Lu
2022-05-12 12:45                               ` Aaron Lu
2022-05-12 17:42                               ` Linus Torvalds
2022-05-12 17:42                                 ` Linus Torvalds
2022-05-12 18:06                                 ` Andrew Morton
2022-05-12 18:06                                   ` Andrew Morton
2022-05-12 18:49                                   ` Linus Torvalds
2022-05-12 18:49                                     ` Linus Torvalds
2022-06-14  2:09                                     ` Feng Tang
2022-06-14  2:09                                       ` Feng Tang
2022-05-13  6:19                                 ` ying.huang
2022-05-13  6:19                                   ` ying.huang
2022-05-11  3:40                     ` Aaron Lu
2022-05-11  3:40                       ` Aaron Lu
2022-05-11  7:32                       ` ying.huang
2022-05-11  7:32                         ` ying.huang
2022-05-11  7:53                         ` Aaron Lu
2022-05-11  7:53                           ` Aaron Lu
2022-06-01  2:19                           ` Aaron Lu
2022-06-01  2:19                             ` Aaron Lu
2022-05-11 12:13 ` [mm/page_alloc] f26b3fa046: netperf.Throughput_Mbps -18.0% regression #forregzbot Thorsten Leemhuis
2022-05-13  8:37   ` Thorsten Leemhuis
2022-09-08 11:39     ` Thorsten Leemhuis

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20220510192708.GQ76023@worktop.programming.kicks-ass.net \
    --to=peterz@infradead.org \
    --cc=lkp@lists.01.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.