Re: [sched/numa] a43455a1d57: +94.1% proc-vmstat.numa_hint_faults_local

All of lore.kernel.org
 help / color / mirror / Atom feed

From: Rik van Riel <riel@redhat.com>
To: lkp@lists.01.org
Subject: Re: [sched/numa] a43455a1d57: +94.1% proc-vmstat.numa_hint_faults_local
Date: Thu, 31 Jul 2014 02:22:55 -0400	[thread overview]
Message-ID: <53D9E0BF.1020600@redhat.com> (raw)
In-Reply-To: <20140731050454.GA9386@aaronlu.sh.intel.com>

[-- Attachment #1: Type: text/plain, Size: 2997 bytes --]

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On 07/31/2014 01:04 AM, Aaron Lu wrote:
> On Wed, Jul 30, 2014 at 10:25:03AM -0400, Rik van Riel wrote:
>> On 07/29/2014 10:14 PM, Aaron Lu wrote:
>>> On Tue, Jul 29, 2014 at 04:04:37PM -0400, Rik van Riel wrote:
>>>> On Tue, 29 Jul 2014 10:17:12 +0200 Peter Zijlstra
>>>> <peterz@infradead.org> wrote:
>>>> 
>>>>>> +#define NUMA_SCALE 1000 +#define NUMA_MOVE_THRESH 50
>>>>> 
>>>>> Please make that 1024, there's no reason not to use power
>>>>> of two here. This base 10 factor thing annoyed me no end
>>>>> already, its time for it to die.
>>>> 
>>>> That's easy enough.  However, it would be good to know
>>>> whether this actually helps with the regression Aaron found
>>>> :)
>>> 
>>> Sorry for the delay.
>>> 
>>> I applied the last patch and queued the hackbench job to the
>>> ivb42 test machine for it to run 5 times, and here is the
>>> result(regarding the proc-vmstat.numa_hint_faults_local
>>> field): 173565 201262 192317 198342 198595 avg: 192816
>>> 
>>> It seems it is still very big than previous kernels.
>> 
>> It looks like a step in the right direction, though.
>> 
>> Could you try running with a larger threshold?
>> 
>>>> +++ b/kernel/sched/fair.c @@ -924,10 +924,12 @@ static inline
>>>> unsigned long group_faults_cpu(struct numa_group *group, int
>>>> nid)
>>>> 
>>>> /* * These return the fraction of accesses done by a
>>>> particular task, or - * task group, on a particular numa
>>>> node.  The group weight is given a - * larger multiplier, in
>>>> order to group tasks together that are almost - * evenly
>>>> spread out between numa nodes. + * task group, on a
>>>> particular numa node.  The NUMA move threshold + * prevents
>>>> task moves with marginal improvement, and is set to 5%. */ 
>>>> +#define NUMA_SCALE 1024 +#define NUMA_MOVE_THRESH (5 *
>>>> NUMA_SCALE / 100)
>> 
>> It would be good to see if changing NUMA_MOVE_THRESH to 
>> (NUMA_SCALE / 8) does the trick.
> 
> With your 2nd patch and the above change, the result is:
> 
> "proc-vmstat.numa_hint_faults_local": [ 199708, 209152, 200638, 
> 187324, 196654 ],
> 
> avg: 198695

OK, so it is still a little higher than your original 162245.

I guess this is to be expected, since the code will be more
successful at placing a task on the right node, which results
in the task scanning its memory more rapidly for a little bit.

Are you seeing any changes in throughput?

- -- 
All rights reversed
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1

iQEcBAEBAgAGBQJT2eC/AAoJEM553pKExN6DIFMH/23LsoEJ8cUqMTdWUzhXesEb
TW0yncraZ6tDkGHopTU4oFmck93XUUVSJRVjLC3lxvxAIdWt8M4GCbWN8RD1yicX
Ii9s18+2r2vkc30gkIgh2yahaqQUun9sUkuaQ4BaKlbP+hwQzB3OfU1GjR7iStFE
t04krgCAL+xL63H/4mN0Y9ZjOBUz2QYbkspS21+oEWKkFY2FyyQn+hOSnA6lSvqy
o7v4tmC8jtRXsQY+hfy1aOtMUZO5sRcYHOttlxgjE5MbnW/whhsC+oB7cWw646St
LhvhhIykl/g2Bz+E3KbfnREGn5OO7NmEhv3am2Dj5XsNHnEfxYJH/m/aTA4az/s=
=/IeV
-----END PGP SIGNATURE-----

WARNING: multiple messages have this Message-ID (diff)

From: Rik van Riel <riel@redhat.com>
To: Aaron Lu <aaron.lu@intel.com>
Cc: Peter Zijlstra <peterz@infradead.org>,
	LKML <linux-kernel@vger.kernel.org>,
	lkp@01.org, jhladky@redhat.com
Subject: Re: [LKP] [sched/numa] a43455a1d57: +94.1% proc-vmstat.numa_hint_faults_local
Date: Thu, 31 Jul 2014 02:22:55 -0400	[thread overview]
Message-ID: <53D9E0BF.1020600@redhat.com> (raw)
In-Reply-To: <20140731050454.GA9386@aaronlu.sh.intel.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On 07/31/2014 01:04 AM, Aaron Lu wrote:
> On Wed, Jul 30, 2014 at 10:25:03AM -0400, Rik van Riel wrote:
>> On 07/29/2014 10:14 PM, Aaron Lu wrote:
>>> On Tue, Jul 29, 2014 at 04:04:37PM -0400, Rik van Riel wrote:
>>>> On Tue, 29 Jul 2014 10:17:12 +0200 Peter Zijlstra
>>>> <peterz@infradead.org> wrote:
>>>> 
>>>>>> +#define NUMA_SCALE 1000 +#define NUMA_MOVE_THRESH 50
>>>>> 
>>>>> Please make that 1024, there's no reason not to use power
>>>>> of two here. This base 10 factor thing annoyed me no end
>>>>> already, its time for it to die.
>>>> 
>>>> That's easy enough.  However, it would be good to know
>>>> whether this actually helps with the regression Aaron found
>>>> :)
>>> 
>>> Sorry for the delay.
>>> 
>>> I applied the last patch and queued the hackbench job to the
>>> ivb42 test machine for it to run 5 times, and here is the
>>> result(regarding the proc-vmstat.numa_hint_faults_local
>>> field): 173565 201262 192317 198342 198595 avg: 192816
>>> 
>>> It seems it is still very big than previous kernels.
>> 
>> It looks like a step in the right direction, though.
>> 
>> Could you try running with a larger threshold?
>> 
>>>> +++ b/kernel/sched/fair.c @@ -924,10 +924,12 @@ static inline
>>>> unsigned long group_faults_cpu(struct numa_group *group, int
>>>> nid)
>>>> 
>>>> /* * These return the fraction of accesses done by a
>>>> particular task, or - * task group, on a particular numa
>>>> node.  The group weight is given a - * larger multiplier, in
>>>> order to group tasks together that are almost - * evenly
>>>> spread out between numa nodes. + * task group, on a
>>>> particular numa node.  The NUMA move threshold + * prevents
>>>> task moves with marginal improvement, and is set to 5%. */ 
>>>> +#define NUMA_SCALE 1024 +#define NUMA_MOVE_THRESH (5 *
>>>> NUMA_SCALE / 100)
>> 
>> It would be good to see if changing NUMA_MOVE_THRESH to 
>> (NUMA_SCALE / 8) does the trick.
> 
> With your 2nd patch and the above change, the result is:
> 
> "proc-vmstat.numa_hint_faults_local": [ 199708, 209152, 200638, 
> 187324, 196654 ],
> 
> avg: 198695

OK, so it is still a little higher than your original 162245.

I guess this is to be expected, since the code will be more
successful at placing a task on the right node, which results
in the task scanning its memory more rapidly for a little bit.

Are you seeing any changes in throughput?

- -- 
All rights reversed
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1

iQEcBAEBAgAGBQJT2eC/AAoJEM553pKExN6DIFMH/23LsoEJ8cUqMTdWUzhXesEb
TW0yncraZ6tDkGHopTU4oFmck93XUUVSJRVjLC3lxvxAIdWt8M4GCbWN8RD1yicX
Ii9s18+2r2vkc30gkIgh2yahaqQUun9sUkuaQ4BaKlbP+hwQzB3OfU1GjR7iStFE
t04krgCAL+xL63H/4mN0Y9ZjOBUz2QYbkspS21+oEWKkFY2FyyQn+hOSnA6lSvqy
o7v4tmC8jtRXsQY+hfy1aOtMUZO5sRcYHOttlxgjE5MbnW/whhsC+oB7cWw646St
LhvhhIykl/g2Bz+E3KbfnREGn5OO7NmEhv3am2Dj5XsNHnEfxYJH/m/aTA4az/s=
=/IeV
-----END PGP SIGNATURE-----

next prev parent reply	other threads:[~2014-07-31  6:22 UTC|newest]

Thread overview: 66+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <53d70ee6.JsUEmW5dWsv8dev+%fengguang.wu@intel.com>
2014-07-29  5:24 ` [sched/numa] a43455a1d57: +94.1% proc-vmstat.numa_hint_faults_local Aaron Lu
2014-07-29  5:24   ` [LKP] " Aaron Lu
2014-07-29  6:39   ` Rik van Riel
2014-07-29  6:39     ` [LKP] " Rik van Riel
2014-07-29  8:17     ` Peter Zijlstra
2014-07-29  8:17       ` [LKP] " Peter Zijlstra
2014-07-29 20:04       ` Rik van Riel
2014-07-29 20:04         ` [LKP] " Rik van Riel
2014-07-30  2:14         ` Aaron Lu
2014-07-30  2:14           ` [LKP] " Aaron Lu
2014-07-30 14:25           ` Rik van Riel
2014-07-30 14:25             ` [LKP] " Rik van Riel
2014-07-31  5:04             ` Aaron Lu
2014-07-31  5:04               ` [LKP] " Aaron Lu
2014-07-31  6:22               ` Rik van Riel [this message]
2014-07-31  6:22                 ` Rik van Riel
2014-07-31  6:53                 ` Aaron Lu
2014-07-31  6:53                   ` [LKP] " Aaron Lu
2014-07-31  6:42               ` Rik van Riel
2014-07-31  6:42                 ` [LKP] " Rik van Riel
2014-08-05 21:43               ` Rik van Riel
2014-08-05 21:43                 ` [LKP] " Rik van Riel
2014-07-31  8:33           ` Peter Zijlstra
2014-07-31  8:33             ` [LKP] " Peter Zijlstra
2014-07-31  8:56             ` Aaron Lu
2014-07-31  8:56               ` [LKP] " Aaron Lu
2014-07-31 10:42     ` Peter Zijlstra
2014-07-31 10:42       ` [LKP] " Peter Zijlstra
2014-07-31 15:57       ` Peter Zijlstra
2014-07-31 15:57         ` [LKP] " Peter Zijlstra
2014-07-31 16:16         ` Jirka Hladky
2014-07-31 16:16           ` [LKP] " Jirka Hladky
2014-07-31 16:27           ` Peter Zijlstra
2014-07-31 16:27             ` [LKP] " Peter Zijlstra
2014-07-31 16:39             ` Jirka Hladky
2014-07-31 16:39               ` [LKP] " Jirka Hladky
2014-07-31 17:37               ` Peter Zijlstra
2014-07-31 17:37                 ` [LKP] " Peter Zijlstra
2014-08-01 15:02                 ` Peter Zijlstra
2014-08-01 15:02                   ` [LKP] " Peter Zijlstra
2014-08-01 20:46           ` Davidlohr Bueso
2014-08-01 20:46             ` [LKP] " Davidlohr Bueso
2014-08-01 20:48             ` Davidlohr Bueso
2014-08-01 20:48               ` [LKP] " Davidlohr Bueso
2014-08-01 21:30             ` Jirka Hladky
2014-08-01 21:30               ` [LKP] " Jirka Hladky
2014-08-02  4:17               ` Rik van Riel
2014-08-02  4:17                 ` [LKP] " Rik van Riel
2014-08-02  5:28                 ` Jirka Hladky
2014-08-02  5:28                   ` [LKP] " Jirka Hladky
2014-08-02  4:26               ` Peter Zijlstra
2014-08-02  4:26                 ` [LKP] " Peter Zijlstra
2014-08-01  0:18       ` Davidlohr Bueso
2014-08-01  0:18         ` [LKP] " Davidlohr Bueso
2014-08-01  2:03       ` Aaron Lu
2014-08-01  2:03         ` [LKP] " Aaron Lu
2014-08-01  4:03         ` Davidlohr Bueso
2014-08-01  4:03           ` [LKP] " Davidlohr Bueso
2014-08-01  7:29           ` Peter Zijlstra
2014-08-01  7:29             ` [LKP] " Peter Zijlstra
2014-08-01  7:29         ` Peter Zijlstra
2014-08-01  7:29           ` [LKP] " Peter Zijlstra
2014-07-31 23:58           ` Yuyang Du
2014-07-31 23:58             ` [LKP] " Yuyang Du
2014-08-01  8:14           ` Fengguang Wu
2014-08-01  8:14             ` [LKP] " Fengguang Wu

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=53D9E0BF.1020600@redhat.com \
    --to=riel@redhat.com \
    --cc=lkp@lists.01.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.