All of lore.kernel.org
 help / color / mirror / Atom feed
From: Sebastian Andrzej Siewior <bigeasy-hfZtesqFncYOwBW4kG4KsQ@public.gmane.org>
To: Michal Hocko <mhocko-IBi9RG/b67k@public.gmane.org>
Cc: "Waiman Long" <longman-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>,
	cgroups-u79uwXL29TY76Z2rM5mHXA@public.gmane.org,
	linux-mm-Bw31MaZKKs3YtjvyW6yDsg@public.gmane.org,
	"Andrew Morton"
	<akpm-de/tnXTf+JLsfHDXvbKv3WD2FQJk+8+b@public.gmane.org>,
	"Johannes Weiner"
	<hannes-druUgvl0LCNAfugRpC6u6w@public.gmane.org>,
	"Michal Koutný" <mkoutny-IBi9RG/b67k@public.gmane.org>,
	"Peter Zijlstra" <peterz-wEGCiKHe2LqWVfeAwA7xHQ@public.gmane.org>,
	"Thomas Gleixner" <tglx-hfZtesqFncYOwBW4kG4KsQ@public.gmane.org>,
	"Vladimir Davydov"
	<vdavydov.dev-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
Subject: Re: [PATCH 3/4] mm/memcg: Add a local_lock_t for IRQ and TASK object.
Date: Thu, 3 Feb 2022 10:54:07 +0100	[thread overview]
Message-ID: <YfumP3u1VCjKHE3b@linutronix.de> (raw)
In-Reply-To: <YflR3/RuGjYuQZPH-2MMpYkNvuYDjFM9bn6wA6Q@public.gmane.org>

On 2022-02-01 16:29:35 [+0100], Michal Hocko wrote:
> > > Sorry, I know that this all is not really related to your work but if
> > > the original optimization is solely based on artificial benchmarks then
> > > I would rather drop it and also make your RT patchset easier.
> > 
> > Do you have any real-world benchmark in mind? Like something that is
> > already used for testing/ benchmarking and would fit here?
> 
> Anything that even remotely resembles a real allocation heavy workload.

So I figured out that build the kernel as user triggers the allocation
path in_task() and in_interrupt(). I booted a PREEMPT_NONE kernel and
run "perf stat -r 5 b.sh" where b.sh unpacks a kernel and runs a
allmodconfig build on /dev/shm. The slow disk should not be a problem.

With the optimisation:
|  Performance counter stats for './b.sh' (5 runs):
| 
|       43.367.405,59 msec task-clock                #   30,901 CPUs utilized            ( +-  0,01% )
|           7.393.238      context-switches          #  170,499 /sec                     ( +-  0,13% )
|             832.364      cpu-migrations            #   19,196 /sec                     ( +-  0,15% )
|         625.235.644      page-faults               #   14,419 K/sec                    ( +-  0,00% )
| 103.822.081.026.160      cycles                    #    2,394 GHz                      ( +-  0,01% )
|  75.392.684.840.822      stalled-cycles-frontend   #   72,63% frontend cycles idle     ( +-  0,02% )
|  54.971.177.787.990      stalled-cycles-backend    #   52,95% backend cycles idle      ( +-  0,02% )
|  69.543.893.308.966      instructions              #    0,67  insn per cycle
|                                                    #    1,08  stalled cycles per insn  ( +-  0,00% )
|  14.585.269.354.314      branches                  #  336,357 M/sec                    ( +-  0,00% )
|     558.029.270.966      branch-misses             #    3,83% of all branches          ( +-  0,01% )
|  
|            1403,441 +- 0,466 seconds time elapsed  ( +-  0,03% )


With the optimisation disabled:
|  Performance counter stats for './b.sh' (5 runs):
| 
|       43.354.742,31 msec task-clock                #   30,869 CPUs utilized            ( +-  0,01% )
|           7.394.210      context-switches          #  170,601 /sec                     ( +-  0,06% )
|             842.835      cpu-migrations            #   19,446 /sec                     ( +-  0,63% )
|         625.242.341      page-faults               #   14,426 K/sec                    ( +-  0,00% )
| 103.791.714.272.978      cycles                    #    2,395 GHz                      ( +-  0,01% )
|  75.369.652.256.425      stalled-cycles-frontend   #   72,64% frontend cycles idle     ( +-  0,01% )
|  54.947.610.706.450      stalled-cycles-backend    #   52,96% backend cycles idle      ( +-  0,01% )
|  69.529.388.440.691      instructions              #    0,67  insn per cycle
|                                                    #    1,08  stalled cycles per insn  ( +-  0,01% )
|  14.584.515.016.870      branches                  #  336,497 M/sec                    ( +-  0,00% )
|     557.716.885.609      branch-misses             #    3,82% of all branches          ( +-  0,02% )
|  
|             1404,47 +- 1,05 seconds time elapsed  ( +-  0,08% )

I'm still open to a more specific test ;)

Sebastian

WARNING: multiple messages have this Message-ID (diff)
From: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
To: Michal Hocko <mhocko@suse.com>
Cc: "Waiman Long" <longman@redhat.com>,
	cgroups@vger.kernel.org, linux-mm@kvack.org,
	"Andrew Morton" <akpm@linux-foundation.org>,
	"Johannes Weiner" <hannes@cmpxchg.org>,
	"Michal Koutný" <mkoutny@suse.com>,
	"Peter Zijlstra" <peterz@infradead.org>,
	"Thomas Gleixner" <tglx@linutronix.de>,
	"Vladimir Davydov" <vdavydov.dev@gmail.com>
Subject: Re: [PATCH 3/4] mm/memcg: Add a local_lock_t for IRQ and TASK object.
Date: Thu, 3 Feb 2022 10:54:07 +0100	[thread overview]
Message-ID: <YfumP3u1VCjKHE3b@linutronix.de> (raw)
In-Reply-To: <YflR3/RuGjYuQZPH@dhcp22.suse.cz>

On 2022-02-01 16:29:35 [+0100], Michal Hocko wrote:
> > > Sorry, I know that this all is not really related to your work but if
> > > the original optimization is solely based on artificial benchmarks then
> > > I would rather drop it and also make your RT patchset easier.
> > 
> > Do you have any real-world benchmark in mind? Like something that is
> > already used for testing/ benchmarking and would fit here?
> 
> Anything that even remotely resembles a real allocation heavy workload.

So I figured out that build the kernel as user triggers the allocation
path in_task() and in_interrupt(). I booted a PREEMPT_NONE kernel and
run "perf stat -r 5 b.sh" where b.sh unpacks a kernel and runs a
allmodconfig build on /dev/shm. The slow disk should not be a problem.

With the optimisation:
|  Performance counter stats for './b.sh' (5 runs):
| 
|       43.367.405,59 msec task-clock                #   30,901 CPUs utilized            ( +-  0,01% )
|           7.393.238      context-switches          #  170,499 /sec                     ( +-  0,13% )
|             832.364      cpu-migrations            #   19,196 /sec                     ( +-  0,15% )
|         625.235.644      page-faults               #   14,419 K/sec                    ( +-  0,00% )
| 103.822.081.026.160      cycles                    #    2,394 GHz                      ( +-  0,01% )
|  75.392.684.840.822      stalled-cycles-frontend   #   72,63% frontend cycles idle     ( +-  0,02% )
|  54.971.177.787.990      stalled-cycles-backend    #   52,95% backend cycles idle      ( +-  0,02% )
|  69.543.893.308.966      instructions              #    0,67  insn per cycle
|                                                    #    1,08  stalled cycles per insn  ( +-  0,00% )
|  14.585.269.354.314      branches                  #  336,357 M/sec                    ( +-  0,00% )
|     558.029.270.966      branch-misses             #    3,83% of all branches          ( +-  0,01% )
|  
|            1403,441 +- 0,466 seconds time elapsed  ( +-  0,03% )


With the optimisation disabled:
|  Performance counter stats for './b.sh' (5 runs):
| 
|       43.354.742,31 msec task-clock                #   30,869 CPUs utilized            ( +-  0,01% )
|           7.394.210      context-switches          #  170,601 /sec                     ( +-  0,06% )
|             842.835      cpu-migrations            #   19,446 /sec                     ( +-  0,63% )
|         625.242.341      page-faults               #   14,426 K/sec                    ( +-  0,00% )
| 103.791.714.272.978      cycles                    #    2,395 GHz                      ( +-  0,01% )
|  75.369.652.256.425      stalled-cycles-frontend   #   72,64% frontend cycles idle     ( +-  0,01% )
|  54.947.610.706.450      stalled-cycles-backend    #   52,96% backend cycles idle      ( +-  0,01% )
|  69.529.388.440.691      instructions              #    0,67  insn per cycle
|                                                    #    1,08  stalled cycles per insn  ( +-  0,01% )
|  14.584.515.016.870      branches                  #  336,497 M/sec                    ( +-  0,00% )
|     557.716.885.609      branch-misses             #    3,82% of all branches          ( +-  0,02% )
|  
|             1404,47 +- 1,05 seconds time elapsed  ( +-  0,08% )

I'm still open to a more specific test ;)

Sebastian


  parent reply	other threads:[~2022-02-03  9:54 UTC|newest]

Thread overview: 63+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-01-25 16:43 [PATCH 0/4] mm/memcg: Address PREEMPT_RT problems instead of disabling it Sebastian Andrzej Siewior
2022-01-25 16:43 ` Sebastian Andrzej Siewior
     [not found] ` <20220125164337.2071854-1-bigeasy-hfZtesqFncYOwBW4kG4KsQ@public.gmane.org>
2022-01-25 16:43   ` [PATCH 1/4] mm/memcg: Disable threshold event handlers on PREEMPT_RT Sebastian Andrzej Siewior
2022-01-25 16:43     ` Sebastian Andrzej Siewior
     [not found]     ` <20220125164337.2071854-2-bigeasy-hfZtesqFncYOwBW4kG4KsQ@public.gmane.org>
2022-01-26 14:40       ` Michal Hocko
2022-01-26 14:40         ` Michal Hocko
     [not found]         ` <YfFddqkAhd1YKqX9-2MMpYkNvuYDjFM9bn6wA6Q@public.gmane.org>
2022-01-26 14:45           ` Sebastian Andrzej Siewior
2022-01-26 14:45             ` Sebastian Andrzej Siewior
     [not found]             ` <YfFegDwQSm9v2Qcu-hfZtesqFncYOwBW4kG4KsQ@public.gmane.org>
2022-01-26 15:04               ` Michal Koutný
2022-01-26 15:04                 ` Michal Koutný
     [not found]                 ` <20220126150455.GC2516-9OudH3eul5jcvrawFnH+a6VXKuFTiq87@public.gmane.org>
2022-01-27 13:36                   ` Sebastian Andrzej Siewior
2022-01-27 13:36                     ` Sebastian Andrzej Siewior
2022-01-26 15:21               ` Michal Hocko
2022-01-26 15:21                 ` Michal Hocko
2022-01-25 16:43   ` [PATCH 2/4] mm/memcg: Protect per-CPU counter by disabling preemption on PREEMPT_RT where needed Sebastian Andrzej Siewior
2022-01-25 16:43     ` Sebastian Andrzej Siewior
     [not found]     ` <20220125164337.2071854-3-bigeasy-hfZtesqFncYOwBW4kG4KsQ@public.gmane.org>
2022-01-26 10:06       ` Vlastimil Babka
2022-01-26 10:06         ` Vlastimil Babka
     [not found]         ` <86eeed07-b7dc-b387-ea4d-1a4a41334fe3-AlSwsSmVLrQ@public.gmane.org>
2022-01-26 11:24           ` Sebastian Andrzej Siewior
2022-01-26 11:24             ` Sebastian Andrzej Siewior
2022-01-26 14:56       ` Michal Hocko
2022-01-26 14:56         ` Michal Hocko
2022-01-25 16:43   ` [PATCH 3/4] mm/memcg: Add a local_lock_t for IRQ and TASK object Sebastian Andrzej Siewior
2022-01-25 16:43     ` Sebastian Andrzej Siewior
     [not found]     ` <20220125164337.2071854-4-bigeasy-hfZtesqFncYOwBW4kG4KsQ@public.gmane.org>
2022-01-26 15:20       ` Michal Hocko
2022-01-26 15:20         ` Michal Hocko
     [not found]         ` <YfFmxH1IXeegNOa9-2MMpYkNvuYDjFM9bn6wA6Q@public.gmane.org>
2022-01-27 11:53           ` Sebastian Andrzej Siewior
2022-01-27 11:53             ` Sebastian Andrzej Siewior
     [not found]             ` <YfKHxKda7bGJmrLJ-hfZtesqFncYOwBW4kG4KsQ@public.gmane.org>
2022-02-01 12:04               ` Michal Hocko
2022-02-01 12:04                 ` Michal Hocko
     [not found]                 ` <YfkhsiWHzsyQSBfl-2MMpYkNvuYDjFM9bn6wA6Q@public.gmane.org>
2022-02-01 12:11                   ` Sebastian Andrzej Siewior
2022-02-01 12:11                     ` Sebastian Andrzej Siewior
     [not found]                     ` <Yfkjjamj09lZn4sA-hfZtesqFncYOwBW4kG4KsQ@public.gmane.org>
2022-02-01 15:29                       ` Michal Hocko
2022-02-01 15:29                         ` Michal Hocko
     [not found]                         ` <YflR3/RuGjYuQZPH-2MMpYkNvuYDjFM9bn6wA6Q@public.gmane.org>
2022-02-03  9:54                           ` Sebastian Andrzej Siewior [this message]
2022-02-03  9:54                             ` Sebastian Andrzej Siewior
     [not found]                             ` <YfumP3u1VCjKHE3b-hfZtesqFncYOwBW4kG4KsQ@public.gmane.org>
2022-02-03 10:09                               ` Michal Hocko
2022-02-03 10:09                                 ` Michal Hocko
     [not found]                                 ` <Yfup9THPcSIPDSoH-2MMpYkNvuYDjFM9bn6wA6Q@public.gmane.org>
2022-02-03 11:09                                   ` Sebastian Andrzej Siewior
2022-02-03 11:09                                     ` Sebastian Andrzej Siewior
2022-02-08 17:58                                   ` Shakeel Butt
2022-02-08 17:58                                     ` Shakeel Butt
     [not found]                                     ` <CALvZod7yovQ5OTWr=k_eiEBVb1LTRvPkbsY8joAtyigQnvBUww-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2022-02-09  9:17                                       ` Michal Hocko
2022-02-09  9:17                                         ` Michal Hocko
2022-01-26 16:57       ` Vlastimil Babka
2022-01-26 16:57         ` Vlastimil Babka
     [not found]         ` <7f4928b8-16e2-88b3-2688-1519a19653a9-AlSwsSmVLrQ@public.gmane.org>
2022-01-31 15:06           ` Sebastian Andrzej Siewior
2022-01-31 15:06             ` Sebastian Andrzej Siewior
     [not found]             ` <Yff69slA4UTz5Q1Y-hfZtesqFncYOwBW4kG4KsQ@public.gmane.org>
2022-02-03 16:01               ` Vlastimil Babka
2022-02-03 16:01                 ` Vlastimil Babka
     [not found]                 ` <e068646f-c7f2-5876-8577-6ddf93df07d0-AlSwsSmVLrQ@public.gmane.org>
2022-02-08 17:17                   ` Sebastian Andrzej Siewior
2022-02-08 17:17                     ` Sebastian Andrzej Siewior
     [not found]                     ` <YgKlr+sHZPayWKUP-hfZtesqFncYOwBW4kG4KsQ@public.gmane.org>
2022-02-08 17:28                       ` Michal Hocko
2022-02-08 17:28                         ` Michal Hocko
2022-02-09  1:48     ` [mm/memcg] 86895e1e85: WARNING:possible_circular_locking_dependency_detected kernel test robot
2022-02-09  1:48       ` kernel test robot
2022-01-25 16:43   ` [PATCH 4/4] mm/memcg: Allow the task_obj optimization only on non-PREEMPTIBLE kernels Sebastian Andrzej Siewior
2022-01-25 16:43     ` Sebastian Andrzej Siewior
2022-01-25 23:21   ` [PATCH 0/4] mm/memcg: Address PREEMPT_RT problems instead of disabling it Andrew Morton
2022-01-25 23:21     ` Andrew Morton
     [not found]     ` <20220125152146.d7e25afe3b8a6807df6fee3f-de/tnXTf+JLsfHDXvbKv3WD2FQJk+8+b@public.gmane.org>
2022-01-26  7:30       ` Sebastian Andrzej Siewior
2022-01-26  7:30         ` Sebastian Andrzej Siewior
  -- strict thread matches above, loose matches on Subject: below --
2022-07-12 11:22 [PATCH 0/4] Backport MEMCG changes from v5.17 David Oberhollenzer
2022-07-12 11:22 ` [PATCH 3/4] mm/memcg: Add a local_lock_t for IRQ and TASK object David Oberhollenzer

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=YfumP3u1VCjKHE3b@linutronix.de \
    --to=bigeasy-hfztesqfncyowbw4kg4ksq@public.gmane.org \
    --cc=akpm-de/tnXTf+JLsfHDXvbKv3WD2FQJk+8+b@public.gmane.org \
    --cc=cgroups-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
    --cc=hannes-druUgvl0LCNAfugRpC6u6w@public.gmane.org \
    --cc=linux-mm-Bw31MaZKKs3YtjvyW6yDsg@public.gmane.org \
    --cc=longman-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org \
    --cc=mhocko-IBi9RG/b67k@public.gmane.org \
    --cc=mkoutny-IBi9RG/b67k@public.gmane.org \
    --cc=peterz-wEGCiKHe2LqWVfeAwA7xHQ@public.gmane.org \
    --cc=tglx-hfZtesqFncYOwBW4kG4KsQ@public.gmane.org \
    --cc=vdavydov.dev-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.