From: Joe Perches <joe@perches.com>
To: Michal Hocko <mhocko@kernel.org>,
Andrew Morton <akpm@linux-foundation.org>
Cc: Jann Horn <jann@thejh.net>,
linux-mm@kvack.org, LKML <linux-kernel@vger.kernel.org>
Subject: Re: [PATCH 0/2] fs, proc: optimize smaps output formatting
Date: Fri, 19 Aug 2016 10:43:15 -0700 [thread overview]
Message-ID: <1471628595.3893.23.camel@perches.com> (raw)
In-Reply-To: <1471601580-17999-1-git-send-email-mhocko@kernel.org>
On Fri, 2016-08-19 at 12:12 +0200, Michal Hocko wrote:
> Hi,
> this is rebased on top of next-20160818. Joe has pointed out that
> meminfo is using a similar trick so I have extracted guts of what we
> have already and made it more generic to be usable for smaps as well
> (patch 1). The second patch then replaces seq_printf with seq_write
> and show_val_kb which should have smaller overhead and my measuring (in
> kvm) shows quite a nice improvements. I hope kvm is not playing tricks
> on me but I didn't get to test on a real HW.
Hi Michal.
A few comments:
For the first patch:
I think this isn't worth the expansion in object size (x86-64 defconfig)
$ size fs/proc/meminfo.o*
text data bss dec hex filename
2698 8 0 2706 a92 fs/proc/meminfo.o.new
2142 8 0 2150 866 fs/proc/meminfo.o.old
Creating a new static in task_mmu would be smaller and faster code.
There are only 3 other uses of %8lu in fs/proc/task_nommu.c and
those use bytes not kB.
There are a few other likely not performance sensitive similar
uses in <arch>/mm
$ git grep -E "seq_printf.*%8lu kB" arch
arch/x86/mm/pageattr.c: seq_printf(m, "DirectMap4k: %8lu kB\n",
arch/x86/mm/pageattr.c: seq_printf(m, "DirectMap2M: %8lu kB\n",
arch/x86/mm/pageattr.c: seq_printf(m, "DirectMap4M: %8lu kB\n",
arch/x86/mm/pageattr.c: seq_printf(m, "DirectMap1G: %8lu kB\n",
arch/s390/mm/pageattr.c: seq_printf(m, "DirectMap4k: %8lu kB\n",
arch/s390/mm/pageattr.c: seq_printf(m, "DirectMap1M: %8lu kB\n",
arch/s390/mm/pageattr.c: seq_printf(m, "DirectMap2G: %8lu kB\n",
For the second patch:
seq_show starts with a PAGE_SIZE buffer and if that buffer isn't
big enough, seq_show redoes the entire output done to that point
into a new buffer << 1 until the buffer is big enough to hold
the output.
So I expect this case of multiple pages / megabytes worth of smap
output (40MB in your pathological case) would be rather faster if
single_open_size was used appropriately for expected output size.
And this would definitely be faster if seq_has_overflowed() was
used somewhere in the iteration loop.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
WARNING: multiple messages have this Message-ID (diff)
From: Joe Perches <joe@perches.com>
To: Michal Hocko <mhocko@kernel.org>,
Andrew Morton <akpm@linux-foundation.org>
Cc: Jann Horn <jann@thejh.net>,
linux-mm@kvack.org, LKML <linux-kernel@vger.kernel.org>
Subject: Re: [PATCH 0/2] fs, proc: optimize smaps output formatting
Date: Fri, 19 Aug 2016 10:43:15 -0700 [thread overview]
Message-ID: <1471628595.3893.23.camel@perches.com> (raw)
In-Reply-To: <1471601580-17999-1-git-send-email-mhocko@kernel.org>
On Fri, 2016-08-19 at 12:12 +0200, Michal Hocko wrote:
> Hi,
> this is rebased on top of next-20160818. Joe has pointed out that
> meminfo is using a similar trick so I have extracted guts of what we
> have already and made it more generic to be usable for smaps as well
> (patch 1). The second patch then replaces seq_printf with seq_write
> and show_val_kb which should have smaller overhead and my measuring (in
> kvm) shows quite a nice improvements. I hope kvm is not playing tricks
> on me but I didn't get to test on a real HW.
Hi Michal.
A few comments:
For the first patch:
I think this isn't worth the expansion in object size (x86-64 defconfig)
$ size fs/proc/meminfo.o*
text data bss dec hex filename
2698 8 0 2706 a92 fs/proc/meminfo.o.new
2142 8 0 2150 866 fs/proc/meminfo.o.old
Creating a new static in task_mmu would be smaller and faster code.
There are only 3 other uses of %8lu in fs/proc/task_nommu.c and
those use bytes not kB.
There are a few other likely not performance sensitive similar
uses in <arch>/mm
$ git grep -E "seq_printf.*%8lu kB" arch
arch/x86/mm/pageattr.c: seq_printf(m, "DirectMap4k: %8lu kB\n",
arch/x86/mm/pageattr.c: seq_printf(m, "DirectMap2M: %8lu kB\n",
arch/x86/mm/pageattr.c: seq_printf(m, "DirectMap4M: %8lu kB\n",
arch/x86/mm/pageattr.c: seq_printf(m, "DirectMap1G: %8lu kB\n",
arch/s390/mm/pageattr.c: seq_printf(m, "DirectMap4k: %8lu kB\n",
arch/s390/mm/pageattr.c: seq_printf(m, "DirectMap1M: %8lu kB\n",
arch/s390/mm/pageattr.c: seq_printf(m, "DirectMap2G: %8lu kB\n",
For the second patch:
seq_show starts with a PAGE_SIZE buffer and if that buffer isn't
big enough, seq_show redoes the entire output done to that point
into a new buffer << 1 until the buffer is big enough to hold
the output.
So I expect this case of multiple pages / megabytes worth of smap
output (40MB in your pathological case) would be rather faster if
single_open_size was used appropriately for expected output size.
And this would definitely be faster if seq_has_overflowed() was
used somewhere in the iteration loop.
next prev parent reply other threads:[~2016-08-19 17:43 UTC|newest]
Thread overview: 50+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-08-18 11:31 [PATCH] proc, smaps: reduce printing overhead Michal Hocko
2016-08-18 11:31 ` Michal Hocko
2016-08-18 13:26 ` Joe Perches
2016-08-18 13:26 ` Joe Perches
2016-08-18 14:26 ` Michal Hocko
2016-08-18 14:26 ` Michal Hocko
2016-08-18 14:41 ` Joe Perches
2016-08-18 14:41 ` Joe Perches
2016-08-18 14:41 ` Michal Hocko
2016-08-18 14:41 ` Michal Hocko
2016-08-18 14:46 ` Joe Perches
2016-08-18 14:46 ` Joe Perches
2016-08-18 14:58 ` Michal Hocko
2016-08-18 14:58 ` Michal Hocko
2016-08-18 15:23 ` Joe Perches
2016-08-18 15:23 ` Joe Perches
2016-08-18 16:42 ` Michal Hocko
2016-08-18 16:42 ` Michal Hocko
2016-08-19 10:12 ` [PATCH 0/2] fs, proc: optimize smaps output formatting Michal Hocko
2016-08-19 10:12 ` Michal Hocko
2016-08-19 10:12 ` [PATCH 1/2] proc, meminfo: abstract show_val_kb Michal Hocko
2016-08-19 10:12 ` Michal Hocko
2016-08-26 2:54 ` [proc, meminfo] dd3b422c11: stderr.Signal#(FPE)caught_by_ps(procps-ng_version#) kernel test robot
2016-08-26 2:54 ` [lkp] " kernel test robot
2016-08-19 10:13 ` [PATCH 2/2] proc, smaps: reduce printing overhead Michal Hocko
2016-08-19 10:13 ` Michal Hocko
2016-08-19 17:43 ` Joe Perches [this message]
2016-08-19 17:43 ` [PATCH 0/2] fs, proc: optimize smaps output formatting Joe Perches
2016-08-19 20:18 ` Joe Perches
2016-08-19 20:18 ` Joe Perches
2016-08-20 7:29 ` Michal Hocko
2016-08-20 7:29 ` Michal Hocko
2016-08-20 7:55 ` Joe Perches
2016-08-20 7:55 ` Joe Perches
2016-08-20 8:00 ` [PATCH 0/2] seq: Speed up /proc/<pid>/smaps Joe Perches
2016-08-20 8:00 ` Joe Perches
2016-08-20 8:00 ` [PATCH 1/2] seq_file: Add __seq_open_private_bufsize for seq file_operation sizes Joe Perches
2016-08-20 8:00 ` Joe Perches
2016-08-20 8:00 ` [PATCH 2/2] proc: task_mmu: Reduce output processing cpu time Joe Perches
2016-08-20 8:00 ` Joe Perches
2016-08-22 7:24 ` Michal Hocko
2016-08-22 7:24 ` Michal Hocko
2016-08-22 8:00 ` Joe Perches
2016-08-22 8:00 ` Joe Perches
2016-08-22 8:30 ` Joe Perches
2016-08-22 8:30 ` Joe Perches
2016-08-22 12:09 ` Michal Hocko
2016-08-22 12:09 ` Michal Hocko
2016-08-23 15:14 ` [PATCH] proc, smaps: reduce printing overhead Michal Hocko
2016-08-23 15:14 ` Michal Hocko
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1471628595.3893.23.camel@perches.com \
--to=joe@perches.com \
--cc=akpm@linux-foundation.org \
--cc=jann@thejh.net \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mhocko@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.