From: Michal Nazarewicz <mina86@mina86.com>
To: Minchan Kim <minchan@kernel.org>,
Andrew Morton <akpm@linux-foundation.org>
Cc: linux-mm@kvack.org, linux-kernel@vger.kernel.org,
Mel Gorman <mgorman@suse.de>, Andy Whitcroft <apw@shadowen.org>,
Alexander Nyberg <alexn@dsv.su.se>,
Randy Dunlap <rdunlap@infradead.org>
Subject: Re: [PATCH v2 2/2] Enhance read_block of page_owner.c
Date: Fri, 11 Jan 2013 17:01:29 +0100 [thread overview]
Message-ID: <xa1t8v7zbteu.fsf@mina86.com> (raw)
In-Reply-To: <1357871401-7075-2-git-send-email-minchan@kernel.org>
[-- Attachment #1: Type: text/plain, Size: 3413 bytes --]
It occurred to me -- and I know it will sound like a heresy -- that
maybe providing an overly long example in C is not the best option here.
Why not page_owner.py with the following content instead (not tested):
#!/usr/bin/python
import collections
import sys
counts = collections.defaultdict(int)
txt = ''
for line in sys.stdin:
if line == '\n':
counts[txt] += 1
txt = ''
else:
txt += line
counts[txt] += 1
for txt, num in sorted(counts.items(), txt=lambda x: x[1]):
if len(txt) > 1:
print '%d times:\n%s' % num, txt
And it's so “long” only because I chose not to read the whole file at
once as in:
counts = collections.defaultdict(int)
for txt in sys.stdin.read().split('\n\n'):
counts[txt] += 1
On Fri, Jan 11 2013, Minchan Kim wrote:
> The read_block reads char one by one until meeting two newline.
> It's not good for the performance and current code isn't good shape
> for readability.
>
> This patch enhances speed and clean up.
>
> Cc: Mel Gorman <mgorman@suse.de>
> Cc: Andy Whitcroft <apw@shadowen.org>
> Cc: Alexander Nyberg <alexn@dsv.su.se>
> Cc: Randy Dunlap <rdunlap@infradead.org>
> Signed-off-by: Michal Nazarewicz <mina86@mina86.com>
> Signed-off-by: Minchan Kim <minchan@kernel.org>
> ---
> Documentation/page_owner.c | 34 +++++++++++++---------------------
> 1 file changed, 13 insertions(+), 21 deletions(-)
>
> diff --git a/Documentation/page_owner.c b/Documentation/page_owner.c
> index 43dde96..96bf481 100644
> --- a/Documentation/page_owner.c
> +++ b/Documentation/page_owner.c
> @@ -28,26 +28,17 @@ static int max_size;
>
> struct block_list *block_head;
>
> -int read_block(char *buf, FILE *fin)
> +int read_block(char *buf, int buf_size, FILE *fin)
> {
> - int ret = 0;
> - int hit = 0;
> - int val;
> - char *curr = buf;
> -
> - for (;;) {
> - val = getc(fin);
> - if (val == EOF) return -1;
> - *curr = val;
> - ret++;
> - if (*curr == '\n' && hit == 1)
> - return ret - 1;
> - else if (*curr == '\n')
> - hit = 1;
> - else
> - hit = 0;
> - curr++;
> + char *curr = buf, *const buf_end = buf + buf_size;
> +
> + while (buf_end - curr > 1 && fgets(curr, buf_end - curr, fin)) {
> + if (*curr == '\n') /* empty line */
> + return curr - buf;
> + curr += strlen(curr);
> }
> +
> + return -1; /* EOF or no space left in buf. */
> }
>
> static int compare_txt(struct block_list *l1, struct block_list *l2)
> @@ -84,10 +75,12 @@ static void add_list(char *buf, int len)
> }
> }
>
> +#define BUF_SIZE 1024
> +
> int main(int argc, char **argv)
> {
> FILE *fin, *fout;
> - char buf[1024];
> + char buf[BUF_SIZE];
> int ret, i, count;
> struct block_list *list2;
> struct stat st;
> @@ -106,11 +99,10 @@ int main(int argc, char **argv)
> list = malloc(max_size * sizeof(*list));
>
> for(;;) {
> - ret = read_block(buf, fin);
> + ret = read_block(buf, BUF_SIZE, fin);
> if (ret < 0)
> break;
>
> - buf[ret] = '\0';
> add_list(buf, ret);
> }
>
> --
> 1.7.9.5
>
--
Best regards, _ _
.o. | Liege of Serenely Enlightened Majesty of o' \,=./ `o
..o | Computer Science, Michał “mina86” Nazarewicz (o o)
ooo +----<email/xmpp: mpn@google.com>--------------ooO--(_)--Ooo--
[-- Attachment #2.1: Type: text/plain, Size: 0 bytes --]
[-- Attachment #2.2: Type: application/pgp-signature, Size: 835 bytes --]
WARNING: multiple messages have this Message-ID (diff)
From: Michal Nazarewicz <mina86@mina86.com>
To: Minchan Kim <minchan@kernel.org>,
Andrew Morton <akpm@linux-foundation.org>
Cc: linux-mm@kvack.org, linux-kernel@vger.kernel.org,
Minchan Kim <minchan@kernel.org>, Mel Gorman <mgorman@suse.de>,
Andy Whitcroft <apw@shadowen.org>,
Alexander Nyberg <alexn@dsv.su.se>,
Randy Dunlap <rdunlap@infradead.org>
Subject: Re: [PATCH v2 2/2] Enhance read_block of page_owner.c
Date: Fri, 11 Jan 2013 17:01:29 +0100 [thread overview]
Message-ID: <xa1t8v7zbteu.fsf@mina86.com> (raw)
In-Reply-To: <1357871401-7075-2-git-send-email-minchan@kernel.org>
[-- Attachment #1: Type: text/plain, Size: 3413 bytes --]
It occurred to me -- and I know it will sound like a heresy -- that
maybe providing an overly long example in C is not the best option here.
Why not page_owner.py with the following content instead (not tested):
#!/usr/bin/python
import collections
import sys
counts = collections.defaultdict(int)
txt = ''
for line in sys.stdin:
if line == '\n':
counts[txt] += 1
txt = ''
else:
txt += line
counts[txt] += 1
for txt, num in sorted(counts.items(), txt=lambda x: x[1]):
if len(txt) > 1:
print '%d times:\n%s' % num, txt
And it's so “long” only because I chose not to read the whole file at
once as in:
counts = collections.defaultdict(int)
for txt in sys.stdin.read().split('\n\n'):
counts[txt] += 1
On Fri, Jan 11 2013, Minchan Kim wrote:
> The read_block reads char one by one until meeting two newline.
> It's not good for the performance and current code isn't good shape
> for readability.
>
> This patch enhances speed and clean up.
>
> Cc: Mel Gorman <mgorman@suse.de>
> Cc: Andy Whitcroft <apw@shadowen.org>
> Cc: Alexander Nyberg <alexn@dsv.su.se>
> Cc: Randy Dunlap <rdunlap@infradead.org>
> Signed-off-by: Michal Nazarewicz <mina86@mina86.com>
> Signed-off-by: Minchan Kim <minchan@kernel.org>
> ---
> Documentation/page_owner.c | 34 +++++++++++++---------------------
> 1 file changed, 13 insertions(+), 21 deletions(-)
>
> diff --git a/Documentation/page_owner.c b/Documentation/page_owner.c
> index 43dde96..96bf481 100644
> --- a/Documentation/page_owner.c
> +++ b/Documentation/page_owner.c
> @@ -28,26 +28,17 @@ static int max_size;
>
> struct block_list *block_head;
>
> -int read_block(char *buf, FILE *fin)
> +int read_block(char *buf, int buf_size, FILE *fin)
> {
> - int ret = 0;
> - int hit = 0;
> - int val;
> - char *curr = buf;
> -
> - for (;;) {
> - val = getc(fin);
> - if (val == EOF) return -1;
> - *curr = val;
> - ret++;
> - if (*curr == '\n' && hit == 1)
> - return ret - 1;
> - else if (*curr == '\n')
> - hit = 1;
> - else
> - hit = 0;
> - curr++;
> + char *curr = buf, *const buf_end = buf + buf_size;
> +
> + while (buf_end - curr > 1 && fgets(curr, buf_end - curr, fin)) {
> + if (*curr == '\n') /* empty line */
> + return curr - buf;
> + curr += strlen(curr);
> }
> +
> + return -1; /* EOF or no space left in buf. */
> }
>
> static int compare_txt(struct block_list *l1, struct block_list *l2)
> @@ -84,10 +75,12 @@ static void add_list(char *buf, int len)
> }
> }
>
> +#define BUF_SIZE 1024
> +
> int main(int argc, char **argv)
> {
> FILE *fin, *fout;
> - char buf[1024];
> + char buf[BUF_SIZE];
> int ret, i, count;
> struct block_list *list2;
> struct stat st;
> @@ -106,11 +99,10 @@ int main(int argc, char **argv)
> list = malloc(max_size * sizeof(*list));
>
> for(;;) {
> - ret = read_block(buf, fin);
> + ret = read_block(buf, BUF_SIZE, fin);
> if (ret < 0)
> break;
>
> - buf[ret] = '\0';
> add_list(buf, ret);
> }
>
> --
> 1.7.9.5
>
--
Best regards, _ _
.o. | Liege of Serenely Enlightened Majesty of o' \,=./ `o
..o | Computer Science, Michał “mina86” Nazarewicz (o o)
ooo +----<email/xmpp: mpn@google.com>--------------ooO--(_)--Ooo--
[-- Attachment #2.1: Type: text/plain, Size: 0 bytes --]
[-- Attachment #2.2: Type: application/pgp-signature, Size: 835 bytes --]
next prev parent reply other threads:[~2013-01-11 16:01 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-01-11 2:30 [PATCH v2 1/2] Fix wrong EOF compare Minchan Kim
2013-01-11 2:30 ` Minchan Kim
2013-01-11 2:30 ` [PATCH v2 2/2] Enhance read_block of page_owner.c Minchan Kim
2013-01-11 2:30 ` Minchan Kim
2013-01-11 16:01 ` Michal Nazarewicz [this message]
2013-01-11 16:01 ` Michal Nazarewicz
2013-01-14 2:33 ` Minchan Kim
2013-01-14 2:33 ` Minchan Kim
2013-01-14 8:27 ` Michal Nazarewicz
2013-01-11 14:21 ` [PATCH v2 1/2] Fix wrong EOF compare Michal Nazarewicz
2013-01-11 14:21 ` Michal Nazarewicz
2013-01-13 11:44 ` Rob Landley
2013-01-13 11:44 ` Rob Landley
2013-01-13 18:15 ` Randy Dunlap
2013-01-13 18:15 ` Randy Dunlap
2013-01-31 10:25 ` Rob Landley
2013-01-31 10:25 ` Rob Landley
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=xa1t8v7zbteu.fsf@mina86.com \
--to=mina86@mina86.com \
--cc=akpm@linux-foundation.org \
--cc=alexn@dsv.su.se \
--cc=apw@shadowen.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mgorman@suse.de \
--cc=minchan@kernel.org \
--cc=rdunlap@infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.