From: Bill Davidsen <davidsen@tmr.com>
To: davids@webmaster.com
Cc: david@lang.hm, linux-kernel@vger.kernel.org
Subject: Re: scheduling oddity on 2.6.20.3 stock
Date: Sat, 02 Jun 2007 18:12:03 -0400 [thread overview]
Message-ID: <4661EB33.8010002@tmr.com> (raw)
In-Reply-To: <MDEHLPKNGKAHNMBLJOLKMEIIDNAC.davids@webmaster.com>
David Schwartz wrote:
>>>>> bunzip2 -c $file.bz2 |gzip -9 >$file.gz
>
> So here are some actual results from a dual P3-1Ghz machine (2.6.21.1,
> CFSv9). First lets time each operation individually:
>
> $ time bunzip2 -k linux-2.6.21.tar.bz2
>
> real 1m5.626s
> user 1m2.240s
> sys 0m3.144s
>
>
> $ time gzip -9 linux-2.6.21.tar
>
> real 1m17.652s
> user 1m15.609s
> sys 0m1.912s
>
> The compress was the most complex (no surprise there) but they are close
> enough that efficient overlap will definitely affect the total wall time. If
> we can both decompress and compress in 1:17, we are optimal. First, let's
> try the normal way:
>
> $ time (bunzip2 -c linux-2.6.21.tar.bz2 | gzip -9 > test1)
>
> real 1m45.051s
> user 2m16.945s
> sys 0m2.752s
>
> 1:45, or 1/3 over optimal. Now, with a 32MB non-blocking cache between the
> two processes ('accel' creates a 32MB cache and uses 'select' to fill from
> stdin and empty to stdout without blocking either direction):
>
> $ time (bunzip2 -c linux-2.6.21.tar.bz2 | ./accel | gzip -9 > test2)
>
> real 1m18.361s
> user 2m19.589s
> sys 0m6.356s
>
> Within testing accuracy of optimal.
>
> So it's not the scheduler. It's the fact that bunzip2/gzip have inadequate
> input/output buffering. I don't think it's unreasonable to consider this a
> defect in those programs.
>
They are hardly designed to optimize this operation...
For a tunable buffer program allowing the buffer size and buffers in the
pool to be set, see www.tmr.com/~public/source program ptbuf. I wrote it
as a proof of concept for a pthreads presentation I was giving, and it
happened to be useful.
--
Bill Davidsen <davidsen@tmr.com>
"We have more to fear from the bungling of the incompetent than from
the machinations of the wicked." - from Slashdot
prev parent reply other threads:[~2007-06-02 22:11 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2007-05-03 1:31 scheduling oddity on 2.6.20.3 stock david
2007-05-03 21:10 ` David Schwartz
2007-05-03 21:25 ` david
2007-05-16 19:49 ` david
2007-05-16 20:06 ` David Schwartz
2007-05-16 20:59 ` David Schwartz
2007-06-02 22:12 ` Bill Davidsen [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4661EB33.8010002@tmr.com \
--to=davidsen@tmr.com \
--cc=david@lang.hm \
--cc=davids@webmaster.com \
--cc=linux-kernel@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.