linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Dave Jones <davej@codemonkey.org.uk>
To: linux-kernel@vger.kernel.org
Subject: Re: [CFT] faster athlon/duron memory copy implementation
Date: Thu, 24 Oct 2002 19:56:37 +0100	[thread overview]
Message-ID: <20021024185637.GB10584@suse.de> (raw)
In-Reply-To: <20021024184328.GA5667@himi.org>

On Fri, Oct 25, 2002 at 04:43:28AM +1000, Simon Fowler wrote:
 > <deletia>
 > 
 > copy_page() tests 
 > copy_page function 'warm up run'         took 12855 cycles per page
 > copy_page function '2.4 non MMX'         took 17267 cycles per page
 > copy_page function '2.4 MMX fallback'    took 14930 cycles per page
 > copy_page function '2.4 MMX version'     took 10642 cycles per page
 > copy_page function 'faster_copy'         took 10591 cycles per page
 > copy_page function 'even_faster'         took 13035 cycles per page
 > copy_page function 'no_prefetch'         took 11657 cycles per page
 > Athlon test program $Id: fast.c,v 1.6 2000/09/23 09:05:45 arjan Exp $ 
 > 
 > copy_page() tests 
 > copy_page function 'warm up run'         took 12871 cycles per page
 > copy_page function '2.4 non MMX'         took 18482 cycles per page
 > copy_page function '2.4 MMX fallback'    took 15013 cycles per page
 > copy_page function '2.4 MMX version'     took 10679 cycles per page
 > copy_page function 'faster_copy'         took 12268 cycles per page
 > copy_page function 'even_faster'         took 10789 cycles per page
 > copy_page function 'no_prefetch'         took 11691 cycles per page
 > Athlon test program $Id: fast.c,v 1.6 2000/09/23 09:05:45 arjan Exp $ 
 > 
 > copy_page() tests 
 > copy_page function 'warm up run'         took 13110 cycles per page
 > copy_page function '2.4 non MMX'         took 14958 cycles per page
 > copy_page function '2.4 MMX fallback'    took 14952 cycles per page
 > copy_page function '2.4 MMX version'     took 12864 cycles per page
 > copy_page function 'faster_copy'         took 10581 cycles per page
 > copy_page function 'even_faster'         took 10629 cycles per page
 > copy_page function 'no_prefetch'         took 11607 cycles per page

Wow. The 612 really sucked badly here. I think this is the only
time I've seen 'even_faster' lose.  A few people (myself included)
have in the past talked about making the memory copy routines
do a boot time benchmark somewhat like the RAID code does to deduce
the best.  Seeing results like this makes me really believe this is
the way forward.

With something like that inplace, we could then have seperate
implementations for each processor revision if needbe without
pessimising for earlier revisions.

		Dave

-- 
| Dave Jones.        http://www.codemonkey.org.uk

  parent reply	other threads:[~2002-10-24 18:48 UTC|newest]

Thread overview: 51+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2002-10-24 17:15 [CFT] faster athlon/duron memory copy implementation Manfred Spraul
2002-10-24 17:37 ` Robert Love
2002-10-24 18:05   ` Zach Brown
2002-10-24 17:41 ` Andreas Steinmetz
2002-10-24 17:48 ` Matthias Welk
2002-10-24 19:01   ` erich
2002-10-24 19:11     ` Arjan van de Ven
2002-10-24 19:38     ` Manfred Spraul
2002-10-25  0:59       ` Panagiotis Papadakos
2002-10-24 17:53 ` Roger Luethi
2002-10-24 18:10 ` Daniel Egger
2002-10-24 19:15   ` Florin Iucha
2002-10-24 19:28   ` Manfred Spraul
2002-10-24 19:38     ` Dave Jones
2002-10-24 19:43     ` Ken Witherow
2002-10-25 13:08     ` Daniel Egger
2002-10-24 18:17 ` Eric Lammerts
2002-10-24 18:26 ` David Rees
2002-10-24 18:35 ` Josh McKinney
2002-10-24 18:36 ` Dave Jones
2002-10-24 18:43 ` Simon Fowler
2002-10-24 18:50   ` Simon Fowler
2002-10-24 18:56   ` Dave Jones [this message]
2002-10-24 18:48 ` Ernst Herzberg
2002-10-24 20:09   ` Ed Sweetman
2002-10-24 20:13     ` Robert Love
2002-10-24 20:31       ` Ed Sweetman
2002-10-24 20:49         ` Dave Jones
2002-10-24 20:26     ` Dave Jones
2002-10-25  9:19       ` Måns Rullgård
2002-10-24 19:11 ` Marcus Libäck
2002-10-24 19:19 ` Brian Gerst
2002-10-24 19:31 ` Matthias Schniedermeyer
2002-10-24 19:33 ` Pascal Schmidt
2002-10-24 19:39 ` Olaf Dietsche
2002-10-24 20:27 ` Mike Civil
2002-10-24 20:44 ` Willy TARREAU
2002-10-24 21:46 ` Josh Fryman
2002-10-24 22:18 ` Tim Schmielau
2002-10-24 23:09 ` Hirokazu Takahashi
2002-10-24 23:37 ` Ryan Cumming
2002-10-25  0:10 ` Matthias Andree
2002-10-25  8:35 ` venom
2002-10-25 13:31 ` Denis Vlasenko
2002-10-26 12:11 ` Jurjen Oskam
  -- strict thread matches above, loose matches on Subject: below --
2002-10-24 18:27 Shawn Starr
2002-10-24 20:51 Dieter Nützel
2002-10-24 21:01 ` Dieter Nützel
2002-10-24 21:16 ` Willy TARREAU
2002-10-24 22:01 Harm Verhagen
2002-10-25 16:29 Jorge Bernal "Koke"

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20021024185637.GB10584@suse.de \
    --to=davej@codemonkey.org.uk \
    --cc=linux-kernel@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).