Linux PARISC architecture development
 help / color / mirror / Atom feed
From: Randolph Chung <randolph@tausq.org>
To: Grant Grundler <grundler@parisc-linux.org>
Cc: parisc-linux@lists.parisc-linux.org
Subject: Re: [parisc-linux] 2.6.10-rc1-pa11 profile data
Date: Thu, 11 Nov 2004 00:11:54 -0800	[thread overview]
Message-ID: <20041111081154.GR15714@tausq.org> (raw)
In-Reply-To: <20041111075431.GB9768@colo.lackof.org>

> I've collect two profiles for -64SMP and will collect
> some UP profiles tomorrow. profiles so far are measuring
> a full kernel build. I expect I'll do the same for -64UP
> kernels too.

hmm.. interesting. top consumers are (with idle loop functions removed)

 40646 flush_kernel_icache_page                 406.4600
  7364 fdsync                                   368.2000
 10567 flush_user_dcache_range_asm              293.5278
 10387 flush_user_icache_range_asm              288.5278
 21409 __clear_user_page_asm                    191.1518
  5356 _spin_lock_irqsave                       111.5833
  1768 fisync                                   110.5000
  1928 _spin_lock                                48.2000
  4255 purge_kernel_dcache_page                  42.5500
   339 $lclu_done                                42.3750
  4089 flush_kernel_dcache_page                  40.8900
  5053 copy_user_page_asm                        33.2434
   569 _write_unlock_irq                         17.7812
   422 _spin_unlock                              17.5833
  1567 find_vma_prev                             16.3229
   181 $lslen_loop                               15.0833
    96 $lslen_done                               12.0000
   996 _write_trylock                            11.3182
   137 $lsfu_loop                                 8.5625
   748 flush_user_dcache_page                     7.4800

we really need to do better at cache flushing..... anybody have any
ideas? :)

but looking at the other ones:
- __clear_user_page_asm can be optimized for 64-bit by writing 8 bytes
  at a time instead of 4
- _spin_lock* needs investigation to see if we have some bad locks
  someplace. lockmeter anybody?
- *lclu* can be rewritten to do better than 1-byte at a time
- copy_user_page_asm can be sped up slightly by using pa_memcpy, but not
  much when i tried last time
- *lslen* can also probably be written in a smarter way... 

i suspect some areas for further investigation are:
- can we do tlb_flush_mm() in a smarter way for SMP?
- can we improve kernel entry time for interrupts (and syscalls) by
  being smarter about what we save on the stack? (i.e. only callee-save
  registers and not all the registers?)

volunteers? :)

randolph
-- 
Randolph Chung
Debian GNU/Linux Developer, hppa/ia64 ports
http://www.tausq.org/
_______________________________________________
parisc-linux mailing list
parisc-linux@lists.parisc-linux.org
http://lists.parisc-linux.org/mailman/listinfo/parisc-linux

  reply	other threads:[~2004-11-11  8:11 UTC|newest]

Thread overview: 25+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2004-11-11  7:54 [parisc-linux] 2.6.10-rc1-pa11 profile data Grant Grundler
2004-11-11  8:11 ` Randolph Chung [this message]
2004-11-11 17:39   ` Carlos O'Donell
2004-11-11 17:42     ` Randolph Chung
2004-11-11 17:50       ` Matthew Wilcox
2004-11-11 17:59         ` Randolph Chung
2004-11-11 18:36           ` Grant Grundler
2004-11-11 18:23   ` Joel Soete
2004-11-11 18:51     ` Randolph Chung
2004-11-26 16:59   ` flush_kernel_[di]cache_page question? [WAS: " Joel Soete
2004-11-26 17:13     ` Randolph Chung
2004-11-26 19:02     ` Grant Grundler
2004-11-28 21:01   ` [id]cache meaning? [Was: [parisc-linux] 2.6.10-rc1-pa11 profile data] Joel Soete
2004-11-28 21:13     ` Matthew Wilcox
2004-11-29  1:14       ` Michael S. Zick
2004-11-29  2:00         ` Matthew Wilcox
2004-12-01 17:44   ` More questions " Joel Soete
2004-12-01 17:56     ` Matthew Wilcox
2004-12-01 18:33       ` Joel Soete
2004-12-03 10:24     ` Joel Soete
2004-12-03 15:41       ` Randolph Chung
2004-12-07 14:42         ` Joel Soete
2004-12-03 15:00   ` *lcul and memory granularity question[Was: " Joel Soete
2004-12-03 15:13     ` Matthew Wilcox
2004-11-12  5:29 ` [parisc-linux] 2.6.10-rc1-pa11 profile data Grant Grundler

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20041111081154.GR15714@tausq.org \
    --to=randolph@tausq.org \
    --cc=grundler@parisc-linux.org \
    --cc=parisc-linux@lists.parisc-linux.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox