From: Jan Kiszka <jan.kiszka@domain.hid>
To: xenomai-core <xenomai@xenomai.org>
Cc: Daniel.Rossier@domain.hid
Subject: [Xenomai-core] [RFC] Micro-optimisations for the libs
Date: Sat, 06 May 2006 10:09:45 +0200 [thread overview]
Message-ID: <445C59C9.8020107@domain.hid> (raw)
[-- Attachment #1: Type: text/plain, Size: 2789 bytes --]
Hi,
[Daniel, I put you in the CC as you showed some interest in this topic.]
as I indicated a some weeks ago, I had a closer look at the code the
user space libs currently produce (on x86). The following considerations
are certainly not worth noticeable microseconds on GHz boxes, but they
may buy us (yet another) few micros on low-end.
First of all, there is some redundant code in the syscall path of each
skin service. This is due to the fact that the function code is
calculated based on the the skin mux id each time a service is invoked.
The mux id has to be shifted and masked in order to combine it with the
constant function code part - this could also easily happen
ahead-of-time, saving code and cycles for each service entry point.
Here is a commented disassembly of some simple native skin service which
only takes one argument.
Function prologue:
460: 55 push %ebp
461: 89 e5 mov %esp,%ebp
463: 57 push %edi
464: 83 ec 10 sub $0x10,%esp
Loading the skin mux-id:
467: a1 00 00 00 00 mov 0x0,%eax
Loading the argument (here: some pointer)
46c: 8b 7d 08 mov 0x8(%ebp),%edi
Calculating the function code:
46f: c1 e0 10 shl $0x10,%eax
472: 25 00 00 ff 00 and $0xff0000,%eax
477: 0d 2b 02 00 08 or $0x800022b,%eax
Saving the code:
47c: 89 45 f8 mov %eax,0xfffffff8(%ebp)
47f: 53 push %ebx
Loading the arguments (here only one):
480: 89 fb mov %edi,%ebx
Restoring the code again, issuing the syscall:
482: 8b 45 f8 mov 0xfffffff8(%ebp),%eax
485: cd 80 int $0x80
487: 5b pop %ebx
Function epilogue:
488: 83 c4 10 add $0x10,%esp
48b: 5f pop %edi
48c: 5d pop %ebp
48d: c3 ret
Looking at this code, I also started thinking about inlining short and
probably heavily-used functions into the user code. This would save the
function prologue/epilogue both in the lib and the user code itself. For
sure, it only makes sense for time-critical functions (think of
mutex_lock/unlock or rt_timer_read). But inlining could be made optional
for the user by providing both the library variant and the inlined
version. The users could then select the preferred one by #defining some
control switch before including the skin headers.
Any thoughts on this? And, almost more important, anyone around willing
to work on these optimisations and evaluate the results? I can't ATM.
Jan
[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 250 bytes --]
next reply other threads:[~2006-05-06 8:09 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2006-05-06 8:09 Jan Kiszka [this message]
2006-05-06 10:37 ` [Xenomai-core] [RFC] Micro-optimisations for the libs Philippe Gerum
2006-05-08 10:51 ` Wolfgang Grandegger
2006-05-08 12:01 ` Jan Kiszka
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=445C59C9.8020107@domain.hid \
--to=jan.kiszka@domain.hid \
--cc=Daniel.Rossier@domain.hid \
--cc=xenomai@xenomai.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.