From: Jamie Lokier <lk@tantalophile.demon.co.uk>
To: Zack Weinberg <zack@codesourcery.com>
Cc: linux-kernel@vger.kernel.org, torvalds@transmeta.com
Subject: Re: [PATCH] Set TIF_IRET in more places
Date: Tue, 7 Jan 2003 11:19:05 +0000 [thread overview]
Message-ID: <20030107111905.GA949@bjl1.asuk.net> (raw)
In-Reply-To: <87isx2dktj.fsf@egil.codesourcery.com>
Zack Weinberg wrote:
> Consider SA_RESTORER - there isn't a guarantee that user space will
> use the same code as the kernel's trampoline. glibc happens to, but
> only because GDB has a hardwired idea of what a signal trampoline
> looks like. Of course, you could simply document that sigreturn() is
> another of the system calls that must be made through int 0x80.
Glibc must use the same code as the kernel's trampoline because of
MD_FALLBACK_FRAME_STATE_FOR() in GCC's exception handling... (or
libgcc.so must change).
It explicitly checks for the opcode sequences 0x58b877000000cd80 and
0xb8ad000000cd80 in order to unwind exception frames around a
handled signal. Ugly, isn't it?
> It occurs to me that the kernel-provided signal trampoline could go in
> the page at 0xffff0000, instead of on the user stack, which would
> eliminate the need for glibc to set SA_RESTORER (it's a pure
> optimization).
Yup.
> Tangentially, I've seen people claim that the trampoline ought to be
> able to avoid entering the kernel, although I'm not convinced (how
> does the signal mask get reset, otherwise?)
Welcome to a wonderful if rather unsightly optimisation:
1. libc installs its own handler function for all non-SIG_DFL signals,
and sigaction() mostly updates a table in userspace.
2. The libc signal handler redirects all signals to the application
through a funky trampoline in libc.
3. A signal mask is maintained in userspace. Also, a pending mask
is maintained in userspace.
4. When a signal is delivered, libc's handler function checks the
userspace signal mask. If the signal should be blocked, and it
is possible to block it, it is marked as pending in _userspace_
pending mask, and the userspace signal mask is propagated to the
kernel to prevent further signals queuing up. Any siginfo_t is
also saved for tha signa. Then libc's handler returns, without
calling the application handler (because that is deferred).
5. When a signal is unblocked from the userspace signal mask, if it
is in the userspace pending mask, it is synthetically delivered
by userspace, which creates a context _as if_ the kernel had
delivered the signal.
By this mechanism, calls to unblock signals from the signal mask can
be done without entering the kernel, because the unblocking can be
done lazily.
Voila! sigreturn() can be written to avoid entering the kernel. Note
that this is possible _now_, with no changes to the kernel. It only
requires changes to libc. I think it would work on all architectures,
not just i386. (It may also be possible to do it without libc help,
in the vsyscall page).
-- Jamie
ps. A similar optimisation allows "spin_lock_irqsave" and
"spin_unlock_irqrestore" to avoid using the cli & sti instructions.
Spin locks already modify the preempt_count, so use a bit of that to
hold the synthetic interrupt-disabled flag, at zero cost... :)
next prev parent reply other threads:[~2003-01-07 11:07 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2003-01-06 21:34 [PATCH] Set TIF_IRET in more places Zack Weinberg
2003-01-07 11:19 ` Jamie Lokier [this message]
2003-01-07 17:42 ` Linus Torvalds
2003-01-07 19:27 ` Zack Weinberg
2003-01-08 1:21 ` Richard Henderson
2003-01-08 2:33 ` Linus Torvalds
2003-01-08 3:32 ` Daniel Jacobowitz
2003-01-08 16:29 ` Jamie Lokier
2003-01-09 6:49 ` Zack Weinberg
-- strict thread matches above, loose matches on Subject: below --
2003-01-06 14:46 Luca Barbieri
2003-01-06 16:04 ` Linus Torvalds
2003-01-06 18:17 ` Luca Barbieri
2003-01-06 18:42 ` Ulrich Drepper
2003-01-06 18:49 ` Linus Torvalds
2003-01-06 20:26 ` Luca Barbieri
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20030107111905.GA949@bjl1.asuk.net \
--to=lk@tantalophile.demon.co.uk \
--cc=linux-kernel@vger.kernel.org \
--cc=torvalds@transmeta.com \
--cc=zack@codesourcery.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox