sigaction.2: clarification for SA

linux-man.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* sigaction.2: clarification for SA_NODEFER needed
@ 2020-11-11  0:42 Heinrich Schuchardt
  2020-11-12 15:37 ` Michael Kerrisk (man-pages)
  0 siblings, 1 reply; 14+ messages in thread
From: Heinrich Schuchardt @ 2020-11-11  0:42 UTC (permalink / raw)
  To: linux-man, Michael Kerrisk

Hello Michael,

I have been writing a handler for SIGILL and SIGSEGV which restarts the
program using execv() if an exception occurs. The handler never returns.

From the description of SA_NODEFER it was not evident that even if the
process is restarted with execv() the signal remains masked if
SA_NODEFER is not set as a flag.

It think this behavior deserves mentioning on the sigaction.2 manpage, e.g.

"Do not prevent the signal from being received from within its own
signal handler. A signal handler call is not terminated by calling
execv() as the pending signal property is inherited by the new process."

On the signal.7 manpage there is a paragraph "Signal mask and pending
signals". Here pending signals are mentioned. There is a sentence
"Between the time when it is generated and when it is delivered a signal
is said to be pending."

To me "delivered" means the instance when the signal handler is called
and not the instance when the signal handler returns. So said sentence
should be reworked, e.g.

"Between the time when it is generated and when the signal handler
returns a signal is said to be pending."

Best regards

Heinrich

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: sigaction.2: clarification for SA_NODEFER needed
  2020-11-11  0:42 sigaction.2: clarification for SA_NODEFER needed Heinrich Schuchardt
@ 2020-11-12 15:37 ` Michael Kerrisk (man-pages)
  2020-11-12 16:25   ` Heinrich Schuchardt
  0 siblings, 1 reply; 14+ messages in thread
From: Michael Kerrisk (man-pages) @ 2020-11-12 15:37 UTC (permalink / raw)
  To: Heinrich Schuchardt, linux-man; +Cc: mtk.manpages

Hello Heinrich,

On 11/11/20 1:42 AM, Heinrich Schuchardt wrote:
> Hello Michael,
> 
> I have been writing a handler for SIGILL and SIGSEGV which restarts the
> program using execv() if an exception occurs. The handler never returns.
> 
>>From the description of SA_NODEFER it was not evident that even if the
> process is restarted with execv() the signal remains masked if
> SA_NODEFER is not set as a flag.
> 
> It think this behavior deserves mentioning on the sigaction.2 manpage, e.g.
> 
> "Do not prevent the signal from being received from within its own
> signal handler. A signal handler call is not terminated by calling
> execv() as the pending signal property is inherited by the new process."
> 
> On the signal.7 manpage there is a paragraph "Signal mask and pending
> signals". Here pending signals are mentioned. There is a sentence
> "Between the time when it is generated and when it is delivered a signal
> is said to be pending."
> 
> To me "delivered" means the instance when the signal handler is called
> and not the instance when the signal handler returns. So said sentence
> should be reworked, e.g.
> 
> "Between the time when it is generated and when the signal handler
> returns a signal is said to be pending."

Your description above seems to conflate two concepts: the signal mask
and pending signals. The sentence that you propose reworking is
correct as it stands.

The sequence of events goes like this:

1. Signal is generated

[At this point the signal is pending, but typically the interval
between this step and the next is typically so brief that the fact
that the signal is pending is unobservable]

2. The kernel delivers the signal to the process:

+ The signal is removed from the set of pending signals.
+ The signal is added to the signal mask (unless SA_NOFER
  was specified)
+ The kernel constructs a frame for the signal handler on the
  user-space stack. The return address in that frame points to a
  small piece of code in user-space called the signal trampoline.
+ The kernel passes control back to the process with execution
  commencing at the start of the signal handler.

3. The signal handler executes.

[At this point, the signal is not pending, but it is present in the
signal mask.]

4. The signal handler returns.

5. Control passes to the signal trampoline, which calls
   sigreturn(2).

6. When sigreturn(2) is called, the kernel once more has control
   and restored various pieces of process state (e.g., the signal
   mask) to the values they had before the signal handler was
   invoked.

7. At completion of the sigreturn(2) system call, the kernel passes control
   back to the user-space program with execution recommencing at the
   point where the main program was interrupted by the signal handler.

Now, if you interrupt things before step 4 (e.g., with exec(),
swapcontext(3), or possibly--depending on how sigsetjmp() was 
called--siglongjmp()), then of course the remaining steps are not
performed.

I've added a few words to the SA_NODEFER description to hopefully
further clarify what it does:

       SA_NODEFER
              Do not prevent the signal from being received  from  within
              its  own signal handler (i.e., do not add the signal to the
              thread's signal mask while the handler is executing).

Probably, the signal(7) manual page could say more about all of this.
What would you think of the following text to be added to that page

   Execution of signal handlers
       Whenever  there is a transition from kernel-mode to user-mode exe‐
       cution (e.g., on return from a system  call  or  scheduling  of  a
       thread onto the CPU), the kernel checks whether there is a pending
       signal for which the process has established a signal handler.  If
       there is such a pending signal, the following steps occur:

       1. The  kernel performs the necessary preparatory steps for execu‐
          tion of the signal handler:

          a) The signal is removed from the set of pending signals.

          b) If the thread has defined an alternate signal  stack  (using
             sigaltstack(2)), then that stack is installed.

          c) Various  pieces  of  signal-related context are saved into a
             "hidden" frame that is created on the stack.  The saved  in‐
             formation includes:

             + the  program  counter  register  (i.e., the address of the
               next instruction in the main program that should  be  exe‐
               cuted when the signal handler returns);

             + the thread's current signal mask;

             + the thread's alternate signal stack settings.

          d) The  thread's  signal  mask is adjusted by adding the signal
             (unless the handler was  established  using  the  SA_NODEFER
             flag)  as  well  as  any  additional  signals  specified  in
             act->sa_mask when sigaction(2) was called.

       2. The kernel constructs a frame for the  signal  handler  on  the
          stack.  Within that frame, the return address points to a piece
          of user-space code called the signal trampoline  (described  in
          sigreturn(2)).

       3. The  kernel  passes control back to user-space, where execution
          commences at the start of the signal handler function.

       4. When the signal handler returns, control passes to  the  signal
          trampoline code.

       5. The  signal  trampoline  calls sigreturn(2), a system call that
          uses the information in the "hidden" stack frame to restore the
          thread's  signal  mask  and  alternate  stack settings to their
          state before the signal handler was called.  Upon completion of
          the  call to sigreturn(2), the kernel transfers control back to
          user space, and the thread recommences execution at  the  point
          where it was interrupted by the signal handler.

       Note  that if the signal handler does not return (e.g., control is
       transferred out of the  handler  using  sigsetjmp(3)  or  swapcon‐
       text(3),  or  the  handler executes a new program with execve(2)),
       then the final step is not performed.  In particular, in such sce‐
       narios it is the programmer's responsibility to restore that state
       of the signal mask, if that is desired.
?

Thanks,

Michael
-- 
Michael Kerrisk
Linux man-pages maintainer; http://www.kernel.org/doc/man-pages/
Linux/UNIX System Programming Training: http://man7.org/training/

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: sigaction.2: clarification for SA_NODEFER needed
  2020-11-12 15:37 ` Michael Kerrisk (man-pages)
@ 2020-11-12 16:25   ` Heinrich Schuchardt
  2020-11-12 20:45     ` Michael Kerrisk (man-pages)
  0 siblings, 1 reply; 14+ messages in thread
From: Heinrich Schuchardt @ 2020-11-12 16:25 UTC (permalink / raw)
  To: Michael Kerrisk (man-pages), linux-man

On 12.11.20 16:37, Michael Kerrisk (man-pages) wrote:
> Hello Heinrich,
>
> On 11/11/20 1:42 AM, Heinrich Schuchardt wrote:
>> Hello Michael,
>>
>> I have been writing a handler for SIGILL and SIGSEGV which restarts the
>> program using execv() if an exception occurs. The handler never returns.
>>
>> >From the description of SA_NODEFER it was not evident that even if the
>> process is restarted with execv() the signal remains masked if
>> SA_NODEFER is not set as a flag.
>>
>> It think this behavior deserves mentioning on the sigaction.2 manpage, e.g.
>>
>> "Do not prevent the signal from being received from within its own
>> signal handler. A signal handler call is not terminated by calling
>> execv() as the pending signal property is inherited by the new process."
>>
>> On the signal.7 manpage there is a paragraph "Signal mask and pending
>> signals". Here pending signals are mentioned. There is a sentence
>> "Between the time when it is generated and when it is delivered a signal
>> is said to be pending."
>>
>> To me "delivered" means the instance when the signal handler is called
>> and not the instance when the signal handler returns. So said sentence
>> should be reworked, e.g.
>>
>> "Between the time when it is generated and when the signal handler
>> returns a signal is said to be pending."
>
> Your description above seems to conflate two concepts: the signal mask
> and pending signals. The sentence that you propose reworking is
> correct as it stands.
>
> The sequence of events goes like this:
>
> 1. Signal is generated
>
> [At this point the signal is pending, but typically the interval
> between this step and the next is typically so brief that the fact
> that the signal is pending is unobservable]
>
> 2. The kernel delivers the signal to the process:
>
> + The signal is removed from the set of pending signals.
> + The signal is added to the signal mask (unless SA_NOFER
>   was specified)
> + The kernel constructs a frame for the signal handler on the
>   user-space stack. The return address in that frame points to a
>   small piece of code in user-space called the signal trampoline.
> + The kernel passes control back to the process with execution
>   commencing at the start of the signal handler.
>
> 3. The signal handler executes.
>
> [At this point, the signal is not pending, but it is present in the
> signal mask.]
>
> 4. The signal handler returns.
>
> 5. Control passes to the signal trampoline, which calls
>    sigreturn(2).
>
> 6. When sigreturn(2) is called, the kernel once more has control
>    and restored various pieces of process state (e.g., the signal
>    mask) to the values they had before the signal handler was
>    invoked.
>
> 7. At completion of the sigreturn(2) system call, the kernel passes control
>    back to the user-space program with execution recommencing at the
>    point where the main program was interrupted by the signal handler.
>
> Now, if you interrupt things before step 4 (e.g., with exec(),
> swapcontext(3), or possibly--depending on how sigsetjmp() was
> called--siglongjmp()), then of course the remaining steps are not
> performed.
>
> I've added a few words to the SA_NODEFER description to hopefully
> further clarify what it does:
>
>        SA_NODEFER
>               Do not prevent the signal from being received  from  within
>               its  own signal handler (i.e., do not add the signal to the
>               thread's signal mask while the handler is executing).
>
> Probably, the signal(7) manual page could say more about all of this.
> What would you think of the following text to be added to that page
>
>    Execution of signal handlers
>        Whenever  there is a transition from kernel-mode to user-mode exe‐
>        cution (e.g., on return from a system  call  or  scheduling  of  a
>        thread onto the CPU), the kernel checks whether there is a pending
>        signal for which the process has established a signal handler.  If
>        there is such a pending signal, the following steps occur:
>
>        1. The  kernel performs the necessary preparatory steps for execu‐
>           tion of the signal handler:
>
>           a) The signal is removed from the set of pending signals.
>
>           b) If the thread has defined an alternate signal  stack  (using
>              sigaltstack(2)), then that stack is installed.
>
>           c) Various  pieces  of  signal-related context are saved into a
>              "hidden" frame that is created on the stack.  The saved  in‐
>              formation includes:
>
>              + the  program  counter  register  (i.e., the address of the
>                next instruction in the main program that should  be  exe‐
>                cuted when the signal handler returns);
>
>              + the thread's current signal mask;
>
>              + the thread's alternate signal stack settings.
>
>           d) The  thread's  signal  mask is adjusted by adding the signal
>              (unless the handler was  established  using  the  SA_NODEFER
>              flag)  as  well  as  any  additional  signals  specified  in
>              act->sa_mask when sigaction(2) was called.
>
>        2. The kernel constructs a frame for the  signal  handler  on  the
>           stack.  Within that frame, the return address points to a piece
>           of user-space code called the signal trampoline  (described  in
>           sigreturn(2)).
>
>        3. The  kernel  passes control back to user-space, where execution
>           commences at the start of the signal handler function.
>
>        4. When the signal handler returns, control passes to  the  signal
>           trampoline code.
>
>        5. The  signal  trampoline  calls sigreturn(2), a system call that
>           uses the information in the "hidden" stack frame to restore the
>           thread's  signal  mask  and  alternate  stack settings to their
>           state before the signal handler was called.  Upon completion of
>           the  call to sigreturn(2), the kernel transfers control back to
>           user space, and the thread recommences execution at  the  point
>           where it was interrupted by the signal handler.
>
>        Note  that if the signal handler does not return (e.g., control is
>        transferred out of the  handler  using  sigsetjmp(3)  or  swapcon‐
>        text(3),  or  the  handler executes a new program with execve(2)),
>        then the final step is not performed.  In particular, in such sce‐
>        narios it is the programmer's responsibility to restore that state
>        of the signal mask, if that is desired.

Hello Michael,

this text is very helpful.

"Signal mask and pending signals" already mentions that the signal mask
controls the blocking of signals. But maybe you could reiterate this in
1d) and in the note below 5).

Best regards

Heinrich

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: sigaction.2: clarification for SA_NODEFER needed
  2020-11-12 16:25   ` Heinrich Schuchardt
@ 2020-11-12 20:45     ` Michael Kerrisk (man-pages)
  2020-11-12 20:57       ` Heinrich Schuchardt
  0 siblings, 1 reply; 14+ messages in thread
From: Michael Kerrisk (man-pages) @ 2020-11-12 20:45 UTC (permalink / raw)
  To: Heinrich Schuchardt, linux-man; +Cc: mtk.manpages

On 11/12/20 5:25 PM, Heinrich Schuchardt wrote:

[...]

> Hello Michael,
> 
> this text is very helpful.
> 
> "Signal mask and pending signals" already mentions that the signal mask
> controls the blocking of signals. But maybe you could reiterate this in
> 1d) and in the note below 5).

Yes, that perhaps does not hurt. Some light tweaks:

   Execution of signal handlers
       Whenever  there is a transition from kernel-mode to user-mode exe‐
       cution (e.g., on return from a system  call  or  scheduling  of  a
       thread onto the CPU), the kernel checks whether there is a pending
       signal for which the process has established a signal handler.  If
       there is such a pending signal, the following steps occur:

       1. The  kernel performs the necessary preparatory steps for execu‐
          tion of the signal handler:

          a) The signal is removed from the set of pending signals.

          b) If the thread has defined an alternate signal  stack  (using
             sigaltstack(2)), then that stack is installed.

          c) Various  pieces  of  signal-related context are saved into a
             "hidden" frame that is created on the stack.  The saved  in‐
             formation includes:

             + the  program  counter  register  (i.e., the address of the
               next instruction in the main program that should  be  exe‐
               cuted when the signal handler returns);

             + the thread's current signal mask;

             + the thread's alternate signal stack settings.

          d) The  thread's  signal  mask is adjusted by adding the signal
             (unless the handler was  established  using  the  SA_NODEFER
             flag)  as  well  as  any  additional  signals  specified  in
             act->sa_mask when sigaction(2) was  called.   These  signals
             are thus blocked while the handler executes.

       2. The  kernel  constructs  a  frame for the signal handler on the
          stack.  Within that frame, the return address points to a piece
          of  user-space  code called the signal trampoline (described in
          sigreturn(2)).

       3. The kernel passes control back to user-space,  where  execution
          commences at the start of the signal handler function.

       4. When  the  signal handler returns, control passes to the signal
          trampoline code.

       5. The signal trampoline calls sigreturn(2), a  system  call  that
          uses the information in the "hidden" stack frame to restore the
          thread's signal mask and  alternate  stack  settings  to  their
          state before the signal handler was called.  Upon completion of
          the call to sigreturn(2), the kernel transfers control back  to
          user  space,  and the thread recommences execution at the point
          where it was interrupted by the signal handler.

       Note that if the signal handler does not return (e.g., control  is
       transferred  out  of  the  handler  using sigsetjmp(3) or swapcon‐
       text(3), or the handler executes a new  program  with  execve(2)),
       then the final step is not performed.  In particular, in such sce‐
       narios it is the programmer's responsibility to restore the  state
       of the signal mask, if it is desired unblock the signals that were
       blocked on entry to the signal handler.

Thanks,

Michael

-- 
Michael Kerrisk
Linux man-pages maintainer; http://www.kernel.org/doc/man-pages/
Linux/UNIX System Programming Training: http://man7.org/training/

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: sigaction.2: clarification for SA_NODEFER needed
  2020-11-12 20:45     ` Michael Kerrisk (man-pages)
@ 2020-11-12 20:57       ` Heinrich Schuchardt
  2020-11-12 21:01         ` Michael Kerrisk (man-pages)
  2020-11-16 13:21         ` Dave Martin
  0 siblings, 2 replies; 14+ messages in thread
From: Heinrich Schuchardt @ 2020-11-12 20:57 UTC (permalink / raw)
  To: Michael Kerrisk (man-pages), linux-man; +Cc: mtk.manpages

Am 12. November 2020 21:45:56 MEZ schrieb "Michael Kerrisk (man-pages)" <mtk.manpages@gmail.com>:
>On 11/12/20 5:25 PM, Heinrich Schuchardt wrote:
>
>[...]
>
>> Hello Michael,
>> 
>> this text is very helpful.
>> 
>> "Signal mask and pending signals" already mentions that the signal
>mask
>> controls the blocking of signals. But maybe you could reiterate this
>in
>> 1d) and in the note below 5).
>
>Yes, that perhaps does not hurt. Some light tweaks:
>
>   Execution of signal handlers
>     Whenever  there is a transition from kernel-mode to user-mode exe‐
>     cution (e.g., on return from a system  call  or  scheduling  of  a
>     thread onto the CPU), the kernel checks whether there is a pending
>     signal for which the process has established a signal handler.  If
>       there is such a pending signal, the following steps occur:
>
>     1. The  kernel performs the necessary preparatory steps for execu‐
>          tion of the signal handler:
>
>          a) The signal is removed from the set of pending signals.
>
>        b) If the thread has defined an alternate signal  stack  (using
>             sigaltstack(2)), then that stack is installed.
>
>        c) Various  pieces  of  signal-related context are saved into a
>           "hidden" frame that is created on the stack.  The saved  in‐
>             formation includes:
>
>           + the  program  counter  register  (i.e., the address of the
>             next instruction in the main program that should  be  exe‐
>               cuted when the signal handler returns);
>
>             + the thread's current signal mask;
>
>             + the thread's alternate signal stack settings.
>
>        d) The  thread's  signal  mask is adjusted by adding the signal
>           (unless the handler was  established  using  the  SA_NODEFER
>           flag)  as  well  as  any  additional  signals  specified  in
>           act->sa_mask when sigaction(2) was  called.   These  signals
>             are thus blocked while the handler executes.
>
>     2. The  kernel  constructs  a  frame for the signal handler on the
>        stack.  Within that frame, the return address points to a piece
>        of  user-space  code called the signal trampoline (described in
>          sigreturn(2)).
>
>     3. The kernel passes control back to user-space,  where  execution
>          commences at the start of the signal handler function.
>
>     4. When  the  signal handler returns, control passes to the signal
>          trampoline code.
>
>     5. The signal trampoline calls sigreturn(2), a  system  call  that
>        uses the information in the "hidden" stack frame to restore the
>        thread's signal mask and  alternate  stack  settings  to  their
>        state before the signal handler was called.  Upon completion of
>        the call to sigreturn(2), the kernel transfers control back  to
>        user  space,  and the thread recommences execution at the point
>          where it was interrupted by the signal handler.
>
>     Note that if the signal handler does not return (e.g., control  is
>     transferred  out  of  the  handler  using sigsetjmp(3) or swapcon‐
>     text(3), or the handler executes a new  program  with  execve(2)),
>     then the final step is not performed.  In particular, in such sce‐
>     narios it is the programmer's responsibility to restore the  state
>     of the signal mask, if it is desired unblock the signals that were
>       blocked on entry to the signal handler.

Is there a function to change the signal mask without leaving the handler?

Otherwise looks good to me.

Best regards

Heinrich

>
>Thanks,
>
>Michael


^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: sigaction.2: clarification for SA_NODEFER needed
  2020-11-12 20:57       ` Heinrich Schuchardt
@ 2020-11-12 21:01         ` Michael Kerrisk (man-pages)
  2020-11-12 21:04           ` Heinrich Schuchardt
  2020-11-16 13:21         ` Dave Martin
  1 sibling, 1 reply; 14+ messages in thread
From: Michael Kerrisk (man-pages) @ 2020-11-12 21:01 UTC (permalink / raw)
  To: Heinrich Schuchardt, linux-man; +Cc: mtk.manpages

> Is there a function to change the signal mask without leaving the handler?

sigprocmask(2).

Thanks,

Michael

-- 
Michael Kerrisk
Linux man-pages maintainer; http://www.kernel.org/doc/man-pages/
Linux/UNIX System Programming Training: http://man7.org/training/

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: sigaction.2: clarification for SA_NODEFER needed
  2020-11-12 21:01         ` Michael Kerrisk (man-pages)
@ 2020-11-12 21:04           ` Heinrich Schuchardt
  2020-11-12 21:17             ` Michael Kerrisk (man-pages)
  2020-11-16 13:29             ` Dave Martin
  0 siblings, 2 replies; 14+ messages in thread
From: Heinrich Schuchardt @ 2020-11-12 21:04 UTC (permalink / raw)
  To: Michael Kerrisk (man-pages), linux-man; +Cc: mtk.manpages

Am 12. November 2020 22:01:58 MEZ schrieb "Michael Kerrisk (man-pages)" <mtk.manpages@gmail.com>:
>> Is there a function to change the signal mask without leaving the
>handler?
>
>sigprocmask(2).

You might want to add a link to the function in the note section.

>
>Thanks,
>
>Michael


^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: sigaction.2: clarification for SA_NODEFER needed
  2020-11-12 21:04           ` Heinrich Schuchardt
@ 2020-11-12 21:17             ` Michael Kerrisk (man-pages)
  2020-11-16 13:29             ` Dave Martin
  1 sibling, 0 replies; 14+ messages in thread
From: Michael Kerrisk (man-pages) @ 2020-11-12 21:17 UTC (permalink / raw)
  To: Heinrich Schuchardt, linux-man; +Cc: mtk.manpages

On 11/12/20 10:04 PM, Heinrich Schuchardt wrote:
> Am 12. November 2020 22:01:58 MEZ schrieb "Michael Kerrisk (man-pages)" <mtk.manpages@gmail.com>:
>>> Is there a function to change the signal mask without leaving the
>> handler?
>>
>> sigprocmask(2).
> 
> You might want to add a link to the function in the note section.
It's already mentioned in the subsection "Signal mask and
pending signals". I think that's probably enough.

Thanks,

Michael

-- 
Michael Kerrisk
Linux man-pages maintainer; http://www.kernel.org/doc/man-pages/
Linux/UNIX System Programming Training: http://man7.org/training/

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: sigaction.2: clarification for SA_NODEFER needed
  2020-11-12 21:04           ` Heinrich Schuchardt
  2020-11-12 21:17             ` Michael Kerrisk (man-pages)
@ 2020-11-16 13:29             ` Dave Martin
  1 sibling, 0 replies; 14+ messages in thread
From: Dave Martin @ 2020-11-16 13:29 UTC (permalink / raw)
  To: Heinrich Schuchardt; +Cc: Michael Kerrisk (man-pages), linux-man

On Thu, Nov 12, 2020 at 10:04:08PM +0100, Heinrich Schuchardt wrote:
> Am 12. November 2020 22:01:58 MEZ schrieb "Michael Kerrisk (man-pages)" <mtk.manpages@gmail.com>:
> >> Is there a function to change the signal mask without leaving the
> >handler?
> >
> >sigprocmask(2).
> 
> You might want to add a link to the function in the note section.

Actually, this is best avoided IMHO:

The behaviour of sigprocmask() is unspecified in multithreaded programs,
while pthread_sigmask() is not specified to be safe in signal handlers.
(Yay, POSIX.)

For these reasons, execve()'ing directly from a signal handler is not a
great idea.  It would probably be better to escape from the signal
handler with siglongjmp() or setcontext(), with the target sigjmp_buf or
ucontext previously set up do the execve().

With SA_SIGINFO, you can also update uc->uc_sigmask inside the signal
handler if you want to change the signal mask on return.  But that's
awkward to do portably, since sigaddset() and sigdelset() are not
specified to be safe in signal handlers either.

Cheers
---Dave

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: sigaction.2: clarification for SA_NODEFER needed
  2020-11-12 20:57       ` Heinrich Schuchardt
  2020-11-12 21:01         ` Michael Kerrisk (man-pages)
@ 2020-11-16 13:21         ` Dave Martin
  2020-11-17 10:46           ` Michael Kerrisk (man-pages)
  1 sibling, 1 reply; 14+ messages in thread
From: Dave Martin @ 2020-11-16 13:21 UTC (permalink / raw)
  To: Heinrich Schuchardt; +Cc: Michael Kerrisk (man-pages), linux-man

On Thu, Nov 12, 2020 at 09:57:35PM +0100, Heinrich Schuchardt wrote:
> Am 12. November 2020 21:45:56 MEZ schrieb "Michael Kerrisk (man-pages)" <mtk.manpages@gmail.com>:
> >On 11/12/20 5:25 PM, Heinrich Schuchardt wrote:
> >
> >[...]
> >
> >> Hello Michael,
> >> 
> >> this text is very helpful.
> >> 
> >> "Signal mask and pending signals" already mentions that the signal
> >mask
> >> controls the blocking of signals. But maybe you could reiterate this
> >in
> >> 1d) and in the note below 5).
> >
> >Yes, that perhaps does not hurt. Some light tweaks:
> >
> >   Execution of signal handlers
> >     Whenever  there is a transition from kernel-mode to user-mode exe‐
> >     cution (e.g., on return from a system  call  or  scheduling  of  a
> >     thread onto the CPU), the kernel checks whether there is a pending
> >     signal for which the process has established a signal handler.  If
> >       there is such a pending signal, the following steps occur:
> >
> >     1. The  kernel performs the necessary preparatory steps for execu‐
> >          tion of the signal handler:
> >
> >          a) The signal is removed from the set of pending signals.
> >
> >        b) If the thread has defined an alternate signal  stack  (using
> >             sigaltstack(2)), then that stack is installed.
> >
> >        c) Various  pieces  of  signal-related context are saved into a
> >           "hidden" frame that is created on the stack.  The saved  in‐
> >             formation includes:

Can we delete "hidden" here?  (In a sense it's actually less hidden than
a typical compiler function frame, since we do provide an explicit
interface for poking about in the signal frame -- you can't do that with
function frames).

> >
> >           + the  program  counter  register  (i.e., the address of the
> >             next instruction in the main program that should  be  exe‐
> >               cuted when the signal handler returns);

You might also want to add something like:

"Architecture-specific register state required for resuming the
interrupted program."

> >
> >             + the thread's current signal mask;
> >
> >             + the thread's alternate signal stack settings.
> >
> >        d) The  thread's  signal  mask is adjusted by adding the signal
> >           (unless the handler was  established  using  the  SA_NODEFER
> >           flag)  as  well  as  any  additional  signals  specified  in
> >           act->sa_mask when sigaction(2) was  called.   These  signals
> >             are thus blocked while the handler executes.

I'd delete "adjusted" since it adds nothing to the meaning.

Would this also be more readable if the logic is flipped around:

--8<--

Any signals specified in act->sa_mask when registering the handler are
added to the thread's signal mask.  The signal being delivered is also
added to the signal mask, unless SA_NODEFER was specified when
registering the handler.

-->8--

> >
> >     2. The  kernel  constructs  a  frame for the signal handler on the
> >        stack.  Within that frame, the return address points to a piece
> >        of  user-space  code called the signal trampoline (described in
> >          sigreturn(2)).

Not all architectures put the function return information on the stack.

The kernel has to explicitly fix up the pc to run the signal handler
here -- it doesn't happen by magic.  So maybe say for (2):

--8<--

The kernel sets the program counter for the thread to point to the first
instruction of the signal handler, and configures the return address for
this function to point to a piece of user-space code called the signal
trampoline [...].

-->8--

> >
> >     3. The kernel passes control back to user-space,  where  execution
> >          commences at the start of the signal handler function.
> >
> >     4. When  the  signal handler returns, control passes to the signal
> >          trampoline code.
> >
> >     5. The signal trampoline calls sigreturn(2), a  system  call  that
> >        uses the information in the "hidden" stack frame to restore the
> >        thread's signal mask and  alternate  stack  settings  to  their
> >        state before the signal handler was called.  Upon completion of
> >        the call to sigreturn(2), the kernel transfers control back  to
> >        user  space,  and the thread recommences execution at the point
> >          where it was interrupted by the signal handler.
> >
> >     Note that if the signal handler does not return (e.g., control  is
> >     transferred  out  of  the  handler  using sigsetjmp(3) or swapcon‐

siglongjmp(), not sigsetjmp().

> >     text(3), or the handler executes a new  program  with  execve(2)),
> >     then the final step is not performed.  In particular, in such sce‐
> >     narios it is the programmer's responsibility to restore the  state
> >     of the signal mask, if it is desired unblock the signals that were
> >       blocked on entry to the signal handler.

I'm pretty sure sigsetjmp(), and probably setcontext(), _do_ restore the
signal mask.

[...]

Cheers
---Dave

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: sigaction.2: clarification for SA_NODEFER needed
  2020-11-16 13:21         ` Dave Martin
@ 2020-11-17 10:46           ` Michael Kerrisk (man-pages)
  2020-11-17 12:05             ` Dave Martin
  0 siblings, 1 reply; 14+ messages in thread
From: Michael Kerrisk (man-pages) @ 2020-11-17 10:46 UTC (permalink / raw)
  To: Dave Martin, Heinrich Schuchardt; +Cc: mtk.manpages, linux-man

Hi Dave,

Thanks a heap for taking a look at the text!

On 11/16/20 2:21 PM, Dave Martin wrote:
> On Thu, Nov 12, 2020 at 09:57:35PM +0100, Heinrich Schuchardt wrote:
>> Am 12. November 2020 21:45:56 MEZ schrieb "Michael Kerrisk (man-pages)" <mtk.manpages@gmail.com>:
>>> On 11/12/20 5:25 PM, Heinrich Schuchardt wrote:
>>>
>>> [...]
>>>
>>>> Hello Michael,
>>>>
>>>> this text is very helpful.
>>>>
>>>> "Signal mask and pending signals" already mentions that the signal
>>> mask
>>>> controls the blocking of signals. But maybe you could reiterate this
>>> in
>>>> 1d) and in the note below 5).
>>>
>>> Yes, that perhaps does not hurt. Some light tweaks:
>>>
>>>   Execution of signal handlers
>>>     Whenever  there is a transition from kernel-mode to user-mode exe‐
>>>     cution (e.g., on return from a system  call  or  scheduling  of  a
>>>     thread onto the CPU), the kernel checks whether there is a pending
>>>     signal for which the process has established a signal handler.  If
>>>       there is such a pending signal, the following steps occur:
>>>
>>>     1. The  kernel performs the necessary preparatory steps for execu‐
>>>          tion of the signal handler:
>>>
>>>          a) The signal is removed from the set of pending signals.
>>>
>>>        b) If the thread has defined an alternate signal  stack  (using
>>>             sigaltstack(2)), then that stack is installed.
>>>
>>>        c) Various  pieces  of  signal-related context are saved into a
>>>           "hidden" frame that is created on the stack.  The saved  in‐
>>>             formation includes:
> 
> Can we delete "hidden" here?  (In a sense it's actually less hidden than
> a typical compiler function frame, since we do provide an explicit
> interface for poking about in the signal frame -- you can't do that with
> function frames).

Yes, fair enough. I removed "hidden".

>>>           + the  program  counter  register  (i.e., the address of the
>>>             next instruction in the main program that should  be  exe‐
>>>               cuted when the signal handler returns);
> 
> You might also want to add something like:
> 
> "Architecture-specific register state required for resuming the
> interrupted program."

Added.

>>>             + the thread's current signal mask;
>>>
>>>             + the thread's alternate signal stack settings.
>>>
>>>        d) The  thread's  signal  mask is adjusted by adding the signal
>>>           (unless the handler was  established  using  the  SA_NODEFER
>>>           flag)  as  well  as  any  additional  signals  specified  in
>>>           act->sa_mask when sigaction(2) was  called.   These  signals
>>>             are thus blocked while the handler executes.
> 
> I'd delete "adjusted" since it adds nothing to the meaning.
> 
> Would this also be more readable if the logic is flipped around:

Well, ummmm, yes it would.

> --8<--
> 
> Any signals specified in act->sa_mask when registering the handler are
> added to the thread's signal mask.  The signal being delivered is also
> added to the signal mask, unless SA_NODEFER was specified when
> registering the handler.

Thanks. Adjusted pretty much as you wrote it.

> -->8--
> 
>>>
>>>     2. The  kernel  constructs  a  frame for the signal handler on the
>>>        stack.  Within that frame, the return address points to a piece
>>>        of  user-space  code called the signal trampoline (described in
>>>          sigreturn(2)).
> 
> Not all architectures put the function return information on the stack.
> 
> The kernel has to explicitly fix up the pc to run the signal handler
> here -- it doesn't happen by magic.  So maybe say for (2):
> 
> --8<--
> 
> The kernel sets the program counter for the thread to point to the first
> instruction of the signal handler, and configures the return address for
> this function to point to a piece of user-space code called the signal
> trampoline [...].

Thanks. Changed pretty much as you suggest.

> -->8--
> 
>>>
>>>     3. The kernel passes control back to user-space,  where  execution
>>>          commences at the start of the signal handler function.
>>>
>>>     4. When  the  signal handler returns, control passes to the signal
>>>          trampoline code.
>>>
>>>     5. The signal trampoline calls sigreturn(2), a  system  call  that
>>>        uses the information in the "hidden" stack frame to restore the
>>>        thread's signal mask and  alternate  stack  settings  to  their
>>>        state before the signal handler was called.  Upon completion of
>>>        the call to sigreturn(2), the kernel transfers control back  to
>>>        user  space,  and the thread recommences execution at the point
>>>          where it was interrupted by the signal handler.
>>>
>>>     Note that if the signal handler does not return (e.g., control  is
>>>     transferred  out  of  the  handler  using sigsetjmp(3) or swapcon‐
> 
> siglongjmp(), not sigsetjmp().

Yep, I spotted that one already and fixed it.

>>>     text(3), or the handler executes a new  program  with  execve(2)),
>>>     then the final step is not performed.  In particular, in such sce‐
>>>     narios it is the programmer's responsibility to restore the  state
>>>     of the signal mask, if it is desired unblock the signals that were
>>>       blocked on entry to the signal handler.
> 
> I'm pretty sure sigsetjmp(), and probably setcontext(), _do_ restore the
> signal mask.

I'd already made adjustment here to note that siglongjmp() may or may
not restore the signal mask. (See below.)

And yes, you are right that those APIs restore the signal mask.
I think I got confused because, as far as I know, swapcontext()
and setcontext() do not restore the alternate signal stack settings. 
(There is no call to sigaltstack() in the glibc implementations, 
nor to sigreturn()--at least not on most implementations.) 
I'm going to skirt the issue by dropping mention of *context().)

Combining your other reply here:

>>>> Is there a function to change the signal mask without leaving the
>>> handler?
>>>
>>> sigprocmask(2).
>>
>> You might want to add a link to the function in the note section.
> 
> Actually, this is best avoided IMHO:
> 
> The behaviour of sigprocmask() is unspecified in multithreaded programs,
> while pthread_sigmask() is not specified to be safe in signal handlers.

I'm not sure I agree. sigprocmask() is explicitly specified as being 
async-signal-safe, which suggests that POSIX blesses its use, at least
in single-threaded programs. And notwithstanding what POSIX says, 
sigprocmask() is safe on Linux/glibc in a MT process (since
pthread_sigmask() is just a simple wrapper for sigprocmask()), and
I'd guess the same is true on many (most?) other implementations as 
well.

> (Yay, POSIX.)
> 
> For these reasons, execve()'ing directly from a signal handler is not a
> great idea.  It would probably be better to escape from the signal
> handler with siglongjmp() or setcontext(), with the target sigjmp_buf or
> ucontext previously set up do the execve().

Well, setcontext() is no longer in POSIX.... (It was removed in
POSIX.1-2008.) And the specification of longjmp() says:

       It is recommended that applications do not call longjmp() or  sig‐
       longjmp()  from  signal handlers. To avoid undefined behavior when
       calling these functions from a  signal  handler,  the  application
       needs to ensure one of the following two things:

        1. After  the  call to longjmp() or siglongjmp() the process only
           calls async-signal-safe functions and does not return from the
           initial call to main().

        2. Any  signal  whose  handler calls longjmp() or siglongjmp() is
           blocked during every call to a non-async-signal-safe function,
           and  no  such  calls are made after returning from the initial
           call to main().

So, in my reading of it, you're no better off than calling
sigprocmask() from the signal handler. Do you agree?

> With SA_SIGINFO, you can also update uc->uc_sigmask inside the signal
> handler if you want to change the signal mask on return.  But that's
> awkward to do portably, since sigaddset() and sigdelset() are not
> specified to be safe in signal handlers either.

I think you've misremembered here. At least as far back as 
POSIX.1-2001, sigaddset() and sigdelset() are specified as
async-signal-safe.

By now, the text has evolved to:

[[
   Execution of signal handlers
       Whenever there is a transition from kernel-mode to user-mode  exe‐
       cution  (e.g.,  on  return  from  a system call or scheduling of a
       thread onto the CPU), the kernel checks whether there is a pending
       signal for which the process has established a signal handler.  If
       there is such a pending signal, the following steps occur:

       1. The kernel performs the necessary preparatory steps for  execu‐
          tion of the signal handler:

          a) The signal is removed from the set of pending signals.

          b) If  the  signal  handler  was  installed by a call to sigac‐
             tion(2) that specified the SA_ONSTACK flag  and  the  thread
             has   defined  an  alternate  signal  stack  (using  sigalt‐
             stack(2)), then that stack is installed.

          c) Various pieces of signal-related context are  saved  into  a
             special  frame  that is created on the stack.  The saved in‐
             formation includes:

             + the program counter register (i.e.,  the  address  of  the
               next  instruction  in the main program that should be exe‐
               cuted when the signal handler returns);

             + architecture-specific register state required for resuming
               the interrupted program;

             + the thread's current signal mask;

             + the thread's alternate signal stack settings.

          d) Any  signals  specified in act->sa_mask when registering the
             handler with sigprocmask(2) are added to the thread's signal
             mask.   The signal being delivered is also added to the sig‐
             nal mask, unless SA_NODEFER was specified  when  registering
             the  handler.  These signals are thus blocked while the han‐
             dler executes.

       2. The kernel constructs a frame for the  signal  handler  on  the
          stack.   The  kernel sets the program counter for the thread to
          point to the first instruction of the signal handler  function,
          and configures the return address for that function to point to
          a piece of user-space code known as the signal trampoline  (de‐
          scribed in sigreturn(2)).

       3. The  kernel  passes control back to user-space, where execution
          commences at the start of the signal handler function.

       4. When the signal handler returns, control passes to  the  signal
          trampoline code.

       5. The  signal  trampoline  calls sigreturn(2), a system call that
          uses the information in the stack frame created in  step  1  to
          restore  the  thread's signal mask and alternate stack settings
          to their state before the signal handler was called.  Upon com‐
          pletion  of the call to sigreturn(2), the kernel transfers con‐
          trol back to user space, and the thread  recommences  execution
          at the point where it was interrupted by the signal handler.

       Note  that if the signal handler does not return (e.g., control is
       transferred out of the handler using siglongjmp(3), or the handler
       executes a new program with execve(2)), then the final step is not
       performed.  In particular, in such scenarios it  is  the  program‐
       mer's  responsibility to restore the state of the signal mask (us‐
       ing sigprocmask(2)), if it is desired to unblock the signals  that
       were  blocked  on  entry  to  the signal handler.  (Note that sig‐
       longjmp(3) may or may not restore the signal  mask,  depending  on
       the savesigs value that was specified in the corresponding call to
       sigsetjmp(3).)
]]

Thanks,

Michael


-- 
Michael Kerrisk
Linux man-pages maintainer; http://www.kernel.org/doc/man-pages/
Linux/UNIX System Programming Training: http://man7.org/training/

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: sigaction.2: clarification for SA_NODEFER needed
  2020-11-17 10:46           ` Michael Kerrisk (man-pages)
@ 2020-11-17 12:05             ` Dave Martin
  2020-11-17 17:09               ` Michael Kerrisk (man-pages)
  0 siblings, 1 reply; 14+ messages in thread
From: Dave Martin @ 2020-11-17 12:05 UTC (permalink / raw)
  To: Michael Kerrisk (man-pages); +Cc: Heinrich Schuchardt, linux-man

On Tue, Nov 17, 2020 at 10:46:11AM +0000, Michael Kerrisk (man-pages) wrote:
> Hi Dave,
> 
> Thanks a heap for taking a look at the text!
> 
> On 11/16/20 2:21 PM, Dave Martin wrote:
> > On Thu, Nov 12, 2020 at 09:57:35PM +0100, Heinrich Schuchardt wrote:
> >> Am 12. November 2020 21:45:56 MEZ schrieb "Michael Kerrisk (man-pages)" <mtk.manpages@gmail.com>:
> >>> On 11/12/20 5:25 PM, Heinrich Schuchardt wrote:
> >>>
> >>> [...]
> >>>
> >>>> Hello Michael,
> >>>>
> >>>> this text is very helpful.
> >>>>
> >>>> "Signal mask and pending signals" already mentions that the signal
> >>> mask
> >>>> controls the blocking of signals. But maybe you could reiterate this
> >>> in
> >>>> 1d) and in the note below 5).
> >>>
> >>> Yes, that perhaps does not hurt. Some light tweaks:
> >>>
> >>>   Execution of signal handlers
> >>>     Whenever  there is a transition from kernel-mode to user-mode exe‐
> >>>     cution (e.g., on return from a system  call  or  scheduling  of  a
> >>>     thread onto the CPU), the kernel checks whether there is a pending
> >>>     signal for which the process has established a signal handler.  If
> >>>       there is such a pending signal, the following steps occur:
> >>>
> >>>     1. The  kernel performs the necessary preparatory steps for execu‐
> >>>          tion of the signal handler:
> >>>
> >>>          a) The signal is removed from the set of pending signals.
> >>>
> >>>        b) If the thread has defined an alternate signal  stack  (using
> >>>             sigaltstack(2)), then that stack is installed.
> >>>
> >>>        c) Various  pieces  of  signal-related context are saved into a
> >>>           "hidden" frame that is created on the stack.  The saved  in‐
> >>>             formation includes:
> > 
> > Can we delete "hidden" here?  (In a sense it's actually less hidden than
> > a typical compiler function frame, since we do provide an explicit
> > interface for poking about in the signal frame -- you can't do that with
> > function frames).
> 
> Yes, fair enough. I removed "hidden".
> 
> >>>           + the  program  counter  register  (i.e., the address of the
> >>>             next instruction in the main program that should  be  exe‐
> >>>               cuted when the signal handler returns);
> > 
> > You might also want to add something like:
> > 
> > "Architecture-specific register state required for resuming the
> > interrupted program."
> 
> Added.
> 
> >>>             + the thread's current signal mask;
> >>>
> >>>             + the thread's alternate signal stack settings.
> >>>
> >>>        d) The  thread's  signal  mask is adjusted by adding the signal
> >>>           (unless the handler was  established  using  the  SA_NODEFER
> >>>           flag)  as  well  as  any  additional  signals  specified  in
> >>>           act->sa_mask when sigaction(2) was  called.   These  signals
> >>>             are thus blocked while the handler executes.
> > 
> > I'd delete "adjusted" since it adds nothing to the meaning.
> > 
> > Would this also be more readable if the logic is flipped around:
> 
> Well, ummmm, yes it would.
> 
> > --8<--
> > 
> > Any signals specified in act->sa_mask when registering the handler are
> > added to the thread's signal mask.  The signal being delivered is also
> > added to the signal mask, unless SA_NODEFER was specified when
> > registering the handler.
> 
> Thanks. Adjusted pretty much as you wrote it.
> 
> > -->8--
> > 
> >>>
> >>>     2. The  kernel  constructs  a  frame for the signal handler on the
> >>>        stack.  Within that frame, the return address points to a piece
> >>>        of  user-space  code called the signal trampoline (described in
> >>>          sigreturn(2)).
> > 
> > Not all architectures put the function return information on the stack.
> > 
> > The kernel has to explicitly fix up the pc to run the signal handler
> > here -- it doesn't happen by magic.  So maybe say for (2):
> > 
> > --8<--
> > 
> > The kernel sets the program counter for the thread to point to the first
> > instruction of the signal handler, and configures the return address for
> > this function to point to a piece of user-space code called the signal
> > trampoline [...].
> 
> Thanks. Changed pretty much as you suggest.
> 
> > -->8--
> > 
> >>>
> >>>     3. The kernel passes control back to user-space,  where  execution
> >>>          commences at the start of the signal handler function.
> >>>
> >>>     4. When  the  signal handler returns, control passes to the signal
> >>>          trampoline code.
> >>>
> >>>     5. The signal trampoline calls sigreturn(2), a  system  call  that
> >>>        uses the information in the "hidden" stack frame to restore the
> >>>        thread's signal mask and  alternate  stack  settings  to  their
> >>>        state before the signal handler was called.  Upon completion of
> >>>        the call to sigreturn(2), the kernel transfers control back  to
> >>>        user  space,  and the thread recommences execution at the point
> >>>          where it was interrupted by the signal handler.
> >>>
> >>>     Note that if the signal handler does not return (e.g., control  is
> >>>     transferred  out  of  the  handler  using sigsetjmp(3) or swapcon‐
> > 
> > siglongjmp(), not sigsetjmp().
> 
> Yep, I spotted that one already and fixed it.
> 
> >>>     text(3), or the handler executes a new  program  with  execve(2)),
> >>>     then the final step is not performed.  In particular, in such sce‐
> >>>     narios it is the programmer's responsibility to restore the  state
> >>>     of the signal mask, if it is desired unblock the signals that were
> >>>       blocked on entry to the signal handler.
> > 
> > I'm pretty sure sigsetjmp(), and probably setcontext(), _do_ restore the
> > signal mask.
> 
> I'd already made adjustment here to note that siglongjmp() may or may
> not restore the signal mask. (See below.)
> 
> And yes, you are right that those APIs restore the signal mask.
> I think I got confused because, as far as I know, swapcontext()
> and setcontext() do not restore the alternate signal stack settings. 
> (There is no call to sigaltstack() in the glibc implementations, 
> nor to sigreturn()--at least not on most implementations.) 
> I'm going to skirt the issue by dropping mention of *context().)
> 
> Combining your other reply here:
> 
> >>>> Is there a function to change the signal mask without leaving the
> >>> handler?
> >>>
> >>> sigprocmask(2).
> >>
> >> You might want to add a link to the function in the note section.
> > 
> > Actually, this is best avoided IMHO:
> > 
> > The behaviour of sigprocmask() is unspecified in multithreaded programs,
> > while pthread_sigmask() is not specified to be safe in signal handlers.
> 
> I'm not sure I agree. sigprocmask() is explicitly specified as being 
> async-signal-safe, which suggests that POSIX blesses its use, at least
> in single-threaded programs. And notwithstanding what POSIX says, 
> sigprocmask() is safe on Linux/glibc in a MT process (since
> pthread_sigmask() is just a simple wrapper for sigprocmask()), and
> I'd guess the same is true on many (most?) other implementations as 
> well.

I don't disagree that sigprocmask() is likely to work in practice for
most purposes, but I wonder whether it could have unexpected effects on
the masking of the libc internal signals in some implementations,
particulary when using SIG_SETMASK.

If trying to execve() out of the signal handler, I think there would be
a strong temptation to restore the signal mask the program had on entry
with

	sigprocmask(SIG_SETMASK, &orig_mask, NULL);

say.  If the original signal was taken while in the middle of libc while
some internal signal was blocked then this would unintentionally unblock
that signal, and deadlocks and other badness may happen on return.

In theory pthread_sigmask() could defend against this, but I don't know
whether it actually does in any implementations.


So, IIUC you really must not return after doing something like this
(certainly if you want to be at all portable).

Trying to do asynchronous context switching using swapconxtext() would
fall foul of this (and plenty else).


> > (Yay, POSIX.)
> > 
> > For these reasons, execve()'ing directly from a signal handler is not a
> > great idea.  It would probably be better to escape from the signal
> > handler with siglongjmp() or setcontext(), with the target sigjmp_buf or
> > ucontext previously set up do the execve().
> 
> Well, setcontext() is no longer in POSIX.... (It was removed in

Well, yes.

> POSIX.1-2008.) And the specification of longjmp() says:
> 
>        It is recommended that applications do not call longjmp() or  sig‐
>        longjmp()  from  signal handlers. To avoid undefined behavior when
>        calling these functions from a  signal  handler,  the  application
>        needs to ensure one of the following two things:
> 
>         1. After  the  call to longjmp() or siglongjmp() the process only
>            calls async-signal-safe functions and does not return from the
>            initial call to main().
> 
>         2. Any  signal  whose  handler calls longjmp() or siglongjmp() is
>            blocked during every call to a non-async-signal-safe function,
>            and  no  such  calls are made after returning from the initial
>            call to main().

i.e., basically the same constraints you have to follow if you want to
achieve the same result safely from _within_ the signal handler.

But I take your point: my claim that using siglongjmp() is a better
approach was overstated.  And it's easy to forget that you're still in
signal-handler-like context even after siglongjmp().

> So, in my reading of it, you're no better off than calling
> sigprocmask() from the signal handler. Do you agree?

Yes, agreed.

(The background to my comments in this area is that I've learned from
experience that messing with the signal mask inside a signal handler
tends to create more problems than it solves -- but that doesn't mean
that there are no situations where it is legitimate.)

> > With SA_SIGINFO, you can also update uc->uc_sigmask inside the signal
> > handler if you want to change the signal mask on return.  But that's
> > awkward to do portably, since sigaddset() and sigdelset() are not
> > specified to be safe in signal handlers either.
> 
> I think you've misremembered here. At least as far back as 
> POSIX.1-2001, sigaddset() and sigdelset() are specified as
> async-signal-safe.

Ah, ignore me.  I think I was confusing these functions with something
else here -- yes, they are specified as async-signal-safe in all recent
versions of the standards.  (In reasonable implementations, they have
probably always been safe in practice.)

> By now, the text has evolved to:
> 
> [[
>    Execution of signal handlers
>        Whenever there is a transition from kernel-mode to user-mode  exe‐
>        cution  (e.g.,  on  return  from  a system call or scheduling of a
>        thread onto the CPU), the kernel checks whether there is a pending
>        signal for which the process has established a signal handler.  If

The signal must also be unblocked.

>        there is such a pending signal, the following steps occur:

You might want to comment on what happens if there are multiple
unblocked signals pending -- you can probably refer to signal(7) rather
than writing it all out again here.

>        1. The kernel performs the necessary preparatory steps for  execu‐
>           tion of the signal handler:
> 
>           a) The signal is removed from the set of pending signals.
> 
>           b) If  the  signal  handler  was  installed by a call to sigac‐
>              tion(2) that specified the SA_ONSTACK flag  and  the  thread
>              has   defined  an  alternate  signal  stack  (using  sigalt‐
>              stack(2)), then that stack is installed.

Actually should (b) and (c) be swapped? (c) saves the SP and stack
configuration.

> 
>           c) Various pieces of signal-related context are  saved  into  a
>              special  frame  that is created on the stack.  The saved in‐
>              formation includes:
> 
>              + the program counter register (i.e.,  the  address  of  the
>                next  instruction  in the main program that should be exe‐
>                cuted when the signal handler returns);
> 
>              + architecture-specific register state required for resuming
>                the interrupted program;
> 
>              + the thread's current signal mask;
> 
>              + the thread's alternate signal stack settings.
> 
>           d) Any  signals  specified in act->sa_mask when registering the
>              handler with sigprocmask(2) are added to the thread's signal
>              mask.   The signal being delivered is also added to the sig‐
>              nal mask, unless SA_NODEFER was specified  when  registering
>              the  handler.  These signals are thus blocked while the han‐
>              dler executes.
> 
>        2. The kernel constructs a frame for the  signal  handler  on  the
>           stack.   The  kernel sets the program counter for the thread to
>           point to the first instruction of the signal handler  function,
>           and configures the return address for that function to point to
>           a piece of user-space code known as the signal trampoline  (de‐
>           scribed in sigreturn(2)).
> 
>        3. The  kernel  passes control back to user-space, where execution
>           commences at the start of the signal handler function.
> 
>        4. When the signal handler returns, control passes to  the  signal
>           trampoline code.
> 
>        5. The  signal  trampoline  calls sigreturn(2), a system call that
>           uses the information in the stack frame created in  step  1  to
>           restore  the  thread's signal mask and alternate stack settings

Nit: and everything else too.

Would it make sense to say something like:

"to restore the thread to its state before the signal handler was
called.  The thread's signal mask and alternate signal stack settings
are also restored as part of this procedure."

>           to their state before the signal handler was called.  Upon com‐
>           pletion  of the call to sigreturn(2), the kernel transfers con‐
>           trol back to user space, and the thread  recommences  execution
>           at the point where it was interrupted by the signal handler.
> 
>        Note  that if the signal handler does not return (e.g., control is
>        transferred out of the handler using siglongjmp(3), or the handler
>        executes a new program with execve(2)), then the final step is not
>        performed.  In particular, in such scenarios it  is  the  program‐
>        mer's  responsibility to restore the state of the signal mask (us‐
>        ing sigprocmask(2)), if it is desired to unblock the signals  that
>        were  blocked  on  entry  to  the signal handler.  (Note that sig‐
>        longjmp(3) may or may not restore the signal  mask,  depending  on
>        the savesigs value that was specified in the corresponding call to
>        sigsetjmp(3).)
> ]]

Otherwise looks good to me.

To exec() straight from a signal handler still requires care though in
order to get things into a sane state for the new process, and while
avoiding the program dying in unintended ways on the way.

Doing this safely in a multithreaded program can be hard, to say the
least.


One other wrinkle that might be worth mentioning, since it has confused
me in the past:  There is no magic internal kernel state that is
different when executing a signal handler.  "Being in a signal handler"
is in fact not a meaningful concept to the kernel.  Everything is
tracked in the user registers and on the user stack.  Signal nesting is
only limited by available stack space (and sane software design...)

I'm not sure how to describe this concisely though.

Cheers
---Dave

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: sigaction.2: clarification for SA_NODEFER needed
  2020-11-17 12:05             ` Dave Martin
@ 2020-11-17 17:09               ` Michael Kerrisk (man-pages)
  2020-11-17 18:40                 ` Dave Martin
  0 siblings, 1 reply; 14+ messages in thread
From: Michael Kerrisk (man-pages) @ 2020-11-17 17:09 UTC (permalink / raw)
  To: Dave Martin; +Cc: mtk.manpages, Heinrich Schuchardt, linux-man

Hi Dave,

On 11/17/20 1:05 PM, Dave Martin wrote:
> On Tue, Nov 17, 2020 at 10:46:11AM +0000, Michael Kerrisk (man-pages) wrote:
>> Hi Dave,
>>
>> Thanks a heap for taking a look at the text!
>>
>> On 11/16/20 2:21 PM, Dave Martin wrote:
>>> On Thu, Nov 12, 2020 at 09:57:35PM +0100, Heinrich Schuchardt wrote:
>>>> Am 12. November 2020 21:45:56 MEZ schrieb "Michael Kerrisk (man-pages)" <mtk.manpages@gmail.com>:

[...]

>>>>>> Is there a function to change the signal mask without leaving the
>>>>> handler?
>>>>>
>>>>> sigprocmask(2).
>>>>
>>>> You might want to add a link to the function in the note section.
>>>
>>> Actually, this is best avoided IMHO:
>>>
>>> The behaviour of sigprocmask() is unspecified in multithreaded programs,
>>> while pthread_sigmask() is not specified to be safe in signal handlers.
>>
>> I'm not sure I agree. sigprocmask() is explicitly specified as being 
>> async-signal-safe, which suggests that POSIX blesses its use, at least
>> in single-threaded programs. And notwithstanding what POSIX says, 
>> sigprocmask() is safe on Linux/glibc in a MT process (since
>> pthread_sigmask() is just a simple wrapper for sigprocmask()), and
>> I'd guess the same is true on many (most?) other implementations as 
>> well.
> 
> I don't disagree that sigprocmask() is likely to work in practice for
> most purposes, but I wonder whether it could have unexpected effects on
> the masking of the libc internal signals in some implementations,
> particulary when using SIG_SETMASK.
> 
> If trying to execve() out of the signal handler, I think there would be
> a strong temptation to restore the signal mask the program had on entry
> with
> 
> 	sigprocmask(SIG_SETMASK, &orig_mask, NULL);
> 
> say.  If the original signal was taken while in the middle of libc while
> some internal signal was blocked then this would unintentionally unblock
> that signal, and deadlocks and other badness may happen on return.

I understand the theory. But, as far as I can tell, glibc (for example)
does not block the internal signals. So, I think that maybe this
situation can't arise in practice (but of course with no guarantees).
At least that's my reading of the glibc code, but hey, reading glibc
code sometimes make the kernel code look like a walk in the park.

> In theory pthread_sigmask() could defend against this, but I don't know
> whether it actually does in any implementations.

It doesn't look as though glibc's pthread_sigmask() does anything 
along these lines.

> So, IIUC you really must not return after doing something like this
> (certainly if you want to be at all portable).
> 
> Trying to do asynchronous context switching using swapconxtext() would
> fall foul of this (and plenty else).
> 
> 
>>> (Yay, POSIX.)
>>>
>>> For these reasons, execve()'ing directly from a signal handler is not a
>>> great idea.  It would probably be better to escape from the signal
>>> handler with siglongjmp() or setcontext(), with the target sigjmp_buf or
>>> ucontext previously set up do the execve().
>>
>> Well, setcontext() is no longer in POSIX.... (It was removed in
> 
> Well, yes.
> 
>> POSIX.1-2008.) And the specification of longjmp() says:
>>
>>        It is recommended that applications do not call longjmp() or  sig‐
>>        longjmp()  from  signal handlers. To avoid undefined behavior when
>>        calling these functions from a  signal  handler,  the  application
>>        needs to ensure one of the following two things:
>>
>>         1. After  the  call to longjmp() or siglongjmp() the process only
>>            calls async-signal-safe functions and does not return from the
>>            initial call to main().
>>
>>         2. Any  signal  whose  handler calls longjmp() or siglongjmp() is
>>            blocked during every call to a non-async-signal-safe function,
>>            and  no  such  calls are made after returning from the initial
>>            call to main().
> 
> i.e., basically the same constraints you have to follow if you want to
> achieve the same result safely from _within_ the signal handler.

Yes.

> But I take your point: my claim that using siglongjmp() is a better
> approach was overstated.  And it's easy to forget that you're still in
> signal-handler-like context even after siglongjmp().
> 
>> So, in my reading of it, you're no better off than calling
>> sigprocmask() from the signal handler. Do you agree?
> 
> Yes, agreed.

Okay.

> (The background to my comments in this area is that I've learned from
> experience that messing with the signal mask inside a signal handler
> tends to create more problems than it solves -- but that doesn't mean
> that there are no situations where it is legitimate.)

[...]

>> By now, the text has evolved to:
>>
>> [[
>>    Execution of signal handlers
>>        Whenever there is a transition from kernel-mode to user-mode  exe‐
>>        cution  (e.g.,  on  return  from  a system call or scheduling of a
>>        thread onto the CPU), the kernel checks whether there is a pending
>>        signal for which the process has established a signal handler.  If
> 
> The signal must also be unblocked.

Added.

>>        there is such a pending signal, the following steps occur:
> 
> You might want to comment on what happens if there are multiple
> unblocked signals pending -- you can probably refer to signal(7) rather
> than writing it all out again here.

From the context of the email thread, it's not clear, but in
fact the text we are discussion is part of the signal(7) manual
page, so I won't add anything here.

>>        1. The kernel performs the necessary preparatory steps for  execu‐
>>           tion of the signal handler:
>>
>>           a) The signal is removed from the set of pending signals.
>>
>>           b) If  the  signal  handler  was  installed by a call to sigac‐
>>              tion(2) that specified the SA_ONSTACK flag  and  the  thread
>>              has   defined  an  alternate  signal  stack  (using  sigalt‐
>>              stack(2)), then that stack is installed.
> 
> Actually should (b) and (c) be swapped? (c) saves the SP and stack
> configuration.

I wondered about this for quite a while, and concluded that the
order must be as currently described. (I'm not 100% confident about
this though, and I didn't really follow the details in the kernel 
source code.) My reasoning is that suppose  we have an alternate 
signal stack setup for SIGSEGV (to handle the case of overflow of
"standard" stack). The next step (c) creates a stack frame. Surely
the only place where that could be done is on the already installed
alternate stack (since the "standard" stack is exhausted). You may 
be about to educate me, of course.

>>           c) Various pieces of signal-related context are  saved  into  a
>>              special  frame  that is created on the stack.  The saved in‐
>>              formation includes:
>>
>>              + the program counter register (i.e.,  the  address  of  the
>>                next  instruction  in the main program that should be exe‐
>>                cuted when the signal handler returns);
>>
>>              + architecture-specific register state required for resuming
>>                the interrupted program;
>>
>>              + the thread's current signal mask;
>>
>>              + the thread's alternate signal stack settings.
>>
>>           d) Any  signals  specified in act->sa_mask when registering the
>>              handler with sigprocmask(2) are added to the thread's signal
>>              mask.   The signal being delivered is also added to the sig‐
>>              nal mask, unless SA_NODEFER was specified  when  registering
>>              the  handler.  These signals are thus blocked while the han‐
>>              dler executes.
>>
>>        2. The kernel constructs a frame for the  signal  handler  on  the
>>           stack.   The  kernel sets the program counter for the thread to
>>           point to the first instruction of the signal handler  function,
>>           and configures the return address for that function to point to
>>           a piece of user-space code known as the signal trampoline  (de‐
>>           scribed in sigreturn(2)).
>>
>>        3. The  kernel  passes control back to user-space, where execution
>>           commences at the start of the signal handler function.
>>
>>        4. When the signal handler returns, control passes to  the  signal
>>           trampoline code.
>>
>>        5. The  signal  trampoline  calls sigreturn(2), a system call that
>>           uses the information in the stack frame created in  step  1  to
>>           restore  the  thread's signal mask and alternate stack settings
> 
> Nit: and everything else too.
> 
> Would it make sense to say something like:
> 
> "to restore the thread to its state before the signal handler was
> called.  The thread's signal mask and alternate signal stack settings
> are also restored as part of this procedure."

Yes, better. Changed.

>>           to their state before the signal handler was called.  Upon com‐
>>           pletion  of the call to sigreturn(2), the kernel transfers con‐
>>           trol back to user space, and the thread  recommences  execution
>>           at the point where it was interrupted by the signal handler.
>>
>>        Note  that if the signal handler does not return (e.g., control is
>>        transferred out of the handler using siglongjmp(3), or the handler
>>        executes a new program with execve(2)), then the final step is not
>>        performed.  In particular, in such scenarios it  is  the  program‐
>>        mer's  responsibility to restore the state of the signal mask (us‐
>>        ing sigprocmask(2)), if it is desired to unblock the signals  that
>>        were  blocked  on  entry  to  the signal handler.  (Note that sig‐
>>        longjmp(3) may or may not restore the signal  mask,  depending  on
>>        the savesigs value that was specified in the corresponding call to
>>        sigsetjmp(3).)
>> ]]
> 
> Otherwise looks good to me.
> 
> To exec() straight from a signal handler still requires care though in
> order to get things into a sane state for the new process, and while
> avoiding the program dying in unintended ways on the way.
> 
> Doing this safely in a multithreaded program can be hard, to say the
> least.
> 
> 
> One other wrinkle that might be worth mentioning, since it has confused
> me in the past:  There is no magic internal kernel state that is
> different when executing a signal handler.  "Being in a signal handler"
> is in fact not a meaningful concept to the kernel.  Everything is
> tracked in the user registers and on the user stack.  Signal nesting is
> only limited by available stack space (and sane software design...)

(Whisper it every morning: "Kernel memory is a limited, 
nonswappable resource.")

> I'm not sure how to describe this concisely though.

I think you already did a good job. I've taken the text and
reworked it just a little:

       From  the  kernel's point of view, execution of the signal handler
       code is exactly the same as the execution of any other  user-space
       code.   That  is  to  say,  the kernel does not record any special
       state information indicating that the thread is currently excuting
       inside a signal handler.  All necessary state information is main‐
       tained in user-space registers  and  the  user-space  stack.   The
       depth  to which nested signal handlers may be invoked is thus lim‐
       ited only by the user-space stack (and sensible software design!).

Thanks again for your comments, Dave.

Cheers,

Michael

-- 
Michael Kerrisk
Linux man-pages maintainer; http://www.kernel.org/doc/man-pages/
Linux/UNIX System Programming Training: http://man7.org/training/

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: sigaction.2: clarification for SA_NODEFER needed
  2020-11-17 17:09               ` Michael Kerrisk (man-pages)
@ 2020-11-17 18:40                 ` Dave Martin
  0 siblings, 0 replies; 14+ messages in thread
From: Dave Martin @ 2020-11-17 18:40 UTC (permalink / raw)
  To: Michael Kerrisk (man-pages); +Cc: Heinrich Schuchardt, linux-man

On Tue, Nov 17, 2020 at 06:09:50PM +0100, Michael Kerrisk (man-pages) wrote:
> Hi Dave,
> 
> On 11/17/20 1:05 PM, Dave Martin wrote:
> > On Tue, Nov 17, 2020 at 10:46:11AM +0000, Michael Kerrisk (man-pages) wrote:
> >> Hi Dave,
> >>
> >> Thanks a heap for taking a look at the text!
> >>
> >> On 11/16/20 2:21 PM, Dave Martin wrote:
> >>> On Thu, Nov 12, 2020 at 09:57:35PM +0100, Heinrich Schuchardt wrote:
> >>>> Am 12. November 2020 21:45:56 MEZ schrieb "Michael Kerrisk (man-pages)" <mtk.manpages@gmail.com>:
> 
> [...]
> 
> >>>>>> Is there a function to change the signal mask without leaving the
> >>>>> handler?
> >>>>>
> >>>>> sigprocmask(2).
> >>>>
> >>>> You might want to add a link to the function in the note section.
> >>>
> >>> Actually, this is best avoided IMHO:
> >>>
> >>> The behaviour of sigprocmask() is unspecified in multithreaded programs,
> >>> while pthread_sigmask() is not specified to be safe in signal handlers.
> >>
> >> I'm not sure I agree. sigprocmask() is explicitly specified as being 
> >> async-signal-safe, which suggests that POSIX blesses its use, at least
> >> in single-threaded programs. And notwithstanding what POSIX says, 
> >> sigprocmask() is safe on Linux/glibc in a MT process (since
> >> pthread_sigmask() is just a simple wrapper for sigprocmask()), and
> >> I'd guess the same is true on many (most?) other implementations as 
> >> well.
> > 
> > I don't disagree that sigprocmask() is likely to work in practice for
> > most purposes, but I wonder whether it could have unexpected effects on
> > the masking of the libc internal signals in some implementations,
> > particulary when using SIG_SETMASK.
> > 
> > If trying to execve() out of the signal handler, I think there would be
> > a strong temptation to restore the signal mask the program had on entry
> > with
> > 
> > 	sigprocmask(SIG_SETMASK, &orig_mask, NULL);
> > 
> > say.  If the original signal was taken while in the middle of libc while
> > some internal signal was blocked then this would unintentionally unblock
> > that signal, and deadlocks and other badness may happen on return.
> 
> I understand the theory. But, as far as I can tell, glibc (for example)
> does not block the internal signals. So, I think that maybe this
> situation can't arise in practice (but of course with no guarantees).
> At least that's my reading of the glibc code, but hey, reading glibc
> code sometimes make the kernel code look like a walk in the park.
> 
> > In theory pthread_sigmask() could defend against this, but I don't know
> > whether it actually does in any implementations.
> 
> It doesn't look as though glibc's pthread_sigmask() does anything 
> along these lines.

That doesn't surprise me too much.

This looks like a case of me inferring a rationale for something that is
actually just a gap in the standards, a known inconsistency, or an
(un)happy accident.


> > So, IIUC you really must not return after doing something like this
> > (certainly if you want to be at all portable).
> > 
> > Trying to do asynchronous context switching using swapconxtext() would
> > fall foul of this (and plenty else).
> > 
> > 
> >>> (Yay, POSIX.)
> >>>
> >>> For these reasons, execve()'ing directly from a signal handler is not a
> >>> great idea.  It would probably be better to escape from the signal
> >>> handler with siglongjmp() or setcontext(), with the target sigjmp_buf or
> >>> ucontext previously set up do the execve().
> >>
> >> Well, setcontext() is no longer in POSIX.... (It was removed in
> > 
> > Well, yes.
> > 
> >> POSIX.1-2008.) And the specification of longjmp() says:
> >>
> >>        It is recommended that applications do not call longjmp() or  sig‐
> >>        longjmp()  from  signal handlers. To avoid undefined behavior when
> >>        calling these functions from a  signal  handler,  the  application
> >>        needs to ensure one of the following two things:
> >>
> >>         1. After  the  call to longjmp() or siglongjmp() the process only
> >>            calls async-signal-safe functions and does not return from the
> >>            initial call to main().
> >>
> >>         2. Any  signal  whose  handler calls longjmp() or siglongjmp() is
> >>            blocked during every call to a non-async-signal-safe function,
> >>            and  no  such  calls are made after returning from the initial
> >>            call to main().
> > 
> > i.e., basically the same constraints you have to follow if you want to
> > achieve the same result safely from _within_ the signal handler.
> 
> Yes.
> 
> > But I take your point: my claim that using siglongjmp() is a better
> > approach was overstated.  And it's easy to forget that you're still in
> > signal-handler-like context even after siglongjmp().
> > 
> >> So, in my reading of it, you're no better off than calling
> >> sigprocmask() from the signal handler. Do you agree?
> > 
> > Yes, agreed.
> 
> Okay.
> 
> > (The background to my comments in this area is that I've learned from
> > experience that messing with the signal mask inside a signal handler
> > tends to create more problems than it solves -- but that doesn't mean
> > that there are no situations where it is legitimate.)
> 
> [...]
> 
> >> By now, the text has evolved to:
> >>
> >> [[
> >>    Execution of signal handlers
> >>        Whenever there is a transition from kernel-mode to user-mode  exe‐
> >>        cution  (e.g.,  on  return  from  a system call or scheduling of a
> >>        thread onto the CPU), the kernel checks whether there is a pending
> >>        signal for which the process has established a signal handler.  If
> > 
> > The signal must also be unblocked.
> 
> Added.
> 
> >>        there is such a pending signal, the following steps occur:
> > 
> > You might want to comment on what happens if there are multiple
> > unblocked signals pending -- you can probably refer to signal(7) rather
> > than writing it all out again here.
> 
> From the context of the email thread, it's not clear, but in
> fact the text we are discussion is part of the signal(7) manual
> page, so I won't add anything here.

For enough.  It's not directly relevant, I guess.


> >>        1. The kernel performs the necessary preparatory steps for  execu‐
> >>           tion of the signal handler:
> >>
> >>           a) The signal is removed from the set of pending signals.
> >>
> >>           b) If  the  signal  handler  was  installed by a call to sigac‐
> >>              tion(2) that specified the SA_ONSTACK flag  and  the  thread
> >>              has   defined  an  alternate  signal  stack  (using  sigalt‐
> >>              stack(2)), then that stack is installed.
> > 
> > Actually should (b) and (c) be swapped? (c) saves the SP and stack
> > configuration.
> 
> I wondered about this for quite a while, and concluded that the
> order must be as currently described. (I'm not 100% confident about
> this though, and I didn't really follow the details in the kernel 
> source code.) My reasoning is that suppose  we have an alternate 
> signal stack setup for SIGSEGV (to handle the case of overflow of
> "standard" stack). The next step (c) creates a stack frame. Surely
> the only place where that could be done is on the already installed
> alternate stack (since the "standard" stack is exhausted). You may 
> be about to educate me, of course.

I see what you mean.

I read "stack is installed" to mean "sp is made to point to that
stack", but I was thinking too much of the implementation details.

I don't think this matters for understanding the effects for
userspace, so I guess it's fine to leave the points in the current
order.

> >>           c) Various pieces of signal-related context are  saved  into  a
> >>              special  frame  that is created on the stack.  The saved in‐
> >>              formation includes:
> >>
> >>              + the program counter register (i.e.,  the  address  of  the
> >>                next  instruction  in the main program that should be exe‐
> >>                cuted when the signal handler returns);
> >>
> >>              + architecture-specific register state required for resuming
> >>                the interrupted program;
> >>
> >>              + the thread's current signal mask;
> >>
> >>              + the thread's alternate signal stack settings.

And I misread this is "the thread's stack settings".  My bad: this is
fine.

Hmm, I wonder how much code actually relies on this feature.

Perhaps this was once considered important for the ucontext API (which
we now know is dodgy to use in signal handlers...)  I can imagine that
ancient userspace threading implementations might have made use of it...

> >>
> >>           d) Any  signals  specified in act->sa_mask when registering the
> >>              handler with sigprocmask(2) are added to the thread's signal
> >>              mask.   The signal being delivered is also added to the sig‐
> >>              nal mask, unless SA_NODEFER was specified  when  registering
> >>              the  handler.  These signals are thus blocked while the han‐
> >>              dler executes.
> >>
> >>        2. The kernel constructs a frame for the  signal  handler  on  the
> >>           stack.   The  kernel sets the program counter for the thread to
> >>           point to the first instruction of the signal handler  function,
> >>           and configures the return address for that function to point to
> >>           a piece of user-space code known as the signal trampoline  (de‐
> >>           scribed in sigreturn(2)).
> >>
> >>        3. The  kernel  passes control back to user-space, where execution
> >>           commences at the start of the signal handler function.
> >>
> >>        4. When the signal handler returns, control passes to  the  signal
> >>           trampoline code.
> >>
> >>        5. The  signal  trampoline  calls sigreturn(2), a system call that
> >>           uses the information in the stack frame created in  step  1  to
> >>           restore  the  thread's signal mask and alternate stack settings
> > 
> > Nit: and everything else too.
> > 
> > Would it make sense to say something like:
> > 
> > "to restore the thread to its state before the signal handler was
> > called.  The thread's signal mask and alternate signal stack settings
> > are also restored as part of this procedure."
> 
> Yes, better. Changed.
> 
> >>           to their state before the signal handler was called.  Upon com‐
> >>           pletion  of the call to sigreturn(2), the kernel transfers con‐
> >>           trol back to user space, and the thread  recommences  execution
> >>           at the point where it was interrupted by the signal handler.
> >>
> >>        Note  that if the signal handler does not return (e.g., control is
> >>        transferred out of the handler using siglongjmp(3), or the handler
> >>        executes a new program with execve(2)), then the final step is not
> >>        performed.  In particular, in such scenarios it  is  the  program‐
> >>        mer's  responsibility to restore the state of the signal mask (us‐
> >>        ing sigprocmask(2)), if it is desired to unblock the signals  that
> >>        were  blocked  on  entry  to  the signal handler.  (Note that sig‐
> >>        longjmp(3) may or may not restore the signal  mask,  depending  on
> >>        the savesigs value that was specified in the corresponding call to
> >>        sigsetjmp(3).)
> >> ]]
> > 
> > Otherwise looks good to me.
> > 
> > To exec() straight from a signal handler still requires care though in
> > order to get things into a sane state for the new process, and while
> > avoiding the program dying in unintended ways on the way.
> > 
> > Doing this safely in a multithreaded program can be hard, to say the
> > least.
> > 
> > 
> > One other wrinkle that might be worth mentioning, since it has confused
> > me in the past:  There is no magic internal kernel state that is
> > different when executing a signal handler.  "Being in a signal handler"
> > is in fact not a meaningful concept to the kernel.  Everything is
> > tracked in the user registers and on the user stack.  Signal nesting is
> > only limited by available stack space (and sane software design...)
> 
> (Whisper it every morning: "Kernel memory is a limited, 
> nonswappable resource.")

Nah, that's what the OOM killer is for.


> > I'm not sure how to describe this concisely though.
> 
> I think you already did a good job. I've taken the text and
> reworked it just a little:
> 
>        From  the  kernel's point of view, execution of the signal handler
>        code is exactly the same as the execution of any other  user-space
>        code.   That  is  to  say,  the kernel does not record any special
>        state information indicating that the thread is currently excuting
>        inside a signal handler.  All necessary state information is main‐
>        tained in user-space registers  and  the  user-space  stack.   The
>        depth  to which nested signal handlers may be invoked is thus lim‐
>        ited only by the user-space stack (and sensible software design!).

Ah, OK.  Works for me.

> Thanks again for your comments, Dave.

No worries

Cheers
---Dave

^ permalink raw reply	[flat|nested] 14+ messages in thread

end of thread, other threads:[~2020-11-17 18:41 UTC | newest]

Thread overview: 14+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2020-11-11  0:42 sigaction.2: clarification for SA_NODEFER needed Heinrich Schuchardt
2020-11-12 15:37 ` Michael Kerrisk (man-pages)
2020-11-12 16:25   ` Heinrich Schuchardt
2020-11-12 20:45     ` Michael Kerrisk (man-pages)
2020-11-12 20:57       ` Heinrich Schuchardt
2020-11-12 21:01         ` Michael Kerrisk (man-pages)
2020-11-12 21:04           ` Heinrich Schuchardt
2020-11-12 21:17             ` Michael Kerrisk (man-pages)
2020-11-16 13:29             ` Dave Martin
2020-11-16 13:21         ` Dave Martin
2020-11-17 10:46           ` Michael Kerrisk (man-pages)
2020-11-17 12:05             ` Dave Martin
2020-11-17 17:09               ` Michael Kerrisk (man-pages)
2020-11-17 18:40                 ` Dave Martin

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).