public inbox for intel-gfx@lists.freedesktop.org
 help / color / mirror / Atom feed
* [PATCH igt] core/sighelper: Interrupt everyone in the process group
@ 2016-01-08  8:44 Chris Wilson
  2016-01-11  7:57 ` Daniel Vetter
  0 siblings, 1 reply; 8+ messages in thread
From: Chris Wilson @ 2016-01-08  8:44 UTC (permalink / raw)
  To: intel-gfx

Some stress tests create both the signal helper and a lot of competing
processes. In these tests, the parent is just waiting upon the children,
and the intention is not to keep waking up the waiting parent, but to
keep interrupting the children (as we hope to trigger races in our
kernel code). kill(-pid) sends the signal to all members of the process
group, not just the target pid.

We also switch from using SIGUSR1 to SIGCONT to paper over a race
condition when forking children that saw the default signal action being
run (and thus killing the child).

Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
---
 lib/igt_aux.c | 10 ++++++----
 1 file changed, 6 insertions(+), 4 deletions(-)

diff --git a/lib/igt_aux.c b/lib/igt_aux.c
index 4d08d68..f6b5792 100644
--- a/lib/igt_aux.c
+++ b/lib/igt_aux.c
@@ -79,7 +79,7 @@ static void __attribute__((noreturn)) signal_helper_process(pid_t pid)
 	/* Interrupt the parent process at 500Hz, just to be annoying */
 	while (1) {
 		usleep(1000 * 1000 / 500);
-		if (kill(pid, SIGUSR1)) /* Parent has died, so must we. */
+		if (kill(pid, SIGCONT)) /* Parent has died, so must we. */
 			exit(0);
 	}
 }
@@ -93,7 +93,7 @@ static void sig_handler(int i)
  * igt_fork_signal_helper:
  *
  * Fork a child process using #igt_fork_helper to interrupt the parent process
- * with a SIGUSR1 signal at regular quick intervals. The corresponding dummy
+ * with a SIGCONT signal at regular quick intervals. The corresponding dummy
  * signal handler is installed in the parent process.
  *
  * This is useful to exercise ioctl error paths, at least where those can be
@@ -108,10 +108,12 @@ void igt_fork_signal_helper(void)
 	if (igt_only_list_subtests())
 		return;
 
-	signal(SIGUSR1, sig_handler);
+	signal(SIGCONT, sig_handler);
+	setpgrp(); /* define a new process group for ourselves */
 
 	igt_fork_helper(&signal_helper) {
-		signal_helper_process(getppid());
+		signal(SIGCONT, SIG_IGN);
+		signal_helper_process(-getppid());
 	}
 }
 
-- 
2.7.0.rc3

_______________________________________________
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx

^ permalink raw reply related	[flat|nested] 8+ messages in thread

* Re: [PATCH igt] core/sighelper: Interrupt everyone in the process group
  2016-01-08  8:44 [PATCH igt] core/sighelper: Interrupt everyone in the process group Chris Wilson
@ 2016-01-11  7:57 ` Daniel Vetter
  2016-01-11  8:54   ` Chris Wilson
  0 siblings, 1 reply; 8+ messages in thread
From: Daniel Vetter @ 2016-01-11  7:57 UTC (permalink / raw)
  To: Chris Wilson; +Cc: intel-gfx

On Fri, Jan 08, 2016 at 08:44:29AM +0000, Chris Wilson wrote:
> Some stress tests create both the signal helper and a lot of competing
> processes. In these tests, the parent is just waiting upon the children,
> and the intention is not to keep waking up the waiting parent, but to
> keep interrupting the children (as we hope to trigger races in our
> kernel code). kill(-pid) sends the signal to all members of the process
> group, not just the target pid.

I don't really have any clue about unix pgroups, but the -pid disappeared
compared to the previous version.
> 
> We also switch from using SIGUSR1 to SIGCONT to paper over a race
> condition when forking children that saw the default signal action being
> run (and thus killing the child).

I thought I fixed that race by first installing the new signal handler,
then forking. Ok, rechecked and it's the SYS_getpid stuff, so another
race. Still I thought signal handlers would survive a fork?
-Daniel

> 
> Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
> ---
>  lib/igt_aux.c | 10 ++++++----
>  1 file changed, 6 insertions(+), 4 deletions(-)
> 
> diff --git a/lib/igt_aux.c b/lib/igt_aux.c
> index 4d08d68..f6b5792 100644
> --- a/lib/igt_aux.c
> +++ b/lib/igt_aux.c
> @@ -79,7 +79,7 @@ static void __attribute__((noreturn)) signal_helper_process(pid_t pid)
>  	/* Interrupt the parent process at 500Hz, just to be annoying */
>  	while (1) {
>  		usleep(1000 * 1000 / 500);
> -		if (kill(pid, SIGUSR1)) /* Parent has died, so must we. */
> +		if (kill(pid, SIGCONT)) /* Parent has died, so must we. */
>  			exit(0);
>  	}
>  }
> @@ -93,7 +93,7 @@ static void sig_handler(int i)
>   * igt_fork_signal_helper:
>   *
>   * Fork a child process using #igt_fork_helper to interrupt the parent process
> - * with a SIGUSR1 signal at regular quick intervals. The corresponding dummy
> + * with a SIGCONT signal at regular quick intervals. The corresponding dummy
>   * signal handler is installed in the parent process.
>   *
>   * This is useful to exercise ioctl error paths, at least where those can be
> @@ -108,10 +108,12 @@ void igt_fork_signal_helper(void)
>  	if (igt_only_list_subtests())
>  		return;
>  
> -	signal(SIGUSR1, sig_handler);
> +	signal(SIGCONT, sig_handler);
> +	setpgrp(); /* define a new process group for ourselves */
>  
>  	igt_fork_helper(&signal_helper) {
> -		signal_helper_process(getppid());
> +		signal(SIGCONT, SIG_IGN);
> +		signal_helper_process(-getppid());
>  	}
>  }
>  
> -- 
> 2.7.0.rc3
> 
> _______________________________________________
> Intel-gfx mailing list
> Intel-gfx@lists.freedesktop.org
> http://lists.freedesktop.org/mailman/listinfo/intel-gfx

-- 
Daniel Vetter
Software Engineer, Intel Corporation
http://blog.ffwll.ch
_______________________________________________
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH igt] core/sighelper: Interrupt everyone in the process group
  2016-01-11  7:57 ` Daniel Vetter
@ 2016-01-11  8:54   ` Chris Wilson
  2016-01-11  9:06     ` Daniel Vetter
  0 siblings, 1 reply; 8+ messages in thread
From: Chris Wilson @ 2016-01-11  8:54 UTC (permalink / raw)
  To: Daniel Vetter; +Cc: intel-gfx

On Mon, Jan 11, 2016 at 08:57:33AM +0100, Daniel Vetter wrote:
> On Fri, Jan 08, 2016 at 08:44:29AM +0000, Chris Wilson wrote:
> > Some stress tests create both the signal helper and a lot of competing
> > processes. In these tests, the parent is just waiting upon the children,
> > and the intention is not to keep waking up the waiting parent, but to
> > keep interrupting the children (as we hope to trigger races in our
> > kernel code). kill(-pid) sends the signal to all members of the process
> > group, not just the target pid.
> 
> I don't really have any clue about unix pgroups, but the -pid disappeared
> compared to the previous version.

-getppid().

I felt it was clearer to pass along the "negative pid = process group"
after setting up the process group.

> > We also switch from using SIGUSR1 to SIGCONT to paper over a race
> > condition when forking children that saw the default signal action being
> > run (and thus killing the child).
> 
> I thought I fixed that race by first installing the new signal handler,
> then forking. Ok, rechecked and it's the SYS_getpid stuff, so another
> race. Still I thought signal handlers would survive a fork?

So did irc. They didn't appear to as the children would sporadically
die with SIGUSR1.
-Chris

-- 
Chris Wilson, Intel Open Source Technology Centre
_______________________________________________
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH igt] core/sighelper: Interrupt everyone in the process group
  2016-01-11  8:54   ` Chris Wilson
@ 2016-01-11  9:06     ` Daniel Vetter
  2016-01-11 12:25       ` Dave Gordon
  0 siblings, 1 reply; 8+ messages in thread
From: Daniel Vetter @ 2016-01-11  9:06 UTC (permalink / raw)
  To: Chris Wilson, Daniel Vetter, intel-gfx

On Mon, Jan 11, 2016 at 08:54:59AM +0000, Chris Wilson wrote:
> On Mon, Jan 11, 2016 at 08:57:33AM +0100, Daniel Vetter wrote:
> > On Fri, Jan 08, 2016 at 08:44:29AM +0000, Chris Wilson wrote:
> > > Some stress tests create both the signal helper and a lot of competing
> > > processes. In these tests, the parent is just waiting upon the children,
> > > and the intention is not to keep waking up the waiting parent, but to
> > > keep interrupting the children (as we hope to trigger races in our
> > > kernel code). kill(-pid) sends the signal to all members of the process
> > > group, not just the target pid.
> > 
> > I don't really have any clue about unix pgroups, but the -pid disappeared
> > compared to the previous version.
> 
> -getppid().
> 
> I felt it was clearer to pass along the "negative pid = process group"
> after setting up the process group.

Oh, I was blind ... Yeah looks better, but please add a bigger comment
around that code explaining why we need a group and why we use SIG_CONT.
With that acked-by: me.

Cheers, Daniel

> > > We also switch from using SIGUSR1 to SIGCONT to paper over a race
> > > condition when forking children that saw the default signal action being
> > > run (and thus killing the child).
> > 
> > I thought I fixed that race by first installing the new signal handler,
> > then forking. Ok, rechecked and it's the SYS_getpid stuff, so another
> > race. Still I thought signal handlers would survive a fork?
> 
> So did irc. They didn't appear to as the children would sporadically
> die with SIGUSR1.

Could be that libc is doing something funny, iirc they have piles of fork
helpers to make fork more reliable (breaking locks and stuff like that),
but then in turn break the abstraction.
-Daniel
-- 
Daniel Vetter
Software Engineer, Intel Corporation
http://blog.ffwll.ch
_______________________________________________
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH igt] core/sighelper: Interrupt everyone in the process group
  2016-01-11  9:06     ` Daniel Vetter
@ 2016-01-11 12:25       ` Dave Gordon
  2016-01-11 12:34         ` Chris Wilson
  0 siblings, 1 reply; 8+ messages in thread
From: Dave Gordon @ 2016-01-11 12:25 UTC (permalink / raw)
  To: Daniel Vetter, Chris Wilson, intel-gfx

On 11/01/16 09:06, Daniel Vetter wrote:
> On Mon, Jan 11, 2016 at 08:54:59AM +0000, Chris Wilson wrote:
>> On Mon, Jan 11, 2016 at 08:57:33AM +0100, Daniel Vetter wrote:
>>> On Fri, Jan 08, 2016 at 08:44:29AM +0000, Chris Wilson wrote:
>>>> Some stress tests create both the signal helper and a lot of competing
>>>> processes. In these tests, the parent is just waiting upon the children,
>>>> and the intention is not to keep waking up the waiting parent, but to
>>>> keep interrupting the children (as we hope to trigger races in our
>>>> kernel code). kill(-pid) sends the signal to all members of the process
>>>> group, not just the target pid.
>>>
>>> I don't really have any clue about unix pgroups, but the -pid disappeared
>>> compared to the previous version.
>>
>> -getppid().
>>
>> I felt it was clearer to pass along the "negative pid = process group"
>> after setting up the process group.
>
> Oh, I was blind ... Yeah looks better, but please add a bigger comment
> around that code explaining why we need a group and why we use SIG_CONT.
> With that acked-by: me.
>
> Cheers, Daniel
>
>>>> We also switch from using SIGUSR1 to SIGCONT to paper over a race
>>>> condition when forking children that saw the default signal action being
>>>> run (and thus killing the child).
>>>
>>> I thought I fixed that race by first installing the new signal handler,
>>> then forking. Ok, rechecked and it's the SYS_getpid stuff, so another
>>> race. Still I thought signal handlers would survive a fork?
>>
>> So did irc. They didn't appear to as the children would sporadically
>> die with SIGUSR1.
>
> Could be that libc is doing something funny, iirc they have piles of fork
> helpers to make fork more reliable (breaking locks and stuff like that),
> but then in turn break the abstraction.
> -Daniel

You could use killpg(pgrp, sig) rather than kill(), just to make it 
clearer that the target is a process group, rather than people having to 
know about the "negative pid" semantics.

I don't think SIGCHLD is a good idea; it has kernel-defined semantics 
beyond just sending a signal. And it may not be delivered at all, if the 
disposition is not "caught". SIGUSR1 was the right thing, really; so it 
would be better to work out how to make that work properly, rather than 
change to a different one.

Signal handlers are (supposed to be) inherited across fork(); signal 
disposition is also inherited, and the set of pending signals of a new 
process is (supposed to be) empty. OTOH a signal can be delivered to the 
child before it returns from the fork(), which may be a bit surprising.

I think the safest way to avoid unexpected signals around a fork() is:

parent calls sigprocmask() to block all interesting signals
parent calls fork() --> child inherits mask
parent calls sigprocmask() to restore the previous mask

	child updates handlers if required
	child calls sigprocmask() to unblock signals

.Dave.
_______________________________________________
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH igt] core/sighelper: Interrupt everyone in the process group
  2016-01-11 12:25       ` Dave Gordon
@ 2016-01-11 12:34         ` Chris Wilson
  2016-01-11 13:29           ` Dave Gordon
  0 siblings, 1 reply; 8+ messages in thread
From: Chris Wilson @ 2016-01-11 12:34 UTC (permalink / raw)
  To: Dave Gordon; +Cc: intel-gfx

On Mon, Jan 11, 2016 at 12:25:07PM +0000, Dave Gordon wrote:
> On 11/01/16 09:06, Daniel Vetter wrote:
> >On Mon, Jan 11, 2016 at 08:54:59AM +0000, Chris Wilson wrote:
> >>On Mon, Jan 11, 2016 at 08:57:33AM +0100, Daniel Vetter wrote:
> >>>On Fri, Jan 08, 2016 at 08:44:29AM +0000, Chris Wilson wrote:
> >>>>Some stress tests create both the signal helper and a lot of competing
> >>>>processes. In these tests, the parent is just waiting upon the children,
> >>>>and the intention is not to keep waking up the waiting parent, but to
> >>>>keep interrupting the children (as we hope to trigger races in our
> >>>>kernel code). kill(-pid) sends the signal to all members of the process
> >>>>group, not just the target pid.
> >>>
> >>>I don't really have any clue about unix pgroups, but the -pid disappeared
> >>>compared to the previous version.
> >>
> >>-getppid().
> >>
> >>I felt it was clearer to pass along the "negative pid = process group"
> >>after setting up the process group.
> >
> >Oh, I was blind ... Yeah looks better, but please add a bigger comment
> >around that code explaining why we need a group and why we use SIG_CONT.
> >With that acked-by: me.
> >
> >Cheers, Daniel
> >
> >>>>We also switch from using SIGUSR1 to SIGCONT to paper over a race
> >>>>condition when forking children that saw the default signal action being
> >>>>run (and thus killing the child).
> >>>
> >>>I thought I fixed that race by first installing the new signal handler,
> >>>then forking. Ok, rechecked and it's the SYS_getpid stuff, so another
> >>>race. Still I thought signal handlers would survive a fork?
> >>
> >>So did irc. They didn't appear to as the children would sporadically
> >>die with SIGUSR1.
> >
> >Could be that libc is doing something funny, iirc they have piles of fork
> >helpers to make fork more reliable (breaking locks and stuff like that),
> >but then in turn break the abstraction.
> >-Daniel
> 
> You could use killpg(pgrp, sig) rather than kill(), just to make it
> clearer that the target is a process group, rather than people
> having to know about the "negative pid" semantics.
> 
> I don't think SIGCHLD is a good idea; it has kernel-defined
> semantics beyond just sending a signal. And it may not be delivered
> at all, if the disposition is not "caught". SIGUSR1 was the right
> thing, really; so it would be better to work out how to make that
> work properly, rather than change to a different one.

SIGCONT not SIGCHLD. And the deposition is supposed to be fully under
our control any way.
 
> Signal handlers are (supposed to be) inherited across fork(); signal
> disposition is also inherited, and the set of pending signals of a
> new process is (supposed to be) empty. OTOH a signal can be
> delivered to the child before it returns from the fork(), which may
> be a bit surprising.
> 
> I think the safest way to avoid unexpected signals around a fork() is:
> 
> parent calls sigprocmask() to block all interesting signals
> parent calls fork() --> child inherits mask
> parent calls sigprocmask() to restore the previous mask

I tried that.
-Chris

-- 
Chris Wilson, Intel Open Source Technology Centre
_______________________________________________
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH igt] core/sighelper: Interrupt everyone in the process group
  2016-01-11 12:34         ` Chris Wilson
@ 2016-01-11 13:29           ` Dave Gordon
  2016-01-11 13:41             ` Chris Wilson
  0 siblings, 1 reply; 8+ messages in thread
From: Dave Gordon @ 2016-01-11 13:29 UTC (permalink / raw)
  To: Chris Wilson, Daniel Vetter, intel-gfx

On 11/01/16 12:34, Chris Wilson wrote:
> On Mon, Jan 11, 2016 at 12:25:07PM +0000, Dave Gordon wrote:
>> On 11/01/16 09:06, Daniel Vetter wrote:
>>> On Mon, Jan 11, 2016 at 08:54:59AM +0000, Chris Wilson wrote:
>>>> On Mon, Jan 11, 2016 at 08:57:33AM +0100, Daniel Vetter wrote:
>>>>> On Fri, Jan 08, 2016 at 08:44:29AM +0000, Chris Wilson wrote:
>>>>>> Some stress tests create both the signal helper and a lot of competing
>>>>>> processes. In these tests, the parent is just waiting upon the children,
>>>>>> and the intention is not to keep waking up the waiting parent, but to
>>>>>> keep interrupting the children (as we hope to trigger races in our
>>>>>> kernel code). kill(-pid) sends the signal to all members of the process
>>>>>> group, not just the target pid.
>>>>>
>>>>> I don't really have any clue about unix pgroups, but the -pid disappeared
>>>>> compared to the previous version.
>>>>
>>>> -getppid().
>>>>
>>>> I felt it was clearer to pass along the "negative pid = process group"
>>>> after setting up the process group.
>>>
>>> Oh, I was blind ... Yeah looks better, but please add a bigger comment
>>> around that code explaining why we need a group and why we use SIG_CONT.
>>> With that acked-by: me.
>>>
>>> Cheers, Daniel
>>>
>>>>>> We also switch from using SIGUSR1 to SIGCONT to paper over a race
>>>>>> condition when forking children that saw the default signal action being
>>>>>> run (and thus killing the child).
>>>>>
>>>>> I thought I fixed that race by first installing the new signal handler,
>>>>> then forking. Ok, rechecked and it's the SYS_getpid stuff, so another
>>>>> race. Still I thought signal handlers would survive a fork?
>>>>
>>>> So did irc. They didn't appear to as the children would sporadically
>>>> die with SIGUSR1.
>>>
>>> Could be that libc is doing something funny, iirc they have piles of fork
>>> helpers to make fork more reliable (breaking locks and stuff like that),
>>> but then in turn break the abstraction.
>>> -Daniel
>>
>> You could use killpg(pgrp, sig) rather than kill(), just to make it
>> clearer that the target is a process group, rather than people
>> having to know about the "negative pid" semantics.
>>
>> I don't think SIGCHLD is a good idea; it has kernel-defined
>> semantics beyond just sending a signal. And it may not be delivered
>> at all, if the disposition is not "caught". SIGUSR1 was the right
>> thing, really; so it would be better to work out how to make that
>> work properly, rather than change to a different one.
>
> SIGCONT not SIGCHLD. And the deposition is supposed to be fully under
> our control any way.

Oops, yes, I meant SIGCONT has kernel-defined semantics, etc ...

Catching SIGCONT is ... unusual. Because you can't catch SIGSTOP, you 
don't normally have any reason to catch SIGCONT.

Actually, SIGCONT is even more bizarre than SIGCHLD as sending a SIGCONT 
to a process can result in a SIGCHLD being sent to its parent.

>> Signal handlers are (supposed to be) inherited across fork(); signal
>> disposition is also inherited, and the set of pending signals of a
>> new process is (supposed to be) empty. OTOH a signal can be
>> delivered to the child before it returns from the fork(), which may
>> be a bit surprising.
>>
>> I think the safest way to avoid unexpected signals around a fork() is:
>>
>> parent calls sigprocmask() to block all interesting signals
>> parent calls fork() --> child inherits mask
>> parent calls sigprocmask() to restore the previous mask
>
> I tried that.
> -Chris

Are we using signal(2) to install the handlers? 'Cos that's archaic and 
has known unfixable race conditions. The Linux kernel supplies SysV 
signal semantics by default, which means the disposition gets reset 
before the handler is called, so a double signal kills the program. The 
glibc signal(3) wrapper provides BSD semantics which are slightly less 
problematic; but libc5 signal(3) implements SysV.

The proper answer is usually to use sigaction(2) instead. Then the race 
conditions don't (shouldn't) occur, at least if the user code gets all 
the options right.

.Dave.
_______________________________________________
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH igt] core/sighelper: Interrupt everyone in the process group
  2016-01-11 13:29           ` Dave Gordon
@ 2016-01-11 13:41             ` Chris Wilson
  0 siblings, 0 replies; 8+ messages in thread
From: Chris Wilson @ 2016-01-11 13:41 UTC (permalink / raw)
  To: Dave Gordon; +Cc: intel-gfx

On Mon, Jan 11, 2016 at 01:29:12PM +0000, Dave Gordon wrote:
> On 11/01/16 12:34, Chris Wilson wrote:
> >On Mon, Jan 11, 2016 at 12:25:07PM +0000, Dave Gordon wrote:
> >>On 11/01/16 09:06, Daniel Vetter wrote:
> >>>On Mon, Jan 11, 2016 at 08:54:59AM +0000, Chris Wilson wrote:
> >>>>On Mon, Jan 11, 2016 at 08:57:33AM +0100, Daniel Vetter wrote:
> >>>>>On Fri, Jan 08, 2016 at 08:44:29AM +0000, Chris Wilson wrote:
> >>>>>>Some stress tests create both the signal helper and a lot of competing
> >>>>>>processes. In these tests, the parent is just waiting upon the children,
> >>>>>>and the intention is not to keep waking up the waiting parent, but to
> >>>>>>keep interrupting the children (as we hope to trigger races in our
> >>>>>>kernel code). kill(-pid) sends the signal to all members of the process
> >>>>>>group, not just the target pid.
> >>>>>
> >>>>>I don't really have any clue about unix pgroups, but the -pid disappeared
> >>>>>compared to the previous version.
> >>>>
> >>>>-getppid().
> >>>>
> >>>>I felt it was clearer to pass along the "negative pid = process group"
> >>>>after setting up the process group.
> >>>
> >>>Oh, I was blind ... Yeah looks better, but please add a bigger comment
> >>>around that code explaining why we need a group and why we use SIG_CONT.
> >>>With that acked-by: me.
> >>>
> >>>Cheers, Daniel
> >>>
> >>>>>>We also switch from using SIGUSR1 to SIGCONT to paper over a race
> >>>>>>condition when forking children that saw the default signal action being
> >>>>>>run (and thus killing the child).
> >>>>>
> >>>>>I thought I fixed that race by first installing the new signal handler,
> >>>>>then forking. Ok, rechecked and it's the SYS_getpid stuff, so another
> >>>>>race. Still I thought signal handlers would survive a fork?
> >>>>
> >>>>So did irc. They didn't appear to as the children would sporadically
> >>>>die with SIGUSR1.
> >>>
> >>>Could be that libc is doing something funny, iirc they have piles of fork
> >>>helpers to make fork more reliable (breaking locks and stuff like that),
> >>>but then in turn break the abstraction.
> >>>-Daniel
> >>
> >>You could use killpg(pgrp, sig) rather than kill(), just to make it
> >>clearer that the target is a process group, rather than people
> >>having to know about the "negative pid" semantics.
> >>
> >>I don't think SIGCHLD is a good idea; it has kernel-defined
> >>semantics beyond just sending a signal. And it may not be delivered
> >>at all, if the disposition is not "caught". SIGUSR1 was the right
> >>thing, really; so it would be better to work out how to make that
> >>work properly, rather than change to a different one.
> >
> >SIGCONT not SIGCHLD. And the deposition is supposed to be fully under
> >our control any way.
> 
> Oops, yes, I meant SIGCONT has kernel-defined semantics, etc ...
> 
> Catching SIGCONT is ... unusual. Because you can't catch SIGSTOP,
> you don't normally have any reason to catch SIGCONT.
> 
> Actually, SIGCONT is even more bizarre than SIGCHLD as sending a
> SIGCONT to a process can result in a SIGCHLD being sent to its
> parent.

Really? That's a nuiscance but nothing more. I'm only trying to paper
over a bug here :)
 
> >>Signal handlers are (supposed to be) inherited across fork(); signal
> >>disposition is also inherited, and the set of pending signals of a
> >>new process is (supposed to be) empty. OTOH a signal can be
> >>delivered to the child before it returns from the fork(), which may
> >>be a bit surprising.
> >>
> >>I think the safest way to avoid unexpected signals around a fork() is:
> >>
> >>parent calls sigprocmask() to block all interesting signals
> >>parent calls fork() --> child inherits mask
> >>parent calls sigprocmask() to restore the previous mask
> >
> >I tried that.
> >-Chris
> 
> Are we using signal(2) to install the handlers? 'Cos that's archaic
> and has known unfixable race conditions. The Linux kernel supplies
> SysV signal semantics by default, which means the disposition gets
> reset before the handler is called, so a double signal kills the
> program. The glibc signal(3) wrapper provides BSD semantics which
> are slightly less problematic; but libc5 signal(3) implements SysV.

We are using both, but in for the sighelper interrupt we are using
signal - but a long time before we fork the test children.

Worth a shot as much as anything else...
-Chris

-- 
Chris Wilson, Intel Open Source Technology Centre
_______________________________________________
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2016-01-11 13:41 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2016-01-08  8:44 [PATCH igt] core/sighelper: Interrupt everyone in the process group Chris Wilson
2016-01-11  7:57 ` Daniel Vetter
2016-01-11  8:54   ` Chris Wilson
2016-01-11  9:06     ` Daniel Vetter
2016-01-11 12:25       ` Dave Gordon
2016-01-11 12:34         ` Chris Wilson
2016-01-11 13:29           ` Dave Gordon
2016-01-11 13:41             ` Chris Wilson

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox