From mboxrd@z Thu Jan 1 00:00:00 1970 From: Oleg Nesterov Subject: Re: [PATCH 2/3] drm/scheduler: Don't call wait_event_killable for signaled process. Date: Mon, 30 Apr 2018 18:00:06 +0200 Message-ID: <20180430160006.GB10583@redhat.com> References: <87muxsbmkp.fsf@xmission.com> <8840ac96-50c4-f94d-eb7c-f007940163f3@amd.com> <877eowa5qh.fsf@xmission.com> <20180425135552.GD7592@redhat.com> <20180425171757.GA10441@redhat.com> <874ljyu98e.fsf@xmission.com> Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit Return-path: Content-Disposition: inline In-Reply-To: Sender: linux-kernel-owner@vger.kernel.org To: Andrey Grodzovsky Cc: christian.koenig@amd.com, "Eric W. Biederman" , David.Panariti@amd.com, amd-gfx@lists.freedesktop.org, linux-kernel@vger.kernel.org, Alexander.Deucher@amd.com, akpm@linux-foundation.org List-Id: amd-gfx.lists.freedesktop.org On 04/30, Andrey Grodzovsky wrote: > > What about changing PF_SIGNALED to  PF_EXITING in > drm_sched_entity_do_release > > -       if ((current->flags & PF_SIGNALED) && current->exit_code == SIGKILL) > +      if ((current->flags & PF_EXITING) && current->exit_code == SIGKILL) let me repeat, please don't use task->exit_code. And in fact this check is racy. But this doesn't matter. Say, we can trivially add SIGNAL_GROUP_KILLED_BY_SIGKILL, or do something else, but I fail to understand what are you trying to do. Suppose that the check above is correct in that it is true iff the task is exiting and it was killed by SIGKILL. What about the "else" branch which does r = wait_event_killable(sched->job_scheduled, ...) ? Once again, fatal_signal_pending() (or even signal_pending()) is not well defined after the exiting task passes exit_signals(). So wait_event_killable() can fail because fatal_signal_pending() is true; and this can happen even if it was not killed. Or it can block and SIGKILL won't be able to wake it up. > If SIGINT was sent then it's SIGINT, Yes, but see above. in this case fatal_signal_pending() will be likely true so wait_event_killable() will fail unless condition is already true. Oleg.