All of lore.kernel.org
 help / color / mirror / Atom feed
* + make-get_user_pages-interruptible.patch added to -mm tree
@ 2008-11-25 23:44 akpm
  0 siblings, 0 replies; 6+ messages in thread
From: akpm @ 2008-11-25 23:44 UTC (permalink / raw)
  To: mm-commits; +Cc: yinghan, hugh, matthew, menage, nickpiggin, penberg, rientjes


The patch titled
     mm: make get_user_pages interruptible
has been added to the -mm tree.  Its filename is
     make-get_user_pages-interruptible.patch

Before you just go and hit "reply", please:
   a) Consider who else should be cc'ed
   b) Prefer to cc a suitable mailing list as well
   c) Ideally: find the original patch on the mailing list and do a
      reply-to-all to that, adding suitable additional cc's

*** Remember to use Documentation/SubmitChecklist when testing your code ***

See http://userweb.kernel.org/~akpm/stuff/added-to-mm.txt to find
out what to do about this

The current -mm tree may be found at http://userweb.kernel.org/~akpm/mmotm/

------------------------------------------------------
Subject: mm: make get_user_pages interruptible
From: Ying Han <yinghan@google.com>

The initial implementation of checking TIF_MEMDIE covers the cases of OOM
killing.  If the process has been OOM killed, the TIF_MEMDIE is set and it
return immediately.  This patch includes:

1.  add the case that the SIGKILL is sent by user processes.  The
   process can try to get_user_pages() unlimited memory even if a user
   process has sent a SIGKILL to it(maybe a monitor find the process
   exceed its memory limit and try to kill it).  In the old
   implementation, the SIGKILL won't be handled until the get_user_pages()
   returns.

2.  change the return value to be ERESTARTSYS.  It makes no sense to
   return ENOMEM if the get_user_pages returned by getting a SIGKILL
   signal.  Considering the general convention for a system call
   interrupted by a signal is ERESTARTNOSYS, so the current return value
   is consistant to that.

Signed-off-by: Paul Menage <menage@google.com>
Signed-off-by:Ying Han <yinghan@google.com>
Acked-by: Pekka Enberg <penberg@cs.helsinki.fi>
Cc: Nick Piggin <nickpiggin@yahoo.com.au>
Cc: Hugh Dickins <hugh@veritas.com>
Cc: Matthew Wilcox <matthew@wil.cx>
Cc: David Rientjes <rientjes@google.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
---

 include/linux/sched.h |    1 +
 kernel/signal.c       |    2 +-
 mm/memory.c           |   10 +++++-----
 3 files changed, 7 insertions(+), 6 deletions(-)

diff -puN include/linux/sched.h~make-get_user_pages-interruptible include/linux/sched.h
--- a/include/linux/sched.h~make-get_user_pages-interruptible
+++ a/include/linux/sched.h
@@ -1750,6 +1750,7 @@ extern void wake_up_new_task(struct task
 extern void sched_fork(struct task_struct *p, int clone_flags);
 extern void sched_dead(struct task_struct *p);
 
+extern int sigkill_pending(struct task_struct *tsk);
 extern void proc_caches_init(void);
 extern void flush_signals(struct task_struct *);
 extern void ignore_signals(struct task_struct *);
diff -puN kernel/signal.c~make-get_user_pages-interruptible kernel/signal.c
--- a/kernel/signal.c~make-get_user_pages-interruptible
+++ a/kernel/signal.c
@@ -1516,7 +1516,7 @@ static inline int may_ptrace_stop(void)
  * Return nonzero if there is a SIGKILL that should be waking us up.
  * Called with the siglock held.
  */
-static int sigkill_pending(struct task_struct *tsk)
+int sigkill_pending(struct task_struct *tsk)
 {
 	return	sigismember(&tsk->pending.signal, SIGKILL) ||
 		sigismember(&tsk->signal->shared_pending.signal, SIGKILL);
diff -puN mm/memory.c~make-get_user_pages-interruptible mm/memory.c
--- a/mm/memory.c~make-get_user_pages-interruptible
+++ a/mm/memory.c
@@ -1219,12 +1219,12 @@ int __get_user_pages(struct task_struct 
 			struct page *page;
 
 			/*
-			 * If tsk is ooming, cut off its access to large memory
-			 * allocations. It has a pending SIGKILL, but it can't
-			 * be processed until returning to user space.
+			 * If we have a pending SIGKILL, don't keep
+			 * allocating memory.
 			 */
-			if (unlikely(test_tsk_thread_flag(tsk, TIF_MEMDIE)))
-				return i ? i : -ENOMEM;
+			if (unlikely(sigkill_pending(current) ||
+					sigkill_pending(tsk)))
+				return i ? i : -ERESTARTSYS;
 
 			if (write)
 				foll_flags |= FOLL_WRITE;
_

Patches currently in -mm which might be from yinghan@google.com are

make-get_user_pages-interruptible.patch


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: + make-get_user_pages-interruptible.patch added to -mm tree
@ 2008-11-27 20:04 Oleg Nesterov
  2008-12-01 23:00 ` Ying Han
  0 siblings, 1 reply; 6+ messages in thread
From: Oleg Nesterov @ 2008-11-27 20:04 UTC (permalink / raw)
  To: Ying Han
  Cc: Paul Menage, Ying Han, Pekka Enberg, Nick Piggin, Hugh Dickins,
	Matthew Wilcox, David Rientjes, Andrew Morton, linux-kernel

Ying Han wrote:
>
> -static int sigkill_pending(struct task_struct *tsk)
> +int sigkill_pending(struct task_struct *tsk)
>  {
>  	return	sigismember(&tsk->pending.signal, SIGKILL) ||
>  		sigismember(&tsk->signal->shared_pending.signal, SIGKILL);
> diff -puN mm/memory.c~make-get_user_pages-interruptible mm/memory.c
> --- a/mm/memory.c~make-get_user_pages-interruptible
> +++ a/mm/memory.c
> @@ -1219,12 +1219,12 @@ int __get_user_pages(struct task_struct 
>  			struct page *page;
>
>  			/*
> -			 * If tsk is ooming, cut off its access to large memory
> -			 * allocations. It has a pending SIGKILL, but it can't
> -			 * be processed until returning to user space.
> +			 * If we have a pending SIGKILL, don't keep
> +			 * allocating memory.
>  			 */
> -			if (unlikely(test_tsk_thread_flag(tsk, TIF_MEMDIE)))
> -				return i ? i : -ENOMEM;
> +			if (unlikely(sigkill_pending(current) ||
> +					sigkill_pending(tsk)))

Please do not export/use sigkill_pending(). It is "private" for ptrace_stop()
(and actually should die imho).

We have fatal_signal_pending() for that.

Oleg.


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: + make-get_user_pages-interruptible.patch added to -mm tree
  2008-11-27 20:04 + make-get_user_pages-interruptible.patch added to -mm tree Oleg Nesterov
@ 2008-12-01 23:00 ` Ying Han
  2008-12-01 23:09   ` Andrew Morton
  0 siblings, 1 reply; 6+ messages in thread
From: Ying Han @ 2008-12-01 23:00 UTC (permalink / raw)
  To: Oleg Nesterov
  Cc: Paul Menage, Pekka Enberg, Nick Piggin, Hugh Dickins,
	Matthew Wilcox, David Rientjes, Andrew Morton, linux-kernel

Thanks Oleg, i looked at the code again and this is a reasonable
change. I will make the change on the patch.
Andrew,
    should i make a patch based on current change or make the same
patch as [V6]?

thanks
--Ying

On Thu, Nov 27, 2008 at 12:04 PM, Oleg Nesterov <oleg@redhat.com> wrote:
> Ying Han wrote:
>>
>> -static int sigkill_pending(struct task_struct *tsk)
>> +int sigkill_pending(struct task_struct *tsk)
>>  {
>>       return  sigismember(&tsk->pending.signal, SIGKILL) ||
>>               sigismember(&tsk->signal->shared_pending.signal, SIGKILL);
>> diff -puN mm/memory.c~make-get_user_pages-interruptible mm/memory.c
>> --- a/mm/memory.c~make-get_user_pages-interruptible
>> +++ a/mm/memory.c
>> @@ -1219,12 +1219,12 @@ int __get_user_pages(struct task_struct
>>                       struct page *page;
>>
>>                       /*
>> -                      * If tsk is ooming, cut off its access to large memory
>> -                      * allocations. It has a pending SIGKILL, but it can't
>> -                      * be processed until returning to user space.
>> +                      * If we have a pending SIGKILL, don't keep
>> +                      * allocating memory.
>>                        */
>> -                     if (unlikely(test_tsk_thread_flag(tsk, TIF_MEMDIE)))
>> -                             return i ? i : -ENOMEM;
>> +                     if (unlikely(sigkill_pending(current) ||
>> +                                     sigkill_pending(tsk)))
>
> Please do not export/use sigkill_pending(). It is "private" for ptrace_stop()
> (and actually should die imho).
>
> We have fatal_signal_pending() for that.
>
> Oleg.
>
>

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: + make-get_user_pages-interruptible.patch added to -mm tree
  2008-12-01 23:00 ` Ying Han
@ 2008-12-01 23:09   ` Andrew Morton
  2008-12-01 23:17     ` Ying Han
  0 siblings, 1 reply; 6+ messages in thread
From: Andrew Morton @ 2008-12-01 23:09 UTC (permalink / raw)
  To: Ying Han
  Cc: oleg, menage, penberg, nickpiggin, hugh, matthew, rientjes,
	linux-kernel

On Mon, 1 Dec 2008 15:00:14 -0800
Ying Han <yinghan@google.com> wrote:

> On Thu, Nov 27, 2008 at 12:04 PM, Oleg Nesterov <oleg@redhat.com> wrote:
> > Ying Han wrote:
> >>
> >> -static int sigkill_pending(struct task_struct *tsk)
> >> +int sigkill_pending(struct task_struct *tsk)
> >>  {
> >>       return  sigismember(&tsk->pending.signal, SIGKILL) ||
> >>               sigismember(&tsk->signal->shared_pending.signal, SIGKILL);
> >> diff -puN mm/memory.c~make-get_user_pages-interruptible mm/memory.c
> >> --- a/mm/memory.c~make-get_user_pages-interruptible
> >> +++ a/mm/memory.c
> >> @@ -1219,12 +1219,12 @@ int __get_user_pages(struct task_struct
> >>                       struct page *page;
> >>
> >>                       /*
> >> -                      * If tsk is ooming, cut off its access to large memory
> >> -                      * allocations. It has a pending SIGKILL, but it can't
> >> -                      * be processed until returning to user space.
> >> +                      * If we have a pending SIGKILL, don't keep
> >> +                      * allocating memory.
> >>                        */
> >> -                     if (unlikely(test_tsk_thread_flag(tsk, TIF_MEMDIE)))
> >> -                             return i ? i : -ENOMEM;
> >> +                     if (unlikely(sigkill_pending(current) ||
> >> +                                     sigkill_pending(tsk)))
> >
> > Please do not export/use sigkill_pending(). It is "private" for ptrace_stop()
> > (and actually should die imho).
> >
> > We have fatal_signal_pending() for that.
> >
> > Oleg.
> >

(top-posting repaired..)

> Thanks Oleg, i looked at the code again and this is a reasonable
> change. I will make the change on the patch.
> Andrew,
>     should i make a patch based on current change or make the same
> patch as [V6]?

I don't mind either way.  I guess a fresh new patch would simplify
review for everyone.

Note that fatal_signal_pending() is not an exact replacement for
sigkill_pending() - fatal_signal_pending() doesn't test the shared
pending signals.



^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: + make-get_user_pages-interruptible.patch added to -mm tree
  2008-12-01 23:09   ` Andrew Morton
@ 2008-12-01 23:17     ` Ying Han
  2008-12-02 13:26       ` Oleg Nesterov
  0 siblings, 1 reply; 6+ messages in thread
From: Ying Han @ 2008-12-01 23:17 UTC (permalink / raw)
  To: Andrew Morton
  Cc: oleg, menage, penberg, nickpiggin, hugh, matthew, rientjes,
	linux-kernel

thanks Andrew.


On Mon, Dec 1, 2008 at 3:09 PM, Andrew Morton <akpm@linux-foundation.org> wrote:
> On Mon, 1 Dec 2008 15:00:14 -0800
> Ying Han <yinghan@google.com> wrote:
>
>> On Thu, Nov 27, 2008 at 12:04 PM, Oleg Nesterov <oleg@redhat.com> wrote:
>> > Ying Han wrote:
>> >>
>> >> -static int sigkill_pending(struct task_struct *tsk)
>> >> +int sigkill_pending(struct task_struct *tsk)
>> >>  {
>> >>       return  sigismember(&tsk->pending.signal, SIGKILL) ||
>> >>               sigismember(&tsk->signal->shared_pending.signal, SIGKILL);
>> >> diff -puN mm/memory.c~make-get_user_pages-interruptible mm/memory.c
>> >> --- a/mm/memory.c~make-get_user_pages-interruptible
>> >> +++ a/mm/memory.c
>> >> @@ -1219,12 +1219,12 @@ int __get_user_pages(struct task_struct
>> >>                       struct page *page;
>> >>
>> >>                       /*
>> >> -                      * If tsk is ooming, cut off its access to large memory
>> >> -                      * allocations. It has a pending SIGKILL, but it can't
>> >> -                      * be processed until returning to user space.
>> >> +                      * If we have a pending SIGKILL, don't keep
>> >> +                      * allocating memory.
>> >>                        */
>> >> -                     if (unlikely(test_tsk_thread_flag(tsk, TIF_MEMDIE)))
>> >> -                             return i ? i : -ENOMEM;
>> >> +                     if (unlikely(sigkill_pending(current) ||
>> >> +                                     sigkill_pending(tsk)))
>> >
>> > Please do not export/use sigkill_pending(). It is "private" for ptrace_stop()
>> > (and actually should die imho).
>> >
>> > We have fatal_signal_pending() for that.
>> >
>> > Oleg.
>> >
>
> (top-posting repaired..)
>
>> Thanks Oleg, i looked at the code again and this is a reasonable
>> change. I will make the change on the patch.
>> Andrew,
>>     should i make a patch based on current change or make the same
>> patch as [V6]?
>
> I don't mind either way.  I guess a fresh new patch would simplify
> review for everyone.
>
> Note that fatal_signal_pending() is not an exact replacement for
> sigkill_pending() - fatal_signal_pending() doesn't test the shared
> pending signals.
>
yeah, i noticed that and that is why i used the sigkill_pending
initially. But after a deeper look
at the signal code, i found for SIGKILL ( signal we care about in this
patch ), tsk->pending is a
superset of tsk->signal->shared_pending.


--Ying
>
>

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: + make-get_user_pages-interruptible.patch added to -mm tree
  2008-12-01 23:17     ` Ying Han
@ 2008-12-02 13:26       ` Oleg Nesterov
  0 siblings, 0 replies; 6+ messages in thread
From: Oleg Nesterov @ 2008-12-02 13:26 UTC (permalink / raw)
  To: Ying Han
  Cc: Andrew Morton, menage, penberg, nickpiggin, hugh, matthew,
	rientjes, linux-kernel

On 12/01, Ying Han wrote:
>
> On Mon, Dec 1, 2008 at 3:09 PM, Andrew Morton <akpm@linux-foundation.org> wrote:
> >
> > Note that fatal_signal_pending() is not an exact replacement for
> > sigkill_pending() - fatal_signal_pending() doesn't test the shared
> > pending signals.
> >
> yeah, i noticed that and that is why i used the sigkill_pending
> initially. But after a deeper look
> at the signal code, i found for SIGKILL ( signal we care about in this
> patch ), tsk->pending is a
> superset of tsk->signal->shared_pending.

Yes. If we have SIGKILL in ->shared_pending, we must have it in
in ->pending too.

The only exception is when we send SIGKILL to the already dying task
which has already dequeued SIGKILL from ->pending. And we do have a
special case: coredumping. elf_core_dump() does get_user_pages()
when the process is "almost" dead, it has SIGNAL_GROUP_EXIT.

So, if get_user_pages() calls sigkill_pending() instead of
fatal_signal_pending(), this can help. But:

	- this relies on the fact that we always queue SIGKILL
	  even if the task is dead. This behaviour can be changed.

	- this doesn't stop the coredumping, it will continue
	  with DUMP_SEEK(). Yes, this is better if we want to
	  stop this thread to populate the memory, but I think
	  we should just fix the coredumping - it should be
	  interruptible.

Actually, the patch exists:
	http://marc.info/?l=linux-kernel&m=121665710711931

Oleg.


^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2008-12-02 13:28 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-11-27 20:04 + make-get_user_pages-interruptible.patch added to -mm tree Oleg Nesterov
2008-12-01 23:00 ` Ying Han
2008-12-01 23:09   ` Andrew Morton
2008-12-01 23:17     ` Ying Han
2008-12-02 13:26       ` Oleg Nesterov
  -- strict thread matches above, loose matches on Subject: below --
2008-11-25 23:44 akpm

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.