public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: george anzinger <george@mvista.com>
To: Robert Love <rml@tech9.net>, Andrew Morton <akpm@zip.com.au>,
	george@mvista.com, linux-kernel@vger.kernel.org,
	Linus Torvalds <torvalds@transmeta.com>
Subject: Re: [PATCH] preemptive kernel behavior change: don't be rude
Date: Thu, 04 Apr 2002 17:51:13 -0800	[thread overview]
Message-ID: <3CAD0311.CFCCCDB@mvista.com> (raw)
In-Reply-To: <Pine.LNX.4.33.0204041554350.26177-100000@penguin.transmeta.com> <1017965029.23629.685.camel@phantasy>

Robert Love wrote:
> 
> On Thu, 2002-04-04 at 18:55, Linus Torvalds wrote:
> > Fair enough. Send me a patch to look at.
> 
> Linus et all,
> 
> Here we go:
> 
> - do not manually set task->state
> - instead, in preempt_schedule, set a flag in preempt_count that
>   denotes that this task is entering schedule off a kernel preemption.
> - use this flag in schedule to jump to pick_next_task
> - in preempt_schedule, upon return from schedule, unset the flag
> - have entry.S just call preempt_schedule and not duplicate this work,
>   as Linus suggested.  I agree.  Note this makes debugging easier as
>   we keep a single point of entry for kernel preemptions.
> 
> The result: we can safely preempt non-TASK_RUNNING tasks.  If one is
> preempted, we can safely survive schedule because we won't handle the
> special casing of non-TASK_RUNNING at the top of schedule.  Thus other
> tasks can run as desired and our non-TASK_RUNNING task will eventually
> be rescheduled, in its original state, and complete happily.
> 
> This is the behavior we have in the 2.4 patches and 2.5 until
> ~2.5.6-pre.  This works.  It requires no other changes elsewhere (it
> actually removes some special-casing Ingo did in the signal code).
> 
> This patch works in theory and compiles, but received minimal testing.
> 
>         Robert Love
> 
> diff -urN linux-2.5.8-pre1/arch/i386/kernel/entry.S linux/arch/i386/kernel/entry.S
> --- linux-2.5.8-pre1/arch/i386/kernel/entry.S   Wed Apr  3 20:57:24 2002
> +++ linux/arch/i386/kernel/entry.S      Thu Apr  4 18:32:03 2002
> @@ -240,9 +240,7 @@
>         jnz restore_all
>         incl TI_PRE_COUNT(%ebx)
>         sti
> -       movl TI_TASK(%ebx), %ecx                # ti->task
> -       movl $0,(%ecx)                  # current->state = TASK_RUNNING
> -       call SYMBOL_NAME(schedule)
> +       call SYMBOL_NAME(preempt_schedule)
>         jmp ret_from_intr
>  #endif
> 
> diff -urN linux-2.5.8-pre1/arch/i386/kernel/ptrace.c linux/arch/i386/kernel/ptrace.c
> --- linux-2.5.8-pre1/arch/i386/kernel/ptrace.c  Wed Apr  3 20:57:24 2002
> +++ linux/arch/i386/kernel/ptrace.c     Thu Apr  4 18:31:17 2002
> @@ -455,11 +455,9 @@
>            between a syscall stop and SIGTRAP delivery */
>         current->exit_code = SIGTRAP | ((current->ptrace & PT_TRACESYSGOOD)
>                                         ? 0x80 : 0);
> -       preempt_disable();
>         current->state = TASK_STOPPED;
>         notify_parent(current, SIGCHLD);
>         schedule();
> -       preempt_enable();
>         /*
>          * this isn't the same as continuing with a signal, but it will do
>          * for normal use.  strace only continues with a signal if the
> diff -urN linux-2.5.8-pre1/arch/i386/kernel/signal.c linux/arch/i386/kernel/signal.c
> --- linux-2.5.8-pre1/arch/i386/kernel/signal.c  Wed Apr  3 20:57:24 2002
> +++ linux/arch/i386/kernel/signal.c     Thu Apr  4 18:31:00 2002
> @@ -610,11 +610,9 @@
>                 if ((current->ptrace & PT_PTRACED) && signr != SIGKILL) {
>                         /* Let the debugger run.  */
>                         current->exit_code = signr;
> -                       preempt_disable();
>                         current->state = TASK_STOPPED;
>                         notify_parent(current, SIGCHLD);
>                         schedule();
> -                       preempt_enable();
> 
>                         /* We're back.  Did the debugger cancel the sig?  */
>                         if (!(signr = current->exit_code))
> @@ -669,14 +667,12 @@
> 
>                         case SIGSTOP: {
>                                 struct signal_struct *sig;
> +                               current->state = TASK_STOPPED;
>                                 current->exit_code = signr;
>                                 sig = current->parent->sig;
> -                               preempt_disable();
> -                               current->state = TASK_STOPPED;
>                                 if (sig && !(sig->action[SIGCHLD-1].sa.sa_flags & SA_NOCLDSTOP))
>                                         notify_parent(current, SIGCHLD);
>                                 schedule();
> -                               preempt_enable();
>                                 continue;
>                         }
> 
> diff -urN linux-2.5.8-pre1/include/linux/sched.h linux/include/linux/sched.h
> --- linux-2.5.8-pre1/include/linux/sched.h      Wed Apr  3 20:57:27 2002
> +++ linux/include/linux/sched.h Thu Apr  4 18:29:53 2002
> @@ -91,6 +91,7 @@
>  #define TASK_UNINTERRUPTIBLE   2
>  #define TASK_ZOMBIE            4
>  #define TASK_STOPPED           8
> +#define PREEMPT_ACTIVE         0x4000000
> 
>  #define __set_task_state(tsk, state_value)             \
>         do { (tsk)->state = (state_value); } while (0)
> diff -urN linux-2.5.8-pre1/kernel/sched.c linux/kernel/sched.c
> --- linux-2.5.8-pre1/kernel/sched.c     Wed Apr  3 20:57:37 2002
> +++ linux/kernel/sched.c        Thu Apr  4 18:29:24 2002
> @@ -764,6 +764,13 @@
>         prev->sleep_timestamp = jiffies;
>         spin_lock_irq(&rq->lock);
> 
> +       /*
> +        * if entering from preempt_schedule, off a kernel preemption,
> +        * go straight to picking the next task.
> +        */
> +       if (unlikely(preempt_get_count() & PREEMPT_ACTIVE))
> +               goto pick_next_task;
> +
>         switch (prev->state) {
>         case TASK_INTERRUPTIBLE:
>                 if (unlikely(signal_pending(prev))) {
> @@ -775,7 +782,7 @@
>         case TASK_RUNNING:
>                 ;
>         }
> -#if CONFIG_SMP
> +#if CONFIG_SMP || CONFIG_PREEMPT
>  pick_next_task:
>  #endif
>         if (unlikely(!rq->nr_running)) {
> @@ -843,8 +850,11 @@
>  {
>         if (unlikely(preempt_get_count()))
>                 return;

This line implies that entry.S calls with preempt count of 0.  It use to
call with 1 or was that WAY back there?  

If this is the way it is and it works, then the += and -= below can be
changed to = and the second reference to PREEMPT_ACTIVE becomes 0.

I think I would rather have entry.S set preempt_count to PREEMPT_ACTIVE
with the interrupt system off, turn it on, make the call directly to
schedule(), and then set to zero on return.  I really am concerned with
taking an interrupt during the call, i.e. between the interrupt on and
the store below.  This can lead to stack overflow rather easily.

> -       current->state = TASK_RUNNING;
> +
> +       current_thread_info()->preempt_count += PREEMPT_ACTIVE;
>         schedule();
> +       current_thread_info()->preempt_count -= PREEMPT_ACTIVE;
> +       barrier();
>  }
>  #endif /* CONFIG_PREEMPT */
> 

-- 
George Anzinger   george@mvista.com
High-res-timers:  http://sourceforge.net/projects/high-res-timers/
Real time sched:  http://sourceforge.net/projects/rtsched/
Preemption patch: http://www.kernel.org/pub/linux/kernel/people/rml

  reply	other threads:[~2002-04-05  1:55 UTC|newest]

Thread overview: 32+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2002-04-04 11:59 Patch: linux-2.5.8-pre1/kernel/exit.c change caused BUG() at boot time Adam J. Richter
2002-04-04 12:56 ` Stelian Pop
2002-04-04 13:40   ` Alessandro Suardi
2002-04-04 16:23 ` David C. Hansen
2002-04-04 18:28   ` Dave Hansen
2002-04-04 18:51     ` Robert Love
2002-04-04 19:14       ` Linus Torvalds
2002-04-04 19:26         ` Robert Love
2002-04-04 19:41           ` Dave Hansen
2002-04-04 20:02             ` Patch: linux-2.5.8-pre1/kernel/exit.c change caused BUG() atboot time george anzinger
2002-04-04 20:54           ` Andrew Morton
2002-04-04 21:34           ` Patch: linux-2.5.8-pre1/kernel/exit.c change caused BUG() at boot time Roger Larsson
2002-04-04 22:38             ` Andrew Morton
2002-04-04 22:42               ` Linus Torvalds
2002-04-04 22:54                 ` Patch: linux-2.5.8-pre1/kernel/exit.c change caused BUG() atboot time Andrew Morton
2002-04-04 23:07                 ` Patch: linux-2.5.8-pre1/kernel/exit.c change caused BUG() at boot time Robert Love
2002-04-04 23:42                   ` Linus Torvalds
2002-04-04 23:47                     ` Robert Love
2002-04-04 23:55                       ` Linus Torvalds
2002-04-05  0:03                         ` [PATCH] preemptive kernel behavior change: don't be rude Robert Love
2002-04-05  1:51                           ` george anzinger [this message]
2002-04-05  2:06                             ` Robert Love
2002-04-04 22:55               ` Patch: linux-2.5.8-pre1/kernel/exit.c change caused BUG() at boot time Robert Love
2002-04-04 23:10                 ` Patch: linux-2.5.8-pre1/kernel/exit.c change caused BUG() atboot time Andrew Morton
2002-04-04 23:16                   ` Robert Love
2002-04-04 19:13     ` Patch: linux-2.5.8-pre1/kernel/exit.c change caused BUG() at boot time Linus Torvalds
2002-04-04 19:16       ` Robert Love
2002-04-04 19:45         ` Linus Torvalds
2002-04-04 20:09           ` Patch: linux-2.5.8-pre1/kernel/exit.c change caused BUG() atboot time george anzinger
2002-04-04 19:48       ` george anzinger
     [not found] <Pine.LNX.4.33.0204041740220.7731-100000@penguin.transmeta.com>
2002-04-05  3:09 ` [PATCH] preemptive kernel behavior change: don't be rude Robert Love
2002-04-05 20:03   ` Roger Larsson

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=3CAD0311.CFCCCDB@mvista.com \
    --to=george@mvista.com \
    --cc=akpm@zip.com.au \
    --cc=linux-kernel@vger.kernel.org \
    --cc=rml@tech9.net \
    --cc=torvalds@transmeta.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox