Re: schedule() BUG - Steve Scott

Linux MIPS Architecture development
 help / color / mirror / Atom feed

From: "Steve Scott" <steve.scott@pioneer-pdt.com>
To: <jsun@mvista.com>
Cc: <linux-mips@linux-mips.org>, <craig.mautner@pioneer-pdt.com>
Subject: Re: schedule() BUG
Date: Mon, 6 Oct 2003 19:05:06 -0700	[thread overview]
Message-ID: <017601c38c77$6d7225a0$2256fea9@janelle> (raw)
In-Reply-To: FJEIIOCBFAIOIDNKLPFJCECODAAA.koji.kawachi@pioneer-pdt.com

[-- Attachment #1: Type: text/plain, Size: 4188 bytes --]

We tried the fault.c patch Jun suggested, but it didn't solve the problem we were
having with the BUG() in schedule(). The patch at the beginning of
except_vec3_generic for the Vr5432 bug had previously been installed.

While chasing the BUG() in schedule(), though, we ran across another BUG() in
alloc_skb() in ...linux/net/core/skbuff.c. :

    alloc_skb called nonatomically from interrupt 80117acc
    kernel BUG at skbuff.c:179!

We changed the way sock_init_data initializes the 'allocation' field and
were able to get past this one (see attached sock.c.patch). We're not sure
if this fix needs to be permanent, or if it's just a temporary workaround.

For the schedule() BUG(), all evidence that we collected pointed to some
interrupt causing us to reenter schedule() (i.e., somehow schedule() was
called during an interrupt handler). We suspected something being run
from the timer interrupt bottom half, but were never able to prove it. We
also thought a remote possibility might be a pipeline hazard in the MIPS
causing the EPC register not to update on a nested exception, but NEC says
that can't happen on the Vr5432 that we're using...

We finally worked around the schedule BUG() by disabling interrupts
during the context switch in schedule(). This workaround required changes
in linux/kernel/sched.c and linux/arch/mips/kernel/r4k_switch.S (see attached
patches).

--steve

> 
> 
> -----Original Message-----
> From: linux-mips-bounce@linux-mips.org
> [mailto:linux-mips-bounce@linux-mips.org]On Behalf Of Jun Sun
> Sent: Wednesday, October 01, 2003 4:50 PM
> To: Craig Mautner
> Cc: linux-mips@linux-mips.org; jsun@mvista.com
> Subject: Re: schedule() BUG
> 
> 
> On Fri, Sep 12, 2003 at 11:04:16AM -0700, Craig Mautner wrote:
> > We are using mips-linux 2.4.17, gcc 3.2.1 (MontaVista) and crashing in
> > schedule():
> >
> > Unable to handle kernel paging request at virtual address 00000000, epc ==
> > 800153c0, ra == 800153c0
> > $0 : 00000000 9001f800 0000001b 00000000 0000001a 83f56000 8298f4a0
> 0000001f
> > $8 : 00000001 ffffe2e0 000022e0 00000000 fffffff9 ffffffff 0000000a
> 00000002
> > $16: 00000000 00000000 82af0000 8298f4a0 83f56000 00000000 80008000
> 00000000
> > $24: 82af1dc2 00000002                   82af0000 82af1ef8 82af1ef8
> 800153c0
> > epc  : 800153c0    Not tainted
> >
> > The code is:
> >
> >     {
> >       struct mm_struct *mm = next->mm;
> >       struct mm_struct *oldmm = prev->active_mm;
> >       if (!mm) {
> >            if (next->active_mm) BUG();   <- this is where we crash
> >            next->active_mm = oldmm;
> >            atomic_inc(&oldmm->mm_count);
> >            enter_lazy_tlb(oldmm, next, this_cpu);
> >       }
> >         .
> >         .
> >         .
> >
> > This seems to happen in our case when 'next' points to 'kswapd' although
> we
> > think it could happen when switching to any kernel task (i.e. those tasks
> > with mm==NULL).
> >
> > We think the culprit is that we are taking an interrupt and rescheduling
> > while at a vulnerable point in 'schedule()'. Interrupts are enabled in
> line
> > 743. If we get an interrupt any time after line 785:
> >
> >            next->active_mm = oldmm;
> >
> > but before line 806
> >
> > __schedule_tail()
> >
> > completes the swap, the interrupt can force 'schedule()' to be reentered
> via
> > 'ret_from_intr()'.
> >
> > If so, 'kswapd's 'active_mm' field will be left non-zero, but 'current'
> will
> > not have been set to point to 'kswapd'. The next time 'schedule()' tries
> to
> > switch to 'kswapd', 'next' points to 'kswapd', and
> >
> >         next->mm == NULL
> >         next->active_mm != NULL
> >
> > which is detected as an invalid state, so we hit the BUG.
> >
> > Some questions:
> > Are we looking at this correctly?
> > Has anyone ever seen this before?
> > Is there a published fix?
> >
> > Thanks,
> >
> > -Craig
> >
> 
> This is an known problem.  Please try the attached patch.
> 
> On R5432 CPU, there is also an hardware bug which can cause the same
> problem.  Please double-check vec3_generic to see if workaround is
> at the beginning of the handler.
> 
> BTW, 2.4.17 is an old kernel. You really need to upgrade.
> 
> Jun
> 
> 
> 

[-- Attachment #2: sock.c.patch --]
[-- Type: application/octet-stream, Size: 341 bytes --]

Index: /home/sscott/work/Software/linux/net/core/sock.c
===================================================================
RCS file: /usr/local/CVS/V4000/Software/linux/net/core/sock.c,v
retrieving revision 1.1.1.1
diff -r1.1.1.1 sock.c
1175c1175
< 	sk->allocation	=	GFP_KERNEL;
---
> 	sk->allocation	=	GFP_ATOMIC; /*GFP_KERNEL;*/

[-- Attachment #3: r4k_switch.S.patch --]
[-- Type: application/octet-stream, Size: 378 bytes --]

Index: /home/sscott/work/Software/linux/arch/mips/kernel/r4k_switch.S
===================================================================
RCS file: /usr/local/CVS/V4000/Software/linux/arch/mips/kernel/r4k_switch.S,v
retrieving revision 1.1.1.1
diff -r1.1.1.1 r4k_switch.S
38a39
> 	ori	t1, 0x1		/* srs - assume ints disabled in schedule(). Reenable when task resumes */


[-- Attachment #4: sched.c.patch --]
[-- Type: application/octet-stream, Size: 391 bytes --]

Index: /home/sscott/work/Software/linux/kernel/sched.c
===================================================================
RCS file: /usr/local/CVS/V4000/Software/linux/kernel/sched.c,v
retrieving revision 1.1.1.1
diff -r1.1.1.1 sched.c
743c743
< 	spin_unlock_irq(&runqueue_lock);
---
> /*srs	spin_unlock_irq(&runqueue_lock); */
747a748
> /*srs*/	spin_unlock_irq(&runqueue_lock);

WARNING: multiple messages have this Message-ID (diff)

From: "Steve Scott" <steve.scott@pioneer-pdt.com>
To: jsun@mvista.com
Cc: linux-mips@linux-mips.org, craig.mautner@pioneer-pdt.com
Subject: Re: schedule() BUG
Date: Mon, 6 Oct 2003 19:05:06 -0700	[thread overview]
Message-ID: <017601c38c77$6d7225a0$2256fea9@janelle> (raw)
Message-ID: <20031007020506.orvgKRE_jfHhHCov2C5pHniXg7-tqstY8zVURcu_enk@z> (raw)
In-Reply-To: FJEIIOCBFAIOIDNKLPFJCECODAAA.koji.kawachi@pioneer-pdt.com

[-- Attachment #1: Type: text/plain, Size: 4188 bytes --]

We tried the fault.c patch Jun suggested, but it didn't solve the problem we were
having with the BUG() in schedule(). The patch at the beginning of
except_vec3_generic for the Vr5432 bug had previously been installed.

While chasing the BUG() in schedule(), though, we ran across another BUG() in
alloc_skb() in ...linux/net/core/skbuff.c. :

    alloc_skb called nonatomically from interrupt 80117acc
    kernel BUG at skbuff.c:179!

We changed the way sock_init_data initializes the 'allocation' field and
were able to get past this one (see attached sock.c.patch). We're not sure
if this fix needs to be permanent, or if it's just a temporary workaround.

For the schedule() BUG(), all evidence that we collected pointed to some
interrupt causing us to reenter schedule() (i.e., somehow schedule() was
called during an interrupt handler). We suspected something being run
from the timer interrupt bottom half, but were never able to prove it. We
also thought a remote possibility might be a pipeline hazard in the MIPS
causing the EPC register not to update on a nested exception, but NEC says
that can't happen on the Vr5432 that we're using...

We finally worked around the schedule BUG() by disabling interrupts
during the context switch in schedule(). This workaround required changes
in linux/kernel/sched.c and linux/arch/mips/kernel/r4k_switch.S (see attached
patches).

--steve

> 
> 
> -----Original Message-----
> From: linux-mips-bounce@linux-mips.org
> [mailto:linux-mips-bounce@linux-mips.org]On Behalf Of Jun Sun
> Sent: Wednesday, October 01, 2003 4:50 PM
> To: Craig Mautner
> Cc: linux-mips@linux-mips.org; jsun@mvista.com
> Subject: Re: schedule() BUG
> 
> 
> On Fri, Sep 12, 2003 at 11:04:16AM -0700, Craig Mautner wrote:
> > We are using mips-linux 2.4.17, gcc 3.2.1 (MontaVista) and crashing in
> > schedule():
> >
> > Unable to handle kernel paging request at virtual address 00000000, epc ==
> > 800153c0, ra == 800153c0
> > $0 : 00000000 9001f800 0000001b 00000000 0000001a 83f56000 8298f4a0
> 0000001f
> > $8 : 00000001 ffffe2e0 000022e0 00000000 fffffff9 ffffffff 0000000a
> 00000002
> > $16: 00000000 00000000 82af0000 8298f4a0 83f56000 00000000 80008000
> 00000000
> > $24: 82af1dc2 00000002                   82af0000 82af1ef8 82af1ef8
> 800153c0
> > epc  : 800153c0    Not tainted
> >
> > The code is:
> >
> >     {
> >       struct mm_struct *mm = next->mm;
> >       struct mm_struct *oldmm = prev->active_mm;
> >       if (!mm) {
> >            if (next->active_mm) BUG();   <- this is where we crash
> >            next->active_mm = oldmm;
> >            atomic_inc(&oldmm->mm_count);
> >            enter_lazy_tlb(oldmm, next, this_cpu);
> >       }
> >         .
> >         .
> >         .
> >
> > This seems to happen in our case when 'next' points to 'kswapd' although
> we
> > think it could happen when switching to any kernel task (i.e. those tasks
> > with mm==NULL).
> >
> > We think the culprit is that we are taking an interrupt and rescheduling
> > while at a vulnerable point in 'schedule()'. Interrupts are enabled in
> line
> > 743. If we get an interrupt any time after line 785:
> >
> >            next->active_mm = oldmm;
> >
> > but before line 806
> >
> > __schedule_tail()
> >
> > completes the swap, the interrupt can force 'schedule()' to be reentered
> via
> > 'ret_from_intr()'.
> >
> > If so, 'kswapd's 'active_mm' field will be left non-zero, but 'current'
> will
> > not have been set to point to 'kswapd'. The next time 'schedule()' tries
> to
> > switch to 'kswapd', 'next' points to 'kswapd', and
> >
> >         next->mm == NULL
> >         next->active_mm != NULL
> >
> > which is detected as an invalid state, so we hit the BUG.
> >
> > Some questions:
> > Are we looking at this correctly?
> > Has anyone ever seen this before?
> > Is there a published fix?
> >
> > Thanks,
> >
> > -Craig
> >
> 
> This is an known problem.  Please try the attached patch.
> 
> On R5432 CPU, there is also an hardware bug which can cause the same
> problem.  Please double-check vec3_generic to see if workaround is
> at the beginning of the handler.
> 
> BTW, 2.4.17 is an old kernel. You really need to upgrade.
> 
> Jun
> 
> 
> 

[-- Attachment #2: sock.c.patch --]
[-- Type: application/octet-stream, Size: 341 bytes --]

Index: /home/sscott/work/Software/linux/net/core/sock.c
===================================================================
RCS file: /usr/local/CVS/V4000/Software/linux/net/core/sock.c,v
retrieving revision 1.1.1.1
diff -r1.1.1.1 sock.c
1175c1175
< 	sk->allocation	=	GFP_KERNEL;
---
> 	sk->allocation	=	GFP_ATOMIC; /*GFP_KERNEL;*/

[-- Attachment #3: r4k_switch.S.patch --]
[-- Type: application/octet-stream, Size: 378 bytes --]

Index: /home/sscott/work/Software/linux/arch/mips/kernel/r4k_switch.S
===================================================================
RCS file: /usr/local/CVS/V4000/Software/linux/arch/mips/kernel/r4k_switch.S,v
retrieving revision 1.1.1.1
diff -r1.1.1.1 r4k_switch.S
38a39
> 	ori	t1, 0x1		/* srs - assume ints disabled in schedule(). Reenable when task resumes */


[-- Attachment #4: sched.c.patch --]
[-- Type: application/octet-stream, Size: 391 bytes --]

Index: /home/sscott/work/Software/linux/kernel/sched.c
===================================================================
RCS file: /usr/local/CVS/V4000/Software/linux/kernel/sched.c,v
retrieving revision 1.1.1.1
diff -r1.1.1.1 sched.c
743c743
< 	spin_unlock_irq(&runqueue_lock);
---
> /*srs	spin_unlock_irq(&runqueue_lock); */
747a748
> /*srs*/	spin_unlock_irq(&runqueue_lock);

next      parent reply	other threads:[~2003-10-07  2:02 UTC|newest]

Thread overview: 13+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <FJEIIOCBFAIOIDNKLPFJCECODAAA.koji.kawachi@pioneer-pdt.com>
2003-10-07  2:05 ` Steve Scott [this message]
2003-10-07  2:05   ` schedule() BUG Steve Scott
2003-10-08 16:29   ` Ralf Baechle
2003-09-12 18:04 Craig Mautner
2003-09-12 18:04 ` Craig Mautner
2003-09-13 16:30 ` Craig Mautner
2003-09-13 16:30   ` Craig Mautner
2003-09-15 18:59 ` Craig Mautner
2003-09-15 18:59   ` Craig Mautner
2003-10-01 23:50 ` Jun Sun
2003-10-02  0:09   ` Ralf Baechle
2003-10-02  0:39     ` Jun Sun
2003-10-02  4:28   ` Daniel Jacobowitz

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='017601c38c77$6d7225a0$2256fea9@janelle' \
    --to=steve.scott@pioneer-pdt.com \
    --cc=craig.mautner@pioneer-pdt.com \
    --cc=jsun@mvista.com \
    --cc=linux-mips@linux-mips.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox