linux-arm-kernel.lists.infradead.org archive mirror
 help / color / mirror / Atom feed
From: dirk.behme@googlemail.com (Dirk Behme)
To: linux-arm-kernel@lists.infradead.org
Subject: SMP issues on ARM11 MPCore
Date: Thu, 7 Jan 2010 14:45:11 +0100	[thread overview]
Message-ID: <1687fa361001070545q73e6738apd6ce5650316eca26@mail.gmail.com> (raw)
In-Reply-To: <1687fa361001070250l62628061h4bebaca1c03d2638@mail.gmail.com>

On Thu, Jan 7, 2010 at 11:50 AM, Dirk Behme <dirk.behme@googlemail.com> wrote:
> On Thu, Dec 24, 2009 at 4:27 PM, mkl lin <mkl0301@hotmail.com> wrote:
>>
>> hi,
>>
>> I'm using ARM11 MPCore with 2 CPU, Linux-2.6.31.1, SMP enabled, L1 enabled, L2 disabled
>>
>> Under SMP environment, I have observed following issues:
>>
>> case 1
>> Sometimes, console became extremely slow, print 1 character for 1-2 seconds
>> RVDS say that both CPU are idling. ?kernel seems find because messages ?response to inserting USB flash is quick and correct.
>>
>> case 2
>> Sometimes, ?the Linux console halt and canot accept any input.
>> RVDS say that both CPU are idling. ?kernel seems fine because messages ?response to inserting USB flash is quick and correct.
>>
>> case 3
>> Sometimes, the test stop with no reason or some fault like segmantation fault and return to console prompt or login prompt.
>>
>> case 4
>> Sometimes,
>> the test stop with no reason, but not returning to console prompt. The
>> console can accept input, but no further response, nor prompt.
>>
>> RVDS says that one CPU is idling, the other is in IRQ context, at
>> entry-armv.S(676) after __pabt_usr, seems like it's keeps getting
>> prefetch abort.
>>
>>
>> I can duplicate case 1 by keep inserting a simple test module.
>>
>> -------
>>
>> #include <linux/init.h>
>> #include <linux/module.h>
>>
>>
>> static int __init MYDRIVER_init(void)
>> {
>>
>> printk("%s: \n",__func__);
>> return 0;
>> }
>>
>> static void __exit MYDRIVER_exit(void)
>> {
>> printk("%s: \n",__func__);
>> }
>>
>> MODULE_AUTHOR("Mac Lin");
>> MODULE_DESCRIPTION("MYDRIVER");
>> MODULE_LICENSE("GPL");
>>
>> module_init(MYDRIVER_init);
>> module_exit(MYDRIVER_exit);
>> -------
>
> We tested above simple module on a 4 CPU ARM11 MPCore system using
> 2.6.32, too. Script to do this is very simple
>
>> cat test.sh
> insmod module.ko
> rmmod module.ko
> insmod module.ko
> ....
>
> This immediately results in [1] ?(user_debug=31 enabled).
>
> Doing some further investigation, we found that [1] happens only if
> module init and exit function are _both_ marked with __init/__exit.
> Removing __init or __exit or both seems to make the test (~50 times)
> run fine.
>
> Any further proposals for testing or debugging?
>
> Best regards
>
> Dirk
>
> [1] ~ # ./test.sh
> ?MYDRIVER_init
> ?MYDRIVER_exit
> ?Unable to handle kernel NULL pointer dereference at virtual address 00000000
> ?pgd = cc37c000
> ?[00000000] *pgd=8c38f031, *pte=00000000, *ppte=00000000
> ?Internal error: Oops: 17 [#1] SMP
> ?last sysfs file:
> ?Modules linked in: [last unloaded: module]
> ?CPU: 0 ? ?Not tainted ?(2.6.32-00012-g89b993e-dirty #17)
> ?PC is at strcmp+0x8/0x34
> ?LR is at sysfs_find_dirent+0x18/0x38
> ?pc : [<c094bd8c>] ? ?lr : [<c08ede4c>] ? ?psr: a0000013
> ?sp : cc26bef8 ?ip : 00000000 ?fp : 00000000
> ?r10: 40025000 ?r9 : cc26a000 ?r8 : 00000880
> ?r7 : cc26bf44 ?r6 : 00000000 ?r5 : 00000000 ?r4 : cc326428
> ?r3 : 000009b3 ?r2 : 00000000 ?r1 : 00000000 ?r0 : 00000000
> ?Flags: NzCv ?IRQs on ?FIQs on ?Mode SVC_32 ?ISA ARM ?Segment user
> ?Control: 08c5787d ?Table: 8c37c00a ?DAC: 00000015
> ?Process rmmod (pid: 378, stack limit = 0xcc26a270)
> ?Stack: (0xcc26bef8 to 0xcc26c000)
> ?bee0: ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? cc3263f8 00000000
> ?bf00: 00000000 c08ec918 cc3263f8 00000000 00000000 00000000 00000000 ccd961e0
> ?bf20: 00000000 c087de50 bf000074 bf000074 00000000 c087e214 00000000 c087e550
> ?bf40: 00000001 75646f6d cc00656c cc37d000 cc25ba20 cc25ba64 cc25ba54 00000000
> ?bf60: 40025000 00000000 40025000 00000001 cc26a000 4002501c bef869c4 0089e8e8
> ?bf80: bf000074 00000880 cc26bf8c 00000000 be00656c 400256c0 00000000 00000081
> ?bfa0: c0842224 c08420a0 be00656c 400256c0 bef86b80 00000880 00000000 75646f6d
> ?bfc0: be00656c 400256c0 00000000 00000081 00000059 00000000 40025000 00000000
> ?bfe0: bef86b80 bef86b70 00012fe0 400e7740 60000010 bef86b80 00000000 00000000
> ?[<c094bd8c>] (strcmp+0x8/0x34) from [<c08ede4c>] (sysfs_find_dirent+0x18/0x38)
> ?[<c08ede4c>] (sysfs_find_dirent+0x18/0x38) from [<c08ec918>]
> (sysfs_hash_and_remove+0x28/0x60)
> ?[<c08ec918>] (sysfs_hash_and_remove+0x28/0x60) from [<c087de50>]
> (free_notes_attrs+0x2c/0x4c)
> ?[<c087de50>] (free_notes_attrs+0x2c/0x4c) from [<c087e214>]
> (free_module+0x2c/0xdc)
> ?[<c087e214>] (free_module+0x2c/0xdc) from [<c087e550>]
> (sys_delete_module+0x214/0x250)
> ?[<c087e550>] (sys_delete_module+0x214/0x250) from [<c08420a0>]
> (ret_fast_syscall+0x0/0x2c)

Debugging of above call stack shows:

strcmp() fails due to both parameters, cs and ct being NULL.

Function free_module() gets 'struct module *mod' as parameter. This
has an element

mod->notes_attrs->attrs[0].attr.name

In case of module's init and exit function are _both_ marked with
__init/__exit, this name is NULL. This is then parsed down by above
call stack and let strcmp() fail.

In case init and/or exit function in above module don't have
__init/__exit, name is a valid pointer.

Any idea what might cause this?

Many thanks and best regards

Dirk

  reply	other threads:[~2010-01-07 13:45 UTC|newest]

Thread overview: 14+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-12-24 15:27 SMP issues on ARM11 MPCore mkl lin
2009-12-24 20:11 ` Dirk Behme
2009-12-25  3:13   ` mkl lin
2010-01-01 20:58 ` Russell King - ARM Linux
2010-01-03  8:56   ` mkl lin
2010-01-03 17:29     ` mkl lin
2010-01-05 16:02   ` mkl lin
2010-01-08 17:44     ` George G. Davis
2010-01-07 10:50 ` Dirk Behme
2010-01-07 13:45   ` Dirk Behme [this message]
2010-01-07 14:13     ` Russell King - ARM Linux
2010-01-07 14:39     ` Uwe Kleine-König
2010-01-08 13:14       ` Dirk Behme
2010-01-08 15:09         ` mkl lin

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1687fa361001070545q73e6738apd6ce5650316eca26@mail.gmail.com \
    --to=dirk.behme@googlemail.com \
    --cc=linux-arm-kernel@lists.infradead.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).