public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* 7ad6e9435596f692ff65f399da12816c94960185 breaks AMD f10h.
@ 2010-09-17 12:56 Conny Seidel
  2010-09-17 13:41 ` Andi Kleen
  2010-09-19  0:14 ` Huang Ying
  0 siblings, 2 replies; 5+ messages in thread
From: Conny Seidel @ 2010-09-17 12:56 UTC (permalink / raw)
  To: Huang Ying; +Cc: Andi Kleen, Len Brown, LKML, borislav.petkov@


[-- Attachment #1.1: Type: text/plain, Size: 9825 bytes --]

Hi,

Kernel v2.6.36-rc4-134-g03a7ab0 breaks with the OOPS below.

Reverting the commitid in the subject fixes the issue.
This is because APEI is selected by default on X86 (X86 && ACPI),
maybe a vendor check in the hest_init() might make sense.

Kernel config is attached.

Thx,
Conny

[    2.280657] ------------[ cut here ]------------
[    2.284611] WARNING: at fs/sysfs/dir.c:451 sysfs_add_one+0x87/0x9b()
[    2.290001] Hardware name: Toonie
[    2.293298] sysfs: cannot create duplicate filename '/devices/platform/GHES.9'
[    2.300001] Modules linked in:
[    2.303053] Pid: 1, comm: swapper Not tainted 2.6.36-rc2-autotest #1
[    2.310001] Call Trace:
[    2.312437]  [<ffffffff810f8a32>] ? sysfs_add_one+0x87/0x9b
[    2.320005]  [<ffffffff81032b86>] warn_slowpath_common+0x80/0x99
[    2.325983]  [<ffffffff81032c35>] warn_slowpath_fmt+0x41/0x43
[    2.330002]  [<ffffffff810f8a32>] sysfs_add_one+0x87/0x9b
[    2.335374]  [<ffffffff810f9364>] create_dir+0x6a/0xa9
[    2.340002]  [<ffffffff810f9437>] sysfs_create_dir+0x94/0xb0
[    2.345636]  [<ffffffff811e3b55>] kobject_add_internal+0x129/0x204
[    2.350002]  [<ffffffff811e3d08>] kobject_add_varg+0x41/0x4d
[    2.360002]  [<ffffffff811e3dcf>] kobject_add+0x64/0x66
[    2.365202]  [<ffffffff811e39bd>] ? kobject_get+0x1a/0x21
[    2.370004]  [<ffffffff812644f9>] device_add+0x13c/0x64c
[    2.375291]  [<ffffffff8126a3cb>] ? pm_runtime_init+0xdd/0xe1
[    2.380002]  [<ffffffff812685de>] platform_device_add+0x14e/0x1aa
[    2.386066]  [<ffffffff8122b7e3>] hest_parse_ghes+0x4a/0x7f
[    2.390002]  [<ffffffff8122b799>] ? hest_parse_ghes+0x0/0x7f
[    2.395632]  [<ffffffff8122b773>] apei_hest_parse+0xc3/0xe9
[    2.400004]  [<ffffffff81bacdd6>] ? hest_init+0x0/0x12f
[    2.405202]  [<ffffffff81baceaf>] hest_init+0xd9/0x12f
[    2.410004]  [<ffffffff81000212>] do_one_initcall+0x7c/0x130
[    2.420004]  [<ffffffff81b8367a>] kernel_init+0x131/0x1ba
[    2.425378]  [<ffffffff81003614>] kernel_thread_helper+0x4/0x10
[    2.430002]  [<ffffffff81b83549>] ? kernel_init+0x0/0x1ba
[    2.435375]  [<ffffffff81003610>] ? kernel_thread_helper+0x0/0x10
[    2.440006] ---[ end trace 4eaa2a86a8e2da22 ]---
[    2.444602] kobject_add_internal failed for GHES.9 with -EEXIST, don't try to register things with the same name in the same directory.
[    2.450001] Pid: 1, comm: swapper Tainted: G        W   2.6.36-rc2-autotest #1
[    2.460001] Call Trace:
[    2.462434]  [<ffffffff811e3bff>] kobject_add_internal+0x1d3/0x204
[    2.470002]  [<ffffffff811e3d08>] kobject_add_varg+0x41/0x4d
[    2.480002]  [<ffffffff811e3dcf>] kobject_add+0x64/0x66
[    2.485202]  [<ffffffff811e39bd>] ? kobject_get+0x1a/0x21
[    2.490002]  [<ffffffff812644f9>] device_add+0x13c/0x64c
[    2.495287]  [<ffffffff8126a3cb>] ? pm_runtime_init+0xdd/0xe1
[    2.500002]  [<ffffffff812685de>] platform_device_add+0x14e/0x1aa
[    2.506064]  [<ffffffff8122b7e3>] hest_parse_ghes+0x4a/0x7f
[    2.510001]  [<ffffffff8122b799>] ? hest_parse_ghes+0x0/0x7f
[    2.520003]  [<ffffffff8122b773>] apei_hest_parse+0xc3/0xe9
[    2.525547]  [<ffffffff81bacdd6>] ? hest_init+0x0/0x12f
[    2.530002]  [<ffffffff81baceaf>] hest_init+0xd9/0x12f
[    2.535114]  [<ffffffff81000212>] do_one_initcall+0x7c/0x130
[    2.540002]  [<ffffffff81b8367a>] kernel_init+0x131/0x1ba
[    2.545375]  [<ffffffff81003614>] kernel_thread_helper+0x4/0x10
[    2.550002]  [<ffffffff81b83549>] ? kernel_init+0x0/0x1ba
[    2.555374]  [<ffffffff81003610>] ? kernel_thread_helper+0x0/0x10
[    2.560006] BUG: unable to handle kernel paging request at ffffeae3800553b0
[    2.566959] IP: [<ffffffff810a8dfb>] kfree+0x42/0x22f
[    2.570000] PGD 0
[    2.570000] Oops: 0000 [#1] SMP
[    2.570000] last sysfs file:
[    2.570000] CPU 0
[    2.570000] Modules linked in:
[    2.570000]
[    2.570000] Pid: 1, comm: swapper Tainted: G        W   2.6.36-rc2-autotest #1 Toonie/Toonie
[    2.570000] RIP: 0010:[<ffffffff810a8dfb>]  [<ffffffff810a8dfb>] kfree+0x42/0x22f
[    2.570000] RSP: 0018:ffff8804364b9d20  EFLAGS: 00010082
[    2.570000] RAX: ffffeae3800553b0 RBX: ffff880435cde400 RCX: ffff88043672e780
[    2.570000] RDX: ffffea0000000000 RSI: 0000000000000037 RDI: ffffc9000185a8a4
[    2.570000] RBP: ffff8804364b9d70 R08: ffff8804364b9b40 R09: ffff8804364b99f0
[    2.570000] R10: 0000000000000000 R11: ffff880435cde410 R12: ffff880435cde458
[    2.570000] R13: ffffffff81b03730 R14: ffff88043676e600 R15: 0000000000000000
[    2.570000] FS:  0000000000000000(0000) GS:ffff880001e00000(0000) knlGS:0000000000000000
[    2.570000] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[    2.570000] CR2: ffffeae3800553b0 CR3: 0000000001acf000 CR4: 00000000000006f0
[    2.570000] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[    2.570000] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[    2.570000] Process swapper (pid: 1, threadinfo ffff8804364b8000, task ffff8804364b77d0)
[    2.570000] Stack:
[    2.570000]  ffff880437c030c0 ffffc9000185a8a4 0000000000000282 00000000ffffffea
[    2.570000] <0> ffff8804364b9da0 ffff880435cde400 ffff880435cde458 ffffffff81b03730
[    2.570000] <0> ffff88043676e600 0000000000000000 ffff8804364b9d90 ffffffff81268764
[    2.570000] Call Trace:
[    2.570000]  [<ffffffff81268764>] platform_device_release+0x1c/0x34
[    2.570000]  [<ffffffff812632f1>] device_release+0x49/0x7e
[    2.570000]  [<ffffffff811e38c3>] kobject_release+0xf9/0x1d9
[    2.570000]  [<ffffffff811e37ca>] ? kobject_release+0x0/0x1d9
[    2.570000]  [<ffffffff811e4b6a>] kref_put+0x43/0x4f
[    2.570000]  [<ffffffff811e3729>] kobject_put+0x47/0x4c
[    2.570000]  [<ffffffff81263253>] put_device+0x12/0x14
[    2.570000]  [<ffffffff8126878e>] platform_device_put+0x12/0x14
[    2.570000]  [<ffffffff8122b807>] hest_parse_ghes+0x6e/0x7f
[    2.570000]  [<ffffffff8122b799>] ? hest_parse_ghes+0x0/0x7f
[    2.570000]  [<ffffffff8122b773>] apei_hest_parse+0xc3/0xe9
[    2.570000]  [<ffffffff81bacdd6>] ? hest_init+0x0/0x12f
[    2.570000]  [<ffffffff81baceaf>] hest_init+0xd9/0x12f
[    2.570000]  [<ffffffff81000212>] do_one_initcall+0x7c/0x130
[    2.570000]  [<ffffffff81b8367a>] kernel_init+0x131/0x1ba
[    2.570000]  [<ffffffff81003614>] kernel_thread_helper+0x4/0x10
[    2.570000]  [<ffffffff81b83549>] ? kernel_init+0x0/0x1ba
[    2.570000]  [<ffffffff81003610>] ? kernel_thread_helper+0x0/0x10
[    2.570000] Code: 7d b8 0f 86 01 02 00 00 9c 8f 45 c0 fa 48 8b 7d b8 e8 36 d1 f7 ff 48 c1 e8 0c 48 ba 00 00 00 00 00 ea ff ff 48 6b c0 38 48 01 d0 <66> 83 38 00 79 04 48 8b 40 10 66 83 38 00 79 04 48 8b 40 10 80
[    2.570000] RIP  [<ffffffff810a8dfb>] kfree+0x42/0x22f
[    2.570000]  RSP <ffff8804364b9d20>
[    2.570000] CR2: ffffeae3800553b0
[    2.570000] ---[ end trace 4eaa2a86a8e2da23 ]---
[    2.570006] Kernel panic - not syncing: Attempted to kill init!
[    2.580002] Pid: 1, comm: swapper Tainted: G      D W   2.6.36-rc2-autotest #1
[    2.587187] Call Trace:
[    2.589622]  [<ffffffff8168455d>] panic+0x8c/0x196
[    2.590003]  [<ffffffff81035c8f>] do_exit+0x70/0x6c1
[    2.594943]  [<ffffffff8103398f>] ? kmsg_dump+0x116/0x130
[    2.600003]  [<ffffffff81687a89>] oops_end+0xb2/0xba
[    2.604944]  [<ffffffff810228d9>] no_context+0x1f3/0x202
[    2.610002]  [<ffffffff81022aa2>] __bad_area_nosemaphore+0x1ba/0x1e0
[    2.620002]  [<ffffffff81003610>] ? kernel_thread_helper+0x0/0x10
[    2.626067]  [<ffffffff816846a3>] ? printk+0x3c/0x3e
[    2.630004]  [<ffffffff8104c0b4>] ? up+0x34/0x39
[    2.634599]  [<ffffffff81003610>] ? kernel_thread_helper+0x0/0x10
[    2.640003]  [<ffffffff8105843b>] ? __module_text_address+0xd/0x5b
[    2.646155]  [<ffffffff81022ad6>] bad_area_nosemaphore+0xe/0x10
[    2.650003]  [<ffffffff816899dd>] do_page_fault+0x1d7/0x3b9
[    2.660002]  [<ffffffff810052aa>] ? dump_trace+0x316/0x328
[    2.665462]  [<ffffffff81686f5f>] page_fault+0x1f/0x30
[    2.670002]  [<ffffffff810a8dfb>] ? kfree+0x42/0x22f
[    2.674943]  [<ffffffff81268764>] platform_device_release+0x1c/0x34
[    2.680002]  [<ffffffff812632f1>] device_release+0x49/0x7e
[    2.685462]  [<ffffffff811e38c3>] kobject_release+0xf9/0x1d9
[    2.690001]  [<ffffffff811e37ca>] ? kobject_release+0x0/0x1d9
[    2.695720]  [<ffffffff811e4b6a>] kref_put+0x43/0x4f
[    2.700002]  [<ffffffff811e3729>] kobject_put+0x47/0x4c
[    2.705202]  [<ffffffff81263253>] put_device+0x12/0x14
[    2.710002]  [<ffffffff8126878e>] platform_device_put+0x12/0x14
[    2.720002]  [<ffffffff8122b807>] hest_parse_ghes+0x6e/0x7f
[    2.725548]  [<ffffffff8122b799>] ? hest_parse_ghes+0x0/0x7f
[    2.730002]  [<ffffffff8122b773>] apei_hest_parse+0xc3/0xe9
[    2.735546]  [<ffffffff81bacdd6>] ? hest_init+0x0/0x12f
[    2.740002]  [<ffffffff81baceaf>] hest_init+0xd9/0x12f
[    2.745114]  [<ffffffff81000212>] do_one_initcall+0x7c/0x130
[    2.750002]  [<ffffffff81b8367a>] kernel_init+0x131/0x1ba
[    2.755374]  [<ffffffff81003614>] kernel_thread_helper+0x4/0x10
[    2.760003]  [<ffffffff81b83549>] ? kernel_init+0x0/0x1ba
[    2.770002]  [<ffffffff81003610>] ? kernel_thread_helper+0x0/0x10


##
##################################################################
# Email : conny.seidel@amd.com            GnuPG-Key : 0xA6AB055D #
# Fingerprint: 17C4 5DB2 7C4C C1C7 1452 8148 F139 7C09 A6AB 055D #
##################################################################
# Advanced Micro Devices GmbH Einsteinring 24 85609 Dornach      #
# General Managers: Alberto Bozzo, Andrew Bowd                   #
# Registration: Dornach, Landkr. Muenchen; Registerger. Muenchen #
#               HRB Nr. 43632                                    #
##################################################################

[-- Attachment #1.2: config.gz --]
[-- Type: application/x-gzip, Size: 15448 bytes --]

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 198 bytes --]

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: 7ad6e9435596f692ff65f399da12816c94960185 breaks AMD f10h.
  2010-09-17 12:56 7ad6e9435596f692ff65f399da12816c94960185 breaks AMD f10h Conny Seidel
@ 2010-09-17 13:41 ` Andi Kleen
  2010-09-17 14:50   ` Borislav Petkov
  2010-09-19  0:14 ` Huang Ying
  1 sibling, 1 reply; 5+ messages in thread
From: Andi Kleen @ 2010-09-17 13:41 UTC (permalink / raw)
  To: Conny Seidel; +Cc: Huang Ying, Len Brown, LKML, borislav.petkov@

  On 9/17/2010 2:56 PM, Conny Seidel wrote:
> Hi,
>
> Kernel v2.6.36-rc4-134-g03a7ab0 breaks with the OOPS below.
>
> Reverting the commitid in the subject fixes the issue.
> This is because APEI is selected by default on X86 (X86&&  ACPI),
> maybe a vendor check in the hest_init() might make sense.

It looks like your system has a duplicated hest source ID. They are supposed
to have unique identifiers.

The kernel should not oops of course, but it's really a BIOS bug imho.

-Andi


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: 7ad6e9435596f692ff65f399da12816c94960185 breaks AMD f10h.
  2010-09-17 13:41 ` Andi Kleen
@ 2010-09-17 14:50   ` Borislav Petkov
  0 siblings, 0 replies; 5+ messages in thread
From: Borislav Petkov @ 2010-09-17 14:50 UTC (permalink / raw)
  To: Andi Kleen; +Cc: Conny Seidel, Huang Ying, Len Brown, LKML

From: Andi Kleen <ak@linux.intel.com>
Date: Fri, Sep 17, 2010 at 03:41:19PM +0200

>  On 9/17/2010 2:56 PM, Conny Seidel wrote:
> >Hi,
> >
> >Kernel v2.6.36-rc4-134-g03a7ab0 breaks with the OOPS below.
> >
> >Reverting the commitid in the subject fixes the issue.
> >This is because APEI is selected by default on X86 (X86&&  ACPI),
> >maybe a vendor check in the hest_init() might make sense.
> 
> It looks like your system has a duplicated hest source ID. They are supposed
> to have unique identifiers.
> 
> The kernel should not oops of course, but it's really a BIOS bug imho.

We're looking into the BIOS issue. But right you are, the kernel should
not oops on a BIOS bug - it's not like BIOS bugs are something which
never happens :)

-- 
Regards/Gruss,
Boris.

Advanced Micro Devices GmbH
Einsteinring 24, 85609 Dornach
General Managers: Alberto Bozzo, Andrew Bowd
Registration: Dornach, Gemeinde Aschheim, Landkreis Muenchen
Registergericht Muenchen, HRB Nr. 43632


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: 7ad6e9435596f692ff65f399da12816c94960185 breaks AMD f10h.
  2010-09-17 12:56 7ad6e9435596f692ff65f399da12816c94960185 breaks AMD f10h Conny Seidel
  2010-09-17 13:41 ` Andi Kleen
@ 2010-09-19  0:14 ` Huang Ying
  2010-09-19 13:09   ` Conny Seidel
  1 sibling, 1 reply; 5+ messages in thread
From: Huang Ying @ 2010-09-19  0:14 UTC (permalink / raw)
  To: Conny Seidel, Brown, Len; +Cc: Andi Kleen, LKML, borislav.petkov@, Jin Dongming

Hi, Conny,

On Fri, 2010-09-17 at 20:56 +0800, Conny Seidel wrote:
> Hi,
> 
> Kernel v2.6.36-rc4-134-g03a7ab0 breaks with the OOPS below.
> 
> Reverting the commitid in the subject fixes the issue.
> This is because APEI is selected by default on X86 (X86 && ACPI),
> maybe a vendor check in the hest_init() might make sense.

It seems that Dongming has reported and fixed this BUG. Please try this
fixes:

http://lkml.org/lkml/2010/9/2/52

Hi, Len,

Can you apply the BUG FIX?

Best Regards,
Huang Ying



^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: 7ad6e9435596f692ff65f399da12816c94960185 breaks AMD f10h.
  2010-09-19  0:14 ` Huang Ying
@ 2010-09-19 13:09   ` Conny Seidel
  0 siblings, 0 replies; 5+ messages in thread
From: Conny Seidel @ 2010-09-19 13:09 UTC (permalink / raw)
  To: Huang Ying; +Cc: Brown, Len, Andi Kleen, LKML, borislav.petkov@, Jin Dongming

[-- Attachment #1: Type: text/plain, Size: 1298 bytes --]

On Sat, 18 Sep 2010 20:14:06 -0400
Huang Ying <ying.huang@intel.com> wrote:

>Hi, Conny,
>
>On Fri, 2010-09-17 at 20:56 +0800, Conny Seidel wrote:
>> Hi,
>>
>> Kernel v2.6.36-rc4-134-g03a7ab0 breaks with the OOPS below.
>>
>> Reverting the commitid in the subject fixes the issue.
>> This is because APEI is selected by default on X86 (X86 && ACPI),
>> maybe a vendor check in the hest_init() might make sense.
>
>It seems that Dongming has reported and fixed this BUG. Please try this
>fixes:
>
>http://lkml.org/lkml/2010/9/2/52
>
>Hi, Len,
>
>Can you apply the BUG FIX?
>
>Best Regards,
>Huang Ying
>
>
The fix is working, thanks a lot.

Conny



##
##################################################################
# Email : conny.seidel@amd.com            GnuPG-Key : 0xA6AB055D #
# Fingerprint: 17C4 5DB2 7C4C C1C7 1452 8148 F139 7C09 A6AB 055D #
##################################################################
# Advanced Micro Devices GmbH Einsteinring 24 85609 Dornach      #
# General Managers: Alberto Bozzo, Andrew Bowd                   #
# Registration: Dornach, Landkr. Muenchen; Registerger. Muenchen #
#               HRB Nr. 43632                                    #
##################################################################

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 198 bytes --]

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2010-09-19 13:06 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-09-17 12:56 7ad6e9435596f692ff65f399da12816c94960185 breaks AMD f10h Conny Seidel
2010-09-17 13:41 ` Andi Kleen
2010-09-17 14:50   ` Borislav Petkov
2010-09-19  0:14 ` Huang Ying
2010-09-19 13:09   ` Conny Seidel

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox