kernel-testers.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: "Justin P. Mattock" <justinmattock-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
To: Stefan Richter
	<stefanr-MtYdepGKPcBMYopoZt5u/LNAH6kLmebB@public.gmane.org>
Cc: Dan Carpenter <error27-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>,
	linux1394-devel-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f@public.gmane.org,
	"Rafael J. Wysocki" <rjw-KKrjLPT3xs0@public.gmane.org>,
	Linux Kernel Mailing List
	<linux-kernel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>,
	Kernel Testers List
	<kernel-testers-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>
Subject: Re: ohci1394_dma=early crash since 2.6.32 (was Re: [Bug #14487] PANIC: early exception 08 rip 246:10 error	ffffffff810251b5 cr2 0)
Date: Mon, 01 Feb 2010 12:59:33 -0800	[thread overview]
Message-ID: <4B6740B5.5070601@gmail.com> (raw)
In-Reply-To: <4B673233.8000300-MtYdepGKPcBMYopoZt5u/LNAH6kLmebB@public.gmane.org>

On 02/01/10 11:57, Stefan Richter wrote:
> Justin P. Mattock wrote:
>> On 02/01/10 04:54, Dan Carpenter wrote:
>>> On Sun, Jan 31, 2010 at 05:39:22PM -0800, Justin P. Mattock wrote:
>>>> On 01/31/10 16:43, Rafael J. Wysocki wrote:
>>>>> This message has been generated automatically as a part of a report
>>>>> of regressions introduced between 2.6.31 and 2.6.32.
>>>>>
>>>>> The following bug entry is on the current list of known regressions
>>>>> introduced between 2.6.31 and 2.6.32.  Please verify if it still should
>>>>> be listed and let me know (either way).
>>>>>
>>>>>
>>>>> Bug-Entry	: http://bugzilla.kernel.org/show_bug.cgi?id=14487
>>>>> Subject	: PANIC: early exception 08 rip 246:10 error ffffffff810251b5 cr2 0
>>>>> Submitter	: Justin P. Mattock<justinmattock-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
>>>>> Date	: 2009-10-23 16:45 (101 days old)
>>>>> References	: http://lkml.org/lkml/2009/10/23/252
> [...]
>>>> yeah still hitting this.
> [...]
>>> I've added the linux1394-devel people to the CC list.
>
> Thanks.  Alas the original author is MIA, and the bug seems to be tied
> to the early platform setup code (rather than OHCI 1394 device specific
> code) about which I for one am clueless.
>
> The listed MAINTAINERS contact of init_ohci1394_dma.c is linux1394-devel
> and me, but a good deal of this driver is very x86 platform specific.
> (There was some interest in making useful for other architectures, but
> this would merely mean that the respective architecture people need to
> keep an eye on their parts of this driver.)
>
>>> Justin has found an issue that when he boots with:  ohci1394_dma=early
>>> his computer
>>> crashes.
>>>
>>> He can get it to boot by modifying drivers/ieee1394/init_ohci1394_dma.c:
> [...]
>
> This modification and some others in the LKML thread from October simply
> cause init_ohci1394_controller() to be skipped for all devices.
>
> init_ohci1394_controller() is simple enough:
>
> static inline void __init init_ohci1394_controller(int num, int slot,
> int func)
> {
> 	unsigned long ohci_base;
> 	struct ti_ohci ohci;
>
> 	printk(KERN_INFO "init_ohci1394_dma: initializing OHCI-1394"
> 			 " at %02x:%02x.%x\n", num, slot, func);
>
> 	ohci_base = read_pci_config(num, slot, func,
> 		PCI_BASE_ADDRESS_0+(0<<2))&  PCI_BASE_ADDRESS_MEM_MASK;
>
> 	set_fixmap_nocache(FIX_OHCI1394_BASE, ohci_base);
>
> 	ohci.registers = (void *)fix_to_virt(FIX_OHCI1394_BASE);
>
> 	init_ohci1394_reset_and_init_dma(&ohci);
> }
>
> Justin, you already established that read_pci_config is not the point
> where it crashes, right?
>
> set_fixmap_nocache() and fix_to_virt() frighten me because I don't know
> what they do. :-)
>
> The rest, init_ohci1394_reset_and_init_dma(), is something which I can
> easily follow.  There is just a bunch of register reads and writes with
> occasional mdelays.  This /could/ be a cause of the crash too if the
> controller is inspired to do something dangerous in there --- meaning,
> if the OHCI 1394 controller starts to write something per DMA into
> memory.  However, we do not switch on any DMA context except for the
> so-called physical DMA unit which only springs into action if a remote
> FireWire-attached console instructs it to do so.
>
> I am noticing one point where init_ohci1394_dma.c violates the OHCI 1394
> specification:  OHCI1394_HCControl_linkEnable is witched on while the
> OHCI1394_ConfigROMmap register is still invalid.  This register needs to
> contain a physical address of a 1kB sized, 1kB aligned memory region
> which allows DMA_TO_DEVICE.  So, since this is a read-only DMA, I am
> tempted to say that this potential issue should not be a cause for a
> kernel crash.
>
> (Sinde note, the OHCI 1394 spec is freely available, see
> http://ieee1394.wiki.kernel.org/index.php/Specifications#OHCI_Release_1.1.2C_January_6.2C_2000
> )
>
>
> Justin Mattock wrote on 2009-10-27 in http://lkml.org/lkml/2009/10/27/335:
>> o.k. you should be able to view
>> this:(let me know if you can't and I can
>> manually write out, and in time find a public
>> photo sharing suite to make things easier).
>>
>> http://www.flickr.com/photos/44066293@N08/4050317695
>>
>> When this happens I see lots of messages from the print
>> during boot, then this happens.
>
> (Now that a bugzilla.kernel.org ticket exists for this you can also use
> bugzilla.kernel.org to publish screenshots if you have an account there.)
>
> This screenshot looks like ___alloc_bootmem_node is the issue here, or
> am I mistaken of what the order of functions in the backtrace means?


cool, thanks for the assistance and info on this.
(I'll have to read through the specification for ohci1394);

as for __alloc_bootmem_node I have not looked into that yet.
(I can read up on this today).

what I was looking at was:
set_fixmap_nocache(FIX_OHCI1394_BASE, ohci_base);

which led me to arch/x86/include/asm/fixmap.h
leading me to believe I was hitting something with
FIXADDR_TOP because the system is a pure64.
(reading through fixmap.h there is mention that
vsyscall only covers 32bit making me think this might
be it).

and also:

init_ohci1394_reset_and_init_dma(&ohci);
(on the bugreport I have a temporary patch
which gets me up and running to do early debugging,
there you will see  both calls are commented out

(as for yesterdays 0xffffffffffffffff(just experimenting)Google gives me 
no info on the differences between 8f's to 16f's, I was under the 
impression that it's x86_32 and x86_64 for the pci address).

as for the bugzilla.kernel.org I'll have to setup an
account there(flickr is nice, but having a bugreport
photo and pics of my vacation isn't);

In general I'm thinking this has todo with the arch(but could be wrong),
because one lfs system I built was x86_32,which worked fine, and then
the next is a pure64 which triggers this.

Thanks for the info/help.

Justin P. Mattock













  parent reply	other threads:[~2010-02-01 20:59 UTC|newest]

Thread overview: 118+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-02-01  0:38 2.6.33-rc6: Reported regressions 2.6.31 -> 2.6.32 Rafael J. Wysocki
2010-02-01  0:38 ` [Bug #14297] console resume broken since ba15ab0e8d Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14504] intermittent hibernation problem again Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14298] warning at manage.c:361 (set_irq_wake), matrix-keypad related? Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14482] kernel BUG at fs/dcache.c:670 +lvm +md +ext3 Rafael J. Wysocki
2010-02-01 15:47   ` Thomas Backlund
     [not found]     ` <4B66F79E.1040606-4qZELD6Fgxhg9hUCZPvPmw@public.gmane.org>
2010-02-02 21:04       ` Rafael J. Wysocki
2010-02-08  9:59       ` Alexander Clouter
2010-02-15 16:24         ` Alexander Clouter
2010-02-15 21:00           ` Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14487] PANIC: early exception 08 rip 246:10 error ffffffff810251b5 cr2 0 Rafael J. Wysocki
2010-02-01  1:39   ` Justin P. Mattock
     [not found]     ` <4B6630CA.9010207-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2010-02-01 12:54       ` Dan Carpenter
2010-02-01 17:57         ` Justin P. Mattock
2010-02-01 19:57           ` ohci1394_dma=early crash since 2.6.32 (was Re: [Bug #14487] PANIC: early exception 08 rip 246:10 error ffffffff810251b5 cr2 0) Stefan Richter
     [not found]             ` <4B673233.8000300-MtYdepGKPcBMYopoZt5u/LNAH6kLmebB@public.gmane.org>
2010-02-01 20:59               ` Justin P. Mattock [this message]
     [not found]                 ` <4B6740B5.5070601-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2010-02-01 22:27                   ` Stefan Richter
     [not found]                     ` <4B675534.5070107-MtYdepGKPcBMYopoZt5u/LNAH6kLmebB@public.gmane.org>
2010-02-01 23:51                       ` Justin P. Mattock
     [not found]                         ` <4B676917.2080506-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2010-02-02  5:45                           ` Stefan Richter
     [not found]                             ` <4B67BC12.4080709-MtYdepGKPcBMYopoZt5u/LNAH6kLmebB@public.gmane.org>
2010-02-02  6:23                               ` Justin P. Mattock
     [not found]                                 ` <4B67C4D2.5050205-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2010-02-02  6:55                                   ` Stefan Richter
     [not found]                                     ` <4B67CC55.40301-MtYdepGKPcBMYopoZt5u/LNAH6kLmebB@public.gmane.org>
2010-02-02  6:57                                       ` Stefan Richter
2010-02-02  7:42                                         ` Justin P. Mattock
2010-02-03  1:43                                         ` Justin P. Mattock
2010-02-03  9:18                                           ` Jan Beulich
     [not found]                                             ` <4B694D5C020000780002D6DA-Qfbpwmsw6RoS3W1tAdPHOtBPR1lH4CV8@public.gmane.org>
2010-02-03 17:11                                               ` Justin P. Mattock
2010-02-03 19:26                                               ` Justin P. Mattock
2010-02-03 23:05                                               ` Justin P. Mattock
     [not found]                                                 ` <4B6A0123.3010607-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2010-02-04  8:54                                                   ` Jan Beulich
2010-02-04  9:04                                                     ` Justin P. Mattock
     [not found]                                                       ` <4B6A8DA9.4090804-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2010-02-04  9:11                                                         ` Jan Beulich
     [not found]                                                           ` <4B6A9D4C020000780002DA99-Qfbpwmsw6RoS3W1tAdPHOtBPR1lH4CV8@public.gmane.org>
2010-02-04  9:17                                                             ` Justin P. Mattock
2010-02-04  9:35                                                               ` Jan Beulich
     [not found]                                                                 ` <4B6AA2D6020000780002DAD1-Qfbpwmsw6RoS3W1tAdPHOtBPR1lH4CV8@public.gmane.org>
2010-02-04  9:48                                                                   ` Justin P. Mattock
2010-02-04  9:57                                                                     ` Jan Beulich
2010-02-04 10:13                                                                       ` Justin P. Mattock
     [not found]                                                                     ` <4B6AA817020000780002DB11@vpn. id2.novell.com>
     [not found]                                                                       ` <4B6AA817020000780002DB11-Qfbpwmsw6RoS3W1tAdPHOtBPR1lH4CV8@public.gmane.org>
2010-02-06 23:57                                                                         ` Justin P. Mattock
     [not found]                                           ` <4B68D4CA.2010803-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2010-02-24 14:37                                             ` Jan Beulich
2010-02-24 15:59                                               ` Justin P. mattock
2010-02-02  7:03                                       ` Justin P. Mattock
2010-02-02  6:46                               ` Justin P. Mattock
2010-02-01  0:43 ` [Bug #14376] Kernel NULL pointer dereference/ kvm subsystem Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14442] resume after hibernate: /dev/sdb drops and returns as /dev/sde Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14426] CE: hpet increasing min_delta_ns flood Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14657] perf subsystem breakage in 2.6.32-rc7 Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14656] Oops at __rmqueue+0x98 with 2.6.32-rc6 Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14624] ath9k: BUG kmalloc-8192: Poison overwritten Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14621] specjbb2005 and aim7 regression with 2.6.32-rc kernels Rafael J. Wysocki
2010-02-01  8:06   ` Mike Galbraith
     [not found]     ` <1265011579.6220.18.camel-YqMYhexLQo1vAv1Ojkdn7Q@public.gmane.org>
2010-02-02 20:56       ` Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14670] i915: playing video via XVideo extension makes the screen flicker Rafael J. Wysocki
2010-02-05 18:49   ` Jesse Barnes
2010-02-01  0:43 ` [Bug #14668] Resume from disk hangs in acpi_ex_acquire_global_lock Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14695] regression in karmic thermal control Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14667] bisected 2.6.32 EC regression - Temperatures not correctly detected after suspend - Dell Studio XPS 16 laptop Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14748] e1000e NIC not working after reboot Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14782] Suspend hangs Lenovo SL300 after gdm login Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14742] 2.6.32 new menu idle governor causes very high CPU temp Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14817] When is system under load, then freeze/HD fail Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14783] Unhandled IRQ on Thinkpad R61i: "irq 16: nobody cared" Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14886] Asus P2B-DS not detected as SMP moterboard Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14868] flood of "don't try to register things with the same name in the same directory." on upgrade to 2.6.32 Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14895] BUG in kernel 2.6.32 when using luks encrypted root and RAID0 Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14894] pohmelfs: NULL pointer dereference Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14891] Deadlock regression related to NFS root Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14897] i915: Commit 0e442c60 causes flickering Rafael J. Wysocki
2010-02-01  9:31   ` David John
     [not found]     ` <4B669F8F.1030402-XRr60H37pjdAfugRpC6u6w@public.gmane.org>
2010-02-02 20:55       ` Rafael J. Wysocki
     [not found]         ` <201002022155.44016.rjw-KKrjLPT3xs0@public.gmane.org>
2010-02-07 12:55           ` David John
     [not found]             ` <4B6EB844.4090001-XRr60H37pjdAfugRpC6u6w@public.gmane.org>
2010-02-07 13:14               ` Rafael J. Wysocki
2010-02-05 18:51       ` Jesse Barnes
2010-02-01  0:43 ` [Bug #14939] drm: random hang with i915 Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14922] 2.6.32 seemed to have broken nVidia MCP7A sata controller Rafael J. Wysocki
2010-02-01  1:13   ` Robert Hancock
     [not found]     ` <51f3faa71001311713sf4dbd45w240003a77de24f68-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2010-02-02 20:53       ` Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14898] ksoftirqd problem Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14998] Caught 32-bit read from uninitialized memory in acpi_system_read_event -- 2.6.31 regression Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #14943] nfs regression? Rafael J. Wysocki
2010-02-01 21:46   ` Nikola Ciprich
2010-02-01  0:43 ` [Bug #14997] Closing and re-opening the lid does not reactivate the backlight Rafael J. Wysocki
2010-02-05 18:57   ` Jesse Barnes
2010-02-01  0:43 ` [Bug #15000] Thinkpad dock button no longer works Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #15040] High cpu temperature with 2.6.32 - bisected to cpuidle menu update Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #15021] agpgart sometimes fails to initialize sometimes Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #15015] blank screen at random times in laptop when sitting idle Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #15042] socket(PF_INET6 hangs when ipv6 not yet initialized Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #15004] i915: *ERROR* Execbuf while wedged Rafael J. Wysocki
2010-02-05 19:01   ` Jesse Barnes
2010-02-05 19:09     ` Chris Mason
2010-02-05 19:17       ` Jesse Barnes
2010-02-08 17:24         ` Chris Mason
2010-02-08 17:35           ` Jesse Barnes
2010-02-08 20:00             ` Chris Mason
2010-02-08 23:39               ` Jesse Barnes
2010-02-10 16:44                 ` Jesse Barnes
2010-02-01  0:43 ` [Bug #15096] Resume lock up -- bisected, commit 3a1151e3f124fd1a2c54b8153f510f1a7c715369 Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #15100] X11 is black after resume from s2ram if my T400 was previous in docking station before Rafael J. Wysocki
2010-02-05 19:05   ` Jesse Barnes
2010-02-01  0:43 ` [Bug #15071] IBM/Lenovo Trackpoint speed, sensitivity reset after suspend Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #15108] Blank screen with KMS enabled (on clevo M5xN laptop) Rafael J. Wysocki
2010-02-05 19:08   ` Jesse Barnes
2010-02-05 22:31     ` Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #15127] Bluetooth: sleeping function called from invalid context Rafael J. Wysocki
2010-02-01  1:06   ` Marcel Holtmann
     [not found]     ` <1264986406.31341.5.camel-bi+AKbBUZKY6gyzm1THtWbp2dZbC/Bob@public.gmane.org>
2010-02-01 17:39       ` David John
     [not found]         ` <4B6711E2.6010409-XRr60H37pjdAfugRpC6u6w@public.gmane.org>
2010-02-01 19:14           ` Marcel Holtmann
     [not found]             ` <1265051657.31341.56.camel-bi+AKbBUZKY6gyzm1THtWbp2dZbC/Bob@public.gmane.org>
2010-02-01 22:00               ` Luiz Augusto von Dentz
2010-02-02  5:17               ` David John
     [not found]                 ` <4B67B57A.7000709-XRr60H37pjdAfugRpC6u6w@public.gmane.org>
2010-02-02  5:41                   ` Marcel Holtmann
     [not found]                     ` <1265089284.31341.63.camel-bi+AKbBUZKY6gyzm1THtWbp2dZbC/Bob@public.gmane.org>
2010-02-02  5:57                       ` David John
2010-02-02 20:52       ` Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #15193] kswapd continuously active Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #15197] padlock_sha1 and hmac broken? Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #15158] oops related to i915_gem_object_save_bit_17_swizzle Rafael J. Wysocki
2010-02-07 23:11   ` Werner LEMBERG
2010-02-01  0:43 ` [Bug #15135] Kernel 2.6.32.x hangs during boot process Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #15134] gobi_loader hangs after commit 8e8dce065088 Rafael J. Wysocki
2010-02-01  0:43 ` [Bug #15156] 2.6.32.6 hang at boot with ati x1600 Rafael J. Wysocki

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4B6740B5.5070601@gmail.com \
    --to=justinmattock-re5jqeeqqe8avxtiumwx3w@public.gmane.org \
    --cc=error27-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org \
    --cc=kernel-testers-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
    --cc=linux-kernel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
    --cc=linux1394-devel-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f@public.gmane.org \
    --cc=rjw-KKrjLPT3xs0@public.gmane.org \
    --cc=stefanr-MtYdepGKPcBMYopoZt5u/LNAH6kLmebB@public.gmane.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).