From: Theo Veenker <T.J.G.Veenker@domain.hid>
To: Philippe Gerum <rpm@xenomai.org>
Cc: Xenomai help <xenomai@xenomai.org>
Subject: Re: [Xenomai-help] kernel 2.6.32.11 with xenomai 2.5.3 fails to boot on ubuntu lucid system
Date: Mon, 23 Aug 2010 09:15:31 +0200 [thread overview]
Message-ID: <4C722013.1050301@domain.hid> (raw)
In-Reply-To: <1282498608.1961.38.camel@domain.hid>
Philippe Gerum wrote:
> On Fri, 2010-08-20 at 15:01 +0200, Philippe Gerum wrote:
>> On Fri, 2010-08-20 at 14:31 +0200, Theo Veenker wrote:
>>> Philippe Gerum wrote:
>>>> On Mon, 2010-08-16 at 21:14 +0200, Theo Veenker wrote:
>>>>> On 08/16/2010 04:26 PM, Theo Veenker wrote:
>>>>>> Gilles Chanteperdrix wrote:
>>>>>>> Theo Veenker wrote:
>>>>>>>> Hi,
>>>>>>>>
>>>>>>>> I want to upgrade all our PC's from Ubuntu hardy to lucid and in the
>>>>>>>> process
>>>>>>>> I'm also going from kernel 2.6.29.5 with Xenomai 2.4.8 to kernel
>>>>>>>> 2.6.32.11
>>>>>>>> with Xenomai 2.5.3.
>>>>>>>>
>>>>>>>> I first built and tested the 2.6.32.11 kernel with 2.5.3 on my hardy
>>>>>>>> system
>>>>>>>> and all went fine. But the problem is it just doesn't run on the
>>>>>>>> lucid distro.
>>>>>>> This, I do not understand, the kernel does not need any support from the
>>>>>>> distribution for booting, how can the same kernel boot with one
>>>>>>> distribution, and not with the other? When you say the "same kernel", do
>>>>>>> you mean the exact same zImage or bzImage, or do you mean the kernel
>>>>>>> with the same configuration, but with a different compiler, or only the
>>>>>>> version is identical?
>>>>>>>
>>>>>> It is a complete mystery to me either. I compiled my kernel into a deb
>>>>>> package
>>>>>> and installed the very same deb package on three machines:
>>>>>> MSI p45 neo3 with Hardy on it -> works OK
>>>>>> MSI p45 neo3 with Ludid on it -> nothing (works fine with regular kernel)
>>>>>> MSI 945P with Lucid on it: -> nothing (works fine with regular kernel)
>>>>>>
>>>>>> I'll try the suggestions posted and keep you informed.
>>>>> OK. Connected a terminal to catch early kernel messages. Still no output
>>>>> unfortunately (with the regular kernel I do get output on the terminal,
>>>>> so the connection works).
>>>>>
>>>>> Meanwhile also built and tested kernel 2.6.32.15 + xenomai 2.5.4. Still nothing.
>>>>> I'm clueless. I'm running Xenomai for years on dozens of systems and I've
>>>>> never run into problems like this. I think I'll have to sit down and take a
>>>>> close look at what I'm doing. I've always built my kernels using make-kpkg,
>>>>> maybe that somehow introduces a problem here. I'll try without it.
>>>>>
>>>>> (unfortunately/luckily I have to work from home for a few days so I can't
>>>>> get to the test system until later this week)
>>>> I failed to reproduce the issue yet, but it very much looks like an
>>>> I-pipe bug. Could you try the following config variants when time
>>>> allows:
>>>>
>>>> - on 2.6.32.11 or .15, disable CONFIG_SMP, enable CONFIG_X86_UP_APIC
>>>> only (*).
>>>> - on 2.6.32.11 or .15, disable CONFIG_SMP, enable CONFIG_X86_UP_APIC and
>>>> CONFIG_X86_UP_IOAPIC (*).
>>>> - on 2.6.32.7, use your normal CONFIG_SMP config, with this patch in:
>>>> http://download.gna.org/adeos/patches/v2.6/x86/older/adeos-ipipe-2.6.32.7-x86-2.5-01.patch
>>>>
>>>> (*) you need to switch off CONFIG_SMP first, to see those knobs appear
>>>> in the "processor type and features" menu.
>>>>
>>>> The fact that you did see the panic blinking signal at least once tends
>>>> to point the finger at some access fault the kernel tries to recover
>>>> without success, rather than a sudden freeze. It must happen early
>>>> enough during the boot process, for the console not to be available yet
>>>> for reporting what the kernel whines about.
>>>>
>>>> We don't know yet if that bug is either the consequence of some
>>>> interrupt delivery, and/or induced by code only involved in SMP. Those
>>>> test configs may help in discovering this.
>>>>
>>>> TIA,
>>>>
>>> Here are my results. I've built 5 kernels:
>>> K1: 2.6.32.15 (without the adeos patch applied)
>>> K2: 2.6.32.15 + 2.5.4
>>> K3: 2.6.32.15 + 2.5.4 CONFIG_SMP off, CONFIG_X86_UP_API on, CONFIG_XENOMAI off, CONFIG_IPIPE on
>>> K4: 2.6.32.15 + 2.5.4 as (3) with CONFIG_X86_UP_IOAPIC on
>>> K5: 2.6.32.7 with adeos-ipipe-2.6.32.7-x86-2.5-01.patch
>>>
>>> I now tested these kernels on four systems:
>>> A1: MSI 945P with Ubuntu 8.04
>>> A2: MSI 945P with Ubuntu 10.04
>>> B1: MSI p45 neo3 with Ubuntu 8.04
>>> B2: MSI p45 neo3 with Ubuntu 10.04
>>>
>>> A1 and A2 are identical systems from the same batch and B1 and B2 also.
>>>
>>> What worked:
>>> A1 A2 B1 B2
>>> ----------------------------------------------------------------
>>> K1 Y Y Y Y
>>> K2 Y N/Y Y N
>>> K3 Y N/Y Y N
>>> K4 Y N/Y Y N
>>> K5 Y N/Y Y N
>>>
>>> The No/Yes cases means on this system sometimes the kernel would boot
>>> (same as others have reported before). In the No cases I got no ouput
>>> on the attached console.
>>>
>>> Stange as it may be I still do see a strong correlation between the OS
>>> version and whether an adeos patched kernel works or not.
>> I should be able to confirm reasonably soon that the bug depends on
>> having an interrupt pending or not before the kernel entry point is
>> reached. This may depend on whether the bootloader clears the PIC and
>> when before jumping to the kernel start address. It will also depend on
>> the behavior of some device involved in the boot sequence, such as the
>> disk controller, for pending that IRQ or not.
>>
>> If this assumption turns to be correct, please make sure to send me your
>> congrats for having authored the silly piece of code I have right in
>> front of me now, that randomly turns boot screens into black holes since
>> 2003 or so.
>
> This took longer than expected since there were multiple holes to plug
> for this issue, but here is the patch that makes my box always boot
> properly again with 2.6.32 kernels :
>
> http://git.denx.de/?p=ipipe-2.6.git;a=commit;h=d7ed9ab34a265d6486504d20ee794548ce6011d3
>
> The fix is included in these patches, which are on their way to xenomai
> mainline:
>
> http://download.gna.org/adeos/patches/v2.6/x86/older/adeos-ipipe-2.6.32.15-x86-2.7-02.patch
> http://download.gna.org/adeos/patches/v2.6/x86/older/adeos-ipipe-2.6.32.20-x86-2.7-02.patch
> http://download.gna.org/adeos/patches/v2.6/x86/adeos-ipipe-2.6.34.5-x86-2.7-03.patch
Yeehaa, it works! Just tested with 2.6.32.15 and the machines with Ubuntu 10.04
(the ones using grub2) now boot fine.
Philippe, I bow my head in gratitude. Many thanks everbody.
Theo
next prev parent reply other threads:[~2010-08-23 7:15 UTC|newest]
Thread overview: 53+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-07-20 7:43 [Xenomai-help] kernel 2.6.32.11 with xenomai 2.5.3 fails to boot on ubuntu lucid system Theo Veenker
2010-07-20 10:14 ` Stefan Kisdaroczi
2010-07-20 11:18 ` Theo Veenker
2010-07-20 11:21 ` Gilles Chanteperdrix
2010-07-20 16:32 ` Theo Veenker
2010-08-16 11:22 ` Theo Veenker
2010-08-16 11:39 ` Gilles Chanteperdrix
2010-08-16 13:48 ` Theo Veenker
2010-08-16 13:53 ` Gilles Chanteperdrix
2010-08-16 14:11 ` Theo Veenker
2010-08-16 11:40 ` Stefan Kisdaroczi
2010-08-16 11:59 ` Gilles Chanteperdrix
2010-08-16 12:00 ` Hemal C.Bavishi
2010-08-16 13:55 ` Theo Veenker
2010-08-16 12:45 ` Gilles Chanteperdrix
2010-08-16 14:26 ` Theo Veenker
2010-08-16 19:14 ` Theo Veenker
2010-08-16 22:39 ` Gilles Chanteperdrix
2010-08-17 10:27 ` Philippe Gerum
2010-08-17 13:51 ` Hemal C.Bavishi
2010-08-17 15:24 ` Stefan Kisdaroczi
2010-08-18 7:09 ` Gilles Chanteperdrix
2010-08-18 9:03 ` Paul
2010-08-18 9:06 ` Gilles Chanteperdrix
2010-08-19 15:21 ` Stefan Kisdaroczi
2010-08-19 15:28 ` Gilles Chanteperdrix
2010-08-19 17:13 ` Stefan Kisdaroczi
2010-08-19 18:49 ` Gilles Chanteperdrix
2010-08-20 16:43 ` Stefan Kisdaroczi
2010-08-18 18:45 ` [Xenomai-help] kernel 2.6.32.11 with xenomai 2.5.3 fails to boot on ubuntu lucid system [PATCH] Stefan Kisdaroczi
2010-08-18 18:58 ` Stefan Kisdaroczi
2010-08-18 21:08 ` Paul
2010-08-17 17:01 ` [Xenomai-help] kernel 2.6.32.11 with xenomai 2.5.3 fails to boot on ubuntu lucid system Philippe Gerum
2010-08-17 17:43 ` Stefan Kisdaroczi
2010-08-17 18:06 ` Jan Kiszka
2010-08-18 12:38 ` Stefan Kisdaroczi
2010-08-18 8:27 ` Philippe Gerum
2010-08-18 12:11 ` Stefan Kisdaroczi
2010-08-18 13:54 ` Stefan Kisdaroczi
2010-08-22 17:42 ` Philippe Gerum
2010-08-23 11:59 ` Stefan Kisdaroczi
2010-08-18 14:53 ` Philippe Gerum
2010-08-18 18:09 ` Philippe Gerum
2010-08-18 23:21 ` Gilles Chanteperdrix
2010-08-18 23:25 ` Gilles Chanteperdrix
2010-08-19 5:18 ` Philippe Gerum
2010-08-20 12:31 ` Theo Veenker
2010-08-20 12:34 ` Gilles Chanteperdrix
2010-08-20 13:34 ` Theo Veenker
2010-08-20 13:01 ` Philippe Gerum
2010-08-22 17:36 ` Philippe Gerum
2010-08-23 7:15 ` Theo Veenker [this message]
2010-08-21 9:32 ` Daniele Nicolodi
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4C722013.1050301@domain.hid \
--to=t.j.g.veenker@domain.hid \
--cc=rpm@xenomai.org \
--cc=xenomai@xenomai.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.