From: Anders Blomdell <anders.blomdell@control.lth.se>
To: Jan Kiszka <jan.kiszka@siemens.com>
Cc: Xenomai <xenomai@xenomai.org>
Subject: Re: [Xenomai] kernel BUG at arch/x86/kernel/ipipe.c:589! on motherboard DX79SI
Date: Mon, 25 Feb 2013 16:17:03 +0100 [thread overview]
Message-ID: <512B806F.6010004@control.lth.se> (raw)
In-Reply-To: <512B7B1E.7090900@siemens.com>
On 2013-02-25 15:54, Jan Kiszka wrote:
> On 2013-02-25 15:39, Anders Blomdell wrote:
>> On 2013-02-25 13:27, Gilles Chanteperdrix wrote:
>>> On 02/25/2013 11:18 AM, Anders Blomdell wrote:
>>>
>>>> On 2013-02-15 16:26, Jan Kiszka wrote:
>>>>> On 2013-02-15 16:15, Anders Blomdell wrote:
>>>>>> Hi,
>>>>>>
>>>>>> I have a DX79SI that dies with "kernel BUG at
>>>>>> arch/x86/kernel/ipipe.c:589!" when running Xenomai. This is not very
>>>>>> surprising since when running the system with an ordinary kernel thera
>>>>>> are a few 'do_IRQ: X.Y No irq handler for vector (irq -1)' each day.
>>>>>>
>>>>>> Question is if it would be possible to do something less fatal than
>>>>>> 'BUG_ON(irq < 0);' in the code below:
>>>>>
>>>>> This remains a bug that has to be understood.
>>>>>
>>>>>>
>>>>>> int __ipipe_handle_irq(struct pt_regs *regs)
>>>>>> {
>>>>>> struct ipipe_percpu_data *p = __ipipe_this_cpu_ptr(&ipipe_percpu);
>>>>>> int irq, vector = regs->orig_ax, flags = 0;
>>>>>> struct pt_regs *tick_regs;
>>>>>>
>>>>>> if (likely(vector < 0)) {
>>>>>> irq = __this_cpu_read(vector_irq[~vector]);
>>>>>> BUG_ON(irq < 0);
>>>>>> } else { /* Software-generated. */
>>>>>> irq = vector;
>>>>>> flags = IPIPE_IRQF_NOACK;
>>>>>> }
>>>>>
>>>>> Kernel 3.5.7 with latest I-pipe?
>>>> Yes.
>>>>
>>>>> This is the second report of this kind,
>>>>> see [1] for the discussion and suggestions. If you don't have KGDB and
>>>>> that kind enabled, try Gilles' instrumentations.
>>>> After a running xenomai five and a half day on a DX58SO motherboard, the
>>>> system crashed, leaving a single 'do_IRQ: 2.166 No irq handler for
>>>> vector (irq -1)' on our logserver.
>>>>
>>>> I'm planning to put in Gilles instrumentations and change the BUG_ON to
>>>> a WARN_ON/WARN, but what should I return after that (my guess is a
>>>> 'return 1', but waiting a week to be proved wrong would be a waste of
>>>> time :-).
>>>
>>>
>>> Returning 1 is incorrect:
>>> - you should probably jump to the end of the __ipipe_handle_irq function
>>> - if the irq is irq 7, meaning a spurious irq, Linux should handle it,
>>> so, __ipipe_dispatch_irq should be called.
>> OK, so you mean that I'm probably lokking at two different problems
>> DX58SO: a spurious interrupt (irq==7) passes through __ipipe_handle_irq
>> without triggering BUG_ON, but something else breaks.
>> DX79SI: some (spurious?) interrupt results in irq < 0, triggering BUG_ON
>>
>> Would the following changes be what you have in mind:
>>
>> if (likely(vector < 0)) {
>> irq = __this_cpu_read(vector_irq[~vector]);
>> if (irq < 0) {
>> WARN(irq < 0, "irq(%d) < 0", irq);
>
> Again, that's only an instrumentation to help finding the bug.
I know (have to crawl before i can walk :-))
My understanding of the code is that if irq < 0, should not call
__ipipe_dispatch_irq, since its irq argument is unsigned, but perhaps
the MAYDAY and IPIPE_STALL_FLAG stuff should be executed (moving the
out: label up a few lines).
I guess that I don't need to consider the special case when
p->hrtimer_irq == -1 and irq == -1?
> According to my reading of the code, Linux should behave incorrectly
> over invalid vector_irq entries as well.
The difference on DX79SI being that Linux (3.6.11) only logs do_IRQ,
while Xenomai (Linux 3.5.7/xenomai-2.6.2.1) gives a BUG_ON which makes
all filesystems read-only, and after that everything more or less
freezes :-(
>
> Jan
>
>> goto out:
>> }
>> } else { /* Software-generated. */
>> irq = vector;
>> flags = IPIPE_IRQF_NOACK;
>> }
>> ...
>> out:
>> return 1;
>>
>>
>>
>>
Regards
Anders
--
Anders Blomdell Email: anders.blomdell@control.lth.se
Department of Automatic Control
Lund University Phone: +46 46 222 4625
P.O. Box 118 Fax: +46 46 138118
SE-221 00 Lund, Sweden
prev parent reply other threads:[~2013-02-25 15:17 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-02-15 15:15 [Xenomai] kernel BUG at arch/x86/kernel/ipipe.c:589! on motherboard DX79SI Anders Blomdell
2013-02-15 15:26 ` Jan Kiszka
2013-02-15 15:34 ` Jan Kiszka
2013-02-25 10:18 ` Anders Blomdell
2013-02-25 11:11 ` Jan Kiszka
2013-02-25 11:53 ` Anders Blomdell
2013-02-25 12:27 ` Gilles Chanteperdrix
2013-02-25 14:39 ` Anders Blomdell
2013-02-25 14:53 ` Philippe Gerum
2013-02-25 14:54 ` Jan Kiszka
2013-02-25 15:17 ` Anders Blomdell [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=512B806F.6010004@control.lth.se \
--to=anders.blomdell@control.lth.se \
--cc=jan.kiszka@siemens.com \
--cc=xenomai@xenomai.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.