From: "Kevin D. Kissell" <kevink@paralogos.com>
To: Glyn Astill <glynastill@yahoo.co.uk>
Cc: linux-mips@linux-mips.org
Subject: Re: Qube2 slowly dies
Date: Fri, 12 Jun 2009 12:45:26 -0700 [thread overview]
Message-ID: <4A32B056.5060707@paralogos.com> (raw)
In-Reply-To: <1099.9578.qm@web23606.mail.ird.yahoo.com>
[-- Attachment #1: Type: text/plain, Size: 2970 bytes --]
Glyn Astill wrote:
>> From: Kevin D. Kissell <kevink@paralogos.com>
>>
>> Your description sounds an awful lot
>> like failures I've seen when
>> interrupts get lost or blocked for some reason (could be
>> hardware, the
>> kernel, or some interaction between them). Have you
>> looked at
>> to see if "Spurious" interrupts are
>> occurring, or if
>> the rate of serviced timer and I/O interrupts decreases or
>> increases as
>> the system degrades?
>>
>
> No I haven't checked - but I will. What would I be looking for that would stick out as "spurious"? The type of interrupt, qty or random interrupts appearing and dissapearing?
>
There's a separate counter, and /proc/interrupts report, for spurious
interrupts.
>
>
>> When the system becomes unresponsive, by any
>> chance does it "wake up" after 10-20 minutes (the time for
>> the Count
>> register to wrap)?
>>
>>
>
> Not that I've noticed, I just see it degrade further and further untill it dies over the course of an hour or so.
>
>
>> If other Qube2s don't exhibit this behavior with a given
>> Linux kernel,
>> but yours does, and yet yours runs NetBSD OK, it suggests
>> that there's a
>> difference in interrupt setup/handling between the two
>> systems that just
>> happens to work around a hardware problem on your board.
>>
>
> I'm sure that's a valid possibility, however I do have two of these machines and I have tried both with the same results.
>
Ah. I had misunderstood your messages to have stated that you had one
Qube2 that exhibited the behavior while others did not. In the actual
case, it definitely sounds like a kernel interrupt management problem,
either at the level of the interrupt controller support code or some bit
of low-level management of the Status.IM interrupt mask. If you can
force the kernel to dump the state of the Status and Cause registers, as
well as that of whatever outboard interrupt controller is on that thing,
that would be good. I used to have a hook in the NMI handler of my
Malta kernels for that, which was useful when I was debugging the SMTC
interrupt support, which was pretty subtle and nasty. And why this
failure mode sounds vaguely familiar. ;o) The interrupt
ack/mask/enable machinery has changed and standardized (for the better)
since the Qube2 was a current product, and the controller "chip"
struct/functions being used may not in fact be entirely correct for the
platform, e.g. you may have non-atomic changes to interrupt masks being
done that screw up in the presence of nested service.
> I also had a problem back when I tried etch with the 2.6.18 kernel, however in this case I saw no degraded performance at all, however after a some of hours of activity (anywhere between 2 and 24+) it'd just fall on it's ass.
>
That's not a very scientific description of a failure. I mean, did the
Qube2 literally jump off the table? ;o)
Regards,
Kevin K.
[-- Attachment #2: Type: text/html, Size: 3964 bytes --]
next prev parent reply other threads:[~2009-06-12 19:45 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-06-11 8:54 Qube2 slowly dies Glyn Astill
2009-06-12 19:45 ` Kevin D. Kissell [this message]
-- strict thread matches above, loose matches on Subject: below --
2009-06-15 21:34 Glyn Astill
2009-06-10 14:24 Glyn Astill
2009-06-10 14:04 Glyn Astill
2009-06-10 14:12 ` Florian Fainelli
2009-06-11 3:39 ` Kevin D. Kissell
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4A32B056.5060707@paralogos.com \
--to=kevink@paralogos.com \
--cc=glynastill@yahoo.co.uk \
--cc=linux-mips@linux-mips.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).