All of lore.kernel.org
 help / color / mirror / Atom feed
From: Stelian Pop <stelian.pop@domain.hid>
To: Heikki Lindholm <holindho@domain.hid>
Cc: Jan Kiszka <jan.kiszka@domain.hid>, xenomai@xenomai.org
Subject: Re: [Xenomai-core] latency kernel part crashes on ppc64
Date: Sun, 08 Jan 2006 22:02:29 +0100	[thread overview]
Message-ID: <1136754149.17443.21.camel@domain.hid> (raw)
In-Reply-To: <43C14453.3040907@domain.hid>

Le dimanche 08 janvier 2006 à 18:56 +0200, Heikki Lindholm a écrit :

> >>Some recent changes (*cough* RTDM benchmark driver *cough*) broke kernel
> >>mode benchmarking for ppc64. Previously klatency worked fine, but now
> >>latency -t 1 crashes somewhere in xnpod_schedule. Jan, any pending
> >>patches a comin'?

So it seems I'm not alone. 

I have done some additionnal debugging on this issue in the last days. I
still haven't find the bug but I narrowed it down a bit.
> 
> > 
> > Nope, it should work as it is. But as Stelian also reported problems on
> > his fresh ARM port with the in-kernel test, I cannot exclude that there
> > /might/ be a problem in the benchmark.
> > 
> > As I don't have any ppc64 hanging around somewhere, we will have to go
> > through this together. Things I would like to know:
> 
> Dammit, I hoped you'd whip up a fix just from me noting a problem. Well, 
> all right then, I'll play along...;)
> 
> >  o When and how does it crash? At start-up immediately? Or after a
> >    while?
> 
> I inserted some serial debug prints and it gets two passes to 
> eval_outer_loop done (enter/exit function). After that it freezes. 

It freezes exactly upon the invocation of rtdm_event_pulse() which
causes a scheduling. In xnpod_schedule, the scheduler queue has been
corrupted and this causes the illegal accesses.

> Without the debug printing it dies with kernel access of illegal memory 
> at xnpod_schedule, which btw. has been quite a common place to die.

Same for me.

> >  o Are there any details / backtraces available with the crash?
> 
> Becaktrace limits to xnpod_schedule if I remember right.

Same for me. But very often I don't even get a backtrace, it just hangs.

> >  o Does -t2 work?
> 
> Umm. Probably not. See below.

Heikki said in a later mail that it works for him, and so it does for me
too.

> >  o What happens if your disable "rtdm_event_pulse(&ctx->result_event);"
> >    in eval_outer_loop (thus no signalling of intermediate results during
> >    the test)? Does it still crash, maybe later during cleanup now?

> Doesn't freeze and can be exited with ctrl-c and even re-run.

Same for me.

Some additionnal information: I've disabled FPU handling in Xeno and it
doesn't change anything, it still crashes.

As I said before, the old klatency test does work reliably for me, with
the latest Xenomai.

I tried moving the 'display' thread into the kernel, and in this
configuration it does no longer crash.

I've started simplifying the code trying to get to the simplest code
which does have the problem. The results is at
http://www.popies.net/tmp/xenobug/bug.tgz if somebody wants to take a
look.

I'll be working on this again tomorrow...

Stelian.
-- 
Stelian Pop <stelian.pop@domain.hid>
Open Wide



  parent reply	other threads:[~2006-01-08 21:02 UTC|newest]

Thread overview: 20+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-01-08 14:34 [Xenomai-core] latency kernel part crashes on ppc64 Heikki Lindholm
2006-01-08 15:17 ` Jan Kiszka
2006-01-08 16:56   ` Heikki Lindholm
2006-01-08 18:03     ` Jan Kiszka
2006-01-08 18:31       ` Heikki Lindholm
2006-01-08 18:55         ` Jan Kiszka
2006-01-08 18:43       ` Heikki Lindholm
2006-01-08 19:23     ` Gilles Chanteperdrix
2006-01-08 21:02     ` Stelian Pop [this message]
2006-01-08 22:06       ` Jan Kiszka
2006-01-09  2:51         ` Philippe Gerum
2006-01-09  8:15           ` Jan Kiszka
2006-01-09  8:38             ` Philippe Gerum
2006-01-09 22:23               ` Gilles Chanteperdrix
2006-01-10  9:06                 ` Philippe Gerum
2006-01-11 22:11                   ` Gilles Chanteperdrix
2006-01-11 22:35                     ` Jan Kiszka
2006-01-11 23:07                       ` [Xenomai-core] latency kernel part fixed Philippe Gerum
2006-01-12  9:15                         ` Wolfgang Grandegger
2006-01-12 12:52                       ` [Xenomai-core] latency kernel part crashes on ppc64 Gilles Chanteperdrix

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1136754149.17443.21.camel@domain.hid \
    --to=stelian.pop@domain.hid \
    --cc=holindho@domain.hid \
    --cc=jan.kiszka@domain.hid \
    --cc=xenomai@xenomai.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.