From: Ingo Molnar <mingo@elte.hu>
To: Fernando Lopez-Lezcano <nando@ccrma.Stanford.EDU>
Cc: William Weston <weston@lysdexia.org>,
cc@ccrma.Stanford.EDU, linux-kernel@vger.kernel.org,
Thomas Gleixner <tglx@linutronix.de>,
david singleton <dsingleton@mvista.com>,
Steven Rostedt <rostedt@goodmis.org>,
Rui Nuno Capela <rncbc@rncbc.org>,
Mark Knecht <markknecht@gmail.com>
Subject: Re: 2.6.14-rc4-rt7
Date: Sat, 22 Oct 2005 05:58:51 +0200 [thread overview]
Message-ID: <20051022035851.GC12751@elte.hu> (raw)
In-Reply-To: <1129937138.5001.4.camel@cmn3.stanford.edu>
* Fernando Lopez-Lezcano <nando@ccrma.Stanford.EDU> wrote:
> Here's one with rc5-rt3:
>
> Oct 21 15:01:46 cmn3 kernel: BUG: ktimer expired short without user
> signal! (hald-addon-stor:4309)
and no "BUG: foo:1234 waking up bar:4321, expiring ktimer short" message
prior to that? Very weird, this line:
> Oct 21 15:01:46 cmn3 kernel: .. expires: 1012/751245500
> Oct 21 15:01:46 cmn3 kernel: .. expired: 1012/750908115
> Oct 21 15:01:46 cmn3 kernel: .. at line: 942
suggests that the ktimer was expired by ktimer_try_to_cancel() /
ktimer_cancel(), in ktimer_schedule(). I.e. something must have woken
the task early. Probably this theory of mine is incorrect then. I'll try
extend the debug info a bit: it would be interesting to see a 'timer
inserted at' timestamp as well (was it shortly before the problem
happened?), and a 'which PID cancelled the timer' info.
a heavy-hitting but complex-to-set-up solution would be to add a serial
console, and to enable WAKEUP_TIMING+LATENCY_TRACING in the .config, and
to edit kernel/latency.c to initialize the default value of the
following variables:
int wakeup_timing = 0;
int trace_all_cpus = 1;
int trace_freerunning = 1;
int trace_print_at_crash = 1;
int trace_user_triggered = 1;
these variables are in the top portion of latency.c. Important: if you
try this then you should probably also enable IGNORE_PRINTK_LOGLEVEL,
which will improve mass-output to the serial console. Another important
thing is to add a stop_trace() call to kernel/ktimers.c's
check_ktimer_signal() function:
unlock_ktimer_base(timer, &flags);
stop_trace();
printk("BUG: ktimer expired short without user signal! (%s:%d)\n",
current->comm, current->pid);
(otherwise all the trace output you'd be getting would be boring printk
related trace entries.)
this will cause the dump_stack() to also output thousands of trace
entries - all the kernel activity (from all CPUs) that preceded the
ktimer problem. Hopefully this pinpoints the bug.
> In both cases the machine goes catatonic, I don't know if right after
> this or not. It responds to the SysRQ key but that's pretty much it, I
> should probably try to get a serial console going somehow.
would it be easy for you to try the UP kernel? One possibility is that
this is some sort of SMP/APIC-timer related problem.
Ingo
next prev parent reply other threads:[~2005-10-22 3:58 UTC|newest]
Thread overview: 117+ messages / expand[flat|nested] mbox.gz Atom feed top
2005-10-17 16:05 2.6.14-rc4-rt7 Ingo Molnar
2005-10-17 17:06 ` 2.6.14-rc4-rt7 Mark Knecht
2005-10-17 19:21 ` 2.6.14-rc4-rt7 Fernando Lopez-Lezcano
2005-10-18 1:30 ` 2.6.14-rc4-rt7 Fernando Lopez-Lezcano
2005-10-18 1:50 ` 2.6.14-rc4-rt7 Mark Knecht
2005-10-18 6:54 ` 2.6.14-rc4-rt7 Ingo Molnar
2005-10-18 7:28 ` 2.6.14-rc4-rt7 Ingo Molnar
2005-10-18 14:11 ` 2.6.14-rc4-rt7 K.R. Foley
2005-10-18 14:49 ` 2.6.14-rc4-rt7 Ingo Molnar
2005-10-18 21:04 ` 2.6.14-rc4-rt7 Fernando Lopez-Lezcano
2005-10-18 21:31 ` 2.6.14-rc4-rt7 William Weston
2005-10-19 6:01 ` 2.6.14-rc4-rt7 Steven Rostedt
2005-10-19 11:19 ` 2.6.14-rc4-rt7 Ingo Molnar
2005-10-20 19:12 ` 2.6.14-rc4-rt7 Fernando Lopez-Lezcano
2005-10-20 19:16 ` 2.6.14-rc4-rt7 Ingo Molnar
2005-10-20 23:55 ` 2.6.14-rc4-rt7 Fernando Lopez-Lezcano
2005-10-21 8:05 ` 2.6.14-rc4-rt7 Ingo Molnar
2005-10-21 23:25 ` 2.6.14-rc4-rt7 Fernando Lopez-Lezcano
2005-10-22 0:20 ` 2.6.14-rc4-rt7 Mark Knecht
2005-10-22 3:41 ` 2.6.14-rc4-rt7 Ingo Molnar
2005-10-22 5:12 ` 2.6.14-rc4-rt7 Lee Revell
2005-10-22 23:25 ` 2.6.14-rc4-rt7 Fernando Lopez-Lezcano
2005-10-22 3:58 ` Ingo Molnar [this message]
2005-10-24 19:28 ` 2.6.14-rc4-rt7 Fernando Lopez-Lezcano
2005-10-24 19:38 ` 2.6.14-rc4-rt7 Fernando Lopez-Lezcano
2005-10-24 19:46 ` 2.6.14-rc4-rt7 john stultz
2005-10-25 9:17 ` 2.6.14-rc4-rt7 Antonio
2005-10-25 15:44 ` 2.6.14-rc4-rt7 Ingo Molnar
2005-10-25 15:58 ` 2.6.14-rc4-rt7 linux-os (Dick Johnson)
2005-10-25 17:35 ` 2.6.14-rc4-rt7 Fernando Lopez-Lezcano
2005-10-25 18:16 ` 2.6.14-rc4-rt7 john stultz
2005-10-25 20:12 ` 2.6.14-rc4-rt7 George Anzinger
2005-10-26 8:28 ` 2.6.14-rc4-rt7 Ingo Molnar
2005-10-26 16:03 ` 2.6.14-rc4-rt7 George Anzinger
2005-10-26 17:17 ` 2.6.14-rc4-rt7 George Anzinger
2005-10-26 20:45 ` 2.6.14-rc4-rt7 Rui Nuno Capela
2005-10-26 22:07 ` 2.6.14-rc4-rt7 William Weston
2005-10-26 23:33 ` 2.6.14-rc4-rt7 john stultz
2005-10-26 23:54 ` 2.6.14-rc4-rt7 William Weston
2005-10-26 23:58 ` 2.6.14-rc4-rt7 Steven Rostedt
2005-10-27 0:11 ` 2.6.14-rc4-rt7 john stultz
2005-10-27 0:34 ` 2.6.14-rc4-rt7 William Weston
2005-10-26 23:57 ` 2.6.14-rc4-rt7 Steven Rostedt
2005-10-27 0:02 ` 2.6.14-rc4-rt7 William Weston
2005-10-27 0:45 ` 2.6.14-rc4-rt7 john stultz
2005-10-27 1:07 ` 2.6.14-rc4-rt7 Steven Rostedt
2005-10-27 1:22 ` 2.6.14-rc4-rt7 john stultz
2005-10-27 1:37 ` 2.6.14-rc4-rt7 Steven Rostedt
2005-10-27 1:52 ` 2.6.14-rc4-rt7 john stultz
2005-10-27 2:11 ` 2.6.14-rc4-rt7 Steven Rostedt
2005-10-27 22:01 ` 2.6.14-rc4-rt7 William Weston
2005-10-27 22:32 ` 2.6.14-rc4-rt7 Steven Rostedt
2005-10-27 1:26 ` 2.6.14-rc4-rt7 Steven Rostedt
2005-10-27 8:01 ` 2.6.14-rc4-rt7 Rui Nuno Capela
2005-10-27 17:44 ` 2.6.14-rc4-rt7 Steven Rostedt
2005-10-27 23:18 ` 2.6.14-rc4-rt7 Rui Nuno Capela
2005-10-28 17:13 ` 2.6.14-rc4-rt7 Fernando Lopez-Lezcano
2005-11-03 22:13 ` 2.6.14-rc4-rt7 - [PATCH] improved boot time TSC synchronization Jim Houston
2005-10-24 20:39 ` 2.6.14-rc4-rt7 Steven Rostedt
2005-10-24 21:00 ` 2.6.14-rc4-rt7 Lee Revell
2005-10-17 21:43 ` 2.6.14-rc4-rt7 Daniel Walker
2005-10-17 22:03 ` 2.6.14-rc4-rt7 Thomas Gleixner
2005-10-17 22:05 ` 2.6.14-rc4-rt7 Daniel Walker
2005-10-17 22:15 ` 2.6.14-rc4-rt7 Thomas Gleixner
2005-10-18 6:42 ` 2.6.14-rc4-rt7 Ingo Molnar
2005-10-18 16:23 ` 2.6.14-rc4-rt7 Daniel Walker
2005-10-18 20:26 ` 2.6.14-rc4-rt7 Ingo Molnar
2005-10-18 0:19 ` 2.6.14-rc4-rt7 Daniel Walker
2005-10-18 6:45 ` 2.6.14-rc4-rt7 Ingo Molnar
2005-10-20 19:54 ` 2.6.14-rc5-rt1 Ingo Molnar
2005-10-20 23:33 ` 2.6.14-rc5-rt1 Felix Oxley
2005-10-21 0:39 ` 2.6.14-rc5-rt1 Mark Knecht
2005-10-21 13:47 ` 2.6.14-rc5-rt1 Mark Knecht
2005-10-21 10:01 ` 2.6.14-rc5-rt1 Felix Oxley
2005-10-21 10:16 ` 2.6.14-rc5-rt1 Ingo Molnar
2005-10-21 10:18 ` 2.6.14-rc5-rt1 Felix Oxley
2005-10-21 10:26 ` 2.6.14-rc5-rt1 Felix Oxley
2005-10-22 23:23 ` 2.6.14-rc5-rt1 Felix Oxley
2005-10-24 22:28 ` [ANNOUNCE] 2.6.14-rc5-rt5 kgdb update George Anzinger
2005-11-12 15:32 ` Ingo Molnar
2005-11-12 15:33 ` Ingo Molnar
2005-11-12 16:10 ` George Anzinger
2005-10-30 13:33 ` 2.6.14-rt1 Ingo Molnar
2005-10-30 14:58 ` 2.6.14-rt1 K.R. Foley
2005-10-30 15:41 ` 2.6.14-rt1 Steven Rostedt
2005-10-30 17:17 ` 2.6.14-rt1 Ingo Molnar
2005-10-30 17:19 ` 2.6.14-rt1 Ingo Molnar
2005-10-30 16:30 ` 2.6.14-rt1 Mark Knecht
2005-10-31 18:13 ` 2.6.14-rt1 Fernando Lopez-Lezcano
2005-11-01 20:18 ` 2.6.14-rt1 Fernando Lopez-Lezcano
2005-11-02 2:47 ` 2.6.14-rt1 Fernando Lopez-Lezcano
2005-11-02 2:55 ` 2.6.14-rt1 Carlos Antunes
2005-11-02 3:05 ` 2.6.14-rt1 Steven Rostedt
2005-11-02 3:26 ` 2.6.14-rt1 Carlos Antunes
2005-11-02 3:32 ` 2.6.14-rt1 Steven Rostedt
2005-11-02 3:36 ` 2.6.14-rt1 Carlos Antunes
2005-11-02 4:05 ` 2.6.14-rt1 Carlos Antunes
2005-11-02 9:21 ` 2.6.14-rt1 Florian Schmidt
2005-11-02 14:35 ` 2.6.14-rt1 Carlos Antunes
2005-11-02 14:40 ` 2.6.14-rt1 Ingo Molnar
2005-11-02 14:45 ` 2.6.14-rt1 Carlos Antunes
2005-11-02 15:37 ` 2.6.14-rt1 Steven Rostedt
2005-11-02 16:07 ` 2.6.14-rt1 Carlos Antunes
2005-11-02 16:24 ` 2.6.14-rt1 Steven Rostedt
2005-11-02 16:53 ` 2.6.14-rt1 Carlos Antunes
2005-11-02 16:37 ` 2.6.14-rt1 Steven Rostedt
2005-11-02 18:13 ` 2.6.14-rt1 Fernando Lopez-Lezcano
2005-11-02 7:02 ` 2.6.14-rt1 Ingo Molnar
2005-11-02 18:13 ` 2.6.14-rt1 Fernando Lopez-Lezcano
2005-11-04 7:04 ` 2.6.14-rt1 Fernando Lopez-Lezcano
2005-11-02 21:41 ` 2.6.14-rt4: __get_nsec_offset() false positives john stultz
2005-11-03 6:53 ` Ingo Molnar
2005-11-05 2:35 ` 2.6.14-rt1 (now rt6) Fernando Lopez-Lezcano
2005-11-05 3:46 ` Mark Knecht
2005-11-09 11:22 ` Ingo Molnar
2005-11-10 12:15 ` Ingo Molnar
2005-11-10 22:10 ` Fernando Lopez-Lezcano
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20051022035851.GC12751@elte.hu \
--to=mingo@elte.hu \
--cc=cc@ccrma.Stanford.EDU \
--cc=dsingleton@mvista.com \
--cc=linux-kernel@vger.kernel.org \
--cc=markknecht@gmail.com \
--cc=nando@ccrma.Stanford.EDU \
--cc=rncbc@rncbc.org \
--cc=rostedt@goodmis.org \
--cc=tglx@linutronix.de \
--cc=weston@lysdexia.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox