linux-omap.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* CPU_IDLE causes random reboots on custom 4430
@ 2012-09-21 22:49 Chris Hoffmann
  2012-09-22  5:45 ` Shilimkar, Santosh
  0 siblings, 1 reply; 5+ messages in thread
From: Chris Hoffmann @ 2012-09-21 22:49 UTC (permalink / raw)
  To: linux-omap

Hi,

We're trying to get a custom 4430 board (aka. nook tablet with OMAP4430 
ES2.3 HS TWL6030 ES2.1) working with p-android-omap-3.0 on android jelly 
bean. The board works quite well, but we experience random hangs and the 
watchdog kicks the board to reboot.

The problem is that kmsg_last doesn't not show any useful information 
and the fiq debugger refuses to work when the device is locked up. We 
also enabled various kernel lock debugging configs to get more info ... 
in vain.

Experimenting with it for a while (removing non-standard drivers like 
accel, charger, battery, etc) we found that removing CONFIG_CPU_IDLE or 
setting disallow_smp_idle in cpuidle44xx.c seems to cure the random 
lockups and make the board work fine (apart from the maybe obvious 
consequences that it doesn't go to deep sleep anymore).

Further below a few sample infos from the board.

We'd very much appreciate any further help on how to proceed to find the 
root cause and happy to provide any more info from sys, debug or 
omapconf tool.

Rgds,
Chris

A dmesg from a normal boot:
https://dl.dropbox.com/u/21820416/CPU_IDLE_CRASH/dmesg.log

Kernel source:
https://github.com/Kuzma30/kernel3NookTablet/tree/jellybean

omapconf --cpuinfo
https://dl.dropbox.com/u/21820416/CPU_IDLE_CRASH/omapconf.cpuinfo

cat /sys/kernel/debug/clock/summary
https://dl.dropbox.com/u/21820416/CPU_IDLE_CRASH/clock.summary

... what else can we provide to track this down?

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: CPU_IDLE causes random reboots on custom 4430
  2012-09-21 22:49 CPU_IDLE causes random reboots on custom 4430 Chris Hoffmann
@ 2012-09-22  5:45 ` Shilimkar, Santosh
  2012-09-22 17:11   ` Chris Hoffmann
  0 siblings, 1 reply; 5+ messages in thread
From: Shilimkar, Santosh @ 2012-09-22  5:45 UTC (permalink / raw)
  To: Chris Hoffmann; +Cc: linux-omap

On Sat, Sep 22, 2012 at 4:19 AM, Chris Hoffmann <chrmhoffmann@gmail.com> wrote:
> Hi,
>
> We're trying to get a custom 4430 board (aka. nook tablet with OMAP4430
> ES2.3 HS TWL6030 ES2.1) working with p-android-omap-3.0 on android jelly
> bean. The board works quite well, but we experience random hangs and the
> watchdog kicks the board to reboot.
>
On the same kernel, you should have support for the persistent log. You might
want to check the output. That should give you pointers on what CPU was
doing before the freeze which resulted in reboot.

Regards
Santosh

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: CPU_IDLE causes random reboots on custom 4430
  2012-09-22  5:45 ` Shilimkar, Santosh
@ 2012-09-22 17:11   ` Chris Hoffmann
  2012-09-23 16:11     ` Shilimkar, Santosh
  0 siblings, 1 reply; 5+ messages in thread
From: Chris Hoffmann @ 2012-09-22 17:11 UTC (permalink / raw)
  To: Shilimkar, Santosh; +Cc: Chris Hoffmann, linux-omap

On 09/22/2012 07:45 AM, Shilimkar, Santosh wrote:
> On Sat, Sep 22, 2012 at 4:19 AM, Chris Hoffmann <chrmhoffmann@gmail.com> wrote:
>> Hi,
>>
>> We're trying to get a custom 4430 board (aka. nook tablet with OMAP4430
>> ES2.3 HS TWL6030 ES2.1) working with p-android-omap-3.0 on android jelly
>> bean. The board works quite well, but we experience random hangs and the
>> watchdog kicks the board to reboot.
>>
> On the same kernel, you should have support for the persistent log. You might
> want to check the output. That should give you pointers on what CPU was
> doing before the freeze which resulted in reboot.

Hi,

I have some problems to provide logs. If I add -DDEBUG to cpuidle44xx.o 
the problem doesn't seem to occur. It could be that printk-ing 
alleviates the issue.

Also the watchdog seems to shutdown the device rather than rebooting it 
(or it hangs?) and then I can't provide /proc/last_kmsg.

How could I provide more info?

Rgds,
Chris




^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: CPU_IDLE causes random reboots on custom 4430
  2012-09-22 17:11   ` Chris Hoffmann
@ 2012-09-23 16:11     ` Shilimkar, Santosh
  2012-09-25 21:10       ` Christian Hoffmann
  0 siblings, 1 reply; 5+ messages in thread
From: Shilimkar, Santosh @ 2012-09-23 16:11 UTC (permalink / raw)
  To: Chris Hoffmann; +Cc: linux-omap

On Sat, Sep 22, 2012 at 10:41 PM, Chris Hoffmann <chrmhoffmann@gmail.com> wrote:
> On 09/22/2012 07:45 AM, Shilimkar, Santosh wrote:
>>
>> On Sat, Sep 22, 2012 at 4:19 AM, Chris Hoffmann <chrmhoffmann@gmail.com>
>> wrote:
>>>
>>> Hi,
>>>
>>> We're trying to get a custom 4430 board (aka. nook tablet with OMAP4430
>>> ES2.3 HS TWL6030 ES2.1) working with p-android-omap-3.0 on android jelly
>>> bean. The board works quite well, but we experience random hangs and the
>>> watchdog kicks the board to reboot.
>>>
>> On the same kernel, you should have support for the persistent log. You
>> might
>> want to check the output. That should give you pointers on what CPU was
>> doing before the freeze which resulted in reboot.
>
>
> Hi,
>
> I have some problems to provide logs. If I add -DDEBUG to cpuidle44xx.o the
> problem doesn't seem to occur. It could be that printk-ing alleviates the
> issue.
>
> Also the watchdog seems to shutdown the device rather than rebooting it (or
> it hangs?) and then I can't provide /proc/last_kmsg.
>
> How could I provide more info?
>
Check if you have "/sys/kernel/debug/persistent_trace" available on
your kernel. This generally helps whenever there are hangs, the last
call stack is stored on memory and on the reboot it can be cat'ed to
see if some useful information about hang is available.

Regards
Santosh

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: CPU_IDLE causes random reboots on custom 4430
  2012-09-23 16:11     ` Shilimkar, Santosh
@ 2012-09-25 21:10       ` Christian Hoffmann
  0 siblings, 0 replies; 5+ messages in thread
From: Christian Hoffmann @ 2012-09-25 21:10 UTC (permalink / raw)
  To: Shilimkar, Santosh; +Cc: Chris Hoffmann, linux-omap

On 09/23/2012 06:11 PM, Shilimkar, Santosh wrote:
> On Sat, Sep 22, 2012 at 10:41 PM, Chris Hoffmann <chrmhoffmann@gmail.com> wrote:
>> On 09/22/2012 07:45 AM, Shilimkar, Santosh wrote:
>>>
>>> On Sat, Sep 22, 2012 at 4:19 AM, Chris Hoffmann <chrmhoffmann@gmail.com>
>>> wrote:
>>>>
>>>> Hi,
>>>>
>>>> We're trying to get a custom 4430 board (aka. nook tablet with OMAP4430
>>>> ES2.3 HS TWL6030 ES2.1) working with p-android-omap-3.0 on android jelly
>>>> bean. The board works quite well, but we experience random hangs and the
>>>> watchdog kicks the board to reboot.
>>>>
>>> On the same kernel, you should have support for the persistent log. You
>>> might
>>> want to check the output. That should give you pointers on what CPU was
>>> doing before the freeze which resulted in reboot.
>>
>>
>> Hi,
>>
>> I have some problems to provide logs. If I add -DDEBUG to cpuidle44xx.o the
>> problem doesn't seem to occur. It could be that printk-ing alleviates the
>> issue.
>>
>> Also the watchdog seems to shutdown the device rather than rebooting it (or
>> it hangs?) and then I can't provide /proc/last_kmsg.
>>
>> How could I provide more info?
>>
> Check if you have "/sys/kernel/debug/persistent_trace" available on
> your kernel. This generally helps whenever there are hangs, the last
> call stack is stored on memory and on the reboot it can be cat'ed to
> see if some useful information about hang is available.

Hi Santosh, all,

the p-android-omap-3.0 doesn't have the persistent_trace but I was able 
to backport it from 3.4 without major issues (only tricky part is that 
in p-android-3.4 there's no apparent user of that device in omapzoom 
kernel).

The problem is now that the omap-watchdog doesn't kick the device to 
reboot but rather to shutdown, so I still have no trace. Soft-rebooting 
shows that I can get persistent_trace.

Switching the persistent tracer to ecc=true does not help as it seems to 
overload the device completely when activating the tracing.

Rgds,
Chris




^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2012-09-25 21:10 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-09-21 22:49 CPU_IDLE causes random reboots on custom 4430 Chris Hoffmann
2012-09-22  5:45 ` Shilimkar, Santosh
2012-09-22 17:11   ` Chris Hoffmann
2012-09-23 16:11     ` Shilimkar, Santosh
2012-09-25 21:10       ` Christian Hoffmann

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).