From: Carsten Emde <C.Emde@osadl.org>
To: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
Cc: Linux RT Users <linux-rt-users@vger.kernel.org>
Subject: Re: [OSADL QA 3.18.9-rt5 #1]
Date: Sat, 11 Apr 2015 03:35:24 +0200 [thread overview]
Message-ID: <55287A5C.8020402@osadl.org> (raw)
In-Reply-To: <20150410123634.GA3057@linutronix.de>
Hi Sebastian,
>>> an Intel Bay Trail board (Intel(R) Celeron(R) CPU J1900 @ 1.99GHz) at
>>> the OSADL QA Farm rack #b/slot #6 (https://www.osadl.org/?id=1894) stops
>>> working every 12 to 36 hours. The only way to get the board back to work
>> [..]
> Could you try this:
> --
> Subject: [PATCH] kernel/irq_work: fix no_hz deadlock
>
> Invoking NO_HZ's irq_work callback from timer irq is not working very
> well if the callback decides to invoke hrtimer_cancel():
>
> |hrtimer_try_to_cancel+0x55/0x5f
> |hrtimer_cancel+0x16/0x28
> |tick_nohz_restart+0x17/0x72
> |__tick_nohz_full_check+0x8e/0x93
> |nohz_full_kick_work_func+0xe/0x10
> |irq_work_run_list+0x39/0x57
> |irq_work_tick+0x60/0x67
> |update_process_times+0x57/0x67
> |tick_sched_handle+0x4a/0x59
> |tick_sched_timer+0x3b/0x64
> |__run_hrtimer+0x7a/0x149
> |hrtimer_interrupt+0x1cc/0x2c5
>
> and here we deadlock while waiting for the lock which we are holding.
> To fix this I'm doing the same thing that upstream is doing: is the
> irq_work dedicated IRQ and use it only for what is marked as "hirq"
> which should only be the FULL_NO_HZ related work.
> Signed-off-by: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
> [..]
Thanks a lot! Applied the patch and restarted the box. Given the fact
that it took up to 36 hours until the board stopped, we unfortunately
need to see at least one week of crash-free operation, before we may
consider the bug as fixed.
-Carsten.
next prev parent reply other threads:[~2015-04-11 1:40 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-04-07 22:52 [OSADL QA 3.18.9-rt5 #1] Carsten Emde
2015-04-09 12:37 ` Sebastian Andrzej Siewior
2015-04-09 16:53 ` Sebastian Andrzej Siewior
2015-04-10 12:36 ` Sebastian Andrzej Siewior
2015-04-11 1:35 ` Carsten Emde [this message]
2015-04-20 21:22 ` [RESOLVED OSADL " Carsten Emde
2015-05-12 0:15 ` [OSADL " Steven Rostedt
2015-05-13 8:12 ` Sebastian Andrzej Siewior
2015-05-13 15:23 ` Steven Rostedt
2015-05-13 16:34 ` Steven Rostedt
2015-06-11 15:27 ` Sebastian Andrzej Siewior
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=55287A5C.8020402@osadl.org \
--to=c.emde@osadl.org \
--cc=bigeasy@linutronix.de \
--cc=linux-rt-users@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.