From: Frederic Weisbecker <frederic@kernel.org>
To: luanshi <zhangliguang@linux.alibaba.com>
Cc: Frederic Weisbecker <fweisbec@gmail.com>,
Thomas Gleixner <tglx@linutronix.de>,
Ingo Molnar <mingo@kernel.org>,
linux-kernel@vger.kernel.org
Subject: Re: tick/sched: iowait_sleeptime resides in tick_cpu_sched structure is a negative value
Date: Thu, 6 Jan 2022 14:40:07 +0100 [thread overview]
Message-ID: <20220106134007.GA1313863@lothringen> (raw)
In-Reply-To: <074a28e8-3794-5fee-f2ab-f08787149481@linux.alibaba.com>
On Thu, Jan 06, 2022 at 05:14:02PM +0800, luanshi wrote:
> Hi All,
>
> We encounted a hardlockup problem on ARM server, after some debug found
> that:
>
> PID: 0 TASK: ffff0400064de300 CPU: 126 COMMAND: "swapper/126"
> #0 [ffff8000250f3c90] __crash_kexec at ffff80001013a064
> #1 [ffff8000250f3e30] panic at ffff800010afd028
> #2 [ffff8000250f3f10] nmi_panic at ffff80001004a9e0
> #3 [ffff8000250f3f20] watchdog_hardlockup_check at ffff80001017b7f0
> #4 [ffff8000250f3f40] sdei_watchdog_callback at ffff80001003c9a4
> #5 [ffff8000250f3f50] sdei_event_handler at ffff8000108816b8
> #6 [ffff8000250f3f80] _sdei_handler at ffff800010b1c2e4
> #7 [ffff8000250f3fd0] __sdei_handler at ffff800010b139b0
> #8 [ffff8000250f3ff0] __sdei_asm_handler at ffff800010014c18
> --- <IRQ stack> ---
> #9 [ffff800013973f10] __cpu_do_idle at ffff800010b13764
> #10 [ffff800013973f20] arch_cpu_idle at ffff800010b137ec
> #11 [ffff800013973f30] default_idle_call at ffff800010b1b5cc
> #12 [ffff800013973f50] cpuidle_idle_call at ffff80001009648c
> #13 [ffff800013973f90] do_idle at ffff8000100965b0
> #14 [ffff800013973fc0] cpu_startup_entry at ffff8000100967d4
> #15 [ffff800013973fe0] secondary_start_kernel at ffff800010026bb0
>
>
> per_cpu(tick_cpu_sched, 126) = $1 = {
> sched_timer = {
> node = {
> node = {
> __rb_parent_color = 18446603337117384112,
> rb_right = 0x0,
> rb_left = 0x0
> },
> expires = 6108564000000
> },
> _softexpires = 6108564000000,
> function = 0xffff800010122ec0 <tick_sched_timer>,
> base = 0xffff04473bbcc780,
> state = 1 '\001',
> is_rel = 0 '\000',
> is_soft = 0 '\000',
> is_hard = 1 '\001'
> },
> check_clocks = 0,
> nohz_mode = NOHZ_MODE_INACTIVE,
> inidle = 1,
> tick_stopped = 0,
> idle_active = 1,
> do_timer_last = 0,
> got_idle_tick = 1,
> last_tick = 0,
> next_tick = 0,
> idle_jiffies = 0,
> idle_calls = 0,
> idle_sleeps = 0,
> idle_entrytime = 5012087709249,
> idle_waketime = 0,
> idle_exittime = 0,
> idle_sleeptime = 4936136669951,
> iowait_sleeptime = -1942739704,
> last_jiffies = 0,
> timer_expires = 0,
> timer_expires_base = 0,
> next_timer = 0,
> idle_expires = 0,
> tick_dep_mask = {
> counter = 0
> }
> }
>
> iowait_sleeptime = -1942739704,
> ----------
> iowait_sleeptime is monotonically increasing, under what circumstances iowait_sleeptime
> can be a negative value?
>
> For detailed information:
> https://bugzilla.kernel.org/show_bug.cgi?id=215458
>
>
> Can you give me some suggestions for debugging.
Some racy updates can happen if cpufreq or "cat /proc/stat" do concurrent
updates. But for that nohz needs to be running and I see your CPU clock has
NOHZ_MODE_INACTIVE. Perhaps it's only for that CPU though.
What is the value of tick_nohz_active in your dump?
>
>
>
> Thanks,
> Liguang
>
next parent reply other threads:[~2022-01-06 13:40 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <074a28e8-3794-5fee-f2ab-f08787149481@linux.alibaba.com>
2022-01-06 13:40 ` Frederic Weisbecker [this message]
2022-01-07 1:15 ` tick/sched: iowait_sleeptime resides in tick_cpu_sched structure is a negative value luanshi
2022-01-06 9:49 luanshi
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20220106134007.GA1313863@lothringen \
--to=frederic@kernel.org \
--cc=fweisbec@gmail.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@kernel.org \
--cc=tglx@linutronix.de \
--cc=zhangliguang@linux.alibaba.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox