From: kernel test robot <oliver.sang@intel.com>
To: Peter Zijlstra <peterz@infradead.org>
Cc: <oe-lkp@lists.linux.dev>, <lkp@intel.com>,
<linux-kernel@vger.kernel.org>, <sched-ext@lists.linux.dev>,
<aubrey.li@linux.intel.com>, <yu.c.chen@intel.com>,
<oliver.sang@intel.com>
Subject: [peterz-queue:sched/cleanup] [sched] 13ea439400: WARNING:at_kernel/sched/deadline.c:#task_contending
Date: Tue, 23 Sep 2025 16:44:58 +0800 [thread overview]
Message-ID: <202509231601.ffbd7efd-lkp@intel.com> (raw)
Hello,
kernel test robot noticed "WARNING:at_kernel/sched/deadline.c:#task_contending" on:
commit: 13ea43940095719bc22d2e129712519fdc307c45 ("sched: Fold sched_class::switch{ing,ed}_{to,from}() into the change pattern")
https://git.kernel.org/cgit/linux/kernel/git/peterz/queue.git sched/cleanup
in testcase: stress-ng
version: stress-ng-x86_64-480b81369-1_20250919
with following parameters:
nr_threads: 100%
testtime: 60s
test: cyclic
cpufreq_governor: performance
config: x86_64-rhel-9.4
compiler: gcc-14
test machine: 224 threads 2 sockets Intel(R) Xeon(R) Platinum 8480+ (Sapphire Rapids) with 256G memory
(please refer to attached dmesg/kmsg for entire log/backtrace)
If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@intel.com>
| Closes: https://lore.kernel.org/oe-lkp/202509231601.ffbd7efd-lkp@intel.com
we also observed various issues on this commit while parent is quite clean.
=========================================================================================
tbox_group/testcase/rootfs/kconfig/compiler/nr_threads/testtime/test/cpufreq_governor:
lkp-spr-2sp1/stress-ng/debian-13-x86_64-20250902.cgz/x86_64-rhel-9.4/gcc-14/100%/60s/cyclic/performance
f0b336327a1be3b1 13ea43940095719bc22d2e12971
---------------- ---------------------------
fail:runs %reproduction fail:runs
| | |
:10 20% 2:10 dmesg.BUG:soft_lockup-CPU##stuck_for#s![diskstats:#]
:10 10% 1:10 dmesg.BUG:soft_lockup-CPU##stuck_for#s![numa-vmstat:#]
:10 10% 1:10 dmesg.BUG:soft_lockup-CPU##stuck_for#s![slabinfo:#]
:10 40% 4:10 dmesg.BUG:soft_lockup-CPU##stuck_for#s![stress-ng-cycli:#]
:10 20% 2:10 dmesg.BUG:soft_lockup-CPU##stuck_for#s![systemd-journal:#]
:10 10% 1:10 dmesg.BUG:workqueue_lockup-pool
:10 10% 1:10 dmesg.INFO:rcu_sched_detected_stalls_on_CPUs/tasks
:10 10% 1:10 dmesg.INFO:task_blocked_for_more_than#seconds
:10 90% 9:10 dmesg.Kernel_panic-not_syncing:softlockup:hung_tasks
:10 10% 1:10 dmesg.RIP:__get_user_8
:10 50% 5:10 dmesg.RIP:__sched_setscheduler
:10 10% 1:10 dmesg.RIP:__schedule
:10 10% 1:10 dmesg.RIP:__x64_sys_clock_nanosleep
:10 10% 1:10 dmesg.RIP:calculate_sigpending
:10 10% 1:10 dmesg.RIP:count_memcg_events
:10 60% 6:10 dmesg.RIP:cpuidle_enter_state
:10 90% 9:10 dmesg.RIP:dequeue_task_dl
:10 10% 1:10 dmesg.RIP:do_nanosleep
:10 10% 1:10 dmesg.RIP:do_user_addr_fault
:10 50% 5:10 dmesg.RIP:enqueue_dl_entity
:10 80% 8:10 dmesg.RIP:finish_task_switch
:10 10% 1:10 dmesg.RIP:folio_add_file_rmap_ptes
:10 70% 7:10 dmesg.RIP:inactive_task_timer
:10 90% 9:10 dmesg.RIP:migrate_task_rq_dl
:10 10% 1:10 dmesg.RIP:native_queued_spin_lock_slowpath
:10 20% 2:10 dmesg.RIP:next_uptodate_folio
:10 90% 9:10 dmesg.RIP:smp_call_function_many_cond
:10 90% 9:10 dmesg.RIP:task_contending
:10 80% 8:10 dmesg.RIP:task_non_contending
:10 10% 1:10 dmesg.RIP:x64_sys_call
:10 90% 9:10 dmesg.WARNING:at_kernel/sched/deadline.c:#dequeue_task_dl
:10 50% 5:10 dmesg.WARNING:at_kernel/sched/deadline.c:#enqueue_dl_entity
:10 70% 7:10 dmesg.WARNING:at_kernel/sched/deadline.c:#inactive_task_timer
:10 90% 9:10 dmesg.WARNING:at_kernel/sched/deadline.c:#migrate_task_rq_dl
:10 90% 9:10 dmesg.WARNING:at_kernel/sched/deadline.c:#task_contending
:10 80% 8:10 dmesg.WARNING:at_kernel/sched/deadline.c:#task_non_contending
[ 41.677103][ C194] ------------[ cut here ]------------
[ 41.295461][ C1] Scheduler tracepoints stat_sleep, stat_iowait, stat_blocked and stat_runtime require the kernel parameter schedstats=enable or kernel.sched_schedstats=1
[ 41.677280][ C147] ------------[ cut here ]------------
[ 41.673528][ T0] Scheduler tracepoints stat_sleep, stat_iowait, stat_blocked and stat_runtime require the kernel parameter schedstats=enable or kernel.sched_schedstats=1
[ 41.677090][T10156] Scheduler tracepoints stat_sleep, stat_iowait, stat_blocked and stat_runtime require the kernel parameter schedstats=enable or kernel.sched_schedstats=1
[ 41.677282][ C147] WARNING: CPU: 147 PID: 0 at kernel/sched/deadline.c:227 task_contending (kernel/sched/deadline.c:227 kernel/sched/deadline.c:287 kernel/sched/deadline.c:504)
[ 41.677096][T10156] sched: DL replenish lagged too much
[ 41.677288][ C147] Modules linked in: intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_ifs i10nm_edac skx_edac_common nfit libnvdimm x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel btrfs blake2b_generic kvm xor dax_hmem snd_pcm raid6_pq irqbypass cxl_acpi ast snd_timer ghash_clmulni_intel cxl_port binfmt_misc pmt_telemetry rapl pmt_discovery drm_client_lib intel_th_gth intel_cstate snd isst_if_mmio iaa_crypto ipmi_ssif qat_4xxx nvme isst_if_mbox_pci intel_sdsi pmt_class cxl_core drm_shmem_helper mei_me intel_th_pci i2c_i801 soundcore idxd intel_qat intel_uncore einj pcspkr isst_if_common nvme_core cdc_ether i2c_ismt intel_th mei i2c_smbus intel_vsec idxd_bus acpi_power_meter wmi drm_kms_helper crc8 ipmi_si acpi_ipmi ipmi_devintf ipmi_msghandler acpi_pad pinctrl_emmitsburg pfr_update pfr_telemetry drm fuse nfnetlink
[ 41.677103][ C194] ------------[ cut here ]------------
[ 41.677338][ C147] CPU: 147 UID: 0 PID: 0 Comm: swapper/147 Tainted: G S 6.17.0-rc4-00012-g13ea43940095 #1 VOLUNTARY
for Remote File[ 29.908797][ T2053] EDAC MC5: Giving out device to module i10nm_edac controller Intel_10nm Socket#1 IMC#1: DEV 0000:fe:0d.0 (INTERRUPT)
[ 41.677280][ C147] ------------[ cut here ]------------
[ 41.677341][ C147] Tainted: [S]=CPU_OUT_OF_SPEC
[ 41.677282][ C147] WARNING: CPU: 147 PID: 0 at kernel/sched/deadline.c:227 task_contending (kernel/sched/deadline.c:227 kernel/sched/deadline.c:287 kernel/sched/deadline.c:504)
[ 41.677342][ C147] Hardware name: Intel Corporation D50DNP1SBB/D50DNP1SBB, BIOS SE5C7411.86B.8118.D04.2206151341 06/15/2022
[ 41.677344][ C147] RIP: 0010:task_contending (kernel/sched/deadline.c:227 kernel/sched/deadline.c:287 kernel/sched/deadline.c:504)
[ 41.677288][ C147] Modules linked in: intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_ifs i10nm_edac skx_edac_common nfit libnvdimm x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel btrfs blake2b_generic kvm xor dax_hmem snd_pcm raid6_pq irqbypass cxl_acpi ast snd_timer ghash_clmulni_intel cxl_port binfmt_misc pmt_telemetry rapl pmt_discovery drm_client_lib intel_th_gth intel_cstate snd isst_if_mmio iaa_crypto ipmi_ssif qat_4xxx nvme isst_if_mbox_pci intel_sdsi pmt_class cxl_core drm_shmem_helper mei_me intel_th_pci i2c_i801 soundcore idxd intel_qat intel_uncore einj pcspkr isst_if_common nvme_core cdc_ether i2c_ismt intel_th mei i2c_smbus intel_vsec idxd_bus acpi_power_meter wmi drm_kms_helper crc8 ipmi_si acpi_ipmi ipmi_devintf ipmi_msghandler acpi_pad pinctrl_emmitsburg pfr_update pfr_telemetry drm fuse nfnetlink
[ 29.924285][ T2053] EDAC MC6: Giving out device to module i10nm_edac controller Intel_10nm Socket#1 IMC#2: DEV 0000:fe:0e.0 (INTERRUPT)
[ 41.677347][ C147] Code: 78 fe ff ff 83 f8 01 74 28 85 c0 7f bd be 03 00 00 00 5b e9 a7 de 64 00 0f 0b eb 88 0f 0b e9 23 ff ff ff 0f 0b e9 43 ff ff ff <0f> 0b e9 49 ff ff ff 48 c7 c6 50 e4 34 81 48 8d bb a8 12 00 00 5b
All code
========
0: 78 fe js 0x0
2: ff (bad)
3: ff 83 f8 01 74 28 incl 0x287401f8(%rbx)
9: 85 c0 test %eax,%eax
b: 7f bd jg 0xffffffffffffffca
d: be 03 00 00 00 mov $0x3,%esi
12: 5b pop %rbx
13: e9 a7 de 64 00 jmp 0x64debf
18: 0f 0b ud2
1a: eb 88 jmp 0xffffffffffffffa4
1c: 0f 0b ud2
1e: e9 23 ff ff ff jmp 0xffffffffffffff46
23: 0f 0b ud2
25: e9 43 ff ff ff jmp 0xffffffffffffff6d
2a:* 0f 0b ud2 <-- trapping instruction
2c: e9 49 ff ff ff jmp 0xffffffffffffff7a
31: 48 c7 c6 50 e4 34 81 mov $0xffffffff8134e450,%rsi
38: 48 8d bb a8 12 00 00 lea 0x12a8(%rbx),%rdi
3f: 5b pop %rbx
Code starting with the faulting instruction
===========================================
0: 0f 0b ud2
2: e9 49 ff ff ff jmp 0xffffffffffffff50
7: 48 c7 c6 50 e4 34 81 mov $0xffffffff8134e450,%rsi
e: 48 8d bb a8 12 00 00 lea 0x12a8(%rbx),%rdi
15: 5b pop %rbx
[ 41.677338][ C147] CPU: 147 UID: 0 PID: 0 Comm: swapper/147 Tainted: G S 6.17.0-rc4-00012-g13ea43940095 #1 VOLUNTARY
[ 41.677350][ C147] RSP: 0018:ffa000000eec4e00 EFLAGS: 00010087
[ 41.677341][ C147] Tainted: [S]=CPU_OUT_OF_SPEC
[ 41.677353][ C147] RAX: ff11001fe02f1140 RBX: ff110040477d42b0 RCX: 0000000000000093
0m] Reached targ[ 29.954811][ T2053] EDAC i10nm: v0.0.6
[ 41.677354][ C147] RDX: 00000000000e6666 RSI: 0000000000000000 RDI: ff110040477d42b0
[ 41.677342][ C147] Hardware name: Intel Corporation D50DNP1SBB/D50DNP1SBB, BIOS SE5C7411.86B.8118.D04.2206151341 06/15/2022
[ 41.677344][ C147] RIP: 0010:task_contending (kernel/sched/deadline.c:227 kernel/sched/deadline.c:287 kernel/sched/deadline.c:504)
[ 41.677355][ C147] RBP: 0000000000100009 R08: 0000000000017701 R09: 0000000000000093
Startin[ 29.979254][ T20] intel_rapl_common: Found RAPL domain package
[ 41.677347][ C147] Code: 78 fe ff ff 83 f8 01 74 28 85 c0 7f bd be 03 00 00 00 5b e9 a7 de 64 00 0f 0b eb 88 0f 0b e9 23 ff ff ff 0f 0b e9 43 ff ff ff <0f> 0b e9 49 ff ff ff 48 c7 c6 50 e4 34 81 48 8d bb a8 12 00 00 5b
All code
========
0: 78 fe js 0x0
2: ff (bad)
3: ff 83 f8 01 74 28 incl 0x287401f8(%rbx)
9: 85 c0 test %eax,%eax
b: 7f bd jg 0xffffffffffffffca
d: be 03 00 00 00 mov $0x3,%esi
12: 5b pop %rbx
13: e9 a7 de 64 00 jmp 0x64debf
18: 0f 0b ud2
1a: eb 88 jmp 0xffffffffffffffa4
1c: 0f 0b ud2
1e: e9 23 ff ff ff jmp 0xffffffffffffff46
23: 0f 0b ud2
25: e9 43 ff ff ff jmp 0xffffffffffffff6d
2a:* 0f 0b ud2 <-- trapping instruction
2c: e9 49 ff ff ff jmp 0xffffffffffffff7a
31: 48 c7 c6 50 e4 34 81 mov $0xffffffff8134e450,%rsi
38: 48 8d bb a8 12 00 00 lea 0x12a8(%rbx),%rdi
3f: 5b pop %rbx
Code starting with the faulting instruction
===========================================
0: 0f 0b ud2
2: e9 49 ff ff ff jmp 0xffffffffffffff50
7: 48 c7 c6 50 e4 34 81 mov $0xffffffff8134e450,%rsi
e: 48 8d bb a8 12 00 00 lea 0x12a8(%rbx),%rdi
15: 5b pop %rbx
[ 41.677357][ C147] R10: 0000000000000000 R11: ffa000000eec4ff8 R12: 0000000000000001
[ 41.677358][ C147] R13: ff110040477d42b0 R14: 0000000000000088 R15: ffa000000eec4ed0
[ 41.677350][ C147] RSP: 0018:ffa000000eec4e00 EFLAGS: 00010087
[ 41.677359][ C147] FS: 0000000000000000(0000) GS:ff1100205c4a2000(0000) knlGS:0000000000000000
[ 41.677353][ C147] RAX: ff11001fe02f1140 RBX: ff110040477d42b0 RCX: 0000000000000093
[ 41.677361][ C147] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 41.677354][ C147] RDX: 00000000000e6666 RSI: 0000000000000000 RDI: ff110040477d42b0
[ 30.004528][ T354] intel_rapl_common: Found RAPL domain package
[ 41.677355][ C147] RBP: 0000000000100009 R08: 0000000000017701 R09: 0000000000000093
[ 41.677362][ C147] CR2: 00007f7ff20b8620 CR3: 00000020a4f97003 CR4: 0000000000f73ef0
[ 30.020455][ T354] intel_rapl_common: Found RAPL domain dram
[ 41.677357][ C147] R10: 0000000000000000 R11: ffa000000eec4ff8 R12: 0000000000000001
[ 41.677364][ C147] PKRU: 55555554
[ 41.677358][ C147] R13: ff110040477d42b0 R14: 0000000000000088 R15: ffa000000eec4ed0
[ 41.677365][ C147] Call Trace:
[ 41.677359][ C147] FS: 0000000000000000(0000) GS:ff1100205c4a2000(0000) knlGS:0000000000000000
[ 41.677367][ C147] <IRQ>
[ 41.677370][ C147] enqueue_dl_entity (kernel/sched/deadline.c:62 kernel/sched/deadline.c:80 kernel/sched/deadline.c:1036 kernel/sched/deadline.c:2022)
[ 41.677361][ C147] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
LKP: ttyS0: 2334: current_version: 90000c0, target_version: 90000c0
[ 41.677362][ C147] CR2: 00007f7ff20b8620 CR3: 00000020a4f97003 CR4: 0000000000f73ef0
[ 41.677373][ C147] enqueue_task_dl (kernel/sched/deadline.c:62 kernel/sched/deadline.c:2127)
[ 41.677364][ C147] PKRU: 55555554
[ 41.677376][ C147] enqueue_task (kernel/sched/core.c:2090)
[ 41.677365][ C147] Call Trace:
[ 41.677378][ C147] ttwu_do_activate (kernel/sched/core.c:3717)
LKP: ttyS0: 2334: skip deploy intel ucode as ucode is same
LKP: ttyS0: 2334: Kernel tests: Boot OK!
[ 41.677367][ C147] <IRQ>
[ 41.677381][ C147] try_to_wake_up (kernel/sched/sched.h:1864 kernel/sched/core.c:3980 kernel/sched/core.c:4303)
[ 41.677370][ C147] enqueue_dl_entity (kernel/sched/deadline.c:62 kernel/sched/deadline.c:80 kernel/sched/deadline.c:1036 kernel/sched/deadline.c:2022)
[ 41.677384][ C147] ? __pfx_hrtimer_wakeup (kernel/time/hrtimer.c:1990)
LKP: ttyS0: 2334: HOSTNAME lkp-spr-2sp1, MAC a4:bf:01:91:28:12, kernel 6.17.0-rc4-00012-g13ea43940095 1
[ 41.677373][ C147] enqueue_task_dl (kernel/sched/deadline.c:62 kernel/sched/deadline.c:2127)
[ 41.677388][ C147] hrtimer_wakeup (kernel/time/hrtimer.c:2000)
[ 41.677390][ C147] __hrtimer_run_queues (kernel/time/hrtimer.c:1761 kernel/time/hrtimer.c:1825)
[ 41.677376][ C147] enqueue_task (kernel/sched/core.c:2090)
[ 41.677393][ C147] hrtimer_interrupt (kernel/time/hrtimer.c:1890)
[ 41.677378][ C147] ttwu_do_activate (kernel/sched/core.c:3717)
[ 41.677381][ C147] try_to_wake_up (kernel/sched/sched.h:1864 kernel/sched/core.c:3980 kernel/sched/core.c:4303)
[ 41.677395][ C147] __sysvec_apic_timer_interrupt (arch/x86/kernel/apic/apic.c:1039 arch/x86/kernel/apic/apic.c:1056)
[ 41.677384][ C147] ? __pfx_hrtimer_wakeup (kernel/time/hrtimer.c:1990)
[ 41.677400][ C147] sysvec_apic_timer_interrupt (arch/x86/kernel/apic/apic.c:1050 (discriminator 35) arch/x86/kernel/apic/apic.c:1050 (discriminator 35))
[ 41.677402][ C147] </IRQ>
[ 41.677388][ C147] hrtimer_wakeup (kernel/time/hrtimer.c:2000)
[ 36.334175][ T2362] is_virt=false
[ 41.677403][ C147] <TASK>
[ 41.677390][ C147] __hrtimer_run_queues (kernel/time/hrtimer.c:1761 kernel/time/hrtimer.c:1825)
[ 41.677405][ C147] asm_sysvec_apic_timer_interrupt (arch/x86/include/asm/idtentry.h:702)
[ 36.334178][ T2362]
[ 41.677393][ C147] hrtimer_interrupt (kernel/time/hrtimer.c:1890)
[ 41.677408][ C147] RIP: 0010:cpuidle_enter_state (drivers/cpuidle/cpuidle.c:292)
[ 41.677395][ C147] __sysvec_apic_timer_interrupt (arch/x86/kernel/apic/apic.c:1039 arch/x86/kernel/apic/apic.c:1056)
The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20250923/202509231601.ffbd7efd-lkp@intel.com
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki
reply other threads:[~2025-09-23 8:45 UTC|newest]
Thread overview: [no followups] expand[flat|nested] mbox.gz Atom feed
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=202509231601.ffbd7efd-lkp@intel.com \
--to=oliver.sang@intel.com \
--cc=aubrey.li@linux.intel.com \
--cc=linux-kernel@vger.kernel.org \
--cc=lkp@intel.com \
--cc=oe-lkp@lists.linux.dev \
--cc=peterz@infradead.org \
--cc=sched-ext@lists.linux.dev \
--cc=yu.c.chen@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.