From: Dave Young <dyoung@redhat.com>
To: Roberto Ricci <io@r-ricci.it>
Cc: ebiederm@xmission.com, rafael@kernel.org, pavel@ucw.cz,
ytcoode@gmail.com, kexec@lists.infradead.org,
linux-pm@vger.kernel.org, akpm@linux-foundation.org,
regressions@lists.linux.dev, linux-kernel@vger.kernel.org
Subject: Re: [REGRESSION] Kernel booted via kexec fails to resume from hibernation
Date: Mon, 27 Jan 2025 10:39:41 +0800 [thread overview]
Message-ID: <Z5bx7ZHNcyc5fM_L@darkstar.users.ipa.redhat.com> (raw)
In-Reply-To: <Z4WFjBVHpndct7br@desktop0a>
Hi
On 01/13/25 at 10:28pm, Roberto Ricci wrote:
> After rebooting the system via kexec, hibernating and rebooting the machine, this oops occurs:
>
[snip]
>
> I will send the kernel config and dmesg in replies to this email.
>
I tried your config (removed some config driver related which is not useful), but it can not boot on my kvm guest.
Firstly I saw a panic in ftrace path, then I rebuilt the kernel without ftrace, it panicked again but in kvm related code path.
Both are not related to kexec at all so I suspect your bug is not kexec specific.
Ftrace panic:
----
0.215515] software IO TLB: area num 4.
[ 0.298815] SLUB: HWalign=64, Order=0-3, MinObjects=0, CPUs=4, Nodes=1
Poking KASLR using RDRAND RDTSC...
[ 0.299813] ftrace: allocating 40381 entries in 158 pages
[ 0.300603] Oops: general protection fault, probably for non-canonical address 0xdffffc0000008a30: 0000 [#1] PREEMPT SMP KASAN NOPTI
[ 0.301877] KASAN: probably user-memory-access in range [0x0000000000045180-0x0000000000045187]
[ 0.302734] CPU: 0 UID: 0 PID: 0 Comm: swapper Not tainted 6.13.0_ricci+ #913
[ 0.303401] Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.16.3-2.fc40 04/01/2014
[ 0.304187] RIP: 0010:__text_poke+0x370/0x840
[ 0.304591] Code: c6 48 8b 04 24 48 8d 78 08 e8 cc fb ff ff 48 c7 c0 80 51 04 00 48 8b 0d 96 32 d9 02 48 ba 00 00 00 00 00 fc ff df 48 c1 e8 03 <0f> b6 04 10 84 c0 74 06 0f 8e cc 03 00 00 65 0f b6 1d aa 3c ed 6e
[ 0.306646] RSP: 0000:ffffffff94007d88 EFLAGS: 00010006
[ 0.307139] RAX: 0000000000008a30 RBX: ffffffff93f0e070 RCX: ffff8881000744c0
[ 0.307839] RDX: dffffc0000000000 RSI: 8000000072001063 RDI: ffff88810006d0d8
[ 0.308634] RBP: ffffffff91001b24 R08: 0000000000000001 R09: fffffbfff2800f91
[ 0.309359] R10: 0000000000000003 R11: 0000000000000100 R12: 0000000000000005
[ 0.309994] R13: 0000000000000b24 R14: 0000000000000b29 R15: 8000000000000063
[ 0.310631] FS: 0000000000000000(0000) GS:ffff88815b400000(0000) knlGS:0000000000000000
[ 0.311351] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 0.311867] CR2: ffff88817ffff000 CR3: 0000000075070001 CR4: 00000000000200b0
[ 0.312510] Call Trace:
[ 0.312731] <TASK>
[ 0.312922] ? __die_body.cold+0x19/0x2b
[ 0.313280] ? die_addr+0x46/0x70
[ 0.313582] ? exc_general_protection+0x150/0x240
[ 0.314010] ? asm_exc_general_protection+0x26/0x30
[ 0.314454] ? trace_initcall_start_cb+0x4/0x90
[ 0.314868] ? __text_poke+0x370/0x840
[ 0.315211] ? __text_poke+0x2ab/0x840
[ 0.315552] ? __pfx_text_poke_memcpy+0x10/0x10
[ 0.315965] ? __pfx___text_poke+0x10/0x10
[ 0.316345] ? mutex_lock+0x87/0xe0
[ 0.316662] ? __pfx_mutex_lock+0x10/0x10
[ 0.317096] ? ftrace_now+0xc0/0xd0
[ 0.317431] ? __pfx_ftrace_now+0x10/0x10
[ 0.317813] ? trace_initcall_start_cb+0x4/0x90
[ 0.318245] ftrace_modify_code_direct+0x58/0x70
[ 0.318696] ftrace_process_locs+0x514/0xb70
[ 0.319105] ftrace_init+0x80/0x150
[ 0.319458] start_kernel+0x181/0x3c0
[ 0.319802] x86_64_start_reservations+0x24/0x30
[ 0.320219] x86_64_start_kernel+0x84/0x90
[ 0.320584] common_startup_64+0x13e/0x141
[ 0.320952] </TASK>
[ 0.321149] Modules linked in:
[ 0.321423] ---[ end trace 0000000000000000 ]---
[ 0.321832] RIP: 0010:__text_poke+0x370/0x840
[ 0.322227] Code: c6 48 8b 04 24 48 8d 78 08 e8 cc fb ff ff 48 c7 c0 80 51 04 00 48 8b 0d 96 32 d9 02 48 ba 00 00 00 00 00 fc ff df 48 c1 e8 03 <0f> b6 04 10 84 c0 74 06 0f 8e cc 03 00 00 65 0f b6 1d aa 3c ed 6e
[ 0.323906] RSP: 0000:ffffffff94007d88 EFLAGS: 00010006
[ 0.324383] RAX: 0000000000008a30 RBX: ffffffff93f0e070 RCX: ffff8881000744c0
[ 0.325025] RDX: dffffc0000000000 RSI: 8000000072001063 RDI: ffff88810006d0d8
[ 0.325681] RBP: ffffffff91001b24 R08: 0000000000000001 R09: fffffbfff2800f91
[ 0.326332] R10: 0000000000000003 R11: 0000000000000100 R12: 0000000000000005
[ 0.326986] R13: 0000000000000b24 R14: 0000000000000b29 R15: 8000000000000063
[ 0.327658] FS: 0000000000000000(0000) GS:ffff88815b400000(0000) knlGS:0000000000000000
[ 0.328411] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 0.328931] CR2: ffff88817ffff000 CR3: 0000000075070001 CR4: 00000000000200b0
[ 0.329595] Kernel panic - not syncing: Attempted to kill the idle task!
[ 0.330278] ---[ end Kernel panic - not syncing: Attempted to kill the idle task! ]---
kvm panic:
-------
[ 0.222231] rcu: srcu_init: Setting srcu_struct sizes based on contention.
[ 0.222824] kfence: initialized - using 2097152 bytes for 255 objects at 0x(____ptrval____)-0x(____ptrval____)
[ 0.223470] Oops: general protection fault, probably for non-canonical address 0xdffffc0000005356: 0000 [#1] PREEMPT SMP KASAN NOPTI
[ 0.224200] KASAN: probably user-memory-access in range [0x0000000000029ab0-0x0000000000029ab7]
[ 0.224747] CPU: 0 UID: 0 PID: 0 Comm: swapper/0 Not tainted 6.13.0_ricci+ #917
[ 0.225223] Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.16.3-2.fc40 04/01/2014
[ 0.225779] RIP: 0010:__sysvec_kvm_asyncpf_interrupt+0x64/0x110
[ 0.226169] Code: 7c e4 4a 48 c7 c7 40 35 27 b7 e8 67 c2 cd 01 48 c7 c0 b0 9a 02 00 48 ba 00 00 00 00 00 fc ff df 48 89 c1 83 e0 07 48 c1 e9 03 <0f> b6 14 11 38 c2 7f 08 84 d2 0f 85 80 00 00 00 65 0f b6 1d 54 61
[ 0.227327] RSP: 0000:ffff888157009fe0 EFLAGS: 00010006
[ 0.227657] RAX: 0000000000000000 RBX: ffffffffb7a07e68 RCX: 0000000000005356
[ 0.228088] RDX: dffffc0000000000 RSI: ffffffffb7273540 RDI: ffffffffb740be40
[ 0.228523] RBP: 0000000000000000 R08: 0000000000000001 R09: ffffed102ae07c08
[ 0.228960] R10: ffff88815703e047 R11: ffff888157009ff8 R12: 0000000000000000
[ 0.229390] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[ 0.229848] FS: 0000000000000000(0000) GS:ffff888157000000(0000) knlGS:0000000000000000
[ 0.230375] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 0.231089] CR2: ffff88815d201000 CR3: 000000015ba66001 CR4: 00000000000200b0
[ 0.231567] Call Trace:
[ 0.231727] <IRQ>
[ 0.231859] ? __die_body.cold+0x19/0x26
[ 0.232125] ? die_addr+0x41/0x70
[ 0.232354] ? exc_general_protection+0x150/0x240
[ 0.232663] ? asm_exc_general_protection+0x26/0x30
[ 0.232988] ? __sysvec_kvm_asyncpf_interrupt+0x64/0x110
[ 0.233341] sysvec_kvm_asyncpf_interrupt+0x9b/0xe0
[ 0.233668] </IRQ>
[ 0.233805] <TASK>
[ 0.233942] asm_sysvec_kvm_asyncpf_interrupt+0x1a/0x20
[ 0.234295] RIP: 0010:__x86_return_thunk+0x0/0x10
[ 0.234606] Code: 01 00 00 00 cc e8 01 00 00 00 cc 48 81 c4 80 00 00 00 65 48 c7 05 3c 54 14 49 ff ff ff ff c3 cc 66 2e 0f 1f 84 00 00 00 00 00 <c3> 90 90 90 90 cc cc cc cc cc cc cc cc cc cc cc e9 eb ff ff ff 0f
[ 0.235828] RSP: 0000:ffffffffb7a07f18 EFLAGS: 00000296
[ 0.236168] RAX: 0000000000000096 RBX: 0000000000000000 RCX: 1ffff1102ae07af0
[ 0.236677] RDX: dffffc0000000000 RSI: 0000000000000004 RDI: 000000000000000f
[ 0.237148] RBP: 00000000000000b0 R08: 0000000000000001 R09: 0000000000000000
[ 0.237610] R10: ffff8881571c1e40 R11: 0000000000000007 R12: 0000000000000000
[ 0.238071] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000094770
[ 0.238547] start_kernel+0x263/0x3a0
[ 0.238787] x86_64_start_reservations+0x24/0x30
[ 0.239088] x86_64_start_kernel+0x84/0x90
[ 0.239355] common_startup_64+0x13e/0x141
[ 0.239637] </TASK>
[ 0.239786] Modules linked in:
[ 0.239987] ---[ end trace 0000000000000000 ]---
[ 0.240287] RIP: 0010:__sysvec_kvm_asyncpf_interrupt+0x64/0x110
[ 0.240676] Code: 7c e4 4a 48 c7 c7 40 35 27 b7 e8 67 c2 cd 01 48 c7 c0 b0 9a 02 00 48 ba 00 00 00 00 00 fc ff df 48 89 c1 83 e0 07 48 c1 e9 03 <0f> b6 14 11 38 c2 7f 08 84 d2 0f 85 80 00 00 00 65 0f b6 1d 54 61
[ 0.241896] RSP: 0000:ffff888157009fe0 EFLAGS: 00010006
[ 0.242242] RAX: 0000000000000000 RBX: ffffffffb7a07e68 RCX: 0000000000005356
[ 0.242708] RDX: dffffc0000000000 RSI: ffffffffb7273540 RDI: ffffffffb740be40
[ 0.243194] RBP: 0000000000000000 R08: 0000000000000001 R09: ffffed102ae07c08
[ 0.243659] R10: ffff88815703e047 R11: ffff888157009ff8 R12: 0000000000000000
[ 0.244122] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[ 0.244595] FS: 0000000000000000(0000) GS:ffff888157000000(0000) knlGS:0000000000000000
[ 0.245121] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 0.245502] CR2: ffff88815d201000 CR3: 000000015ba66001 CR4: 00000000000200b0
[ 0.245968] Kernel panic - not syncing: Fatal exception in interrupt
[ 0.246411] ---[ end Kernel panic - not syncing: Fatal exception in interrupt ]---
You can find the kernel config here (with the ftrace enabled):
https://people.redhat.com/~ruyang/snakeyear/panic-ftrace.config
Thanks
Dave
next prev parent reply other threads:[~2025-01-27 2:39 UTC|newest]
Thread overview: 37+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-01-13 21:28 [REGRESSION] Kernel booted via kexec fails to resume from hibernation Roberto Ricci
2025-01-13 21:31 ` Roberto Ricci
2025-01-14 3:42 ` Baoquan He
2025-04-01 12:59 ` msizanoen
2025-04-03 22:00 ` Roberto Ricci
2025-04-04 2:54 ` msizanoen
2025-04-04 4:56 ` msizanoen
2025-04-04 5:50 ` msizanoen
2025-04-04 20:39 ` Roberto Ricci
2025-04-05 5:15 ` msizanoen
2025-04-04 20:00 ` Roberto Ricci
2025-01-13 21:32 ` Roberto Ricci
2025-01-13 23:17 ` Andrew Morton
2025-01-14 13:19 ` Roberto Ricci
2025-01-14 13:16 ` Roberto Ricci
2025-01-15 4:04 ` Baoquan He
2025-01-15 12:00 ` Roberto Ricci
2025-01-16 11:52 ` Roberto Ricci
2025-01-17 1:55 ` Baoquan He
2025-01-17 3:41 ` Baoquan He
2025-01-17 7:52 ` Roberto Ricci
2025-01-16 9:54 ` Yuntao Wang
2025-01-22 9:45 ` RuiRui Yang
2025-01-22 13:01 ` Roberto Ricci
2025-01-27 2:39 ` Dave Young [this message]
2025-01-27 2:42 ` Dave Young
2025-03-09 17:09 ` Donald
2025-03-29 0:14 ` Roberto Ricci
2025-03-29 0:14 ` Roberto Ricci
2025-03-29 0:15 ` Roberto Ricci
2025-03-29 1:44 ` Baoquan He
2025-03-29 20:30 ` Roberto Ricci
2025-03-29 20:33 ` Roberto Ricci
2025-03-31 3:22 ` Dave Young
2025-04-03 21:59 ` Roberto Ricci
2025-04-04 23:31 ` Roberto Ricci
2025-04-04 23:37 ` Roberto Ricci
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=Z5bx7ZHNcyc5fM_L@darkstar.users.ipa.redhat.com \
--to=dyoung@redhat.com \
--cc=akpm@linux-foundation.org \
--cc=ebiederm@xmission.com \
--cc=io@r-ricci.it \
--cc=kexec@lists.infradead.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-pm@vger.kernel.org \
--cc=pavel@ucw.cz \
--cc=rafael@kernel.org \
--cc=regressions@lists.linux.dev \
--cc=ytcoode@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).