linux-pm.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Dave Young <dyoung@redhat.com>
To: Roberto Ricci <io@r-ricci.it>
Cc: ebiederm@xmission.com, rafael@kernel.org, pavel@ucw.cz,
	ytcoode@gmail.com, kexec@lists.infradead.org,
	linux-pm@vger.kernel.org, akpm@linux-foundation.org,
	regressions@lists.linux.dev, linux-kernel@vger.kernel.org
Subject: Re: [REGRESSION] Kernel booted via kexec fails to resume from hibernation
Date: Mon, 27 Jan 2025 10:39:41 +0800	[thread overview]
Message-ID: <Z5bx7ZHNcyc5fM_L@darkstar.users.ipa.redhat.com> (raw)
In-Reply-To: <Z4WFjBVHpndct7br@desktop0a>

Hi
On 01/13/25 at 10:28pm, Roberto Ricci wrote:
> After rebooting the system via kexec, hibernating and rebooting the machine, this oops occurs:
> 
[snip]
> 
> I will send the kernel config and dmesg in replies to this email.
> 

I tried your config (removed some config driver related which is not useful), but it can not boot on my kvm guest.
Firstly I saw a panic in ftrace path,  then I rebuilt the kernel without ftrace, it panicked again but in kvm related code path.
Both are not related to kexec at all so I suspect your bug is not kexec specific.

Ftrace panic:
----
    0.215515] software IO TLB: area num 4.
[    0.298815] SLUB: HWalign=64, Order=0-3, MinObjects=0, CPUs=4, Nodes=1
Poking KASLR using RDRAND RDTSC...
[    0.299813] ftrace: allocating 40381 entries in 158 pages
[    0.300603] Oops: general protection fault, probably for non-canonical address 0xdffffc0000008a30: 0000 [#1] PREEMPT SMP KASAN NOPTI
[    0.301877] KASAN: probably user-memory-access in range [0x0000000000045180-0x0000000000045187]
[    0.302734] CPU: 0 UID: 0 PID: 0 Comm: swapper Not tainted 6.13.0_ricci+ #913
[    0.303401] Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.16.3-2.fc40 04/01/2014
[    0.304187] RIP: 0010:__text_poke+0x370/0x840
[    0.304591] Code: c6 48 8b 04 24 48 8d 78 08 e8 cc fb ff ff 48 c7 c0 80 51 04 00 48 8b 0d 96 32 d9 02 48 ba 00 00 00 00 00 fc ff df 48 c1 e8 03 <0f> b6 04 10 84 c0 74 06 0f 8e cc 03 00 00 65 0f b6 1d aa 3c ed 6e
[    0.306646] RSP: 0000:ffffffff94007d88 EFLAGS: 00010006
[    0.307139] RAX: 0000000000008a30 RBX: ffffffff93f0e070 RCX: ffff8881000744c0
[    0.307839] RDX: dffffc0000000000 RSI: 8000000072001063 RDI: ffff88810006d0d8
[    0.308634] RBP: ffffffff91001b24 R08: 0000000000000001 R09: fffffbfff2800f91
[    0.309359] R10: 0000000000000003 R11: 0000000000000100 R12: 0000000000000005
[    0.309994] R13: 0000000000000b24 R14: 0000000000000b29 R15: 8000000000000063
[    0.310631] FS:  0000000000000000(0000) GS:ffff88815b400000(0000) knlGS:0000000000000000
[    0.311351] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[    0.311867] CR2: ffff88817ffff000 CR3: 0000000075070001 CR4: 00000000000200b0
[    0.312510] Call Trace:
[    0.312731]  <TASK>
[    0.312922]  ? __die_body.cold+0x19/0x2b
[    0.313280]  ? die_addr+0x46/0x70
[    0.313582]  ? exc_general_protection+0x150/0x240
[    0.314010]  ? asm_exc_general_protection+0x26/0x30
[    0.314454]  ? trace_initcall_start_cb+0x4/0x90
[    0.314868]  ? __text_poke+0x370/0x840
[    0.315211]  ? __text_poke+0x2ab/0x840
[    0.315552]  ? __pfx_text_poke_memcpy+0x10/0x10
[    0.315965]  ? __pfx___text_poke+0x10/0x10
[    0.316345]  ? mutex_lock+0x87/0xe0
[    0.316662]  ? __pfx_mutex_lock+0x10/0x10
[    0.317096]  ? ftrace_now+0xc0/0xd0
[    0.317431]  ? __pfx_ftrace_now+0x10/0x10
[    0.317813]  ? trace_initcall_start_cb+0x4/0x90
[    0.318245]  ftrace_modify_code_direct+0x58/0x70
[    0.318696]  ftrace_process_locs+0x514/0xb70
[    0.319105]  ftrace_init+0x80/0x150
[    0.319458]  start_kernel+0x181/0x3c0
[    0.319802]  x86_64_start_reservations+0x24/0x30
[    0.320219]  x86_64_start_kernel+0x84/0x90
[    0.320584]  common_startup_64+0x13e/0x141
[    0.320952]  </TASK>
[    0.321149] Modules linked in:
[    0.321423] ---[ end trace 0000000000000000 ]---
[    0.321832] RIP: 0010:__text_poke+0x370/0x840
[    0.322227] Code: c6 48 8b 04 24 48 8d 78 08 e8 cc fb ff ff 48 c7 c0 80 51 04 00 48 8b 0d 96 32 d9 02 48 ba 00 00 00 00 00 fc ff df 48 c1 e8 03 <0f> b6 04 10 84 c0 74 06 0f 8e cc 03 00 00 65 0f b6 1d aa 3c ed 6e
[    0.323906] RSP: 0000:ffffffff94007d88 EFLAGS: 00010006
[    0.324383] RAX: 0000000000008a30 RBX: ffffffff93f0e070 RCX: ffff8881000744c0
[    0.325025] RDX: dffffc0000000000 RSI: 8000000072001063 RDI: ffff88810006d0d8
[    0.325681] RBP: ffffffff91001b24 R08: 0000000000000001 R09: fffffbfff2800f91
[    0.326332] R10: 0000000000000003 R11: 0000000000000100 R12: 0000000000000005
[    0.326986] R13: 0000000000000b24 R14: 0000000000000b29 R15: 8000000000000063
[    0.327658] FS:  0000000000000000(0000) GS:ffff88815b400000(0000) knlGS:0000000000000000
[    0.328411] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[    0.328931] CR2: ffff88817ffff000 CR3: 0000000075070001 CR4: 00000000000200b0
[    0.329595] Kernel panic - not syncing: Attempted to kill the idle task!
[    0.330278] ---[ end Kernel panic - not syncing: Attempted to kill the idle task! ]---

kvm panic:
-------
[    0.222231] rcu: srcu_init: Setting srcu_struct sizes based on contention.
[    0.222824] kfence: initialized - using 2097152 bytes for 255 objects at 0x(____ptrval____)-0x(____ptrval____)
[    0.223470] Oops: general protection fault, probably for non-canonical address 0xdffffc0000005356: 0000 [#1] PREEMPT SMP KASAN NOPTI
[    0.224200] KASAN: probably user-memory-access in range [0x0000000000029ab0-0x0000000000029ab7]
[    0.224747] CPU: 0 UID: 0 PID: 0 Comm: swapper/0 Not tainted 6.13.0_ricci+ #917
[    0.225223] Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.16.3-2.fc40 04/01/2014
[    0.225779] RIP: 0010:__sysvec_kvm_asyncpf_interrupt+0x64/0x110
[    0.226169] Code: 7c e4 4a 48 c7 c7 40 35 27 b7 e8 67 c2 cd 01 48 c7 c0 b0 9a 02 00 48 ba 00 00 00 00 00 fc ff df 48 89 c1 83 e0 07 48 c1 e9 03 <0f> b6 14 11 38 c2 7f 08 84 d2 0f 85 80 00 00 00 65 0f b6 1d 54 61
[    0.227327] RSP: 0000:ffff888157009fe0 EFLAGS: 00010006
[    0.227657] RAX: 0000000000000000 RBX: ffffffffb7a07e68 RCX: 0000000000005356
[    0.228088] RDX: dffffc0000000000 RSI: ffffffffb7273540 RDI: ffffffffb740be40
[    0.228523] RBP: 0000000000000000 R08: 0000000000000001 R09: ffffed102ae07c08
[    0.228960] R10: ffff88815703e047 R11: ffff888157009ff8 R12: 0000000000000000
[    0.229390] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[    0.229848] FS:  0000000000000000(0000) GS:ffff888157000000(0000) knlGS:0000000000000000
[    0.230375] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[    0.231089] CR2: ffff88815d201000 CR3: 000000015ba66001 CR4: 00000000000200b0
[    0.231567] Call Trace:
[    0.231727]  <IRQ>
[    0.231859]  ? __die_body.cold+0x19/0x26
[    0.232125]  ? die_addr+0x41/0x70
[    0.232354]  ? exc_general_protection+0x150/0x240
[    0.232663]  ? asm_exc_general_protection+0x26/0x30
[    0.232988]  ? __sysvec_kvm_asyncpf_interrupt+0x64/0x110
[    0.233341]  sysvec_kvm_asyncpf_interrupt+0x9b/0xe0
[    0.233668]  </IRQ>
[    0.233805]  <TASK>
[    0.233942]  asm_sysvec_kvm_asyncpf_interrupt+0x1a/0x20
[    0.234295] RIP: 0010:__x86_return_thunk+0x0/0x10
[    0.234606] Code: 01 00 00 00 cc e8 01 00 00 00 cc 48 81 c4 80 00 00 00 65 48 c7 05 3c 54 14 49 ff ff ff ff c3 cc 66 2e 0f 1f 84 00 00 00 00 00 <c3> 90 90 90 90 cc cc cc cc cc cc cc cc cc cc cc e9 eb ff ff ff 0f
[    0.235828] RSP: 0000:ffffffffb7a07f18 EFLAGS: 00000296
[    0.236168] RAX: 0000000000000096 RBX: 0000000000000000 RCX: 1ffff1102ae07af0
[    0.236677] RDX: dffffc0000000000 RSI: 0000000000000004 RDI: 000000000000000f
[    0.237148] RBP: 00000000000000b0 R08: 0000000000000001 R09: 0000000000000000
[    0.237610] R10: ffff8881571c1e40 R11: 0000000000000007 R12: 0000000000000000
[    0.238071] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000094770
[    0.238547]  start_kernel+0x263/0x3a0
[    0.238787]  x86_64_start_reservations+0x24/0x30
[    0.239088]  x86_64_start_kernel+0x84/0x90
[    0.239355]  common_startup_64+0x13e/0x141
[    0.239637]  </TASK>
[    0.239786] Modules linked in:
[    0.239987] ---[ end trace 0000000000000000 ]---
[    0.240287] RIP: 0010:__sysvec_kvm_asyncpf_interrupt+0x64/0x110
[    0.240676] Code: 7c e4 4a 48 c7 c7 40 35 27 b7 e8 67 c2 cd 01 48 c7 c0 b0 9a 02 00 48 ba 00 00 00 00 00 fc ff df 48 89 c1 83 e0 07 48 c1 e9 03 <0f> b6 14 11 38 c2 7f 08 84 d2 0f 85 80 00 00 00 65 0f b6 1d 54 61
[    0.241896] RSP: 0000:ffff888157009fe0 EFLAGS: 00010006
[    0.242242] RAX: 0000000000000000 RBX: ffffffffb7a07e68 RCX: 0000000000005356
[    0.242708] RDX: dffffc0000000000 RSI: ffffffffb7273540 RDI: ffffffffb740be40
[    0.243194] RBP: 0000000000000000 R08: 0000000000000001 R09: ffffed102ae07c08
[    0.243659] R10: ffff88815703e047 R11: ffff888157009ff8 R12: 0000000000000000
[    0.244122] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[    0.244595] FS:  0000000000000000(0000) GS:ffff888157000000(0000) knlGS:0000000000000000
[    0.245121] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[    0.245502] CR2: ffff88815d201000 CR3: 000000015ba66001 CR4: 00000000000200b0
[    0.245968] Kernel panic - not syncing: Fatal exception in interrupt
[    0.246411] ---[ end Kernel panic - not syncing: Fatal exception in interrupt ]---


You can find the kernel config here (with the ftrace enabled):
https://people.redhat.com/~ruyang/snakeyear/panic-ftrace.config

Thanks
Dave


  parent reply	other threads:[~2025-01-27  2:39 UTC|newest]

Thread overview: 37+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-01-13 21:28 [REGRESSION] Kernel booted via kexec fails to resume from hibernation Roberto Ricci
2025-01-13 21:31 ` Roberto Ricci
2025-01-14  3:42   ` Baoquan He
2025-04-01 12:59   ` msizanoen
2025-04-03 22:00     ` Roberto Ricci
2025-04-04  2:54       ` msizanoen
2025-04-04  4:56         ` msizanoen
2025-04-04  5:50           ` msizanoen
2025-04-04 20:39             ` Roberto Ricci
2025-04-05  5:15             ` msizanoen
2025-04-04 20:00         ` Roberto Ricci
2025-01-13 21:32 ` Roberto Ricci
2025-01-13 23:17 ` Andrew Morton
2025-01-14 13:19   ` Roberto Ricci
2025-01-14 13:16 ` Roberto Ricci
2025-01-15  4:04   ` Baoquan He
2025-01-15 12:00     ` Roberto Ricci
2025-01-16 11:52       ` Roberto Ricci
2025-01-17  1:55         ` Baoquan He
2025-01-17  3:41           ` Baoquan He
2025-01-17  7:52             ` Roberto Ricci
2025-01-16  9:54     ` Yuntao Wang
2025-01-22  9:45 ` RuiRui Yang
2025-01-22 13:01   ` Roberto Ricci
2025-01-27  2:39 ` Dave Young [this message]
2025-01-27  2:42   ` Dave Young
2025-03-09 17:09     ` Donald
2025-03-29  0:14     ` Roberto Ricci
2025-03-29  0:14       ` Roberto Ricci
2025-03-29  0:15       ` Roberto Ricci
2025-03-29  1:44       ` Baoquan He
2025-03-29 20:30         ` Roberto Ricci
2025-03-29 20:33           ` Roberto Ricci
2025-03-31  3:22           ` Dave Young
2025-04-03 21:59             ` Roberto Ricci
2025-04-04 23:31           ` Roberto Ricci
2025-04-04 23:37             ` Roberto Ricci

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=Z5bx7ZHNcyc5fM_L@darkstar.users.ipa.redhat.com \
    --to=dyoung@redhat.com \
    --cc=akpm@linux-foundation.org \
    --cc=ebiederm@xmission.com \
    --cc=io@r-ricci.it \
    --cc=kexec@lists.infradead.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-pm@vger.kernel.org \
    --cc=pavel@ucw.cz \
    --cc=rafael@kernel.org \
    --cc=regressions@lists.linux.dev \
    --cc=ytcoode@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).