linux-perf-users.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [BUG] Guest OSes die simultaneously (bisected)
@ 2024-01-03 22:22 Paul E. McKenney
  2024-01-03 23:27 ` Paul E. McKenney
  2024-01-04 10:01 ` Breno Leitao
  0 siblings, 2 replies; 12+ messages in thread
From: Paul E. McKenney @ 2024-01-03 22:22 UTC (permalink / raw)
  To: Like Xu, Andi Kleen, Kan Liang, Luwei Kang, Peter Zijlstra,
	Paolo Bonzini
  Cc: linux-perf-users, linux-kernel, kvm, Breno Leitao,
	Arnaldo Carvalho de Melo, Ingo Molnar

Hello!

Since some time between v5.19 and v6.4, long-running rcutorture tests
would (rarely but intolerably often) have all guests on a given host die
simultaneously with something like an instruction fault or a segmentation
violation.

Each bisection step required 20 hosts running 10 hours each, and
this eventually fingered commit c59a1f106f5c ("KVM: x86/pmu: Add
IA32_PEBS_ENABLE MSR emulation for extended PEBS").  Although this commit
is certainly messing with things that could possibly cause all manner
of mischief, I don't immediately see a smoking gun.  Except that the
commit prior to this one is rock solid.

Just to make things a bit more exciting, bisection in mainline proved
to be problematic due to bugs of various kinds that hid this one.  I was
therefore forced to bisect among the commits backported to the internal
v5.19-based kernel, which fingered the backported version of the patch
called out above.

Please note that this is not (yet) an emergency.  I will just continue
to run rcutorture on v5.19-based hypervisors in the meantime.

Any suggestions for debugging or fixing?

							Thanx, Paul

^ permalink raw reply	[flat|nested] 12+ messages in thread

end of thread, other threads:[~2024-01-04 19:23 UTC | newest]

Thread overview: 12+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-01-03 22:22 [BUG] Guest OSes die simultaneously (bisected) Paul E. McKenney
2024-01-03 23:27 ` Paul E. McKenney
2024-01-04  0:24   ` Sean Christopherson
2024-01-04  1:00     ` Paul E. McKenney
2024-01-04 14:50       ` Paul E. McKenney
2024-01-04 14:59         ` Paolo Bonzini
2024-01-04 16:06           ` Paul E. McKenney
2024-01-04 16:32             ` Paolo Bonzini
2024-01-04 17:25               ` Sean Christopherson
2024-01-04 19:23               ` Paul E. McKenney
2024-01-04 17:07             ` Andi Kleen
2024-01-04 10:01 ` Breno Leitao

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).