* sporadic virtio_blk errors and "vcpu not ready for apic_round_robin"
@ 2009-02-06 8:00 Michael Tokarev
0 siblings, 0 replies; only message in thread
From: Michael Tokarev @ 2009-02-06 8:00 UTC (permalink / raw)
To: KVM list
Hello
Since quite some time, I'm seeing sporadic I/O errors in guests
running ontop of virtio_blk devices. The information I have is
quite bare: guest usually shows something like:
Feb 6 02:47:34 hobbit kernel: end_request: I/O error, dev vda, sector 9786968
Feb 6 02:47:34 hobbit kernel: Buffer I/O error on device vda7, logical block 473367
Feb 6 02:47:34 hobbit kernel: lost page write due to I/O error on vda7
Feb 6 02:47:34 hobbit kernel: Aborting journal on device vda7.
Feb 6 02:47:35 hobbit kernel: ext3_abort called.
Feb 6 02:47:35 hobbit kernel: EXT3-fs error (device vda7): ext3_journal_start_sb: Detected aborted journal
Feb 6 02:47:35 hobbit kernel: Remounting filesystem read-only
After this point, the system is still live but the corresponding
block device stops working. I can umount the device, but any
attempt to remount it tells the device is *busy*, and using,
say, cfdisk on it (just starting, attempting to READ the partition
table) results in a kernel OOPS after about a 2 mins of inactivity.
At which time host displays a series of
vcpu not ready for apic_round_robin
messages (about 20 of them).
I'm trying to capture the OOPS right now. But obviously the problem
is elsewhere, since that OOPS is far after the original issue (the
I/O errors).
It happens sporadically, sometimes the guest is running for a week,
sometimes (as here) it crashed after several hours of uptime. It
does not relate to system activity either, as far as I can see --
happens on either high or slightly-loaded system, and may happen
on mostly idle guest system while another high-loaded guest is
running at the same time.
The host is running 2.6.27.10 x86-64 on a AMD Phenom 9750 processor,
AMD 780G/SB700 chipset. Using stock kvm modules. Userspace is
32bits kvm-83. Guests are linux systems running 2.6.27.10 or .14,
32bits, uniprocessor.
After seeing this link -- https://bugs.launchpad.net/ubuntu/+source/kvm/+bug/246175 ,
I disabled cpufreq on host. Bit it didn't help.
The issue persists since about a month or two (difficult to say as the
problem is very sporadic). I *think* kvm-72 (for example) exposed the
same problem on this host/guest combination, but I'm not sure.
Any pointers on how to debug the prob, or, even better, if it's a known
issue, is very welcome -- this is a production system and it becomes
quite.. unstable.
Thanks!
/mjt
^ permalink raw reply [flat|nested] only message in thread
only message in thread, other threads:[~2009-02-06 8:00 UTC | newest]
Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-02-06 8:00 sporadic virtio_blk errors and "vcpu not ready for apic_round_robin" Michael Tokarev
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox