public inbox for kvm@vger.kernel.org
 help / color / mirror / Atom feed
From: Marcelo Tosatti <mtosatti@redhat.com>
To: Alexander Graf <agraf@suse.de>
Cc: Avi Kivity <avi@redhat.com>,
	"kvm@vger.kernel.org" <kvm@vger.kernel.org>,
	Joerg Roedel <joerg.roedel@amd.com>,
	Sheng Yang <sheng@linux.intel.com>
Subject: Re: KVM guest crashes
Date: Sat, 24 Jan 2009 11:06:01 -0200	[thread overview]
Message-ID: <20090124130601.GA3485@amt.cnet> (raw)
In-Reply-To: <7D5F5FE3-E7C9-4352-9125-5624F08F94E3@suse.de>

On Sat, Jan 24, 2009 at 08:42:06AM +0100, Alexander Graf wrote:
>> rarely now). You can use the no_timer_check kernel option to bypass  
>> it.
>
> Ok :-). Thanks. The logic in the kernel for this is really stupid  
> (basing timing on clock speed). What about disabling the check if we  
> detect KVM?

Yes, this is an option. We've talked about it before, but no patch was
merged. The RHEL5.3 kernel skips those checks when it detects VMWare 
or KVM hypervisors.

We should understand what is happening to fix the fullvirt/old guest
case. For the in-kernel PIT, I believe there is a bug somewhere, either
in PIT itself or in the interaction with IOAPIC (failure to inject
interrupts for some reason). I started debugging it by constantly
reboot'ing an SMP guest but my testbox died. Hope to get back to it
soon.

>> Regarding the corruption problem, I have a few questions:
>>
>> - It is SMP specific (ie both kernel/userspace irqchip fail).
>> 	- which means UP guests are stable with both kernel/user
>> 	  irqchip.
>
> I have not been able to reproduce any of my issues with UP. I have to  
> admit that I only tried UP with in-kernel irqchip.

OK.

>> The "Stuck ??" messages seem to be coming from smpboot.c. So for some
>> reason vcpu's are being reset. Don't seem to be a triple fault because
>> in that case all vcpu's would be reset (so yes, the vcpu was really on
>> BIOS code).
>
> Hm. I know that OSX turns off CPUs it doesn't need as an alternative to 
> deep-sleep. Does Linux do that too?

Not that I know of, unless you offline CPU's manually, which does not
seem to be the case.

>> Suggest the following:
>> - Confirm the problem happens with root on ext3 filesystem (can't you
>>  mount the CIFS and copy the data over to a local guest disk to
>>  simulate similar load?).
>
> I had Stuck ?? messages without networking, but if it helps I can try  
> that too. In the project we're using this for we do things over cifs, so 
> that's why I built the test case around it.

OK. Just trying to decrease the variables involved. I'll setup a machine
to run a similar load next week.

>> - Check that the kernel text is not corrupted. Save the "good" kernel
>>  text with QEMU's "pmemsave" or "memsave" (you can see start/end in
>>  the symbols _text/_etext, /proc/kallsyms) after booting. After you
>>  see the crash, save the "bad" kernel text, compare. This can give
>>  additional clues (or not).
>
> Good idea - I'll try.
>
>> Also, you mentioned "other reports" previously, can you point to them,
>> please?
>
> Yes, will do later. I gotta run now! Thanks for the reply - it's good to 
> know this isn't getting ignored :-).

Have a good weekend.


  reply	other threads:[~2009-01-24 13:06 UTC|newest]

Thread overview: 18+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-01-20 15:49 KVM guest crashes Alexander Graf
2009-01-20 20:07 ` Avi Kivity
2009-01-20 20:20   ` Alexander Graf
2009-01-21  8:14   ` Alexander Graf
2009-01-21  9:05     ` Avi Kivity
2009-01-21  9:36       ` Avi Kivity
2009-01-21 10:44         ` Alexander Graf
2009-01-22 20:29         ` Alexander Graf
2009-01-22 20:36           ` Alexander Graf
2009-01-22 20:55             ` Alexander Graf
2009-01-23 16:36               ` Alexander Graf
2009-01-23 22:36           ` Marcelo Tosatti
2009-01-24  7:42             ` Alexander Graf
2009-01-24 13:06               ` Marcelo Tosatti [this message]
2009-01-24 14:30                 ` Alexander Graf
2009-01-26 15:53             ` Alexander Graf
2009-01-26 16:21               ` Marcelo Tosatti
2009-01-26 16:33                 ` Alexander Graf

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20090124130601.GA3485@amt.cnet \
    --to=mtosatti@redhat.com \
    --cc=agraf@suse.de \
    --cc=avi@redhat.com \
    --cc=joerg.roedel@amd.com \
    --cc=kvm@vger.kernel.org \
    --cc=sheng@linux.intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox