All of lore.kernel.org
 help / color / mirror / Atom feed
From: Brad Campbell <lists2009@fnarfbargle.com>
To: Nadav Amit <nadav.amit@gmail.com>, Paolo Bonzini <pbonzini@redhat.com>
Cc: "Saso Slavicic" <saso.linux@astim.si>,
	"kvm list" <kvm@vger.kernel.org>,
	"Radim Krčmář" <rkrcmar@redhat.com>
Subject: Re: XP machine freeze
Date: Mon, 20 Apr 2015 00:50:40 +0800	[thread overview]
Message-ID: <5533DCE0.4080606@fnarfbargle.com> (raw)
In-Reply-To: <45D7B761-5F3B-4A1B-8057-6C77693A308B@gmail.com>


On 19/04/15 23:48, Nadav Amit wrote:
> Brad Campbell <lists2009@fnarfbargle.com> wrote:
>
>> On 13/04/15 22:02, Paolo Bonzini wrote:
>>> On 13/04/2015 14:45, Brad Campbell wrote:
>>>> G'day Paolo,
>>>>
>>>> Yes, on AMD and I've tried hard to reproduce it on Intel and been unable
>>>> to thus far.
>>>>
>>>> Now you mention it may be AMD specific, I have a spare motherboard and
>>>> processor sitting in a drawer. I'll bolt it together tomorrow and see if
>>>> I can reproduce it on another AMD machine. Two machines should let me
>>>> test it twice as fast.
>>>>
>>>> I got a fail this afternoon, so I'm due to reboot tonight. I'll just
>>>> revert that one suspect commit from a known bad kernel and see if that
>>>> cleans it up. If not then I'll work through the remainder of the
>>>> information in your mail. I really appreciate the attention you've paid
>>>> to this, it has been a frustrating bug for me because I'm in a position
>>>> of not knowing what I don't know, and obviously doing something wrong in
>>>> very long bisection processes.
>>> Actually, if you have time to change your course of action, please
>>> revert the one that Nadav pointed out (f210f7572bed, KVM: x86:
>>> Fix lost interrupt on irr_pending race) or cherry-pick it on top of 3.17.
>>>
>>> Paolo
>> Ok, I think we have a winner. Patch manually plopped on top of vanilla 3.17. It has never gone for anywhere near this long on a bad kernel.
>>
>> brad@srv:~$ uptime
>> 23:24:48 up 6 days,  1:01,  3 users,  load average: 1.48, 1.95, 2.48
>>
>> So this patch went into the kernel during the 3.19 release cycle? Affected kernels 3.16-3.18?
> Actually, the original bug seemed to be introduced by commit
> 33e4c68656a2e461b296ce714ec322978de85412 "KVM: Optimize searching for
> highest IRR”. So the bug goes all the way back to 2.6.32. The race that this
> patch fixes just became more apparent (i.e., likely to happen) on 3.16. It
> is fixed in 3.19.

And I can confidently state that over the years I've seen this happen a 
number of times, but in each case I was using qemu with an SDL console 
as a user-interactive VM, and a moving the mouse would restore network 
connectivity. It was obviously seriously exacerbated by something that 
went into 3.16.

I really appreciate the assistance in pinning this down. At the next 
excuse for a reboot I'll upgrade the server to a 3.19.x kernel and call 
it done.

Regards,
Brad

-- 
Dolphins are so intelligent that within a few weeks they can
train Americans to stand at the edge of the pool and throw them
fish.


  reply	other threads:[~2015-04-19 16:51 UTC|newest]

Thread overview: 25+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-03-16 15:10 XP machine freeze Saso Slavicic
2015-03-19  0:51 ` Marcelo Tosatti
2015-03-30 16:19   ` Saso Slavicic
2015-03-22 15:31 ` Brad Campbell
2015-03-30 21:11   ` Paolo Bonzini
2015-03-31  0:27     ` Brad Campbell
2015-03-31  6:29       ` Saso Slavicic
2015-03-31  7:18         ` Brad Campbell
2015-03-31  8:56           ` Paolo Bonzini
2015-03-31 11:16             ` Brad Campbell
2015-03-31 11:23               ` Paolo Bonzini
2015-04-04 10:55                 ` Brad Campbell
2015-04-13  4:07     ` Brad Campbell
2015-04-13 12:38       ` Paolo Bonzini
2015-04-13 12:45         ` Brad Campbell
2015-04-13 14:02           ` Paolo Bonzini
2015-04-13 14:25             ` Brad Campbell
2015-04-19 15:27             ` Brad Campbell
2015-04-19 15:48               ` Nadav Amit
2015-04-19 16:50                 ` Brad Campbell [this message]
2015-04-19 17:16                   ` Paolo Bonzini
2015-04-13 12:47         ` Saso Slavicic
2015-04-13 13:33         ` Radim Krčmář
2015-04-13 13:34         ` Nadav Amit
2015-04-13 14:01           ` Paolo Bonzini

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5533DCE0.4080606@fnarfbargle.com \
    --to=lists2009@fnarfbargle.com \
    --cc=kvm@vger.kernel.org \
    --cc=nadav.amit@gmail.com \
    --cc=pbonzini@redhat.com \
    --cc=rkrcmar@redhat.com \
    --cc=saso.linux@astim.si \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.