public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Ingo Molnar <mingo@elte.hu>
To: Kevin Shanahan <kmshanah@ucwb.org.au>
Cc: "Rafael J. Wysocki" <rjw@sisk.pl>,
	Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
	Kernel Testers List <kernel-testers@vger.kernel.org>,
	Mike Galbraith <efault@gmx.de>,
	bugme-daemon@bugzilla.kernel.org,
	Steven Rostedt <rostedt@goodmis.org>,
	Peter Zijlstra <a.p.zijlstra@chello.nl>
Subject: Re: [Bug #12465] KVM guests stalling on 2.6.28 (bisected) [Bug 12465]
Date: Sun, 15 Feb 2009 11:04:54 +0100	[thread overview]
Message-ID: <20090215100454.GA30314@elte.hu> (raw)
In-Reply-To: <1234691284.4967.23.camel@kulgan.wumi.org.au>


* Kevin Shanahan <kmshanah@ucwb.org.au> wrote:

> On Sat, 2009-02-14 at 21:50 +0100, Rafael J. Wysocki wrote:
> > This message has been generated automatically as a part of a report
> > of regressions introduced between 2.6.27 and 2.6.28.
> > 
> > The following bug entry is on the current list of known regressions
> > introduced between 2.6.27 and 2.6.28.  Please verify if it still should
> > be listed and let me know (either way).
> 
> Yes, this should still be listed.
> 
> I just tested against 2.6.29-rc5 and the problem is as bad as ever
> (perhaps worse?)
> 
> --- hermes-old.wumi.org.au ping statistics ---
> 900 packets transmitted, 448 received, +317 errors, 50% packet loss, time 899845ms
> rtt min/avg/max/mdev = 0.131/420.015/10890.699/1297.022 ms, pipe 11

i looked at the trace you did earlier and which you uploaded to:

  http://disenchant.net/tmp/bug-12465/trace-1/

Here is one 3 seconds (!) latency:

 0)  qemu-sy-4237  |               |      kvm_vcpu_block() {
 0)  qemu-sy-4237  |               |        kvm_cpu_has_interrupt() {
 0)  qemu-sy-4237  |               |          kvm_apic_has_interrupt() {
 0)  qemu-sy-4237  |   0.291 us    |          }
 0)  qemu-sy-4237  |               |          kvm_apic_accept_pic_intr() {
 0)  qemu-sy-4237  |   0.291 us    |          }
 0)  qemu-sy-4237  |   1.476 us    |        }
 0)  qemu-sy-4237  |               |        kvm_cpu_has_pending_timer() {
 0)  qemu-sy-4237  |   0.325 us    |        }
 0)  qemu-sy-4237  |               |        kvm_arch_vcpu_runnable() {
 0)  qemu-sy-4237  |   0.288 us    |        }
 0)  qemu-sy-4237  |               |        kvm_arch_vcpu_put() {
 0)  qemu-sy-4237  |   0.415 us    |        }
 0)  qemu-sy-4237  |               |        schedule() {
 0)  qemu-sy-4237  |               |          wakeup_preempt_entity() {
 0)  qemu-sy-4237  |   0.300 us    |          }
 ------------------------------------------
 0)  qemu-sy-4237  =>   ksoftir-4   
 ------------------------------------------

 0)   ksoftir-4    | ! 3010470 us |  }
 ------------------------------------------
 0)   ksoftir-4    =>  qemu-sy-4355 
 ------------------------------------------

 0)  qemu-sy-4355  |   1.575 us    |          }
 0)  qemu-sy-4355  |   6.520 us    |        }
 0)  qemu-sy-4355  |   7.121 us    |      }
 0)  qemu-sy-4355  |               |      __wake_up() {
 0)  qemu-sy-4355  |               |        __wake_up_common() {
 0)  qemu-sy-4355  |               |          autoremove_wake_function() {
 0)  qemu-sy-4355  |               |            default_wake_function() {

qemu-sy-4237 has been scheduled away, and the system appeared to have done
nothing in the meantime. That's not something that really looks like a
scheduler regression - there is nothing the scheduler can do if KVM
decides to block a task.

It would be nice to enhance this single-CPU trace some more - to more
surgically see what is going on. Firstly, absolute timestamps would be
nice:

  echo funcgraph-abstime  > trace_options
  echo funcgraph-proc     > trace_options

as it's a bit hard to see the global timescale of events.

Secondly, not all events are included - in particular i dont really see
the points when packets are passed. Would it be possible to add a tracing
hypercall so that the guest kernel can inject trace events that can be seen
on the native-side trace? Regarding ping latencies really just two things
matter: the loopback network device's rx and tx path. We should trace the
outgoing sequence number and the incoming sequence number of IP packets,
and inject that to the host side. This way we can correlate the delays
precisely.

	Ingo

  reply	other threads:[~2009-02-15 10:05 UTC|newest]

Thread overview: 71+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-02-14 20:48 2.6.29-rc5: Reported regressions 2.6.27 -> 2.6.28 Rafael J. Wysocki
2009-02-14 20:48 ` [Bug #12061] snd_hda_intel: power_save: sound cracks on powerdown Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12209] oldish top core dumps (in its meminfo() function) Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12208] uml is very slow on 2.6.28 host Rafael J. Wysocki
2009-02-22 13:58   ` Américo Wang
2009-02-23 14:27     ` Miklos Szeredi
2009-02-14 20:50 ` [Bug #12160] networking oops after resume from s2ram (2.6.28-rc6) Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12265] FPU emulation broken in 2.6.28-rc8 ? Rafael J. Wysocki
2009-02-14 23:23   ` Ingo Molnar
2009-02-14 20:50 ` [Bug #12337] ~100 extra wakeups reported by powertop Rafael J. Wysocki
2009-02-14 23:35   ` Alberto Gonzalez
2009-02-15 14:20     ` Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12263] Sata soft reset filling log Rafael J. Wysocki
2009-02-15 20:47   ` Justin Madru
2009-02-15 21:21     ` Rafael J. Wysocki
2009-02-15 22:30       ` Ingo Molnar
2009-02-15 23:12         ` Rafael J. Wysocki
2009-02-16 15:18           ` Sergei Shtylyov
2009-02-16 15:21             ` Ingo Molnar
2009-02-16 15:21             ` Sergei Shtylyov
2009-02-16 15:31               ` Sergei Shtylyov
2009-02-16 19:23                 ` Justin Madru
2009-02-16 19:42                   ` Sergei Shtylyov
2009-02-16 21:40                     ` Justin Madru
2009-02-17 11:19                       ` Hugh Dickins
2009-02-17 19:08                         ` Justin Madru
2009-02-18  1:03                           ` Sergei Shtylyov
2009-02-18  6:42                             ` Justin Madru
2009-02-14 20:50 ` [Bug #12224] journal activity on inactive partition causes inactive harddrive spinup Rafael J. Wysocki
2009-02-23 12:22   ` Theodore Tso
2009-02-23 14:36     ` Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12395] 2.6.28-rc9: oprofile regression Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12401] 2.6.28 regression: xbacklight broken on ThinkPad X61s Rafael J. Wysocki
2009-02-15 13:44   ` Matthew Garrett
2009-02-15 14:38     ` Rafael J. Wysocki
2009-02-15 22:16       ` Tino Keitel
2009-02-16  1:16         ` Matthew Garrett
2009-02-16 12:37           ` Ingo Molnar
2009-02-16 12:42             ` Matthew Garrett
2009-02-14 20:50 ` [Bug #12403] TTY problem on linux-2.6.28-rc7 Rafael J. Wysocki
2009-02-16 16:12   ` Aristeu Rozanski
2009-02-16 20:42     ` Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12393] debugging in dosemu causes lots of 'scheduling while atomic' Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12406] 2.6.28 thinks that my PS/2 mouse is a touchpad Rafael J. Wysocki
2009-02-15  6:14   ` Alexander E. Patrakov
2009-02-15 14:40     ` Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12405] oops in __bounce_end_io_read under kvm Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12404] Oops in 2.6.28-rc9 and -rc8 -- mtrr issues / e1000e Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12409] NULL pointer dereference at get_stats() Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12407] Kernel 2.6.28 regression: Hang after hibernate Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12408] Funny problem with 2.6.28: Kernel stalls Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12411] 2.6.28: BUG in r8169 Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12500] r8169: NETDEV WATCHDOG: eth0 (r8169): transmit timed out Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12421] GPF on 2.6.28 and 2.6.28-rc9-git3, e1000e and e1000 issues Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12465] KVM guests stalling on 2.6.28 (bisected) Rafael J. Wysocki
2009-02-15  9:48   ` [Bug #12465] KVM guests stalling on 2.6.28 (bisected) [Bug 12465] Kevin Shanahan
2009-02-15 10:04     ` Ingo Molnar [this message]
2009-02-22 10:39       ` [Bug #12465] KVM guests stalling on 2.6.28 (bisected) [bug 12465] Kevin Shanahan
2009-02-22 17:27         ` Ingo Molnar
2009-02-23 11:38       ` [Bug #12465] KVM guests stalling on 2.6.28 (bisected) [Bug 12465] Kevin Shanahan
2009-02-14 20:50 ` [Bug #12559] Huawei E169 doesn't work as mass storage anymore Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12612] hard lockup when interrupting cdda2wav Rafael J. Wysocki
2009-02-17 17:16   ` Matthias Reichl
2009-02-17 20:23     ` Rafael J. Wysocki
2009-02-19 13:49       ` FUJITA Tomonori
2009-02-14 20:50 ` [Bug #12619] Regression 2.6.28 and last - boot failed Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12614] WOL with forcedeth broken since f55c21fd9a92a444e55ad1ca4e4732d56661bf2e Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12690] DPMS (LCD powersave, poweroff) don't work Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12645] DMI low-memory-protect quirk causes resume hang on Samsung NC10 Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12658] ThrustMaster Firestorm Dual Power 3 Gamepads stopped working Rafael J. Wysocki
2009-02-14 20:50 ` [Bug #12634] video distortion and lockup with i830 video chip and 2.6.28.3 Rafael J. Wysocki

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20090215100454.GA30314@elte.hu \
    --to=mingo@elte.hu \
    --cc=a.p.zijlstra@chello.nl \
    --cc=bugme-daemon@bugzilla.kernel.org \
    --cc=efault@gmx.de \
    --cc=kernel-testers@vger.kernel.org \
    --cc=kmshanah@ucwb.org.au \
    --cc=linux-kernel@vger.kernel.org \
    --cc=rjw@sisk.pl \
    --cc=rostedt@goodmis.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox