xen-devel.lists.xenproject.org archive mirror
 help / color / mirror / Atom feed
* cpuidle causing Dom0 soft lockups
@ 2010-01-21  9:51 Jan Beulich
  2010-01-21 10:11 ` Keir Fraser
                   ` (2 more replies)
  0 siblings, 3 replies; 61+ messages in thread
From: Jan Beulich @ 2010-01-21  9:51 UTC (permalink / raw)
  To: xen-devel

On large systems and with Dom0 booting with (significantly) more than
32 vCPU-s we have got multiple reports that the now by default
enabled C-state management is causing soft lockups, usually preventing
the boot from completing.

The observations are:

Reducing the number of vCPU-s (or pCPU-s) sufficiently much makes
the systems work.

max_cstate=0 makes the systems work.

max_cstate=1 makes the problem less severe on one (bigger) system,
and eliminates it completely on another (smaller) one.

When appearing to hang, all vCPU-s are in Dom0's timer_interrupt(),
and all (sometimes all but one) are attempting to acquire xtime_lock.
However, due to our use of ticket locks we can verify that this is not
a deadlock (repeatedly sending '0' shows forward progress, as the
tickets [visible on the stack] continue to increase). Additionally, there
is always one vCPU that has its polling event channel (used for
waking the next waiting vCPU when a lock becomes available)
signaled.

In one case (but not in the other) it is always the same vCPU that
is apparently taking very long to wake up from the polling request.
This may be coincidence, but output after sending 'c' also indicates
a significantly higher (about 3 times) usage value for C2 than the
second highest one; the duration printed is roughly the same for
all CPUs.

While I don't know this code well, it would seem that we're suffering
from extremely long wakeup times. This suggests that there likely is
a (performance) problem even for smaller numbers of vCPU-s.
Hence, unless it can be fixed before 4.0 releases, I would suggest
disabling C-state management by default again.

I can provide full logs in case needed.

Jan

^ permalink raw reply	[flat|nested] 61+ messages in thread

end of thread, other threads:[~2010-02-24  8:56 UTC | newest]

Thread overview: 61+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-01-21  9:51 cpuidle causing Dom0 soft lockups Jan Beulich
2010-01-21 10:11 ` Keir Fraser
2010-01-21 10:16   ` Jan Beulich
2010-01-21 10:26     ` Keir Fraser
2010-01-21 10:53       ` Jan Beulich
2010-01-21 11:03         ` Keir Fraser
2010-01-21 11:13           ` Jan Beulich
2010-02-02  7:54           ` Jan Beulich
2010-02-02  8:13             ` Juergen Gross
2010-02-02 17:07             ` Yu, Ke
2010-02-03 10:15               ` Jan Beulich
2010-02-03 12:10                 ` Tian, Kevin
2010-02-03 12:18                   ` Juergen Gross
2010-02-04  1:40                     ` Tian, Kevin
2010-02-04  6:31                       ` Juergen Gross
2010-02-03 13:20                   ` Jan Beulich
2010-02-04  1:48                     ` Tian, Kevin
2010-02-03 14:46                 ` Yu, Ke
2010-02-03  7:32             ` Yu, Ke
2010-02-03 10:23               ` Jan Beulich
2010-02-05  8:48               ` Jan Beulich
2010-02-05  9:00                 ` Tian, Kevin
2010-02-05  9:14                   ` Jan Beulich
2010-02-05  9:52                     ` Tian, Kevin
2010-02-05 10:37                       ` Jan Beulich
2010-02-05 11:16                         ` Tian, Kevin
2010-02-05 14:59                           ` Jan Beulich
2010-02-05 15:51                             ` Jan Beulich
2010-02-06  1:52                               ` Tian, Kevin
2010-02-08  8:45                                 ` Jan Beulich
2010-02-09  7:55                                   ` Tian, Kevin
2010-02-09 12:35                                     ` Jan Beulich
2010-02-11 14:44                                     ` Jan Beulich
2010-02-11 17:01                                       ` Keir Fraser
2010-02-12  9:21                                         ` Jan Beulich
2010-02-06  1:50                             ` Tian, Kevin
2010-02-08  8:36                               ` Jan Beulich
2010-02-05  9:16                 ` Yu, Ke
2010-02-07 15:36                 ` Yu, Ke
2010-02-08  9:08                   ` Jan Beulich
2010-02-08 10:11                     ` Keir Fraser
2010-02-09  8:02                     ` Tian, Kevin
2010-02-13  2:28                     ` Yu, Ke
2010-02-15  8:24                       ` Keir Fraser
2010-02-15 17:33                       ` Keir Fraser
2010-02-16  4:59                         ` Yu, Ke
2010-02-16  7:59                         ` Jan Beulich
2010-02-16 13:12                           ` Yu, Ke
2010-02-16 14:24                             ` Jan Beulich
2010-02-22 13:29                         ` Jan Beulich
2010-02-22 13:44                           ` Keir Fraser
2010-02-23  9:32                             ` Yu, Ke
2010-02-23 10:37                               ` Jan Beulich
2010-02-23 10:57                                 ` Keir Fraser
2010-02-23 16:44                                   ` Jan Beulich
2010-02-24  3:08                                     ` Tian, Kevin
2010-02-24  6:51                                       ` Yu, Ke
2010-02-24  8:56                                         ` Jan Beulich
2010-01-21 10:35 ` Wei, Gang
2010-01-21 12:07 ` Yu, Ke
2010-01-25  8:08   ` Jan Beulich

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).