From: Marek Marczykowski <marmarek@mimuw.edu.pl>
To: Konrad Rzeszutek Wilk <konrad.wilk@oracle.com>
Cc: "xen-devel@lists.xensource.com" <xen-devel@lists.xensource.com>,
Joanna Rutkowska <joanna@invisiblethingslab.com>,
Rafal Wojtczuk <rafal@invisiblethingslab.com>
Subject: Re: xen-4.1: PV domain hanging at startup, jiffies stopped
Date: Wed, 31 Aug 2011 18:27:54 +0200 [thread overview]
Message-ID: <4E5E610A.1010501@mimuw.edu.pl> (raw)
In-Reply-To: <4E5D1B57.7040106@mimuw.edu.pl>
[-- Attachment #1.1: Type: text/plain, Size: 3213 bytes --]
On 30.08.2011 19:18, Marek Marczykowski wrote:
> On 29.08.2011 22:59, Konrad Rzeszutek Wilk wrote:
>> Ok, but I am still unsure where it is hanging in DomU. Can you run with
>> 'console=hvc0 debug initcall_debug loglevel=8 earlyprintk=xen' to get an idea
>> of what is stuck in the guest?
>
> With "initcall_debug" parameter problem does not appear (at least for
> 200 domU starts)... It looks like race condition which doesn't happens
> on slowed down kernel (by printing lots of debug info). This also
> explains why this bug appears only on fast hardware.
>
>> You might also have better luck using
>> 'xenctx' to get a stack trace of what is hangning in the guest.
>> (you will need the System.map file from the guest's kernel.. but that should
>> be fairly easy to extract).
>
> xenctx didn't provide any useful data :/ It always shows following trace
> for hanged domU:
> -----------------
> rip: ffffffff810013aa hypercall_page+0x3aa
> flags: 00001246 i z p
> rsp: ffffffff81801ee0
> rax: 0000000000000000 rcx: ffffffff810013aa rdx: 0000000000000000
> rbx: ffffffff81800010 rsi: 00000000deadbeef rdi: 00000000deadbeef
> rbp: ffffffff81801ef8 r8: 0000000000000000 r9: 0000000000000000
> r10: 0000000000000000 r11: 0000000000000246 r12: 0000000000000000
> r13: 0000000000000000 r14: ffffffffffffffff r15: 0000000000000000
> cs: e033 ss: e02b ds: 0000 es: 0000
> fs: 0000 @ 0000000000000000
> gs: 0000 @ ffff880018ee7000/0000000000000000
> Code (instr addr ffffffff810013aa)
> cc cc cc cc cc cc cc cc cc cc cc 51 41 53 b8 1d 00 00 00 0f 05 <41> 5b
> 59 c3 cc cc cc cc cc cc cc
>
>
> Stack:
> 0000000000000000 0000000000000000 ffffffff810072a0 ffffffff81801f18
> ffffffff81012528 ffffffff81800010 ffffffff8185a2a0 ffffffff81801f38
> ffffffff81009faf 0000000000000000 6db6db6db6db6db7 ffffffff81801f48
> ffffffff813fb388 ffffffff81801f88 ffffffff81875c79 ffffffff81801f88
>
> Call Trace:
> [<ffffffff810013aa>] hypercall_page+0x3aa <--
> [<ffffffff810072a0>] xen_safe_halt+0x10
> [<ffffffff81012528>] default_idle+0x58
> [<ffffffff81009faf>] cpu_idle+0x5f
> [<ffffffff813fb388>] rest_init+0x68
> [<ffffffff81875c79>] start_kernel+0x36f
> [<ffffffff81875346>] x86_64_start_reservations+0x131
> [<ffffffff81878245>] xen_start_kernel+0x5f1
> ------------------
>
> I've collected few more messages from successful and failed domU starts.
> The only difference is the place where "Switched to NOHz mode on CPU #0"
> appears and existence of "CE: xen increased min_delta_ns to ..." and
> "CE: Reprogramming failure. Giving up" messages.
>
> I think it can be related to:
> http://lists.xensource.com/archives/html/xen-devel/2010-07/msg00649.html
> (this was on HVM not PV, but looks similar)
>
> I've tried also xenpm set-max-cstate 0 and tsc_mode=1 in domU config,
> but it doesn't help. Also pinning vcpu doesn't help (this domUs have
> only 1 vcpu). Is 'xenpm set-max-cstate 0' the same as booting xen with
> max_cstate=0?
Looks like tsc_mode=2 solves the problem.
--
Pozdrawiam / Best Regards,
Marek Marczykowski | RLU #390519
marmarek at mimuw edu pl | xmpp:marmarek at staszic waw pl
[-- Attachment #1.2: S/MIME Cryptographic Signature --]
[-- Type: application/pkcs7-signature, Size: 5842 bytes --]
[-- Attachment #2: Type: text/plain, Size: 138 bytes --]
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel
next prev parent reply other threads:[~2011-08-31 16:27 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-08-28 13:13 xen-4.1: PV domain hanging at startup, jiffies stopped Marek Marczykowski
2011-08-29 20:07 ` Konrad Rzeszutek Wilk
2011-08-29 20:21 ` Marek Marczykowski
2011-08-29 20:59 ` Konrad Rzeszutek Wilk
2011-08-29 21:28 ` Pasi Kärkkäinen
2011-08-30 17:18 ` Marek Marczykowski
2011-08-31 16:27 ` Marek Marczykowski [this message]
2011-08-31 20:00 ` Dan Magenheimer
2011-08-31 20:49 ` Marek Marczykowski
2011-08-31 21:01 ` Keir Fraser
2011-08-31 21:13 ` Marek Marczykowski
2011-08-31 22:07 ` Keir Fraser
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4E5E610A.1010501@mimuw.edu.pl \
--to=marmarek@mimuw.edu.pl \
--cc=joanna@invisiblethingslab.com \
--cc=konrad.wilk@oracle.com \
--cc=rafal@invisiblethingslab.com \
--cc=xen-devel@lists.xensource.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).