All of lore.kernel.org
 help / color / mirror / Atom feed
From: Andrew Cooper <andrew.cooper3@citrix.com>
To: Jan Beulich <JBeulich@suse.com>,
	"Marcos E. Matsunaga" <Marcos.Matsunaga@oracle.com>
Cc: xen-devel@lists.xenproject.org
Subject: Re: EFI Xen unstable crashes on Dell E6410 when calling efi_get_time.
Date: Wed, 22 Oct 2014 12:01:08 +0100	[thread overview]
Message-ID: <54478E74.3010408@citrix.com> (raw)
In-Reply-To: <5447A1790200007800040F54@mail.emea.novell.com>

On 22/10/14 11:22, Jan Beulich wrote:
>>>> On 22.10.14 at 11:45, <andrew.cooper3@citrix.com> wrote:
>> On 22/10/14 01:29, Marcos E. Matsunaga wrote:
>>> I went out and got the serial cable. Attached is the full output.
>>>
>>>
>>> On 10/21/2014 05:06 PM, Marcos E. Matsunaga wrote:
>>>> Folks,
>>>>
>>>> I am trying to boot Xen using efibootmgr on a Dell E6410 laptop with
>>>> 4GB RAM, running an Intel I5 dual core with VT and all the
>>>> virtualization options enabled.
>>>>
>>>> It crashes almost immediately. I am working on getting the serial
>>>> console up so that I can get a more detailed stack.
>>>>
>>>> A screenshot of the console is attached.
>>>>
>>>> The xen.cfg file is:
>>>>
>>>> [global]
>>>> default=xen
>>>>
>>>> [xen]
>>>> options=console=vga,com1 com1=115200,8n1 dom0_max_vcpus=2 vga="qxl"
>>>> kernel=vmlinuz-3.8.13-48.el7uek.Other_EFI_v1.x86_64
>>>> root=UUID=917bfc7f-8d9c-4acf-a98a-a9f558daccf2  ro console=hvc0
>>>> enforcing=0 biosdevname=0 earlyprintk=xen nomodeset
>>>> ramdisk=initramfs-3.8.13-48.el7uek.Other_EFI_v1.x86_64.img
>>>>
>>>>
>>>> The codepath is "(gdb) x/20i get_cmos_time
>>>>    0xffff82d080188825 <get_cmos_time>:  push   %rbp
>>>>    0xffff82d080188826 <get_cmos_time+1>:        mov %rsp,%rbp
>>>>    0xffff82d080188829 <get_cmos_time+4>:        push   %r12
>>>>    0xffff82d08018882b <get_cmos_time+6>:        push   %rbx
>>>>    0xffff82d08018882c <get_cmos_time+7>:        cmpb
>>>> $0x0,0xb620d(%rip)        # 0xffff82d08023ea40 <efi_enabled>
>>>>    0xffff82d080188833 <get_cmos_time+14>:       je 0xffff82d080188843
>>>> <get_cmos_time+30>
>>>>    0xffff82d080188835 <get_cmos_time+16>:       callq
>>>> 0xffff82d080100069 <efi_get_time>"
>>>>
>> Ok - there are two separate bugs here.
>>
>> The first is that we call into the efi runtime via efi_rs->GetTime, and
>> a pagefault happens for the instruction at 0x00000000db25a33d for the
>> virtual address 0x00000000fed1f410
>>
>> The memory map looks quite weird, but the faulting address is covered in
>> this range.
>>
>> (XEN)  00000fed1c000-00000fed1ffff type=11 attr=8000000000000000
>>
>> So I would expect it to be mapped into the EFI pagetables.
> Then you must have missed
>
> (XEN) Unknown cachability for MFNs 0xfed1c-0xfed1f
>
> which means no mapping got established (as we don't know what
> cachability attributes to give to it).
>
> This is a firmware bug.

I had indeed missed the secondary meaning of that message.

>> The second is that once the pagefault has happened, we trap back into
>> Xen and attempt to do a pagetable walk, falling over an assertion in
>> map_domain_page().
>>
>> For EFI calls, we run on the efi pagetables, not the idle pagetables, so
>> I am not surprised that the assertion has failed.  I suspect that the
>> pagefault hander for hypervisor faults needs to become wise to the fact
>> that we may receive a fault when calling into the firmware.  As all the
>> efi pagetables are xenheap pages, there is nothing conceptually wrong
>> with using map_domain_page() to do the walk.
> I'm not sure it's worth taking care of this special case. But yes, if
> we really want to, extending the condition to also consider
> efi_l4_pgtable would seem the right thing to do.

I think being able to do a pagetable walk from an EFI fault would be
useful, even if only to aid debugging.  In this case, a non-debug build
would successfully perform the walk.

I have had a quick go, but it is rather hard to get the efi_l4_pgtable
symbol available to use in domain_page.c without some gross extern'ing. 
It would be a nice fix if anyone has sufficient tuits.

~Andrew

  reply	other threads:[~2014-10-22 11:01 UTC|newest]

Thread overview: 14+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-10-21 21:06 EFI Xen unstable crashes on Dell E6410 when calling efi_get_time Marcos E. Matsunaga
2014-10-22  0:29 ` Marcos E. Matsunaga
2014-10-22  9:45   ` Andrew Cooper
2014-10-22 10:22     ` Jan Beulich
2014-10-22 11:01       ` Andrew Cooper [this message]
2014-10-22 14:15         ` Jan Beulich
2014-10-22 14:22           ` Andrew Cooper
2014-10-23 14:03         ` Jan Beulich
2014-10-24  2:37           ` Marcos E. Matsunaga
2014-10-24  9:15             ` Jan Beulich
2014-10-27 21:07               ` Marcos E. Matsunaga
2014-10-28  7:34                 ` Jan Beulich
2014-10-22 13:12       ` Konrad Rzeszutek Wilk
2014-10-22 14:19         ` Jan Beulich

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=54478E74.3010408@citrix.com \
    --to=andrew.cooper3@citrix.com \
    --cc=JBeulich@suse.com \
    --cc=Marcos.Matsunaga@oracle.com \
    --cc=xen-devel@lists.xenproject.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.