public inbox for kexec@lists.infradead.org
 help / color / mirror / Atom feed
From: ebiederm@xmission.com (Eric W. Biederman)
To: "H. Peter Anvin" <hpa@zytor.com>
Cc: kexec@lists.infradead.org, Neil Horman <nhorman@tuxdriver.com>
Subject: Re: Question regardin intel64 arch and page table setup
Date: Wed, 11 Aug 2010 14:51:39 -0700	[thread overview]
Message-ID: <m1zkwsn7h0.fsf@fess.ebiederm.org> (raw)
In-Reply-To: <4C6301C2.5020808@zytor.com> (H. Peter Anvin's message of "Wed, 11 Aug 2010 13:02:10 -0700")

"H. Peter Anvin" <hpa@zytor.com> writes:

> On 08/11/2010 12:47 PM, Neil Horman wrote:
>> Hey all-
>> 	I've got a question regarding x86_64 and how linux uses the paging
>> hardware.  I'm tinkering with ways to get kexec to boot a new kernel on panic
>> without leaving long mode.  The idea being that if we can do that, then we don't
>> need to store the new kdump kernel below the 4G physical limit for 32 bit
>> systems.  In doing this though, I figured I would have to re-initalize the page
>> table with an identity mapped set of page tables to cover all of ram and load
>> that into cr3.  My question is, is it safe to do so while paging is enabled.
>> The docs I've read are unclear on that and if I have to disable paging that
>> automatically drops me out of long mode, which is bad.  I would think its safe
>> to do, since I imagined we had to do on context switches in the scheduler, but
>> the __switch_to implementation for x86_64 sems to do nothing but update the task
>> register.  Intel vol 3a says we need to update cr3, but I don't see where that
>> happens, so I'm not sure if theres some automated bit that does a cr3 update
>> safely when we write tr.
>> 
>> 	Anywho, any guidance, clarification would be appreciated.  Thanks!
>> Neil
>> 
>
> It is definitely safe to load a new CR3 while paging is done; it is done
> all the time.  The currently executing page needs to be mapped to the
> same physical and virtual address in most kernels.
>
> However, there are a *LOT* of issues with having a kernel that is
> completely above 4 GiB.  For one thing, a lot of device drivers simply
> will not work if there is no memory below 4 GiB awavilable to the
> kernel.   As such, I don't think you will be successful in this
> project.

A couple of pieces.
1) The kernel side of kexec and kexec on panic does not leave long mode.
   Long mode is left by the glue code in /sbin/kexec.

2) I agree about the DMA limitation however there are enough systems
   with iommu's these days you may be able to get it to work.

3) I would start just getting the normal kexec case to work.
   The 64bit kernel does support starting at the 64bit entry point,
   but I don't think it has been tested if loaded above 4G.

   It certainly should work and as time goes by I expect running
   a kernel above 4G to become an increasingly interesting use case.
   So it is certainly worth play with.

   But as Peter says having a kernel completely above 4GiB has is likely
   to uncover a lot of baked in assumptions so we real problems might
   result.

   Hmm.  On the normal kexec side you don't loose the low 4GiB so that
   case should be a lot easier to bootstrap with.  Once it works with
   the low 4GiB you can add a mem= or whatever to disable using the low
   4GiB and see what happens.

Have fun.

Eric

_______________________________________________
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

  reply	other threads:[~2010-08-11 21:52 UTC|newest]

Thread overview: 10+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-08-11 19:47 Question regardin intel64 arch and page table setup Neil Horman
2010-08-11 20:02 ` H. Peter Anvin
2010-08-11 21:51   ` Eric W. Biederman [this message]
2010-08-11 21:54     ` Eric W. Biederman
2010-08-11 22:02     ` H. Peter Anvin
2010-08-12  0:22       ` Eric W. Biederman
2010-08-12  1:05   ` Neil Horman
2010-08-12  1:46     ` H. Peter Anvin
2010-08-12  1:53     ` H. Peter Anvin
2010-08-12  3:21     ` Eric W. Biederman

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=m1zkwsn7h0.fsf@fess.ebiederm.org \
    --to=ebiederm@xmission.com \
    --cc=hpa@zytor.com \
    --cc=kexec@lists.infradead.org \
    --cc=nhorman@tuxdriver.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox