* arm64 boot requirements @ 2015-11-30 1:45 Carl van Schaik 2015-12-01 11:02 ` Mark Rutland 0 siblings, 1 reply; 10+ messages in thread From: Carl van Schaik @ 2015-11-30 1:45 UTC (permalink / raw) To: linux-arm-kernel In commit bd00cd5f8c8c3c282bb1e1eac6a6679a4f808091, the idmap_pg_dir and swapper_pg_dir where moved from before the kernel to after it. The problem is that these symbols fall outside the range covered by the ELF file - outside of any section. A bootloader which loads the kernel ELF file and dynamically determines where to place the DTB, may try place it after the kernel. We've just run into this problem and the DTB gets overwritten as soon as the pagetables are created. I'd suggest that the kernel either: A. document this boot requirement for where not to load a DTB B. update the vmlinux.lds.S such that these symbols (and _end) are properly covered by a section in the ELF, and thus preventing this issue. thanks, Carl -------------- next part -------------- A non-text attachment was scrubbed... Name: 0001-arm64-place-initial-page-tables-in-ELF-section.patch Type: text/x-patch Size: 1323 bytes Desc: not available URL: <http://lists.infradead.org/pipermail/linux-arm-kernel/attachments/20151130/9de49ebc/attachment.bin> -------------- next part -------------- A non-text attachment was scrubbed... Name: smime.p7s Type: application/pkcs7-signature Size: 4267 bytes Desc: S/MIME Cryptographic Signature URL: <http://lists.infradead.org/pipermail/linux-arm-kernel/attachments/20151130/9de49ebc/attachment.p7s> ^ permalink raw reply [flat|nested] 10+ messages in thread
* arm64 boot requirements 2015-11-30 1:45 arm64 boot requirements Carl van Schaik @ 2015-12-01 11:02 ` Mark Rutland 2015-12-01 11:52 ` Ard Biesheuvel ` (2 more replies) 0 siblings, 3 replies; 10+ messages in thread From: Mark Rutland @ 2015-12-01 11:02 UTC (permalink / raw) To: linux-arm-kernel Hi, On Mon, Nov 30, 2015 at 12:45:18PM +1100, Carl van Schaik wrote: > In commit bd00cd5f8c8c3c282bb1e1eac6a6679a4f808091, the idmap_pg_dir > and swapper_pg_dir where moved from before the kernel to after it. > > The problem is that these symbols fall outside the range covered by > the ELF file - outside of any section. > > A bootloader which loads the kernel ELF file and dynamically > determines where to place the DTB, may try place it after the > kernel. We've just run into this problem and the DTB gets > overwritten as soon as the pagetables are created. We had similar issues with the BSS when booting Image files prior to this and commit a2c1d73b94ed49f5 ("arm64: Update the Image header"). Since then, the image_size field in the Image header tells you how much memory the kernel may clobber (including the BSS and page tables). Prior to that, the page tables were below the kernel, and also not described in any ELF section. Others booting the kernel vmlinux haven't reported similar issues, so I assume that either they are parsing the Image header, or getting lucky. Parsing the header is necessary to get the correct text offset, too... Pratyush, Geoff, I understood you were loading the kernel vmlinux for kexec. Do you parse the Image header to figure out where to place things? > I'd suggest that the kernel either: > A. document this boot requirement for where not to load a DTB Do you have any particular suggestion? We already describe the Image footprint (including BSS and page tables) by the image_size in the Image header, which is sufficient. The size of the BSS and page tables is effectively unbound, so we can't define some upper bound that will always be true. The documentation is written on the assumption that an Image file is being used rather than a vmlinux. Perhaps that is something to consider. > B. update the vmlinux.lds.S such that these symbols (and _end) are > properly covered by a section in the ELF, and thus preventing this > issue. I'm worried that this only solves this one case, and it means that there are two (potentially conflicting) sources of information that a bootloader might be using -- the ELF or the Image header. I don't want to have to duplicate text_offset and so on, which implies that parsing the Image header is necessary anyway. That's something we can discuss if you send a patch (inline rather than attached). Thanks, Mark. ^ permalink raw reply [flat|nested] 10+ messages in thread
* arm64 boot requirements 2015-12-01 11:02 ` Mark Rutland @ 2015-12-01 11:52 ` Ard Biesheuvel 2015-12-01 22:09 ` Christopher Covington 2015-12-02 13:46 ` Carl van Schaik 2015-12-01 12:50 ` Pratyush Anand 2015-12-02 19:03 ` Geoff Levand 2 siblings, 2 replies; 10+ messages in thread From: Ard Biesheuvel @ 2015-12-01 11:52 UTC (permalink / raw) To: linux-arm-kernel On 1 December 2015 at 12:02, Mark Rutland <mark.rutland@arm.com> wrote: > Hi, > > On Mon, Nov 30, 2015 at 12:45:18PM +1100, Carl van Schaik wrote: >> In commit bd00cd5f8c8c3c282bb1e1eac6a6679a4f808091, the idmap_pg_dir >> and swapper_pg_dir where moved from before the kernel to after it. >> >> The problem is that these symbols fall outside the range covered by >> the ELF file - outside of any section. >> >> A bootloader which loads the kernel ELF file and dynamically >> determines where to place the DTB, may try place it after the >> kernel. We've just run into this problem and the DTB gets >> overwritten as soon as the pagetables are created. Could you explain why you are using the ELF file and not the binary image file? This is not future proof: currently, the Image is a straight binary objcopy of vmlinux, but that is not guaranteed to remain that way. Things like KASLR may require post build steps that mangle vmlinux or Image afterwards. > > We had similar issues with the BSS when booting Image files prior to > this and commit a2c1d73b94ed49f5 ("arm64: Update the Image header"). > Since then, the image_size field in the Image header tells you how much > memory the kernel may clobber (including the BSS and page tables). > > Prior to that, the page tables were below the kernel, and also not > described in any ELF section. > > Others booting the kernel vmlinux haven't reported similar issues, so I > assume that either they are parsing the Image header, or getting lucky. > Parsing the header is necessary to get the correct text offset, too... > > Pratyush, Geoff, I understood you were loading the kernel vmlinux for > kexec. Do you parse the Image header to figure out where to place > things? > >> I'd suggest that the kernel either: >> A. document this boot requirement for where not to load a DTB > > Do you have any particular suggestion? > > We already describe the Image footprint (including BSS and page tables) > by the image_size in the Image header, which is sufficient. The size of > the BSS and page tables is effectively unbound, so we can't define some > upper bound that will always be true. > > The documentation is written on the assumption that an Image file is > being used rather than a vmlinux. Perhaps that is something to consider. > >> B. update the vmlinux.lds.S such that these symbols (and _end) are >> properly covered by a section in the ELF, and thus preventing this >> issue. > > I'm worried that this only solves this one case, and it means that there > are two (potentially conflicting) sources of information that a > bootloader might be using -- the ELF or the Image header. I don't want > to have to duplicate text_offset and so on, which implies that parsing > the Image header is necessary anyway. > > That's something we can discuss if you send a patch (inline rather than > attached). > I think updating the linker script to put the page tables into a .pgdir section is reasonable, since it is part of the static footprint of the kernel. However, I strongly feel that the Image header should remain the authoritative source of information regarding the nature (big/little endian, page size) and the static footprint of the Image . ^ permalink raw reply [flat|nested] 10+ messages in thread
* arm64 boot requirements 2015-12-01 11:52 ` Ard Biesheuvel @ 2015-12-01 22:09 ` Christopher Covington 2015-12-02 10:26 ` Mark Rutland 2015-12-02 13:46 ` Carl van Schaik 1 sibling, 1 reply; 10+ messages in thread From: Christopher Covington @ 2015-12-01 22:09 UTC (permalink / raw) To: linux-arm-kernel On 12/01/2015 06:52 AM, Ard Biesheuvel wrote: > On 1 December 2015 at 12:02, Mark Rutland <mark.rutland@arm.com> wrote: >> Hi, >> >> On Mon, Nov 30, 2015 at 12:45:18PM +1100, Carl van Schaik wrote: >>> In commit bd00cd5f8c8c3c282bb1e1eac6a6679a4f808091, the idmap_pg_dir >>> and swapper_pg_dir where moved from before the kernel to after it. >>> >>> The problem is that these symbols fall outside the range covered by >>> the ELF file - outside of any section. >>> >>> A bootloader which loads the kernel ELF file and dynamically >>> determines where to place the DTB, may try place it after the >>> kernel. We've just run into this problem and the DTB gets >>> overwritten as soon as the pagetables are created. > > Could you explain why you are using the ELF file and not the binary image file? > This is not future proof: currently, the Image is a straight binary > objcopy of vmlinux, but that is not guaranteed to remain that way. > Things like KASLR may require post build steps that mangle vmlinux or > Image afterwards. > >> >> We had similar issues with the BSS when booting Image files prior to >> this and commit a2c1d73b94ed49f5 ("arm64: Update the Image header"). >> Since then, the image_size field in the Image header tells you how much >> memory the kernel may clobber (including the BSS and page tables). >> >> Prior to that, the page tables were below the kernel, and also not >> described in any ELF section. >> >> Others booting the kernel vmlinux haven't reported similar issues, so I >> assume that either they are parsing the Image header, or getting lucky. >> Parsing the header is necessary to get the correct text offset, too... >> >> Pratyush, Geoff, I understood you were loading the kernel vmlinux for >> kexec. Do you parse the Image header to figure out where to place >> things? >> >>> I'd suggest that the kernel either: >>> A. document this boot requirement for where not to load a DTB >> >> Do you have any particular suggestion? >> >> We already describe the Image footprint (including BSS and page tables) >> by the image_size in the Image header, which is sufficient. The size of >> the BSS and page tables is effectively unbound, so we can't define some >> upper bound that will always be true. >> >> The documentation is written on the assumption that an Image file is >> being used rather than a vmlinux. Perhaps that is something to consider. >> >>> B. update the vmlinux.lds.S such that these symbols (and _end) are >>> properly covered by a section in the ELF, and thus preventing this >>> issue. >> >> I'm worried that this only solves this one case, and it means that there >> are two (potentially conflicting) sources of information that a >> bootloader might be using -- the ELF or the Image header. I don't want >> to have to duplicate text_offset and so on, which implies that parsing >> the Image header is necessary anyway. >> >> That's something we can discuss if you send a patch (inline rather than >> attached). >> > > I think updating the linker script to put the page tables into a > .pgdir section is reasonable, since it is part of the static footprint > of the kernel. > > However, I strongly feel that the Image header should remain the > authoritative source of information regarding the nature (big/little > endian, page size) and the static footprint of the Image. I find `readelf -a | less` quite handy. Is there anything comparable for the AArch64 Image format? Please forgive my ignorance, but is the EFI stub another possible source for sort of information? Thanks, Christopher Covington -- Qualcomm Innovation Center, Inc. The Qualcomm Innovation Center, Inc. is a member of the Code Aurora Forum, a Linux Foundation Collaborative Project ^ permalink raw reply [flat|nested] 10+ messages in thread
* arm64 boot requirements 2015-12-01 22:09 ` Christopher Covington @ 2015-12-02 10:26 ` Mark Rutland 0 siblings, 0 replies; 10+ messages in thread From: Mark Rutland @ 2015-12-02 10:26 UTC (permalink / raw) To: linux-arm-kernel On Tue, Dec 01, 2015 at 05:09:18PM -0500, Christopher Covington wrote: > On 12/01/2015 06:52 AM, Ard Biesheuvel wrote: > > However, I strongly feel that the Image header should remain the > > authoritative source of information regarding the nature (big/little > > endian, page size) and the static footprint of the Image. > > I find `readelf -a | less` quite handy. Is there anything comparable for > the AArch64 Image format? Not that I am aware of. These days I just use od or hexedit, and parse the header manually. It's documented, so it would be possible to write one. > Please forgive my ignorance, but is the EFI stub another possible source > for sort of information? Not really. The PE/COFF header for the EFI stub realistically only tells you wether or not the kernel has an EFI stub. It shouldn't be used to derive information about the kernel itself. Thanks, Mark. ^ permalink raw reply [flat|nested] 10+ messages in thread
* arm64 boot requirements 2015-12-01 11:52 ` Ard Biesheuvel 2015-12-01 22:09 ` Christopher Covington @ 2015-12-02 13:46 ` Carl van Schaik 2015-12-03 12:24 ` Mark Rutland 1 sibling, 1 reply; 10+ messages in thread From: Carl van Schaik @ 2015-12-02 13:46 UTC (permalink / raw) To: linux-arm-kernel Hi, On 1 December 2015 Ard Biesheuvel <ard.biesheuvel@linaro.org> wote: > On 1 December 2015 at 12:02, Mark Rutland <mark.rutland@arm.com> > wrote: > > Hi, > > > > On Mon, Nov 30, 2015 at 12:45:18PM +1100, Carl van Schaik wrote: > >> In commit bd00cd5f8c8c3c282bb1e1eac6a6679a4f808091, the > idmap_pg_dir > >> and swapper_pg_dir where moved from before the kernel to after it. > >> > >> The problem is that these symbols fall outside the range covered by > >> the ELF file - outside of any section. > >> > >> A bootloader which loads the kernel ELF file and dynamically > >> determines where to place the DTB, may try place it after the > >> kernel. We've just run into this problem and the DTB gets > >> overwritten as soon as the pagetables are created. > > Could you explain why you are using the ELF file and not the binary image > file? > This is not future proof: currently, the Image is a straight binary > objcopy of vmlinux, but that is not guaranteed to remain that way. > Things like KASLR may require post build steps that mangle vmlinux or > Image afterwards. The reason we've been using ELF files is mostly to do with legacy virtualization related reasons in our systems, we used to patch symbols in the ELFs for example pre device-tree. However, since it hadn't caused problems until now we had continued to use it. We haven't yet added Aarch64 Linux boot image header parsing but it should be trivial. The other area we are looking into is optimized multi-VM static boot images by constructing hypervisor-bundle images containing de-duplicated Linux sections, allowing an ELF bootloader to populate multiple Linux VMs from a smaller boot image - resulting in faster boot. > > > > We had similar issues with the BSS when booting Image files prior to > > this and commit a2c1d73b94ed49f5 ("arm64: Update the Image header"). > > Since then, the image_size field in the Image header tells you how much > > memory the kernel may clobber (including the BSS and page tables). > > > > Prior to that, the page tables were below the kernel, and also not > > described in any ELF section. > > > > Others booting the kernel vmlinux haven't reported similar issues, so I > > assume that either they are parsing the Image header, or getting lucky. > > Parsing the header is necessary to get the correct text offset, too... > > > > Pratyush, Geoff, I understood you were loading the kernel vmlinux for > > kexec. Do you parse the Image header to figure out where to place > > things? > > > >> I'd suggest that the kernel either: > >> A. document this boot requirement for where not to load a DTB > > > > Do you have any particular suggestion? > > > > We already describe the Image footprint (including BSS and page tables) > > by the image_size in the Image header, which is sufficient. The size of > > the BSS and page tables is effectively unbound, so we can't define some > > upper bound that will always be true. > > > > The documentation is written on the assumption that an Image file is > > being used rather than a vmlinux. Perhaps that is something to consider. > > > >> B. update the vmlinux.lds.S such that these symbols (and _end) are > >> properly covered by a section in the ELF, and thus preventing this > >> issue. > > > > I'm worried that this only solves this one case, and it means that there > > are two (potentially conflicting) sources of information that a > > bootloader might be using -- the ELF or the Image header. I don't want > > to have to duplicate text_offset and so on, which implies that parsing > > the Image header is necessary anyway. > > > > That's something we can discuss if you send a patch (inline rather than > > attached). > > > > I think updating the linker script to put the page tables into a > .pgdir section is reasonable, since it is part of the static footprint > of the kernel. I agree > However, I strongly feel that the Image header should remain the > authoritative source of information regarding the nature (big/little > endian, page size) and the static footprint of the Image . Agreed, and there are other ways to de-duplicate which will still work with binary image inputs. ^ permalink raw reply [flat|nested] 10+ messages in thread
* arm64 boot requirements 2015-12-02 13:46 ` Carl van Schaik @ 2015-12-03 12:24 ` Mark Rutland 0 siblings, 0 replies; 10+ messages in thread From: Mark Rutland @ 2015-12-03 12:24 UTC (permalink / raw) To: linux-arm-kernel On Thu, Dec 03, 2015 at 12:46:35AM +1100, Carl van Schaik wrote: > Hi, > > On 1 December 2015 Ard Biesheuvel <ard.biesheuvel@linaro.org> wote: > > On 1 December 2015 at 12:02, Mark Rutland <mark.rutland@arm.com> > > wrote: > > > Hi, > > > > > > On Mon, Nov 30, 2015 at 12:45:18PM +1100, Carl van Schaik wrote: > > >> In commit bd00cd5f8c8c3c282bb1e1eac6a6679a4f808091, the > > idmap_pg_dir > > >> and swapper_pg_dir where moved from before the kernel to after it. > > >> > > >> The problem is that these symbols fall outside the range covered by > > >> the ELF file - outside of any section. > > >> > > >> A bootloader which loads the kernel ELF file and dynamically > > >> determines where to place the DTB, may try place it after the > > >> kernel. We've just run into this problem and the DTB gets > > >> overwritten as soon as the pagetables are created. > > > > Could you explain why you are using the ELF file and not the binary image > > file? > > This is not future proof: currently, the Image is a straight binary > > objcopy of vmlinux, but that is not guaranteed to remain that way. > > Things like KASLR may require post build steps that mangle vmlinux or > > Image afterwards. > > The reason we've been using ELF files is mostly to do with legacy virtualization > related reasons in our systems, we used to patch symbols in the ELFs for example > pre device-tree. However, since it hadn't caused problems until now we had > continued to use it. We haven't yet added Aarch64 Linux boot image header parsing > but it should be trivial. > > The other area we are looking into is optimized multi-VM static boot images by > constructing hypervisor-bundle images containing de-duplicated Linux sections, > allowing an ELF bootloader to populate multiple Linux VMs from a smaller boot > image - resulting in faster boot. Ok. Per Ard's comments, this may get broken in future by KASLR or similar; we cannot make strong guarantees as to the vmlinux being directly usable. That's a different discussion, though... > > >> I'd suggest that the kernel either: > > >> A. document this boot requirement for where not to load a DTB > > > > > > Do you have any particular suggestion? > > > > > > We already describe the Image footprint (including BSS and page tables) > > > by the image_size in the Image header, which is sufficient. The size of > > > the BSS and page tables is effectively unbound, so we can't define some > > > upper bound that will always be true. > > > > > > The documentation is written on the assumption that an Image file is > > > being used rather than a vmlinux. Perhaps that is something to consider. > > > > > >> B. update the vmlinux.lds.S such that these symbols (and _end) are > > >> properly covered by a section in the ELF, and thus preventing this > > >> issue. > > > > > > I'm worried that this only solves this one case, and it means that there > > > are two (potentially conflicting) sources of information that a > > > bootloader might be using -- the ELF or the Image header. I don't want > > > to have to duplicate text_offset and so on, which implies that parsing > > > the Image header is necessary anyway. > > > > > > That's something we can discuss if you send a patch (inline rather than > > > attached). > > > > > > > I think updating the linker script to put the page tables into a > > .pgdir section is reasonable, since it is part of the static footprint > > of the kernel. > > I agree Ok. As above, please send a standalone, inline patch for this. Please Cc at least myself, Ard, and Catalin. We can have any further discussion there. > > However, I strongly feel that the Image header should remain the > > authoritative source of information regarding the nature (big/little > > endian, page size) and the static footprint of the Image . > > Agreed, and there are other ways to de-duplicate which will still work > with binary image inputs. I completely agree that the Image is the canonical source of information. Thanks, Mark. ^ permalink raw reply [flat|nested] 10+ messages in thread
* arm64 boot requirements 2015-12-01 11:02 ` Mark Rutland 2015-12-01 11:52 ` Ard Biesheuvel @ 2015-12-01 12:50 ` Pratyush Anand 2015-12-02 19:03 ` Geoff Levand 2 siblings, 0 replies; 10+ messages in thread From: Pratyush Anand @ 2015-12-01 12:50 UTC (permalink / raw) To: linux-arm-kernel On 01/12/2015:11:02:55 AM, Mark Rutland wrote: > Pratyush, Geoff, I understood you were loading the kernel vmlinux for > kexec. Do you parse the Image header to figure out where to place > things? Yes, ARM64 kexec-tools supports both elf and binary image loading and in both the cases text_offset and image_size is calculated from image header. Location for other segments like initrd or DTB are calculatd accordingly [1] ~Pratyush [1] http://git.kernel.org/cgit/linux/kernel/git/geoff/kexec-tools.git/tree/kexec/arch/arm64/kexec-arm64.c#n585 ^ permalink raw reply [flat|nested] 10+ messages in thread
* arm64 boot requirements 2015-12-01 11:02 ` Mark Rutland 2015-12-01 11:52 ` Ard Biesheuvel 2015-12-01 12:50 ` Pratyush Anand @ 2015-12-02 19:03 ` Geoff Levand 2015-12-03 12:29 ` Mark Rutland 2 siblings, 1 reply; 10+ messages in thread From: Geoff Levand @ 2015-12-02 19:03 UTC (permalink / raw) To: linux-arm-kernel Hi, On Tue, 2015-12-01 at 11:02 +0000, Mark Rutland wrote: > Pratyush, Geoff, I understood you were loading the kernel vmlinux for > kexec. Do you parse the Image header to figure out where to place > things? Yes, in the kexec user tools we use text_offset to make enough room for the kernel, but there is also the need for page_offset. We need to know the page_offset to be able to do virtual to physical address conversions. We can calculate the page_offset for a vmlinux image as page_offset = phdr->p_vaddr - text_offset. The binary Image currently has no info about page_offset or virtual addressing. We have a kexec-tools option for the user to specify a page_offset. If that option is not provided we try to look at the running kernel's symbols, and if that fails, fall back to a default page_offset. This is less than ideal, and certainly makes the binary Image less appealing to use with kexec. -Geoff ^ permalink raw reply [flat|nested] 10+ messages in thread
* arm64 boot requirements 2015-12-02 19:03 ` Geoff Levand @ 2015-12-03 12:29 ` Mark Rutland 0 siblings, 0 replies; 10+ messages in thread From: Mark Rutland @ 2015-12-03 12:29 UTC (permalink / raw) To: linux-arm-kernel On Wed, Dec 02, 2015 at 11:03:48AM -0800, Geoff Levand wrote: > Hi, > > On Tue, 2015-12-01 at 11:02 +0000, Mark Rutland wrote: > > Pratyush, Geoff, I understood you were loading the kernel vmlinux for > > kexec. Do you parse the Image header to figure out where to place > > things? > > Yes, in the kexec user tools we use text_offset to make > enough room for the kernel, but there is also the need for > page_offset. > > We need to know the page_offset to be able to do virtual to > physical address conversions. We can calculate the page_offset > for a vmlinux image as page_offset = phdr->p_vaddr - text_offset. I don't understadn why you need to do that. Is that just just so you can figure out where to load the segments physically? > The binary Image currently has no info about page_offset or > virtual addressing. We have a kexec-tools option for the > user to specify a page_offset. If that option is not provided > we try to look at the running kernel's symbols, and if that > fails, fall back to a default page_offset. This is less than > ideal, and certainly makes the binary Image less appealing to > use with kexec. I don't follow at all why this complication is necessary. I don't think that the kexec tools should be looking at the kernel symbols in that manner, and I don't think that we need to expose the page offset via the Image header. The first loaded address in the vmlinux corresponds to PHYS_OFFSET + TEXT_OFFSET. If you know that, you can figure out an offset to apply to VAs to convert them to PAs when loading. What am I missing? Thanks, Mark. ^ permalink raw reply [flat|nested] 10+ messages in thread
end of thread, other threads:[~2015-12-03 12:29 UTC | newest] Thread overview: 10+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2015-11-30 1:45 arm64 boot requirements Carl van Schaik 2015-12-01 11:02 ` Mark Rutland 2015-12-01 11:52 ` Ard Biesheuvel 2015-12-01 22:09 ` Christopher Covington 2015-12-02 10:26 ` Mark Rutland 2015-12-02 13:46 ` Carl van Schaik 2015-12-03 12:24 ` Mark Rutland 2015-12-01 12:50 ` Pratyush Anand 2015-12-02 19:03 ` Geoff Levand 2015-12-03 12:29 ` Mark Rutland
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).