From: "Michael S. Tsirkin" <mst@redhat.com>
To: Igor Mammedov <imammedo@redhat.com>
Cc: pbonzini@redhat.com, kevin@koconnor.net, seabios@seabios.org,
kraxel@redhat.com, qemu-devel@nongnu.org
Subject: Re: [Qemu-devel] [PATCH v2] map 64-bit PCI BARs at location provided by emulator
Date: Sun, 13 Oct 2013 23:28:47 +0300 [thread overview]
Message-ID: <20131013202847.GB12354@redhat.com> (raw)
In-Reply-To: <20131013193319.54578867@thinkpad>
On Sun, Oct 13, 2013 at 07:33:19PM +0200, Igor Mammedov wrote:
> On Sun, 13 Oct 2013 19:46:09 +0300
> "Michael S. Tsirkin" <mst@redhat.com> wrote:
>
> > On Sun, Oct 13, 2013 at 06:23:28PM +0200, Igor Mammedov wrote:
> > > On Sun, 13 Oct 2013 18:59:20 +0300
> > > "Michael S. Tsirkin" <mst@redhat.com> wrote:
> > >
> > > > On Sun, Oct 13, 2013 at 05:11:54PM +0200, Igor Mammedov wrote:
> > > > > On Sun, 13 Oct 2013 15:31:23 +0300
> > > > > "Michael S. Tsirkin" <mst@redhat.com> wrote:
> > > > >
> > > > > > On Sun, Oct 13, 2013 at 02:13:44PM +0200, Igor Mammedov wrote:
> > > > > > > Currently 64-bit PCI BARs are unconditionally mapped by BIOS right
> > > > > > > over 4G + RamSizeOver4G location, which doesn't allow to reserve
> > > > > > > extra space before 64-bit PCI window. For memory hotplug an extra
> > > > > > > RAM space might be reserved after present 64-bit RAM end and BIOS
> > > > > > > should map 64-bit PCI BARs after it.
> > > > > > >
> > > > > > > Introduce "etc/pcimem64-start" romfile to provide BIOS a hint
> > > > > > > where it should start mapping of 64-bit PCI BARs. If romfile is
> > > > > > > missing, BIOS reverts to legacy behavior and starts mapping right
> > > > > > > after high memory.
> > > > > > >
> > > > > > > Signed-off-by: Igor Mammedov <imammedo@redhat.com>
> > > > > > > ---
> > > > > > > v2:
> > > > > > > * place 64-bit window behind high RAM end if "etc/pcimem64-start"
> > > > > > > points below it.
> > > > > >
> > > > > > Hmm I had an alternative suggestion of passing smbios
> > > > > > tables in from QEMU (seabios already has a mechanism for this).
> > > > > > We could then simply increase the existing RamSize variable.
> > > > > >
> > > > > > What do you think about this idea?
> > > > > There is several issues with RamSizeOver4G I see:
> > > > > 1. As Gerd pointed out it is also used to setup a part of e820 tables.
> > > > > It possibly could be also changed see "[Qemu-devel] [RfC PATCH] e820: pass
> > > > > high memory too" but it will be more complicated to keep seabios
> > > > > backward/forward compatible if we use RamSizeOver4G for passing.
> > > >
> > > > What are the compatibility worries, exactly?
> > > >
> > > > The way I see it, if QEMU wants to be compatible it can't
> > > > use memory hotplug. In that case RamSizeOver4G will have
> > > > the old meaning which should work without issues.
> > > QEMU can't know what BIOS version is used and error out in wrong case, and that
> > > could cause hard to guess what going wrong bugs in the guest.
> > >
> > > >
> > > > > 2. It doesn't scale if we are going to use more than 1Tb future.
> > > > > Extending RamSizeOver4G value via additional CMOS port wasn't welcomed
> > > > > either, see
> > > > > http://www.seabios.org/pipermail/seabios/2010-September/000911.html
> > > > >
> > > > > It looks like fw_cfg interface is preferred one and "etc/pcimem64-start"
> > > > > approach doesn't have backward/forward compatibility issues.
> > > >
> > > > Hmm but you aren't passing in memory size - pcimem64-start
> > > > merely tells bios where to put PCI devices.
> > > > So how would it help?
> > > If a lot of memory is reserved for memory hotplug (terabytes), that could
> > > put window behind what RamSizeOver4G could address it.
> >
> > All we care about for a compatible bios is that it
> > keep working in old hardware configurations.
> > IMO there's no real need for old BIOS to work
> > with e.g. memory hotplug - it's reasonable
> > to ask users to update fitmware if they want
> > to use this feature.
> Fine with me if we don't care about strange hangs if something misused.
>
> But RamSizeOver4G still doesn't allow correctly place PCI window if large
> amount of memory present/reserved.
I'm fine with adding a new fw cfg really.
I am however not sure we should have a single new base64 flag.
Hardware has a set of ranges where guest can
put PCI devices - would we be better off with
an interface that lists all these ranges?
I'm not saying we should but this needs a bit of thought.
> >
> > > > > >
> > > > > >
> > > > > > > ---
> > > > > > > src/fw/pciinit.c | 13 ++++++++++++-
> > > > > > > 1 file changed, 12 insertions(+), 1 deletion(-)
> > > > > > >
> > > > > > > diff --git a/src/fw/pciinit.c b/src/fw/pciinit.c
> > > > > > > index b29db99..798309b 100644
> > > > > > > --- a/src/fw/pciinit.c
> > > > > > > +++ b/src/fw/pciinit.c
> > > > > > > @@ -18,6 +18,8 @@
> > > > > > > #include "paravirt.h" // RamSize
> > > > > > > #include "string.h" // memset
> > > > > > > #include "util.h" // pci_setup
> > > > > > > +#include "byteorder.h" // le64_to_cpu
> > > > > > > +#include "romfile.h" // romfile_loadint
> > > > > > >
> > > > > > > #define PCI_DEVICE_MEM_MIN 0x1000
> > > > > > > #define PCI_BRIDGE_IO_MIN 0x1000
> > > > > > > @@ -764,6 +766,15 @@ static void pci_bios_map_devices(struct pci_bus *busses)
> > > > > > > {
> > > > > > > if (pci_bios_init_root_regions(busses)) {
> > > > > > > struct pci_region r64_mem, r64_pref;
> > > > > > > + u64 ram64_end = 0x100000000ULL + RamSizeOver4G;
> > > > > > > + u64 base64 = le64_to_cpu(romfile_loadint("etc/pcimem64-start",
> > > > > > > + ram64_end));
> > > > > > > + if (base64 < ram64_end) {
> > > > > > > + dprintf(1, "ignorig etc/pcimem64-start [0x%llx] below present RAM, "
> > > > > > > + "placing 64-bit PCI window behind RAM end: %llx",
> > > > > > > + base64, ram64_end);
> > > > > > > + base64 = ram64_end;
> > > > > > > + }
> > > > > > > r64_mem.list.first = NULL;
> > > > > > > r64_pref.list.first = NULL;
> > > > > > > pci_region_migrate_64bit_entries(&busses[0].r[PCI_REGION_TYPE_MEM],
> > > > > > > @@ -779,7 +790,7 @@ static void pci_bios_map_devices(struct pci_bus *busses)
> > > > > > > u64 align_mem = pci_region_align(&r64_mem);
> > > > > > > u64 align_pref = pci_region_align(&r64_pref);
> > > > > > >
> > > > > > > - r64_mem.base = ALIGN(0x100000000LL + RamSizeOver4G, align_mem);
> > > > > > > + r64_mem.base = ALIGN(base64, align_mem);
> > > > > > > r64_pref.base = ALIGN(r64_mem.base + sum_mem, align_pref);
> > > > > > > pcimem64_start = r64_mem.base;
> > > > > > > pcimem64_end = r64_pref.base + sum_pref;
> > > > > > > --
> > > > > > > 1.8.3.1
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Regards,
> > > > > Igor
> > >
> > >
> > > --
> > > Regards,
> > > Igor
>
>
> --
> Regards,
> Igor
next prev parent reply other threads:[~2013-10-13 20:26 UTC|newest]
Thread overview: 30+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-10-13 12:13 [Qemu-devel] [PATCH v2] map 64-bit PCI BARs at location provided by emulator Igor Mammedov
2013-10-13 12:31 ` Michael S. Tsirkin
2013-10-13 15:11 ` Igor Mammedov
2013-10-13 15:59 ` Michael S. Tsirkin
2013-10-13 16:23 ` Igor Mammedov
2013-10-13 16:46 ` Michael S. Tsirkin
2013-10-13 17:33 ` Igor Mammedov
2013-10-13 18:19 ` [Qemu-devel] [SeaBIOS] " Igor Mammedov
2013-10-13 19:53 ` Kevin O'Connor
2013-10-14 8:01 ` Gerd Hoffmann
2013-10-13 20:28 ` Michael S. Tsirkin [this message]
2013-10-14 10:27 ` [Qemu-devel] " Igor Mammedov
2013-10-14 11:00 ` Michael S. Tsirkin
2013-10-14 12:16 ` Gerd Hoffmann
2013-10-14 12:38 ` Michael S. Tsirkin
2013-10-14 13:04 ` Gerd Hoffmann
2013-10-14 14:00 ` Michael S. Tsirkin
2013-10-14 16:15 ` Igor Mammedov
2013-10-14 16:37 ` Michael S. Tsirkin
2013-10-15 8:01 ` Gerd Hoffmann
2013-10-15 9:05 ` Igor Mammedov
2013-10-15 9:14 ` [Qemu-devel] [SeaBIOS] " Gerd Hoffmann
2013-10-15 12:36 ` Igor Mammedov
2013-10-15 9:16 ` [Qemu-devel] " Michael S. Tsirkin
2013-10-15 9:24 ` Gerd Hoffmann
2013-10-15 9:53 ` Igor Mammedov
2013-10-15 9:47 ` Igor Mammedov
2013-10-15 9:08 ` Michael S. Tsirkin
2013-10-14 12:28 ` Igor Mammedov
2013-10-13 15:15 ` Kevin O'Connor
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20131013202847.GB12354@redhat.com \
--to=mst@redhat.com \
--cc=imammedo@redhat.com \
--cc=kevin@koconnor.net \
--cc=kraxel@redhat.com \
--cc=pbonzini@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=seabios@seabios.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).