qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
From: Xiao Guangrong <guangrong.xiao@linux.intel.com>
To: Igor Mammedov <imammedo@redhat.com>
Cc: vsementsov@virtuozzo.com, ehabkost@redhat.com,
	kvm@vger.kernel.org, mst@redhat.com, gleb@kernel.org,
	mtosatti@redhat.com, qemu-devel@nongnu.org, stefanha@redhat.com,
	pbonzini@redhat.com, dan.j.williams@intel.com, rth@twiddle.net
Subject: Re: [Qemu-devel] [PATCH v7 25/35] nvdimm acpi: init the resource used by NVDIMM ACPI
Date: Fri, 6 Nov 2015 16:31:43 +0800	[thread overview]
Message-ID: <563C656F.3030808@linux.intel.com> (raw)
In-Reply-To: <20151105154906.659779b5@nial.brq.redhat.com>



On 11/05/2015 10:49 PM, Igor Mammedov wrote:
> On Thu, 5 Nov 2015 21:33:39 +0800
> Xiao Guangrong <guangrong.xiao@linux.intel.com> wrote:
>
>>
>>
>> On 11/05/2015 09:03 PM, Igor Mammedov wrote:
>>> On Thu, 5 Nov 2015 18:15:31 +0800
>>> Xiao Guangrong <guangrong.xiao@linux.intel.com> wrote:
>>>
>>>>
>>>>
>>>> On 11/05/2015 05:58 PM, Igor Mammedov wrote:
>>>>> On Mon,  2 Nov 2015 17:13:27 +0800
>>>>> Xiao Guangrong <guangrong.xiao@linux.intel.com> wrote:
>>>>>
>>>>>> A page staring from 0xFF00000 and IO port 0x0a18 - 0xa1b in guest are
>>>>>                                   ^^ missing one 0???
>>>>>
>>>>>> reserved for NVDIMM ACPI emulation, refer to docs/specs/acpi_nvdimm.txt
>>>>>> for detailed design
>>>>>>
>>>>>> A parameter, 'nvdimm-support', is introduced for PIIX4_PM and ICH9-LPC
>>>>>> that controls if nvdimm support is enabled, it is true on default and
>>>>>> it is false on 2.4 and its earlier version to keep compatibility
>>>>>>
>>>>>> Signed-off-by: Xiao Guangrong <guangrong.xiao@linux.intel.com>
>>>>> [...]
>>>>>
>>>>>> @@ -33,6 +33,15 @@
>>>>>>      */
>>>>>>     #define MIN_NAMESPACE_LABEL_SIZE      (128UL << 10)
>>>>>>
>>>>>> +/*
>>>>>> + * A page staring from 0xFF00000 and IO port 0x0a18 - 0xa1b in guest are
>>>>>                                     ^^^ missing 0 or value in define below has an extra 0
>>>>>
>>>>>> + * reserved for NVDIMM ACPI emulation, refer to docs/specs/acpi_nvdimm.txt
>>>>>> + * for detailed design.
>>>>>> + */
>>>>>> +#define NVDIMM_ACPI_MEM_BASE          0xFF000000ULL
>>>>> it still maps RAM at arbitrary place,
>>>>> that's the reason why VMGenID patches hasn't been merged for
>>>>> more than several months.
>>>>> I'm not against of using (more exactly I'm for it) direct mapping
>>>>> but we should reach consensus when and how to use it first.
>>>>>
>>>>> I'd wouldn't use addresses below 4G as it may be used firmware or
>>>>> legacy hardware and I won't bet that 0xFF000000ULL won't conflict
>>>>> with anything.
>>>>> An alternative place to allocate reserve from could be high memory.
>>>>> For pc we have "reserved-memory-end" which currently makes sure
>>>>> that hotpluggable memory range isn't used by firmware.
>>>>>
>>>>> How about making API that allows to map additional memory
>>>>> ranges before reserved-memory-end and pushes it up as mappings are
>>>>> added.
>>>>
>>>> That what i did in the v1/v2 versions, however, as you noticed, using 64-bit
>>>> address in ACPI in QEMU is not a easy work - we can not simply make
>>>> SSDT.rev = 2 to apply 64 bit address, the reason i have documented in
>>>> v3's changelog:
>>>>
>>>>      3) we figure out a unused memory hole below 4G that is 0xFF00000 ~
>>>>         0xFFF00000, this range is large enough for NVDIMM ACPI as build 64-bit
>>>>         ACPI SSDT/DSDT table will break windows XP.
>>>>         BTW, only make SSDT.rev = 2 can not work since the width is only depended
>>>>         on DSDT.rev based on 19.6.28 DefinitionBlock (Declare Definition Block)
>>>>         in ACPI spec:
>>>> | Note: For compatibility with ACPI versions before ACPI 2.0, the bit
>>>> | width of Integer objects is dependent on the ComplianceRevision of the DSDT.
>>>> | If the ComplianceRevision is less than 2, all integers are restricted to 32
>>>> | bits. Otherwise, full 64-bit integers are used. The version of the DSDT sets
>>>> | the global integer width for all integers, including integers in SSDTs.
>>>>      4) use the lowest ACPI spec version to document AML terms.
>>>>
>>>> The only way introducing 64 bit address is adding XSDT support that what
>>>> Michael did before, however, it seems it need great efforts to do it as
>>>> it will break OVMF. It's a long term workload. :(
>>> to enable 64-bit integers in AML it's sufficient to change DSDT revision to 2,
>>> I already have a patch that switches DSDT/SSDT to rev2.
>>> Tests show it doesn't break WindowsXP (which is rev1) and uses 64-bit integers
>>> on linux & later Windows versions.
>>
>> Great, i remembered i did the similar test (directly change DSDT to rev2) and it
>> caused winXP blue screen. Could you please tell me where i can find your patch?
> https://github.com/imammedo/qemu/commits/mhpt_table_v2
> following changes revision:
>   pc: acpi: bump DSDT/SSDT compliance revision to v2
> and here is user:
>   acpi: memhp: simplify MCRS() using 64-bit math
>
> when writing ASL one shall make sure that only XP supported
> features are in global scope, which is evaluated when tables
> are loaded and features of rev2 and higher are inside methods.
> That way XP doesn't crash as far as it doesn't evaluate unsupported
> opcode and one can guard those opcodes checking _REV object if neccesary.
>

Really a good study case to me, i tried your patch and moved the 64 bit
staffs to the private method, it worked. :)

Igor, is this the API you want?

diff --git a/hw/i386/pc.c b/hw/i386/pc.c
index 6bf569a..aba29df 100644
--- a/hw/i386/pc.c
+++ b/hw/i386/pc.c
@@ -1291,6 +1291,38 @@ FWCfgState *xen_load_linux(PCMachineState *pcms,
      return fw_cfg;
  }

+static void pc_reserve_high_memory_init(PCMachineState *pcms,
+                                        uint64_t base, uint64_t align)
+{
+    pcms->reserve_high_memory.current_addr = ROUND_UP(base, align);
+}
+
+static uint64_t
+pc_reserve_high_memory_end(PCMachineState *pcms, int64_t align)
+{
+    return ROUND_UP(pcms->reserve_high_memory.current_addr, align);
+}
+
+uint64_t pc_reserve_high_memory(PCMachineState *pcms, uint64_t size,
+                                int64_t align, Error **errp)
+{
+    uint64_t end_addr, current_addr = pcms->reserve_high_memory.current_addr;
+
+    if (!current_addr) {
+        error_setg(errp, "reserved high memory is not initialized.");
+        return 0;
+    }
+
+    end_addr = pc_reserve_high_memory_end(pcms, align) + size;
+    if (current_addr > end_addr) {
+        error_setg(errp, "reserved high memory is not enough.");
+        return 0;
+    }
+
+    pcms->reserve_high_memory.current_addr = end_addr;
+    return end_addr;
+}
+
  FWCfgState *pc_memory_init(PCMachineState *pcms,
                             MemoryRegion *system_memory,
                             MemoryRegion *rom_memory,
@@ -1379,6 +1411,8 @@ FWCfgState *pc_memory_init(PCMachineState *pcms,
                             "hotplug-memory", hotplug_mem_size);
          memory_region_add_subregion(system_memory, pcms->hotplug_memory.base,
                                      &pcms->hotplug_memory.mr);
+        pc_reserve_high_memory_init(pcms, pcms->hotplug_memory.base +
+                                    hotplug_mem_size, 1ULL << 30);
      }

      /* Initialize PC system firmware */
@@ -1403,7 +1437,7 @@ FWCfgState *pc_memory_init(PCMachineState *pcms,
          uint64_t res_mem_end = pcms->hotplug_memory.base;

          if (!pcmc->broken_reserved_end) {
-            res_mem_end += memory_region_size(&pcms->hotplug_memory.mr);
+            res_mem_end = pc_reserve_high_memory_end(pcms, 0x1ULL << 30);
          }
          *val = cpu_to_le64(ROUND_UP(res_mem_end, 0x1ULL << 30));
          fw_cfg_add_file(fw_cfg, "etc/reserved-memory-end", val, sizeof(*val));
diff --git a/include/hw/i386/pc.h b/include/hw/i386/pc.h
index 47162cf..fae3fea 100644
--- a/include/hw/i386/pc.h
+++ b/include/hw/i386/pc.h
@@ -20,6 +20,11 @@

  #define HPET_INTCAP "hpet-intcap"

+struct PCReserveHighMemory {
+    uint64_t current_addr;
+};
+typedef struct PCReserveHighMemory PCReserveHighMemory;
+
  /**
   * PCMachineState:
   * @acpi_dev: link to ACPI PM device that performs ACPI hotplug handling
@@ -41,6 +46,7 @@ struct PCMachineState {
      OnOffAuto smm;
      bool enforce_aligned_dimm;
      ram_addr_t below_4g_mem_size, above_4g_mem_size;
+    PCReserveHighMemory reserve_high_memory;
  };

  #define PC_MACHINE_ACPI_DEVICE_PROP "acpi-device"
@@ -175,6 +181,9 @@ PcGuestInfo *pc_guest_info_init(PCMachineState *pcms);

  void pc_set_legacy_acpi_data_size(void);

+uint64_t pc_reserve_high_memory(PCMachineState *pcms, uint64_t size,
+                                int64_t align, Error **errp);
+
  #define PCI_HOST_PROP_PCI_HOLE_START   "pci-hole-start"
  #define PCI_HOST_PROP_PCI_HOLE_END     "pci-hole-end"
  #define PCI_HOST_PROP_PCI_HOLE64_START "pci-hole64-start"


>
>>>>
>>>> The luck thing is, the ACPI part is not ABI, we can move it to the high
>>>> memory if QEMU supports XSDT is ready in future development.
>>> But mapped control region is ABI and we can't change it if we find out later
>>> that it breaks something.
>>
>> But the ACPI code is completely built by QEMU, which is transparent to guest
>> and guest should not depend on it, no?
> unfortunately no, think about cross-version migration.

It makes sense. Stupid me. :(

  reply	other threads:[~2015-11-06  8:38 UTC|newest]

Thread overview: 101+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-11-02  9:13 [Qemu-devel] [PATCH v7 00/35] implement vNVDIMM Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 01/35] acpi: add aml_derefof Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 02/35] acpi: add aml_sizeof Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 03/35] acpi: add aml_create_field Xiao Guangrong
2015-11-03  6:14   ` Shannon Zhao
2015-11-03 14:52     ` Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 04/35] acpi: add aml_concatenate Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 05/35] acpi: add aml_object_type Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 06/35] acpi: add aml_method_serialized Xiao Guangrong
2015-11-03 12:30   ` Igor Mammedov
2015-11-03 13:27     ` Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 07/35] util: introduce qemu_file_get_page_size() Xiao Guangrong
2015-11-02 13:56   ` Vladimir Sementsov-Ogievskiy
2015-11-06 15:36   ` Eduardo Habkost
2015-11-09  4:36     ` Xiao Guangrong
2015-11-09 18:34       ` Eduardo Habkost
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 08/35] exec: allow memory to be allocated from any kind of path Xiao Guangrong
2015-11-02 14:51   ` Vladimir Sementsov-Ogievskiy
2015-11-02 15:22     ` Xiao Guangrong
2015-11-02 15:52       ` Vladimir Sementsov-Ogievskiy
2015-11-03 23:00   ` Eduardo Habkost
2015-11-04  3:12     ` Xiao Guangrong
2015-11-04 12:40       ` Eduardo Habkost
2015-11-04 14:22         ` Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 09/35] exec: allow file_ram_alloc to work on file Xiao Guangrong
2015-11-02 15:12   ` Vladimir Sementsov-Ogievskiy
2015-11-02 15:25     ` Xiao Guangrong
2015-11-02 15:58       ` Vladimir Sementsov-Ogievskiy
2015-11-02 21:12   ` Paolo Bonzini
2015-11-03  3:56     ` Xiao Guangrong
2015-11-03 13:55       ` Paolo Bonzini
2015-11-03 14:26         ` Xiao Guangrong
2015-11-03 12:34   ` Igor Mammedov
2015-11-03 13:32     ` Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 10/35] hostmem-file: clean up memory allocation Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 11/35] util: introduce qemu_file_getlength() Xiao Guangrong
2015-11-02 15:26   ` Vladimir Sementsov-Ogievskiy
2015-11-03 23:21   ` Eduardo Habkost
2015-11-04  3:17     ` Xiao Guangrong
2015-11-04 14:44       ` Eduardo Habkost
2015-11-04 14:44         ` Xiao Guangrong
2015-11-06 15:50   ` Eduardo Habkost
2015-11-09  4:44     ` Xiao Guangrong
2015-11-09 19:21       ` Eduardo Habkost
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 12/35] util: let qemu_fd_getlength support block device Xiao Guangrong
2015-11-02 16:11   ` Vladimir Sementsov-Ogievskiy
2015-11-02 16:21     ` Xiao Guangrong
2015-11-06 15:44       ` Eduardo Habkost
2015-11-06 15:48   ` Eduardo Habkost
2015-11-06 15:54   ` Eduardo Habkost
2015-11-09  5:58     ` Xiao Guangrong
2015-11-09 18:43       ` Eduardo Habkost
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 13/35] hostmem-file: use whole file size if possible Xiao Guangrong
2015-11-02 17:09   ` Vladimir Sementsov-Ogievskiy
2015-11-03 14:51     ` Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 14/35] pc-dimm: remove DEFAULT_PC_DIMMSIZE Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 15/35] pc-dimm: make pc_existing_dimms_capacity static and rename it Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 16/35] pc-dimm: drop the prefix of pc-dimm Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 17/35] stubs: rename qmp_pc_dimm_device_list.c Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 18/35] pc-dimm: rename pc-dimm.c and pc-dimm.h Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 19/35] dimm: abstract dimm device from pc-dimm Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 20/35] dimm: get mapped memory region from DIMMDeviceClass->get_memory_region Xiao Guangrong
2015-11-02 12:19   ` Vladimir Sementsov-Ogievskiy
2015-11-02 13:08     ` Xiao Guangrong
2015-11-02 14:26       ` Vladimir Sementsov-Ogievskiy
2015-11-02 15:06         ` Xiao Guangrong
2015-11-02 16:16           ` Vladimir Sementsov-Ogievskiy
2015-11-03 14:47             ` Xiao Guangrong
2015-11-05  8:53               ` Vladimir Sementsov-Ogievskiy
2015-11-05 17:29   ` Eduardo Habkost
2015-11-06  2:50     ` Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 21/35] dimm: keep the state of the whole backend memory Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 22/35] dimm: introduce realize callback Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 23/35] nvdimm: implement NVDIMM device abstract Xiao Guangrong
2015-11-13 16:53   ` Vladimir Sementsov-Ogievskiy
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 24/35] docs: add NVDIMM ACPI documentation Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 25/35] nvdimm acpi: init the resource used by NVDIMM ACPI Xiao Guangrong
2015-11-05  9:58   ` Igor Mammedov
2015-11-05 10:15     ` Xiao Guangrong
2015-11-05 13:03       ` Igor Mammedov
2015-11-05 13:33         ` Xiao Guangrong
2015-11-05 14:49           ` Igor Mammedov
2015-11-06  8:31             ` Xiao Guangrong [this message]
2015-11-06  8:56               ` Xiao Guangrong
2015-11-09 11:13               ` Igor Mammedov
2015-11-11  3:01                 ` [Qemu-devel] Ask for ACK (was Re: [PATCH v7 25/35] nvdimm acpi: init the resource used by NVDIMM ACPI) Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 26/35] nvdimm acpi: build ACPI NFIT table Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 27/35] nvdimm acpi: build ACPI nvdimm devices Xiao Guangrong
2015-11-03 13:13   ` Igor Mammedov
2015-11-03 14:22     ` Xiao Guangrong
2015-11-04  8:56       ` Igor Mammedov
2015-11-04 14:11         ` Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 28/35] nvdimm acpi: save arg3 for NVDIMM device _DSM method Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 29/35] nvdimm acpi: support function 0 Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 30/35] nvdimm acpi: support Get Namespace Label Size function Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 31/35] nvdimm acpi: support Get Namespace Label Data function Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 32/35] nvdimm acpi: support Set " Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 33/35] nvdimm: allow using whole backend memory as pmem Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 34/35] nvdimm acpi: support _FIT method Xiao Guangrong
2015-11-02  9:13 ` [Qemu-devel] [PATCH v7 35/35] nvdimm: add maintain info Xiao Guangrong
2015-11-02 11:51 ` [Qemu-devel] [PATCH v7 00/35] implement vNVDIMM Stefan Hajnoczi

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=563C656F.3030808@linux.intel.com \
    --to=guangrong.xiao@linux.intel.com \
    --cc=dan.j.williams@intel.com \
    --cc=ehabkost@redhat.com \
    --cc=gleb@kernel.org \
    --cc=imammedo@redhat.com \
    --cc=kvm@vger.kernel.org \
    --cc=mst@redhat.com \
    --cc=mtosatti@redhat.com \
    --cc=pbonzini@redhat.com \
    --cc=qemu-devel@nongnu.org \
    --cc=rth@twiddle.net \
    --cc=stefanha@redhat.com \
    --cc=vsementsov@virtuozzo.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).