From: Xiao Guangrong <guangrong.xiao@linux.intel.com>
To: Igor Mammedov <imammedo@redhat.com>
Cc: ehabkost@redhat.com, kvm@vger.kernel.org, mst@redhat.com,
gleb@kernel.org, mtosatti@redhat.com, qemu-devel@nongnu.org,
stefanha@redhat.com, pbonzini@redhat.com, rth@twiddle.net
Subject: Re: [Qemu-devel] [PATCH v2 08/18] nvdimm: init backend memory mapping and config data area
Date: Tue, 8 Sep 2015 21:38:17 +0800 [thread overview]
Message-ID: <55EEE4C9.4090400@linux.intel.com> (raw)
In-Reply-To: <20150907161155.154a02f8@nial.brq.redhat.com>
On 09/07/2015 10:11 PM, Igor Mammedov wrote:
> On Fri, 14 Aug 2015 22:52:01 +0800
> Xiao Guangrong <guangrong.xiao@linux.intel.com> wrote:
>
>> The parameter @file is used as backed memory for NVDIMM which is
>> divided into two parts if @dataconfig is true:
>> - first parts is (0, size - 128K], which is used as PMEM (Persistent
>> Memory)
>> - 128K at the end of the file, which is used as Config Data Area, it's
>> used to store Label namespace data
>>
>> The @file supports both regular file and block device, of course we
>> can assign any these two kinds of files for test and emulation, however,
>> in the real word for performance reason, we usually used these files as
>> NVDIMM backed file:
>> - the regular file in the filesystem with DAX enabled created on NVDIMM
>> device on host
>> - the raw PMEM device on host, e,g /dev/pmem0
>
> A lot of code in this series could reuse what QEMU already
> uses for implementing pc-dimm devices.
>
> here is common concepts that could be reused.
> - on physical system both DIMM and NVDIMM devices use
> the same slots. We could share QEMU's '-m slots' option between
> both devices. An alternative to not sharing would be to introduce
> '-machine nvdimm_slots' option.
> And yes, we need to know number of NVDIMMs to describe
> them all in ACPI table (taking in amount future hotplug
> include in this possible NVDIMM devices)
> I'd go the same way as on real hardware on make them share the same slots.
I'd prefer sharing slots for pc-dimm and nvdimm, it's easier to reuse the
logic of slot-assignment and plug/unplug.
> - they share the same physical address space and limits
> on how much memory system can handle. So I'd suggest sharing existing
> '-m maxmem' option and reuse hotplug_memory address space.
Sounds good to me.
>
> Essentially what I'm suggesting is to inherit NVDIMM's implementation
> from pc-dimm reusing all of its code/backends and
> just override parts that do memory mapping into guest's address space to
> accommodate NVDIMM's requirements.
Good idea!
We have to differentiate pc-dimm and nvdimm in the common code and nvdimm
has different points with pc-dimm (for example, its has reserved-region, and
need support live migration of label data). How about rename 'pc-nvdimm' to
'memory-device' and make it as a common device type, then build pc-dimm and
nvdimm on top of it?
Something like:
static TypeInfo memory_device_info = {
.name = TYPE_MEM_DEV,
.parent = TYPE_DEVICE,
};
static TypeInfo memory_device_info = {
.name = TYPE_PC_DIMM,
.parent = TYPE_MEM_DEV,
};
static TypeInfo memory_device_info = {
.name = TYPE_NVDIMM,
.parent = TYPE_MEM_DEV,
};
It also make CONIFG_NVDIMM and CONFIG_HOT_PLUG be independent.
>
>>
>> Signed-off-by: Xiao Guangrong <guangrong.xiao@linux.intel.com>
>> ---
>> hw/mem/nvdimm/pc-nvdimm.c | 109 ++++++++++++++++++++++++++++++++++++++++++++-
>> include/hw/mem/pc-nvdimm.h | 7 +++
>> 2 files changed, 115 insertions(+), 1 deletion(-)
>>
>> diff --git a/hw/mem/nvdimm/pc-nvdimm.c b/hw/mem/nvdimm/pc-nvdimm.c
>> index 7a270a8..97710d1 100644
>> --- a/hw/mem/nvdimm/pc-nvdimm.c
>> +++ b/hw/mem/nvdimm/pc-nvdimm.c
>> @@ -22,12 +22,20 @@
>> * License along with this library; if not, see <http://www.gnu.org/licenses/>
>> */
>>
>> +#include <sys/mman.h>
>> +#include <sys/ioctl.h>
>> +#include <linux/fs.h>
>> +
>> +#include "exec/address-spaces.h"
>> #include "hw/mem/pc-nvdimm.h"
>>
>> -#define PAGE_SIZE (1UL << 12)
>> +#define PAGE_SIZE (1UL << 12)
>> +
>> +#define MIN_CONFIG_DATA_SIZE (128 << 10)
>>
>> static struct nvdimms_info {
>> ram_addr_t current_addr;
>> + int device_index;
>> } nvdimms_info;
>>
>> /* the address range [offset, ~0ULL) is reserved for NVDIMM. */
>> @@ -37,6 +45,26 @@ void pc_nvdimm_reserve_range(ram_addr_t offset)
>> nvdimms_info.current_addr = offset;
>> }
>>
>> +static ram_addr_t reserved_range_push(uint64_t size)
>> +{
>> + uint64_t current;
>> +
>> + current = ROUND_UP(nvdimms_info.current_addr, PAGE_SIZE);
>> +
>> + /* do not have enough space? */
>> + if (current + size < current) {
>> + return 0;
>> + }
>> +
>> + nvdimms_info.current_addr = current + size;
>> + return current;
>> +}
> You can't use all memory above hotplug_memory area since
> we have to tell guest where 64-bit PCI window starts,
> and currently it should start at reserved-memory-end
> (but it isn't due to a bug: I've just posted fix to qemu-devel
> "[PATCH 0/2] pc: fix 64-bit PCI window clashing with memory hotplug region"
> )
Ah, got it, thanks for you pointing it out.
>
>> +
>> +static uint32_t new_device_index(void)
>> +{
>> + return nvdimms_info.device_index++;
>> +}
>> +
>> static char *get_file(Object *obj, Error **errp)
>> {
>> PCNVDIMMDevice *nvdimm = PC_NVDIMM(obj);
>> @@ -48,6 +76,11 @@ static void set_file(Object *obj, const char *str, Error **errp)
>> {
>> PCNVDIMMDevice *nvdimm = PC_NVDIMM(obj);
>>
>> + if (memory_region_size(&nvdimm->mr)) {
>> + error_setg(errp, "cannot change property value");
>> + return;
>> + }
>> +
>> if (nvdimm->file) {
>> g_free(nvdimm->file);
>> }
>> @@ -76,13 +109,87 @@ static void pc_nvdimm_init(Object *obj)
>> set_configdata, NULL);
>> }
>>
>> +static uint64_t get_file_size(int fd)
>> +{
>> + struct stat stat_buf;
>> + uint64_t size;
>> +
>> + if (fstat(fd, &stat_buf) < 0) {
>> + return 0;
>> + }
>> +
>> + if (S_ISREG(stat_buf.st_mode)) {
>> + return stat_buf.st_size;
>> + }
>> +
>> + if (S_ISBLK(stat_buf.st_mode) && !ioctl(fd, BLKGETSIZE64, &size)) {
>> + return size;
>> + }
>> +
>> + return 0;
>> +}
> All this file stuff I'd leave to already existing backends like
> memory-backend-file or even memory-backend-ram which already do
> above and more allowing to configure persistent and volatile
> NVDIMMs without changing NVDIMM fronted code.
>
The current memory backends use all memory size and map it to guest's
address space. However, nvdimm needs a reserved region for its label
data which is only accessed in Qemu.
How about introduce two parameters, "reserved_size" and "reserved_addr"
to TYPE_MEMORY_BACKEND, then only the memory region
[0, size - reserved_size) is mapped to guest and the remain part is
pointed by "reserved_addr"?
next prev parent reply other threads:[~2015-09-08 13:44 UTC|newest]
Thread overview: 164+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-08-14 14:51 [PATCH v2 00/18] implement vNVDIMM Xiao Guangrong
2015-08-14 14:51 ` [Qemu-devel] " Xiao Guangrong
2015-08-14 14:51 ` [PATCH v2 01/18] acpi: allow aml_operation_region() working on 64 bit offset Xiao Guangrong
2015-08-14 14:51 ` [Qemu-devel] " Xiao Guangrong
2015-09-02 8:05 ` Igor Mammedov
2015-09-02 8:05 ` [Qemu-devel] " Igor Mammedov
2015-08-14 14:51 ` [PATCH v2 02/18] i386/acpi-build: allow SSDT to operate on 64 bit Xiao Guangrong
2015-08-14 14:51 ` [Qemu-devel] " Xiao Guangrong
2015-09-02 10:06 ` Igor Mammedov
2015-09-02 10:06 ` [Qemu-devel] " Igor Mammedov
2015-09-02 10:43 ` Xiao Guangrong
2015-09-02 11:42 ` Igor Mammedov
2015-09-06 7:01 ` Xiao Guangrong
2015-09-02 12:05 ` Michael S. Tsirkin
2015-09-02 12:05 ` [Qemu-devel] " Michael S. Tsirkin
2015-08-14 14:51 ` [PATCH v2 03/18] acpi: add aml_derefof Xiao Guangrong
2015-08-14 14:51 ` [Qemu-devel] " Xiao Guangrong
2015-09-02 10:16 ` Igor Mammedov
2015-09-02 10:16 ` [Qemu-devel] " Igor Mammedov
2015-09-02 10:38 ` Xiao Guangrong
2015-09-02 10:38 ` [Qemu-devel] " Xiao Guangrong
2015-08-14 14:51 ` [PATCH v2 04/18] acpi: add aml_sizeof Xiao Guangrong
2015-08-14 14:51 ` [Qemu-devel] " Xiao Guangrong
2015-09-02 10:18 ` Igor Mammedov
2015-09-02 10:18 ` [Qemu-devel] " Igor Mammedov
2015-09-02 10:39 ` Xiao Guangrong
2015-09-02 10:39 ` [Qemu-devel] " Xiao Guangrong
2015-08-14 14:51 ` [PATCH v2 05/18] acpi: add aml_create_field Xiao Guangrong
2015-08-14 14:51 ` [Qemu-devel] " Xiao Guangrong
2015-09-02 11:10 ` Igor Mammedov
2015-09-02 11:10 ` [Qemu-devel] " Igor Mammedov
2015-09-06 5:32 ` Xiao Guangrong
2015-09-06 5:32 ` [Qemu-devel] " Xiao Guangrong
2015-08-14 14:51 ` [PATCH v2 06/18] pc: implement NVDIMM device abstract Xiao Guangrong
2015-08-14 14:51 ` [Qemu-devel] " Xiao Guangrong
2015-08-25 14:57 ` Stefan Hajnoczi
2015-08-25 14:57 ` Stefan Hajnoczi
2015-08-26 9:37 ` Xiao Guangrong
2015-08-26 9:37 ` Xiao Guangrong
2015-09-02 9:58 ` Igor Mammedov
2015-09-02 9:58 ` Igor Mammedov
2015-09-02 10:36 ` Xiao Guangrong
2015-09-02 10:36 ` Xiao Guangrong
2015-09-02 11:31 ` Igor Mammedov
2015-09-02 11:31 ` Igor Mammedov
2015-09-06 6:07 ` Xiao Guangrong
2015-09-06 6:07 ` Xiao Guangrong
2015-09-07 13:40 ` Igor Mammedov
2015-09-07 13:40 ` Igor Mammedov
2015-09-08 14:03 ` Xiao Guangrong
2015-09-10 9:47 ` Igor Mammedov
2015-08-14 14:52 ` [PATCH v2 07/18] nvdimm: reserve address range for NVDIMM Xiao Guangrong
2015-08-14 14:52 ` [Qemu-devel] " Xiao Guangrong
2015-08-25 15:12 ` Stefan Hajnoczi
2015-08-25 15:12 ` [Qemu-devel] " Stefan Hajnoczi
2015-08-26 9:39 ` Xiao Guangrong
2015-08-26 9:40 ` Xiao Guangrong
2015-08-26 9:40 ` Xiao Guangrong
2015-08-25 15:39 ` Stefan Hajnoczi
2015-08-25 15:39 ` [Qemu-devel] " Stefan Hajnoczi
2015-08-28 17:25 ` Eduardo Habkost
2015-08-28 17:25 ` [Qemu-devel] " Eduardo Habkost
2015-08-31 7:01 ` Xiao Guangrong
2015-08-31 7:01 ` [Qemu-devel] " Xiao Guangrong
2015-09-04 12:02 ` Igor Mammedov
2015-09-04 12:02 ` [Qemu-devel] " Igor Mammedov
2015-09-06 7:22 ` Xiao Guangrong
2015-09-06 7:22 ` Xiao Guangrong
2015-08-14 14:52 ` [PATCH v2 08/18] nvdimm: init backend memory mapping and config data area Xiao Guangrong
2015-08-14 14:52 ` [Qemu-devel] " Xiao Guangrong
2015-08-25 16:03 ` Stefan Hajnoczi
2015-08-25 16:03 ` [Qemu-devel] " Stefan Hajnoczi
2015-08-26 10:40 ` Xiao Guangrong
2015-08-26 10:40 ` [Qemu-devel] " Xiao Guangrong
2015-08-28 11:58 ` Stefan Hajnoczi
2015-08-28 11:58 ` [Qemu-devel] " Stefan Hajnoczi
2015-08-31 6:23 ` Xiao Guangrong
2015-08-31 6:23 ` [Qemu-devel] " Xiao Guangrong
2015-09-01 9:14 ` Stefan Hajnoczi
2015-09-01 9:14 ` [Qemu-devel] " Stefan Hajnoczi
2015-09-15 16:10 ` Paolo Bonzini
2015-09-15 16:10 ` [Qemu-devel] " Paolo Bonzini
2015-09-17 8:39 ` Xiao Guangrong
2015-09-17 8:39 ` [Qemu-devel] " Xiao Guangrong
2015-09-17 9:04 ` Igor Mammedov
2015-09-17 9:04 ` [Qemu-devel] " Igor Mammedov
2015-09-17 9:14 ` Xiao Guangrong
2015-09-17 9:14 ` [Qemu-devel] " Xiao Guangrong
2015-09-17 9:34 ` Paolo Bonzini
2015-09-17 9:34 ` [Qemu-devel] " Paolo Bonzini
2015-09-17 12:43 ` Xiao Guangrong
2015-09-15 16:07 ` Paolo Bonzini
2015-09-15 16:07 ` [Qemu-devel] " Paolo Bonzini
2015-09-17 8:23 ` Xiao Guangrong
2015-09-17 8:23 ` [Qemu-devel] " Xiao Guangrong
2015-09-15 16:06 ` Paolo Bonzini
2015-09-15 16:06 ` [Qemu-devel] " Paolo Bonzini
2015-09-17 8:21 ` Xiao Guangrong
2015-09-17 8:21 ` [Qemu-devel] " Xiao Guangrong
2015-09-07 14:11 ` Igor Mammedov
2015-09-07 14:11 ` Igor Mammedov
2015-09-08 13:38 ` Xiao Guangrong [this message]
2015-09-10 10:35 ` Igor Mammedov
2015-09-15 16:11 ` Paolo Bonzini
2015-08-14 14:52 ` [PATCH v2 09/18] nvdimm: build ACPI NFIT table Xiao Guangrong
2015-08-14 14:52 ` [Qemu-devel] " Xiao Guangrong
2015-09-15 16:12 ` Paolo Bonzini
2015-09-15 16:12 ` [Qemu-devel] " Paolo Bonzini
2015-09-15 17:35 ` Igor Mammedov
2015-09-15 17:35 ` [Qemu-devel] " Igor Mammedov
2015-08-14 14:52 ` [PATCH v2 10/18] nvdimm: init the address region used by DSM method Xiao Guangrong
2015-08-14 14:52 ` [Qemu-devel] " Xiao Guangrong
2015-08-25 16:11 ` Stefan Hajnoczi
2015-08-25 16:11 ` [Qemu-devel] " Stefan Hajnoczi
2015-08-26 10:41 ` Xiao Guangrong
2015-08-26 10:41 ` [Qemu-devel] " Xiao Guangrong
2015-08-14 14:52 ` [PATCH v2 11/18] nvdimm: build ACPI nvdimm devices Xiao Guangrong
2015-08-14 14:52 ` [Qemu-devel] " Xiao Guangrong
2015-08-14 14:52 ` [PATCH v2 12/18] nvdimm: save arg3 for NVDIMM device _DSM method Xiao Guangrong
2015-08-14 14:52 ` [Qemu-devel] " Xiao Guangrong
2015-08-14 14:52 ` [PATCH v2 13/18] nvdimm: build namespace config data Xiao Guangrong
2015-08-14 14:52 ` [Qemu-devel] " Xiao Guangrong
2015-08-25 16:16 ` Stefan Hajnoczi
2015-08-25 16:16 ` Stefan Hajnoczi
2015-08-26 10:42 ` Xiao Guangrong
2015-08-26 10:42 ` Xiao Guangrong
2015-08-28 11:59 ` Stefan Hajnoczi
2015-08-28 11:59 ` Stefan Hajnoczi
2015-08-31 6:25 ` Xiao Guangrong
2015-08-31 6:25 ` Xiao Guangrong
2015-08-14 14:52 ` [PATCH v2 14/18] nvdimm: support NFIT_CMD_IMPLEMENTED function Xiao Guangrong
2015-08-14 14:52 ` [Qemu-devel] " Xiao Guangrong
2015-08-25 16:23 ` Stefan Hajnoczi
2015-08-25 16:23 ` [Qemu-devel] " Stefan Hajnoczi
2015-08-26 10:46 ` Xiao Guangrong
2015-08-26 10:46 ` [Qemu-devel] " Xiao Guangrong
2015-08-28 12:01 ` Stefan Hajnoczi
2015-08-28 12:01 ` [Qemu-devel] " Stefan Hajnoczi
2015-08-31 6:51 ` Xiao Guangrong
2015-08-31 6:51 ` Xiao Guangrong
2015-09-01 9:16 ` Stefan Hajnoczi
2015-09-01 9:16 ` Stefan Hajnoczi
2015-08-14 14:52 ` [PATCH v2 15/18] nvdimm: support NFIT_CMD_GET_CONFIG_SIZE function Xiao Guangrong
2015-08-14 14:52 ` [Qemu-devel] " Xiao Guangrong
2015-08-25 16:24 ` Stefan Hajnoczi
2015-08-25 16:24 ` [Qemu-devel] " Stefan Hajnoczi
2015-08-26 10:47 ` Xiao Guangrong
2015-08-26 10:47 ` [Qemu-devel] " Xiao Guangrong
2015-08-14 14:52 ` [PATCH v2 16/18] nvdimm: support NFIT_CMD_GET_CONFIG_DATA Xiao Guangrong
2015-08-14 14:52 ` [Qemu-devel] " Xiao Guangrong
2015-08-14 14:52 ` [PATCH v2 17/18] nvdimm: support NFIT_CMD_SET_CONFIG_DATA Xiao Guangrong
2015-08-14 14:52 ` [Qemu-devel] " Xiao Guangrong
2015-08-14 14:52 ` [PATCH v2 18/18] nvdimm: add maintain info Xiao Guangrong
2015-08-14 14:52 ` [Qemu-devel] " Xiao Guangrong
2015-08-25 16:26 ` [Qemu-devel] [PATCH v2 00/18] implement vNVDIMM Stefan Hajnoczi
2015-08-25 16:26 ` Stefan Hajnoczi
2015-08-26 10:49 ` Xiao Guangrong
2015-08-26 10:49 ` Xiao Guangrong
2015-10-07 14:02 ` Stefan Hajnoczi
2015-10-07 14:02 ` Stefan Hajnoczi
2015-10-07 14:43 ` Xiao Guangrong
2015-10-07 14:43 ` Xiao Guangrong
2015-10-09 10:38 ` Stefan Hajnoczi
2015-10-09 10:38 ` Stefan Hajnoczi
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=55EEE4C9.4090400@linux.intel.com \
--to=guangrong.xiao@linux.intel.com \
--cc=ehabkost@redhat.com \
--cc=gleb@kernel.org \
--cc=imammedo@redhat.com \
--cc=kvm@vger.kernel.org \
--cc=mst@redhat.com \
--cc=mtosatti@redhat.com \
--cc=pbonzini@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=rth@twiddle.net \
--cc=stefanha@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.