From: "Dr. David Alan Gilbert" <dgilbert@redhat.com>
To: qemu-devel@nongnu.org, Eduardo Habkost <ehabkost@redhat.com>,
Igor Mammedov <imammedo@redhat.com>,
Paolo Bonzini <pbonzini@redhat.com>,
mst@redhat.com, Xiao Guangrong <xiaoguangrong.eric@gmail.com>,
Juan Quintela <quintela@redhat.com>,
Stefan Hajnoczi <stefanha@redhat.com>,
Dan Williams <dan.j.williams@intel.com>
Subject: Re: [Qemu-devel] [PATCH v2 7/8] migration/ram: ensure write persistence on loading compressed pages to PMEM
Date: Wed, 7 Feb 2018 13:03:56 +0000 [thread overview]
Message-ID: <20180207130355.GH2665@work-vm> (raw)
In-Reply-To: <20180207121525.5pyrld36k5xbm373@hz-desktop>
* Haozhong Zhang (haozhong.zhang@intel.com) wrote:
> On 02/07/18 11:54 +0000, Dr. David Alan Gilbert wrote:
> > * Haozhong Zhang (haozhong.zhang@intel.com) wrote:
> > > When loading a compressed page to persistent memory, flush CPU cache
> > > after the data is decompressed. Combined with a call to pmem_drain()
> > > at the end of memory loading, we can guarantee those compressed pages
> > > are persistently loaded to PMEM.
> >
> > Can you explain why this can use the flush and doesn't need the special
> > memset?
>
> The best approach to ensure the write persistence is to operate pmem
> all via libpmem, e.g., pmem_memcpy_nodrain() + pmem_drain(). However,
> the write to pmem in this case is performed by uncompress() which is
> implemented out of QEMU and libpmem. It may or may not use libpmem,
> which is not controlled by QEMU. Therefore, we have to use the less
> optimal approach, that is to flush cache for all pmem addresses that
> uncompress() may have written, i.e.,/e.g., memcpy() and/or memset() in
> uncompress(), and pmem_flush() + pmem_drain() in QEMU.
In what way is it less optimal?
If that's a legal thing to do, then why not just do a pmem_flush +
pmem_drain right at the end of the ram loading and leave all the rest of
the code untouched?
Dave
> Haozhong
>
> >
> > Dave
> >
> > > Signed-off-by: Haozhong Zhang <haozhong.zhang@intel.com>
> > > ---
> > > include/qemu/pmem.h | 4 ++++
> > > migration/ram.c | 16 +++++++++++-----
> > > 2 files changed, 15 insertions(+), 5 deletions(-)
> > >
> > > diff --git a/include/qemu/pmem.h b/include/qemu/pmem.h
> > > index 77ee1fc4eb..20e3f6e71d 100644
> > > --- a/include/qemu/pmem.h
> > > +++ b/include/qemu/pmem.h
> > > @@ -37,6 +37,10 @@ static inline void *pmem_memset_nodrain(void *pmemdest, int c, size_t len)
> > > return memset(pmemdest, c, len);
> > > }
> > >
> > > +static inline void pmem_flush(const void *addr, size_t len)
> > > +{
> > > +}
> > > +
> > > static inline void pmem_drain(void)
> > > {
> > > }
> > > diff --git a/migration/ram.c b/migration/ram.c
> > > index 5a79bbff64..924d2b9537 100644
> > > --- a/migration/ram.c
> > > +++ b/migration/ram.c
> > > @@ -274,6 +274,7 @@ struct DecompressParam {
> > > void *des;
> > > uint8_t *compbuf;
> > > int len;
> > > + bool is_pmem;
> > > };
> > > typedef struct DecompressParam DecompressParam;
> > >
> > > @@ -2502,7 +2503,7 @@ static void *do_data_decompress(void *opaque)
> > > DecompressParam *param = opaque;
> > > unsigned long pagesize;
> > > uint8_t *des;
> > > - int len;
> > > + int len, rc;
> > >
> > > qemu_mutex_lock(¶m->mutex);
> > > while (!param->quit) {
> > > @@ -2518,8 +2519,11 @@ static void *do_data_decompress(void *opaque)
> > > * not a problem because the dirty page will be retransferred
> > > * and uncompress() won't break the data in other pages.
> > > */
> > > - uncompress((Bytef *)des, &pagesize,
> > > - (const Bytef *)param->compbuf, len);
> > > + rc = uncompress((Bytef *)des, &pagesize,
> > > + (const Bytef *)param->compbuf, len);
> > > + if (rc == Z_OK && param->is_pmem) {
> > > + pmem_flush(des, len);
> > > + }
> > >
> > > qemu_mutex_lock(&decomp_done_lock);
> > > param->done = true;
> > > @@ -2605,7 +2609,8 @@ static void compress_threads_load_cleanup(void)
> > > }
> > >
> > > static void decompress_data_with_multi_threads(QEMUFile *f,
> > > - void *host, int len)
> > > + void *host, int len,
> > > + bool is_pmem)
> > > {
> > > int idx, thread_count;
> > >
> > > @@ -2619,6 +2624,7 @@ static void decompress_data_with_multi_threads(QEMUFile *f,
> > > qemu_get_buffer(f, decomp_param[idx].compbuf, len);
> > > decomp_param[idx].des = host;
> > > decomp_param[idx].len = len;
> > > + decomp_param[idx].is_pmem = is_pmem;
> > > qemu_cond_signal(&decomp_param[idx].cond);
> > > qemu_mutex_unlock(&decomp_param[idx].mutex);
> > > break;
> > > @@ -2964,7 +2970,7 @@ static int ram_load(QEMUFile *f, void *opaque, int version_id)
> > > ret = -EINVAL;
> > > break;
> > > }
> > > - decompress_data_with_multi_threads(f, host, len);
> > > + decompress_data_with_multi_threads(f, host, len, is_pmem);
> > > break;
> > >
> > > case RAM_SAVE_FLAG_XBZRLE:
> > > --
> > > 2.14.1
> > >
> > --
> > Dr. David Alan Gilbert / dgilbert@redhat.com / Manchester, UK
--
Dr. David Alan Gilbert / dgilbert@redhat.com / Manchester, UK
next prev parent reply other threads:[~2018-02-07 13:04 UTC|newest]
Thread overview: 34+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-02-07 7:33 [Qemu-devel] [PATCH v2 0/8] nvdimm: guarantee persistence of QEMU writes to persistent memory Haozhong Zhang
2018-02-07 7:33 ` [Qemu-devel] [PATCH v2 1/8] memory, exec: switch file ram allocation functions to 'flags' parameters Haozhong Zhang
2018-02-07 7:33 ` [Qemu-devel] [PATCH v2 2/8] hostmem-file: add the 'pmem' option Haozhong Zhang
2018-02-07 7:33 ` [Qemu-devel] [PATCH v2 3/8] configure: add libpmem support Haozhong Zhang
2018-02-07 7:33 ` [Qemu-devel] [PATCH v2 4/8] mem/nvdimm: ensure write persistence to PMEM in label emulation Haozhong Zhang
2018-02-09 14:27 ` Stefan Hajnoczi
2018-02-09 14:57 ` Haozhong Zhang
2018-02-12 13:55 ` Stefan Hajnoczi
2018-02-07 7:33 ` [Qemu-devel] [PATCH v2 5/8] migration/ram: ensure write persistence on loading zero pages to PMEM Haozhong Zhang
2018-02-07 10:17 ` Pankaj Gupta
2018-02-07 11:18 ` Haozhong Zhang
2018-02-07 11:30 ` Pankaj Gupta
2018-02-07 11:38 ` Dr. David Alan Gilbert
2018-02-07 11:52 ` Haozhong Zhang
2018-02-07 12:51 ` Haozhong Zhang
2018-02-07 12:59 ` Dr. David Alan Gilbert
2018-02-07 14:10 ` Pankaj Gupta
2018-02-07 12:56 ` Dr. David Alan Gilbert
2018-02-07 7:33 ` [Qemu-devel] [PATCH v2 6/8] migration/ram: ensure write persistence on loading normal " Haozhong Zhang
2018-02-07 11:49 ` Dr. David Alan Gilbert
2018-02-07 12:02 ` Haozhong Zhang
2018-02-07 7:33 ` [Qemu-devel] [PATCH v2 7/8] migration/ram: ensure write persistence on loading compressed " Haozhong Zhang
2018-02-07 11:54 ` Dr. David Alan Gilbert
2018-02-07 12:15 ` Haozhong Zhang
2018-02-07 13:03 ` Dr. David Alan Gilbert [this message]
2018-02-07 13:20 ` Haozhong Zhang
2018-02-07 13:24 ` Dr. David Alan Gilbert
2018-02-07 18:05 ` Dan Williams
2018-02-07 18:08 ` Dr. David Alan Gilbert
2018-02-07 18:31 ` Dan Williams
2018-02-07 18:37 ` Dr. David Alan Gilbert
2018-02-07 22:43 ` Dan Williams
2018-02-07 7:33 ` [Qemu-devel] [PATCH v2 8/8] migration/ram: ensure write persistence on loading xbzrle " Haozhong Zhang
2018-02-07 13:08 ` Dr. David Alan Gilbert
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20180207130355.GH2665@work-vm \
--to=dgilbert@redhat.com \
--cc=dan.j.williams@intel.com \
--cc=ehabkost@redhat.com \
--cc=imammedo@redhat.com \
--cc=mst@redhat.com \
--cc=pbonzini@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=quintela@redhat.com \
--cc=stefanha@redhat.com \
--cc=xiaoguangrong.eric@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).