From: Ross Zwisler <ross.zwisler@linux.intel.com>
To: Dan Williams <dan.j.williams@intel.com>
Cc: "linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
"H. Peter Anvin" <hpa@zytor.com>,
"Luis R. Rodriguez" <mcgrof@suse.com>,
"Rafael J. Wysocki" <rjw@rjwysocki.net>,
Andrew Morton <akpm@linux-foundation.org>,
Borislav Petkov <bp@suse.de>, Christoph Hellwig <hch@lst.de>,
Christoph Jaeger <cj@linux.com>,
Dan Streetman <ddstreet@ieee.org>, Ingo Molnar <mingo@redhat.com>,
Juergen Gross <jgross@suse.com>, Len Brown <lenb@kernel.org>,
Rusty Russell <rusty@rustcorp.com.au>,
Stephen Rothwell <sfr@canb.auug.org.au>,
Thierry Reding <treding@nvidia.com>,
Thomas Gleixner <tglx@linutronix.de>,
Toshi Kani <toshi.kani@hp.com>,
Vishal Verma <vishal.l.verma@intel.com>,
Will Deacon <will.deacon@arm.com>,
Linux ACPI <linux-acpi@vger.kernel.org>,
"linux-nvdimm@lists.01.org" <linux-nvdimm@lists.01.org>,
X86 ML <x86@kernel.org>
Subject: Re: [PATCH v2] nd_blk: add support for "read flush" DSM flag
Date: Thu, 20 Aug 2015 13:00:10 -0600 [thread overview]
Message-ID: <1440097210.30197.9.camel@linux.intel.com> (raw)
In-Reply-To: <CAPcyv4iSJ8e3mgiMb3cG_NEW4VqmuWhHdL2v4Os_xiPSgGLmCg@mail.gmail.com>
On Thu, 2015-08-20 at 11:26 -0700, Dan Williams wrote:
> On Thu, Aug 20, 2015 at 11:17 AM, Ross Zwisler
> <ross.zwisler@linux.intel.com> wrote:
> > On Thu, 2015-08-20 at 10:59 -0700, Dan Williams wrote:
> [..]
> > Ah, I think we're getting confused about the deinterleave part.
> >
> > The aperture is a set of contiguous addresses from the perspective of the
> > DIMM, but when it's interleaved by the iMC it becomes a bunch of segments that
> > are not contiguous in the virtual address space of the kernel.
> >
> > Meaning, say you have an 8k aperture that is interleaved with one other DIMM
> > on a 256 byte granularity - this means that in SPA space you'll end up with a
> > big mesh of 256 byte chunks, half of which belong to you and half which don't:
> >
> > SPA space:
> > +--------------------+
> > |256 bytes (ours) |
> > +--------------------+
> > |256 bytes (not ours)|
> > +--------------------+
> > |256 bytes (ours) |
> > +--------------------+
> > |256 bytes (not ours)|
> > +--------------------+
> > ...
> >
> > To be able to flush the entire aperture unconditionally, we have to walk
> > through all the segments that belong to use and flush each one of them. I
> > don't think we want to blindly flush the entire interleaved space because a)
> > the other chunks are some other DIMMs' apertures, and b) we'd be flushing 2x
> > or more (depending on how many DIMMs are interleaved) the space we need, one
> > cache line at a time.
>
> I am indeed proposing flushing other DIMMs because those ranges are
> invalidated by the aperture moving. This is based on the assumption
> that the flushing is cheaper in the case when no dirty-lines are
> found. The performance gains of doing piecemeal flushes seems not
> worth the complexity.
Why are the segments belonging to other apertures invalidated because we have
moved our aperture? They are all independent cache lines (segments must be a
multiple of the cache line size), and the other apertures might be in the
middle of some other I/O operation on some other CPU that we know nothing
about.
WARNING: multiple messages have this Message-ID (diff)
From: Ross Zwisler <ross.zwisler@linux.intel.com>
To: Dan Williams <dan.j.williams@intel.com>
Cc: "linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
"H. Peter Anvin" <hpa@zytor.com>,
"Luis R. Rodriguez" <mcgrof@suse.com>,
"Rafael J. Wysocki" <rjw@rjwysocki.net>,
Andrew Morton <akpm@linux-foundation.org>,
Borislav Petkov <bp@suse.de>, Christoph Hellwig <hch@lst.de>,
Christoph Jaeger <cj@linux.com>,
Dan Streetman <ddstreet@ieee.org>, Ingo Molnar <mingo@redhat.com>,
Juergen Gross <jgross@suse.com>, Len Brown <lenb@kernel.org>,
Rusty Russell <rusty@rustcorp.com.au>,
Stephen Rothwell <sfr@canb.auug.org.au>,
Thierry Reding <treding@nvidia.com>,
Thomas Gleixner <tglx@linutronix.de>,
Toshi Kani <toshi.kani@hp.com>,
Vishal Verma <vishal.l.verma@intel.com>,
Will Deacon <will.deacon@arm.com>,
Linux ACPI <linux-acpi@vger.kernel.org>,
"linux-nvdimm@lists.01.org" <linux-nvdimm@ml01.01.org>,
X86 ML <x86@kernel.org>
Subject: Re: [PATCH v2] nd_blk: add support for "read flush" DSM flag
Date: Thu, 20 Aug 2015 13:00:10 -0600 [thread overview]
Message-ID: <1440097210.30197.9.camel@linux.intel.com> (raw)
In-Reply-To: <CAPcyv4iSJ8e3mgiMb3cG_NEW4VqmuWhHdL2v4Os_xiPSgGLmCg@mail.gmail.com>
On Thu, 2015-08-20 at 11:26 -0700, Dan Williams wrote:
> On Thu, Aug 20, 2015 at 11:17 AM, Ross Zwisler
> <ross.zwisler@linux.intel.com> wrote:
> > On Thu, 2015-08-20 at 10:59 -0700, Dan Williams wrote:
> [..]
> > Ah, I think we're getting confused about the deinterleave part.
> >
> > The aperture is a set of contiguous addresses from the perspective of the
> > DIMM, but when it's interleaved by the iMC it becomes a bunch of segments that
> > are not contiguous in the virtual address space of the kernel.
> >
> > Meaning, say you have an 8k aperture that is interleaved with one other DIMM
> > on a 256 byte granularity - this means that in SPA space you'll end up with a
> > big mesh of 256 byte chunks, half of which belong to you and half which don't:
> >
> > SPA space:
> > +--------------------+
> > |256 bytes (ours) |
> > +--------------------+
> > |256 bytes (not ours)|
> > +--------------------+
> > |256 bytes (ours) |
> > +--------------------+
> > |256 bytes (not ours)|
> > +--------------------+
> > ...
> >
> > To be able to flush the entire aperture unconditionally, we have to walk
> > through all the segments that belong to use and flush each one of them. I
> > don't think we want to blindly flush the entire interleaved space because a)
> > the other chunks are some other DIMMs' apertures, and b) we'd be flushing 2x
> > or more (depending on how many DIMMs are interleaved) the space we need, one
> > cache line at a time.
>
> I am indeed proposing flushing other DIMMs because those ranges are
> invalidated by the aperture moving. This is based on the assumption
> that the flushing is cheaper in the case when no dirty-lines are
> found. The performance gains of doing piecemeal flushes seems not
> worth the complexity.
Why are the segments belonging to other apertures invalidated because we have
moved our aperture? They are all independent cache lines (segments must be a
multiple of the cache line size), and the other apertures might be in the
middle of some other I/O operation on some other CPU that we know nothing
about.
next prev parent reply other threads:[~2015-08-20 19:00 UTC|newest]
Thread overview: 24+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-08-19 22:48 [PATCH v2] nd_blk: add support for "read flush" DSM flag Ross Zwisler
2015-08-19 22:48 ` Ross Zwisler
2015-08-19 23:06 ` Dan Williams
2015-08-19 23:06 ` Dan Williams
2015-08-20 16:44 ` Ross Zwisler
2015-08-20 16:44 ` Ross Zwisler
2015-08-20 17:59 ` Dan Williams
2015-08-20 17:59 ` Dan Williams
2015-08-20 18:17 ` Ross Zwisler
2015-08-20 18:17 ` Ross Zwisler
2015-08-20 18:26 ` Dan Williams
2015-08-20 18:26 ` Dan Williams
2015-08-20 19:00 ` Ross Zwisler [this message]
2015-08-20 19:00 ` Ross Zwisler
2015-08-20 20:27 ` Dan Williams
2015-08-20 20:27 ` Dan Williams
2015-08-20 21:15 ` Ross Zwisler
2015-08-20 21:15 ` Ross Zwisler
2015-08-23 1:59 ` Dan Williams
2015-08-23 1:59 ` Dan Williams
2015-08-20 10:21 ` Will Deacon
2015-08-20 10:21 ` Will Deacon
2015-08-20 16:08 ` Ross Zwisler
2015-08-20 16:08 ` Ross Zwisler
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1440097210.30197.9.camel@linux.intel.com \
--to=ross.zwisler@linux.intel.com \
--cc=akpm@linux-foundation.org \
--cc=bp@suse.de \
--cc=cj@linux.com \
--cc=dan.j.williams@intel.com \
--cc=ddstreet@ieee.org \
--cc=hch@lst.de \
--cc=hpa@zytor.com \
--cc=jgross@suse.com \
--cc=lenb@kernel.org \
--cc=linux-acpi@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-nvdimm@lists.01.org \
--cc=mcgrof@suse.com \
--cc=mingo@redhat.com \
--cc=rjw@rjwysocki.net \
--cc=rusty@rustcorp.com.au \
--cc=sfr@canb.auug.org.au \
--cc=tglx@linutronix.de \
--cc=toshi.kani@hp.com \
--cc=treding@nvidia.com \
--cc=vishal.l.verma@intel.com \
--cc=will.deacon@arm.com \
--cc=x86@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.