* Xen Developer Summit Storage Performance BoF notes
@ 2013-10-30 11:03 David Vrabel
2013-10-30 11:33 ` Roger Pau Monné
2013-10-30 16:09 ` Felipe Franciosi
0 siblings, 2 replies; 4+ messages in thread
From: David Vrabel @ 2013-10-30 11:03 UTC (permalink / raw)
To: Xen-devel@lists.xen.org
[ I forgot to make notes so these are from memory, please respond with
any corrections or omissions. ]
Felipe introduced the session, highlighting the change in storage (i.e.,
low latency SSDs and fast SANs) were exposing bottlenecks in the current
architecture which is designed with slow disks. Refer to his
presentation from Friday for more details.
Felipe noted that persistent grants were causing performance regressions
when the backend did not support them and system where copy cost > map
cost (e.g., when dom0 has few VCPUs). Roger agreed on restoring the
zero-copy path in the frontends was a good idea. [He has now posted
patches for this.]
Felipe mentioned that persistent grants were most beneficial when using
user space backend. David pointed out that this is most likely caused
by a poor implementation of the gntdev device.
Matt mentioned contention on the m2p override lock as causing
performance problems and suggested making this a read/write lock.
David listed some of the key bottlenecks already identified and plans to
resolve them without any protocol changes.
1. Unmap TLB flushes can be eliminated if the mapping is not used.
Experiments by XenServer suggest grant mapped pages by blkback are never
accessed thus eliminating all TLB flushes.
2. Grant table lock contention can be reduced by finer grained locked.
e.g., by having buckets of map tracking structures and hashing
domid+grant ref to a bucket.
3. gntdev device does a double map/unmap (for userspace and kernel
mapping) and does the kernel mapping a page at a time. Userspace
mappings could be done at page fault time (in the expectation that
userspace doesn't touch them) and the kernel side should batch grant
table ops using a new GNTOP_unmap_and_duplicate hypercall sub-op for the
unmap. Roger said he'd posted patch for the sub-op, but received no
feedback.
Anil mentioned PV filesystems but this wasn't discussed in any depth.
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: Xen Developer Summit Storage Performance BoF notes
2013-10-30 11:03 Xen Developer Summit Storage Performance BoF notes David Vrabel
@ 2013-10-30 11:33 ` Roger Pau Monné
2013-10-30 18:20 ` Konrad Rzeszutek Wilk
2013-10-30 16:09 ` Felipe Franciosi
1 sibling, 1 reply; 4+ messages in thread
From: Roger Pau Monné @ 2013-10-30 11:33 UTC (permalink / raw)
To: David Vrabel, Xen-devel@lists.xen.org
On 30/10/13 12:03, David Vrabel wrote:
> [ I forgot to make notes so these are from memory, please respond with
> any corrections or omissions. ]
>
> Felipe introduced the session, highlighting the change in storage (i.e.,
> low latency SSDs and fast SANs) were exposing bottlenecks in the current
> architecture which is designed with slow disks. Refer to his
> presentation from Friday for more details.
>
> Felipe noted that persistent grants were causing performance regressions
> when the backend did not support them and system where copy cost > map
> cost (e.g., when dom0 has few VCPUs). Roger agreed on restoring the
> zero-copy path in the frontends was a good idea. [He has now posted
> patches for this.]
>
> Felipe mentioned that persistent grants were most beneficial when using
> user space backend. David pointed out that this is most likely caused
> by a poor implementation of the gntdev device.
>
> Matt mentioned contention on the m2p override lock as causing
> performance problems and suggested making this a read/write lock.
>
> David listed some of the key bottlenecks already identified and plans to
> resolve them without any protocol changes.
>
> 1. Unmap TLB flushes can be eliminated if the mapping is not used.
> Experiments by XenServer suggest grant mapped pages by blkback are never
> accessed thus eliminating all TLB flushes.
>
> 2. Grant table lock contention can be reduced by finer grained locked.
> e.g., by having buckets of map tracking structures and hashing
> domid+grant ref to a bucket.
>
> 3. gntdev device does a double map/unmap (for userspace and kernel
> mapping) and does the kernel mapping a page at a time. Userspace
> mappings could be done at page fault time (in the expectation that
> userspace doesn't touch them) and the kernel side should batch grant
> table ops using a new GNTOP_unmap_and_duplicate hypercall sub-op for the
> unmap. Roger said he'd posted patch for the sub-op, but received no
> feedback.
Here are the patches, the Linux side was reviewed by Stefano (if my
memory doesn't fail):
http://lists.xen.org/archives/html/xen-devel/2013-07/msg02825.html
https://lkml.org/lkml/2013/8/27/189
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: Xen Developer Summit Storage Performance BoF notes
2013-10-30 11:03 Xen Developer Summit Storage Performance BoF notes David Vrabel
2013-10-30 11:33 ` Roger Pau Monné
@ 2013-10-30 16:09 ` Felipe Franciosi
1 sibling, 0 replies; 4+ messages in thread
From: Felipe Franciosi @ 2013-10-30 16:09 UTC (permalink / raw)
To: David Vrabel, Xen-devel@lists.xen.org
Thanks for writing this up, David.
I also did not take any notes (argh), but from what I recall your summary is complete.
Do we want to raise any actions on who will follow each of the items?
Or at least prioritise them?
Thanks,
Felipe
-----Original Message-----
From: xen-devel-bounces@lists.xen.org [mailto:xen-devel-bounces@lists.xen.org] On Behalf Of David Vrabel
Sent: 30 October 2013 11:04
To: Xen-devel@lists.xen.org
Subject: [Xen-devel] Xen Developer Summit Storage Performance BoF notes
[ I forgot to make notes so these are from memory, please respond with any corrections or omissions. ]
Felipe introduced the session, highlighting the change in storage (i.e., low latency SSDs and fast SANs) were exposing bottlenecks in the current architecture which is designed with slow disks. Refer to his presentation from Friday for more details.
Felipe noted that persistent grants were causing performance regressions when the backend did not support them and system where copy cost > map cost (e.g., when dom0 has few VCPUs). Roger agreed on restoring the zero-copy path in the frontends was a good idea. [He has now posted patches for this.]
Felipe mentioned that persistent grants were most beneficial when using user space backend. David pointed out that this is most likely caused by a poor implementation of the gntdev device.
Matt mentioned contention on the m2p override lock as causing performance problems and suggested making this a read/write lock.
David listed some of the key bottlenecks already identified and plans to resolve them without any protocol changes.
1. Unmap TLB flushes can be eliminated if the mapping is not used.
Experiments by XenServer suggest grant mapped pages by blkback are never accessed thus eliminating all TLB flushes.
2. Grant table lock contention can be reduced by finer grained locked.
e.g., by having buckets of map tracking structures and hashing
domid+grant ref to a bucket.
3. gntdev device does a double map/unmap (for userspace and kernel
mapping) and does the kernel mapping a page at a time. Userspace mappings could be done at page fault time (in the expectation that userspace doesn't touch them) and the kernel side should batch grant table ops using a new GNTOP_unmap_and_duplicate hypercall sub-op for the unmap. Roger said he'd posted patch for the sub-op, but received no feedback.
Anil mentioned PV filesystems but this wasn't discussed in any depth.
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xen.org
http://lists.xen.org/xen-devel
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: Xen Developer Summit Storage Performance BoF notes
2013-10-30 11:33 ` Roger Pau Monné
@ 2013-10-30 18:20 ` Konrad Rzeszutek Wilk
0 siblings, 0 replies; 4+ messages in thread
From: Konrad Rzeszutek Wilk @ 2013-10-30 18:20 UTC (permalink / raw)
To: Roger Pau Monné; +Cc: David Vrabel, Xen-devel@lists.xen.org
On Wed, Oct 30, 2013 at 12:33:04PM +0100, Roger Pau Monné wrote:
> On 30/10/13 12:03, David Vrabel wrote:
> > [ I forgot to make notes so these are from memory, please respond with
> > any corrections or omissions. ]
> >
> > Felipe introduced the session, highlighting the change in storage (i.e.,
> > low latency SSDs and fast SANs) were exposing bottlenecks in the current
> > architecture which is designed with slow disks. Refer to his
> > presentation from Friday for more details.
> >
> > Felipe noted that persistent grants were causing performance regressions
> > when the backend did not support them and system where copy cost > map
> > cost (e.g., when dom0 has few VCPUs). Roger agreed on restoring the
> > zero-copy path in the frontends was a good idea. [He has now posted
> > patches for this.]
> >
> > Felipe mentioned that persistent grants were most beneficial when using
> > user space backend. David pointed out that this is most likely caused
> > by a poor implementation of the gntdev device.
> >
> > Matt mentioned contention on the m2p override lock as causing
> > performance problems and suggested making this a read/write lock.
> >
> > David listed some of the key bottlenecks already identified and plans to
> > resolve them without any protocol changes.
> >
> > 1. Unmap TLB flushes can be eliminated if the mapping is not used.
> > Experiments by XenServer suggest grant mapped pages by blkback are never
> > accessed thus eliminating all TLB flushes.
> >
> > 2. Grant table lock contention can be reduced by finer grained locked.
> > e.g., by having buckets of map tracking structures and hashing
> > domid+grant ref to a bucket.
> >
> > 3. gntdev device does a double map/unmap (for userspace and kernel
> > mapping) and does the kernel mapping a page at a time. Userspace
> > mappings could be done at page fault time (in the expectation that
> > userspace doesn't touch them) and the kernel side should batch grant
> > table ops using a new GNTOP_unmap_and_duplicate hypercall sub-op for the
> > unmap. Roger said he'd posted patch for the sub-op, but received no
> > feedback.
>
> Here are the patches, the Linux side was reviewed by Stefano (if my
> memory doesn't fail):
>
> http://lists.xen.org/archives/html/xen-devel/2013-07/msg02825.html
> https://lkml.org/lkml/2013/8/27/189
Could you repost them please based on the v3.12-rc7? Thanks!
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2013-10-30 18:20 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2013-10-30 11:03 Xen Developer Summit Storage Performance BoF notes David Vrabel
2013-10-30 11:33 ` Roger Pau Monné
2013-10-30 18:20 ` Konrad Rzeszutek Wilk
2013-10-30 16:09 ` Felipe Franciosi
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.