* [LSF/MM TOPIC] Filesystem-DAX, page-pinning, and RDMA
@ 2018-01-25 3:56 Dan Williams
[not found] ` <CAPcyv4gQNM9RbTbRWKnG6Vby_CW9CJ9EZTARsVNi=9cas7ZR2A-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2018-01-25 7:23 ` Christoph Hellwig
0 siblings, 2 replies; 11+ messages in thread
From: Dan Williams @ 2018-01-25 3:56 UTC (permalink / raw)
To: lsf-pc
Cc: Christoph Hellwig, Michal Hocko, Linux MM, linux-rdma,
linux-nvdimm, jgg
The get_user_pages_longterm() api was recently added as a stop-gap
measure to prevent applications from growing dependencies on the
ability to to pin DAX-mapped filesystem blocks for RDMA indefinitely
with no ongoing coordination with the filesystem. This 'longterm'
pinning is also problematic for the non-DAX VMA case where the core-mm
needs a time bounded way to revoke a pin and manipulate the physical
pages. While existing RDMA applications have already grown the
assumption that they can pin page-cache pages indefinitely, the fact
that we are breaking this assumption for filesystem-dax presents an
opportunity to deprecate the 'indefinite pin' mechanisms and move to a
general interface that supports pin revocation.
While RDMA may grow an explicit Infiniband-verb for this 'memory
registration with lease' semantic, it seems that this problem is
bigger than just RDMA. At LSF/MM it would be useful to have a
discussion between fs, mm, dax, and RDMA folks about addressing this
problem at the core level.
Particular people that would be useful to have in attendance are
Michal Hocko, Christoph Hellwig, and Jason Gunthorpe (cc'd).
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 11+ messages in thread[parent not found: <CAPcyv4gQNM9RbTbRWKnG6Vby_CW9CJ9EZTARsVNi=9cas7ZR2A-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>]
* Re: [LSF/MM TOPIC] Filesystem-DAX, page-pinning, and RDMA
[not found] ` <CAPcyv4gQNM9RbTbRWKnG6Vby_CW9CJ9EZTARsVNi=9cas7ZR2A-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
@ 2018-01-25 4:01 ` Bart Van Assche
[not found] ` <1516852902.3724.4.camel-Sjgp3cTcYWE@public.gmane.org>
2018-01-29 23:33 ` Jerome Glisse
1 sibling, 1 reply; 11+ messages in thread
From: Bart Van Assche @ 2018-01-25 4:01 UTC (permalink / raw)
To: dan.j.williams-ral2JQCrhuEAvxtiuMwx3w@public.gmane.org,
lsf-pc-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA@public.gmane.org
Cc: linux-nvdimm-hn68Rpc1hR1g9hUCZPvPmw@public.gmane.org,
linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org,
mhocko-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org,
hch-wEGCiKHe2LqWVfeAwA7xHQ@public.gmane.org,
linux-mm-Bw31MaZKKs3YtjvyW6yDsg@public.gmane.org,
jgg-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org
On Wed, 2018-01-24 at 19:56 -0800, Dan Williams wrote:
> The get_user_pages_longterm() api was recently added as a stop-gap
> measure to prevent applications from growing dependencies on the
> ability to to pin DAX-mapped filesystem blocks for RDMA indefinitely
> with no ongoing coordination with the filesystem. This 'longterm'
> pinning is also problematic for the non-DAX VMA case where the core-mm
> needs a time bounded way to revoke a pin and manipulate the physical
> pages. While existing RDMA applications have already grown the
> assumption that they can pin page-cache pages indefinitely, the fact
> that we are breaking this assumption for filesystem-dax presents an
> opportunity to deprecate the 'indefinite pin' mechanisms and move to a
> general interface that supports pin revocation.
>
> While RDMA may grow an explicit Infiniband-verb for this 'memory
> registration with lease' semantic, it seems that this problem is
> bigger than just RDMA. At LSF/MM it would be useful to have a
> discussion between fs, mm, dax, and RDMA folks about addressing this
> problem at the core level.
>
> Particular people that would be useful to have in attendance are
> Michal Hocko, Christoph Hellwig, and Jason Gunthorpe (cc'd).
Is on demand paging sufficient as a solution for your use case or do
you perhaps need something different? See also
https://www.openfabrics.org/images/eventpresos/workshops2013/2013_Workshop_Tues_0930_liss_odp.pdf
Thanks,
Bart.
^ permalink raw reply [flat|nested] 11+ messages in thread* Re: [LSF/MM TOPIC] Filesystem-DAX, page-pinning, and RDMA
[not found] ` <CAPcyv4gQNM9RbTbRWKnG6Vby_CW9CJ9EZTARsVNi=9cas7ZR2A-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2018-01-25 4:01 ` Bart Van Assche
@ 2018-01-29 23:33 ` Jerome Glisse
2018-02-01 23:27 ` Jason Gunthorpe
1 sibling, 1 reply; 11+ messages in thread
From: Jerome Glisse @ 2018-01-29 23:33 UTC (permalink / raw)
To: Dan Williams
Cc: linux-nvdimm-hn68Rpc1hR1g9hUCZPvPmw, linux-rdma, Michal Hocko,
Christoph Hellwig, Linux MM, jgg-VPRAkNaXOzVWk0Htik3J/w,
lsf-pc-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA
On Wed, Jan 24, 2018 at 07:56:02PM -0800, Dan Williams wrote:
> The get_user_pages_longterm() api was recently added as a stop-gap
> measure to prevent applications from growing dependencies on the
> ability to to pin DAX-mapped filesystem blocks for RDMA indefinitely
> with no ongoing coordination with the filesystem. This 'longterm'
> pinning is also problematic for the non-DAX VMA case where the core-mm
> needs a time bounded way to revoke a pin and manipulate the physical
> pages. While existing RDMA applications have already grown the
> assumption that they can pin page-cache pages indefinitely, the fact
> that we are breaking this assumption for filesystem-dax presents an
> opportunity to deprecate the 'indefinite pin' mechanisms and move to a
> general interface that supports pin revocation.
>
> While RDMA may grow an explicit Infiniband-verb for this 'memory
> registration with lease' semantic, it seems that this problem is
> bigger than just RDMA. At LSF/MM it would be useful to have a
> discussion between fs, mm, dax, and RDMA folks about addressing this
> problem at the core level.
>
> Particular people that would be useful to have in attendance are
> Michal Hocko, Christoph Hellwig, and Jason Gunthorpe (cc'd).
>
Between i would also like to participate, in my view the burden should
be on GUP users, so if hardware is not ODP capable then you should at
least be able to kill the mapping/GUP and force the hardware to redo a
GUP if it get any more transaction on affect umem. Can non ODP hardware
do that ? Or is it out of the question ?
Cheers,
Jérôme
^ permalink raw reply [flat|nested] 11+ messages in thread* Re: [LSF/MM TOPIC] Filesystem-DAX, page-pinning, and RDMA
2018-01-29 23:33 ` Jerome Glisse
@ 2018-02-01 23:27 ` Jason Gunthorpe
0 siblings, 0 replies; 11+ messages in thread
From: Jason Gunthorpe @ 2018-02-01 23:27 UTC (permalink / raw)
To: Jerome Glisse
Cc: Dan Williams, lsf-pc, Christoph Hellwig, Michal Hocko, Linux MM,
linux-rdma, linux-nvdimm
On Mon, Jan 29, 2018 at 06:33:25PM -0500, Jerome Glisse wrote:
> Between i would also like to participate, in my view the burden should
> be on GUP users, so if hardware is not ODP capable then you should at
> least be able to kill the mapping/GUP and force the hardware to redo a
> GUP if it get any more transaction on affect umem. Can non ODP hardware
> do that ? Or is it out of the question ?
For RDMA we can have the HW forcibly tear down the MR, but it is
incredibly disruptive and nobody running applications would be happy
with this outcome.
Jason
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: [LSF/MM TOPIC] Filesystem-DAX, page-pinning, and RDMA
2018-01-25 3:56 [LSF/MM TOPIC] Filesystem-DAX, page-pinning, and RDMA Dan Williams
[not found] ` <CAPcyv4gQNM9RbTbRWKnG6Vby_CW9CJ9EZTARsVNi=9cas7ZR2A-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
@ 2018-01-25 7:23 ` Christoph Hellwig
[not found] ` <20180125072351.GA11093-wEGCiKHe2LqWVfeAwA7xHQ@public.gmane.org>
1 sibling, 1 reply; 11+ messages in thread
From: Christoph Hellwig @ 2018-01-25 7:23 UTC (permalink / raw)
To: Dan Williams
Cc: lsf-pc, Christoph Hellwig, Michal Hocko, Linux MM, linux-rdma,
linux-nvdimm, jgg
On Wed, Jan 24, 2018 at 07:56:02PM -0800, Dan Williams wrote:
> Particular people that would be useful to have in attendance are
> Michal Hocko, Christoph Hellwig, and Jason Gunthorpe (cc'd).
I won't be able to make it - I'll have to do election work and
count the ballots for our city council and mayor election.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 11+ messages in thread
end of thread, other threads:[~2018-02-01 23:27 UTC | newest]
Thread overview: 11+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2018-01-25 3:56 [LSF/MM TOPIC] Filesystem-DAX, page-pinning, and RDMA Dan Williams
[not found] ` <CAPcyv4gQNM9RbTbRWKnG6Vby_CW9CJ9EZTARsVNi=9cas7ZR2A-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2018-01-25 4:01 ` Bart Van Assche
[not found] ` <1516852902.3724.4.camel-Sjgp3cTcYWE@public.gmane.org>
2018-01-25 7:02 ` Dan Williams
[not found] ` <CAPcyv4iERedTChineSd-9fYR-xOc6E4L-okj7OnCMmoUkMf0tA-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2018-01-25 16:08 ` Jason Gunthorpe
2018-01-25 16:47 ` hch
2018-01-29 23:33 ` Jerome Glisse
2018-02-01 23:27 ` Jason Gunthorpe
2018-01-25 7:23 ` Christoph Hellwig
[not found] ` <20180125072351.GA11093-wEGCiKHe2LqWVfeAwA7xHQ@public.gmane.org>
2018-01-25 16:08 ` Jason Gunthorpe
[not found] ` <20180125160802.GD10706-uk2M96/98Pc@public.gmane.org>
2018-01-25 16:47 ` Christoph Hellwig
2018-01-27 2:50 ` Dan Williams
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox