From mboxrd@z Thu Jan 1 00:00:00 1970 From: Ingo Molnar Subject: Re: [PATCH v2 00/10] evacuate struct page from the block layer, introduce __pfn_t Date: Thu, 7 May 2015 19:52:30 +0200 Message-ID: <20150507175230.GB21781@gmail.com> References: <20150506200219.40425.74411.stgit@dwillia2-desk3.amr.corp.intel.com> <20150507090217.GA4467@gmail.com> <20150507144225.GA20491@gmail.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Cc: Linus Torvalds , Linux Kernel Mailing List , Boaz Harrosh , Jan Kara , Mike Snitzer , Neil Brown , Benjamin Herrenschmidt , Dave Hansen , Heiko Carstens , Chris Mason , Paul Mackerras , "H. Peter Anvin" , Christoph Hellwig , Alasdair Kergon , "linux-nvdimm@lists.01.org" , Mel Gorman , Matthew Wilcox , Ross Zwisler , Rik van Riel , Martin Schwidefsky , Jens Axboe , Theodore Ts'o , "Martin K. Petersen" Return-path: Received: from mail-wg0-f46.google.com ([74.125.82.46]:35301 "EHLO mail-wg0-f46.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751736AbbEGRwh (ORCPT ); Thu, 7 May 2015 13:52:37 -0400 Content-Disposition: inline In-Reply-To: Sender: linux-fsdevel-owner@vger.kernel.org List-ID: * Dan Williams wrote: > > That looks like a layering violation and a mistake to me. If we > > want to do direct (sector_t -> sector_t) IO, with no serialization > > worries, it should have its own (simple) API - which things like > > hierarchical RAID or RDMA APIs could use. > > I'm wrapped around the idea that __pfn_t *is* that simple api for > the tiered storage driver use case. [...] I agree. (see my previous mail) > [...] For RDMA I think we need struct page because I assume that > would be coordinated through a filesystem an truncate() is back in > play. So I don't think RDMA is necessarily special, it's just a weirdly programmed DMA request: - If it is used internally by an exclusively managed complex storage driver, then it can use low level block APIs and pfn_t. - If RDMA is exposed all the way to user-space (do we have such APIs?), allowing users to initiate RDMA IO into user buffers, then (the user visible) buffer needs struct page backing. (which in turn will then at some lower level convert to pfns.) That's true for both regular RAM pages and mmap()-ed persistent RAM pages as well. Thanks, Ingo