From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mike Rapoport Subject: Re: [PATCH RFC] mm: add MAP_EXCLUSIVE to create exclusive user mappings Date: Tue, 29 Oct 2019 10:32:55 +0100 Message-ID: <20191029093254.GE18773@rapoport-lnx> References: <1572171452-7958-1-git-send-email-rppt@kernel.org> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Return-path: Content-Disposition: inline In-Reply-To: Sender: linux-kernel-owner@vger.kernel.org To: Andy Lutomirski Cc: linux-kernel@vger.kernel.org, Alexey Dobriyan , Andrew Morton , Andy Lutomirski , Arnd Bergmann , Borislav Petkov , Dave Hansen , James Bottomley , Peter Zijlstra , Steven Rostedt , Thomas Gleixner , Ingo Molnar , "H. Peter Anvin" , linux-api@vger.kernel.org, linux-mm@kvack.org, x86@kernel.org, Mike Rapoport List-Id: linux-api@vger.kernel.org On Mon, Oct 28, 2019 at 02:44:23PM -0600, Andy Lutomirski wrote: > > > On Oct 27, 2019, at 4:17 AM, Mike Rapoport wrote: > > > > From: Mike Rapoport > > > > Hi, > > > > The patch below aims to allow applications to create mappins that have > > pages visible only to the owning process. Such mappings could be used to > > store secrets so that these secrets are not visible neither to other > > processes nor to the kernel. > > > > I've only tested the basic functionality, the changes should be verified > > against THP/migration/compaction. Yet, I'd appreciate early feedback. > > I’ve contemplated the concept a fair amount, and I think you should > consider a change to the API. In particular, rather than having it be a > MAP_ flag, make it a chardev. You can, at least at first, allow only > MAP_SHARED, and admins can decide who gets to use it. It might also play > better with the VM overall, and you won’t need a VM_ flag for it — you > can just wire up .fault to do the right thing. I think mmap()/mprotect()/madvise() are the natural APIs for such interface. Switching to a chardev doesn't solve the major problem of direct map fragmentation and defeats the ability to use exclusive memory mappings with the existing allocators, while mprotect() and madvise() do not. -- Sincerely yours, Mike.