linux-fsdevel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [LSF/MM TOPIC] mmap_sem in ->fault and ->page_mkwrite
@ 2013-01-31 22:23 Jan Kara
  2013-01-31 23:03 ` Al Viro
  0 siblings, 1 reply; 4+ messages in thread
From: Jan Kara @ 2013-01-31 22:23 UTC (permalink / raw)
  To: lsf-pc; +Cc: linux-mm, linux-fsdevel

  Hi,

  I'm not sure if this is such a great topic but it's a question which
I came across a few times already and LSF/MM is a good place for
brainstorming somewhat crazy ideas ;).

So currently ->fault() and ->page_mkwrite() are called under mmap_sem held
for reading. Now this creates sometimes unpleasant locking dependencies for
filesystems (modern filesystems have to do an equivalent of ->write_begin
in ->page_mkwrite and that is a non-trivial operation). Just to mention my
last itch, I had to split reader side of filesystem freezing lock into two
locks - one which ranks above mmap_sem and one which ranks below it. Then
writer side has to wait for both locks. It works but ...

So I was wondering: Would it be somehow possible we could drop mmap_sem in
these two callbacks (especially ->page_mkwrite())? I understand process'
mapping can change under us once we drop the semaphore so we'd have to
somehow recheck we have still the right page after re-taking mmap_sem. Like
if we protected VMAs with SRCU so that they don't disappear under us once
we drop mmap_sem and after retaking mmap_sem we would recheck whether VMA
still applies to our fault.

And I know there's VM_FAULT_RETRY but that really seems like a special hack
for x86 architecture page fault code. Making it work for all architectures
and callers such as get_user_pages() didn't really seem plausible to me.

								Honza
-- 
Jan Kara <jack@suse.cz>
SUSE Labs, CR

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2013-02-09  5:51 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2013-01-31 22:23 [LSF/MM TOPIC] mmap_sem in ->fault and ->page_mkwrite Jan Kara
2013-01-31 23:03 ` Al Viro
2013-02-04 18:04   ` Jan Kara
2013-02-09  5:51     ` Michel Lespinasse

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).