From: Lukas Senger <lukas@fridolin.com>
To: Dave Hansen <dave.hansen@linux.intel.com>
Cc: Andrew Morton <akpm@linux-foundation.org>,
Michal Hocko <mhocko@suse.cz>,
Matthias Wirth <matthias.wirth@gmail.com>,
Matthew Wilcox <matthew@wil.cx>, Jeff Layton <jlayton@redhat.com>,
"J. Bruce Fields" <bfields@fieldses.org>,
Johannes Weiner <hannes@cmpxchg.org>,
Rik van Riel <riel@redhat.com>, Lisa Du <cldu@marvell.com>,
Paul Mackerras <paulus@samba.org>,
Sasha Levin <sasha.levin@oracle.com>,
Benjamin Herrenschmidt <benh@kernel.crashing.org>,
Fengguang Wu <fengguang.wu@intel.com>,
Shaohua Li <shli@kernel.org>,
Alexey Kardashevskiy <aik@ozlabs.ru>,
Minchan Kim <minchan@kernel.org>,
"Kirill A. Shutemov" <kirill.shutemov@linux.intel.com>,
Al Viro <viro@zeniv.linux.org.uk>,
Steven Whitehouse <swhiteho@redhat.com>,
Mel Gorman <mgorman@suse.de>,
Cody P Schafer <cody@linux.vnet.ibm.com>,
Jiang Liu <liuj97@gmail.com>,
David Rientjes <rientjes@google.com>, "Srivatsa S. Bhat"
Subject: Re: [PATCH] mm: implement POSIX_FADV_NOREUSE
Date: Thu, 13 Mar 2014 13:40:53 +0100 [thread overview]
Message-ID: <1394714453.542.38.camel@dinghy> (raw)
In-Reply-To: <532085E3.5030904@linux.intel.com>
> But, why wouldn't this work there? Define a percpu variable, and assign
> it to the target page in readahead's read_pages() and in
> do_generic_file_read() which deal with pages one at a time and not in lists.
>
> struct page *read_me_once;
> void hint_page_read_once(struct page *page)
> {
> read_me_once = page;
> }
>
> Then check for (read_me_once == page) in add_page_to_lru_list() instead
> of the page flag. Then, make read_me_once per-cpu. This won't be
> preempt safe, but we're talking about readahead and hints here, so we
> can probably just bail in the cases where we race.
Thanks for clarifying that. The problem now is that by the time we get
to add_page_to_lru_list we're dealing with multiple pages again, because
of the buffering in pagevecs. We could do the (read_me_once == page)
check in __lru_cache_add and then add it to a (new) lru_add_tail_pvec
that adds its pages to the tail of the lru_lists.
If this way isn't feasible, we'll take a look at Andrew and Michal's
DONTNEED lite idea. However, with a DONTNEED lite implemented in the
posix_fadvise, the syscall would be more cumbersome to use for
application programmers. They would need to call it after every read.
The tail-pvec approach only needs a single syscall after open, as do
NORMAL, SEQUENTIAL and RANDOM. Furthermore we suspect that implementing
it in a way that respects other processes (unlike DONTNEED) won't be
much simpler than the tail-pvec approach.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
WARNING: multiple messages have this Message-ID (diff)
From: Lukas Senger <lukas@fridolin.com>
To: Dave Hansen <dave.hansen@linux.intel.com>
Cc: Andrew Morton <akpm@linux-foundation.org>,
Michal Hocko <mhocko@suse.cz>,
Matthias Wirth <matthias.wirth@gmail.com>,
Matthew Wilcox <matthew@wil.cx>, Jeff Layton <jlayton@redhat.com>,
"J. Bruce Fields" <bfields@fieldses.org>,
Johannes Weiner <hannes@cmpxchg.org>,
Rik van Riel <riel@redhat.com>, Lisa Du <cldu@marvell.com>,
Paul Mackerras <paulus@samba.org>,
Sasha Levin <sasha.levin@oracle.com>,
Benjamin Herrenschmidt <benh@kernel.crashing.org>,
Fengguang Wu <fengguang.wu@intel.com>,
Shaohua Li <shli@kernel.org>,
Alexey Kardashevskiy <aik@ozlabs.ru>,
Minchan Kim <minchan@kernel.org>,
"Kirill A. Shutemov" <kirill.shutemov@linux.intel.com>,
Al Viro <viro@zeniv.linux.org.uk>,
Steven Whitehouse <swhiteho@redhat.com>,
Mel Gorman <mgorman@suse.de>,
Cody P Schafer <cody@linux.vnet.ibm.com>,
Jiang Liu <liuj97@gmail.com>,
David Rientjes <rientjes@google.com>,
"Srivatsa S. Bhat" <srivatsa.bhat@linux.vnet.ibm.com>,
Zhang Yanfei <zhangyanfei@cn.fujitsu.com>,
Raghavendra K T <raghavendra.kt@linux.vnet.ibm.com>,
Lukas Czerner <lczerner@redhat.com>,
Damien Ramonda <damien.ramonda@intel.com>,
Mark Rutland <mark.rutland@arm.com>,
linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org,
linux-mm@kvack.org, i4passt <i4passt@lists.cs.fau.de>
Subject: Re: [PATCH] mm: implement POSIX_FADV_NOREUSE
Date: Thu, 13 Mar 2014 13:40:53 +0100 [thread overview]
Message-ID: <1394714453.542.38.camel@dinghy> (raw)
In-Reply-To: <532085E3.5030904@linux.intel.com>
> But, why wouldn't this work there? Define a percpu variable, and assign
> it to the target page in readahead's read_pages() and in
> do_generic_file_read() which deal with pages one at a time and not in lists.
>
> struct page *read_me_once;
> void hint_page_read_once(struct page *page)
> {
> read_me_once = page;
> }
>
> Then check for (read_me_once == page) in add_page_to_lru_list() instead
> of the page flag. Then, make read_me_once per-cpu. This won't be
> preempt safe, but we're talking about readahead and hints here, so we
> can probably just bail in the cases where we race.
Thanks for clarifying that. The problem now is that by the time we get
to add_page_to_lru_list we're dealing with multiple pages again, because
of the buffering in pagevecs. We could do the (read_me_once == page)
check in __lru_cache_add and then add it to a (new) lru_add_tail_pvec
that adds its pages to the tail of the lru_lists.
If this way isn't feasible, we'll take a look at Andrew and Michal's
DONTNEED lite idea. However, with a DONTNEED lite implemented in the
posix_fadvise, the syscall would be more cumbersome to use for
application programmers. They would need to call it after every read.
The tail-pvec approach only needs a single syscall after open, as do
NORMAL, SEQUENTIAL and RANDOM. Furthermore we suspect that implementing
it in a way that respects other processes (unlike DONTNEED) won't be
much simpler than the tail-pvec approach.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2014-03-13 12:40 UTC|newest]
Thread overview: 26+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-03-11 10:25 [PATCH] mm: implement POSIX_FADV_NOREUSE Matthias Wirth
2014-03-11 10:25 ` Matthias Wirth
2014-03-11 14:06 ` Michal Hocko
2014-03-11 14:06 ` Michal Hocko
2014-03-11 15:24 ` Dave Hansen
2014-03-11 15:24 ` Dave Hansen
2014-03-11 21:27 ` Andrew Morton
2014-03-11 21:27 ` Andrew Morton
2014-03-12 11:59 ` Lukas Senger
2014-03-12 11:59 ` Lukas Senger
2014-03-12 14:46 ` Michal Hocko
2014-03-12 14:46 ` Michal Hocko
2014-03-12 16:05 ` Dave Hansen
2014-03-12 16:05 ` Dave Hansen
2014-03-13 12:40 ` Lukas Senger [this message]
2014-03-13 12:40 ` Lukas Senger
2014-03-13 18:43 ` [PATCHv2] " Matthias Wirth
2014-03-13 18:43 ` Matthias Wirth
2014-03-13 20:01 ` Andrew Morton
2014-03-13 20:01 ` Andrew Morton
2014-03-14 12:34 ` Lukas Senger
2014-03-14 12:34 ` Lukas Senger
2014-03-14 15:52 ` [PATCHv3] " Matthias Wirth
2014-03-14 15:52 ` Matthias Wirth
2014-03-18 15:14 ` Michal Hocko
2014-03-18 15:14 ` Michal Hocko
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1394714453.542.38.camel@dinghy \
--to=lukas@fridolin.com \
--cc=aik@ozlabs.ru \
--cc=akpm@linux-foundation.org \
--cc=benh@kernel.crashing.org \
--cc=bfields@fieldses.org \
--cc=cldu@marvell.com \
--cc=cody@linux.vnet.ibm.com \
--cc=dave.hansen@linux.intel.com \
--cc=fengguang.wu@intel.com \
--cc=hannes@cmpxchg.org \
--cc=jlayton@redhat.com \
--cc=kirill.shutemov@linux.intel.com \
--cc=liuj97@gmail.com \
--cc=matthew@wil.cx \
--cc=matthias.wirth@gmail.com \
--cc=mgorman@suse.de \
--cc=mhocko@suse.cz \
--cc=minchan@kernel.org \
--cc=paulus@samba.org \
--cc=riel@redhat.com \
--cc=rientjes@google.com \
--cc=sasha.levin@oracle.com \
--cc=shli@kernel.org \
--cc=swhiteho@redhat.com \
--cc=viro@zeniv.linux.org.uk \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.