From: Daniel Jordan <daniel.m.jordan@oracle.com>
To: linux-mm@kvack.org, linux-kernel@vger.kernel.org
Cc: aaron.lu@intel.com, akpm@linux-foundation.org,
dave.hansen@linux.intel.com, mgorman@techsingularity.net,
mhocko@kernel.org, mike.kravetz@oracle.com,
pasha.tatashin@oracle.com, steven.sistare@oracle.com,
tim.c.chen@intel.com
Subject: [RFC PATCH v2 0/7] ktask: multithread cpu-intensive kernel work
Date: Thu, 24 Aug 2017 16:49:57 -0400 [thread overview]
Message-ID: <20170824205004.18502-1-daniel.m.jordan@oracle.com> (raw)
ktask is a generic framework for parallelizing cpu-intensive work in the
kernel. The intended use is for big machines that can use their cpu power
to speed up large tasks that can't otherwise be multithreaded in userland.
The API is generic enough to add concurrency to many different kinds of
tasks--for example, zeroing a range of pages or evicting a list of
inodes--and aims to save its clients the trouble of splitting up the work,
choosing the number of threads to use, starting these threads, and load
balancing the work between them.
Why do we need ktask when the kernel has other APIs for managing
concurrency? After all, kthread_workers and workqueues already provide ways
to start threads, and the kernel can handle large tasks with a single thread
by periodically yielding the cpu with cond_resched or doing the work in
fixed size batches.
Of the existing concurrency facilities, kthread_worker isn't suited for
providing parallelism because each comes with only a single thread.
Workqueues are a better fit for this, and in fact ktask is built on an
unbound workqueue, but workqueues aren't designed for splitting up a large
task. ktask instead uses unbound workqueue threads to run "chunks" of a
task.
More background is available in the documentation commit (first commit of the
series).
This patchset is based on 4.13-rc6 and contains three ktask users so far, with
more to come:
- clearing gigantic pages
- fallocate for HugeTLB pages
- deferred struct page initialization at boot time
The core ktask code is based on work by Pavel Tatashin, Steve Sistare, and
Jonathan Adams.
v1 -> v2:
- Added deferred struct page initialization use case.
- Explained the source of the performance improvement from parallelizing
clear_gigantic_page (comment from Dave Hansen).
- Fixed Documentation and build warnings from CONFIG_KTASK=n kernels.
link to v1: https://lkml.org/lkml/2017/7/14/666
Daniel Jordan (7):
ktask: add documentation
ktask: multithread cpu-intensive kernel work
ktask: add /proc/sys/debug/ktask_max_threads
mm: enlarge type of offset argument in mem_map_offset and mem_map_next
mm: parallelize clear_gigantic_page
hugetlbfs: parallelize hugetlbfs_fallocate with ktask
mm: parallelize deferred struct page initialization within each node
Documentation/core-api/index.rst | 1 +
Documentation/core-api/ktask.rst | 104 ++++++++++
fs/hugetlbfs/inode.c | 117 +++++++++---
include/linux/ktask.h | 235 +++++++++++++++++++++++
include/linux/ktask_internal.h | 19 ++
include/linux/mm.h | 6 +
init/Kconfig | 7 +
init/main.c | 2 +
kernel/Makefile | 2 +-
kernel/ktask.c | 396 +++++++++++++++++++++++++++++++++++++++
kernel/sysctl.c | 10 +
mm/internal.h | 7 +-
mm/memory.c | 35 +++-
mm/page_alloc.c | 174 ++++++++++-------
14 files changed, 1014 insertions(+), 101 deletions(-)
create mode 100644 Documentation/core-api/ktask.rst
create mode 100644 include/linux/ktask.h
create mode 100644 include/linux/ktask_internal.h
create mode 100644 kernel/ktask.c
--
2.12.2
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next reply other threads:[~2017-08-24 20:48 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-08-24 20:49 Daniel Jordan [this message]
2017-08-24 20:49 ` [RFC PATCH v2 1/7] ktask: add documentation Daniel Jordan
2017-08-24 23:07 ` Randy Dunlap
2017-08-25 14:12 ` Daniel Jordan
2017-08-24 20:49 ` [RFC PATCH v2 2/7] ktask: multithread cpu-intensive kernel work Daniel Jordan
2017-08-24 20:50 ` [RFC PATCH v2 3/7] ktask: add /proc/sys/debug/ktask_max_threads Daniel Jordan
2017-08-24 20:50 ` [RFC PATCH v2 4/7] mm: enlarge type of offset argument in mem_map_offset and mem_map_next Daniel Jordan
2017-08-24 20:50 ` [RFC PATCH v2 5/7] mm: parallelize clear_gigantic_page Daniel Jordan
2017-08-24 20:50 ` [RFC PATCH v2 6/7] hugetlbfs: parallelize hugetlbfs_fallocate with ktask Daniel Jordan
2017-08-24 20:50 ` [RFC PATCH v2 7/7] mm: parallelize deferred struct page initialization within each node Daniel Jordan
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20170824205004.18502-1-daniel.m.jordan@oracle.com \
--to=daniel.m.jordan@oracle.com \
--cc=aaron.lu@intel.com \
--cc=akpm@linux-foundation.org \
--cc=dave.hansen@linux.intel.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mgorman@techsingularity.net \
--cc=mhocko@kernel.org \
--cc=mike.kravetz@oracle.com \
--cc=pasha.tatashin@oracle.com \
--cc=steven.sistare@oracle.com \
--cc=tim.c.chen@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).