All of lore.kernel.org
 help / color / mirror / Atom feed
From: Avi Kivity <avi@redhat.com>
To: Christopher Yeoh <cyeoh@au1.ibm.com>
Cc: linux-kernel@vger.kernel.org,
	Linux Memory Management List <linux-mm@kvack.org>,
	Ingo Molnar <mingo@elte.hu>
Subject: Re: [RFC][PATCH] Cross Memory Attach
Date: Wed, 15 Sep 2010 12:58:15 +0200	[thread overview]
Message-ID: <4C90A6C7.9050607@redhat.com> (raw)
In-Reply-To: <20100915104855.41de3ebf@lilo>

  On 09/15/2010 03:18 AM, Christopher Yeoh wrote:
> The basic idea behind cross memory attach is to allow MPI programs doing
> intra-node communication to do a single copy of the message rather than
> a double copy of the message via shared memory.

If the host has a dma engine (many modern ones do) you can reduce this 
to zero copies (at least, zero processor copies).

> The following patch attempts to achieve this by allowing a
> destination process, given an address and size from a source process, to
> copy memory directly from the source process into its own address space
> via a system call. There is also a symmetrical ability to copy from
> the current process's address space into a destination process's
> address space.
>
>

Instead of those two syscalls, how about a vmfd(pid_t pid, ulong start, 
ulong len) system call which returns an file descriptor that represents 
a portion of the process address space.  You can then use preadv() and 
pwritev() to copy memory, and io_submit(IO_CMD_PREADV) and 
io_submit(IO_CMD_PWRITEV) for asynchronous variants (especially useful 
with a dma engine, since that adds latency).

With some care (and use of mmu_notifiers) you can even mmap() your vmfd 
and access remote process memory directly.

A nice property of file descriptors is that you can pass them around 
securely via SCM_RIGHTS.  So a process can create a window into its 
address space and pass it to other processes.

(or you could just use a shared memory object and pass it around)

-- 
I have a truly marvellous patch that fixes the bug which this
signature is too narrow to contain.


WARNING: multiple messages have this Message-ID (diff)
From: Avi Kivity <avi@redhat.com>
To: Christopher Yeoh <cyeoh@au1.ibm.com>
Cc: linux-kernel@vger.kernel.org,
	Linux Memory Management List <linux-mm@kvack.org>,
	Ingo Molnar <mingo@elte.hu>
Subject: Re: [RFC][PATCH] Cross Memory Attach
Date: Wed, 15 Sep 2010 12:58:15 +0200	[thread overview]
Message-ID: <4C90A6C7.9050607@redhat.com> (raw)
In-Reply-To: <20100915104855.41de3ebf@lilo>

  On 09/15/2010 03:18 AM, Christopher Yeoh wrote:
> The basic idea behind cross memory attach is to allow MPI programs doing
> intra-node communication to do a single copy of the message rather than
> a double copy of the message via shared memory.

If the host has a dma engine (many modern ones do) you can reduce this 
to zero copies (at least, zero processor copies).

> The following patch attempts to achieve this by allowing a
> destination process, given an address and size from a source process, to
> copy memory directly from the source process into its own address space
> via a system call. There is also a symmetrical ability to copy from
> the current process's address space into a destination process's
> address space.
>
>

Instead of those two syscalls, how about a vmfd(pid_t pid, ulong start, 
ulong len) system call which returns an file descriptor that represents 
a portion of the process address space.  You can then use preadv() and 
pwritev() to copy memory, and io_submit(IO_CMD_PREADV) and 
io_submit(IO_CMD_PWRITEV) for asynchronous variants (especially useful 
with a dma engine, since that adds latency).

With some care (and use of mmu_notifiers) you can even mmap() your vmfd 
and access remote process memory directly.

A nice property of file descriptors is that you can pass them around 
securely via SCM_RIGHTS.  So a process can create a window into its 
address space and pass it to other processes.

(or you could just use a shared memory object and pass it around)

-- 
I have a truly marvellous patch that fixes the bug which this
signature is too narrow to contain.

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

  parent reply	other threads:[~2010-09-15 10:58 UTC|newest]

Thread overview: 62+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-09-15  1:18 [RFC][PATCH] Cross Memory Attach Christopher Yeoh
2010-09-15  8:02 ` Ingo Molnar
2010-09-15  8:02   ` Ingo Molnar
2010-09-15  8:16   ` Ingo Molnar
2010-09-15  8:16     ` Ingo Molnar
2010-09-15 13:23     ` Christopher Yeoh
2010-09-15 13:23       ` Christopher Yeoh
2010-09-15 13:20   ` Christopher Yeoh
2010-09-15 13:20     ` Christopher Yeoh
2010-09-15 10:58 ` Avi Kivity [this message]
2010-09-15 10:58   ` Avi Kivity
2010-09-15 13:51   ` Ingo Molnar
2010-09-15 13:51     ` Ingo Molnar
2010-09-15 16:10     ` Avi Kivity
2010-09-15 16:10       ` Avi Kivity
2010-09-15 14:42   ` Christopher Yeoh
2010-09-15 14:42     ` Christopher Yeoh
2010-09-15 14:52     ` Linus Torvalds
2010-09-15 14:52       ` Linus Torvalds
2010-09-15 15:44       ` Robin Holt
2010-09-15 15:44         ` Robin Holt
2010-09-16  6:32     ` Brice Goglin
2010-09-16  6:32       ` Brice Goglin
2010-09-16  9:15       ` Brice Goglin
2010-09-16  9:15         ` Brice Goglin
2010-09-16 14:00         ` Christopher Yeoh
2010-09-16 14:00           ` Christopher Yeoh
2010-09-15 14:46   ` Bryan Donlan
2010-09-15 14:46     ` Bryan Donlan
2010-09-15 16:13     ` Avi Kivity
2010-09-15 16:13       ` Avi Kivity
2010-09-15 19:35       ` Eric W. Biederman
2010-09-15 19:35         ` Eric W. Biederman
2010-09-16  1:18     ` Christopher Yeoh
2010-09-16  1:18       ` Christopher Yeoh
2010-09-16  9:26       ` Avi Kivity
2010-09-16  9:26         ` Avi Kivity
2010-11-02  3:37         ` Christopher Yeoh
2010-11-02  3:37           ` Christopher Yeoh
2010-11-02 11:10           ` Avi Kivity
2010-11-02 11:10             ` Avi Kivity
2010-09-16  1:58     ` KOSAKI Motohiro
2010-09-16  1:58       ` KOSAKI Motohiro
2010-09-16  8:08       ` Ingo Molnar
2010-09-16  8:08         ` Ingo Molnar
2010-09-15 15:11 ` Linus Torvalds
2010-09-15 15:14   ` Linus Torvalds
2010-09-16  2:25     ` Christopher Yeoh
2010-09-16 16:27   ` Peter Zijlstra
2010-09-16 16:54     ` Linus Torvalds
2010-09-16 17:13       ` Peter Zijlstra
2010-09-16 17:34         ` Linus Torvalds
2010-09-16 17:47           ` Peter Zijlstra
2010-09-16 17:54             ` Linus Torvalds
2010-09-16 18:00               ` Linus Torvalds
2010-09-19  4:44                 ` Yuhong Bao
2010-09-19 19:20               ` Yuhong Bao
2010-09-19 21:48                 ` Russell King - ARM Linux
2010-09-19 22:47                   ` Yuhong Bao
2010-09-19  4:55           ` Yuhong Bao
2010-09-15 16:07 ` Valdis.Kletnieks
2010-09-16  2:17   ` Christopher Yeoh

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4C90A6C7.9050607@redhat.com \
    --to=avi@redhat.com \
    --cc=cyeoh@au1.ibm.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=mingo@elte.hu \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.