public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: William Lee Irwin III <wli@holomorphy.com>
To: Andrea Arcangeli <andrea@suse.de>, linux-kernel@vger.kernel.org
Subject: Re: 64GB NUMA-Q after pgcl
Date: Sun, 30 Mar 2003 21:22:14 -0800	[thread overview]
Message-ID: <20030331052214.GV13178@holomorphy.com> (raw)
In-Reply-To: <20030331042729.GQ30140@holomorphy.com>

On Sun, Mar 30, 2003 at 08:27:29PM -0800, William Lee Irwin III wrote:
> I can answer more questions about what goes on to make this happen if
> need be.

I'm just going to start explaining now.

-----------------------------------------      page clustering turns the
|             struct page               |      relationship between base
-----------------------------------------      pages and ptes into 1:N.
  ^   ^   ^   ^   ^   ^   ^   ^   ^   ^        struct pages remain of the
  |   |   |   |   |   |   |   |   |   |        same size, but track a
-----------------------------------------      larger area and are fewer
|PTE|PTE|PTE|PTE|PTE|PTE|PTE|PTE|PTE|PTE|      in number. ptes still point
-----------------------------------------      to the same size areas.


Anonymous pages want smaller than PAGE_SIZE pieces at a time, in fact
exactly 4KB (MMUPAGE_SIZE) to satisfy any particular fault, so we scan
around looking for PTE's to point at as many of the 4KB pieces as we can.

     -------------------------------
                    page
     -------------------------------
     piece | piece | piece | piece |
     -------------------------------
       \        \       \        \
        \        \        \         \
         \        \         \          \
          \        \          \           \
           \        \           \            \
            \        \            \             \
             \        \             \              \
     -------------------------------------------------------------
      PTE  |  PTE  |  PTE  |  PTE  |  PTE  |  PTE  |  PTE  |  PTE
     -------------------------------------------------------------


Miscellaneous side effects happen, like follow_page() and
get_user_pages() need to return pfn's instead of struct pages. Various
address calculations start needing unit conversions. Pagecache lookups
need to add in "subpfn offsets" relative to start of the base page. And
so on and so forth.

The net result should be (and was in Hugh's code) that there is zero
impact on binary compatibility. The smaller EXEC_PAGE_SIZE a.k.a.
MMUPAGE_SIZE is 100% faithfully emulated and the entire affair is fully
transparent to userspace. The maximum filesystem blocksize is increased.
And various O(pages) traversals get linear speedups, and various
O(pages) -sized data structures get linear size reductions.


-- wli

  reply	other threads:[~2003-03-31  5:11 UTC|newest]

Thread overview: 18+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2003-03-28  4:00 64GB NUMA-Q after pgcl William Lee Irwin III
2003-03-28  7:45 ` Zwane Mwaikambo
2003-03-28  7:57   ` William Lee Irwin III
2003-03-28  8:05     ` Zwane Mwaikambo
2003-03-28 10:14       ` William Lee Irwin III
2003-03-28 17:38         ` John Levon
2003-03-30 23:19 ` Andrea Arcangeli
2003-03-31  4:27   ` William Lee Irwin III
2003-03-31  5:22     ` William Lee Irwin III [this message]
2003-03-31 21:02       ` Ingo Oeser
2003-03-31 22:27         ` William Lee Irwin III
2003-04-01  1:25           ` Andrea Arcangeli
2003-03-31 18:35     ` Andrea Arcangeli
2003-03-31 18:41       ` Christoph Hellwig
2003-03-31 19:08         ` William Lee Irwin III
2003-04-01  0:47           ` Andrea Arcangeli
2003-04-01  0:44         ` Andrea Arcangeli
2003-03-31 18:55       ` William Lee Irwin III

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20030331052214.GV13178@holomorphy.com \
    --to=wli@holomorphy.com \
    --cc=andrea@suse.de \
    --cc=linux-kernel@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox