From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from psmtp.com (na3sys010amx169.postini.com [74.125.245.169]) by kanga.kvack.org (Postfix) with SMTP id 867F06B0068 for ; Thu, 12 Jul 2012 14:50:55 -0400 (EDT) Date: Thu, 12 Jul 2012 20:50:31 +0200 From: Andrea Arcangeli Subject: Re: [PATCH 28/40] autonuma: make khugepaged pte_numa aware Message-ID: <20120712185031.GN20382@redhat.com> References: <1340888180-15355-1-git-send-email-aarcange@redhat.com> <1340888180-15355-29-git-send-email-aarcange@redhat.com> <4FF12284.4040109@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <4FF12284.4040109@redhat.com> Sender: owner-linux-mm@kvack.org List-ID: To: Rik van Riel Cc: linux-kernel@vger.kernel.org, linux-mm@kvack.org, Hillf Danton , Dan Smith , Peter Zijlstra , Linus Torvalds , Andrew Morton , Thomas Gleixner , Ingo Molnar , Paul Turner , Suresh Siddha , Mike Galbraith , "Paul E. McKenney" , Lai Jiangshan , Bharata B Rao , Lee Schermerhorn , Johannes Weiner , Srivatsa Vaddagiri , Christoph Lameter , Alex Shi , Mauricio Faria de Oliveira , Konrad Rzeszutek Wilk , Don Morris , Benjamin Herrenschmidt On Mon, Jul 02, 2012 at 12:24:36AM -0400, Rik van Riel wrote: > On 06/28/2012 08:56 AM, Andrea Arcangeli wrote: > > If any of the ptes that khugepaged is collapsing was a pte_numa, the > > resulting trans huge pmd will be a pmd_numa too. > > Why? > > If some of the ptes already got faulted in and made really > resident again, why do you want to incur a new NUMA fault > on the newly collapsed hugepage? If we don't set pmd_numa on the collapsed hugepage, the result is that we'll understimate the thread NUMA affinity to the node where the hugepage is located (mm affinity is recorded independently by the NUMA hinting page faults). If it's better or worse I guess depends on luck, we just lose information. I guess overstimating the node affinity with a node with hugepages just collapsed is better than understimating it, more often than not. I doubt it matters much if just 1 pte_numa or all pte_numa creates a pmd_numa. With the pmd scan mode (default enabled) we fault in at pmd-granularity regardless of THP or not, so either ways it's the same, this only an issue when you set knuma_scand/pmd = 0 at runtime. > Is there something on we should know about? > > If so, could you document it? I'll add a note. -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: email@kvack.org