From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1030857Ab2CSNlU (ORCPT ); Mon, 19 Mar 2012 09:41:20 -0400 Received: from mx1.redhat.com ([209.132.183.28]:63719 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932233Ab2CSNlT (ORCPT ); Mon, 19 Mar 2012 09:41:19 -0400 Date: Mon, 19 Mar 2012 14:40:29 +0100 From: Andrea Arcangeli To: Avi Kivity Cc: Peter Zijlstra , Linus Torvalds , Andrew Morton , Thomas Gleixner , Ingo Molnar , Paul Turner , Suresh Siddha , Mike Galbraith , "Paul E. McKenney" , Lai Jiangshan , Dan Smith , Bharata B Rao , Lee Schermerhorn , Rik van Riel , Johannes Weiner , linux-kernel@vger.kernel.org, linux-mm@kvack.org Subject: Re: [RFC][PATCH 00/26] sched/numa Message-ID: <20120319134029.GK24602@redhat.com> References: <20120316144028.036474157@chello.nl> <4F670325.7080700@redhat.com> <1332155527.18960.292.camel@twins> <4F671B90.3010209@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <4F671B90.3010209@redhat.com> Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Mar 19, 2012 at 01:42:08PM +0200, Avi Kivity wrote: > Extra work, and more slowness until they get rebuilt. Why not migrate > entire large pages? The main problem is the double copy, first copy for migrate, second for khugepaged. This is why we want it native over time. So it also only stops the accesses to the pages for a shorter period of time. > I agree with this, but it's really widespread throughout the kernel, > from interrupts to work items to background threads. It needs to be > solved generically (IIRC vhost has some accouting fix for a similar issue). Exactly. > It's the standard space/time tradeoff. Once solution wants more > storage, the other wants more faults. I didn't grow it much more than memcg, and at least if you boot on NUMA hardware you'll be sure to use AutoNUMA. The fact it's in the struct page it's an implementation detail, it'll only be allocated if the kernel is booted on NUMA hardware later.