From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1758933Ab2CSN1L (ORCPT ); Mon, 19 Mar 2012 09:27:11 -0400 Received: from casper.infradead.org ([85.118.1.10]:42521 "EHLO casper.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755121Ab2CSN1J convert rfc822-to-8bit (ORCPT ); Mon, 19 Mar 2012 09:27:09 -0400 Message-ID: <1332163591.18960.334.camel@twins> Subject: Re: [RFC][PATCH 00/26] sched/numa From: Peter Zijlstra To: Andrea Arcangeli Cc: Avi Kivity , Linus Torvalds , Andrew Morton , Thomas Gleixner , Ingo Molnar , Paul Turner , Suresh Siddha , Mike Galbraith , "Paul E. McKenney" , Lai Jiangshan , Dan Smith , Bharata B Rao , Lee Schermerhorn , Rik van Riel , Johannes Weiner , linux-kernel@vger.kernel.org, linux-mm@kvack.org Date: Mon, 19 Mar 2012 14:26:31 +0100 In-Reply-To: <20120319130401.GI24602@redhat.com> References: <20120316144028.036474157@chello.nl> <4F670325.7080700@redhat.com> <1332155527.18960.292.camel@twins> <20120319130401.GI24602@redhat.com> Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7BIT X-Mailer: Evolution 3.2.2- Mime-Version: 1.0 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, 2012-03-19 at 14:04 +0100, Andrea Arcangeli wrote: > If you boot with memcg compiled in, that's taking an equivalent amount > of memory per-page. > > If you can bear the memory loss when memcg is compiled in even when > not enabled, you sure can bear it on NUMA systems that have lots of > memory, so it's perfectly ok to sacrifice a bit of it so that it > performs like not-NUMA but you still have more memory than not-NUMA. > I think the overhead of memcg is quite insane as well. And no I cannot bear that and have it disabled in all my kernels. NUMA systems having lots of memory is a false argument, that doesn't mean we can just waste tons of it, people pay good money for that memory, they want to use it. I fact, I know that HPC people want things like swap-over-nfs so they can push infrequently running system crap out into swap so they can get these few extra megabytes of memory. And you're proposing they give up ~100M just like that?