From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756110Ab3BVEqe (ORCPT ); Thu, 21 Feb 2013 23:46:34 -0500 Received: from mga02.intel.com ([134.134.136.20]:10058 "EHLO mga02.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755730Ab3BVEqd (ORCPT ); Thu, 21 Feb 2013 23:46:33 -0500 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="4.84,713,1355126400"; d="scan'208";a="266040458" Message-ID: <5126F83E.9060206@intel.com> Date: Fri, 22 Feb 2013 12:46:54 +0800 From: Alex Shi User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:15.0) Gecko/20120912 Thunderbird/15.0.1 MIME-Version: 1.0 To: Michael Wang CC: Peter Zijlstra , LKML , Ingo Molnar , Paul Turner , Mike Galbraith , Andrew Morton , Ram Pai , "Nikunj A. Dadhania" , Namhyung Kim Subject: Re: [RFC PATCH v3 1/3] sched: schedule balance map foundation References: <51079178.3070002@linux.vnet.ibm.com> <510791B2.6090506@linux.vnet.ibm.com> <1361366720.10155.25.camel@laptop> <5125A966.6040601@linux.vnet.ibm.com> <1361446661.26780.15.camel@laptop> <5126DD98.7030202@linux.vnet.ibm.com> <5126E705.3040308@intel.com> <5126F1D4.5030308@linux.vnet.ibm.com> In-Reply-To: <5126F1D4.5030308@linux.vnet.ibm.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 02/22/2013 12:19 PM, Michael Wang wrote: > >> > Why not seek other way to change O(n^2) to O(n)? >> > >> > Access 2G memory is unbelievable performance cost. > Not access 2G memory, but (2G / 16K) memory, the sbm size is O(N). > > And please notice that on 16k cpus system, topology will be deep if NUMA > enabled (O(log N) as Peter said), and that's really a good stage for > this idea to perform on, we could save lot's of recursed 'for' cycles. > CPU execute part is very very fast compare to the memory access, the 'for' cycles cost is most on the memory access for many domain/groups data, not instruction execution. In a hot patch, several KB memory access will cause clear cpu cache pollution then make kernel slowly. -- Thanks Alex