From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751974Ab3CALaE (ORCPT ); Fri, 1 Mar 2013 06:30:04 -0500 Received: from cn.fujitsu.com ([222.73.24.84]:62407 "EHLO song.cn.fujitsu.com" rhost-flags-OK-FAIL-OK-OK) by vger.kernel.org with ESMTP id S1751588Ab3CALaC (ORCPT ); Fri, 1 Mar 2013 06:30:02 -0500 X-IronPort-AV: E=Sophos;i="4.84,761,1355068800"; d="scan'208";a="6794578" Message-ID: <513091C5.2010109@cn.fujitsu.com> Date: Fri, 01 Mar 2013 19:32:21 +0800 From: Tang Chen User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:12.0) Gecko/20120430 Thunderbird/12.0.1 MIME-Version: 1.0 To: Yinghai Lu CC: "H. Peter Anvin" , Linus Torvalds , Andrew Morton , Konrad Rzeszutek Wilk , Stefano Stabellini , Yasuaki Ishimatsu , Don Morris , Tim Gardner , Tejun Heo , Tony Luck , Thomas Renninger , Linux Kernel Mailing List , Thomas Gleixner , Ingo Molnar , Benjamin Herrenschmidt Subject: Re: sched: CPU #1's llc-sibling CPU #0 is not on the same node! References: <512B7D10.4060304@tpi.com> <512D58C2.1090403@jp.fujitsu.com> <512D7FAD.1040003@jp.fujitsu.com> <512D8EDA.3010602@jp.fujitsu.com> <512DBD24.7090302@cn.fujitsu.com> <20130227132612.14664a3a.akpm@linux-foundation.org> <51302481.9070005@cn.fujitsu.com> <513030AF.70208@zytor.com> In-Reply-To: X-MIMETrack: Itemize by SMTP Server on mailserver/fnst(Release 8.5.3|September 15, 2011) at 2013/03/01 19:29:03, Serialize by Router on mailserver/fnst(Release 8.5.3|September 15, 2011) at 2013/03/01 19:29:04, Serialize complete at 2013/03/01 19:29:04 Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset=ISO-8859-1; format=flowed Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 03/01/2013 03:43 PM, Yinghai Lu wrote: > Please check attached patches. > > Plan A. revert all 8 patches: > revert_movablemem_map.patch > > Plan B. fix movablemem_map: > kill_max_low_pfn_mapped.patch and fix_movablemem_map.patch > > fix_movablemem_map.patch is too risky, and need more test. > Hi Yinghai, In your Plan B, you allocated pgdat on local node, right ? - nd_pa = memblock_alloc_try_nid(nd_size, SMP_CACHE_BYTES, nid); + nd_pa = memblock_find_in_range_node(start, end, nd_size, + SMP_CACHE_BYTES, nid); ---------------- Here, right ? Without movablemem_map, pgdat will be allocated successfully on local node, right ? If so, this will prevent node hot-plug, because as mentioned by Kamezawa, there is no way to ensure pgdat is not used by others on stack. I do hope you can stop putting pgdat and zone on local node for now. And improve it in the future. And I also hope you can apply my revert SRAT patch first, and then do your work. It will seem more clean to me. Thanks. :)