From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755201AbbI1BwF (ORCPT ); Sun, 27 Sep 2015 21:52:05 -0400 Received: from cn.fujitsu.com ([59.151.112.132]:21123 "EHLO heian.cn.fujitsu.com" rhost-flags-OK-FAIL-OK-FAIL) by vger.kernel.org with ESMTP id S1752898AbbI1BwD (ORCPT ); Sun, 27 Sep 2015 21:52:03 -0400 X-IronPort-AV: E=Sophos;i="5.15,520,1432569600"; d="scan'208";a="101152263" Message-ID: <56089CDA.3020309@cn.fujitsu.com> Date: Mon, 28 Sep 2015 09:50:18 +0800 From: Tang Chen User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:24.0) Gecko/20100101 Thunderbird/24.1.0 MIME-Version: 1.0 To: Tejun Heo CC: , , , , , , , , , , , , , , , Subject: Re: [PATCH v2 3/7] x86, gfp: Cache best near node for memory allocation. References: <1441859269-25831-1-git-send-email-tangchen@cn.fujitsu.com> <1441859269-25831-4-git-send-email-tangchen@cn.fujitsu.com> <20150910192935.GI8114@mtj.duckdns.org> <560665DB.7020301@cn.fujitsu.com> <20150926175337.GB3572@htj.duckdns.org> In-Reply-To: <20150926175337.GB3572@htj.duckdns.org> Content-Type: text/plain; charset="ISO-8859-1"; format=flowed Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, tj, On 09/27/2015 01:53 AM, Tejun Heo wrote: > Hello, Tang. > > On Sat, Sep 26, 2015 at 05:31:07PM +0800, Tang Chen wrote: >>>> @@ -307,13 +307,19 @@ static inline struct page *alloc_pages_node(int nid, gfp_t gfp_mask, >>>> if (nid < 0) >>>> nid = numa_node_id(); >>>> + if (!node_online(nid)) >>>> + nid = get_near_online_node(nid); >>>> + >>>> return __alloc_pages(gfp_mask, order, node_zonelist(nid, gfp_mask)); >>>> } >>> Why not just update node_data[]->node_zonelist in the first place? >> zonelist will be rebuilt in __offline_pages() when the zone is not populated >> any more. >> >> Here, getting the best near online node is for those cpus on memory-less >> nodes. >> >> In the original code, if nid is NUMA_NO_NODE, the node the current cpu >> resides in >> will be chosen. And if the node is memory-less node, the cpu will be mapped >> to its >> best near online node. >> >> But this patch-set will map the cpu to its original node, so numa_node_id() >> may return >> a memory-less node to allocator. And then memory allocation may fail. > Correct me if I'm wrong but the zonelist dictates which memory areas > the page allocator is gonna try to from, right? What I'm wondering is > why we aren't handling memory-less nodes by simply updating their > zonelists. I mean, if, say, node 2 is memory-less, its zonelist can > simply point to zones from other nodes, right? What am I missing > here? Oh, yes, you are right. But I remember some time ago, Liu, Jiang has or was going to handle memory less node like this in his patch: https://lkml.org/lkml/2015/8/16/130 BTW, to Liu Jiang, how is your patches going on ? Thanks. > > Thanks. >