From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1759585AbcHEImS (ORCPT ); Fri, 5 Aug 2016 04:42:18 -0400 Received: from outbound-smtp11.blacknight.com ([46.22.139.16]:59726 "EHLO outbound-smtp11.blacknight.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1759414AbcHEIlm (ORCPT ); Fri, 5 Aug 2016 04:41:42 -0400 Date: Fri, 5 Aug 2016 09:41:15 +0100 From: Mel Gorman To: James Hogan Cc: Andrew Morton , Linux-MM , Rik van Riel , Vlastimil Babka , Johannes Weiner , Minchan Kim , Joonsoo Kim , LKML , metag Subject: Re: [PATCH 03/34] mm, vmscan: move LRU lists to node Message-ID: <20160805084115.GO2799@techsingularity.net> References: <1467970510-21195-1-git-send-email-mgorman@techsingularity.net> <1467970510-21195-4-git-send-email-mgorman@techsingularity.net> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.23 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Aug 04, 2016 at 09:59:17PM +0100, James Hogan wrote: > > Signed-off-by: Mel Gorman > > Acked-by: Johannes Weiner > > Acked-by: Vlastimil Babka > > This breaks boot on metag architecture: > Oops: err 0007 (Data access general read/write fault) addr 00233008 [#1] > > It appears to be in node_page_state_snapshot() (via > pgdat_reclaimable()), and have come via mm_init. Here's the relevant > bit of the backtrace: > > node_page_state_snapshot@0x4009c884(enum node_stat_item item = > ???, struct pglist_data * pgdat = ???) + 0x48 > pgdat_reclaimable(struct pglist_data * pgdat = 0x402517a0) > show_free_areas(unsigned int filter = 0) + 0x2cc > show_mem(unsigned int filter = 0) + 0x18 > mm_init@0x4025c3d4() > start_kernel() + 0x204 > > __per_cpu_offset[0] == 0x233000 (close to bad addr), > pgdat->per_cpu_nodestats = NULL. and setup_per_cpu_pageset() > definitely hasn't been called yet (mm_init is called before > setup_per_cpu_pageset()). > > Any ideas what the correct solution is (and why presumably others > haven't seen the same issue on other architectures?). > metag calls show_mem in mem_init() before the pagesets are initialised. What's surprising is that it worked for the zone stats as it appears that calling zone_reclaimable() from that context should also have broken. Did anything change recently that would have avoided the zone->pageset dereference in zone_reclaimable() before? The easiest option would be to not call show_mem from arch code until after the pagesets are setup. -- Mel Gorman SUSE Labs