From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756274AbcGZP6z (ORCPT ); Tue, 26 Jul 2016 11:58:55 -0400 Received: from mga11.intel.com ([192.55.52.93]:29990 "EHLO mga11.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753112AbcGZP6x (ORCPT ); Tue, 26 Jul 2016 11:58:53 -0400 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.28,425,1464678000"; d="scan'208";a="1024268944" Subject: Re: [RFC PATCH] mm/hugetlb: Avoid soft lockup in set_max_huge_pages() To: Jia He , linux-mm@kvack.org References: <1469547868-9814-1-git-send-email-hejianet@gmail.com> Cc: linux-kernel@vger.kernel.org, Andrew Morton , Naoya Horiguchi , Mike Kravetz , "Kirill A. Shutemov" , Michal Hocko , Paul Gortmaker From: Dave Hansen Message-ID: <579788BA.1040706@linux.intel.com> Date: Tue, 26 Jul 2016 08:58:50 -0700 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Thunderbird/38.8.0 MIME-Version: 1.0 In-Reply-To: <1469547868-9814-1-git-send-email-hejianet@gmail.com> Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 07/26/2016 08:44 AM, Jia He wrote: > This patch is to fix such soft lockup. I thouhgt it is safe to call > cond_resched() because alloc_fresh_gigantic_page and alloc_fresh_huge_page > are out of spin_lock/unlock section. Yikes. So the call site for both the things you patch is this: > while (count > persistent_huge_pages(h)) { ... > spin_unlock(&hugetlb_lock); > if (hstate_is_gigantic(h)) > ret = alloc_fresh_gigantic_page(h, nodes_allowed); > else > ret = alloc_fresh_huge_page(h, nodes_allowed); > spin_lock(&hugetlb_lock); and you choose to patch both of the alloc_*() functions. Why not just fix it at the common call site? Seems like that spin_lock(&hugetlb_lock) could be a cond_resched_lock() which would fix both cases. Also, putting that cond_resched() inside the for_each_node*() loop is an odd choice. It seems to indicate that the loops can take a long time, which really isn't the case. The _loop_ isn't long, right?