From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751815AbcGRNw5 (ORCPT ); Mon, 18 Jul 2016 09:52:57 -0400 Received: from mx1.redhat.com ([209.132.183.28]:54803 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750732AbcGRNwz (ORCPT ); Mon, 18 Jul 2016 09:52:55 -0400 Date: Mon, 18 Jul 2016 15:53:10 +0200 From: Oleg Nesterov To: Shayan Pooya Cc: Konstantin Khlebnikov , Michal Hocko , Konstantin Khlebnikov , cgroups mailinglist , LKML , linux-mm@kvack.org Subject: Re: bug in memcg oom-killer results in a hung syscall in another process in the same cgroup Message-ID: <20160718135309.GC25380@redhat.com> References: <20160711064150.GB5284@dhcp22.suse.cz> <20160712071927.GD14586@dhcp22.suse.cz> <57851224.2020902@yandex-team.ru> <20160714132258.GA1333@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.18 (2008-05-17) X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.27]); Mon, 18 Jul 2016 13:52:54 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 07/15, Shayan Pooya wrote: > > >> --- x/kernel/sched/core.c > >> +++ x/kernel/sched/core.c > >> @@ -2793,8 +2793,11 @@ asmlinkage __visible void schedule_tail(struct task_struct *prev) > >> balance_callback(rq); > >> preempt_enable(); > >> > >> - if (current->set_child_tid) > >> + if (current->set_child_tid) { > >> + mem_cgroup_oom_enable(); > >> put_user(task_pid_vnr(current), current->set_child_tid); > >> + mem_cgroup_oom_disable(); > >> + } > >> } > >> > >> /* > > I tried this patch and I still see the same stuck processes (assuming > that's what you were curious about). Of course. Because I am stupid. Firtsly, I forgot to include another change in fault.c. And now I see that change was wrong anyway. I'll try to make another debugging patch today later, but let me repeat that it won't fix the real problem anyway. Thanks, and sorry for wasting your time. Oleg.