From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-qc0-f169.google.com (mail-qc0-f169.google.com [209.85.216.169]) by kanga.kvack.org (Postfix) with ESMTP id ABBF16B0036 for ; Thu, 19 Jun 2014 16:42:05 -0400 (EDT) Received: by mail-qc0-f169.google.com with SMTP id c9so2723339qcz.14 for ; Thu, 19 Jun 2014 13:42:05 -0700 (PDT) Received: from e37.co.us.ibm.com (e37.co.us.ibm.com. [32.97.110.158]) by mx.google.com with ESMTPS id d7si8113986qar.50.2014.06.19.13.42.04 for (version=TLSv1 cipher=RC4-SHA bits=128/128); Thu, 19 Jun 2014 13:42:05 -0700 (PDT) Received: from /spool/local by e37.co.us.ibm.com with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted for from ; Thu, 19 Jun 2014 14:42:04 -0600 Received: from b03cxnp08026.gho.boulder.ibm.com (b03cxnp08026.gho.boulder.ibm.com [9.17.130.18]) by d03dlp02.boulder.ibm.com (Postfix) with ESMTP id 3881E3E40388 for ; Thu, 19 Jun 2014 14:39:17 -0600 (MDT) Received: from d03av06.boulder.ibm.com (d03av06.boulder.ibm.com [9.17.195.245]) by b03cxnp08026.gho.boulder.ibm.com (8.13.8/8.13.8/NCO v10.0) with ESMTP id s5JKc75F66388166 for ; Thu, 19 Jun 2014 22:38:07 +0200 Received: from d03av06.boulder.ibm.com (loopback [127.0.0.1]) by d03av06.boulder.ibm.com (8.14.4/8.13.1/NCO v10.0 AVout) with ESMTP id s5JKh6R7029865 for ; Thu, 19 Jun 2014 14:43:07 -0600 Date: Thu, 19 Jun 2014 13:39:09 -0700 From: "Paul E. McKenney" Subject: Re: slub/debugobjects: lockup when freeing memory Message-ID: <20140619203909.GI4904@linux.vnet.ibm.com> Reply-To: paulmck@linux.vnet.ibm.com References: <53A2F406.4010109@oracle.com> <20140619165247.GA4904@linux.vnet.ibm.com> <20140619202928.GG4904@linux.vnet.ibm.com> <53A348E6.3050404@oracle.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <53A348E6.3050404@oracle.com> Sender: owner-linux-mm@kvack.org List-ID: To: Sasha Levin Cc: Thomas Gleixner , Christoph Lameter , Pekka Enberg , Matt Mackall , Andrew Morton , Dave Jones , "linux-mm@kvack.org" , LKML On Thu, Jun 19, 2014 at 04:32:38PM -0400, Sasha Levin wrote: > On 06/19/2014 04:29 PM, Paul E. McKenney wrote: > > On Thu, Jun 19, 2014 at 09:29:08PM +0200, Thomas Gleixner wrote: > >> > On Thu, 19 Jun 2014, Paul E. McKenney wrote: > >> > > >>> > > On Thu, Jun 19, 2014 at 10:03:04AM -0500, Christoph Lameter wrote: > >>>> > > > On Thu, 19 Jun 2014, Sasha Levin wrote: > >>>> > > > > >>>>> > > > > [ 690.770137] ? __this_cpu_preempt_check (lib/smp_processor_id.c:63) > >>>>> > > > > [ 690.770137] __slab_alloc (mm/slub.c:1732 mm/slub.c:2205 mm/slub.c:2369) > >>>>> > > > > [ 690.770137] ? __lock_acquire (kernel/locking/lockdep.c:3189) > >>>>> > > > > [ 690.770137] ? __debug_object_init (lib/debugobjects.c:100 lib/debugobjects.c:312) > >>>>> > > > > [ 690.770137] kmem_cache_alloc (mm/slub.c:2442 mm/slub.c:2484 mm/slub.c:2489) > >>>>> > > > > [ 690.770137] ? __debug_object_init (lib/debugobjects.c:100 lib/debugobjects.c:312) > >>>>> > > > > [ 690.770137] ? debug_object_activate (lib/debugobjects.c:439) > >>>>> > > > > [ 690.770137] __debug_object_init (lib/debugobjects.c:100 lib/debugobjects.c:312) > >>>>> > > > > [ 690.770137] debug_object_init (lib/debugobjects.c:365) > >>>>> > > > > [ 690.770137] rcuhead_fixup_activate (kernel/rcu/update.c:231) > >>>>> > > > > [ 690.770137] debug_object_activate (lib/debugobjects.c:280 lib/debugobjects.c:439) > >>>>> > > > > [ 690.770137] ? discard_slab (mm/slub.c:1486) > >>>>> > > > > [ 690.770137] __call_rcu (kernel/rcu/rcu.h:76 (discriminator 2) kernel/rcu/tree.c:2585 (discriminator 2)) > >>>> > > > > >>>> > > > __call_rcu does a slab allocation? This means __call_rcu can no longer be > >>>> > > > used in slab allocators? What happened? > >>> > > > >>> > > My guess is that the root cause is a double call_rcu(), call_rcu_sched(), > >>> > > call_rcu_bh(), or call_srcu(). > >>> > > > >>> > > Perhaps the DEBUG_OBJECTS code now allocates memory to report errors? > >>> > > That would be unfortunate... > >> > > >> > Well, no. Look at the callchain: > >> > > >> > __call_rcu > >> > debug_object_activate > >> > rcuhead_fixup_activate > >> > debug_object_init > >> > kmem_cache_alloc > >> > > >> > So call rcu activates the object, but the object has no reference in > >> > the debug objects code so the fixup code is called which inits the > >> > object and allocates a reference .... > > OK, got it. And you are right, call_rcu() has done this for a very > > long time, so not sure what changed. > > It's probable my fault. I've introduced clone() and unshare() fuzzing. > > Those two are full with issues and I've been waiting with enabling those > until the rest of the kernel could survive trinity for more than an hour. Well, that might explain why I haven't seen it in my testing. ;-) Thanx, Paul -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: email@kvack.org