From mboxrd@z Thu Jan 1 00:00:00 1970 From: Heiko Carstens Subject: Re: [GIT PULL] Queue free fix (was Re: [PATCH] block: Free queue resources at blk_release_queue()) Date: Mon, 7 Nov 2011 17:43:51 +0100 Message-ID: <20111107164351.GA2483@osiris.boeblingen.de.ibm.com> References: <20111018154542.GB3869@osiris.boeblingen.de.ibm.com> <1318955380.5169.15.camel@dabdike.int.hansenpartnership.com> <20111031100557.GA2621@osiris.boeblingen.de.ibm.com> <1320057746.2964.1.camel@dabdike> <4EAE8A7E.8000504@ce.jp.nec.com> <20111031130004.GB4768@osiris.boeblingen.de.ibm.com> <20111103182548.GA12131@redhat.com> <20111104091936.GB2397@osiris.boeblingen.de.ibm.com> <4EB7C159.8020009@ce.jp.nec.com> <20111107153649.GA9935@redhat.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Return-path: Content-Disposition: inline In-Reply-To: <20111107153649.GA9935@redhat.com> Sender: linux-kernel-owner@vger.kernel.org To: Mike Snitzer Cc: Jun'ichi Nomura , James Bottomley , Steffen Maier , "linux-scsi@vger.kernel.org" , Jens Axboe , Hannes Reinecke , Linux Kernel , Alan Stern , Thadeu Lima de Souza Cascardo , "Taraka R. Bodireddy" , "Seshagiri N. Ippili" , "Manvanthara B. Puttashankar" , Jeff Moyer , Shaohua Li , gmuelas@de.ibm.com List-Id: linux-scsi@vger.kernel.org On Mon, Nov 07, 2011 at 10:36:49AM -0500, Mike Snitzer wrote: > > struct pgpath is freed before dm_target when tearing down dm table. > > So if the problematic completion was being done after freeing pgpath > > but before freeing dm_target, crash would look like that > > and what's happening seems the same for these dm crashes: > > dm table was somehow destroyed while I/O was in-flight. > > Could be the block layer's onstack plugging changes are at the heart of > this. > > I voiced onstack plugging concerns relative to DM some time ago > (https://lkml.org/lkml/2011/3/9/450) but somehow convinced myself DM was > fine to no longer need dm_table_unplug_all() etc. Unfortunately I > cannot recall _why_ I felt that was the case. > > So DM needs further review relative to block's onstack plugging changes > and DM IO completion. > > > It's interesting that your test started to crash in dm with v3.0.7. > > Have you gotten these dm crashes with v3.0.6 or before? > > Have you hit the initially-reported scsi oops with v3.0.7? > > Are your v3.0.6 and v3.0.7 compiled with same config and the tests > > ran on same system? > > If all 3.0.x fail: it would be interesting to know if 2.6.39 (which > introduced the onstack plugging) also has these problems. At least 2.6.39.1 failed as well, as reported already in June. At least the crash looked similar back then: http://marc.info/?l=linux-scsi&m=130824013229933&w=2 > Testing with > 2.6.38 would be very insightful because it obviously doesn't have any of > the onstack plugging churn. Ok, we might try 2.6.38.8 (latest stable) and see what happens.