From mboxrd@z Thu Jan 1 00:00:00 1970 From: James Bottomley Subject: Re: [PATCH] block: fix oops with block tag queueing Date: Tue, 26 May 2009 15:58:51 +0000 Message-ID: <1243353531.2815.37.camel@localhost.localdomain> References: <1242839186.2881.57.camel@localhost.localdomain> <4A14B4A1.5050303@gmail.com> Mime-Version: 1.0 Content-Type: text/plain Content-Transfer-Encoding: 7bit Return-path: Received: from bedivere.hansenpartnership.com ([66.63.167.143]:50220 "EHLO bedivere.hansenpartnership.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751327AbZEZP6u (ORCPT ); Tue, 26 May 2009 11:58:50 -0400 In-Reply-To: <4A14B4A1.5050303@gmail.com> Sender: linux-scsi-owner@vger.kernel.org List-Id: linux-scsi@vger.kernel.org To: Tejun Heo Cc: Jens Axboe , linux-scsi On Thu, 2009-05-21 at 10:55 +0900, Tejun Heo wrote: > James Bottomley wrote: > > commit e8939a50466fd963eb1ba9118c34b9ffb7ff6aa6 > > Author: Tejun Heo > > Date: Fri May 8 11:54:16 2009 +0900 > > > > block: implement and enforce request peek/start/fetch > > > > Added a BUG_ON(blk_queued_rq(req)) to the top of blk_finish_req(). > > Unfortunately, this checks whether req->queuelist is empty. This list > > is doing double duty both as the queue list and the tag list, so tagged > > requests come in here with this not empty and boom (the tag list is > > emptied by blk_queue_end_tag() lower down). > > > > Fix this by moving the BUG_ON to below the end tag we also seem > > vulnerable to this in blk_requeue_request() as well. I think all uses > > of blk_queued_rq() need auditing because the check is clearly wrong in > > the tagged case. > > > > Signed-off-by: James Bottomley > > Oops, > > Acked-by: Tejun Heo > > There are also some drivers which use queuelist for internal purposes > after dequeueing, which also screws up blk_queued_rq() test in > addition to being questionable practice to begin with. Maybe we would > be better off with a flag? Either is fine by me ... could we get some fix in, please? I'm currently carrying this below the merge-base on the SCSI postmerge tree to prevent my main build server oopsing under SCSI testing ... I'm a bit surprised we haven't had more reports from linux-oops ... but you can bet that if Jens moves libata to generic tag use, that will change ... James