From mboxrd@z Thu Jan 1 00:00:00 1970 From: David Hildenbrand Subject: Re: blk-mq crash under KVM in multiqueue block code (with virtio-blk and ext4) Date: Wed, 17 Sep 2014 21:09:54 +0200 Message-ID: <20140917210954.6e622fb5@thinkpad-w530> References: <541178D6.6010303@de.ibm.com> <541352ED.7030800@de.ibm.com> <54193F4F.9060508@de.ibm.com> <20140917140034.10125d00@thinkpad-w530> <20140917215226.426f6ce7@tom-ThinkPad-T410> <54199923.9010201@kernel.dk> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: virtualization-bounces@lists.linux-foundation.org Errors-To: virtualization-bounces@lists.linux-foundation.org To: Ming Lei Cc: Jens Axboe , KVM list , "Michael S. Tsirkin" , "linux-kernel@vger.kernel.org >> Linux Kernel Mailing List" , Virtualization List , Christian Borntraeger List-Id: virtualization@lists.linuxfoundation.org > On Wed, Sep 17, 2014 at 10:22 PM, Jens Axboe wrote: > > > > Another way would be to ensure that the timeout handler doesn't touch hw_ctx > > or tag_sets that aren't fully initialized yet. But I think this is > > safer/cleaner. > > That may not be easy or enough to check if hw_ctx/tag_sets are > fully initialized if you mean all requests have been used one time. > > On Wed, Sep 17, 2014 at 10:11 PM, David Hildenbrand > > I was playing with a simple patch that just sets cmd_flags and action_flags to > > What is action_flags? atomic_flags, sorry :) Otherwise e.g. REQ_ATOM_STARTED could already be set due to the randomness. I am not sure if this is really necessary, or if it is completely shielded by the tag-handling code, but seemed to be clean for me to do it (and I remember it not being set within blk_mq_rq_ctx_init). > > > 0. That should already be sufficient to hinder blk_mq_tag_to_rq and the calling > > method to do the wrong thing. > > Yes, clearing rq->cmd_flags should be enough. > > And looks better to move rq initialization to __blk_mq_free_request() > too, otherwise timeout still may see old cmd_flags and rq->q before > rq's new initialization. Yes, __blk_mq_free_request() should also reset at least rq->cmd_flags, and I think we can remove the initialization from __blk_mq_alloc_request(). David > > > Thanks,