From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from dkim1.fusionio.com ([66.114.96.53]:36410 "EHLO dkim1.fusionio.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1756263Ab3JRQG3 (ORCPT ); Fri, 18 Oct 2013 12:06:29 -0400 Received: from mx2.fusionio.com (unknown [10.101.1.160]) by dkim1.fusionio.com (Postfix) with ESMTP id CFF187C06B4 for ; Fri, 18 Oct 2013 10:06:28 -0600 (MDT) Date: Fri, 18 Oct 2013 12:06:23 -0400 From: Josef Bacik To: Sage Weil CC: Josef Bacik , Subject: Re: transaction commit deadlock on current rc Message-ID: <20131018160623.GE6924@localhost.localdomain> References: <20131018142554.GD6924@localhost.localdomain> MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" In-Reply-To: Sender: linux-btrfs-owner@vger.kernel.org List-ID: On Fri, Oct 18, 2013 at 08:42:28AM -0700, Sage Weil wrote: > On Fri, 18 Oct 2013, Josef Bacik wrote: > > On Thu, Oct 17, 2013 at 12:56:14PM -0700, Sage Weil wrote: > > > Hey, > > > > > > I'm seeing the deadlock below under a ceph-osd workload. There may be a > > > subtle problem with the async transaction sequence (since nobody but ceph > > > uses that that I know of), but not obvious to me why > > > create_pending_snapshots would get stuck on btrfs_tree_lock... > > > > > > > Can you do sysrq+w when this happens so I can see everybody who's blocked? > > Thanks, > > Oops, forgot to attach the bug link. It's at > > http://tracker.ceph.com/attachments/download/1035/a > http://tracker.ceph.com/issues/6451 > > The machine is still hung.. if there is additional info I can gather > you can ping me on irc. > Oops, I'll fix that right up, sorry about that. Thanks, Josef