From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from relay.sgi.com (relay1.corp.sgi.com [137.38.102.111]) by oss.sgi.com (Postfix) with ESMTP id CA3D67F4E for ; Wed, 22 Oct 2014 09:16:59 -0500 (CDT) Received: from cuda.sgi.com (cuda2.sgi.com [192.48.176.25]) by relay1.corp.sgi.com (Postfix) with ESMTP id A93278F8035 for ; Wed, 22 Oct 2014 07:16:56 -0700 (PDT) Received: from mail-qg0-f41.google.com (mail-qg0-f41.google.com [209.85.192.41]) by cuda.sgi.com with ESMTP id cWovuZes0d2GwDpE (version=TLSv1 cipher=RC4-SHA bits=128 verify=NO) for ; Wed, 22 Oct 2014 07:16:52 -0700 (PDT) Received: by mail-qg0-f41.google.com with SMTP id a108so2626152qge.28 for ; Wed, 22 Oct 2014 07:16:50 -0700 (PDT) Message-ID: <5447BC4C.3050408@gmail.com> Date: Wed, 22 Oct 2014 10:16:44 -0400 From: "Michael L. Semon" MIME-Version: 1.0 Subject: Re: xfsdump not work in 3.17 References: <543F3C9A.4060603@sandeen.net> In-Reply-To: <543F3C9A.4060603@sandeen.net> List-Id: XFS Filesystem from SGI List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset="us-ascii"; Format="flowed" Errors-To: xfs-bounces@oss.sgi.com Sender: xfs-bounces@oss.sgi.com To: Eric Sandeen , xfs On 10/15/14 23:33, Eric Sandeen wrote: > On 10/15/14 9:31 PM, Tommy Wu wrote: >> Hi! >> >> xfsdump 3.1.4 >> xfsprogs 3.2.1 >> linux kerenl 3.17/3.17.1 >> > > ... > >> It just create a small dump file, and if I run the same xfsdump again (or umount the filesystem), it will hang, like: >> >> fw1:/vol/backup/fw1# /sbin/xfsdump -v trace,drive=debug,media=debug -l 0 -o -J -F - /dev/vg/root | gzip > test.gz >> /sbin/xfsdump: using file dump (drive_simple) strategy >> /sbin/xfsdump: version 3.1.4 (dump format 3.0) >> /sbin/xfsdump: level 0 dump of fw1.teatime.com.tw:/ >> /sbin/xfsdump: dump date: Thu Oct 16 10:30:09 2014 >> /sbin/xfsdump: session id: b8354300-d54c-4131-b39c-7c0b63968208 >> /sbin/xfsdump: session label: "" >> /sbin/xfsdump: ino map phase 1: constructing initial dump list >> >> >> switch back to kernel 3.16, the same command work fine. >> > > http://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/commit/fs/xfs?id=a8b1ee8bafc765ebf029d03c5479a69aebff9693 > > addresses the small backup file, and > > [PATCH] xfs: bulkstat doesn't release AGI buffer on error > > (on the list) most likely addresses the hang, I think. > > -Eric Thanks! I'm still looking for that one good recipe to fix xfsdump in my i686 Pentium 4 dungeon here, currently using yesterday morning's git kernel + xfs-oss/for-next. The test dataset is a basic slackware-current setup, regular kernel source, -stable kernel source, on v5-superblock/finobt XFS (mkfs.xfs -m crc=1,finobt=1 ...). The dataset uses about 7 GB of disk space. This letter is half-baked thoughts, only here to express the idea "don't think you're out of the woods yet!" in some primitive manner. The first patch seems to get rid of the earliest xfsdump premature SUCCESS, the one where xfsdump ran for less than 10 seconds and left a dump file of less than 1 MB. BTW, in the commit message and to `git log xfs-oss/for-next`, the commit message for the patch starts "caused a regression in xfs_inumbers" but does not mention which commit caused the regression. With the second patch applied, the dump size increases to about 1 decimal GB before exiting, same size in three different runs. I think I tried the patch "xfs: Check error during inode btree iteration in xfs_bulkstat()"--no other similar patches in my mailing list patchset download-- and xfsdump dumped up to 1.2 decimal GB, same size in two different runs. These patches are being run through xfstests as I work, so there's nothing there to report yet. It was only this morning that I got tar to complete a system backup without asserting in some way (hangcheck timer expires, stack varies), and the last oops got into that uncomfortable xfs_dir3_leaf area. Should this happen again, I'll either post some traces or the output of `xfsdump -v 3 ...` I was rushed into work today and couldn't grab the logs. Should `xfsdump -v 3 ...` report SUCCESS for one code and an error for the second return code, that second code has been "unknown error." I've never run xfsdump at -v 3 before and don't know if that is normal. The rest is still being fleshed out. tar seems to be OK, so long as xfsdump has not been invoked beforehand. tar has not been run enough times to get a true 1:1 correlation on it, though. The current goal is to reconstruct the filesystem and see if all problems magically go away. So far, xfs_repair has reported no errors on this filesystem. Thanks! Michael _______________________________________________ xfs mailing list xfs@oss.sgi.com http://oss.sgi.com/mailman/listinfo/xfs