From: Danny ter Haar <dth@dth.net>
To: Christoph Hellwig <hch@infradead.org>, xfs@oss.sgi.com
Subject: Re: problems showing up as XFS problems on kernels after 2.6.28-git2
Date: Fri, 9 Jan 2009 02:26:10 +0100 [thread overview]
Message-ID: <20090109012610.GA23075@dth.net> (raw)
In-Reply-To: <20090109004609.GM9448@disturbed>
Quoting Dave Chinner (david@fromorbit.com):
> Looking at this, I think there are two possibilities in terms of the
> problem being detected. We are modifying the inode BMBT here,
> so that means we have XFS_BTREE_ROOT_IN_INODE set. The corruption
> trigger has occurred because a xfs_btree_increment() call has
> returned a zero status. This means we failed here:
>
> 1324 /* Fail if we just went off the right edge of the tree. */
> 1325 xfs_btree_get_sibling(cur, block, &ptr, XFS_BB_RIGHTSIB);
> 1326 if (xfs_btree_ptr_is_null(cur, &ptr))
> 1327 goto out0;
>
> or here:
>
> 1351 /*
> 1352 * If we went off the root then we are either seriously
> 1353 * confused or have the tree root in an inode.
> 1354 */
> 1355 if (lev == cur->bc_nlevels) {
> 1356 if (cur->bc_flags & XFS_BTREE_ROOT_IN_INODE)
> 1357 goto out0;
> 1358 ASSERT(0);
>
> i.e. we either fell off the right edge of the tree or went over the top
> of it.
> I can't really see how we've done either of those things unless the
> tree has been corrupted by a prior operation.
sounds logical.
First time when it happened i moved the primairy hd to sec ide connector, connected
a seperate hard drive as new master, installed a fresh debian lenny on that
harddrive, ran xfs-repair on all xfs filesystems: no errors
> Given that each time it is aptitude that is causing the problem, can you
> prevent aptitude from running automatically on boot and run it manually?
> If you can reporduce the problem manually then we can move on to the
> next step....
I wasn't clear (obvioulsy)
This machine is besides my NAS also my apt-cacher-ng server for all my other
machines here at home. The easiest way to trigger the error is often by running
a simple "aptitude update; aptitude -d dist-upgrade"
So when it barfed i did the aptitude by hand.
And it checks everything from the cache at /var/cache/apt-cacher-ng
which is on sda6 (root filesystem on XFS)
So it doesn't "barf" right on boot, it takes a few minutes or even hours:
filer1:~# last -20 reboot
reboot system boot 2.6.28-git2-d Thu Jan 8 12:00 - current(05:18)
reboot system boot 2.6.28-git3-d Thu Jan 8 11:31 - 11:59 (00:27)
reboot system boot 2.6.28-git3-d Thu Jan 8 10:56 - 11:59 (01:02)
reboot system boot 2.6.28-git3-d Thu Jan 8 10:44 - 10:54 (00:10)
reboot system boot 2.6.28-git3-d Thu Jan 8 10:30 - 10:43 (00:12)
reboot system boot 2.6.28-git2 Wed Jan 7 15:08 - 10:28 (19:19)
reboot system boot 2.6.28-git9-d Wed Jan 7 12:29 - 14:58 (02:29)
reboot system boot 2.6.28-git2 Wed Jan 7 10:08 - 12:27 (02:19)
reboot system boot 2.6.28-git9 Wed Jan 7 09:21 - 10:06 (00:45)
reboot system boot 2.6.28-git9 Wed Jan 7 08:42 - 10:06 (01:24)
reboot system boot 2.6.28-git2 Tue Jan 6 21:45 - 08:40 (10:55)
reboot system boot 2.6.28-git4 Tue Jan 6 21:27 - 08:40 (11:13)
reboot system boot 2.6.28-git4 Tue Jan 6 21:22 - 08:40 (11:18)
Sometimes the kernel barfes while accessing /dev/sdb1 of /dev/sdc1
which is only accessed using samba.
I can once more install the "other" debian lenny harddrive, boot from there
and than manually do an xfs_repair on xfs filesystems.
I can than boot a kernel that is know to barf and try to get it to barf.
> > So (in my case) something while going from git2 -> git3 didn't go positive.
> That would have been when Linus did the XFS pull...
Do you want me to figure out what patch from git2->git3 is the cullprit ?
I'll have to compile/reboot for a while.
Tell me what else i can do to resolve this.
Danny
--
_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs
next prev parent reply other threads:[~2009-01-09 1:26 UTC|newest]
Thread overview: 30+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-01-07 16:52 problems showing up as XFS problems on kernels after 2.6.28-git2 dth
2009-01-07 18:02 ` Christoph Hellwig
2009-01-07 18:24 ` Danny ter Haar
2009-01-07 18:31 ` Christoph Hellwig
2009-01-07 18:44 ` Danny ter Haar
2009-01-07 18:52 ` Christoph Hellwig
2009-01-07 22:09 ` Danny ter Haar
2009-01-08 0:38 ` Danny ter Haar
2009-01-07 18:56 ` Christoph Hellwig
2009-01-07 19:01 ` Danny ter Haar
2009-01-08 21:56 ` Danny ter Haar
2009-01-09 0:46 ` Dave Chinner
2009-01-09 1:26 ` Danny ter Haar [this message]
2009-01-09 2:08 ` Dave Chinner
2009-01-09 6:10 ` Danny ter Haar
2009-01-09 19:44 ` Christoph Hellwig
2009-01-09 19:51 ` Danny ter Haar
2009-01-09 19:58 ` Christoph Hellwig
2009-01-09 21:42 ` Danny ter Haar
2009-01-09 22:01 ` Christoph Hellwig
2009-01-09 22:23 ` Danny ter Haar
2009-01-13 20:04 ` Danny ter Haar
2009-01-16 20:43 ` Danny ter Haar
2009-01-17 7:38 ` Dave Chinner
2009-01-17 23:25 ` Danny ter Haar
2009-01-18 2:50 ` Danny ter Haar
2009-01-19 3:17 ` Dave Chinner
2009-01-14 19:42 ` Tino Keitel
2009-01-14 19:44 ` Tino Keitel
2009-01-14 19:44 ` Tino Keitel
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20090109012610.GA23075@dth.net \
--to=dth@dth.net \
--cc=hch@infradead.org \
--cc=xfs@oss.sgi.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.