From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <xfs-bounce@oss.sgi.com>
Received: with ECARTIS (v1.0.0; list xfs); Thu, 20 Dec 2007 22:37:17 -0800 (PST)
Received: from larry.melbourne.sgi.com (larry.melbourne.sgi.com [134.14.52.130])
	by oss.sgi.com (8.12.11.20060308/8.12.11/SuSE Linux 0.7) with SMTP id lBL6bBYj012560
	for <xfs@oss.sgi.com>; Thu, 20 Dec 2007 22:37:12 -0800
Message-ID: <476B5FE4.9030405@sgi.com>
Date: Fri, 21 Dec 2007 17:40:36 +1100
From: Timothy Shimmin <tes@sgi.com>
MIME-Version: 1.0
Subject: Re: mount prob: "log inconsistent or not a log"
References: <20071220000144.GQ19770@msoe.edu> <4769BD13.5040303@sgi.com> <20071220011848.GV19770@msoe.edu> <20071220015425.GL4612@sgi.com> <20071220024453.GX19770@msoe.edu> <20071220175512.GA6023@msoe.edu> <476B0D29.4050504@sgi.com> <20071221015533.GH6476@msoe.edu>
In-Reply-To: <20071221015533.GH6476@msoe.edu>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit
Sender: xfs-bounce@oss.sgi.com
Errors-to: xfs-bounce@oss.sgi.com
List-Id: xfs
To: "Jonathan C. Detert" <Jonathan.Detert@msoe.edu>
Cc: xfs@oss.sgi.com

Jonathan C. Detert wrote:
> * Timothy Shimmin <tes@sgi.com> [071220 18:49]:
>> Jonathan C. Detert wrote:
>>> I just want to clarify - I mean to ask:
>>>
>>> 	Is it possible that an xfs filesystem which is in use by an o.s.,
>>> 	last mounted by the o.s. months ago, could have a corrupted log, and
>>> 	yet keep functioning until such time as a remount of it is attempted?
>>>
>> Yes.
>> The log is read on every mount. I don't believe we read it otherwise.
>> On a mount we look for the log head and an unmount record etc. to decide
>> whether to do a replay or not.
>> So it could be corrupted and we would not know and we don't really care
>> until the next mount.
>> However, if we are modifying metadata in the filesystem (with some 
>> exceptions)
>> then we will be continuing to write to the log and we will continue to
>> wrap the log. And hence if someone corrupted the log at a point in the past,
>> we may very well be able to overwrite the log and effectively lose that
>> corruption.
>>
>> Do you understand the basic idea?
> 
> I think so.
> 
>> We just write to the log while the file-system is mounted and we are writing
>> to the filesystem (particularly changing metadata); and
>> we read from the log during (just prior) mounting of the filesystem.
>>
>>> 	I.e. if a log gets corrupted while the f.s. is in use, will anyone
>>> 	notice, until such time as an attempt to remount the f.s. is made?
>> No I don't think anyone will notice.
>> No-one has cause to read it, so noone cares.
>>
>> If you unmount it cleanly. Run repair on it before mounting it again.
>> Then repair may find corruption when it tries to find the log head etc...
>> to work out if the log is clean or not.
>> i.e. in this case repair will try to read the log before you've tried
>> to do a mount.
> 
> So, how do you recover from a situation like i have, namely:
> 
> 1) the xfs file system is not currently mounted
> 2) an attempt to mount the xfs fs fails, complaining about log being
>    inconsistent or not a log
> 
> You say noone cares if the log is corrupt, but it seems to be a
> fatal problem if you want to mount the fs.
> 
Noone cares until you need to read it and it is still corrupt
and then you care when you want to mount it.
So yes, you care ;-)

> Bottom line: how do you recover when trying to remount an xfs fs whose
> log happens to be corrupt?

Without doing anything fancy, you are stuffed.
You need to clear the log and repair it using "xfs_repair -L"
(the -L will zero the log and for linux write a log record in there).

I was wondering that if only the end of the log was corrupted and that
was not between the tail and the head, then one could write
0x00000255 (597 dec) at the start of each sector (which is the previous
cycle#) where the corruption is.
Then the recovery code would likely think that this was old
data as it has the old cycle# (and hence ignore it).
However, it is xmas breakup day and I'm too tired to try it out
at the moment. :)
Was thinking of something like:
xfs_io -c 'pwrite -b 512 -S 0x00000255 38889s 31s' sdb-xfs-log.fixup
But that's not right. Sorry.
And then I'm still not convinced about the tail numbers in many
of the log records (particularly the head) anyway.

--Tim