public inbox for linux-xfs@vger.kernel.org
 help / color / mirror / Atom feed
* Corrupted file system
@ 2014-09-23 22:02 Diane Trout
  2014-09-23 22:12 ` Sean Caron
  2014-09-24 10:08 ` Emmanuel Florac
  0 siblings, 2 replies; 9+ messages in thread
From: Diane Trout @ 2014-09-23 22:02 UTC (permalink / raw)
  To: xfs

Hi,

I had a raid failure at work that ended up corrupting an xfs filesystem the 
tail of the xfs_repair command looks like the below. I was able to generate a 
metadata dump but is there a point to making it available? 

It does crash repeatedly at the same place

(I'm not subscribed to the list, so could you reply directly as well?)

disconnected inode 15276, moving to lost+found
disconnected inode 15277, moving to lost+found
disconnected inode 15278, moving to lost+found
disconnected inode 15279, moving to lost+found
disconnected inode 15280, moving to lost+found
disconnected inode 15281, moving to lost+found
disconnected inode 15282, moving to lost+found
disconnected inode 15283, moving to lost+found
disconnected inode 15284, moving to lost+found
disconnected inode 15286, moving to lost+found
disconnected inode 15287, moving to lost+found
disconnected inode 15288, moving to lost+found
disconnected inode 15289, moving to lost+found
disconnected inode 15290, moving to lost+found
disconnected inode 15291, moving to lost+found
disconnected inode 15292, moving to lost+found
disconnected inode 15293, moving to lost+found
disconnected inode 15294, moving to lost+found
disconnected inode 15295, moving to lost+found
disconnected inode 15360, moving to lost+found
corrupt dinode 15360, extent total = 1, nblocks = 0.  This is a bug.
Please capture the filesystem metadata with xfs_metadump and
report it to xfs@oss.sgi.com.
cache_node_purge: refcount was 1, not zero (node=0x7f369883e6f0)

fatal error -- 117 - couldn't iget disconnected inode


_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Corrupted file system
  2014-09-23 22:02 Corrupted file system Diane Trout
@ 2014-09-23 22:12 ` Sean Caron
  2014-09-23 22:29   ` Diane Trout
  2014-09-24 10:08 ` Emmanuel Florac
  1 sibling, 1 reply; 9+ messages in thread
From: Sean Caron @ 2014-09-23 22:12 UTC (permalink / raw)
  To: Diane Trout, Sean Caron; +Cc: xfs@oss.sgi.com


[-- Attachment #1.1: Type: text/plain, Size: 2361 bytes --]

Hi Diane,

Probably best to reformat and restore from a backup at this point. I'm
notorious for my views on xfs_repair but most folks here will agree that in
the case where there has been a major underlying array failure, there is
not much it can do to help. Better to just mount ro,noreplaylog and get
what you can in these sorts of scenarios, IMO.

"Did you try the latest copy of xfs_repair"? Sometimes it will get further
or not crash, but it likely will still maul whatever's left of your
filesystem.

Best of luck,

Sean


On Tue, Sep 23, 2014 at 6:02 PM, Diane Trout <diane@caltech.edu> wrote:

> Hi,
>
> I had a raid failure at work that ended up corrupting an xfs filesystem the
> tail of the xfs_repair command looks like the below. I was able to
> generate a
> metadata dump but is there a point to making it available?
>
> It does crash repeatedly at the same place
>
> (I'm not subscribed to the list, so could you reply directly as well?)
>
> disconnected inode 15276, moving to lost+found
> disconnected inode 15277, moving to lost+found
> disconnected inode 15278, moving to lost+found
> disconnected inode 15279, moving to lost+found
> disconnected inode 15280, moving to lost+found
> disconnected inode 15281, moving to lost+found
> disconnected inode 15282, moving to lost+found
> disconnected inode 15283, moving to lost+found
> disconnected inode 15284, moving to lost+found
> disconnected inode 15286, moving to lost+found
> disconnected inode 15287, moving to lost+found
> disconnected inode 15288, moving to lost+found
> disconnected inode 15289, moving to lost+found
> disconnected inode 15290, moving to lost+found
> disconnected inode 15291, moving to lost+found
> disconnected inode 15292, moving to lost+found
> disconnected inode 15293, moving to lost+found
> disconnected inode 15294, moving to lost+found
> disconnected inode 15295, moving to lost+found
> disconnected inode 15360, moving to lost+found
> corrupt dinode 15360, extent total = 1, nblocks = 0.  This is a bug.
> Please capture the filesystem metadata with xfs_metadump and
> report it to xfs@oss.sgi.com.
> cache_node_purge: refcount was 1, not zero (node=0x7f369883e6f0)
>
> fatal error -- 117 - couldn't iget disconnected inode
>
>
> _______________________________________________
> xfs mailing list
> xfs@oss.sgi.com
> http://oss.sgi.com/mailman/listinfo/xfs
>

[-- Attachment #1.2: Type: text/html, Size: 3063 bytes --]

[-- Attachment #2: Type: text/plain, Size: 121 bytes --]

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Corrupted file system
  2014-09-23 22:12 ` Sean Caron
@ 2014-09-23 22:29   ` Diane Trout
  2014-09-23 23:12     ` Eric Sandeen
  0 siblings, 1 reply; 9+ messages in thread
From: Diane Trout @ 2014-09-23 22:29 UTC (permalink / raw)
  To: Sean Caron; +Cc: xfs@oss.sgi.com

Hello,

Yes, I had my doubts about that there's anything reasonable one could do after 
part of the file system was randomized. I'll trying updating to xfsprogs 3.2.1 
and see if that does any better.

Thank you for the advice.

Diane

On Tuesday, September 23, 2014 18:12:22 Sean Caron wrote:
> Hi Diane,
> 
> Probably best to reformat and restore from a backup at this point. I'm
> notorious for my views on xfs_repair but most folks here will agree that in
> the case where there has been a major underlying array failure, there is
> not much it can do to help. Better to just mount ro,noreplaylog and get
> what you can in these sorts of scenarios, IMO.
> 
> "Did you try the latest copy of xfs_repair"? Sometimes it will get further
> or not crash, but it likely will still maul whatever's left of your
> filesystem.
> 
> Best of luck,
> 
> Sean
> 
> On Tue, Sep 23, 2014 at 6:02 PM, Diane Trout <diane@caltech.edu> wrote:
> > Hi,
> > 
> > I had a raid failure at work that ended up corrupting an xfs filesystem
> > the
> > tail of the xfs_repair command looks like the below. I was able to
> > generate a
> > metadata dump but is there a point to making it available?
> > 
> > It does crash repeatedly at the same place
> > 
> > (I'm not subscribed to the list, so could you reply directly as well?)
> > 
> > disconnected inode 15276, moving to lost+found
> > disconnected inode 15277, moving to lost+found
> > disconnected inode 15278, moving to lost+found
> > disconnected inode 15279, moving to lost+found
> > disconnected inode 15280, moving to lost+found
> > disconnected inode 15281, moving to lost+found
> > disconnected inode 15282, moving to lost+found
> > disconnected inode 15283, moving to lost+found
> > disconnected inode 15284, moving to lost+found
> > disconnected inode 15286, moving to lost+found
> > disconnected inode 15287, moving to lost+found
> > disconnected inode 15288, moving to lost+found
> > disconnected inode 15289, moving to lost+found
> > disconnected inode 15290, moving to lost+found
> > disconnected inode 15291, moving to lost+found
> > disconnected inode 15292, moving to lost+found
> > disconnected inode 15293, moving to lost+found
> > disconnected inode 15294, moving to lost+found
> > disconnected inode 15295, moving to lost+found
> > disconnected inode 15360, moving to lost+found
> > corrupt dinode 15360, extent total = 1, nblocks = 0.  This is a bug.
> > Please capture the filesystem metadata with xfs_metadump and
> > report it to xfs@oss.sgi.com.
> > cache_node_purge: refcount was 1, not zero (node=0x7f369883e6f0)
> > 
> > fatal error -- 117 - couldn't iget disconnected inode
> > 
> > 
> > _______________________________________________
> > xfs mailing list
> > xfs@oss.sgi.com
> > http://oss.sgi.com/mailman/listinfo/xfs

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Corrupted file system
  2014-09-23 22:29   ` Diane Trout
@ 2014-09-23 23:12     ` Eric Sandeen
  0 siblings, 0 replies; 9+ messages in thread
From: Eric Sandeen @ 2014-09-23 23:12 UTC (permalink / raw)
  To: Diane Trout, Sean Caron; +Cc: xfs@oss.sgi.com

On 9/23/14 5:29 PM, Diane Trout wrote:
> Hello,
> 
> Yes, I had my doubts about that there's anything reasonable one could do after 
> part of the file system was randomized. I'll trying updating to xfsprogs 3.2.1 
> and see if that does any better.

Yes - you didn't mention which version you used, but I'd definitely try something
current.  The bug rings a bell, maybe fixed now.

To be safe, you can always make an xfs_metadump image (use -o to not obfuscate
filenames for convenience), xfs_mdrestore that, and point repair at that fs image.
Then you'll see what it *would* do before you *do* do.  ;)

But yeah, depending on the damage done to the storage, you may be in trouble
that repair can't help with.  Depends on the failure, though.

-Eric

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Corrupted file system
  2014-09-23 22:02 Corrupted file system Diane Trout
  2014-09-23 22:12 ` Sean Caron
@ 2014-09-24 10:08 ` Emmanuel Florac
  2014-09-24 20:38   ` Diane Trout
  1 sibling, 1 reply; 9+ messages in thread
From: Emmanuel Florac @ 2014-09-24 10:08 UTC (permalink / raw)
  To: Diane Trout; +Cc: xfs

Le Tue, 23 Sep 2014 15:02:57 -0700
Diane Trout <diane@caltech.edu> écrivait:

> I had a raid failure at work that ended up corrupting an xfs
> filesystem the tail of the xfs_repair command looks like the below. I
> was able to generate a metadata dump but is there a point to making
> it available? 
> 
> It does crash repeatedly at the same place

Did you try the very latest xfs_repair?

-- 
------------------------------------------------------------------------
Emmanuel Florac     |   Direction technique
                    |   Intellique
                    |	<eflorac@intellique.com>
                    |   +33 1 78 94 84 02
------------------------------------------------------------------------

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Corrupted file system
  2014-09-24 10:08 ` Emmanuel Florac
@ 2014-09-24 20:38   ` Diane Trout
  2014-09-24 20:57     ` Eric Sandeen
  0 siblings, 1 reply; 9+ messages in thread
From: Diane Trout @ 2014-09-24 20:38 UTC (permalink / raw)
  To: Emmanuel Florac; +Cc: xfs

On Wednesday, September 24, 2014 12:08:40 Emmanuel Florac wrote:
> Le Tue, 23 Sep 2014 15:02:57 -0700
> 
> Diane Trout <diane@caltech.edu> écrivait:
> > I had a raid failure at work that ended up corrupting an xfs
> > filesystem the tail of the xfs_repair command looks like the below. I
> > was able to generate a metadata dump but is there a point to making
> > it available?
> > 
> > It does crash repeatedly at the same place
> 
> Did you try the very latest xfs_repair?


I grabbed the one out of debian unstable (3.2.1), verified that it was your 
latest release and used that. I didn't try building the version in git.

It did get much further than version 3.1.7+b1 (debian stable)

Diane

Phase 6 - check inode connectivity...
        - resetting contents of realtime bitmap and summary inodes
        - traversing filesystem ...
entry "lymphoblastoid" in dir ino 256 doesn't have a .. entry, will set it in 
ino 2051.
bad hash table for directory inode 256 (no data entry): rebuilding
rebuilding directory inode 256
7f70c79c8700: Badness in key lookup (length)
bp=(bno 0x3480, len 4096 bytes) key=(bno 0x3480, len 8192 bytes)
Metadata corruption detected at block 0x4000070/0x1000
xfs_da_do_buf(2): XFS_CORRUPTION_ERROR

fatal error -- can't read block 8388608 for directory inode 259, error 117


_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Corrupted file system
  2014-09-24 20:38   ` Diane Trout
@ 2014-09-24 20:57     ` Eric Sandeen
  2014-09-25  0:25       ` Diane Trout
  0 siblings, 1 reply; 9+ messages in thread
From: Eric Sandeen @ 2014-09-24 20:57 UTC (permalink / raw)
  To: Diane Trout; +Cc: xfs-oss

If you want to make a metadata image, compress it, and send it my way I'd be glad
to look at it - I've been doing some repair work lately.

-Eric

On 9/24/14 3:38 PM, Diane Trout wrote:
> On Wednesday, September 24, 2014 12:08:40 Emmanuel Florac wrote:
>> Le Tue, 23 Sep 2014 15:02:57 -0700
>>
>> Diane Trout <diane@caltech.edu> écrivait:
>>> I had a raid failure at work that ended up corrupting an xfs
>>> filesystem the tail of the xfs_repair command looks like the below. I
>>> was able to generate a metadata dump but is there a point to making
>>> it available?
>>>
>>> It does crash repeatedly at the same place
>>
>> Did you try the very latest xfs_repair?
> 
> 
> I grabbed the one out of debian unstable (3.2.1), verified that it was your 
> latest release and used that. I didn't try building the version in git.
> 
> It did get much further than version 3.1.7+b1 (debian stable)
> 
> Diane
> 
> Phase 6 - check inode connectivity...
>         - resetting contents of realtime bitmap and summary inodes
>         - traversing filesystem ...
> entry "lymphoblastoid" in dir ino 256 doesn't have a .. entry, will set it in 
> ino 2051.
> bad hash table for directory inode 256 (no data entry): rebuilding
> rebuilding directory inode 256
> 7f70c79c8700: Badness in key lookup (length)
> bp=(bno 0x3480, len 4096 bytes) key=(bno 0x3480, len 8192 bytes)
> Metadata corruption detected at block 0x4000070/0x1000
> xfs_da_do_buf(2): XFS_CORRUPTION_ERROR
> 
> fatal error -- can't read block 8388608 for directory inode 259, error 117
> 
> 
> _______________________________________________
> xfs mailing list
> xfs@oss.sgi.com
> http://oss.sgi.com/mailman/listinfo/xfs
> 

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Corrupted file system
  2014-09-24 20:57     ` Eric Sandeen
@ 2014-09-25  0:25       ` Diane Trout
  2014-09-25  0:50         ` Eric Sandeen
  0 siblings, 1 reply; 9+ messages in thread
From: Diane Trout @ 2014-09-25  0:25 UTC (permalink / raw)
  To: Eric Sandeen; +Cc: xfs-oss

Huh.

I'd managed to make a metadump using the 3.1.7 version of the tools. Then I 
tried to run xfs_repair on the partition, and now xfs_metadump is segfaulting 
on the partition.

Would the older metadata dump be useful?

I'd need to rebuild xfsprogs before the stack trace would be useful... debian 
likes to strip the symbols and they didn't make a -dbg package.

Diane

On Wednesday, September 24, 2014 15:57:53 Eric Sandeen wrote:
> If you want to make a metadata image, compress it, and send it my way I'd be
> glad to look at it - I've been doing some repair work lately.
> 
> -Eric
> 
> On 9/24/14 3:38 PM, Diane Trout wrote:
> > On Wednesday, September 24, 2014 12:08:40 Emmanuel Florac wrote:
> >> Le Tue, 23 Sep 2014 15:02:57 -0700
> >> 
> >> Diane Trout <diane@caltech.edu> écrivait:
> >>> I had a raid failure at work that ended up corrupting an xfs
> >>> filesystem the tail of the xfs_repair command looks like the below. I
> >>> was able to generate a metadata dump but is there a point to making
> >>> it available?
> >>> 
> >>> It does crash repeatedly at the same place
> >> 
> >> Did you try the very latest xfs_repair?
> > 
> > I grabbed the one out of debian unstable (3.2.1), verified that it was
> > your
> > latest release and used that. I didn't try building the version in git.
> > 
> > It did get much further than version 3.1.7+b1 (debian stable)
> > 
> > Diane
> > 
> > Phase 6 - check inode connectivity...
> > 
> >         - resetting contents of realtime bitmap and summary inodes
> >         - traversing filesystem ...
> > 
> > entry "lymphoblastoid" in dir ino 256 doesn't have a .. entry, will set it
> > in ino 2051.
> > bad hash table for directory inode 256 (no data entry): rebuilding
> > rebuilding directory inode 256
> > 7f70c79c8700: Badness in key lookup (length)
> > bp=(bno 0x3480, len 4096 bytes) key=(bno 0x3480, len 8192 bytes)
> > Metadata corruption detected at block 0x4000070/0x1000
> > xfs_da_do_buf(2): XFS_CORRUPTION_ERROR
> > 
> > fatal error -- can't read block 8388608 for directory inode 259, error 117
> > 
> > 
> > _______________________________________________
> > xfs mailing list
> > xfs@oss.sgi.com
> > http://oss.sgi.com/mailman/listinfo/xfs

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Corrupted file system
  2014-09-25  0:25       ` Diane Trout
@ 2014-09-25  0:50         ` Eric Sandeen
  0 siblings, 0 replies; 9+ messages in thread
From: Eric Sandeen @ 2014-09-25  0:50 UTC (permalink / raw)
  To: Diane Trout; +Cc: xfs-oss

On 9/24/14 7:25 PM, Diane Trout wrote:
> Huh.
> 
> I'd managed to make a metadump using the 3.1.7 version of the tools. Then I 
> tried to run xfs_repair on the partition, and now xfs_metadump is segfaulting 
> on the partition.
> 
> Would the older metadata dump be useful?

sure.  Compress it & send me a private note with it attached (or with a location
if it's too big to email)

-Eric

> I'd need to rebuild xfsprogs before the stack trace would be useful... debian 
> likes to strip the symbols and they didn't make a -dbg package.
> 
> Diane
> 
> On Wednesday, September 24, 2014 15:57:53 Eric Sandeen wrote:
>> If you want to make a metadata image, compress it, and send it my way I'd be
>> glad to look at it - I've been doing some repair work lately.
>>
>> -Eric
>>
>> On 9/24/14 3:38 PM, Diane Trout wrote:
>>> On Wednesday, September 24, 2014 12:08:40 Emmanuel Florac wrote:
>>>> Le Tue, 23 Sep 2014 15:02:57 -0700
>>>>
>>>> Diane Trout <diane@caltech.edu> écrivait:
>>>>> I had a raid failure at work that ended up corrupting an xfs
>>>>> filesystem the tail of the xfs_repair command looks like the below. I
>>>>> was able to generate a metadata dump but is there a point to making
>>>>> it available?
>>>>>
>>>>> It does crash repeatedly at the same place
>>>>
>>>> Did you try the very latest xfs_repair?
>>>
>>> I grabbed the one out of debian unstable (3.2.1), verified that it was
>>> your
>>> latest release and used that. I didn't try building the version in git.
>>>
>>> It did get much further than version 3.1.7+b1 (debian stable)
>>>
>>> Diane
>>>
>>> Phase 6 - check inode connectivity...
>>>
>>>         - resetting contents of realtime bitmap and summary inodes
>>>         - traversing filesystem ...
>>>
>>> entry "lymphoblastoid" in dir ino 256 doesn't have a .. entry, will set it
>>> in ino 2051.
>>> bad hash table for directory inode 256 (no data entry): rebuilding
>>> rebuilding directory inode 256
>>> 7f70c79c8700: Badness in key lookup (length)
>>> bp=(bno 0x3480, len 4096 bytes) key=(bno 0x3480, len 8192 bytes)
>>> Metadata corruption detected at block 0x4000070/0x1000
>>> xfs_da_do_buf(2): XFS_CORRUPTION_ERROR
>>>
>>> fatal error -- can't read block 8388608 for directory inode 259, error 117
>>>
>>>
>>> _______________________________________________
>>> xfs mailing list
>>> xfs@oss.sgi.com
>>> http://oss.sgi.com/mailman/listinfo/xfs
> 

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2014-09-25  0:50 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-09-23 22:02 Corrupted file system Diane Trout
2014-09-23 22:12 ` Sean Caron
2014-09-23 22:29   ` Diane Trout
2014-09-23 23:12     ` Eric Sandeen
2014-09-24 10:08 ` Emmanuel Florac
2014-09-24 20:38   ` Diane Trout
2014-09-24 20:57     ` Eric Sandeen
2014-09-25  0:25       ` Diane Trout
2014-09-25  0:50         ` Eric Sandeen

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox