e2fsck repeatedly asks to clear the same entry?

linux-ext4.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* e2fsck repeatedly asks to clear the same entry?
@ 2012-11-23 18:26 Mark Casey
  2012-11-23 19:18 ` Andreas Dilger
  0 siblings, 1 reply; 6+ messages in thread
From: Mark Casey @ 2012-11-23 18:26 UTC (permalink / raw)
  To: linux-ext4

Hello list,

I'm in a bit of a loop trying to fix my ext4 filesystem; it always goes 
like this even after several passes.

> root@host:/home/luser# /root/latest/sbin/e2fsck -f /dev/vgdalr6/lv1
> e2fsck 1.42.5 (29-Jul-2012)
> Pass 1: Checking inodes, blocks, and sizes
> Pass 2: Checking directory structure
> Entry 'A5 11-3' in /share/path/09/Brett/Pines/Flynt's Side Drive - Complete Archive Copy/SA Version Pines/Chris Pics 11-2-10 (268533857) has deleted/unused inode 15115.  Clear<y>? yes
> Pass 3: Checking directory connectivity
> Pass 4: Checking reference counts
> Pass 5: Checking group summary information
>
> /dev/vgdalr6/lv1: ***** FILE SYSTEM WAS MODIFIED *****
> /dev/vgdalr6/lv1: 8993801/268697600 files (0.8% non-contiguous), 2152234742/4299161600 blocks
> root@host:/home/luser#


Would anyone have any suggestions how to proceed?

The cause of this is that I did an unsupported resize (a shrink) by 
commenting out one of resize2fs' checks...as described here: (note I'm 
not claiming this as "permission"; I knew it might not work)

http://article.gmane.org/gmane.comp.file-systems.ext4/35375

resize2fs gave no indication of trouble but the check that followed 
didn't go perfectly. 6 or 7 directory entries needed to be cleared and 
I've restored ~10gb from backup, but otherwise this current issue with 
the directory 'A5 11-3' is the only symptom presenting. I have most of 
the e2fsck log that followed the resize in case that would be of use.

Thank you,
Mark


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: e2fsck repeatedly asks to clear the same entry?
  2012-11-23 18:26 e2fsck repeatedly asks to clear the same entry? Mark Casey
@ 2012-11-23 19:18 ` Andreas Dilger
  2012-11-24  6:27   ` Mark Casey
  2012-11-24 17:34   ` Eric Sandeen
  0 siblings, 2 replies; 6+ messages in thread
From: Andreas Dilger @ 2012-11-23 19:18 UTC (permalink / raw)
  To: Mark Casey; +Cc: linux-ext4@vger.kernel.org

On 2012-11-23, at 11:26, Mark Casey <markc@unifiedgroup.com> wrote:
> I'm in a bit of a loop trying to fix my ext4 filesystem; it always goes like this even after several passes.
> 
>> root@host:/home/luser# /root/latest/sbin/e2fsck -f /dev/vgdalr6/lv1
>> e2fsck 1.42.5 (29-Jul-2012)
>> Pass 1: Checking inodes, blocks, and sizes
>> Pass 2: Checking directory structure
>> Entry 'A5 11-3' in /share/path/09/Brett/Pines/Flynt's Side Drive - Complete Archive Copy/SA Version Pines/Chris Pics 11-2-10 (268533857) has deleted/unused inode 15115.  Clear<y>? yes
>> Pass 3: Checking directory connectivity
>> Pass 4: Checking reference counts
>> Pass 5: Checking group summary information
>> 
>> /dev/vgdalr6/lv1: ***** FILE SYSTEM WAS MODIFIED *****
>> /dev/vgdalr6/lv1: 8993801/268697600 files (0.8% non-contiguous), 2152234742/4299161600 blocks
>> root@host:/home/luser#
> 
> 
> Would anyone have any suggestions how to proceed?
> 
> The cause of this is that I did an unsupported resize (a shrink) by commenting out one of resize2fs' checks...as described here: (note I'm not claiming this as "permission"; I knew it might not work)

E2fsck should be able to fix (i.e. get into some consistent state) anything regardless of how it got into that state. 

> http://article.gmane.org/gmane.comp.file-systems.ext4/35375
> 
>  this current issue with the directory 'A5 11-3' is the only symptom presenting. I have most of the e2fsck log that followed the resize in case that would be of use.

It would probably be useful to get information from debugfs for this directory and inode (stats, ls -l <268533857>, stat <15115>, and checki 15115). 

Normally I'd say that getting a e2image of the filesystem would be useful for debugging and to create a test case, but since the filesystem is 16TB in size that won't be practical. 

In the worst case, it should be possible to fix this manually on debugfs either by marking the inode in use in the bitmap (seti 15115) or clear the inode number in the directory entry (on my phone right now and can't check the command for this).

It would be nice to get a test case first, so that e2fsck could be fixed, so if this isn't causing you grief it would be nice to keep this around until there is a chance to understand the problem. 

Cheers, Andreas

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: e2fsck repeatedly asks to clear the same entry?
  2012-11-23 19:18 ` Andreas Dilger
@ 2012-11-24  6:27   ` Mark Casey
  2012-11-24 19:17     ` Andreas Dilger
  2012-11-24 17:34   ` Eric Sandeen
  1 sibling, 1 reply; 6+ messages in thread
From: Mark Casey @ 2012-11-24  6:27 UTC (permalink / raw)
  To: linux-ext4

On 11/23/2012 1:18 PM, Andreas Dilger wrote:
> On 2012-11-23, at 11:26, Mark Casey <markc@unifiedgroup.com> wrote:
>> I'm in a bit of a loop trying to fix my ext4 filesystem; it always goes like this even after several passes.
>>
>>> root@host:/home/luser# /root/latest/sbin/e2fsck -f /dev/vgdalr6/lv1
>>> e2fsck 1.42.5 (29-Jul-2012)
>>> Pass 1: Checking inodes, blocks, and sizes
>>> Pass 2: Checking directory structure
>>> Entry 'A5 11-3' in /share/path/09/Brett/Pines/Flynt's Side Drive - Complete Archive Copy/SA Version Pines/Chris Pics 11-2-10 (268533857) has deleted/unused inode 15115.  Clear<y>? yes
>>> Pass 3: Checking directory connectivity
>>> Pass 4: Checking reference counts
>>> Pass 5: Checking group summary information
>>>
>>> /dev/vgdalr6/lv1: ***** FILE SYSTEM WAS MODIFIED *****
>>> /dev/vgdalr6/lv1: 8993801/268697600 files (0.8% non-contiguous), 2152234742/4299161600 blocks
>>> root@host:/home/luser#
>>
>>
>> Would anyone have any suggestions how to proceed?
>>
>> The cause of this is that I did an unsupported resize (a shrink) by commenting out one of resize2fs' checks...as described here: (note I'm not claiming this as "permission"; I knew it might not work)
>
> E2fsck should be able to fix (i.e. get into some consistent state) anything regardless of how it got into that state.
>
>> http://article.gmane.org/gmane.comp.file-systems.ext4/35375
>>
>>   this current issue with the directory 'A5 11-3' is the only symptom presenting. I have most of the e2fsck log that followed the resize in case that would be of use.
>
> It would probably be useful to get information from debugfs for this directory and inode (stats, ls -l <268533857>, stat <15115>, and checki 15115).

I've gathered that info as best I can. Last one didn't want to work. Let 
me know if there's more:

* stats *
Filesystem volume name:   <none>
Last mounted on:          /home
Filesystem UUID:          3652885c-e8c6-4f4d-86a0-a4c1d1784557
Filesystem magic number:  0xEF53
Filesystem revision #:    1 (dynamic)
Filesystem features:      has_journal ext_attr dir_index filetype extent 
64bit flex_bg sparse_super large_file huge_file uninit_bg dir_nlink 
extra_isize
Filesystem flags:         signed_directory_hash
Default mount options:    user_xattr acl
Filesystem state:         clean
Errors behavior:          Continue
Filesystem OS type:       Linux
Inode count:              268697600
Block count:              4299161600
Reserved block count:     10747903
Free blocks:              2146933332
Free inodes:              259703799
First block:              0
Block size:               4096
Fragment size:            4096
Blocks per group:         32768
Fragments per group:      32768
Inodes per group:         2048
Inode blocks per group:   128
RAID stride:              64
RAID stripe width:        576
Flex block group size:    16
Filesystem created:       Sun Sep  9 18:40:39 2012
Last mount time:          Fri Nov 23 11:12:16 2012
Last write time:          Fri Nov 23 12:27:56 2012
Mount count:              0
Maximum mount count:      -1
Last checked:             Fri Nov 23 12:27:56 2012
Check interval:           0 (<none>)
Lifetime writes:          12 TB
Reserved blocks uid:      0 (user root)
Reserved blocks gid:      0 (group root)
First inode:              11
Inode size:               256
Required extra isize:     28
Desired extra isize:      28
Journal inode:            8
Default directory hash:   half_md4
Directory Hash Seed:      94884f6d-8b2e-4830-a33b-02652aee727c
Journal backup:           inode blocks
Directories:              258340

* ls -l *
  268533857   42757 (2)   1000   1000    4096 22-Nov-2012 12:38 .
  268304391   42757 (2)   1000   1000    4096 11-Jan-2011 18:06 ..
   15111   42757 (2)   1000   1000   20480 11-Jan-2011 17:44 Group 1
   15112   42757 (2)   1000   1000    4096 11-Jan-2011 17:44 Group 2
   15113   42757 (2)   1000   1000   16384 11-Jan-2011 17:45 Group 3
   15114   42757 (2)   1000   1000   12288 11-Jan-2011 17:46 Group 4 11-2
   15115   42757 (2)   1000   1000   36864 11-Jan-2011 17:48 Group 5 11-3
   15116   42757 (2)   1000   1000   40960 11-Jan-2011 17:51 Group 6 11-4

* stat *
User:  1000   Group:  1000   Size: 36864
File ACL: 0    Directory ACL: 0
Links: 0   Blockcount: 80
Fragment:  Address: 0    Number: 0    Size: 0
  ctime: 0x50ae708d:cb6e6828 -- Thu Nov 22 12:35:57 2012
  atime: 0x504eda66:cb6e6828 -- Tue Sep 11 01:29:58 2012
  mtime: 0x4d2cec69:00000000 -- Tue Jan 11 17:48:57 2011
crtime: 0x504ed9f5:dab0e61c -- Tue Sep 11 01:28:05 2012
dtime: 0x50af000c -- Thu Nov 22 22:48:12 2012
Size of extra inode fields: 28

>
> Normally I'd say that getting a e2image of the filesystem would be useful for debugging and to create a test case, but since the filesystem is 16TB in size that won't be practical.
>
> In the worst case, it should be possible to fix this manually on debugfs either by marking the inode in use in the bitmap (seti 15115) or clear the inode number in the directory entry (on my phone right now and can't check the command for this).
>
> It would be nice to get a test case first, so that e2fsck could be fixed, so if this isn't causing you grief it would be nice to keep this around until there is a chance to understand the problem.

I'm assuming that with the rest of the fsck coming up clean it is safe 
to use the filesystem in production come Monday (Samba)?

These files are older so I wouldn't mind setting the permissions so that 
no one can get to them for a bit. What would I need to do to get a test 
case going?

>
> Cheers, Andreas--
> To unsubscribe from this list: send the line "unsubscribe linux-ext4" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>

Thank you,
Mark


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: e2fsck repeatedly asks to clear the same entry?
  2012-11-24  6:27   ` Mark Casey
@ 2012-11-24 19:17     ` Andreas Dilger
  2012-11-26  3:52       ` Mark Casey
  0 siblings, 1 reply; 6+ messages in thread
From: Andreas Dilger @ 2012-11-24 19:17 UTC (permalink / raw)
  To: Mark Casey; +Cc: linux-ext4

On 2012-11-23, at 23:27, Mark Casey <markc@unifiedgroup.com> wrote:

> On 11/23/2012 1:18 PM, Andreas Dilger wrote:
>> On 2012-11-23, at 11:26, Mark Casey <markc@unifiedgroup.com> wrote:
>>> I'm in a bit of a loop trying to fix my ext4 filesystem; it always goes like this even after several passes.
>>> 
>>>> root@host:/home/luser# /root/latest/sbin/e2fsck -f /dev/vgdalr6/lv1
>>>> e2fsck 1.42.5 (29-Jul-2012)
>>>> Pass 1: Checking inodes, blocks, and sizes
>>>> Pass 2: Checking directory structure
>>>> Entry 'A5 11-3' in /share/path/09/Brett/Pines/Flynt's Side Drive - Complete Archive Copy/SA Version Pines/Chris Pics 11-2-10 (268533857) has deleted/unused inode 15115.  Clear<y>? yes
>>>> Pass 3: Checking directory connectivity
>>>> Pass 4: Checking reference counts
>>>> Pass 5: Checking group summary information
>>>> 
>>>> /dev/vgdalr6/lv1: ***** FILE SYSTEM WAS MODIFIED *****
>>>> /dev/vgdalr6/lv1: 8993801/268697600 files (0.8% non-contiguous), 2152234742/4299161600 blocks
>>>> root@host:/home/luser#
>>> 
>>> 
>>> Would anyone have any suggestions how to proceed?
>>> 
>>> The cause of this is that I did an unsupported resize (a shrink) by commenting out one of resize2fs' checks...as described here: (note I'm not claiming this as "permission"; I knew it might not work)
>> 
>> E2fsck should be able to fix (i.e. get into some consistent state) anything regardless of how it got into that state.
>> 
>>> http://article.gmane.org/gmane.comp.file-systems.ext4/35375
>>> 
>>>  this current issue with the directory 'A5 11-3' is the only symptom presenting. I have most of the e2fsck log that followed the resize in case that would be of use.
>> 
>> It would probably be useful to get information from debugfs for this directory and inode (stats, ls -l <268533857>, stat <15115>, and checki 15115).
> 
> I've gathered that info as best I can. Last one didn't want to work. Let me know if there's more:
> 
> * stats *
> Filesystem volume name:   <none>
> Last mounted on:          /home
> Filesystem UUID:          3652885c-e8c6-4f4d-86a0-a4c1d1784557
> Filesystem magic number:  0xEF53
> Filesystem revision #:    1 (dynamic)
> Filesystem features:      has_journal ext_attr dir_index filetype extent 64bit flex_bg sparse_super large_file huge_file uninit_bg dir_nlink extra_isize
> Filesystem flags:         signed_directory_hash
> Default mount options:    user_xattr acl
> Filesystem state:         clean
> Errors behavior:          Continue
> Filesystem OS type:       Linux
> Inode count:              268697600
> Block count:              4299161600
> Reserved block count:     10747903
> Free blocks:              2146933332
> Free inodes:              259703799
> First block:              0
> Block size:               4096
> Fragment size:            4096
> Blocks per group:         32768
> Fragments per group:      32768
> Inodes per group:         2048
> Inode blocks per group:   128
> RAID stride:              64
> RAID stripe width:        576
> Flex block group size:    16
> Filesystem created:       Sun Sep  9 18:40:39 2012
> Last mount time:          Fri Nov 23 11:12:16 2012
> Last write time:          Fri Nov 23 12:27:56 2012
> Mount count:              0
> Maximum mount count:      -1
> Last checked:             Fri Nov 23 12:27:56 2012
> Check interval:           0 (<none>)
> Lifetime writes:          12 TB
> Reserved blocks uid:      0 (user root)
> Reserved blocks gid:      0 (group root)
> First inode:              11
> Inode size:               256
> Required extra isize:     28
> Desired extra isize:      28
> Journal inode:            8
> Default directory hash:   half_md4
> Directory Hash Seed:      94884f6d-8b2e-4830-a33b-02652aee727c
> Journal backup:           inode blocks
> Directories:              258340
> 
> * ls -l *
> 268533857   42757 (2)   1000   1000    4096 22-Nov-2012 12:38 .
> 268304391   42757 (2)   1000   1000    4096 11-Jan-2011 18:06 ..
>  15111   42757 (2)   1000   1000   20480 11-Jan-2011 17:44 Group 1
>  15112   42757 (2)   1000   1000    4096 11-Jan-2011 17:44 Group 2
>  15113   42757 (2)   1000   1000   16384 11-Jan-2011 17:45 Group 3
>  15114   42757 (2)   1000   1000   12288 11-Jan-2011 17:46 Group 4 11-2
>  15115   42757 (2)   1000   1000   36864 11-Jan-2011 17:48 Group 5 11-3

So this is the problematic entry. The directory entry looks ok, though it doesn't have the same name as e2fsck reports. It claims the entry is "A5 11-3", which is a bit bizarre. 

>  15116   42757 (2)   1000   1000   40960 11-Jan-2011 17:51 Group 6 11-4
> 
> * stat *
> User:  1000   Group:  1000   Size: 36864
> File ACL: 0    Directory ACL: 0
> Links: 0   Blockcount: 80
> Fragment:  Address: 0    Number: 0    Size: 0
> ctime: 0x50ae708d:cb6e6828 -- Thu Nov 22 12:35:57 2012
> atime: 0x504eda66:cb6e6828 -- Tue Sep 11 01:29:58 2012
> mtime: 0x4d2cec69:00000000 -- Tue Jan 11 17:48:57 2011
> crtime: 0x504ed9f5:dab0e61c -- Tue Sep 11 01:28:05 2012
> dtime: 0x50af000c -- Thu Nov 22 22:48:12 2012
> Size of extra inode fields: 28

Both the nlinks here and the dtime show that this inode is deleted, so e2fsck is right in reporting that the directory entry is wrong. 

To fix this manually, you would need to set the inode number for this entry to zero, but I don't recall off the top of my head how to do this. 

>> Normally I'd say that getting a e2image of the filesystem would be useful for debugging and to create a test case, but since the filesystem is 16TB in size that won't be practical.
>> 
>> In the worst case, it should be possible to fix this manually on debugfs either by marking the inode in use in the bitmap (seti 15115) or clear the inode number in the directory entry (on my phone right now and can't check the command for this).
>> 
>> It would be nice to get a test case first, so that e2fsck could be fixed, so if this isn't causing you grief it would be nice to keep this around until there is a chance to understand the problem.
> 
> I'm assuming that with the rest of the fsck coming up clean it is safe to use the filesystem in production come Monday (Samba)?

Yes, though if this directory is accessed it might turn the filesystem read-only. 

> These files are older so I wouldn't mind setting the permissions so that no one can get to them for a bit. What would I need to do to get a test case going?

Just mark the parent directory inaccessible:

# chmod 000 "/share/path/09/Brett/Pines/Flynt's Side Drive - Complete Archive Copy/SA Version Pines/Chris Pics 11-2-10/Group 5 11-3"

Cheers, Andreas

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: e2fsck repeatedly asks to clear the same entry?
  2012-11-24 19:17     ` Andreas Dilger
@ 2012-11-26  3:52       ` Mark Casey
  0 siblings, 0 replies; 6+ messages in thread
From: Mark Casey @ 2012-11-26  3:52 UTC (permalink / raw)
  To: linux-ext4

On 11/24/2012 1:17 PM, Andreas Dilger wrote:
>
> So this is the problematic entry. The directory entry looks ok, though it doesn't have the same name as e2fsck reports. It claims the entry is "A5 11-3", which is a bit bizarre.

My fault. I was initially inconsistent in deciding what parts of our 
real tree I wanted online. So there is nothing named "A5 11-3". Should 
not be an issue again; sorry to add complication.

>
> ...
>
> Yes, though if this directory is accessed it might turn the filesystem read-only.

Yep, it does.

>
>> These files are older so I wouldn't mind setting the permissions so that no one can get to them for a bit. What would I need to do to get a test case going?
>
> Just mark the parent directory inaccessible:
>
> # chmod 000 "/share/path/09/Brett/Pines/Flynt's Side Drive - Complete Archive Copy/SA Version Pines/Chris Pics 11-2-10/Group 5 11-3"
>

No problem. Once I've got that done I'll see what I can do with e2image. 
I really appreciate the input so I'd like to do whatever I can if you 
still think it might lead to some sort of bugfix. So far the only 
changes made were to restore the other files that the post-resize fsck 
had to remove/free. Final total was a couple gigs across 7 dirs.

I've looked for any stat differences between the current file tree and 
the one from the backup just before the resize. The only issue found is 
that there appear to be ~200 directories that were not removed by fsck 
but appear to have had their modtimes reset by it instead. The actual 
files contained were untouched. If that is also no big concern then I 
think later tonight, after/if I can get an e2image done, I'll just 
restore their modtimes from the backup to make things pretty again.

Thank you,
Mark

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: e2fsck repeatedly asks to clear the same entry?
  2012-11-23 19:18 ` Andreas Dilger
  2012-11-24  6:27   ` Mark Casey
@ 2012-11-24 17:34   ` Eric Sandeen
  1 sibling, 0 replies; 6+ messages in thread
From: Eric Sandeen @ 2012-11-24 17:34 UTC (permalink / raw)
  To: Andreas Dilger; +Cc: Mark Casey, linux-ext4@vger.kernel.org

On 11/23/12 1:18 PM, Andreas Dilger wrote:
> On 2012-11-23, at 11:26, Mark Casey <markc@unifiedgroup.com> wrote:
>> I'm in a bit of a loop trying to fix my ext4 filesystem; it always
>> goes like this even after several passes.

...

> Normally I'd say that getting a e2image of the filesystem would be
> useful for debugging and to create a test case, but since the
> filesystem is 16TB in size that won't be practical.

It might not be that bad.  You could make a raw e2image, mount it,
remove some of the non-affected dir trees, (maybe make another e2image -r
of that modified image), and it might compress pretty well.

Just a thought, if the email debugging doesn't yield results.

-Eric

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2012-11-26  3:52 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-11-23 18:26 e2fsck repeatedly asks to clear the same entry? Mark Casey
2012-11-23 19:18 ` Andreas Dilger
2012-11-24  6:27   ` Mark Casey
2012-11-24 19:17     ` Andreas Dilger
2012-11-26  3:52       ` Mark Casey
2012-11-24 17:34   ` Eric Sandeen

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).