public inbox for linux-xfs@vger.kernel.org
 help / color / mirror / Atom feed
* problem with latest xfsprogs progress code
@ 2007-01-16 16:30 James Braid
  2007-01-17 11:00 ` James Braid
  0 siblings, 1 reply; 5+ messages in thread
From: James Braid @ 2007-01-16 16:30 UTC (permalink / raw)
  To: xfs

Running 2.8.18 xfs_repair on a largeish (65TB, ~70M inodes) filesystem on
an x86_64 machine gives the following "progress" output:

12:15:36: process known inodes and inode discovery - 1461632 of 0 inod
es done
12:15:36: Phase 3: elapsed time 14 minutes, 32 seconds - processed 100
571 inodes per minute
12:15:36: Phase 3: 0% done - estimated remaining time 3364 weeks, 3 da
ys, 7 hours, 30 minutes, 45 seconds

Is this a known bug?

Thanks, James

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: problem with latest xfsprogs progress code
  2007-01-16 16:30 problem with latest xfsprogs progress code James Braid
@ 2007-01-17 11:00 ` James Braid
  2007-01-17 14:08   ` Klaus Strebel
  0 siblings, 1 reply; 5+ messages in thread
From: James Braid @ 2007-01-17 11:00 UTC (permalink / raw)
  To: xfs

I'm now seeing the following output - it's been sitting at this point for
over 13 hours now... earlier versions of xfs_repair would finish quite a
bit faster. Any ideas whats going on?

        - 03:00:37: traversing filesystem - 0 of 55 allocation groups done
        - 03:15:37: traversing filesystem - 0 of 55 allocation groups done
        - 03:30:37: traversing filesystem - 0 of 55 allocation groups done
        - 03:45:37: traversing filesystem - 0 of 55 allocation groups done
        - 04:00:37: traversing filesystem - 0 of 55 allocation groups done
        - 04:15:37: traversing filesystem - 0 of 55 allocation groups done
        - 04:30:37: traversing filesystem - 0 of 55 allocation groups done
        - 04:45:37: traversing filesystem - 0 of 55 allocation groups done
        - 05:00:37: traversing filesystem - 0 of 55 allocation groups done
        - 05:15:37: traversing filesystem - 0 of 55 allocation groups done
        - 05:30:37: traversing filesystem - 0 of 55 allocation groups done
        - 05:45:37: traversing filesystem - 0 of 55 allocation groups done
        - 06:00:37: traversing filesystem - 0 of 55 allocation groups done
        - 06:15:37: traversing filesystem - 0 of 55 allocation groups done
        - 06:30:37: traversing filesystem - 0 of 55 allocation groups done
        - 06:45:37: traversing filesystem - 0 of 55 allocation groups done
        - 07:00:37: traversing filesystem - 0 of 55 allocation groups done
        - 07:15:37: traversing filesystem - 0 of 55 allocation groups done
        - 07:30:37: traversing filesystem - 0 of 55 allocation groups done
        - 07:45:37: traversing filesystem - 0 of 55 allocation groups done
        - 08:00:37: traversing filesystem - 0 of 55 allocation groups done
        - 08:15:37: traversing filesystem - 0 of 55 allocation groups done
        - 08:30:37: traversing filesystem - 0 of 55 allocation groups done
        - 08:45:37: traversing filesystem - 0 of 55 allocation groups done
        - 09:00:37: traversing filesystem - 0 of 55 allocation groups done
        - 09:15:37: traversing filesystem - 0 of 55 allocation groups done
        - 09:30:37: traversing filesystem - 0 of 55 allocation groups done
        - 09:45:37: traversing filesystem - 0 of 55 allocation groups done
        - 10:00:37: traversing filesystem - 0 of 55 allocation groups done
        - 10:15:37: traversing filesystem - 0 of 55 allocation groups done
        - 10:30:37: traversing filesystem - 0 of 55 allocation groups done



> Running 2.8.18 xfs_repair on a largeish (65TB, ~70M inodes) filesystem on
> an x86_64 machine gives the following "progress" output:
>
> 12:15:36: process known inodes and inode discovery - 1461632 of 0 inod
> es done
> 12:15:36: Phase 3: elapsed time 14 minutes, 32 seconds - processed 100
> 571 inodes per minute
> 12:15:36: Phase 3: 0% done - estimated remaining time 3364 weeks, 3 da
> ys, 7 hours, 30 minutes, 45 seconds
>
> Is this a known bug?
>
> Thanks, James
>
>
>

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: problem with latest xfsprogs progress code
  2007-01-17 11:00 ` James Braid
@ 2007-01-17 14:08   ` Klaus Strebel
  2007-01-17 15:54     ` Eric Sandeen
  0 siblings, 1 reply; 5+ messages in thread
From: Klaus Strebel @ 2007-01-17 14:08 UTC (permalink / raw)
  To: jamesb; +Cc: xfs

James Braid schrieb:
> I'm now seeing the following output - it's been sitting at this point for
> over 13 hours now... earlier versions of xfs_repair would finish quite a
> bit faster. Any ideas whats going on?
> 
>         - 03:00:37: traversing filesystem - 0 of 55 allocation groups done
>         - 03:15:37: traversing filesystem - 0 of 55 allocation groups done
>         - 03:30:37: traversing filesystem - 0 of 55 allocation groups done
>         - 03:45:37: traversing filesystem - 0 of 55 allocation groups done
>         - 04:00:37: traversing filesystem - 0 of 55 allocation groups done
>         - 04:15:37: traversing filesystem - 0 of 55 allocation groups done
>         - 04:30:37: traversing filesystem - 0 of 55 allocation groups done
>         - 04:45:37: traversing filesystem - 0 of 55 allocation groups done
>         - 05:00:37: traversing filesystem - 0 of 55 allocation groups done
>         - 05:15:37: traversing filesystem - 0 of 55 allocation groups done
>         - 05:30:37: traversing filesystem - 0 of 55 allocation groups done
>         - 05:45:37: traversing filesystem - 0 of 55 allocation groups done
>         - 06:00:37: traversing filesystem - 0 of 55 allocation groups done
>         - 06:15:37: traversing filesystem - 0 of 55 allocation groups done
>         - 06:30:37: traversing filesystem - 0 of 55 allocation groups done
>         - 06:45:37: traversing filesystem - 0 of 55 allocation groups done
>         - 07:00:37: traversing filesystem - 0 of 55 allocation groups done
>         - 07:15:37: traversing filesystem - 0 of 55 allocation groups done
>         - 07:30:37: traversing filesystem - 0 of 55 allocation groups done
>         - 07:45:37: traversing filesystem - 0 of 55 allocation groups done
>         - 08:00:37: traversing filesystem - 0 of 55 allocation groups done
>         - 08:15:37: traversing filesystem - 0 of 55 allocation groups done
>         - 08:30:37: traversing filesystem - 0 of 55 allocation groups done
>         - 08:45:37: traversing filesystem - 0 of 55 allocation groups done
>         - 09:00:37: traversing filesystem - 0 of 55 allocation groups done
>         - 09:15:37: traversing filesystem - 0 of 55 allocation groups done
>         - 09:30:37: traversing filesystem - 0 of 55 allocation groups done
>         - 09:45:37: traversing filesystem - 0 of 55 allocation groups done
>         - 10:00:37: traversing filesystem - 0 of 55 allocation groups done
>         - 10:15:37: traversing filesystem - 0 of 55 allocation groups done
>         - 10:30:37: traversing filesystem - 0 of 55 allocation groups done
> 
> 
> 
>> Running 2.8.18 xfs_repair on a largeish (65TB, ~70M inodes) filesystem on
>> an x86_64 machine gives the following "progress" output:
>>
>> 12:15:36: process known inodes and inode discovery - 1461632 of 0 inod
>> es done
>> 12:15:36: Phase 3: elapsed time 14 minutes, 32 seconds - processed 100
>> 571 inodes per minute
>> 12:15:36: Phase 3: 0% done - estimated remaining time 3364 weeks, 3 da
>> ys, 7 hours, 30 minutes, 45 seconds
>>
>> Is this a known bug?
Hi James,

why do you think that this is a bug? You have an almost infinitely large
filesystem, so the file-system check will also run for an almost
infinitely long time ;-).

You see, not all that's possible is really desirable.

Ciao
Klaus

Btw. i wouldn't expect this xfs_repair run to finish without running out
of memory :-(.

-- 
Mit freundlichen Grüssen / best regards

Klaus Strebel, Dipl.-Inform. (FH), mailto:klaus.strebel@gmx.net

/"\
\ /     ASCII RIBBON CAMPAIGN
 X        AGAINST HTML MAIL
/ \

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: problem with latest xfsprogs progress code
  2007-01-17 14:08   ` Klaus Strebel
@ 2007-01-17 15:54     ` Eric Sandeen
  2007-01-17 22:51       ` David Chatterton
  0 siblings, 1 reply; 5+ messages in thread
From: Eric Sandeen @ 2007-01-17 15:54 UTC (permalink / raw)
  To: Klaus Strebel; +Cc: jamesb, xfs

Klaus Strebel wrote:

>>> Running 2.8.18 xfs_repair on a largeish (65TB, ~70M inodes) filesystem on
>>> an x86_64 machine gives the following "progress" output:
>>>
>>> 12:15:36: process known inodes and inode discovery - 1461632 of 0 inod
>>> es done
>>> 12:15:36: Phase 3: elapsed time 14 minutes, 32 seconds - processed 100
>>> 571 inodes per minute
>>> 12:15:36: Phase 3: 0% done - estimated remaining time 3364 weeks, 3 da
>>> ys, 7 hours, 30 minutes, 45 seconds
>>>
>>> Is this a known bug?
> Hi James,
> 
> why do you think that this is a bug? You have an almost infinitely large
> filesystem, so the file-system check will also run for an almost
> infinitely long time ;-).
> 
> You see, not all that's possible is really desirable.

Well, while 65TB is impressive*, and repairing it quickly is indeed a
challenge, it probably still should not take 64+ years.  ;-)

Sounds like something is in fact going wrong.

-Eric

*it amuses me to see xfs users refer to nearly 100T as largeISH; clearly
 you all do not suffer from lowered expectations.  :)

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: problem with latest xfsprogs progress code
  2007-01-17 15:54     ` Eric Sandeen
@ 2007-01-17 22:51       ` David Chatterton
  0 siblings, 0 replies; 5+ messages in thread
From: David Chatterton @ 2007-01-17 22:51 UTC (permalink / raw)
  To: jamesb; +Cc: Eric Sandeen, Klaus Strebel, xfs



Eric Sandeen wrote:
> Klaus Strebel wrote:
> 
>>>> Running 2.8.18 xfs_repair on a largeish (65TB, ~70M inodes) filesystem on
>>>> an x86_64 machine gives the following "progress" output:
>>>>
>>>> 12:15:36: process known inodes and inode discovery - 1461632 of 0 inod
>>>> es done
>>>> 12:15:36: Phase 3: elapsed time 14 minutes, 32 seconds - processed 100
>>>> 571 inodes per minute
>>>> 12:15:36: Phase 3: 0% done - estimated remaining time 3364 weeks, 3 da
>>>> ys, 7 hours, 30 minutes, 45 seconds
>>>>
>>>> Is this a known bug?
>> Hi James,
>>
>> why do you think that this is a bug? You have an almost infinitely large
>> filesystem, so the file-system check will also run for an almost
>> infinitely long time ;-).
>>
>> You see, not all that's possible is really desirable.
> 
> Well, while 65TB is impressive*, and repairing it quickly is indeed a
> challenge, it probably still should not take 64+ years.  ;-)
> 
> Sounds like something is in fact going wrong.
> 
> -Eric
> 
> *it amuses me to see xfs users refer to nearly 100T as largeISH; clearly
>  you all do not suffer from lowered expectations.  :)
> 

Barry is at linux.conf.au this week, he knows this code better than
anyone else.

Phase 3 is scanning the inodes in each allocation group, building up a
map of filesystem blocks that are marked as used.

See http://oss.sgi.com/projects/xfs/training/xfs_slides_11_repair.pdf

Scanning an AG and its inodes should not be taking this long.
Are you under memory pressure and the machine is just swapping to death?
Are you seeing I/O errors on the storage?
Is the storage using AVT mode and the luns are flipping between controllers?


Thanks,

David

-- 
David Chatterton
XFS Engineering Manager
SGI Australia

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2007-01-17 22:52 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-01-16 16:30 problem with latest xfsprogs progress code James Braid
2007-01-17 11:00 ` James Braid
2007-01-17 14:08   ` Klaus Strebel
2007-01-17 15:54     ` Eric Sandeen
2007-01-17 22:51       ` David Chatterton

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox