cluster-devel.redhat.com archive mirror
 help / color / mirror / Atom feed
From: Andrew Price <anprice@redhat.com>
To: cluster-devel.redhat.com
Subject: [Cluster-devel] [fsck.gfs2 v2 PATCH 00/40] fsck.gfs2: memory reduction patches
Date: Mon, 9 May 2016 15:55:36 +0100	[thread overview]
Message-ID: <5730A4E8.1030106@redhat.com> (raw)
In-Reply-To: <cover.1462555424.git.rpeterso@redhat.com>

Hi Bob,

On 06/05/16 18:38, Bob Peterson wrote:
> On 15 April, I posted a set of 30 patches for fsck.gfs2 to reduce its
> memory requirements. Since that time, I've been testing and fixing
> problems. Here is version 2.
> ---
> Right now, fsck.gfs2 is a memory hog with regard to very large file
> systems. A file system of 250TB would require an absurd about of memory.
> That's because it uses memory for (1) rgrp bitmaps, (2) its own internal
> blockmap, (3) a directory tree, and (4) a tree of ALL inodes in the
> file system, for reconciling link counts.
>
> This series of 40 patches is designed to reduce the memory footprint
> of fsck.gfs2. It does so by almost entirely eliminating the inode
> tree in favor of two much smaller bitmaps. In order to do that, I
> rearranged the order in which things are done.
>
> In order to save on memory, some things will be slower, so this will
> cause a performance hit to fsck.gfs2. However, I'm confident that
> we can fix them and make it fast again. The slowness is caused because
> most bitmap manipulation is now done on the bitmaps in the resource
> groups, which means the code needs to search out the rgrp for these
> operations. This can be optimized to pass in the rgrp, since it will
> be known in most cases (for example, pass1 operates on one rgrp at a
> time, and chances are quite good that the rgrp needed will be that one.)
> These further optimizations will be done in later patches.
>
> I'm still testing these changes against my collection of gfs2 and gfs1
> metadata, but the results are good so far. So there may be changes.

I've read through the patches and the code looks good to me. It's 
difficult to comment on the design of the solution but nothing scary 
jumped out at me, and the test results look promising. There's some 
really good refactoring work in this set too.

> Signed-off-by: Bob Peterson <rpeterso@redhat.com>
> ---
> Bob Peterson (40):
>    fsck.gfs2: Move pass5 to immediately follow pass1
>    fsck.gfs2: Convert block_type to bitmap_type after pass1 and 5
>    fsck.gfs2: Change bitmap_type variables to int
>    fsck.gfs2: Use di_entries to determine if lost+found was created
>    fsck.gfs2: pass1b shouldn't complain about non-bitmap blocks
>    fsck.gfs2: Change all fsck_blockmap_set to fsck_bitmap_set
>    fsck.gfs2: Move set_ip_blockmap to pass1
>    fsck.gfs2: Remove unneeded parameter instree from set_ip_blockmap
>    fsck.gfs2: Move leaf repair to pass2
>    fsck.gfs2: Eliminate astate code
>    fsck.gfs2: Move reprocess code to pass1
>    fsck.gfs2: Separate out functions that may only be done after pass1
>    fsck.gfs2: Divest check_metatree from fsck_blockmap_set
>    fsck.gfs2: eliminate fsck_blockmap_set from check_eattr_entries
>    fsck.gfs2: Move blockmap stuff to pass1.c
>    fsck: make pass1 call bitmap reconciliation AKA pass5
>    fsck.gfs2: make blockmap global variable only to pass1
>    fsck.gfs2: Add wrapper function pass1_check_metatree
>    fsck.gfs2: pass counted_links into fix_link_count in pass4
>    fsck.gfs2: refactor pass4 function scan_inode_list
>    fsck.gfs2: More refactoring of pass4 function scan_inode_list
>    fsck.gfs2: Fix white space problems
>    fsck.gfs2: move link count info for directories to directory tree
>    fsck.gfs2: Use bitmaps instead of linked list for inodes w/nlink == 1
>    fsck.gfs2: Refactor check_n_fix_bitmap to make it more readable
>    fsck.gfs2: adjust rgrp inode count when fixing bitmap
>    fsck.gfs2: blocks cannot be UNLINKED in pass1b or after that
>    fsck.gfs2: Add error checks to get_next_leaf
>    fsck.gfs2: re-add a non-allocating repair_leaf to pass1
>    libgfs2: Allocate new GFS1 metadata as type 3, not type 1
>    fsck.gfs2: Undo partially done metadata records
>    fsck.gfs2: Eliminate redundant code in _fsck_bitmap_set
>    fsck.gfs2: Fix inode counting bug
>    fsck.gfs2: Adjust bitmap for lost+found after adding to dirtree
>    GFS2: Add initialization checks for GFS1 used metadata

s/GFS2/fsck.gfs2/ :)

Thanks,
Andy

>    fsck.gfs2: Use BLKST constants to make pass5 more clear
>    fsck.gfs2: Fix GFS1 "used meta" accounting bug
>    fsck.gfs2: pass1b is too noisy wrt gfs1 non-dinode metadata
>    fsck.gfs2: Fix rgrp dinode accounting bug
>    fsck.gfs2: Fix rgrp accounting in check_n_fix_bitmap
>
>   gfs2/fsck/Makefile.am         |   1 +
>   gfs2/fsck/afterpass1_common.c | 319 ++++++++++++++++
>   gfs2/fsck/afterpass1_common.h |  31 ++
>   gfs2/fsck/fsck.h              |  26 +-
>   gfs2/fsck/initialize.c        |  63 +++-
>   gfs2/fsck/inode_hash.c        |   3 +-
>   gfs2/fsck/link.c              | 139 ++++++-
>   gfs2/fsck/link.h              |   6 +-
>   gfs2/fsck/lost_n_found.c      |  17 +-
>   gfs2/fsck/main.c              |  38 +-
>   gfs2/fsck/metawalk.c          | 847 ++++++++----------------------------------
>   gfs2/fsck/metawalk.h          |  59 +--
>   gfs2/fsck/pass1.c             | 479 ++++++++++++++++++++++--
>   gfs2/fsck/pass1b.c            | 148 +++++---
>   gfs2/fsck/pass2.c             | 276 ++++++++++----
>   gfs2/fsck/pass3.c             |  52 +--
>   gfs2/fsck/pass4.c             | 339 ++++++++++-------
>   gfs2/fsck/pass5.c             |  60 +--
>   gfs2/fsck/util.c              | 171 +++------
>   gfs2/fsck/util.h              |  70 ++--
>   gfs2/libgfs2/fs_ops.c         |  33 +-
>   21 files changed, 1849 insertions(+), 1328 deletions(-)
>   create mode 100644 gfs2/fsck/afterpass1_common.c
>   create mode 100644 gfs2/fsck/afterpass1_common.h
>



  parent reply	other threads:[~2016-05-09 14:55 UTC|newest]

Thread overview: 45+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-05-06 17:38 [Cluster-devel] [fsck.gfs2 v2 PATCH 00/40] fsck.gfs2: memory reduction patches Bob Peterson
2016-05-06 17:38 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 01/40] fsck.gfs2: Move pass5 to immediately follow pass1 Bob Peterson
2016-05-06 17:38 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 02/40] fsck.gfs2: Convert block_type to bitmap_type after pass1 and 5 Bob Peterson
2016-05-06 17:38 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 03/40] fsck.gfs2: Change bitmap_type variables to int Bob Peterson
2016-05-06 17:38 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 04/40] fsck.gfs2: Use di_entries to determine if lost+found was created Bob Peterson
2016-05-06 17:38 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 05/40] fsck.gfs2: pass1b shouldn't complain about non-bitmap blocks Bob Peterson
2016-05-06 17:38 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 06/40] fsck.gfs2: Change all fsck_blockmap_set to fsck_bitmap_set Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 07/40] fsck.gfs2: Move set_ip_blockmap to pass1 Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 08/40] fsck.gfs2: Remove unneeded parameter instree from set_ip_blockmap Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 09/40] fsck.gfs2: Move leaf repair to pass2 Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 10/40] fsck.gfs2: Eliminate astate code Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 11/40] fsck.gfs2: Move reprocess code to pass1 Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 12/40] fsck.gfs2: Separate out functions that may only be done after pass1 Bob Peterson
2016-05-09 14:55   ` Andrew Price
2016-05-09 16:35     ` Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 13/40] fsck.gfs2: Divest check_metatree from fsck_blockmap_set Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 14/40] fsck.gfs2: eliminate fsck_blockmap_set from check_eattr_entries Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 15/40] fsck.gfs2: Move blockmap stuff to pass1.c Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 16/40] fsck: make pass1 call bitmap reconciliation AKA pass5 Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 17/40] fsck.gfs2: make blockmap global variable only to pass1 Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 18/40] fsck.gfs2: Add wrapper function pass1_check_metatree Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 19/40] fsck.gfs2: pass counted_links into fix_link_count in pass4 Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 20/40] fsck.gfs2: refactor pass4 function scan_inode_list Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 21/40] fsck.gfs2: More refactoring of " Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 22/40] fsck.gfs2: Fix white space problems Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 23/40] fsck.gfs2: move link count info for directories to directory tree Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 24/40] fsck.gfs2: Use bitmaps instead of linked list for inodes w/nlink == 1 Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 25/40] fsck.gfs2: Refactor check_n_fix_bitmap to make it more readable Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 26/40] fsck.gfs2: adjust rgrp inode count when fixing bitmap Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 27/40] fsck.gfs2: blocks cannot be UNLINKED in pass1b or after that Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 28/40] fsck.gfs2: Add error checks to get_next_leaf Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 29/40] fsck.gfs2: re-add a non-allocating repair_leaf to pass1 Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 30/40] libgfs2: Allocate new GFS1 metadata as type 3, not type 1 Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 31/40] fsck.gfs2: Undo partially done metadata records Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 32/40] fsck.gfs2: Eliminate redundant code in _fsck_bitmap_set Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 33/40] fsck.gfs2: Fix inode counting bug Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 34/40] fsck.gfs2: Adjust bitmap for lost+found after adding to dirtree Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 35/40] GFS2: Add initialization checks for GFS1 used metadata Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 36/40] fsck.gfs2: Use BLKST constants to make pass5 more clear Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 37/40] fsck.gfs2: Fix GFS1 "used meta" accounting bug Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 38/40] fsck.gfs2: pass1b is too noisy wrt gfs1 non-dinode metadata Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 39/40] fsck.gfs2: Fix rgrp dinode accounting bug Bob Peterson
2016-05-06 17:39 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 40/40] fsck.gfs2: Fix rgrp accounting in check_n_fix_bitmap Bob Peterson
2016-05-09 14:55 ` Andrew Price [this message]
2016-05-09 17:48 ` [Cluster-devel] [fsck.gfs2 v2 PATCH 00/40] fsck.gfs2: memory reduction patches Abhijith Das

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5730A4E8.1030106@redhat.com \
    --to=anprice@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).