From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 28428C433F5 for ; Thu, 24 Mar 2022 14:39:29 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1351068AbiCXOk7 (ORCPT ); Thu, 24 Mar 2022 10:40:59 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:60366 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1347017AbiCXOkz (ORCPT ); Thu, 24 Mar 2022 10:40:55 -0400 Received: from ams.source.kernel.org (ams.source.kernel.org [145.40.68.75]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 0149635DF7 for ; Thu, 24 Mar 2022 07:39:22 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ams.source.kernel.org (Postfix) with ESMTPS id 9D861B82409 for ; Thu, 24 Mar 2022 14:39:21 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id D9FD8C340ED; Thu, 24 Mar 2022 14:39:16 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1648132760; bh=gAZpVXYOpw0ShBQWA1T2gNp/wWDlQwWNUJIrMqEi3U0=; h=From:To:Cc:Subject:Date:From; b=qgYPtZMLWQX0QgXYEBjS6DuGGYdRPu0LBNk4L9P5ndDDGGISzV3GC7h4lwlHtR2Oe taO4gE3CJMJM+JX1PxOE63ovV9wGAU+5FPVUnkSyrwv6GwanQ+qhfvPZzzQoBb3Dya vh3aiSmaVXq9cLgdzhkvY4gzqOt/Fv+58dZdyD7I7tSGZ1VnY6xTSWxVJyaycW//jl Q1bRTK1CMzV6os/TyVcXIhhKoMJCDQsk8mY7OFzkD10kwu5koMS/t2jfR/eVKs5u6G p4+ntBV+LWlvjOX4vRtp0HG4gpVYa8birwqkqnH23UYjiHii4QuUu1/ega04Rp4EaK akJrEXfIAqFTQ== From: Chao Yu To: jaegeuk@kernel.org Cc: linux-f2fs-devel@lists.sourceforge.net, linux-kernel@vger.kernel.org, Chao Yu , Chao Yu Subject: [PATCH v2] f2fs: give priority to select unpinned section for foreground GC Date: Thu, 24 Mar 2022 22:39:07 +0800 Message-Id: <20220324143907.3071-1-chao@kernel.org> X-Mailer: git-send-email 2.32.0 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Previously, during foreground GC, if victims contain data of pinned file, it will fail migration of the data, and meanwhile i_gc_failures of that pinned file may increase, and when it exceeds threshold, GC will unpin the file, result in breaking pinfile's semantics. In order to mitigate such condition, let's record and skip section which has pinned file's data and give priority to select unpinned one. Signed-off-by: Chao Yu --- v2: - don't release .pinned_secmap memory in init_victim_secmap() fs/f2fs/gc.c | 53 ++++++++++++++++++++++++++++++++++++++++++++--- fs/f2fs/segment.c | 7 +++++++ fs/f2fs/segment.h | 2 ++ 3 files changed, 59 insertions(+), 3 deletions(-) diff --git a/fs/f2fs/gc.c b/fs/f2fs/gc.c index 6a7e4148ff9d..b4e559030d99 100644 --- a/fs/f2fs/gc.c +++ b/fs/f2fs/gc.c @@ -646,6 +646,34 @@ static void release_victim_entry(struct f2fs_sb_info *sbi) f2fs_bug_on(sbi, !list_empty(&am->victim_list)); } +static void pin_section(struct f2fs_sb_info *sbi, unsigned int segno) +{ + struct dirty_seglist_info *dirty_i = DIRTY_I(sbi); + + set_bit(GET_SEC_FROM_SEG(sbi, segno), dirty_i->pinned_secmap); + dirty_i->pinned_secmap_cnt++; +} + +static bool pinned_section_exists(struct dirty_seglist_info *dirty_i) +{ + return dirty_i->pinned_secmap_cnt; +} + +static bool section_is_pinned(struct dirty_seglist_info *dirty_i, + unsigned int secno) +{ + return pinned_section_exists(dirty_i) && + test_bit(secno, dirty_i->pinned_secmap); +} + +static void unpin_all_sections(struct f2fs_sb_info *sbi) +{ + unsigned int bitmap_size = f2fs_bitmap_size(MAIN_SECS(sbi)); + + memset(DIRTY_I(sbi)->pinned_secmap, 0, bitmap_size); + DIRTY_I(sbi)->pinned_secmap_cnt = 0; +} + /* * This function is called from two paths. * One is garbage collection and the other is SSR segment selection. @@ -787,6 +815,9 @@ static int get_victim_by_default(struct f2fs_sb_info *sbi, if (gc_type == BG_GC && test_bit(secno, dirty_i->victim_secmap)) goto next; + if (gc_type == FG_GC && section_is_pinned(dirty_i, secno)) + goto next; + if (is_atgc) { add_victim_entry(sbi, &p, segno); goto next; @@ -1202,8 +1233,10 @@ static int move_data_block(struct inode *inode, block_t bidx, } if (f2fs_is_pinned_file(inode)) { - if (gc_type == FG_GC) + if (gc_type == FG_GC) { f2fs_pin_file_control(inode, true); + pin_section(F2FS_I_SB(inode), segno); + } err = -EAGAIN; goto out; } @@ -1352,8 +1385,10 @@ static int move_data_page(struct inode *inode, block_t bidx, int gc_type, goto out; } if (f2fs_is_pinned_file(inode)) { - if (gc_type == FG_GC) + if (gc_type == FG_GC) { f2fs_pin_file_control(inode, true); + pin_section(F2FS_I_SB(inode), segno); + } err = -EAGAIN; goto out; } @@ -1485,6 +1520,7 @@ static int gc_data_segment(struct f2fs_sb_info *sbi, struct f2fs_summary *sum, gc_type == FG_GC) { f2fs_pin_file_control(inode, true); iput(inode); + pin_section(sbi, segno); return submitted; } @@ -1766,9 +1802,17 @@ int f2fs_gc(struct f2fs_sb_info *sbi, bool sync, ret = -EINVAL; goto stop; } +retry: ret = __get_victim(sbi, &segno, gc_type); - if (ret) + if (ret) { + /* allow to search victim from sections has pinned data */ + if (ret == -ENODATA && gc_type == FG_GC && + pinned_section_exists(DIRTY_I(sbi))) { + unpin_all_sections(sbi); + goto retry; + } goto stop; + } seg_freed = do_garbage_collect(sbi, segno, &gc_list, gc_type, force); if (gc_type == FG_GC && @@ -1811,6 +1855,9 @@ int f2fs_gc(struct f2fs_sb_info *sbi, bool sync, SIT_I(sbi)->last_victim[ALLOC_NEXT] = 0; SIT_I(sbi)->last_victim[FLUSH_DEVICE] = init_segno; + if (gc_type == FG_GC && pinned_section_exists(DIRTY_I(sbi))) + unpin_all_sections(sbi); + trace_f2fs_gc_end(sbi->sb, ret, total_freed, sec_freed, get_pages(sbi, F2FS_DIRTY_NODES), get_pages(sbi, F2FS_DIRTY_DENTS), diff --git a/fs/f2fs/segment.c b/fs/f2fs/segment.c index 012524db7437..1c20d7c9eca3 100644 --- a/fs/f2fs/segment.c +++ b/fs/f2fs/segment.c @@ -4736,6 +4736,12 @@ static int init_victim_secmap(struct f2fs_sb_info *sbi) dirty_i->victim_secmap = f2fs_kvzalloc(sbi, bitmap_size, GFP_KERNEL); if (!dirty_i->victim_secmap) return -ENOMEM; + + dirty_i->pinned_secmap = f2fs_kvzalloc(sbi, bitmap_size, GFP_KERNEL); + if (!dirty_i->pinned_secmap) + return -ENOMEM; + + dirty_i->pinned_secmap_cnt = 0; return 0; } @@ -5324,6 +5330,7 @@ static void destroy_victim_secmap(struct f2fs_sb_info *sbi) { struct dirty_seglist_info *dirty_i = DIRTY_I(sbi); + kvfree(dirty_i->pinned_secmap); kvfree(dirty_i->victim_secmap); } diff --git a/fs/f2fs/segment.h b/fs/f2fs/segment.h index 5c94caf0c0a1..fd6f246e649c 100644 --- a/fs/f2fs/segment.h +++ b/fs/f2fs/segment.h @@ -294,6 +294,8 @@ struct dirty_seglist_info { struct mutex seglist_lock; /* lock for segment bitmaps */ int nr_dirty[NR_DIRTY_TYPE]; /* # of dirty segments */ unsigned long *victim_secmap; /* background GC victims */ + unsigned long *pinned_secmap; /* pinned victims from foreground GC */ + unsigned int pinned_secmap_cnt; /* count of victims which has pinned data */ }; /* victim selection function for cleaning and SSR */ -- 2.32.0