From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id D6BC2C6FD18 for ; Tue, 25 Apr 2023 15:04:24 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234279AbjDYPEY (ORCPT ); Tue, 25 Apr 2023 11:04:24 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:37368 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233991AbjDYPEX (ORCPT ); Tue, 25 Apr 2023 11:04:23 -0400 Received: from dfw.source.kernel.org (dfw.source.kernel.org [139.178.84.217]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 643237EF4; Tue, 25 Apr 2023 08:04:18 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id D68286136C; Tue, 25 Apr 2023 15:04:17 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 28A4DC433D2; Tue, 25 Apr 2023 15:04:13 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1682435057; bh=O3ezZk4F0u+EDnSCgMTllu+XmCUbpS7GFbM9vxhBf20=; h=Date:Subject:To:Cc:References:From:In-Reply-To:From; b=kwdCyVKB3PWhdDCUwgl4iLtMu2yDPZIL3XHVoB6depYYTASgegorif0quoy5FBcMD 7mvgCAArpzVHTpX0gnNdEptlYb3Souf04h25iz8elTKAncwXyTcUwAtk23oDWmjL6O 4hXr4Cibu0AEnfPJQHlKHU7deFfFp8EO+N7ILvtheGezIgHxUczrvisNBLdCHeom5z bZzmyMvitjABKAF4fjFvQzvXSVB6537zfInvLsMKMsTprp6OodqLKY+DHa+N5wGfuL 6Pk2mA5kUrA/yuz5KwkqUfKklZ4Zyusy5vTy90is0evCL3cyvYSn+GDnOmIOy4dWtG cLxHfNZNeAreg== Message-ID: Date: Tue, 25 Apr 2023 23:04:11 +0800 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.10.0 Subject: Re: [PATCH v2] f2fs: add async reset zone command support To: daejun7.park@samsung.com, "jaegeuk@kernel.org" , "rostedt@goodmis.org" , "mhiramat@kernel.org" , "linux-f2fs-devel@lists.sourceforge.net" , "linux-kernel@vger.kernel.org" , "linux-trace-kernel@vger.kernel.org" Cc: Seokhwan Kim , Yonggil Song , beomsu kim References: <20230425081018epcms2p8fefa94810f0b341d2cbd70ba587e9e8c@epcms2p8> Content-Language: en-US From: Chao Yu In-Reply-To: <20230425081018epcms2p8fefa94810f0b341d2cbd70ba587e9e8c@epcms2p8> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: linux-trace-kernel@vger.kernel.org On 2023/4/25 16:10, Daejun Park wrote: > Changelog: > > v1 -> v2 > Changed to apply the optional async reset write pointer by default. > > > This patch enables submit reset zone command asynchornously. It helps > decrease average latency of write IOs in high utilization scenario by > faster checkpointing. > > Signed-off-by: Daejun Park > --- > fs/f2fs/segment.c | 95 ++++++++++++++++++++++++++++++++++--- > include/trace/events/f2fs.h | 18 ++++++- > 2 files changed, 105 insertions(+), 8 deletions(-) > > diff --git a/fs/f2fs/segment.c b/fs/f2fs/segment.c > index 82430f80c5da..27074a6eaf20 100644 > --- a/fs/f2fs/segment.c > +++ b/fs/f2fs/segment.c > @@ -1195,6 +1195,46 @@ static void __init_discard_policy(struct f2fs_sb_info *sbi, > static void __update_discard_tree_range(struct f2fs_sb_info *sbi, > struct block_device *bdev, block_t lstart, > block_t start, block_t len); > + > +#ifdef CONFIG_BLK_DEV_ZONED > +static int __submit_zone_reset_cmd(struct f2fs_sb_info *sbi, > + struct discard_cmd *dc, blk_opf_t flag, > + struct list_head *wait_list, > + unsigned int *issued) > +{ > + struct discard_cmd_control *dcc = SM_I(sbi)->dcc_info; > + struct block_device *bdev = dc->bdev; > + struct bio *bio = bio_alloc(bdev, 0, REQ_OP_ZONE_RESET | flag, GFP_NOFS); > + unsigned long flags; > + > + trace_f2fs_issue_reset_zone(bdev, SECTOR_FROM_BLOCK(dc->di.start)); trace_f2fs_issue_reset_zone(bdev, dc->di.start); ? > + > + spin_lock_irqsave(&dc->lock, flags); > + dc->state = D_SUBMIT; > + dc->bio_ref++; > + spin_unlock_irqrestore(&dc->lock, flags); > + > + if (issued) > + (*issued)++; > + > + atomic_inc(&dcc->queued_discard); > + dc->queued++; > + list_move_tail(&dc->list, wait_list); > + > + /* sanity check on discard range */ > + __check_sit_bitmap(sbi, dc->di.lstart, dc->di.lstart + dc->di.len); > + > + bio->bi_iter.bi_sector = SECTOR_FROM_BLOCK(dc->di.start); > + bio->bi_private = dc; > + bio->bi_end_io = f2fs_submit_discard_endio; > + submit_bio(bio); > + > + atomic_inc(&dcc->issued_discard); > + > + return 0; Unneeded return value? > +} > +#endif > + > /* this function is copied from blkdev_issue_discard from block/blk-lib.c */ > static int __submit_discard_cmd(struct f2fs_sb_info *sbi, > struct discard_policy *dpolicy, > @@ -1216,6 +1256,11 @@ static int __submit_discard_cmd(struct f2fs_sb_info *sbi, > if (is_sbi_flag_set(sbi, SBI_NEED_FSCK)) > return 0; > > +#ifdef CONFIG_BLK_DEV_ZONED > + if (f2fs_sb_has_blkzoned(sbi) && bdev_is_zoned(bdev)) > + return __submit_zone_reset_cmd(sbi, dc, flag, wait_list, issued); > +#endif > + > trace_f2fs_issue_discard(bdev, dc->di.start, dc->di.len); > > lstart = dc->di.lstart; > @@ -1460,21 +1505,42 @@ static void __update_discard_tree_range(struct f2fs_sb_info *sbi, > } > } > > +#ifdef CONFIG_BLK_DEV_ZONED > +static void __queue_zone_reset_cmd(struct f2fs_sb_info *sbi, > + struct block_device *bdev, block_t blkstart, block_t blklen) block_t lblkstart, > +{ > + block_t lblkstart = blkstart; block_t blkstart = lblkstart; > + > + if (f2fs_is_multi_device(sbi)) { > + int devi = f2fs_target_device_index(sbi, blkstart); > + > + blkstart -= FDEV(devi).start_blk; > + } > + > + trace_f2fs_queue_reset_zone(bdev, blkstart); > + > + mutex_lock(&SM_I(sbi)->dcc_info->cmd_lock); > + __insert_discard_cmd(sbi, bdev, lblkstart, blkstart, blklen); > + mutex_unlock(&SM_I(sbi)->dcc_info->cmd_lock); > +} > +#endif > + > static void __queue_discard_cmd(struct f2fs_sb_info *sbi, > struct block_device *bdev, block_t blkstart, block_t blklen) > { > block_t lblkstart = blkstart; > > + trace_f2fs_queue_discard(bdev, blkstart, blklen); If device doesn't support discard, it's better not show above tracepoint? > + > if (!f2fs_bdev_support_discard(bdev)) > return; > > - trace_f2fs_queue_discard(bdev, blkstart, blklen); > - > if (f2fs_is_multi_device(sbi)) { > int devi = f2fs_target_device_index(sbi, blkstart); > > blkstart -= FDEV(devi).start_blk; > } > + > mutex_lock(&SM_I(sbi)->dcc_info->cmd_lock); > __update_discard_tree_range(sbi, bdev, lblkstart, blkstart, blklen); > mutex_unlock(&SM_I(sbi)->dcc_info->cmd_lock); > @@ -1725,7 +1791,18 @@ static void f2fs_wait_discard_bio(struct f2fs_sb_info *sbi, block_t blkaddr) > dc = __lookup_discard_cmd(sbi, blkaddr); > if (dc) { > if (dc->state == D_PREP) { > - __punch_discard_cmd(sbi, dc, blkaddr); > + if (f2fs_sb_has_blkzoned(sbi) && > + bdev_is_zoned(dc->bdev)) { > + struct discard_policy dpolicy; > + /* force submit zone reset */ > + __init_discard_policy(sbi, &dpolicy, > + DPOLICY_FORCE, 1); > + __submit_discard_cmd(sbi, &dpolicy, dc, NULL); Can we call __submit_zone_reset_cmd() directly? Thanks, > + dc->ref++; > + need_wait = true; > + } else { > + __punch_discard_cmd(sbi, dc, blkaddr); > + } > } else { > dc->ref++; > need_wait = true; > @@ -1875,9 +1952,15 @@ static int __f2fs_issue_discard_zone(struct f2fs_sb_info *sbi, > blkstart, blklen); > return -EIO; > } > - trace_f2fs_issue_reset_zone(bdev, blkstart); > - return blkdev_zone_mgmt(bdev, REQ_OP_ZONE_RESET, > - sector, nr_sects, GFP_NOFS); > + > + if (unlikely(!is_sbi_flag_set(sbi, SBI_POR_DOING))) { > + trace_f2fs_issue_reset_zone(bdev, blkstart); > + return blkdev_zone_mgmt(bdev, REQ_OP_ZONE_RESET, > + sector, nr_sects, GFP_NOFS); > + } > + > + __queue_zone_reset_cmd(sbi, bdev, lblkstart, blklen); > + return 0; > } > > /* For conventional zones, use regular discard if supported */ > diff --git a/include/trace/events/f2fs.h b/include/trace/events/f2fs.h > index 99cbc5949e3c..ee1477de8324 100644 > --- a/include/trace/events/f2fs.h > +++ b/include/trace/events/f2fs.h > @@ -1512,7 +1512,7 @@ DEFINE_EVENT(f2fs_discard, f2fs_remove_discard, > TP_ARGS(dev, blkstart, blklen) > ); > > -TRACE_EVENT(f2fs_issue_reset_zone, > +DECLARE_EVENT_CLASS(f2fs_reset_zone, > > TP_PROTO(struct block_device *dev, block_t blkstart), > > @@ -1528,11 +1528,25 @@ TRACE_EVENT(f2fs_issue_reset_zone, > __entry->blkstart = blkstart; > ), > > - TP_printk("dev = (%d,%d), reset zone at block = 0x%llx", > + TP_printk("dev = (%d,%d), zone at block = 0x%llx", > show_dev(__entry->dev), > (unsigned long long)__entry->blkstart) > ); > > +DEFINE_EVENT(f2fs_reset_zone, f2fs_queue_reset_zone, > + > + TP_PROTO(struct block_device *dev, block_t blkstart), > + > + TP_ARGS(dev, blkstart) > +); > + > +DEFINE_EVENT(f2fs_reset_zone, f2fs_issue_reset_zone, > + > + TP_PROTO(struct block_device *dev, block_t blkstart), > + > + TP_ARGS(dev, blkstart) > +); > + > TRACE_EVENT(f2fs_issue_flush, > > TP_PROTO(struct block_device *dev, unsigned int nobarrier,