From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from bombadil.infradead.org (bombadil.infradead.org [198.137.202.133]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id 0CDB9C47258 for ; Thu, 1 Feb 2024 02:32:20 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=lists.infradead.org; s=bombadil.20210309; h=Sender:List-Subscribe:List-Help :List-Post:List-Archive:List-Unsubscribe:List-Id:Content-Transfer-Encoding: MIME-Version:Message-ID:Date:Subject:Cc:To:From:Reply-To:Content-Type: Content-ID:Content-Description:Resent-Date:Resent-From:Resent-Sender: Resent-To:Resent-Cc:Resent-Message-ID:In-Reply-To:References:List-Owner; bh=Kd14LPOMT0i8196jrcW6/++I8e+PPUQtGw/KZTrP0X4=; b=k6S9zqCaFnnSxXm8Drt5phsHss YlSXQ+9tGyfeyTGRr0wH+4hv1DpWtJHEr5j69uZ0gyuGixSTZK4N0Rheu+NJAGD7LaSC9ti1g3Eyk siGn5WmljcV1c8kk2E7ujgJyIzAiyG78lvZzYmUMRnbdCX8plDghTjX+Huj6NZEGsm358WvEGgmWL yIWkIO4k83WCSP1b4Mvo/qYrUfmoczNTy0wfWe/rl+OZvgiUkUVXNmKAxz7OBXOqmAPvMIlb5yC5o sW/yyWtUNyuuBURQuOz3tTb8mVHWATohe3iqfm2R1u2T/B7miYQ+291HETylSe4hw6XFIvDA4mudM AKt+PllQ==; Received: from localhost ([::1] helo=bombadil.infradead.org) by bombadil.infradead.org with esmtp (Exim 4.97.1 #2 (Red Hat Linux)) id 1rVMsE-00000006J3I-1rDy; Thu, 01 Feb 2024 02:32:18 +0000 Received: from out30-112.freemail.mail.aliyun.com ([115.124.30.112]) by bombadil.infradead.org with esmtps (Exim 4.97.1 #2 (Red Hat Linux)) id 1rVMsC-00000006J2u-1Drf for linux-nvme@lists.infradead.org; Thu, 01 Feb 2024 02:32:17 +0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linux.alibaba.com; s=default; t=1706754733; h=From:To:Subject:Date:Message-ID:MIME-Version; bh=Kd14LPOMT0i8196jrcW6/++I8e+PPUQtGw/KZTrP0X4=; b=Q7GDETt2iAm34De7cEyltP0FCJIaW//5qWf0mCIbYRuNQr7v6qM7JpZ6Ifdycziou5iBLc++YOMl8jO/04SW1Horthqz0t9AEzMDNTETLO8mextFLVF62UmJZ6lytYZj9VOikbM7OfLi9/YunmvD3GxUq/3G5q9/U1VnUZ3dPQ0= X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R821e4;CH=green;DM=||false|;DS=||;FP=0|-1|-1|-1|0|-1|-1|-1;HT=ay29a033018046050;MF=kanie@linux.alibaba.com;NM=1;PH=DS;RN=4;SR=0;TI=SMTPD_---0W.lz.v7_1706754727; Received: from localhost(mailfrom:kanie@linux.alibaba.com fp:SMTPD_---0W.lz.v7_1706754727) by smtp.aliyun-inc.com; Thu, 01 Feb 2024 10:32:12 +0800 From: Guixin Liu To: sagi@grimberg.me, hch@lst.de, kch@nvidia.com Cc: linux-nvme@lists.infradead.org Subject: [PATCH v7 0/1] Implement the NVMe reservation feature Date: Thu, 1 Feb 2024 10:32:06 +0800 Message-ID: <20240201023207.112007-1-kanie@linux.alibaba.com> X-Mailer: git-send-email 2.43.0 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-CRM114-Version: 20100106-BlameMichelson ( TRE 0.8.0 (BSD) ) MR-646709E3 X-CRM114-CacheID: sfid-20240131_183216_685172_26E977EB X-CRM114-Status: GOOD ( 20.09 ) X-BeenThere: linux-nvme@lists.infradead.org X-Mailman-Version: 2.1.34 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: "Linux-nvme" Errors-To: linux-nvme-bounces+linux-nvme=archiver.kernel.org@lists.infradead.org Hi guys, I've implemented the NVMe reservation feature. Please review it, all comments are welcome as usual. Changes from v6 to v7: - Handle "reservation notification mask" feature command to mask reservation log. - Add all the registrants that need to be freed to a temporary list fist, and then after calling synchronize_rcu(), release all the registrants on the temporary list. - Fix the resv log page is random when there is no resv log page. - Change nvmet_is_host_still_connected() to nvmet_is_host_connected(). - Remove nvmet_pr_set_rtype_and_holder() and change nvmet_pr_create_new_resv() to nvmet_pr_create_new_reservation(). - Change nvmet_pr_find_registrant_by_hostid() to nvmet_pr_find_registrant(). - Change nvmet_pr_send_resv_released() to nvmet_pr_resv_released(). - Change __nvmet_pr_unregister_one() to nvmet_pr_unregister_one(). - In nvmet_pr_unreg_by_prkey(), nvmet_pr_unreg_by_prkey_except_hostid() and nvmet_pr_unreg_except_hostid(), first do unregistering and then do event sending. Changes from v5 to v6: - Use synchronize_rcu() and kfree() to free registrant instead of kfree_rcu(). - Remove nvmet_pr_register_check_rkey(), put the check into pr_lock warp. And refactor the nvmet_pr_register(). - Add the print fmt to the head. - Add lockdep_is_held(&pr->pr_lock) condition to list_for_each_entry_rcu. - Fix the bug in nvmet_pr_update_reg_attr(), when the change_attr hook return fail, we should not replace the holder. Changes from v4 to v5: - Use rculist macros to handle registration_list instead of list macros regardless of in mutex lock or not. - Use goto statement instead of return in nvmet_is_host_still_connected and __nvmet_pr_unregister_one. - Add lockdep_assert_held and rcu_read_lock_held assert to many functions, if it's necessary. - Add a comment to nvmet_execute_get_log_page_resv to explain how lost_count works. - In nvmet_pr_clear, we should set holder to NULL first, I fixed this. - Unify nvmet_pr_update_holder_rtype and __nvmet_pr_do_replace to nvmet_pr_update_reg_attr. - Fix wrong nr_pages in nvmet_execute_get_log_page_resv. - Fix the deadlock issue of nvmet_pr_exit_ns, put it out of the subsys lock. Changes from v3 to v4: - Use kfifo to handle resv log page instead of list, and also limit the resv log queue to 64. - Change the function calling alignment style to: nvmet_pr_send_event_by_hostid(pr, hostid, NVME_PR_LOG_RESERVATOPM_PREEMPTED); - Put kmalloc out of rcu_read_lock in nvmet_execute_pr_report(). - Remove the goto in __nvmet_pr_unregister_one(). - Change generation to atomic_t, and remove nvmet_pr_inc_generation(). - In addtion, the number2 patch "nvmet: unify aer type enum" is not relate with this patch, so I will send it separately. Changes from v2 to v3: - Use rcu instead of rwlock to make IO path run faster, and put the rtype into the struct nvmet_pr_registrant. - Limit the resv_log_list to 128. - Change generation to atomic64. - Put register rkey check to a warpper. - Change nr_avl_pages to nr_pages. - Use NVME_SC_SUCCESS instead of 0. - Change kmalloc param to let it not sleep in mutex lock. Changes from v1 to v2: - Implement the reservation notification report, includes registration preempted, reservation released and reservation preempted. And also handle the reservation log page available event and send get reservation log page command to clear log page at host. - Put the reservation check access after validate opcode. And remove opcodes which nvmet not implement yet check. Now there is no admin opcode nvmet implemented needs reservation check, so I dont add reservation check to admin command path. Next we need to do reservation check includes the situation of nsid is 0xffffffff at each admin command path, if it is needed. - Add reservation commands support in nvmet_get_cmd_effects_nvm(). - From Chaitanya, change the local variable tree style to make it cleaner, and add some comments about NVMe spec. And also change others advice from chaitanya. - Put the nvmet_pr_check_cmd_access and nvmet_parse_pr_cmd into reservation enable check warp. - Remove kmem_cache instead to use kmalloc and kfree. - Change others advice from Sagi. - Add a blktest test case, this patch will be sent before these series of patches. Guixin Liu (1): nvmet: support reservation feature drivers/nvme/target/Makefile | 2 +- drivers/nvme/target/admin-cmd.c | 20 +- drivers/nvme/target/configfs.c | 27 + drivers/nvme/target/core.c | 51 +- drivers/nvme/target/nvmet.h | 36 ++ drivers/nvme/target/pr.c | 1041 +++++++++++++++++++++++++++++++ include/linux/nvme.h | 54 ++ 7 files changed, 1224 insertions(+), 7 deletions(-) create mode 100644 drivers/nvme/target/pr.c -- 2.43.0