From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id 26521CCF9E3 for ; Thu, 30 Oct 2025 15:23:27 +0000 (UTC) Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1vEUUB-0004hi-GI; Thu, 30 Oct 2025 11:22:47 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1vEUU8-0004gr-GI for qemu-devel@nongnu.org; Thu, 30 Oct 2025 11:22:44 -0400 Received: from us-smtp-delivery-124.mimecast.com ([170.10.129.124]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1vEUTk-0006XU-9w for qemu-devel@nongnu.org; Thu, 30 Oct 2025 11:22:44 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1761837737; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=6Lm/GnRFPdaTGn+n6ZnJEDhQJPDWu5LYL8zGrDK1Esk=; b=eI8zDAdksza/Xw7Ko8A/kI0x/qDRqDecaDDF+kMvU7pS6otchy03CbDSOn85cIp8+t8qDs PIWfXvODdGack+T1EZZC634QxAtAnjICnyqKF8VANPBHIWihRVFbIemNI0z00+ukD6B8nY 0vylGPNv/ETqpFQYM6eBnJqitg8AJF4= Received: from mx-prod-mc-05.mail-002.prod.us-west-2.aws.redhat.com (ec2-54-186-198-63.us-west-2.compute.amazonaws.com [54.186.198.63]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_256_GCM_SHA384) id us-mta-391-o0MS5bP7PAafbh0yB6aF3A-1; Thu, 30 Oct 2025 11:22:14 -0400 X-MC-Unique: o0MS5bP7PAafbh0yB6aF3A-1 X-Mimecast-MFC-AGG-ID: o0MS5bP7PAafbh0yB6aF3A_1761837733 Received: from mx-prod-int-01.mail-002.prod.us-west-2.aws.redhat.com (mx-prod-int-01.mail-002.prod.us-west-2.aws.redhat.com [10.30.177.4]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by mx-prod-mc-05.mail-002.prod.us-west-2.aws.redhat.com (Postfix) with ESMTPS id 5883D1955DCD; Thu, 30 Oct 2025 15:22:13 +0000 (UTC) Received: from localhost (unknown [10.2.16.94]) by mx-prod-int-01.mail-002.prod.us-west-2.aws.redhat.com (Postfix) with ESMTP id EE45330001A1; Thu, 30 Oct 2025 15:22:12 +0000 (UTC) From: Stefan Hajnoczi To: qemu-devel@nongnu.org Cc: Hanna Czenczek , Paolo Bonzini , hibriansong@gmail.com, eblake@redhat.com, Stefan Hajnoczi , Kevin Wolf , qemu-block@nongnu.org Subject: [RESEND PATCH v5 13/13] block/io_uring: use non-vectored read/write when possible Date: Thu, 30 Oct 2025 11:21:49 -0400 Message-ID: <20251030152150.470170-14-stefanha@redhat.com> In-Reply-To: <20251030152150.470170-1-stefanha@redhat.com> References: <20251030152150.470170-1-stefanha@redhat.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Scanned-By: MIMEDefang 3.4.1 on 10.30.177.4 Received-SPF: pass client-ip=170.10.129.124; envelope-from=stefanha@redhat.com; helo=us-smtp-delivery-124.mimecast.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, RCVD_IN_VALIDITY_RPBL_BLOCKED=0.001, RCVD_IN_VALIDITY_SAFE_BLOCKED=0.001, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: qemu-devel@nongnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org Sender: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org The io_uring_prep_readv2/writev2() man pages recommend using the non-vectored read/write operations when possible for performance reasons. I didn't measure a significant difference but it doesn't hurt to have this optimization in place. Suggested-by: Eric Blake Signed-off-by: Stefan Hajnoczi --- v5: - Reduce #ifdef HAVE_IO_URING_PREP_WRITEV2 code duplication [Kevin] --- block/io_uring.c | 34 ++++++++++++++++++++++++++-------- 1 file changed, 26 insertions(+), 8 deletions(-) diff --git a/block/io_uring.c b/block/io_uring.c index dd930ee57e..f1514cf024 100644 --- a/block/io_uring.c +++ b/block/io_uring.c @@ -46,17 +46,28 @@ static void luring_prep_sqe(struct io_uring_sqe *sqe, void *opaque) switch (req->type) { case QEMU_AIO_WRITE: -#ifdef HAVE_IO_URING_PREP_WRITEV2 { int luring_flags = (flags & BDRV_REQ_FUA) ? RWF_DSYNC : 0; - io_uring_prep_writev2(sqe, fd, qiov->iov, - qiov->niov, offset, luring_flags); - } + if (luring_flags != 0 || qiov->niov > 1) { +#ifdef HAVE_IO_URING_PREP_WRITEV2 + io_uring_prep_writev2(sqe, fd, qiov->iov, + qiov->niov, offset, luring_flags); #else - assert(flags == 0); - io_uring_prep_writev(sqe, fd, qiov->iov, qiov->niov, offset); + /* + * FUA should only be enabled with HAVE_IO_URING_PREP_WRITEV2, see + * luring_has_fua(). + */ + assert(luring_flags == 0); + + io_uring_prep_writev(sqe, fd, qiov->iov, qiov->niov, offset); #endif + } else { + /* The man page says non-vectored is faster than vectored */ + struct iovec *iov = qiov->iov; + io_uring_prep_write(sqe, fd, iov->iov_base, iov->iov_len, offset); + } break; + } case QEMU_AIO_ZONE_APPEND: io_uring_prep_writev(sqe, fd, qiov->iov, qiov->niov, offset); break; @@ -65,8 +76,15 @@ static void luring_prep_sqe(struct io_uring_sqe *sqe, void *opaque) if (req->resubmit_qiov.iov != NULL) { qiov = &req->resubmit_qiov; } - io_uring_prep_readv(sqe, fd, qiov->iov, qiov->niov, - offset + req->total_read); + if (qiov->niov > 1) { + io_uring_prep_readv(sqe, fd, qiov->iov, qiov->niov, + offset + req->total_read); + } else { + /* The man page says non-vectored is faster than vectored */ + struct iovec *iov = qiov->iov; + io_uring_prep_read(sqe, fd, iov->iov_base, iov->iov_len, + offset + req->total_read); + } break; } case QEMU_AIO_FLUSH: -- 2.51.0