From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from mx0a-001b2d01.pphosted.com (mx0a-001b2d01.pphosted.com [148.163.156.1]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 07AA32676E6; Thu, 11 Sep 2025 19:48:43 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=148.163.156.1 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1757620125; cv=none; b=PtBrE4g02NZUNy82j76Ryzf1xg4gukwokGC7T/Va6hp4JvE2krxb8F04IezQy16CB+w4PYHbVBpBbv3MVl86GG3b8s+XHPAI+M+HcKWzFI53pf9BL9TYccb5MtrooZLz+xT5Y+fKkV9ORflblyjTSXp7DPofLZctw2QKnSnNCMA= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1757620125; c=relaxed/simple; bh=zlc1CfvUCVKHyqijo1PhGhYpw6271i8dBA+/OaSZT6I=; h=From:To:Cc:Subject:Date:Message-ID:MIME-Version; b=cpEqyoZBnbKO4C5nLWHloGSnXd1f9+uXF08TNSBO/9h+8cCbvuKxyMq1kDpj2oSd7jUUQeWLVDFFOWLvSm4uGxpB785SVeNgFAIUGt7GRoMKClVYb641eTNkd6uq9LGwFZIUnAmVQ/9y1Oq6WOKDH97LLe+U9JVKyBHwidUbCbQ= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=linux.ibm.com; spf=pass smtp.mailfrom=linux.ibm.com; dkim=pass (2048-bit key) header.d=ibm.com header.i=@ibm.com header.b=aV2gdokC; arc=none smtp.client-ip=148.163.156.1 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=linux.ibm.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=linux.ibm.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=ibm.com header.i=@ibm.com header.b="aV2gdokC" Received: from pps.filterd (m0356517.ppops.net [127.0.0.1]) by mx0a-001b2d01.pphosted.com (8.18.1.2/8.18.1.2) with ESMTP id 58BHVSVU030930; Thu, 11 Sep 2025 19:48:33 GMT DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ibm.com; h=cc :content-transfer-encoding:date:from:message-id:mime-version :subject:to; s=pp1; bh=er2C1seonyk5TZRDuh2BpJ+JDMS9v+a8srVOgvdAN t0=; b=aV2gdokCyPEmqWgSxYFsCaBURtr7jq/3nFxxd5aAgrmOBm+2ema0rHgiM kXN+JvJ3K7Knw8Iu4mhCSuxRTgMNZoj36q+2vey+TrPAHeb8kixpystXs88DHFQP 7JVR1kF9oTqPOdmthmAZnaZVKyrRkSwkNl6e6zdaiZC2sqVAIc0d/ZcWpgzurF7M /n1ckHozOVb+Fx5YRv3xMEgmKWOpX1CLNzktqfsEVSJ/gV28CD7B1hpI3sMRpFy5 5M+1gsDzMNbpAmfJlLLK2sv7G2yZlCBG2zFvgYqLTlUWyDZaH/6LJkz4JfGFr4nG Emuk6b90r2XbX6GH8jNyL9Xcl+5lA== Received: from pps.reinject (localhost [127.0.0.1]) by mx0a-001b2d01.pphosted.com (PPS) with ESMTPS id 490xydbu95-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Thu, 11 Sep 2025 19:48:33 +0000 (GMT) Received: from m0356517.ppops.net (m0356517.ppops.net [127.0.0.1]) by pps.reinject (8.18.1.12/8.18.0.8) with ESMTP id 58BJmWRr015615; Thu, 11 Sep 2025 19:48:32 GMT Received: from ppma13.dal12v.mail.ibm.com (dd.9e.1632.ip4.static.sl-reverse.com [50.22.158.221]) by mx0a-001b2d01.pphosted.com (PPS) with ESMTPS id 490xydbu8x-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Thu, 11 Sep 2025 19:48:32 +0000 (GMT) Received: from pps.filterd (ppma13.dal12v.mail.ibm.com [127.0.0.1]) by ppma13.dal12v.mail.ibm.com (8.18.1.2/8.18.1.2) with ESMTP id 58BI2kFp017181; Thu, 11 Sep 2025 19:48:31 GMT Received: from smtprelay07.fra02v.mail.ibm.com ([9.218.2.229]) by ppma13.dal12v.mail.ibm.com (PPS) with ESMTPS id 4911gmqa4g-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Thu, 11 Sep 2025 19:48:31 +0000 Received: from smtpav04.fra02v.mail.ibm.com (smtpav04.fra02v.mail.ibm.com [10.20.54.103]) by smtprelay07.fra02v.mail.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id 58BJmSGF48628100 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Thu, 11 Sep 2025 19:48:28 GMT Received: from smtpav04.fra02v.mail.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id F02022004D; Thu, 11 Sep 2025 19:48:27 +0000 (GMT) Received: from smtpav04.fra02v.mail.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id CE79920040; Thu, 11 Sep 2025 19:48:27 +0000 (GMT) Received: from tuxmaker.boeblingen.de.ibm.com (unknown [9.152.85.9]) by smtpav04.fra02v.mail.ibm.com (Postfix) with ESMTPS; Thu, 11 Sep 2025 19:48:27 +0000 (GMT) Received: by tuxmaker.boeblingen.de.ibm.com (Postfix, from userid 55271) id A1108E1089; Thu, 11 Sep 2025 21:48:27 +0200 (CEST) From: Alexandra Winter To: "D. Wythe" , Dust Li , Sidraya Jayagond , Wenjia Zhang , David Miller , Jakub Kicinski , Paolo Abeni , Eric Dumazet , Andrew Lunn Cc: Julian Ruess , Aswin Karuvally , Halil Pasic , Mahanta Jambigi , Tony Lu , Wen Gu , linux-rdma@vger.kernel.org, netdev@vger.kernel.org, linux-s390@vger.kernel.org, Heiko Carstens , Vasily Gorbik , Alexander Gordeev , Christian Borntraeger , Sven Schnelle , Simon Horman , Eric Biggers , Ard Biesheuvel , Herbert Xu , Harald Freudenberger , Konstantin Shkolnyy Subject: [PATCH net-next v2 00/14] dibs - Direct Internal Buffer Sharing Date: Thu, 11 Sep 2025 21:48:13 +0200 Message-ID: <20250911194827.844125-1-wintera@linux.ibm.com> X-Mailer: git-send-email 2.48.1 Precedence: bulk X-Mailing-List: linux-s390@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-TM-AS-GCONF: 00 X-Proofpoint-ORIG-GUID: aRUYOkngFbHfUaxA8MnOWamVTS5v7I2z X-Proofpoint-Spam-Details-Enc: AW1haW4tMjUwOTA2MDIzNSBTYWx0ZWRfX8UWiwK7c0iQp oN0oPiMkdl2PMnX6KcYFyWfrpQUaBC+BAZSkPPzSHHNZxJHD9wEzMQhS1llfmEmyW1olftiixLd NuzgGSgbH53LAsCCRQ56SG+8WBCq8tNgjqQ+UCGs+4QVsIzAejHBzyVpC1HqSm/sUURW8Oo+Sp7 85bUxL6WheFDu39wHBMmvYOd3Zc0CzBkhRE5LDlvK5A7tCbc/FSHMzP1Z1I/8HmORQg6v9YKCye rfUGQEyxMOqPjP43KCAQ80UqGlP9X3Id1yiZ2EerFmG1aas6RYI8YjtRRPW4y04FJHxJ8TTFohF Hidpoqk4wdqTKHPN0Mhbia8nQ4vHjalsogzM4ZIihJXPqO7Feu92aa0efMuZ/Gz/GBOWCjFXBdT X1s7EfHo X-Proofpoint-GUID: s2AWPeo5QM4Yz-hxMUIx6l19TqN74k36 X-Authority-Analysis: v=2.4 cv=F59XdrhN c=1 sm=1 tr=0 ts=68c32791 cx=c_pps a=AfN7/Ok6k8XGzOShvHwTGQ==:117 a=AfN7/Ok6k8XGzOShvHwTGQ==:17 a=yJojWOMRYYMA:10 a=3nI6nj7hAAAA:8 a=VwQbUJbxAAAA:8 a=SRrdq9N9AAAA:8 a=mkMtH7XpAAAA:8 a=VnNF1IyMAAAA:8 a=QyXUC8HyAAAA:8 a=RkOrkl1-BjnOTL04EN0A:9 a=bb6nGAbfQKYA:10 a=PUQwBqpy_9XipHPXVRm3:22 a=I9Slk6e--tAXHahELIuT:22 X-Proofpoint-Virus-Version: vendor=baseguard engine=ICAP:2.0.293,Aquarius:18.0.1117,Hydra:6.1.9,FMLib:17.12.80.40 definitions=2025-09-11_03,2025-09-11_02,2025-03-28_01 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 priorityscore=1501 malwarescore=0 suspectscore=0 phishscore=0 clxscore=1015 impostorscore=0 bulkscore=0 adultscore=0 spamscore=0 classifier=typeunknown authscore=0 authtc= authcc= route=outbound adjust=0 reason=mlx scancount=1 engine=8.19.0-2507300000 definitions=main-2509060235 This series introduces a generic abstraction of existing components like: - the s390 specific ISM device (Internal Shared Memory), - the SMC-D loopback mechanism (Shared Memory Communication - Direct) - the client interface of the SMC-D module to the transport devices This generic shim layer can be extended with more devices, more clients and more features in the future. This layer is called 'dibs' for Direct Internal Buffer Sharing based on the common scheme that these mechanisms enable controlled sharing of memory buffers within some containing entity such as a hypervisor or a Linux instance. Benefits: - Cleaner separation of ISM and SMC-D functionality - simpler and less module dependencies - Clear interface definition. - Extendable for future devices and clients. An overview was given at the Netdev 0x19 conference, recordings and slides are available [1]. Background / Status quo: ------------------------ Currently s390 hardware provides virtual PCI ISM devices (Internal Shared Memory). Their driver is in drivers/s390/net/ism_drv.c. The main user is SMC-D (net/smc). The ism driver offers a client interface so other users/protocols can also use them, but it is still heavily intermingled with the smc code. Namely, the ism module cannot be used without the smc module, which feels artificial. There is ongoing work to extend the ISM concept of shared buffers that can be accessed directly by another instance on the same hardware: [2] proposed a loopback interface (ism_lo), that can be used on non-s390 architectures (e.g. between containers or to test SMC-D). A minimal implementation went upstream with [3]: ism_lo currently is a part of the smc protocol and rather hidden. [4] proposed a virtio definition of ism (ism_virtio) that can be used between kvm guests. We will shortly send an RFC for an dibs client that uses dibs as transport for TTY. Concept: -------- Create a shim layer in net/dibs that contains common definitions and code for all dibs devices and all dibs clients. Any device or client module only needs to depend on this dibs layer module and any device or client code only needs to include the definitions in include/linux/dibs.h. The name dibs was chosen to clearly distinguish it from the existing s390 ism devices. And to emphasize that it is not about sharing whole memory regions with anybody, but dedicating single buffers for another system. Implementation: --------------- The end result of this series is: A dibs shim layer with - One dibs client: smc-d - Two dibs device drivers: ism and dibs-loopback - Everything prepared to add more clients and more device drivers. Patches 1-2 contain some issues that were found along the way. They make sense on their own, but also enable a better structured dibs series. There are three components that exist today: a) smc module (especially SMC-D functionality, which is an ism client today) b) ism device driver (supports multiple ism clients today) c) smc-loopback (integrated with smc today) In order to preserve existing functionality at each step, these are not moved to dibs layer by component, instead: - the dibs layer is established in parallel to existing code [patches 3-6] - then some service functions are moved to the dibs layer [patches 7-12] - the actual data movement is moved to the dibs layer [patch 13] - and last event handling is moved to the dibs layer [patch 14] Future: ------- Items that are not part of this patchset but can be added later: - dynamically add or remove dibs_loopback. That will be allow for simple testing of add_dev()/del_dev() - handle_irq(): Call clients without interrupt context. e.g using threaded interrupts. I left this for a follow-on, because it includes conceptual changes for the smcd receive code. - Any improvements of locking scopes. I mainly moved some of the the existing locks to dibs layer. I have the feeling there is room for improvements. - The device drivers should not loop through the client array - dibs_dev_op.*_dmb() functions reveal unnecessary details of the internal dmb struct to the clients - Check whether client calls to dibs_dev_ops should be replaced by interface functions that provide additional value - Check whether device driver calls to dibs_client_ops should be replaced by interface functions that provide additional value. Link: [1] https://netdevconf.info/0x19/sessions/talk/communication-via-internal-shared-memory-ism-time-to-open-up.html Link: [2] https://lore.kernel.org/netdev/1695568613-125057-1-git-send-email-guwen@linux.alibaba.com/ Link: [3] https://lore.kernel.org/linux-kernel//20240428060738.60843-1-guwen@linux.alibaba.com/ Link: [4] https://groups.oasis-open.org/communities/community-home/digestviewer/viewthread?GroupId=3973&MessageKey=c060ecf9-ea1a-49a2-9827-c92f0e6447b2&CommunityKey=2f26be99-3aa1-48f6-93a5-018dce262226&hlmlt=VT --- Changes in v2: - More fixes of transient scope of IS_ENABLED(CONFIG_ISM) [7,13] (patchwork) - Delete obsolete net/smc/smc_loopback.* files [13] (Dust Li) - Fix transient usage of supports_v2() after rebase on top of 091d019adce0 ("net/smc: remove unused function smc_lo_supports_v2") [7] - Fix CC according to get_maintainer.pl (patchwork) - Place dibs level code in drivers/dibs/ instead of net/dibs/ (Dust Li, Julian Ruess) Changes in v1: - Don't change __init in smc_core_init() (was [1]) (Dust Li) - Split off log message improvements from this series (was [2,5]) (Dust Li) - Fix arch/s390/[debug_]defconfig [4,6,7,14] - Helptext of dibs/Kconfig [3,6] - include linux/slab.h to avoid Wimplicit-function-declaration of kzalloc() and kfree() [5,6] (Simon Horman and 'kernel test robot ') - Fix transient use of undefined 'ism' pointer [7] (Simon Horman) - Fix transient scope of IS_ENABLED(CONFIG_ISM) [7,13] (Simon Horman) - Change position of is_attached in struct smc_buf_desc to reduce gaps [1] (Dust Li) - Fix SW-pnetid handling for s390 ism devices [9] - use dibs->dev instead of defining dibs_get_dev() [8] - add const to uuid_t* parameters [12,14] (Julian Ruess) - no log message for module load/unload [3] (Jakub Kicinski) - Link: https://lore.kernel.org/netdev/20250905145428.1962105-1-wintera@linux.ibm.com/ RFC: - Link: https://lore.kernel.org/netdev/20250806154122.3413330-1-wintera@linux.ibm.com/ Alexandra Winter (11): net/smc: Remove error handling of unregister_dmb() net/smc: Decouple sf and attached send_buf in smc_loopback dibs: Create drivers/dibs dibs: Register smc as dibs_client dibs: Register ism as dibs device dibs: Define dibs loopback dibs: Define dibs_client_ops and dibs_dev_ops dibs: Local gid for dibs devices dibs: Move vlan support to dibs_dev_ops dibs: Move query_remote_gid() to dibs_dev_ops dibs: Move data path to dibs layer Julian Ruess (3): dibs: Move struct device to dibs_dev dibs: Create class dibs dibs: Move event handling to dibs layer MAINTAINERS | 9 +- arch/s390/configs/debug_defconfig | 4 +- arch/s390/configs/defconfig | 4 +- drivers/Makefile | 1 + drivers/dibs/Kconfig | 23 ++ drivers/dibs/Makefile | 8 + drivers/dibs/dibs_loopback.c | 356 +++++++++++++++++++ drivers/dibs/dibs_loopback.h | 57 +++ drivers/dibs/dibs_main.c | 278 +++++++++++++++ drivers/s390/net/Kconfig | 3 +- drivers/s390/net/ism.h | 53 ++- drivers/s390/net/ism_drv.c | 573 +++++++++++------------------- include/linux/dibs.h | 464 ++++++++++++++++++++++++ include/linux/ism.h | 28 +- include/net/smc.h | 51 +-- net/Kconfig | 1 + net/smc/Kconfig | 16 +- net/smc/Makefile | 1 - net/smc/af_smc.c | 12 +- net/smc/smc_clc.c | 6 +- net/smc/smc_core.c | 6 +- net/smc/smc_core.h | 5 + net/smc/smc_diag.c | 2 +- net/smc/smc_ism.c | 224 ++++++------ net/smc/smc_ism.h | 36 +- net/smc/smc_loopback.c | 421 ---------------------- net/smc/smc_loopback.h | 60 ---- net/smc/smc_pnet.c | 25 +- net/smc/smc_tx.c | 3 + 29 files changed, 1645 insertions(+), 1085 deletions(-) create mode 100644 drivers/dibs/Kconfig create mode 100644 drivers/dibs/Makefile create mode 100644 drivers/dibs/dibs_loopback.c create mode 100644 drivers/dibs/dibs_loopback.h create mode 100644 drivers/dibs/dibs_main.c create mode 100644 include/linux/dibs.h delete mode 100644 net/smc/smc_loopback.c delete mode 100644 net/smc/smc_loopback.h -- 2.48.1