From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 6179EC7EE2D for ; Fri, 3 Mar 2023 16:39:51 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id B5F466B0074; Fri, 3 Mar 2023 11:39:50 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id B0B246B0078; Fri, 3 Mar 2023 11:39:50 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 984D86B0074; Fri, 3 Mar 2023 11:39:50 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0015.hostedemail.com [216.40.44.15]) by kanga.kvack.org (Postfix) with ESMTP id 893946B0072 for ; Fri, 3 Mar 2023 11:39:50 -0500 (EST) Received: from smtpin02.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay09.hostedemail.com (Postfix) with ESMTP id 5D8BC80263 for ; Fri, 3 Mar 2023 16:39:50 +0000 (UTC) X-FDA: 80528148540.02.940A3E4 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by imf01.hostedemail.com (Postfix) with ESMTP id 8C6264000F for ; Fri, 3 Mar 2023 16:39:47 +0000 (UTC) Authentication-Results: imf01.hostedemail.com; dkim=pass header.d=redhat.com header.s=mimecast20190719 header.b=JcXLDBnL; spf=pass (imf01.hostedemail.com: domain of mtosatti@redhat.com designates 170.10.133.124 as permitted sender) smtp.mailfrom=mtosatti@redhat.com; dmarc=pass (policy=none) header.from=redhat.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1677861587; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=A7qH7z8qcUef6NyJXfR2PL9084/lxkL2GnTDPhF4AIk=; b=Iy7bydqFLpUvj7vQgJH9kVls2H/BJegIg2hMR/3/BZatmp4Ddl7HkmRv6BaWbnD8sccJ/M dK2u0KWOzufck93J4SDhh7U5jsKNmUooFrompNZ0l4b12iEfdOVIZVqh+cXU5eOyUeeDmI i29UMImB1V9+PkS8Wt3T0XBycysZgd4= ARC-Authentication-Results: i=1; imf01.hostedemail.com; dkim=pass header.d=redhat.com header.s=mimecast20190719 header.b=JcXLDBnL; spf=pass (imf01.hostedemail.com: domain of mtosatti@redhat.com designates 170.10.133.124 as permitted sender) smtp.mailfrom=mtosatti@redhat.com; dmarc=pass (policy=none) header.from=redhat.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1677861587; a=rsa-sha256; cv=none; b=A3PmTwfDeyaY3LCOvc8LFT1Ysxkz6i64dT8j3m1LXBofXkUPjFigTx2TUhNG0CkGwaZSou bypvo3ryFszMkeNbQZ9oigD3r67ODkl8dbhSTFuX5niVa66MvhTGuDJa6TO2ysLFo0qSAP BzQPdtAdNI2jD26cQ6QT/SIc1gjuoAI= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1677861586; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=A7qH7z8qcUef6NyJXfR2PL9084/lxkL2GnTDPhF4AIk=; b=JcXLDBnLii4LIJBfvhNW/xY/ri9H/QRNEtTDJdc+BTtoPnmjDpP4DQSBO4ov9ts07u0YYg idIgaxMtMg58A8RVmvjVM44yhkaE+s7QO84lJanfrvEUkltki4GQonAiXKX6S71ydeRZHe UeG8IkHefgTQt/kZNK4u/ZbMbgmtgwE= Received: from mimecast-mx02.redhat.com (mimecast-mx02.redhat.com [66.187.233.88]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-618-H4SqEBn-Otq_xZ4Ygilg-A-1; Fri, 03 Mar 2023 11:39:43 -0500 X-MC-Unique: H4SqEBn-Otq_xZ4Ygilg-A-1 Received: from smtp.corp.redhat.com (int-mx06.intmail.prod.int.rdu2.redhat.com [10.11.54.6]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx02.redhat.com (Postfix) with ESMTPS id 3AE8185CCE4; Fri, 3 Mar 2023 16:39:43 +0000 (UTC) Received: from tpad.localdomain (ovpn-112-2.gru2.redhat.com [10.97.112.2]) by smtp.corp.redhat.com (Postfix) with ESMTPS id D454C2166B2B; Fri, 3 Mar 2023 16:39:42 +0000 (UTC) Received: by tpad.localdomain (Postfix, from userid 1000) id B6B29401A0A1D; Fri, 3 Mar 2023 12:39:11 -0300 (-03) Date: Fri, 3 Mar 2023 12:39:11 -0300 From: Marcelo Tosatti To: Peter Xu Cc: Christoph Lameter , Aaron Tomlin , Frederic Weisbecker , Andrew Morton , linux-kernel@vger.kernel.org, linux-mm@kvack.org Subject: Re: [PATCH v2 02/11] this_cpu_cmpxchg: ARM64: switch this_cpu_cmpxchg to locked, add _local function Message-ID: References: <20230209150150.380060673@redhat.com> <20230209153204.683821550@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Scanned-By: MIMEDefang 3.1 on 10.11.54.6 X-Rspam-User: X-Rspamd-Server: rspam03 X-Stat-Signature: okj6h7wftymxmfh5e7tq3hqshpgocsph X-Rspamd-Queue-Id: 8C6264000F X-HE-Tag: 1677861587-994083 X-HE-Meta: U2FsdGVkX1+YTmBKNNXMqGNuy9udMOfosk/5/j1zkzqCblIp0GWQlRZacSukc1WuQPxaHhX8bZNQknZTMWdFYkkFO3ocojjKvPc9rgkUdmLtcG3Lp7qSj+rJ8UbSkbZyw4qGykznLrUZR0gC8Jt24qtbram7I8HUaPHh/8tCeFcIykaEtHmi+QXNFsWOmnKyR6CJYppZ7ghp+YUd6/irzt92D5FY0jHEkjHEdM2cBcD47MNsS1d7ayv7jWbSBB34PA4JYf4QtBfmqplgeYBMR0nzJzlNclwvtZOxFlF4KNaZitvSRKPh2dbZtOT/yyTpOyLdzm9qcKdT14QE5kM1BJeyxEYeFjn6rWOkLKoVEcWM7UAN5C/Hk6gB305SuN4rPU9fVECTMnmuwAfR81FrS7nyWDaD0XiCI8Axp5SYmXUxb0F+H2krxfK/WMx8fzo4QLUM0oD4qdE3nEAGhye46c2/VmZB0XJ0mRosAFAlunaQ2DRAeeF6vAR3Oj4gM5ngBXr+946rY9pgoWyOQgDriWMvl6ceoMCicwWejq3gGPeu3dkljvpAQrjE///a6VFPahw8UBFqwxyxkEH2XM347FQRCwC9NRC2a1L73B7YEIWI+Tz2e++z8fC5o487xrQJAJoMdyxzoVnePu48s4Qqjk3j4Z9+6Q53Mz58P70KE5uwIRiNhV0q0DcHlKzWqaHusCs5x7GW1zvikgcmTD/Ke1OmYTX8hsQjUU9XL3CtTsF0JbLAG0q6i0R6N+VGKc8z8a2sMm4VYr66PQrtuk4rGLcXzLZsrQlr9m/SKIDZ+pruDSOksMYWxIo5wSDA8feVAXhuhQvThF+wItJ60iMNILxCLXoYzYT9kJaXWiL3m7HG0oye3+8v/4AnUjeMBjjOjvcpbxNmbMRpRq+FyZetVUCqhHLP1zCZJ8cH3IIbSGe4vzbBazUKS4pw82CETTUAhoDelldcrOaRsDedBsp HEjx+rLI fTMaQ+jSWzGzAbJgN7XhNbc/2+fxhjzlnp2dih/QmOxIdV1xgj/boTJ+EFDc5efk7GVY/la2RwzIpc/zxkCQta708TEgrKOgAWv2cqdu0HnvO5iAcaTcI5IOxZanr7FpA4k51Eim9I+iKsNmnRHIcx1pxQtvryLUo7GwcSM/8H0vYFeupP8uPMDEuppe2oMscRTs/kGJr9mC68pXpB+Nz3irua00zNBsyVfMQgfVoVQUT164kUiynMzfARJfudAAcDjg0dHNLjm86tDNFasXZQVeFqk5mP0ksJYgT/xKMJlKvOECm5aLNH3mVXGorcURXs6xbSJTZgtE9jUU= X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: On Thu, Mar 02, 2023 at 04:25:08PM -0500, Peter Xu wrote: > On Thu, Mar 02, 2023 at 06:04:25PM -0300, Marcelo Tosatti wrote: > > On Thu, Mar 02, 2023 at 03:53:12PM -0500, Peter Xu wrote: > > > On Thu, Feb 09, 2023 at 12:01:52PM -0300, Marcelo Tosatti wrote: > > > > Goal is to have vmstat_shepherd to transfer from > > > > per-CPU counters to global counters remotely. For this, > > > > an atomic this_cpu_cmpxchg is necessary. > > > > > > > > Following the kernel convention for cmpxchg/cmpxchg_local, > > > > change ARM's this_cpu_cmpxchg_ helpers to be atomic, > > > > and add this_cpu_cmpxchg_local_ helpers which are not atomic. > > > > > > I can follow on the necessity of having the _local version, however two > > > questions below. > > > > > > > > > > > Signed-off-by: Marcelo Tosatti > > > > > > > > Index: linux-vmstat-remote/arch/arm64/include/asm/percpu.h > > > > =================================================================== > > > > --- linux-vmstat-remote.orig/arch/arm64/include/asm/percpu.h > > > > +++ linux-vmstat-remote/arch/arm64/include/asm/percpu.h > > > > @@ -232,13 +232,23 @@ PERCPU_RET_OP(add, add, ldadd) > > > > _pcp_protect_return(xchg_relaxed, pcp, val) > > > > > > > > #define this_cpu_cmpxchg_1(pcp, o, n) \ > > > > - _pcp_protect_return(cmpxchg_relaxed, pcp, o, n) > > > > + _pcp_protect_return(cmpxchg, pcp, o, n) > > > > #define this_cpu_cmpxchg_2(pcp, o, n) \ > > > > - _pcp_protect_return(cmpxchg_relaxed, pcp, o, n) > > > > + _pcp_protect_return(cmpxchg, pcp, o, n) > > > > #define this_cpu_cmpxchg_4(pcp, o, n) \ > > > > - _pcp_protect_return(cmpxchg_relaxed, pcp, o, n) > > > > + _pcp_protect_return(cmpxchg, pcp, o, n) > > > > #define this_cpu_cmpxchg_8(pcp, o, n) \ > > > > + _pcp_protect_return(cmpxchg, pcp, o, n) > > > > > > This makes this_cpu_cmpxchg_*() not only non-local, but also (especially > > > for arm64) memory barrier implications since cmpxchg() has a strong memory > > > barrier, while the old this_cpu_cmpxchg*() doesn't have, afaiu. > > > > > > Maybe it's not a big deal if the audience of this helper is still limited > > > (e.g. we can add memory barriers if we don't want strict ordering > > > implication), but just to check with you on whether it's intended, and if > > > so whether it may worth some comments. > > > > It happens that on ARM-64 cmpxchg_local == cmpxchg_relaxed. > > > > See cf10b79a7d88edc689479af989b3a88e9adf07ff. > > This is more or less a comment in general, rather than for arm only. > > Fundamentally starting from this patch it's redefining this_cpu_cmpxchg(). > What I meant is whether we should define it properly then implement the > arch patches with what is defined. > > We're adding non-local semantics into it, which is obvious to me. Which match the cmpxchg() function semantics. > We're (silently, in this patch for aarch64) adding memory barrier semantics > too, this is not obvious to me on whether all archs should implement this > api the same way. Documentation/atomic_t.txt says that _relaxed means "no barriers". So i'd assume: cmpxchg_relaxed: no additional barriers cmpxchg_local: only guarantees atomicity to wrt local CPU. cmpxchg: atomic in SMP context. https://lore.kernel.org/linux-arm-kernel/20180505103550.s7xsnto7tgppkmle@gmail.com/#r There seems to be a lack of clarity in documentation. > It will make a difference IMHO when the helpers are used in any other code > clips, because IIUC proper definition of memory barrier implications will > decide whether the callers need explicit barriers when ordering is required. Trying to limit the scope of changes to solve the problem at hand. More specifically what this patch does is: 1) Add this_cpu_cmpxchg_local, uses arch cmpxchg_local implementation to back it. 2) Add this_cpu_cmpxchg, uses arch cmpxchg implementation to back it. Note that now becomes consistent with cmpxchg and cmpxchg_local semantics. > > This patchset maintains the current behaviour > > of this_cpu_cmpxch (for this_cpu_cmpxch_local), which was: > > > > #define this_cpu_cmpxchg_1(pcp, o, n) \ > > - _pcp_protect_return(cmpxchg_relaxed, pcp, o, n) > > + _pcp_protect_return(cmpxchg, pcp, o, n) > > #define this_cpu_cmpxchg_2(pcp, o, n) \ > > - _pcp_protect_return(cmpxchg_relaxed, pcp, o, n) > > + _pcp_protect_return(cmpxchg, pcp, o, n) > > #define this_cpu_cmpxchg_4(pcp, o, n) \ > > - _pcp_protect_return(cmpxchg_relaxed, pcp, o, n) > > + _pcp_protect_return(cmpxchg, pcp, o, n) > > #define this_cpu_cmpxchg_8(pcp, o, n) \ > > + _pcp_protect_return(cmpxchg, pcp, o, n) > > > > > > + > > > > +#define this_cpu_cmpxchg_local_1(pcp, o, n) \ > > > > _pcp_protect_return(cmpxchg_relaxed, pcp, o, n) > > > > +#define this_cpu_cmpxchg_local_2(pcp, o, n) \ > > > > + _pcp_protect_return(cmpxchg_relaxed, pcp, o, n) > > > > +#define this_cpu_cmpxchg_local_4(pcp, o, n) \ > > > > + _pcp_protect_return(cmpxchg_relaxed, pcp, o, n) > > > > +#define this_cpu_cmpxchg_local_8(pcp, o, n) \ > > > > + _pcp_protect_return(cmpxchg_relaxed, pcp, o, n) > > > > > > I think cmpxchg_relaxed()==cmpxchg_local() here for aarch64, however should > > > we still use cmpxchg_local() to pair with this_cpu_cmpxchg_local_*()? > > > > Since cmpxchg_local = cmpxchg_relaxed, seems like this is not necessary. > > > > > Nothing about your patch along since it was the same before, but I'm > > > wondering whether this is a good time to switchover. > > > > I would say that another patch is more appropriate to change this, > > if desired. > > Sure on this one. Thanks, > > -- > Peter Xu > >