From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from bombadil.infradead.org (bombadil.infradead.org [198.137.202.133]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id 2EB6FC46467 for ; Tue, 3 Jan 2023 15:04:23 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=lists.infradead.org; s=bombadil.20210309; h=Sender: Content-Transfer-Encoding:Content-Type:List-Subscribe:List-Help:List-Post: List-Archive:List-Unsubscribe:List-Id:In-Reply-To:MIME-Version:References: Message-ID:Subject:Cc:To:From:Date:Reply-To:Content-ID:Content-Description: Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID: List-Owner; bh=QwjEtW6Fo2Kpz0Pl54KQKJ0P0oxPBi4AVmduAUgsAEk=; b=l9nAF+DBZGeahF +xO4pRD9ovzeId3iZKkmpsGVnUFrVIjDOYwq4HmJNMVb/uXmN34k3r8FZ1ErQbGY4YdY04il82Q7R kXfjVVCWhaZU2j6IYMNBwp6sXFsAYETvyojtR2wwkNWFZ/FVR/+qvTRnP+lfFJIQYuOIiNvs4wRGp 4U2a1ik3tQ57kJbXrZVTBP1CDEM5yjLhsgsR7L5TL2ahu/ItufYY2v3DrvlRQnAZscqW0TsEMELR+ UC1XSRM+Fw9eLsystitoTV1raN9Le6+WIgIYhrsazwdHFPG1izNdceMyr/pUJ5Q7C1eqeRcrT+9JN KAFfp3VwB/QxUL2NyOUA==; Received: from localhost ([::1] helo=bombadil.infradead.org) by bombadil.infradead.org with esmtp (Exim 4.94.2 #2 (Red Hat Linux)) id 1pCioD-002EnI-VE; Tue, 03 Jan 2023 15:02:45 +0000 Received: from pandora.armlinux.org.uk ([2001:4d48:ad52:32c8:5054:ff:fe00:142]) by bombadil.infradead.org with esmtps (Exim 4.94.2 #2 (Red Hat Linux)) id 1pCeC0-000cNQ-PS for linux-arm-kernel@lists.infradead.org; Tue, 03 Jan 2023 10:06:58 +0000 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=armlinux.org.uk; s=pandora-2019; h=Sender:In-Reply-To: Content-Transfer-Encoding:Content-Type:MIME-Version:References:Message-ID: Subject:Cc:To:From:Date:Reply-To:Content-ID:Content-Description:Resent-Date: Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:List-Id: List-Help:List-Unsubscribe:List-Subscribe:List-Post:List-Owner:List-Archive; bh=4coSDhKU7mwRMnUBHyZf53rh/zHTShyJKZjrQJqxxZQ=; b=B12V0JeslL/XIkP4Z0lltsBBWc tZfAKB3JMY2U5lPpdVnMCkPvGczQgL1+b/2r2g+EqYedCqWcm8B0KryJnEicwXfHgzJY2masjoF5G Fkj2Sv1EeyWY8ckdQ5TCl7O8l3OP4iNffTqkUi6KUhBMCvW0FbKMQSnEDX27CEHgggGkBn/sFB4zc iBhIdqNwZ7QDp4Um4klM1db2XMzsCGv08GUDglexnqvEah9iKrNEu+TWfamME0iqX5IP9ARs9HRS+ n3CEWveF+U3eDsbkiovi2+ebb4u1rwhW1/okbMbLc9ywBwOLdm1Jyize/tnfpw16eebMsnaDUdpRS ZqBUObGQ==; Received: from shell.armlinux.org.uk ([fd8f:7570:feb6:1:5054:ff:fe00:4ec]:35908) by pandora.armlinux.org.uk with esmtpsa (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1pCeBw-0005Bw-TV; Tue, 03 Jan 2023 10:06:44 +0000 Received: from linux by shell.armlinux.org.uk with local (Exim 4.94.2) (envelope-from ) id 1pCeBu-0001xj-1a; Tue, 03 Jan 2023 10:06:42 +0000 Date: Tue, 3 Jan 2023 10:06:42 +0000 From: "Russell King (Oracle)" To: Hui Tang Cc: linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org, Jens Axboe Subject: Re: [bug-report] possible performance problem in ret_to_user_from_irq Message-ID: References: <7ecb8f3c-2aeb-a905-0d4a-aa768b9649b5@huawei.com> MIME-Version: 1.0 Content-Disposition: inline In-Reply-To: <7ecb8f3c-2aeb-a905-0d4a-aa768b9649b5@huawei.com> X-CRM114-Version: 20100106-BlameMichelson ( TRE 0.8.0 (BSD) ) MR-646709E3 X-CRM114-CacheID: sfid-20230103_020653_778556_ED8E4005 X-CRM114-Status: GOOD ( 16.03 ) X-BeenThere: linux-arm-kernel@lists.infradead.org X-Mailman-Version: 2.1.34 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Sender: "linux-arm-kernel" Errors-To: linux-arm-kernel-bounces+linux-arm-kernel=archiver.kernel.org@lists.infradead.org T24gTW9uLCBEZWMgMjYsIDIwMjIgYXQgMDQ6NDU6MjBQTSArMDgwMCwgSHVpIFRhbmcgd3JvdGU6 Cj4gaGkgZm9sa3MuCj4gCj4gSSBmb3VuZCBhIHBlcmZvcm1hbmNlIHByb2JsZW0gd2hpY2ggaXMg aW50cm9kdWNlZCBieSBjb21taXQKPiAzMmQ1OTc3M2RhMzggKCJhcm06IGFkZCBzdXBwb3J0IGZv ciBUSUZfTk9USUZZX1NJR05BTCIpLgo+IEFmdGVyIHRoZSBjb21taXQsICBhbnkgYml0IGluIHRo ZSByYW5nZSBvZiAwLi4xNSB3aWxsIGNhdXNlCj4gZG9fd29ya19wZW5kaW5nKCkgdG8gYmUgaW52 b2tlZC4gTW9yZSBmcmVxdWVudCBkb193b3JrX3BlbmRpbmcoKQo+IGludm9rZWQgcG9zc2libGUg cmVzdWx0IGluIHdvcnNlIHBlcmZvcm1hbmNlLgo+IAo+IFNvbWUgb2YgdGhlIHRlc3RzIEkndmUg ZG9uZe+8jCBhcyBmb2xsb3dzOgo+IGxtYmVuY2ggdGVzdAkJCWJhc2UJCXdpdGggcGF0Y2gKPiAu L2xhdF9jdHggLVAgMSAtcyAwICAyCQk3LjMxNjcJCTExLjA0Cj4gLi9sYXRfY3R4IC1QIDEgLXMg MTYgMiAgICAgICAgICA4LjA0NjcJCTE0LjUzNjcKPiAuL2xhdF9jdHggLVAgMSAtcyA2NCAyCQk3 Ljg2NjcJCTExLjQzCj4gLi9sYXRfY3R4IC1QIDEgLXMgMTYgMTYJCTE2LjQ3CQkxOC4zNjY3Cj4g Li9sYXRfcGlwZSAtUCAxCQkJMjguMTY3MQkJNDQuNzkwNAo+IAo+IGxpYk1pY3JvLTAuNC4xIHRl c3QJCWJhc2UJCXdpdGggcGF0Y2gKPiAuL2Nhc2NhZGVfY29uZCAtRSAtQyAyMDBcCj4gIC1MIC1T IC1XIC1OICJjX2NvbmRfMSIgLUkgMTAwCTI4Ni4zMzMzCTM1OAo+IAo+IFdoZW4gSSBhZGp1c3Qg dGVzdCBiaXQsIHRoZSBwZXJmb3JtYW5jZSBwcm9ibGVtIGdvbmUuCj4gLQltb3ZzCXIxLCByMSwg bHNsICMxNgo+ICsJbGRyCXIyLCA9I19USUZfV09SS19NQVNLCj4gKwl0c3QJcjEsIHIyCj4gCj4g RG9lcyBhbnlvbmUgaGF2ZSBhIGdvb2Qgc3VnZ2VzdGlvbiBmb3IgdGhpcyBwcm9ibGVtPwo+IHNo b3VsZCBqdXN0IHRlc3QgX1RJRl9XT1JLX01BU0ssIGFzIGJlZm9yZT8KCkkgdGhpbmsgaXQgc2hv dWxkIGJlIGZpbmUgLSBidXQgSSB3b3VsZCBzdWdnZXN0IHJlLW9yZ2FuaXNpbmcgdGhlClRJRiBk ZWZpbml0aW9ucyBzbyB0aGF0IHRob3NlIFRJRiBiaXRzIHRoYXQgc2hvdWxkbid0IHRyaWdnZXIK ZG9fd29ya19wZW5kaW5nIGFyZSBub3QgaW4gdGhlIGZpcnN0IDE2IGJpdHMuCgpOb3RlIHRoYXQg YWxsIGZvdXIgYml0cyBpbiBfVElGX1NZU0NBTExfV09SSyBuZWVkIHRvIHN0YXkgd2l0aGluCmFu IDgtYml0IGV2ZW4tYml0LWFsaWduZWQgcmFuZ2UsIHNvIHRoZSB2YWx1ZSBpcyBzdWl0YWJsZSBm b3IgYW4KaW1tZWRpYXRlIGFzc2VtYmx5IGNvbnN0YW50LgoKSSdkIHN1Z2dlc3QgbW92aW5nIHRo ZSBUSUYgZGVmaW5pdGlvbnMgZm9yIDIwIHRvIDE5LCBhbmQgNC4uNyB0bwoyMC4uMjMsIGFuZCB0 aGVuIDggdG8gNC4KClRoYW5rcy4KCi0tIApSTUsncyBQYXRjaCBzeXN0ZW06IGh0dHBzOi8vd3d3 LmFybWxpbnV4Lm9yZy51ay9kZXZlbG9wZXIvcGF0Y2hlcy8KRlRUUCBpcyBoZXJlISA0ME1icHMg ZG93biAxME1icHMgdXAuIERlY2VudCBjb25uZWN0aXZpdHkgYXQgbGFzdCEKCl9fX19fX19fX19f X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fCmxpbnV4LWFybS1rZXJuZWwgbWFp bGluZyBsaXN0CmxpbnV4LWFybS1rZXJuZWxAbGlzdHMuaW5mcmFkZWFkLm9yZwpodHRwOi8vbGlz dHMuaW5mcmFkZWFkLm9yZy9tYWlsbWFuL2xpc3RpbmZvL2xpbnV4LWFybS1rZXJuZWwK From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 436EAC4708E for ; Tue, 3 Jan 2023 10:07:07 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S236997AbjACKHE (ORCPT ); Tue, 3 Jan 2023 05:07:04 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:41122 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S236812AbjACKGt (ORCPT ); Tue, 3 Jan 2023 05:06:49 -0500 Received: from pandora.armlinux.org.uk (pandora.armlinux.org.uk [IPv6:2001:4d48:ad52:32c8:5054:ff:fe00:142]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 3096B12A for ; Tue, 3 Jan 2023 02:06:48 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=armlinux.org.uk; s=pandora-2019; h=Sender:In-Reply-To: Content-Transfer-Encoding:Content-Type:MIME-Version:References:Message-ID: Subject:Cc:To:From:Date:Reply-To:Content-ID:Content-Description:Resent-Date: Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:List-Id: List-Help:List-Unsubscribe:List-Subscribe:List-Post:List-Owner:List-Archive; bh=4coSDhKU7mwRMnUBHyZf53rh/zHTShyJKZjrQJqxxZQ=; b=B12V0JeslL/XIkP4Z0lltsBBWc tZfAKB3JMY2U5lPpdVnMCkPvGczQgL1+b/2r2g+EqYedCqWcm8B0KryJnEicwXfHgzJY2masjoF5G Fkj2Sv1EeyWY8ckdQ5TCl7O8l3OP4iNffTqkUi6KUhBMCvW0FbKMQSnEDX27CEHgggGkBn/sFB4zc iBhIdqNwZ7QDp4Um4klM1db2XMzsCGv08GUDglexnqvEah9iKrNEu+TWfamME0iqX5IP9ARs9HRS+ n3CEWveF+U3eDsbkiovi2+ebb4u1rwhW1/okbMbLc9ywBwOLdm1Jyize/tnfpw16eebMsnaDUdpRS ZqBUObGQ==; Received: from shell.armlinux.org.uk ([fd8f:7570:feb6:1:5054:ff:fe00:4ec]:35908) by pandora.armlinux.org.uk with esmtpsa (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1pCeBw-0005Bw-TV; Tue, 03 Jan 2023 10:06:44 +0000 Received: from linux by shell.armlinux.org.uk with local (Exim 4.94.2) (envelope-from ) id 1pCeBu-0001xj-1a; Tue, 03 Jan 2023 10:06:42 +0000 Date: Tue, 3 Jan 2023 10:06:42 +0000 From: "Russell King (Oracle)" To: Hui Tang Cc: linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org, Jens Axboe Subject: Re: [bug-report] possible performance problem in ret_to_user_from_irq Message-ID: References: <7ecb8f3c-2aeb-a905-0d4a-aa768b9649b5@huawei.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <7ecb8f3c-2aeb-a905-0d4a-aa768b9649b5@huawei.com> Sender: Russell King (Oracle) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Dec 26, 2022 at 04:45:20PM +0800, Hui Tang wrote: > hi folks. > > I found a performance problem which is introduced by commit > 32d59773da38 ("arm: add support for TIF_NOTIFY_SIGNAL"). > After the commit, any bit in the range of 0..15 will cause > do_work_pending() to be invoked. More frequent do_work_pending() > invoked possible result in worse performance. > > Some of the tests I've done, as follows: > lmbench test base with patch > ./lat_ctx -P 1 -s 0 2 7.3167 11.04 > ./lat_ctx -P 1 -s 16 2 8.0467 14.5367 > ./lat_ctx -P 1 -s 64 2 7.8667 11.43 > ./lat_ctx -P 1 -s 16 16 16.47 18.3667 > ./lat_pipe -P 1 28.1671 44.7904 > > libMicro-0.4.1 test base with patch > ./cascade_cond -E -C 200\ > -L -S -W -N "c_cond_1" -I 100 286.3333 358 > > When I adjust test bit, the performance problem gone. > - movs r1, r1, lsl #16 > + ldr r2, =#_TIF_WORK_MASK > + tst r1, r2 > > Does anyone have a good suggestion for this problem? > should just test _TIF_WORK_MASK, as before? I think it should be fine - but I would suggest re-organising the TIF definitions so that those TIF bits that shouldn't trigger do_work_pending are not in the first 16 bits. Note that all four bits in _TIF_SYSCALL_WORK need to stay within an 8-bit even-bit-aligned range, so the value is suitable for an immediate assembly constant. I'd suggest moving the TIF definitions for 20 to 19, and 4..7 to 20..23, and then 8 to 4. Thanks. -- RMK's Patch system: https://www.armlinux.org.uk/developer/patches/ FTTP is here! 40Mbps down 10Mbps up. Decent connectivity at last!