From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id AD24AC7EE2E for ; Tue, 13 Jun 2023 02:28:32 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S237564AbjFMC2b (ORCPT ); Mon, 12 Jun 2023 22:28:31 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:42744 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S239035AbjFMC20 (ORCPT ); Mon, 12 Jun 2023 22:28:26 -0400 Received: from szxga02-in.huawei.com (szxga02-in.huawei.com [45.249.212.188]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 15C55199E; Mon, 12 Jun 2023 19:28:17 -0700 (PDT) Received: from kwepemm600003.china.huawei.com (unknown [172.30.72.56]) by szxga02-in.huawei.com (SkyGuard) with ESMTP id 4QgC7p48VDzLq8m; Tue, 13 Jun 2023 10:25:10 +0800 (CST) Received: from [10.67.111.205] (10.67.111.205) by kwepemm600003.china.huawei.com (7.193.23.202) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2507.23; Tue, 13 Jun 2023 10:28:14 +0800 Subject: Re: [RFC] Adding support for setting the affinity of the recording process To: James Clark , Peter Zijlstra , Arnaldo Carvalho de Melo CC: Mark Rutland , Alexander Shishkin , Jiri Olsa , Namhyung Kim , Ian Rogers , Adrian Hunter , linux-perf-users , linux-kernel , Ingo Molnar References: <159de73b-fdd6-6df8-4f77-73c628fe641f@huawei.com> From: Yang Jihong Message-ID: Date: Tue, 13 Jun 2023 10:28:14 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.6.1 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset="utf-8"; format=flowed Content-Language: en-US Content-Transfer-Encoding: 8bit X-Originating-IP: [10.67.111.205] X-ClientProxiedBy: dggems701-chm.china.huawei.com (10.3.19.178) To kwepemm600003.china.huawei.com (7.193.23.202) X-CFilter-Loop: Reflected Precedence: bulk List-ID: X-Mailing-List: linux-perf-users@vger.kernel.org Hello, On 2023/6/12 23:27, James Clark wrote: > > > On 12/06/2023 11:26, Yang Jihong wrote: >> Hello everyone, >> >> Currently, perf-record supports profiling an existing process, thread, >> or a specified command. >> >> Sometimes we may need to set CPU affinity of the target process before >> recording: >> >>   # taskset -pc >>   # perf record -p -- sleep 10 >> >> or: >> >>   # perf record -- `taskset -c COMMAND` >> >> I'm thinking about getting perf to support setting the affinity of the >> recording process, for example: >> >> 1. set the CPU affinity of the process to , process >> to ,  and record: >> >>   # perf record -p /:/ -- sleep 10 >> > > I'm not sure if this is necessary. You can already do this with taskset > when you launch the processes or for existing ones. Yes, that's what we're doing now, and I'm thinking about whether perf can support this "taskset" feature. > >> and >> >> 2. set CPU affinity of the COMMAND and record: >> >>   # perf record --taskset-command COMMAND >> >> In doing so, perf, as an observer, actually changes some of the >> properties of the target process, which may be contrary to the purpose >> of perf tool. >> >> >> Will we consider accepting this approach? >> > > For #2 I do this sometimes, but I prefix the perf command with taskset > because otherwise there is a small time between when taskset does its > thing and launching the child process that it runs in the wrong place. > > Then one issue with the above method is that perf itself gets pinned to > those CPUs as well. I suppose that could influence your application but > I've never had an issue with it. > > If you really can't live with perf also being pinned to those CPUs I > would say it makes sense to add options for #2. Otherwise I would just > run everything under taskset and no changes are needed. If "perf" process and the target process are pinned to the same CPU, and the CPU usage of the target process is high, the perf data collection may be affected. Therefore, in this case, we may need to pin the target process and "perf" to different CPUs. > > I think you would still need to have perf itself pinned to the CPUs just > before it does the fork and exec, and then after that it can undo its > pinning. Otherwise you'd still get that small time running on the wrong > cores. > Thanks for your advice, or we can support setting different affinities for the "perf" process and the target process. Thanks, Yang.