From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755848AbcA3DgN (ORCPT ); Fri, 29 Jan 2016 22:36:13 -0500 Received: from shelob.surriel.com ([74.92.59.67]:51553 "EHLO shelob.surriel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753742AbcA3DgL (ORCPT ); Fri, 29 Jan 2016 22:36:11 -0500 From: riel@redhat.com To: linux-kernel@vger.kernel.org Cc: tglx@linutronix.de, mingo@kernel.org, luto@amacapital.net, fweisbec@gmail.com, peterz@infradead.org, clark@redhat.com Subject: [PATCH 3/4] time,acct: drop irq save & restore from __acct_update_integrals Date: Fri, 29 Jan 2016 22:36:04 -0500 Message-Id: <1454124965-13974-4-git-send-email-riel@redhat.com> X-Mailer: git-send-email 2.5.0 In-Reply-To: <1454124965-13974-1-git-send-email-riel@redhat.com> References: <1454124965-13974-1-git-send-email-riel@redhat.com> Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Rik van Riel It looks like all the call paths that lead to __acct_update_integrals already have irqs disabled, and __acct_update_integrals does not need to disable irqs itself. This is very convenient since about half the CPU time left in this function was spent in local_irq_save alone. Performance of a microbenchmark that calls an invalid syscall ten million times in a row on a nohz_full CPU improves 21% vs. 4.5-rc1 with both the removal of divisions from __acct_update_integrals and this patch, with runtime dropping from 3.7 to 2.9 seconds. With these patches applied, the highest remaining cpu user in the trace is native_sched_clock, which is addressed in the next patch. Suggested-by: Peter Zijlstra Signed-off-by: Rik van Riel --- kernel/tsacct.c | 6 +----- 1 file changed, 1 insertion(+), 5 deletions(-) diff --git a/kernel/tsacct.c b/kernel/tsacct.c index 8908f8b1d26e..b2663d699a72 100644 --- a/kernel/tsacct.c +++ b/kernel/tsacct.c @@ -124,26 +124,22 @@ static void __acct_update_integrals(struct task_struct *tsk, cputime_t utime, cputime_t stime) { cputime_t time, dtime; - unsigned long flags; u64 delta; if (unlikely(!tsk->mm)) return; - local_irq_save(flags); time = stime + utime; dtime = time - tsk->acct_timexpd; delta = cputime_to_nsecs(dtime); if (delta < TICK_NSEC) - goto out; + return; tsk->acct_timexpd = time; /* The final unit will be Mbyte-usecs, see xacct_add_tsk */ tsk->acct_rss_mem1 += delta * get_mm_rss(tsk->mm) / 1024; tsk->acct_vm_mem1 += delta * tsk->mm->total_vm / 1024; -out: - local_irq_restore(flags); } /** -- 2.5.0