From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <linux-kernel-owner@vger.kernel.org>
Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand
	id S1751836AbbJJJF4 (ORCPT <rfc822;w@1wt.eu>);
	Sat, 10 Oct 2015 05:05:56 -0400
Received: from mail-wi0-f177.google.com ([209.85.212.177]:35717 "EHLO
	mail-wi0-f177.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org
	with ESMTP id S1751406AbbJJJFu (ORCPT
	<rfc822;linux-kernel@vger.kernel.org>);
	Sat, 10 Oct 2015 05:05:50 -0400
Date: Sat, 10 Oct 2015 11:05:46 +0200
From: Ingo Molnar <mingo@kernel.org>
To: Andy Lutomirski <luto@amacapital.net>
Cc: Andy Lutomirski <luto@kernel.org>, X86 ML <x86@kernel.org>,
        "linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
        Brian Gerst <brgerst@gmail.com>, Denys Vlasenko <dvlasenk@redhat.com>,
        Linus Torvalds <torvalds@linux-foundation.org>,
        Borislav Petkov <bp@alien8.de>,
        Arnaldo Carvalho de Melo <acme@infradead.org>,
        Jiri Olsa <jolsa@redhat.com>
Subject: Re: [PATCH v2 32/36] x86/entry: Micro-optimize compat fast syscall
 arg fetch
Message-ID: <20151010090546.GA20819@gmail.com>
References: <cover.1444091584.git.luto@kernel.org>
 <bdff034e2f23c5eb974c760cf494cb5bddce8f29.1444091585.git.luto@kernel.org>
 <20151009073206.GB31859@gmail.com>
 <CALCETrUmNtXZQZQEe=pxqX0bGS2sf15YXfChNMLigmZgzytc+w@mail.gmail.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
In-Reply-To: <CALCETrUmNtXZQZQEe=pxqX0bGS2sf15YXfChNMLigmZgzytc+w@mail.gmail.com>
User-Agent: Mutt/1.5.23 (2014-03-12)
Sender: linux-kernel-owner@vger.kernel.org
List-ID: <linux-kernel.vger.kernel.org>
X-Mailing-List: linux-kernel@vger.kernel.org


* Andy Lutomirski <luto@amacapital.net> wrote:

> On Fri, Oct 9, 2015 at 12:32 AM, Ingo Molnar <mingo@kernel.org> wrote:
> >
> > * Andy Lutomirski <luto@kernel.org> wrote:
> >
> >> we're following a 32-bit pointer, and the uaccess code isn't smart
> >> enough to figure out that the access_ok check isn't needed.
> >>
> >> This saves about three cycles on a cache-hot fast syscall.
> >
> > Another request: could you please stick the benchmarking code of the various x86
> > system call variants into 'perf bench' - under tools/perf/bench/, so that
> > measurements can be done on more hardware and can be reproduced easily?
> >
> > I'd suggest we dedicate an entirely new benchmark family to it: 'perf bench x86'
> > and then have:
> >
> >    perf bench x86 syscall vdso
> >    perf bench x86 syscall int80
> >    perf bench x86 syscall vdso-compat
> 
> I'll play with this.  I'm not too familiar with the perf bench stuff.

So the perf bench stuff is meant to be a familiar home to kernel developers we'd 
like to slap a micro (or macro) benchmark into an easy to modify place.

Over the years it has gathered a number of benchmarks - but more are always 
welcome.

Just copy one of the existing benchmark modules (the tools/perf/bench/numa.c one 
is the most advanced one, tools/perf/bench/sched-pipe.c is the simplest one) and 
off you go.

Here's a commit that adds a new benchmark suite:

  a043971141f1 ("perf bench: Add futex-hash microbenchmark")

There are no big restrictions on the benchmarks: just put your existing code in 
that produces stdout output and it will be likely very close to upstream 
acceptable.

Can help should you get stuck anywhere.

Thanks,

	Ingo