From mboxrd@z Thu Jan 1 00:00:00 1970 From: Thomas Monjalon Subject: Re: [PATCH v8 1/3] eal/x86: run-time dispatch over memcpy Date: Wed, 25 Oct 2017 11:00:00 +0200 Message-ID: <4158692.mhjs8xbxgm@xps> References: <1507206794-79941-1-git-send-email-xiaoyun.li@intel.com> <2601191342CEEE43887BDE71AB9772585FAAEB8E@IRSMSX103.ger.corp.intel.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7Bit Cc: "Ananyev, Konstantin" , "Richardson, Bruce" , dev@dpdk.org, "Lu, Wenzhuo" , "Zhang, Helin" , "ophirmu@mellanox.com" To: "Li, Xiaoyun" Return-path: Received: from out4-smtp.messagingengine.com (out4-smtp.messagingengine.com [66.111.4.28]) by dpdk.org (Postfix) with ESMTP id 0CEFF1B294 for ; Wed, 25 Oct 2017 11:00:01 +0200 (CEST) In-Reply-To: List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Sender: "dev" 25/10/2017 10:54, Li, Xiaoyun: > > > > > > I think we should focus on micro-benchmark and find a > > > > > > reasonnable threshold for a reasonnable drop tradeoff. > > > > > > > > > > > Sadly, it may not be that simple. What shows best performance for > > > > > micro- benchmarks may not show the same effect in a real application. > > > > > > > > > > /Bruce > > > > > > > > Then how to measure the performance? > > > > > > > > And I cannot reproduce 15% drop on mellanox. > > > > Could the person who tested 15% drop help to do test again with 1024 > > > > threshold and see if there is any improvement? > > > > > > As Bruce said, best performance on micro-benchmark may not show the > > same effect in real applications. > > > And I cannot reproduce the 15% drop. > > > And I don't know if raising the threshold can improve the perf or not. > > > Could the person who tested 15% drop help to do test again with 1024 > > threshold and see if there is any improvement? > > > > As I already asked before - why not to make that threshold dynamic? > > Konstantin > > > I want to confirm that raising threshold is useful. Then can make it dynamic and set it very large as default. You can confirm it with micro-benchmarks.