From mboxrd@z Thu Jan  1 00:00:00 1970
From: "Venkateswararao Jujjuri (JV)" <jvrao@linux.vnet.ibm.com>
Subject: Re: [V9fs-developer] :[RFC] [PATCH 6/7] [net/9p] Read and Write side
 zerocopy changes for 9P2000.L protocol.
Date: Wed, 09 Feb 2011 13:39:53 -0800
Message-ID: <4D5309A9.3070508@linux.vnet.ibm.com>
References: <1297063283-2180-1-git-send-email-jvrao@linux.vnet.ibm.com> <1297063283-2180-7-git-send-email-jvrao@linux.vnet.ibm.com> <4D4F97ED.9070401@linux.vnet.ibm.com> <AANLkTimBf+twXBx0_XJEfPUhn40Ep62jCCzjTLL-3nsy@mail.gmail.com> <AANLkTimNqt5-0XSEmCvewVpjtMU-XSB51FUWbMMZ67cX@mail.gmail.com> <4D5302A3.7000000@linux.vnet.ibm.com> <AANLkTimkxYt6_ZKRKZRe73wahWVjaxgX2ZHxYbw_=7L+@mail.gmail.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: 7bit
Cc: linux-fsdevel@vger.kernel.org, v9fs-developer@lists.sourceforge.net
To: Eric Van Hensbergen <ericvh@gmail.com>
Return-path: <linux-fsdevel-owner@vger.kernel.org>
Received: from e9.ny.us.ibm.com ([32.97.182.139]:42365 "EHLO e9.ny.us.ibm.com"
	rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP
	id S1752197Ab1BIVkB (ORCPT <rfc822;linux-fsdevel@vger.kernel.org>);
	Wed, 9 Feb 2011 16:40:01 -0500
Received: from d01dlp01.pok.ibm.com (d01dlp01.pok.ibm.com [9.56.224.56])
	by e9.ny.us.ibm.com (8.14.4/8.13.1) with ESMTP id p19LEjlA019635
	for <linux-fsdevel@vger.kernel.org>; Wed, 9 Feb 2011 16:14:46 -0500
Received: from d01relay04.pok.ibm.com (d01relay04.pok.ibm.com [9.56.227.236])
	by d01dlp01.pok.ibm.com (Postfix) with ESMTP id 450D6728047
	for <linux-fsdevel@vger.kernel.org>; Wed,  9 Feb 2011 16:39:58 -0500 (EST)
Received: from d03av02.boulder.ibm.com (d03av02.boulder.ibm.com [9.17.195.168])
	by d01relay04.pok.ibm.com (8.13.8/8.13.8/NCO v10.0) with ESMTP id p19LdvcB177006
	for <linux-fsdevel@vger.kernel.org>; Wed, 9 Feb 2011 16:39:57 -0500
Received: from d03av02.boulder.ibm.com (loopback [127.0.0.1])
	by d03av02.boulder.ibm.com (8.14.4/8.13.1/NCO v10.0 AVout) with ESMTP id p19LdvJd025083
	for <linux-fsdevel@vger.kernel.org>; Wed, 9 Feb 2011 14:39:57 -0700
In-Reply-To: <AANLkTimkxYt6_ZKRKZRe73wahWVjaxgX2ZHxYbw_=7L+@mail.gmail.com>
Sender: linux-fsdevel-owner@vger.kernel.org
List-ID: <linux-fsdevel.vger.kernel.org>

On 2/9/2011 1:18 PM, Eric Van Hensbergen wrote:
> On Wed, Feb 9, 2011 at 3:09 PM, Venkateswararao Jujjuri (JV)
> <jvrao@linux.vnet.ibm.com> wrote:
>> WRITE
>>
>> IO SIZE      TOTAL SIZE       No ZC                ZC
>> 1                   1MB                22.4 kb/s         19.8 kb/s
>> 32                 32MB              711 kb/s          633 kb/s
>> 64                 64MB              1.4 mb/s          1.3  mb/s
>> 128               128MB             2.8 mb/s          2.6 mb/s
>> 256               256MB             5.6 mb/s          5.1 mb/s
>> 512               512MB            10.4 mb/s        10.2 mb/s
>> 1024              1GB               19.7 mb/s         20.4 mb/s
>> 2048              2GB               40.1 mb/s          43.7 mb/s
>> 4096              4GB               71.4 mb/s          73.1 mb/s
>>
>> READ
>> IO SIZE      TOTAL SIZE       No ZC                ZC
>> 1                   1MB                26.6 kb/s         23.1 kb/s
>> 32                 32MB              783 kb/s           734 kb/s
>> 64                 64MB              1.7 mb/s          1.5 mb/s
>> 128               128MB             3.4 mb/s          3.0 mb/s
>> 256               256MB             4.2 mb/s           5.9 mb/s
>> 512               512MB            6.9 mb/s            11.6 mb/s
>> 1024              1GB               23.3 mb/s          23.4 mb/s
>> 2048              2GB               42.5 mb/s          45.4 mb/s
>> 4096              4GB               67.4 mb/s          73.9 mb/s
>>
>> As you can see, the difference is marginal..but zc improves as the IO size
>> increases.
>> In the past we have seen tremendous improvements with different msizes.
>> It  is mostly because of shipping bigger chunks of data which is possible with
>> zero copy.
>> Also it could be my setup/box even on the host I am getting same/similar numbers.
>>
> 
> So the break even point for write is around 512 and for read it is
> somewhere between 128 and 256 -- but I think there may be some
> justification then for not doing zc for payloads of 128 or less.
> Interesting number, its the same as ERRMAX :)  These numbers will be
> different system to system of course, but I imagine on a server class
> machine the tradeoff size moves higher instead of lower (since
> processor and caches are likely to be faster).  How characteristic is
> the machine you tested it on JV?

It is a HS21 blade a two socket quad core Xeon with 4 GB memory, IO to the local
disk.
As I said, throughput on the host also in the same range...we could very well be
capped by the
disk performance. But I agree that if the iosize+hdr size < 4k we can just use
non-zero copy.
I don't think it is going to swing the pendulum of performance/complexity in
either way..but
given that we are going to allocate atleast 4k buffers, it makes sense to use it
if we can accommodate everything in there.

- JV
> 
>       -eric