From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([2001:4830:134:3::10]:48732) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XEbXj-00007G-Io for qemu-devel@nongnu.org; Tue, 05 Aug 2014 06:00:37 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1XEbXc-0001Zk-71 for qemu-devel@nongnu.org; Tue, 05 Aug 2014 06:00:31 -0400 Received: from youngberry.canonical.com ([91.189.89.112]:45478) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XEbXc-0001Za-1S for qemu-devel@nongnu.org; Tue, 05 Aug 2014 06:00:24 -0400 Received: from mail-vc0-f177.google.com ([209.85.220.177]) by youngberry.canonical.com with esmtpsa (TLS1.0:RSA_ARCFOUR_SHA1:16) (Exim 4.71) (envelope-from ) id 1XEbXb-0005rE-5u for qemu-devel@nongnu.org; Tue, 05 Aug 2014 10:00:23 +0000 Received: by mail-vc0-f177.google.com with SMTP id hy4so1020568vcb.22 for ; Tue, 05 Aug 2014 03:00:22 -0700 (PDT) MIME-Version: 1.0 In-Reply-To: <20140805094844.GF4391@noname.str.redhat.com> References: <1407209598-2572-1-git-send-email-ming.lei@canonical.com> <20140805094844.GF4391@noname.str.redhat.com> Date: Tue, 5 Aug 2014 18:00:22 +0800 Message-ID: From: Ming Lei Content-Type: text/plain; charset=UTF-8 Subject: Re: [Qemu-devel] [PATCH v1 00/17] dataplane: optimization and multi virtqueue support List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: Kevin Wolf Cc: Peter Maydell , Fam Zheng , "Michael S. Tsirkin" , qemu-devel , Stefan Hajnoczi , Paolo Bonzini On Tue, Aug 5, 2014 at 5:48 PM, Kevin Wolf wrote: > Am 05.08.2014 um 05:33 hat Ming Lei geschrieben: >> Hi, >> >> These patches bring up below 4 changes: >> - introduce object allocation pool and apply it to >> virtio-blk dataplane for improving its performance >> >> - introduce selective coroutine bypass mechanism >> for improving performance of virtio-blk dataplane with >> raw format image > > Before applying any bypassing patches, I think we should understand in > detail where we are losing performance with coroutines enabled. >>From the below profiling data, CPU becomes slow to run instructions with coroutine, and CPU dcache miss is increased so it is very likely caused by switching stack frequently. http://marc.info/?l=qemu-devel&m=140679721126306&w=2 http://pastebin.com/ae0vnQ6V > > I also think that the device emulation has no business in deciding > whether the bypass is used (it depends solely on conditions outside of > the device) and that leaking the fd number out of raw-posix is wrong. > Both of them are layering violations that shouldn't be reintroduced. Yes, that is right, and I have added comments that the bypass hint will be moved to block layer completely in future. Thanks,