From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([208.118.235.92]:47390) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1TsWme-0003Z0-Om for qemu-devel@nongnu.org; Tue, 08 Jan 2013 05:51:53 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1TsWmd-0007ls-4A for qemu-devel@nongnu.org; Tue, 08 Jan 2013 05:51:52 -0500 Received: from mx1.redhat.com ([209.132.183.28]:45007) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1TsWmc-0007ln-R3 for qemu-devel@nongnu.org; Tue, 08 Jan 2013 05:51:51 -0500 Message-ID: <50EBFA3F.8030808@redhat.com> Date: Tue, 08 Jan 2013 11:51:43 +0100 From: Kevin Wolf MIME-Version: 1.0 References: <1355941771-3418-1-git-send-email-namei.unix@gmail.com> <87k3s6shdv.wl%morita.kazutaka@lab.ntt.co.jp> <50D967C3.7020109@gmail.com> <50E58B19.2050701@gmail.com> <20130104163830.GF6310@stefanha-thinkpad.hitronhub.home> <50E7AEC4.5080309@gmail.com> <50E7BA41.3020307@gmail.com> <50E7DC9B.4080309@gmail.com> <50EACC61.2020603@redhat.com> <50EBB1CB.9030608@gmail.com> <20130108094025.GE2557@stefanha-thinkpad.redhat.com> <50EBEAD2.6070608@gmail.com> <50EBEE42.7010407@redhat.com> <50EBF755.3050607@gmail.com> In-Reply-To: <50EBF755.3050607@gmail.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit Subject: Re: [Qemu-devel] [PATCH] sheepdog: implement direct write semantics List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: Liu Yuan Cc: Stefan Hajnoczi , qemu-devel@nongnu.org, MORITA Kazutaka , Paolo Bonzini Am 08.01.2013 11:39, schrieb Liu Yuan: > On 01/08/2013 06:00 PM, Kevin Wolf wrote: >> Am 08.01.2013 10:45, schrieb Liu Yuan: >>> On 01/08/2013 05:40 PM, Stefan Hajnoczi wrote: >>>> Otherwise use sheepdog writeback and let QEMU block.c decide when to >>>> flush. Never use sheepdog writethrough because it's redundant here. >>> >>> I don't get it. What do you mean by 'redundant'? If we use virtio & >>> sheepdog block driver, how can we specify writethrough mode for Sheepdog >>> cache? Here 'writethrough' means use a pure read cache, which doesn't >>> need flush at all. >> >> A writethrough cache is equivalent to a write-back cache where each >> write is followed by a flush. qemu makes sure to send these flushes, so >> there is no need use Sheepdog's writethrough mode. > > Implement writethrough as writeback + flush will cause considerable > overhead for network block device like Sheepdog: a single write request > will be executed as two requests: write + flush Yeah, maybe we should have some kind of a FUA flag with write requests instead of sending a separate flush. > This also explains why > I saw a regression about write performance: Old QEMU can issue multiple > write requests in one go, but now the requests are sent one by one (even > with cache=writeback set), which makes Sheepdog write performance drop a > lot. Is it possible to issue multiple requests in one go as old QEMU does? Huh? We didn't change anything to that respect, or at least not that I'm aware of. qemu always only had single-request bdrv_co_writev, so if anything that batching must have happened inside Sheepdog code? Do you know what makes it not batch requests any more? > It seems it is hard to restore into old semantics of cache flags due to > new design of QEMU block layer. So will you accept that adding a 'flags' > into BlockDriverState which carry the 'cache flags' from user to keep > backward compatibility? No, going back to the old behaviour would break guest-toggled WCE. Kevin