qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
From: Liu Yuan <namei.unix@gmail.com>
To: Jeff Cody <jcody@redhat.com>
Cc: Kevin Wolf <kwolf@redhat.com>,
	Teruaki Ishizaki <ishizaki.teruaki@lab.ntt.co.jp>,
	Hitoshi Mitake <mitake.hitoshi@gmail.com>,
	Hitoshi Mitake <mitake.hitoshi@lab.ntt.co.jp>,
	qemu-devel@nongnu.org, Vasiliy Tolstov <v.tolstov@selfip.ru>,
	sheepdog-ng@googlegroups.com, morita.kazutaka@gmail.com,
	Stefan Hajnoczi <stefanha@redhat.com>,
	sheepdog@lists.wpkg.org
Subject: Re: [Qemu-devel] [sheepdog] [PATCH] sheepdog: fix overlapping metadata update
Date: Mon, 3 Aug 2015 10:01:00 +0800	[thread overview]
Message-ID: <20150803020100.GA4733@ubuntu-trusty> (raw)
In-Reply-To: <20150730132744.GA11022@localhost.localdomain>

On Thu, Jul 30, 2015 at 09:27:44AM -0400, Jeff Cody wrote:
> On Thu, Jul 30, 2015 at 09:41:08AM +0300, Vasiliy Tolstov wrote:
> > 2015-07-29 12:31 GMT+03:00 Liu Yuan <namei.unix@gmail.com>:
> > > Technically, it won't affect the performance because index updates are not range
> > > but concrete in terms of underlying 4M block size. Only 2 or 3 indexes in a
> > > range will be updated and 90+% updates will be only 1. So if 2 updates stride a
> > > large range, it will actually worse the performance of sheepdog because many
> > > additional unref of object will be executed by sheep internally.
> > >
> > > It is not a performance problem but more the right fix. Even with your patch,
> > > updates of inode can overlap. You just don't allow overlapped requests go to
> > > sheepdog, which is a overkill approach. IMHO, we should only adjust to avoid
> > > the overlapped inode updates, which can be done easily and incrementally on top
> > > of old code, rather than take on a complete new untested overkill mechanism. So
> > > what we get from your patch? Covering the problem and lock every requests?
> > >
> > > Your patch actually fix nothing but just cover the problem by slowing down the
> > > request and even with your patch, the problem still exists because inode updates
> > > can overlap. Your commit log doesn't explain what is the real problem and why
> > > your fix works. This is not your toy project that can commit whatever you want.
> > >
> > >> BTW, sheepdog project was already forked, why don't you fork the block
> > >> driver, too?
> > >
> > > What makes you think you own the block driver?
> > >
> > > We forked the sheepdog project because it is low quality of code partly and mostly
> > > some company tries to make it a private project. It is not as open source friendly
> > > as before and that is the main reason Kazutaka and I chose to fork the sheepdog
> > > project. But this doesn't mean we need to fork the QEMU project, it is an
> > > open source project and not your home toy.
> > >
> > > Kazutaka and I are the biggest contributers of both sheepdog and QEMU sheepdog
> > > block driver for years, so I think I am eligible to review the patch and
> > > responsible to suggest the right fix. If you are pissed off when someone else
> > > have other opinions, you can just fork the code and play with it at home or you
> > > follow the rule of open source project.
> > 
> > 
> > Jeff Cody, please be the judge, patch from Hitoshi solved my problem
> > that i emailed in sheepdog list (i have test environment with 8 hosts
> > on each 6 SSD disks and infiniband interconnect between hosts) before
> > Hitoshi patch, massive writing to sheepdog storage breaks file system
> > and corrupt it.
> > After the patch i don't see issues.
> >
> 
> I'd rather see some sort consensus amongst Liu, Hitoshi, yourself, or
> others more intimately familiar with sheepdog.
> 
> Right now, we have Hitoshi's patch in the main git repo, slated for
> 2.4 release (which is Monday).  It sounds, from Liu's email, as this
> may not fix the root cause.
> 
> Vasiliy said he would test Liu's patch; if he can confirm this new
> patch fix, then I would be inclined to use Liu's patch, based on the
> detailed analysis of the issue in the commit message.
> 

This is my performance comparison on top of latest QEMU with my latop with SSD.

sheepdog cluster run with 3 nodes with '-n' to get best volume performance.
QEMU command:
qemu-system-x86_64 -m 1024 --enable-kvm \
	-drive file=debian_squeeze_amd64_standard.qcow2,cache=writeback,if=virtio \
	-drive file=sheepdog:test,if=virtio

sheepdog:test is created as 'dog vdi create test 80G'

I test both time for mkfs and iops for fio write.

fio.conf:
[global]
ioengine=libaio
direct=1
thread=1
norandommap=1
runtime=60
size=300M
directory=/mnt

[write4k-rand]
stonewall
group_reporting
bs=4k
rw=randwrite
numjobs=8
iodepth=32

Resualt:
================================================
sheep formated with -c 2:1 (erasure coding)
       mkfs      fio
Yuan   0.069     4578    
Hitosh 0.071     3722

sheep formarted with -c 2 (replication)
       mkfs      fio
Yuan   0.074     6873  
Hitosh 0.081     6174
================================================

Thanks,
Yuan

      parent reply	other threads:[~2015-08-03  2:01 UTC|newest]

Thread overview: 18+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-07-29  4:02 [Qemu-devel] [PATCH] sheepdog: fix overlapping metadata update Liu Yuan
2015-07-29  5:04 ` [Qemu-devel] [sheepdog] " Hitoshi Mitake
2015-07-29  9:31   ` Liu Yuan
2015-07-30  6:41     ` Vasiliy Tolstov
2015-07-30  9:13       ` Liu Yuan
2015-07-30  9:29         ` Vasiliy Tolstov
2015-07-30 13:27       ` Jeff Cody
2015-07-31 11:55         ` Vasiliy Tolstov
2015-07-31 12:08           ` Vasiliy Tolstov
2015-08-02  2:06             ` Liu Yuan
2015-08-02 11:52             ` Vasiliy Tolstov
2015-08-02 12:07               ` Vasiliy Tolstov
2015-08-03  0:41               ` Liu Yuan
2015-08-04  8:07                 ` Vasiliy Tolstov
2015-08-05 18:58                   ` Jeff Cody
2015-08-09 14:03                     ` Vasiliy Tolstov
2015-08-10 10:44                       ` Stefan Hajnoczi
2015-08-03  2:01         ` Liu Yuan [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20150803020100.GA4733@ubuntu-trusty \
    --to=namei.unix@gmail.com \
    --cc=ishizaki.teruaki@lab.ntt.co.jp \
    --cc=jcody@redhat.com \
    --cc=kwolf@redhat.com \
    --cc=mitake.hitoshi@gmail.com \
    --cc=mitake.hitoshi@lab.ntt.co.jp \
    --cc=morita.kazutaka@gmail.com \
    --cc=qemu-devel@nongnu.org \
    --cc=sheepdog-ng@googlegroups.com \
    --cc=sheepdog@lists.wpkg.org \
    --cc=stefanha@redhat.com \
    --cc=v.tolstov@selfip.ru \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).