From: Liu Yuan <namei.unix@gmail.com>
To: Jeff Cody <jcody@redhat.com>
Cc: Kevin Wolf <kwolf@redhat.com>,
Teruaki Ishizaki <ishizaki.teruaki@lab.ntt.co.jp>,
Hitoshi Mitake <mitake.hitoshi@gmail.com>,
Hitoshi Mitake <mitake.hitoshi@lab.ntt.co.jp>,
qemu-devel@nongnu.org, Vasiliy Tolstov <v.tolstov@selfip.ru>,
sheepdog-ng@googlegroups.com, morita.kazutaka@gmail.com,
Stefan Hajnoczi <stefanha@redhat.com>,
sheepdog@lists.wpkg.org
Subject: Re: [Qemu-devel] [sheepdog] [PATCH] sheepdog: fix overlapping metadata update
Date: Mon, 3 Aug 2015 10:01:00 +0800 [thread overview]
Message-ID: <20150803020100.GA4733@ubuntu-trusty> (raw)
In-Reply-To: <20150730132744.GA11022@localhost.localdomain>
On Thu, Jul 30, 2015 at 09:27:44AM -0400, Jeff Cody wrote:
> On Thu, Jul 30, 2015 at 09:41:08AM +0300, Vasiliy Tolstov wrote:
> > 2015-07-29 12:31 GMT+03:00 Liu Yuan <namei.unix@gmail.com>:
> > > Technically, it won't affect the performance because index updates are not range
> > > but concrete in terms of underlying 4M block size. Only 2 or 3 indexes in a
> > > range will be updated and 90+% updates will be only 1. So if 2 updates stride a
> > > large range, it will actually worse the performance of sheepdog because many
> > > additional unref of object will be executed by sheep internally.
> > >
> > > It is not a performance problem but more the right fix. Even with your patch,
> > > updates of inode can overlap. You just don't allow overlapped requests go to
> > > sheepdog, which is a overkill approach. IMHO, we should only adjust to avoid
> > > the overlapped inode updates, which can be done easily and incrementally on top
> > > of old code, rather than take on a complete new untested overkill mechanism. So
> > > what we get from your patch? Covering the problem and lock every requests?
> > >
> > > Your patch actually fix nothing but just cover the problem by slowing down the
> > > request and even with your patch, the problem still exists because inode updates
> > > can overlap. Your commit log doesn't explain what is the real problem and why
> > > your fix works. This is not your toy project that can commit whatever you want.
> > >
> > >> BTW, sheepdog project was already forked, why don't you fork the block
> > >> driver, too?
> > >
> > > What makes you think you own the block driver?
> > >
> > > We forked the sheepdog project because it is low quality of code partly and mostly
> > > some company tries to make it a private project. It is not as open source friendly
> > > as before and that is the main reason Kazutaka and I chose to fork the sheepdog
> > > project. But this doesn't mean we need to fork the QEMU project, it is an
> > > open source project and not your home toy.
> > >
> > > Kazutaka and I are the biggest contributers of both sheepdog and QEMU sheepdog
> > > block driver for years, so I think I am eligible to review the patch and
> > > responsible to suggest the right fix. If you are pissed off when someone else
> > > have other opinions, you can just fork the code and play with it at home or you
> > > follow the rule of open source project.
> >
> >
> > Jeff Cody, please be the judge, patch from Hitoshi solved my problem
> > that i emailed in sheepdog list (i have test environment with 8 hosts
> > on each 6 SSD disks and infiniband interconnect between hosts) before
> > Hitoshi patch, massive writing to sheepdog storage breaks file system
> > and corrupt it.
> > After the patch i don't see issues.
> >
>
> I'd rather see some sort consensus amongst Liu, Hitoshi, yourself, or
> others more intimately familiar with sheepdog.
>
> Right now, we have Hitoshi's patch in the main git repo, slated for
> 2.4 release (which is Monday). It sounds, from Liu's email, as this
> may not fix the root cause.
>
> Vasiliy said he would test Liu's patch; if he can confirm this new
> patch fix, then I would be inclined to use Liu's patch, based on the
> detailed analysis of the issue in the commit message.
>
This is my performance comparison on top of latest QEMU with my latop with SSD.
sheepdog cluster run with 3 nodes with '-n' to get best volume performance.
QEMU command:
qemu-system-x86_64 -m 1024 --enable-kvm \
-drive file=debian_squeeze_amd64_standard.qcow2,cache=writeback,if=virtio \
-drive file=sheepdog:test,if=virtio
sheepdog:test is created as 'dog vdi create test 80G'
I test both time for mkfs and iops for fio write.
fio.conf:
[global]
ioengine=libaio
direct=1
thread=1
norandommap=1
runtime=60
size=300M
directory=/mnt
[write4k-rand]
stonewall
group_reporting
bs=4k
rw=randwrite
numjobs=8
iodepth=32
Resualt:
================================================
sheep formated with -c 2:1 (erasure coding)
mkfs fio
Yuan 0.069 4578
Hitosh 0.071 3722
sheep formarted with -c 2 (replication)
mkfs fio
Yuan 0.074 6873
Hitosh 0.081 6174
================================================
Thanks,
Yuan
prev parent reply other threads:[~2015-08-03 2:01 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-07-29 4:02 [Qemu-devel] [PATCH] sheepdog: fix overlapping metadata update Liu Yuan
2015-07-29 5:04 ` [Qemu-devel] [sheepdog] " Hitoshi Mitake
2015-07-29 9:31 ` Liu Yuan
2015-07-30 6:41 ` Vasiliy Tolstov
2015-07-30 9:13 ` Liu Yuan
2015-07-30 9:29 ` Vasiliy Tolstov
2015-07-30 13:27 ` Jeff Cody
2015-07-31 11:55 ` Vasiliy Tolstov
2015-07-31 12:08 ` Vasiliy Tolstov
2015-08-02 2:06 ` Liu Yuan
2015-08-02 11:52 ` Vasiliy Tolstov
2015-08-02 12:07 ` Vasiliy Tolstov
2015-08-03 0:41 ` Liu Yuan
2015-08-04 8:07 ` Vasiliy Tolstov
2015-08-05 18:58 ` Jeff Cody
2015-08-09 14:03 ` Vasiliy Tolstov
2015-08-10 10:44 ` Stefan Hajnoczi
2015-08-03 2:01 ` Liu Yuan [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20150803020100.GA4733@ubuntu-trusty \
--to=namei.unix@gmail.com \
--cc=ishizaki.teruaki@lab.ntt.co.jp \
--cc=jcody@redhat.com \
--cc=kwolf@redhat.com \
--cc=mitake.hitoshi@gmail.com \
--cc=mitake.hitoshi@lab.ntt.co.jp \
--cc=morita.kazutaka@gmail.com \
--cc=qemu-devel@nongnu.org \
--cc=sheepdog-ng@googlegroups.com \
--cc=sheepdog@lists.wpkg.org \
--cc=stefanha@redhat.com \
--cc=v.tolstov@selfip.ru \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).