From: Josh Durgin <josh.durgin@inktank.com>
To: =?gb18030?Q?=D2=B6=C9=DC=E8=A1?= <yeshaochen@foxmail.com>
Cc: qemu-devel <qemu-devel@nongnu.org>
Subject: Re: [Qemu-devel] =?gb18030?b?u9i4tKO6ILvYuLSjuiC72Li0o7ogu9i4tKO6ICBx?= =?gb18030?q?emu_vm_big_network_latency_when_met_heavy_io?=
Date: Mon, 13 Jan 2014 23:58:14 -0800 [thread overview]
Message-ID: <52D4EE16.5060703@inktank.com> (raw)
In-Reply-To: <tencent_35E422F81535374D5992C5AB@qq.com>
[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #1: Type: text/plain; charset=gb18030; format=flowed, Size: 5156 bytes --]
On 01/13/2014 10:39 PM, Ò¶ÉÜè¡ wrote:
> Hi Josh
>
> Thanks for your reply.
>
> librbd version : 0.67.5-1
>
> /etc/ceph/ceph.conf config file:
> the ip and hostname was hiddened.
> [global]
> ;open auth.
> auth cluster required = cephx
> auth service required = cephx
> auth client required = cephx
> ;global pid & log setting.
> admin socket = /home/ceph/var/run/$cluster-$name.asok
>
> [mon]
> keyring = /home/ceph/var/lib/$type/$cluster-$id/keyring
> mon data = /home/ceph/var/lib/$type/$cluster-$id
> mon cluster log file = /home/ceph/log/$cluster.log
> [mon.a]
> host = cld-xx
> mon addr = x.x.x.x:6789
> user = ceph
> [mon.b]
> host = cld-xx
> mon addr = x.x.x.x:6789
> user = ceph
> [mon.c]
> host = cld-xx
> mon addr = x.x.x.x:6789
> user = ceph
> [mon.d]
> host = cld-xx
> mon addr = x.x.x.x:6789
> user = ceph
> [mon.e]
> host = cld-xx
> mon addr = x.x.x.x:6789
> user = ceph
>
> [osd]
> keyring = /home/ceph/var/lib/$type/$cluster-$id/keyring
> osd data = /home/ceph/var/lib/$type/$cluster-$id
> osd journal = /home/ceph/var/lib/$type/$cluster-$id/journal
> osd journal size = 1000
> osd mkfs type = xfs
> osd mount options xfs = rw,noatime,inode64
> [osd.0]
> host = cld-xx
> addr = x.x.x.x
> user = ceph
> devs = /dev/sdb1
> [osd.1]
> host = cld-xx
> addr = x.x.x.x
> user = ceph
> devs = /dev/sda1
> [osd.2]
> host = cld-xx
> addr = x.x.x.x
> user = ceph
> devs = /dev/sdb1
> [osd.3]
> host = cld-xx
> addr = x.x.x.x
> user = ceph
> devs = /dev/sda1
> [osd.4]
> host = cld-xx
> addr = x.x.x.x
> user = ceph
> devs = /dev/sdb1
> [osd.5]
> host = cld-xx
> addr = x.x.x.x
> user = ceph
> devs = /dev/sda1
>
> [client.libvirt]
> rbd cache = true
Turning on rbd caching in ceph.conf, while telling QEMU it is not
writeback in libvirt's xml like:
<driver name='qemu' type='raw' cache='none'/>
is not safe since QEMU won't propagate flush requests from the guest
to librbd's cache. Since QEMU 1.3 setting the QEMU cache mode will
also set the librbd cache mode appropriately, so you don't need to
have the setting in your ceph.conf file at all.
Can you verify that your QEMU binary was built against librbd v0.60 or
later?
It would also help to see whether you can reproduce the problem when
QEMU is sending flushes (cache=writeback), and with the cache
disabled (cache=none, no 'rbd cache' setting in ceph.conf).
The next step would be gathering a log from librbd (debug rbd = 20,
debug objectcacher=20, debug objecter=20, debug ms = 1) when this
problem is occurring, and correlating it with a timestamped log of
the network latency.
Thanks,
Josh
> auth support = cephx none
> mon host =
> x.x.x.x:6789;x.x.x.x:6789;x.x.x.x:6789;x.x.x.x:6789;x.x.x.x:6789
>
> Regards
>
> Alan ye
>
> ------------------
> Ò¶ÉÜè¡
> Alan Ye
>
>
>
> ------------------ ÔʼÓʼþ ------------------
> *·¢¼þÈË:* "Josh Durgin";<josh.durgin@inktank.com>;
> *·¢ËÍʱ¼ä:* 2014Äê1ÔÂ14ÈÕ(ÐÇÆÚ¶þ) ÏÂÎç2:24
> *ÊÕ¼þÈË:* "Stefan Hajnoczi"<stefanha@gmail.com>; "Ò¶ÉÜè¡"
> <yeshaochen@foxmail.com>;
> *³ËÍ:* "qemu-devel"<qemu-devel@nongnu.org>;
> *Ö÷Ìâ:* Re: »Ø¸´£º »Ø¸´£º »Ø¸´£º [Qemu-devel] qemu vm big network
> latency when met heavy io
>
> On 01/12/2014 06:39 PM, Stefan Hajnoczi wrote:
> > On Fri, Jan 10, 2014 at 11:50 AM, Ò¶ÉÜè¡ <yeshaochen@foxmail.com> wrote:
> >
> > Please use Reply-all to keep the CC list in tact. That way the
> > conversation stays on the mailing list and others can participate.
> >
> >>> Is the sum of guests' RAM less than the total physical RAM on the host
> >> The host run 3 vms, each vm use one vcpu core and 1G ram.
> >> # free -m
> >>
> >> total used free shared buffers
> cached
> >>
> >> Mem: 32242 4808 27434 0 278
> 2058
> >>
> >> -/+ buffers/cache: 2471 29771
> >>
> >> Swap: 4095 0 4095
> >>
> >>
> >> The host has 8 cores.
> >> # cat /proc/cpuinfo | grep processor
> >> processor : 0
> >> processor : 1
> >> processor : 2
> >> processor : 3
> >> processor : 4
> >> processor : 5
> >> processor : 6
> >> processor : 7
> >>
> >> so, both of two question's answer is 'yes'.
> >> When I runing random write test, the host use 0 swap.
> >
> > Great. That means the host is not overcommitted.
> >
> > It's likely that the problem is a bug in QEMU's rbd driver or librados.
> >
> > Josh: Perhaps something you're interested in looking into?
>
> Yes, thanks for bringing it to my attention. It does sound like a bug in
> QEMU's rbd driver or ceph's userspace libraries.
>
> Could you share what version of librbd you're using, and your
> /etc/ceph/ceph.conf?
>
> Thanks,
> Josh
next prev parent reply other threads:[~2014-01-14 7:58 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-01-06 6:55 [Qemu-devel] qemu vm big network latency when met heavy io =?gb18030?B?0rbJ3Oih?=
2014-01-08 4:44 ` Stefan Hajnoczi
[not found] ` <CANzgAZnvTmKc1NKoJPRLTT-L-ppYWoUko6JKHfpMzOGboLh8YA@mail.gmail.com>
2014-01-09 2:47 ` [Qemu-devel] [Qemu-discuss] " Stefan Hajnoczi
[not found] ` <tencent_1499A979046903942D656C7F@qq.com>
[not found] ` <CAJSP0QV0G=reFHP4N=njGUcZxP-x-Ef=L0U3zj5DHfRGjrgvrw@mail.gmail.com>
[not found] ` <tencent_2A1DD90F3E8281BD36D67A63@qq.com>
[not found] ` <CAJSP0QW1m7UL27_CROjXsG2sBse+Md9fSWMgXzTZWVaxvhaz7Q@mail.gmail.com>
[not found] ` <tencent_74BAFF5462FB3EA25C557F97@qq.com>
[not found] ` <CAJSP0QXgjkHf3mFzSOm5rLjd5+uHpHDPTcKPFruZ0naTiN5yHg@mail.gmail.com>
2014-01-14 6:24 ` [Qemu-devel] 回复: 回复: 回复: " Josh Durgin
[not found] ` <tencent_35E422F81535374D5992C5AB@qq.com>
2014-01-14 7:58 ` Josh Durgin [this message]
[not found] ` <tencent_043538FC1C65245E76ED4CB0@qq.com>
2014-01-16 2:25 ` [Qemu-devel] =?gb18030?b?u9i4tKO6ILvYuLSjuiC72Li0o7ogu9i4tKO6ILvY?= =?gb18030?q?=B8=B4=A3=BA__qemu_vm_big_network_latency_when_met_heavy_io?= Josh Durgin
[not found] ` <tencent_0C119822426FB8B81E60F6EB@qq.com>
2014-01-17 6:57 ` [Qemu-devel] =?gb18030?b?u9i4tKO6ILvYuLSjuiC72Li0o7ogu9i4tKO6ILvYuLSjuiC72Li0o7ogIHFlbXUgdm0gYmlnIG5ldHdvcmsgbGF0ZW5jeSB3aGVu?= =?gb18030?q?_met_heavy_io?= Josh Durgin
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=52D4EE16.5060703@inktank.com \
--to=josh.durgin@inktank.com \
--cc=qemu-devel@nongnu.org \
--cc=yeshaochen@foxmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).