From: sanbai <sanbai@taobao.com>
To: Vivek Goyal <vgoyal@redhat.com>
Cc: linux-kernel@vger.kernel.org, Zhu Yanhai <gaoyang.zyh@taobao.com>,
Tejun Heo <tj@kernel.org>, Jens Axboe <axboe@kernel.dk>,
Tao Ma <taoma.tm@gmail.com>
Subject: Re: [RFC v1] add new io-scheduler to use cgroup on high-speed device
Date: Fri, 07 Jun 2013 11:09:54 +0800 [thread overview]
Message-ID: <51B14F02.9090409@taobao.com> (raw)
In-Reply-To: <20130605133059.GA16339@redhat.com>
On 2013年06月05日 21:30, Vivek Goyal wrote:
> On Wed, Jun 05, 2013 at 10:09:31AM +0800, Robin Dong wrote:
>> We want to use blkio.cgroup on high-speed device (like fusionio) for our mysql clusters.
>> After testing different io-scheduler, we found that cfq is too slow and deadline can't run on cgroup.
> So why not enhance deadline to be able to be used with cgroups instead of
> coming up with a new scheduler?
I think if we add cgroups support into deadline, it will not be suitable
to call "deadline" anymore...so a new ioscheduler and a new name may not
confuse users.
>
>> So we developed a new io-scheduler: tpps (Tiny Parallel Proportion Scheduler).It dispatch requests
>> only by using their individual weight and total weight (proportion) therefore it's simply and efficient.
> Can you give more details. Do you idle? Idling kills performance. If not,
> then without idling how do you achieve performance differentiation.
We don't idle, when comes to .elevator_dispatch_fn,we just compute
quota for every group:
quota = nr_requests - rq_in_driver;
group_quota = quota * group_weight / total_weight;
and dispatch 'group_quota' requests for the coordinate group. Therefore
high-weight group
will dispatch more requests than low-weight group.
>
>> Test case: fusionio card, 4 cgroups, iodepth-512
>>
>> groupname weight
>> test1 1000
>> test2 800
>> test3 600
>> test4 400
>>
> What's the workload used for this?
>
>> Use tpps, the result is:
>>
>> groupname iops avg-rt(ms) max-rt(ms)
>> test1 30220 16 54
>> test2 28261 18 56
>> test3 26333 19 69
>> test4 20152 25 87
>>
>> Use cfq, the result is:
>>
>> groupname iops avg-rt(ms) max-rt(ms)
>> test1 16478 30 242
>> test2 13015 39 347
>> test3 9300 54 371
>> test4 5806 87 393
> How do results look like with cfq if this is run with slice_idle=0 and
> quatum=128 or higher.
>
> cfqq idles on 3 things. queue (cfqq), service tree and cfq group.
> slice_idle will disable idling on cfqq but not no service tree. If
> we provide a knob for that, then idling on service tree can be disabled
> too and then we will be left with group idling only and then it should
> be much better.
I do the test again for cfq (slice_idle=0, quatum=128) and tpps
cfq (slice_idle=0, quatum=128)
groupname iops avg-rt(ms) max-rt(ms)
test1 16148 15 188
test2 12756 20 117
test3 9778 26 268
test4 6198 41 209
tpps
groupname iops avg-rt(ms) max-rt(ms)
test1 17292 14 65
test2 15221 16 80
test3 12080 21 66
test4 7995 32 90
Looks cfq with is much better than before.
My fio script is :
[global]
direct=1
ioengine=libaio
#ioengine=psync
runtime=30
bs=4k
rw=randread
iodepth=256
filename=/dev/fioa
numjobs=2
#group_reporting
[read1]
cgroup=test1
cgroup_weight=1000
[read2]
cgroup=test2
cgroup_weight=800
[read3]
cgroup=test3
cgroup_weight=600
[read4]
cgroup=test4
cgroup_weight=400
>
> Thanks
> Vivek
--
Robin Dong
董昊(花名:三百)
阿里巴巴 集团 核心系统部 内核组
分机:72370
手机:13520865473
email:sanbai@taobao.com
next prev parent reply other threads:[~2013-06-07 3:10 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-06-05 2:09 [RFC v1] add new io-scheduler to use cgroup on high-speed device Robin Dong
2013-06-05 3:03 ` Tejun Heo
2013-06-05 3:26 ` sanbai
2013-06-05 13:55 ` Vivek Goyal
2013-06-05 17:36 ` Tejun Heo
2013-06-05 13:59 ` Vivek Goyal
2013-06-05 13:30 ` Vivek Goyal
2013-06-07 3:09 ` sanbai [this message]
2013-06-07 19:53 ` Vivek Goyal
2013-06-08 3:50 ` sanbai
2013-06-08 4:38 ` sanbai
-- strict thread matches above, loose matches on Subject: below --
2013-06-05 2:23 Robin Dong
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=51B14F02.9090409@taobao.com \
--to=sanbai@taobao.com \
--cc=axboe@kernel.dk \
--cc=gaoyang.zyh@taobao.com \
--cc=linux-kernel@vger.kernel.org \
--cc=taoma.tm@gmail.com \
--cc=tj@kernel.org \
--cc=vgoyal@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.