From: sanbai <sanbai@taobao.com>
To: Vivek Goyal <vgoyal@redhat.com>
Cc: linux-kernel@vger.kernel.org, Zhu Yanhai <gaoyang.zyh@taobao.com>,
Tejun Heo <tj@kernel.org>, Jens Axboe <axboe@kernel.dk>,
Tao Ma <taoma.tm@gmail.com>
Subject: Re: [RFC v1] add new io-scheduler to use cgroup on high-speed device
Date: Fri, 07 Jun 2013 11:09:54 +0800 [thread overview]
Message-ID: <51B14F02.9090409@taobao.com> (raw)
In-Reply-To: <20130605133059.GA16339@redhat.com>
On 2013年06月05日 21:30, Vivek Goyal wrote:
> On Wed, Jun 05, 2013 at 10:09:31AM +0800, Robin Dong wrote:
>> We want to use blkio.cgroup on high-speed device (like fusionio) for our mysql clusters.
>> After testing different io-scheduler, we found that cfq is too slow and deadline can't run on cgroup.
> So why not enhance deadline to be able to be used with cgroups instead of
> coming up with a new scheduler?
I think if we add cgroups support into deadline, it will not be suitable
to call "deadline" anymore...so a new ioscheduler and a new name may not
confuse users.
>
>> So we developed a new io-scheduler: tpps (Tiny Parallel Proportion Scheduler).It dispatch requests
>> only by using their individual weight and total weight (proportion) therefore it's simply and efficient.
> Can you give more details. Do you idle? Idling kills performance. If not,
> then without idling how do you achieve performance differentiation.
We don't idle, when comes to .elevator_dispatch_fn,we just compute
quota for every group:
quota = nr_requests - rq_in_driver;
group_quota = quota * group_weight / total_weight;
and dispatch 'group_quota' requests for the coordinate group. Therefore
high-weight group
will dispatch more requests than low-weight group.
>
>> Test case: fusionio card, 4 cgroups, iodepth-512
>>
>> groupname weight
>> test1 1000
>> test2 800
>> test3 600
>> test4 400
>>
> What's the workload used for this?
>
>> Use tpps, the result is:
>>
>> groupname iops avg-rt(ms) max-rt(ms)
>> test1 30220 16 54
>> test2 28261 18 56
>> test3 26333 19 69
>> test4 20152 25 87
>>
>> Use cfq, the result is:
>>
>> groupname iops avg-rt(ms) max-rt(ms)
>> test1 16478 30 242
>> test2 13015 39 347
>> test3 9300 54 371
>> test4 5806 87 393
> How do results look like with cfq if this is run with slice_idle=0 and
> quatum=128 or higher.
>
> cfqq idles on 3 things. queue (cfqq), service tree and cfq group.
> slice_idle will disable idling on cfqq but not no service tree. If
> we provide a knob for that, then idling on service tree can be disabled
> too and then we will be left with group idling only and then it should
> be much better.
I do the test again for cfq (slice_idle=0, quatum=128) and tpps
cfq (slice_idle=0, quatum=128)
groupname iops avg-rt(ms) max-rt(ms)
test1 16148 15 188
test2 12756 20 117
test3 9778 26 268
test4 6198 41 209
tpps
groupname iops avg-rt(ms) max-rt(ms)
test1 17292 14 65
test2 15221 16 80
test3 12080 21 66
test4 7995 32 90
Looks cfq with is much better than before.
My fio script is :
[global]
direct=1
ioengine=libaio
#ioengine=psync
runtime=30
bs=4k
rw=randread
iodepth=256
filename=/dev/fioa
numjobs=2
#group_reporting
[read1]
cgroup=test1
cgroup_weight=1000
[read2]
cgroup=test2
cgroup_weight=800
[read3]
cgroup=test3
cgroup_weight=600
[read4]
cgroup=test4
cgroup_weight=400
>
> Thanks
> Vivek
--
Robin Dong
董昊(花名:三百)
阿里巴巴 集团 核心系统部 内核组
分机:72370
手机:13520865473
email:sanbai@taobao.com
next prev parent reply other threads:[~2013-06-07 3:10 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-06-05 2:09 [RFC v1] add new io-scheduler to use cgroup on high-speed device Robin Dong
2013-06-05 3:03 ` Tejun Heo
2013-06-05 3:26 ` sanbai
2013-06-05 13:55 ` Vivek Goyal
2013-06-05 17:36 ` Tejun Heo
2013-06-05 13:59 ` Vivek Goyal
2013-06-05 13:30 ` Vivek Goyal
2013-06-07 3:09 ` sanbai [this message]
2013-06-07 19:53 ` Vivek Goyal
2013-06-08 3:50 ` sanbai
2013-06-08 4:38 ` sanbai
-- strict thread matches above, loose matches on Subject: below --
2013-06-05 2:23 Robin Dong
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=51B14F02.9090409@taobao.com \
--to=sanbai@taobao.com \
--cc=axboe@kernel.dk \
--cc=gaoyang.zyh@taobao.com \
--cc=linux-kernel@vger.kernel.org \
--cc=taoma.tm@gmail.com \
--cc=tj@kernel.org \
--cc=vgoyal@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox