From: Vivek Goyal <vgoyal@redhat.com>
To: linux kernel mailing list <linux-kernel@vger.kernel.org>
Cc: Jens Axboe <axboe@kernel.dk>, Nauman Rafique <nauman@google.com>,
Gui Jianfeng <guijianfeng@cn.fujitsu.com>,
Divyesh Shah <dpshah@google.com>,
Heinz Mauelshagen <heinzm@redhat.com>,
arighi@develer.com
Subject: Re: [RFC PATCH] Bio Throttling support for block IO controller
Date: Wed, 1 Sep 2010 16:07:57 -0400 [thread overview]
Message-ID: <20100901200756.GE22149@redhat.com> (raw)
In-Reply-To: <20100901175830.GC22149@redhat.com>
On Wed, Sep 01, 2010 at 01:58:30PM -0400, Vivek Goyal wrote:
> Hi,
>
> Currently CFQ provides the weight based proportional division of bandwidth.
> People also have been looking at extending block IO controller to provide
> throttling/max bandwidth control.
>
> I have started to write the support for throttling in block layer on
> request queue so that it can be used both for higher level logical
> devices as well as leaf nodes. This patch is still work in progress but
> I wanted to post it for early feedback.
>
> Basically currently I have hooked into __make_request() function to
> check which cgroup bio belongs to and if it is exceeding the specified
> BW rate. If no, thread can continue to dispatch bio as it is otherwise
> bio is queued internally and dispatched later with the help of a worker
> thread.
>
> HOWTO
> =====
> - Mount blkio controller
> mount -t cgroup -o blkio none /cgroup/blkio
>
> - Specify a bandwidth rate on particular device for root group. The format
> for policy is "<major>:<minor> <byes_per_second>".
>
> echo "8:16 1048576" > /cgroup/blkio/blkio.read_bps_device
>
> Above will put a limit of 1MB/second on reads happening for root group
> on device having major/minor number 8:16.
>
> - Run dd to read a file and see if rate is throttled to 1MB/s or not.
>
> # dd if=/mnt/common/zerofile of=/dev/null bs=4K count=1024 iflag=direct
> 1024+0 records in
> 1024+0 records out
> 4194304 bytes (4.2 MB) copied, 4.0001 s, 1.0 MB/s
>
> Limits for writes can be put using blkio.write_bps_device file.
>
> Open Issues
> ===========
> - Do we need to provide additional queue congestion semantics as we are
> throttling and queuing bios at request queue and probably we don't want
> a user space application to consume all the memory allocating bios
> and bombarding request queue with those bios.
>
> - How to handle the current blkio cgroup stats file and two policies
> in the background. If for some reason both throttling and proportional
> BW policies are operating on request queue, then stats will be very
> confusing.
>
> May be we can allow activating either throttling or proportional BW
> policy per request queue and we can create a /sys tunable to list and
> chose between policies (something like choosing IO scheduler). The
> only downside of this apporach is that user also need to be aware of
> the storage hierachy and activate right policy at each node/request
> queue.
Thinking more about it. The issue of stats from proportional bandwidth
controller and max bandwidth controller clobbering each other can
probably be solved by also specifying policy name with the stat. For
example, currently blkio.io_serviced, looks as follows.
# cat blkio.io_serviced
253:2 Read 61
253:2 Write 0
253:2 Sync 61
253:2 Async 0
253:2 Total 61
We can introduce one more field to specify policy for which this stats are as
follows.
# cat blkio.io_serviced
253:2 Read 61 throttle
253:2 Write 0 throttle
253:2 Sync 61 throttle
253:2 Async 0 throttle
253:2 Total 61 throttle
253:2 Read 61 proportional
253:2 Write 0 proportional
253:2 Sync 61 proportional
253:2 Async 0 proportional
253:2 Total 61 proportional
It will allow us following.
- Avoid two control policies overwritting each other's stats.
- Allow both policies (throttle, proportional) to be operational on
same request queue at the same time instead of forcing user to choose
one.
- We don't have to introduce another /sys variable per request queue and
that will make life easier in terms of configuration.
Thoughts?
Vivek
next prev parent reply other threads:[~2010-09-01 20:08 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-09-01 17:58 [RFC PATCH] Bio Throttling support for block IO controller Vivek Goyal
2010-09-01 20:07 ` Vivek Goyal [this message]
2010-09-02 15:18 ` Vivek Goyal
2010-09-02 16:22 ` Nauman Rafique
2010-09-02 17:22 ` Vivek Goyal
2010-09-02 17:32 ` Balbir Singh
2010-09-02 18:39 ` Paul E. McKenney
2010-09-03 1:57 ` Vivek Goyal
2010-09-03 23:36 ` Paul E. McKenney
2010-09-03 9:50 ` Gui Jianfeng
2010-09-03 12:48 ` Vivek Goyal
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20100901200756.GE22149@redhat.com \
--to=vgoyal@redhat.com \
--cc=arighi@develer.com \
--cc=axboe@kernel.dk \
--cc=dpshah@google.com \
--cc=guijianfeng@cn.fujitsu.com \
--cc=heinzm@redhat.com \
--cc=linux-kernel@vger.kernel.org \
--cc=nauman@google.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox