All of lore.kernel.org
 help / color / mirror / Atom feed
From: Vivek Goyal <vgoyal@redhat.com>
To: lina <lulina_nuaa@foxmail.com>
Cc: linux kernel mailing list <linux-kernel@vger.kernel.org>
Subject: Re: blk-throttle.c : When limit is changed, must start a new slice
Date: Fri, 4 Mar 2011 14:30:06 -0500	[thread overview]
Message-ID: <20110304193006.GF5466@redhat.com> (raw)
In-Reply-To: <tencent_323C67E7636EA7895ACB367F@qq.com>

On Sat, Mar 05, 2011 at 12:30:55AM +0800, lina wrote:
> Hi Vivek,    
>      Take you a litter time to read this letter, I think there seens  to 
>  be a small bug in blk-throttle.c.
>      This might happen that initially cgroup limit was greater than the 
>  device's physic ability, but later limit was under physic ability. In this
>  case, all of the bio in the device was throttled for serveral minutes. 
>   
>      I did some analysis as the following:
>      First, setting a very large bps on device lead all of the bio go
>  through. The slice is begin when test begin, and only extend but 
>  can not start new one. 
>      Second, change the limit under physic ability to make one bio 
>  queued. Once one bio queued, blk_throtl_bio func will call 
>  tg_update_disptime to estimate the delay time for the throtl_work.
>      As the slice is very old, there is a very large value in 
>  tg->bytes_disp[rw], and the tg->disptime is a long time after jiffies. 
>  During this time, all of the bio is queued. And the work_queue can 
>  not start, so tg->slice_start[rw] still can not be reset.
>   
>      Although after serveral minutes everything will be ok, but it still
>  seens no-good for users.
>      
>      I think it should start a new slice when the limit is changed. 
>      Here is my patch, please conrrect it if there is something wrong
>  follow.

CCing to lkml. Lets keep all the testing and bug reports regarding
blkio throttling on mailing list.

thanks for the bug report lina. I think this is a bug. I am not too keen
on restarting slice all the time upon limit change as somebody can exploit
that to get higher BW by doing it frequently. Can you try attached patch
and see if it solves your problem.

>   
>  --- block/blk-throttle.c
>  +++ block/blk-throttle.c
>  @@ -1027,6 +1027,9 @@
>           *biop = NULL;  
>           if (update_disptime) {
>  +                 if (tg->limits_changed)
>  +                 {
>  +                         throtl_start_new_slice(td, tg, rw);
>  +                 }
>                   tg_update_disptime(td, tg);
>                   throtl_schedule_next_dispatch(td);
>           }
>   
>   
>       There is two other problems, can you explain them to me? 
>   
>  First:
>       The blkio dir always like 
>   
>       blkio //patent dir
>         ---blkio.weight
>         ---blkio.throtl_write_bps_device
>         ...
>         ---tst1  //child dir
>               ---blkio.weight              ---blkio.throtl_write_bps_device
>               ...
>   
>       The child dir's throtl files like tst1/blkio.throttl_write_bps_device 
>  seens never be use. It maybe better that do not create these files.
>  If they are useful, please tell me. Thank you!

Throttle creates a flag hierarchy of groups internally. So even if you
create child of a child group, child will be treated as if children 
of root and will be throttled as per throttling rules.
>   
>  Second:
>       When apply weight policy to some pid, I must mkdir tst1 like 
>  above. If all of pid in tst1 is end, the pid in tasks will disappear, but
>  the tst1 dir is still here. I think here will be better if tst1 can be 
>  removed automatically. 
>       Throttle policy has no auto-clean capacity. If I create many 
>  linear dm devices, and apply throttle policy on them, then use 
>  'dmsetup remove--all'. Now I must clean the entries one by one in blkio.throtl_write_bps_device manually. If the throttle policy can 
>  clean the entry automatically when the device is no longger in the 
>  system, It seens to be perfect!

i think it is hard to remove rules also when devie is going away. One
easy way is to remove whole cgroup. Other would be that provide a 
simple command to delete all the rules from a file. 

say echo "0" > blkio.throttle.read_bps_device will remove all the rules
from the file. If that helps you, you can write one patch for that.

Thanks
Vivek

---
 block/blk-throttle.c |   13 +++++++++++++
 1 file changed, 13 insertions(+)

Index: linux-2.6/block/blk-throttle.c
===================================================================
--- linux-2.6.orig/block/blk-throttle.c	2011-03-04 13:59:45.000000000 -0500
+++ linux-2.6/block/blk-throttle.c	2011-03-04 14:05:31.542332242 -0500
@@ -1023,6 +1023,19 @@ int blk_throtl_bio(struct request_queue 
 	/* Bio is with-in rate limit of group */
 	if (tg_may_dispatch(td, tg, bio, NULL)) {
 		throtl_charge_bio(tg, bio);
+
+		/*
+		 * We need to trim slice even when bios are not being queued
+		 * otherwise it might happen that a bio is not queued for
+		 * a long time and slice keeps on extending and trim is not
+		 * called for a long time. Now if limits are reduced suddenly
+		 * we take into account all the IO dispatched so far at new
+		 * low rate and * newly queued IO gets a really long dispatch
+		 * time.
+		 *
+		 * So keep on trimming slice even if bio is not queued.
+		 */
+		throtl_trim_slice(td, tg, rw);
 		goto out;
 	}
 


       reply	other threads:[~2011-03-04 19:30 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <tencent_323C67E7636EA7895ACB367F@qq.com>
2011-03-04 19:30 ` Vivek Goyal [this message]
     [not found] <tencent_1FCD09C260CD8FFB3D3886AD@qq.com>
2011-03-07 16:07 ` blk-throttle.c : When limit is changed, must start a new slice Vivek Goyal
     [not found] <tencent_5B2F59AF17F4410D6A9CFA34@qq.com>
2011-03-08 14:51 ` Vivek Goyal
     [not found] <tencent_6A5F95FF2112DFE963C44E4E@qq.com>
2011-03-08 20:54 ` Vivek Goyal

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20110304193006.GF5466@redhat.com \
    --to=vgoyal@redhat.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=lulina_nuaa@foxmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.