From: Lars Ellenberg <lars.ellenberg@linbit.com>
To: Ming Lei <ming.lei@canonical.com>
Cc: linux-block <linux-block@vger.kernel.org>,
Roland Kammerer <roland.kammerer@linbit.com>,
Jens Axboe <axboe@kernel.dk>, NeilBrown <neilb@suse.com>,
Kent Overstreet <kent.overstreet@gmail.com>,
Shaohua Li <shli@kernel.org>, Alasdair Kergon <agk@redhat.com>,
Mike Snitzer <snitzer@redhat.com>,
"open list:DEVICE-MAPPER (LVM)" <dm-devel@redhat.com>,
Ingo Molnar <mingo@redhat.com>,
Peter Zijlstra <peterz@infradead.org>,
Takashi Iwai <tiwai@suse.de>, Jiri Kosina <jkosina@suse.cz>,
Zheng Liu <gnehzuil.liu@gmail.com>,
Keith Busch <keith.busch@intel.com>,
"Martin K. Petersen" <martin.petersen@oracle.com>,
"Kirill A. Shutemov" <kirill.shutemov@linux.intel.com>,
Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
"open list:BCACHE (BLOCK LAYER CACHE)"
<linux-bcache@vger.kernel.org>,
"open list:SOFTWARE RAID (Multiple Disks) SUPPORT"
<linux-raid@vger.kernel.org>
Subject: Re: [RFC] block: fix blk_queue_split() resource exhaustion
Date: Wed, 6 Jul 2016 14:38:41 +0200 [thread overview]
Message-ID: <20160706123841.GA13335@soda.linbit> (raw)
In-Reply-To: <CACVXFVNTKsRQ0PQTL47-ceP-KqMR-zR4DO1v-ey5NTUYv8bUog@mail.gmail.com>
On Mon, Jul 04, 2016 at 06:47:29PM +0800, Ming Lei wrote:
> >> One clean solution may be to convert the loop of generic_make_request()
> >> into the following way:
> >>
> >> do {
> >> struct bio *splitted, *remainder = NULL;
> >> struct request_queue *q = bdev_get_queue(bio->bi_bdev);
> >>
> >> blk_queue_split(q, &bio, &remainder, q->bio_split);
> >>
> >> ret = q->make_request_fn(q, bio);
> >>
> >> if (remainder)
> >> bio_list_add(current->bio_list, remainder);
> >> bio = bio_list_pop(current->bio_list);
> >> } while (bio)
> >
> > Not good enough.
> > Goal was to first process all "deeper level" bios
> > before processing the remainder.
>
> For the reported bio splitting issue, I think the goal is to make sure all
> BIOs generated from 'bio' inside .make_request_fn(bio) are queued
> before the 'current' remainder. Cause it is the issue introduced by
> blk_split_bio().
Stacking:
qA (limitting layer)
-> qB (remapping)
-> qC (remapping)
-> qD (hardware)
[in fact you don't even need four layers,
this is just to clarify that the stack may be more complex than you
assume it would be]
Columns below:
1) argument to generic_make_request() and its target queue.
2) current->bio_list
3) "in-flight" counter of qA.
==== In your new picture, iiuc:
generic_make_request(bio_orig)
NULL in-flight=0
bio_orig empty
blk_queue_split()
result:
bio_s, bio_r
qA->make_request_fn(bio_s)
in-flight=1
bio_c = bio_clone(bio_s)
generic_make_request(bio_c to qB)
bio_c
<-return
bio_list_add(bio_r)
bio_c, bio_r
bio_list_pop()
bio_r
qB->make_request_fn(bio_c)
(Assume it does not clone, but only remap.
But it may also be a striping layer,
and queue more than one bio here.)
generic_make_request(bio_c to qC)
bio_r, bio_c
<-return
bio_list_pop()
bio_c
qA->make_request_fn(bio_r) in-flight still 1
BLOCKS, because bio_c has not been processed to its final
destination qD yet, and not dispatched to hardware.
==== my suggestion
generic_make_request(bio_orig)
NULL in-flight=0
bio_orig empty in-flight=0
qA->make_request_fn(bio_orig)
blk_queue_split()
result:
bio_s, and bio_r stuffed away to head of remainder list.
in-flight=1
bio_c = bio_clone(bio_s)
generic_make_request(bio_c to qB)
bio_c
<-return
bio_c
bio_list_pop()
empty
qB->make_request_fn(bio_c)
(Assume it does not clone, but only remap.
But it may also be a striping layer,
and queue more than one bio here.)
generic_make_request(bio_c to qC)
bio_c
<-return
bio_list_pop()
empty
qC->make_request_fn(bio_c)
generic_make_request(bio_c to qD)
bio_c
<-return
bio_list_pop()
empty
qD->make_request_fn(bio_c)
dispatches to hardware
<-return
empty
bio_list_pop()
NULL, great, lets pop from remainder list
qA->make_request_fn(bio_r) in-flight=?
May block, but only until completion of bio_c.
Which may already have happened.
*makes progress*
Thanks,
Lars
next prev parent reply other threads:[~2016-07-06 12:38 UTC|newest]
Thread overview: 36+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-06-22 8:22 [RFC] block: fix blk_queue_split() resource exhaustion Lars Ellenberg
2016-06-22 8:22 ` Lars Ellenberg
2016-06-24 11:36 ` Ming Lei
2016-06-24 14:27 ` Lars Ellenberg
2016-06-24 15:15 ` Mike Snitzer
2016-06-28 8:24 ` Lars Ellenberg
2016-06-25 9:30 ` [RFC] " Ming Lei
2016-06-28 8:45 ` Lars Ellenberg
2016-07-02 10:03 ` Ming Lei
2016-07-02 10:28 ` Ming Lei
2016-07-04 8:20 ` Lars Ellenberg
2016-07-04 10:47 ` Ming Lei
2016-07-06 12:38 ` Lars Ellenberg [this message]
2016-07-06 15:57 ` Ming Lei
2016-07-07 8:03 ` Lars Ellenberg
2016-07-07 8:03 ` Lars Ellenberg
2016-07-07 13:14 ` Ming Lei
2016-07-07 5:35 ` [dm-devel] " NeilBrown
2016-07-07 5:35 ` NeilBrown
2016-07-07 8:16 ` Lars Ellenberg
2016-07-07 12:39 ` Lars Ellenberg
2016-07-07 12:39 ` Lars Ellenberg
2016-07-07 12:47 ` Mike Snitzer
2016-07-07 22:07 ` [dm-devel] [RFC] " NeilBrown
2016-07-08 8:02 ` Lars Ellenberg
2016-07-08 9:39 ` NeilBrown
2016-07-08 13:00 ` Lars Ellenberg
2016-07-08 13:24 ` Re[2]: " Pavel Goran
2016-07-08 13:59 ` Lars Ellenberg
2016-07-08 11:08 ` Ming Lei
2016-07-08 11:08 ` Ming Lei
2016-07-08 12:52 ` Lars Ellenberg
2016-07-08 13:05 ` Mike Snitzer
2016-07-07 12:45 ` Mike Snitzer
2016-07-07 22:40 ` NeilBrown
2016-07-07 14:36 ` Mike Snitzer
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20160706123841.GA13335@soda.linbit \
--to=lars.ellenberg@linbit.com \
--cc=agk@redhat.com \
--cc=axboe@kernel.dk \
--cc=dm-devel@redhat.com \
--cc=gnehzuil.liu@gmail.com \
--cc=jkosina@suse.cz \
--cc=keith.busch@intel.com \
--cc=kent.overstreet@gmail.com \
--cc=kirill.shutemov@linux.intel.com \
--cc=linux-bcache@vger.kernel.org \
--cc=linux-block@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-raid@vger.kernel.org \
--cc=martin.petersen@oracle.com \
--cc=ming.lei@canonical.com \
--cc=mingo@redhat.com \
--cc=neilb@suse.com \
--cc=peterz@infradead.org \
--cc=roland.kammerer@linbit.com \
--cc=shli@kernel.org \
--cc=snitzer@redhat.com \
--cc=tiwai@suse.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.