From: ming.lei@redhat.com (Ming Lei)
Subject: [PATCH 2/2] blk-mq: simplify queue mapping & schedule with each possisble CPU
Date: Fri, 26 Jan 2018 17:31:38 +0800 [thread overview]
Message-ID: <20180126093129.GA10926@ming.t460p> (raw)
In-Reply-To: <53da00dc-3d46-dcdb-2be4-277f79a9888b@oracle.com>
Hi Jianchao,
On Fri, Jan 19, 2018@11:05:35AM +0800, jianchao.wang wrote:
> Hi ming
>
> Sorry for delayed report this.
>
> On 01/17/2018 05:57 PM, Ming Lei wrote:
> > 2) hctx->next_cpu can become offline from online before __blk_mq_run_hw_queue
> > is run, there isn't warning, but once the IO is submitted to hardware,
> > after it is completed, how does the HBA/hw queue notify CPU since CPUs
> > assigned to this hw queue(irq vector) are offline? blk-mq's timeout
> > handler may cover that, but looks too tricky.
>
> In theory, the irq affinity will be migrated to other cpu. This is done by
Yes, but the other CPU should belong to this irq's affinity, and if all
CPUs in the irq's affinity is DEAD, this irq vector will be shutdown,
and if there is in-flight IO or will be, then the completion for this
IOs won't be delivered to CPUs. And now seems we depend on queue's timeout
handler to handle them.
> fixup_irqs() in the context of stop_machine.
> However, in my test, I found this log:
>
> [ 267.161043] do_IRQ: 7.33 No irq handler for vector
>
> The 33 is the vector used by nvme cq.
> The irq seems to be missed and sometimes IO hang occurred.
As I mentioned above, it shouldn't be strange to see in CPU offline/online
stress test.
--
Ming
prev parent reply other threads:[~2018-01-26 9:31 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <20180112025306.28004-1-ming.lei@redhat.com>
[not found] ` <20180112025306.28004-3-ming.lei@redhat.com>
[not found] ` <0d36c16b-cb4b-6088-fdf3-2fe5d8f33cd7@oracle.com>
[not found] ` <20180116121010.GA26429@ming.t460p>
[not found] ` <7c24e321-2d3b-cdec-699a-f58c34300aa9@oracle.com>
[not found] ` <20180116153248.GA3018@ming.t460p>
[not found] ` <7f5bad86-febc-06fc-67c0-393777d172e4@oracle.com>
[not found] ` <20180117035159.GA9487@ming.t460p>
[not found] ` <8c8efce8-ea02-0a9e-8369-44c885f4731d@oracle.com>
2018-01-17 6:22 ` [PATCH 2/2] blk-mq: simplify queue mapping & schedule with each possisble CPU Ming Lei
2018-01-17 8:09 ` jianchao.wang
2018-01-17 9:57 ` Ming Lei
2018-01-17 10:07 ` Christian Borntraeger
2018-01-17 10:14 ` Christian Borntraeger
2018-01-17 10:17 ` Ming Lei
2018-01-19 3:05 ` jianchao.wang
2018-01-26 9:31 ` Ming Lei [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20180126093129.GA10926@ming.t460p \
--to=ming.lei@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).