All of lore.kernel.org
 help / color / mirror / Atom feed
From: Jens Axboe <axboe@kernel.dk>
To: Kosuke Tatsukawa <tatsu@ab.jp.nec.com>
Cc: "linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>
Subject: Re: [PATCH] blk-mq: fix waitqueue_active without memory barrier in block/blk-mq-tag.c
Date: Fri, 9 Oct 2015 09:11:16 -0600	[thread overview]
Message-ID: <5617D914.4060700@kernel.dk> (raw)
In-Reply-To: <17EC94B0A072C34B8DCF0D30AD16044A02874753@BPXM09GP.gisp.nec.co.jp>

On 10/08/2015 06:35 PM, Kosuke Tatsukawa wrote:
> blk_mq_tag_update_depth() seems to be missing a memory barrier which
> might cause the waker to not notice the waiter and fail to send a
> wake_up as in the following figure.
> 
> 	blk_mq_tag_update_depth			bt_get
> ------------------------------------------------------------------------
> if (waitqueue_active(&bs->wait))
> /* The CPU might reorder the test for
>     the waitqueue up here, before
>     prior writes complete */
> 					prepare_to_wait(&bs->wait, &wait,
> 					  TASK_UNINTERRUPTIBLE);
> 					tag = __bt_get(hctx, bt, last_tag,
> 					  tags);
> 					/* Value set in bt_update_count not
> 					   visible yet */
> bt_update_count(&tags->bitmap_tags, tdepth);
> /* blk_mq_tag_wakeup_all(tags, false); */
>   bt = &tags->bitmap_tags;
>   wake_index = atomic_read(&bt->wake_index);
> 					...
> 					io_schedule();
> ------------------------------------------------------------------------
> 
> This patch adds the missing memory barrier.
> 
> I found this issue when I was looking through the linux source code
> for places calling waitqueue_active() before wake_up*(), but without
> preceding memory barriers, after sending a patch to fix a similar
> issue in drivers/tty/n_tty.c  (Details about the original issue can be
> found here: https://lkml.org/lkml/2015/9/28/849).
> 
> Signed-off-by: Kosuke Tatsukawa <tatsu@ab.jp.nec.com>
> ---
>   block/blk-mq-tag.c |    4 ++++
>   1 files changed, 4 insertions(+), 0 deletions(-)
> 
> diff --git a/block/blk-mq-tag.c b/block/blk-mq-tag.c
> index ed96474..7a6b6e2 100644
> --- a/block/blk-mq-tag.c
> +++ b/block/blk-mq-tag.c
> @@ -75,6 +75,10 @@ void blk_mq_tag_wakeup_all(struct blk_mq_tags *tags, bool include_reserve)
>   	struct blk_mq_bitmap_tags *bt;
>   	int i, wake_index;
>   
> +	/*
> +	 * Make sure all changes prior to this are visible from other CPUs.
> +	 */
> +	smp_mb();
>   	bt = &tags->bitmap_tags;
>   	wake_index = atomic_read(&bt->wake_index);
>   	for (i = 0; i < BT_WAIT_QUEUES; i++) {
> 

Thanks, after looking at this, I think this patch is fine. It's not a
super hot path, so not worth it to further optimize this or look into
ways to avoid the barrier. I do wonder if there are archs where
atomic_read() is a memory barrier, in that case we need not do it at
all. And perhaps we have some weird smp_before_bla variant that could be
used here instead fo improve upon that case.

-- 
Jens Axboe


  reply	other threads:[~2015-10-09 15:11 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-10-09  0:35 [PATCH] blk-mq: fix waitqueue_active without memory barrier in block/blk-mq-tag.c Kosuke Tatsukawa
2015-10-09 15:11 ` Jens Axboe [this message]
2015-10-10  5:04   ` Kosuke Tatsukawa

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5617D914.4060700@kernel.dk \
    --to=axboe@kernel.dk \
    --cc=linux-kernel@vger.kernel.org \
    --cc=tatsu@ab.jp.nec.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.