From: Pavel Begunkov <asml.silence@gmail.com>
To: Hao Xu <haoxu@linux.alibaba.com>, Jens Axboe <axboe@kernel.dk>
Cc: io-uring@vger.kernel.org, Joseph Qi <joseph.qi@linux.alibaba.com>
Subject: Re: [PATCH] io_uring: fix possible deadlock in io_uring_poll
Date: Wed, 3 Feb 2021 00:04:02 +0000 [thread overview]
Message-ID: <9d60270f-993b-ba83-29a0-ce6582c383e0@gmail.com> (raw)
In-Reply-To: <1612295573-221587-1-git-send-email-haoxu@linux.alibaba.com>
On 02/02/2021 19:52, Hao Xu wrote:
> This might happen if we do epoll_wait on a uring fd while reading/writing
> the former epoll fd in a sqe in the former uring instance.
> So let's don't flush cqring overflow list when we fail to get the uring
> lock. This leads to less accuracy, but is still ok.
if (io_cqring_events(ctx) || test_bit(0, &ctx->cq_check_overflow))
mask |= EPOLLIN | EPOLLRDNORM;
Instead of flushing. It'd make sense if we define poll as "there might
be something, go do your peek/wait with overflow checks". Jens, is that
documented anywhere?
>
> Reported-by: Abaci <abaci@linux.alibaba.com>
> Fixes: 6c503150ae33 ("io_uring: patch up IOPOLL overflow_flush sync")
> Signed-off-by: Hao Xu <haoxu@linux.alibaba.com>
> ---
>
> Here I use mutex_trylock() to fix this issue, but this causes loss of
> accuracy. I think doing cqring overflow flush in a task work maybe a
> better solution. I'm think of this. Any thoughts?
>
> fs/io_uring.c | 31 ++++++++++++++++++++++++++++++-
> 1 file changed, 30 insertions(+), 1 deletion(-)
>
> diff --git a/fs/io_uring.c b/fs/io_uring.c
> index 38c6cbe1ab38..866e45d42ac7 100644
> --- a/fs/io_uring.c
> +++ b/fs/io_uring.c
> @@ -8718,7 +8718,36 @@ static __poll_t io_uring_poll(struct file *file, poll_table *wait)
> smp_rmb();
> if (!io_sqring_full(ctx))
> mask |= EPOLLOUT | EPOLLWRNORM;
> - io_cqring_overflow_flush(ctx, false, NULL, NULL);
> +
> + if (test_bit(0, &ctx->cq_check_overflow)) {
> + bool should_flush = true;
> + /* iopoll syncs against uring_lock, not completion_lock */
> + if (ctx->flags & IORING_SETUP_IOPOLL) {
> + /*
> + * avoid ABBA deadlock.
> + * there could be contention like below:
> + * CPU0 CPU1
> + * ---- ----
> + * lock(&ctx->uring_lock);
> + * lock(&ep->mtx);
> + * lock(&ctx->uring_lock);
> + * lock(&ep->mtx);
> + *
> + * this might happen if we do epoll_wait on a uring fd while
> + * read/write the former epoll fd in a sqe in the former uring
> + * instance.
> + * We don't flush cqring overflow list when we fail to get the
> + * uring lock. This leads to less accuracy, but is still ok.
> + */
> + should_flush = mutex_trylock(&ctx->uring_lock);
> + }
> + if (should_flush) {
> + __io_cqring_overflow_flush(ctx, false, NULL, NULL);
> + if (ctx->flags & IORING_SETUP_IOPOLL)
> + mutex_unlock(&ctx->uring_lock);
> + }
> + }
> +
> if (io_cqring_events(ctx))
> mask |= EPOLLIN | EPOLLRDNORM;
>
>
--
Pavel Begunkov
next prev parent reply other threads:[~2021-02-03 0:08 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-02-02 19:52 [PATCH] io_uring: fix possible deadlock in io_uring_poll Hao Xu
2021-02-03 0:04 ` Pavel Begunkov [this message]
2021-02-03 1:48 ` Jens Axboe
2021-02-03 16:48 ` Pavel Begunkov
2021-02-04 16:48 ` Pavel Begunkov
2021-02-04 16:54 ` Hao Xu
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=9d60270f-993b-ba83-29a0-ce6582c383e0@gmail.com \
--to=asml.silence@gmail.com \
--cc=axboe@kernel.dk \
--cc=haoxu@linux.alibaba.com \
--cc=io-uring@vger.kernel.org \
--cc=joseph.qi@linux.alibaba.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox