From: Ming Lei <ming.lei@redhat.com>
To: Uday Shankar <ushankar@purestorage.com>
Cc: Jens Axboe <axboe@kernel.dk>,
linux-block@vger.kernel.org,
Caleb Sander Mateos <csander@purestorage.com>
Subject: Re: [PATCH V2 4/8] ublk: move device reset into ublk_ch_release()
Date: Thu, 17 Apr 2025 07:31:11 +0800 [thread overview]
Message-ID: <aAA9v03LmUzUnf3P@fedora> (raw)
In-Reply-To: <Z//+0vPyo1/4spfx@dev-ushankar.dev.purestorage.com>
On Wed, Apr 16, 2025 at 01:02:42PM -0600, Uday Shankar wrote:
> On Wed, Apr 16, 2025 at 11:54:38AM +0800, Ming Lei wrote:
> > ublk_ch_release() is called after ublk char device is closed, when all
> > uring_cmd are done, so it is perfect fine to move ublk device reset to
> > ublk_ch_release() from ublk_ctrl_start_recovery().
> >
> > This way can avoid to grab the exiting daemon task_struct too long.
> >
> > However, reset of the following ublk IO flags has to be moved until ublk
> > io_uring queues are ready:
> >
> > - ubq->canceling
> >
> > For requeuing IO in case of ublk_nosrv_dev_should_queue_io() before device
> > is recovered
> >
> > - ubq->fail_io
> >
> > For failing IO in case of UBLK_F_USER_RECOVERY_FAIL_IO before device is
> > recovered
> >
> > - ublk_io->flags
> >
> > For preventing using io->cmd
> >
> > With this way, recovery is simplified a lot.
> >
> > Signed-off-by: Ming Lei <ming.lei@redhat.com>
> > ---
> > drivers/block/ublk_drv.c | 121 +++++++++++++++++++++++----------------
> > 1 file changed, 72 insertions(+), 49 deletions(-)
> >
> > diff --git a/drivers/block/ublk_drv.c b/drivers/block/ublk_drv.c
> > index a479969fd77e..1fe39cf85b2f 100644
> > --- a/drivers/block/ublk_drv.c
> > +++ b/drivers/block/ublk_drv.c
> > @@ -1074,7 +1074,7 @@ static inline struct ublk_uring_cmd_pdu *ublk_get_uring_cmd_pdu(
> >
> > static inline bool ubq_daemon_is_dying(struct ublk_queue *ubq)
> > {
> > - return ubq->ubq_daemon->flags & PF_EXITING;
> > + return !ubq->ubq_daemon || ubq->ubq_daemon->flags & PF_EXITING;
> > }
> >
> > /* todo: handle partial completion */
> > @@ -1470,6 +1470,37 @@ static const struct blk_mq_ops ublk_mq_ops = {
> > .timeout = ublk_timeout,
> > };
> >
> > +static void ublk_queue_reinit(struct ublk_device *ub, struct ublk_queue *ubq)
> > +{
> > + int i;
> > +
> > + /* All old ioucmds have to be completed */
> > + ubq->nr_io_ready = 0;
> > +
> > + /*
> > + * old daemon is PF_EXITING, put it now
> > + *
> > + * It could be NULL in case of closing one quisced device.
> > + */
> > + if (ubq->ubq_daemon)
> > + put_task_struct(ubq->ubq_daemon);
> > + /* We have to reset it to NULL, otherwise ub won't accept new FETCH_REQ */
> > + ubq->ubq_daemon = NULL;
> > + ubq->timeout = false;
> > +
> > + for (i = 0; i < ubq->q_depth; i++) {
> > + struct ublk_io *io = &ubq->ios[i];
> > +
> > + /*
> > + * UBLK_IO_FLAG_CANCELED is kept for avoiding to touch
> > + * io->cmd
> > + */
> > + io->flags &= UBLK_IO_FLAG_CANCELED;
> > + io->cmd = NULL;
> > + io->addr = 0;
> > + }
> > +}
> > +
> > static int ublk_ch_open(struct inode *inode, struct file *filp)
> > {
> > struct ublk_device *ub = container_of(inode->i_cdev,
> > @@ -1481,10 +1512,26 @@ static int ublk_ch_open(struct inode *inode, struct file *filp)
> > return 0;
> > }
> >
> > +static void ublk_reset_ch_dev(struct ublk_device *ub)
> > +{
> > + int i;
> > +
> > + for (i = 0; i < ub->dev_info.nr_hw_queues; i++)
> > + ublk_queue_reinit(ub, ublk_get_queue(ub, i));
> > +
> > + /* set to NULL, otherwise new ubq_daemon cannot mmap the io_cmd_buf */
> > + ub->mm = NULL;
> > + ub->nr_queues_ready = 0;
> > + ub->nr_privileged_daemon = 0;
> > +}
> > +
> > static int ublk_ch_release(struct inode *inode, struct file *filp)
> > {
> > struct ublk_device *ub = filp->private_data;
> >
> > + /* all uring_cmd has been done now, reset device & ubq */
> > + ublk_reset_ch_dev(ub);
>
> Can we take the ub->mutex here? I can't see a concrete data race but I
> feel there is lots of potential for one since the device still may be
> concurrently accessed via ctrl commands.
Actually it isn't needed:
- only the two RECOVERY commands may touch this data, but START_USER_RECOVERY waits
until close is done.
- for other commands, this data won't be touched, and just the state update
need to be protected by ub->mutex, which is always true. ublk_cancel_dev() is one
exception, but ubq->cancel_lock provides the protection.
thanks,
Ming
next prev parent reply other threads:[~2025-04-16 23:31 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-04-16 3:54 [PATCH V2 0/8] ublk: simplify & improve IO canceling Ming Lei
2025-04-16 3:54 ` [PATCH V2 1/8] ublk: properly serialize all FETCH_REQs Ming Lei
2025-04-16 3:54 ` [PATCH V2 2/8] ublk: add ublk_force_abort_dev() Ming Lei
2025-04-16 3:54 ` [PATCH V2 3/8] ublk: rely on ->canceling for dealing with ublk_nosrv_dev_should_queue_io Ming Lei
2025-04-16 18:35 ` Uday Shankar
2025-04-16 3:54 ` [PATCH V2 4/8] ublk: move device reset into ublk_ch_release() Ming Lei
2025-04-16 19:02 ` Uday Shankar
2025-04-16 23:31 ` Ming Lei [this message]
2025-04-16 3:54 ` [PATCH V2 5/8] ublk: improve detection and handling of ublk server exit Ming Lei
2025-04-16 19:09 ` Uday Shankar
2025-04-16 3:54 ` [PATCH V2 6/8] ublk: remove __ublk_quiesce_dev() Ming Lei
2025-04-16 3:54 ` [PATCH V2 7/8] ublk: simplify aborting ublk request Ming Lei
2025-04-16 3:54 ` [PATCH V2 8/8] selftests: ublk: add generic_06 for covering fault inject Ming Lei
2025-04-16 18:45 ` [PATCH V2 0/8] ublk: simplify & improve IO canceling Jens Axboe
2025-04-17 0:03 ` Ming Lei
2025-04-17 1:33 ` Jens Axboe
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=aAA9v03LmUzUnf3P@fedora \
--to=ming.lei@redhat.com \
--cc=axboe@kernel.dk \
--cc=csander@purestorage.com \
--cc=linux-block@vger.kernel.org \
--cc=ushankar@purestorage.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.