From: Chris Leech <cleech-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
To: "Guilherme G. Piccoli"
<gpiccoli-23VcF4HTsmIX0ybBhKVfKdBPR1lH4CV8@public.gmane.org>
Cc: Or Gerlitz <gerlitz.or-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>,
lduncan-IBi9RG/b67k@public.gmane.org,
Mike Christie <michaelc-hcNo3dDEHLuVc3sceRu5cw@public.gmane.org>,
open-iscsi-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org,
Sagi Grimberg
<sagig-LDSdmyG8hGV8YrgS2mwiifqBs+8SCbDb@public.gmane.org>,
linux-scsi-u79uwXL29TY76Z2rM5mHXA@public.gmane.org,
Or Gerlitz <ogerlitz-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>,
Shlomo Pongratz <shlomop-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
Subject: Re: [PATCH 1/1] iscsi: fix regression caused by session lock patch
Date: Mon, 6 Feb 2017 12:27:49 -0500 (EST) [thread overview]
Message-ID: <1976057129.23970152.1486402069647.JavaMail.zimbra@redhat.com> (raw)
In-Reply-To: <631008bd-1e05-2c88-b153-695c76128eb4-23VcF4HTsmIX0ybBhKVfKdBPR1lH4CV8@public.gmane.org>
----- Original Message -----
> On 09/11/2016 03:21, Chris Leech wrote:
> > On Mon, Nov 07, 2016 at 04:23:10PM -0200, Guilherme G. Piccoli wrote:
> >>
> >> Sure! Count on us to test any patches. I guess the first step is to
> >> reproduce on upstream right? We haven't tested specifically this
> >> scenario for long time. Will try to reproduce on 4.9-rc4 and update here.
> >
> > Great, I'm looking forward to hearing the result.
> >
> > Assuming it reproduces, I don't think this level of fine grained locking
> > is necessarily the best solution, but it may help confirm the problem.
> > Especially if the WARN_ONCE I slipped in here triggers.
>
> Chris, sorry for my huge delay.
> Finally I was able to perform tests and I have good news - seems your
> patch below fixed the issue.
Thanks for the testing, looks like you have the magic target to reproduce this.
I think this verifies what Mike's idea of what was going wrong, and we're way overdue to get this fixed upstream. Thanks to IBM for pushing this, I don't think any major distro is shipping this patch and we don't want to keep having to back it out.
The options look like
1) back out the session lock changes that split it into two locks
2) add in the additional locking from this test patch
3) some other fix for the issue of targets that complete tasks oddly
I'm leaning to #1, as I don't want to keep adding more locks for this.
Sagi, Or, Shlomo? You pushed to keep this from being backed out before. Here's your cause, any better ideas on fixing it? I also tried to go back in the mailing list archives, but I don't see any real numbers for the performance gains.
- Chris
> Firstly, I was able to reproduce with kernel 4.10-rc6. See the file
> repro.out - it's a dump from xmon, the kernel debugger from PowerPC.
> With this tool we can dump the exception details, registers, PACA
> (Processor Address Communication Area, ppc specific structure) and
> dmesg. It took me less than 15 minutes to reproduce.
>
> Then, I applied your patch on the top of this kernel and the benchmark
> was able to successfully complete, in about 3 hours. We can see the
> WARN() you added was reached, the attached file dmesg-cleech_patch shows
> the kernel log with your patch.
>
> The workload was FIO benchmark doing both reads and writes to the remote
> storage via iSCSI, connected over ethernet direct cabling (10Gb speed).
> Distro was Ubuntu 16.04.1 .
>
> Any more tests or info you need, please let me know!
> Cheers,
>
>
> Guilherme
>
>
> > - Chris
> >
> > ---
> >
> > diff --git a/drivers/scsi/libiscsi.c b/drivers/scsi/libiscsi.c
> > index f9b6fba..fbd18ab 100644
> > --- a/drivers/scsi/libiscsi.c
> > +++ b/drivers/scsi/libiscsi.c
> > @@ -560,8 +560,12 @@ static void iscsi_complete_task(struct iscsi_task
> > *task, int state)
> > WARN_ON_ONCE(task->state == ISCSI_TASK_FREE);
> > task->state = state;
> >
> > - if (!list_empty(&task->running))
> > + spin_lock_bh(&conn->taskqueuelock);
> > + if (!list_empty(&task->running)) {
> > + WARN_ONCE(1, "iscsi_complete_task while task on list");
> > list_del_init(&task->running);
> > + }
> > + spin_unlock_bh(&conn->taskqueuelock);
> >
> > if (conn->task == task)
> > conn->task = NULL;
> > @@ -783,7 +787,9 @@ __iscsi_conn_send_pdu(struct iscsi_conn *conn, struct
> > iscsi_hdr *hdr,
> > if (session->tt->xmit_task(task))
> > goto free_task;
> > } else {
> > + spin_lock_bh(&conn->taskqueuelock);
> > list_add_tail(&task->running, &conn->mgmtqueue);
> > + spin_unlock_bh(&conn->taskqueuelock);
> > iscsi_conn_queue_work(conn);
> > }
> >
> > @@ -1474,8 +1480,10 @@ void iscsi_requeue_task(struct iscsi_task *task)
> > * this may be on the requeue list already if the xmit_task callout
> > * is handling the r2ts while we are adding new ones
> > */
> > + spin_lock_bh(&conn->taskqueuelock);
> > if (list_empty(&task->running))
> > list_add_tail(&task->running, &conn->requeue);
> > + spin_unlock_bh(&conn->taskqueuelock);
> > iscsi_conn_queue_work(conn);
> > }
> > EXPORT_SYMBOL_GPL(iscsi_requeue_task);
> > @@ -1512,22 +1520,26 @@ static int iscsi_data_xmit(struct iscsi_conn *conn)
> > * only have one nop-out as a ping from us and targets should not
> > * overflow us with nop-ins
> > */
> > + spin_lock_bh(&conn->taskqueuelock);
> > check_mgmt:
> > while (!list_empty(&conn->mgmtqueue)) {
> > conn->task = list_entry(conn->mgmtqueue.next,
> > struct iscsi_task, running);
> > list_del_init(&conn->task->running);
> > + spin_unlock_bh(&conn->taskqueuelock);
> > if (iscsi_prep_mgmt_task(conn, conn->task)) {
> > /* regular RX path uses back_lock */
> > spin_lock_bh(&conn->session->back_lock);
> > __iscsi_put_task(conn->task);
> > spin_unlock_bh(&conn->session->back_lock);
> > conn->task = NULL;
> > + spin_lock_bh(&conn->taskqueuelock);
> > continue;
> > }
> > rc = iscsi_xmit_task(conn);
> > if (rc)
> > goto done;
> > + spin_lock_bh(&conn->taskqueuelock);
> > }
> >
> > /* process pending command queue */
> > @@ -1535,19 +1547,24 @@ static int iscsi_data_xmit(struct iscsi_conn *conn)
> > conn->task = list_entry(conn->cmdqueue.next, struct iscsi_task,
> > running);
> > list_del_init(&conn->task->running);
> > + spin_unlock_bh(&conn->taskqueuelock);
> > if (conn->session->state == ISCSI_STATE_LOGGING_OUT) {
> > fail_scsi_task(conn->task, DID_IMM_RETRY);
> > + spin_lock_bh(&conn->taskqueuelock);
> > continue;
> > }
> > rc = iscsi_prep_scsi_cmd_pdu(conn->task);
> > if (rc) {
> > if (rc == -ENOMEM || rc == -EACCES) {
> > + spin_lock_bh(&conn->taskqueuelock);
> > list_add_tail(&conn->task->running,
> > &conn->cmdqueue);
> > conn->task = NULL;
> > + spin_unlock_bh(&conn->taskqueuelock);
> > goto done;
> > } else
> > fail_scsi_task(conn->task, DID_ABORT);
> > + spin_lock_bh(&conn->taskqueuelock);
> > continue;
> > }
> > rc = iscsi_xmit_task(conn);
> > @@ -1558,6 +1575,7 @@ static int iscsi_data_xmit(struct iscsi_conn *conn)
> > * we need to check the mgmt queue for nops that need to
> > * be sent to aviod starvation
> > */
> > + spin_lock_bh(&conn->taskqueuelock);
> > if (!list_empty(&conn->mgmtqueue))
> > goto check_mgmt;
> > }
> > @@ -1577,12 +1595,15 @@ static int iscsi_data_xmit(struct iscsi_conn *conn)
> > conn->task = task;
> > list_del_init(&conn->task->running);
> > conn->task->state = ISCSI_TASK_RUNNING;
> > + spin_unlock_bh(&conn->taskqueuelock);
> > rc = iscsi_xmit_task(conn);
> > if (rc)
> > goto done;
> > + spin_lock_bh(&conn->taskqueuelock);
> > if (!list_empty(&conn->mgmtqueue))
> > goto check_mgmt;
> > }
> > + spin_unlock_bh(&conn->taskqueuelock);
> > spin_unlock_bh(&conn->session->frwd_lock);
> > return -ENODATA;
> >
> > @@ -1738,7 +1759,9 @@ int iscsi_queuecommand(struct Scsi_Host *host, struct
> > scsi_cmnd *sc)
> > goto prepd_reject;
> > }
> > } else {
> > + spin_lock_bh(&conn->taskqueuelock);
> > list_add_tail(&task->running, &conn->cmdqueue);
> > + spin_unlock_bh(&conn->taskqueuelock);
> > iscsi_conn_queue_work(conn);
> > }
> >
> > @@ -2897,6 +2920,7 @@ iscsi_conn_setup(struct iscsi_cls_session
> > *cls_session, int dd_size,
> > INIT_LIST_HEAD(&conn->mgmtqueue);
> > INIT_LIST_HEAD(&conn->cmdqueue);
> > INIT_LIST_HEAD(&conn->requeue);
> > + spin_lock_init(&conn->taskqueuelock);
> > INIT_WORK(&conn->xmitwork, iscsi_xmitworker);
> >
> > /* allocate login_task used for the login/text sequences */
> > diff --git a/include/scsi/libiscsi.h b/include/scsi/libiscsi.h
> > index 4d1c46a..c7b1dc7 100644
> > --- a/include/scsi/libiscsi.h
> > +++ b/include/scsi/libiscsi.h
> > @@ -196,6 +196,7 @@ struct iscsi_conn {
> > struct iscsi_task *task; /* xmit task in progress */
> >
> > /* xmit */
> > + spinlock_t taskqueuelock; /* protects the next three lists */
> > struct list_head mgmtqueue; /* mgmt (control) xmit queue */
> > struct list_head cmdqueue; /* data-path cmd queue */
> > struct list_head requeue; /* tasks needing another run */
> >
>
--
You received this message because you are subscribed to the Google Groups "open-iscsi" group.
To unsubscribe from this group and stop receiving emails from it, send an email to open-iscsi+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To post to this group, send email to open-iscsi-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
Visit this group at https://groups.google.com/group/open-iscsi.
For more options, visit https://groups.google.com/d/optout.
next prev parent reply other threads:[~2017-02-06 17:27 UTC|newest]
Thread overview: 21+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-11-12 5:05 [PATCH 1/1] iscsi: fix regression caused by session lock patch mchristi
2015-11-12 12:03 ` Sagi Grimberg
2015-11-12 20:58 ` Mike Christie
2015-11-13 15:06 ` Or Gerlitz
2015-11-13 16:51 ` Mike Christie
2015-11-15 10:10 ` Or Gerlitz
[not found] ` <CAJ3xEMhQiywXo0=kRO7f=fW--1kc6mbNs_X7wLoYtXmRWeqBkg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2015-11-16 17:30 ` Michael Christie
2015-11-17 16:55 ` Or Gerlitz
2015-11-18 11:30 ` Or Gerlitz
[not found] ` <CAJ3xEMiu4XBO2d1oLnrgay1uLQmY871n9Kn-yp73PAkfKNnp9A-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2015-11-18 18:39 ` Mike Christie
2016-11-07 18:15 ` Chris Leech
[not found] ` <20161107181556.cnhwst4nu63xtrqk-r8IHplWLGbA5tHQWs+pTeqPFFGjUI2lm2LY78lusg7I@public.gmane.org>
2016-11-07 18:23 ` Guilherme G. Piccoli
[not found] ` <5820C68E.6050206-23VcF4HTsmIX0ybBhKVfKdBPR1lH4CV8@public.gmane.org>
2016-11-09 5:21 ` Chris Leech
[not found] ` <20161109052142.j4psips7yvx7uohx-r8IHplWLGbA5tHQWs+pTeqPFFGjUI2lm2LY78lusg7I@public.gmane.org>
2016-11-12 1:51 ` Guilherme G. Piccoli
2017-02-06 13:19 ` Guilherme G. Piccoli
[not found] ` <631008bd-1e05-2c88-b153-695c76128eb4-23VcF4HTsmIX0ybBhKVfKdBPR1lH4CV8@public.gmane.org>
2017-02-06 17:27 ` Chris Leech [this message]
[not found] ` <1976057129.23970152.1486402069647.JavaMail.zimbra-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2017-02-06 18:24 ` Guilherme G. Piccoli
2017-02-06 19:22 ` Sagi Grimberg
2015-11-12 21:33 ` Chris Leech
2016-01-22 16:50 ` Brian King
2016-01-22 19:11 ` Mike Christie
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1976057129.23970152.1486402069647.JavaMail.zimbra@redhat.com \
--to=cleech-h+wxahxf7alqt0dzr+alfa@public.gmane.org \
--cc=gerlitz.or-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org \
--cc=gpiccoli-23VcF4HTsmIX0ybBhKVfKdBPR1lH4CV8@public.gmane.org \
--cc=lduncan-IBi9RG/b67k@public.gmane.org \
--cc=linux-scsi-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
--cc=michaelc-hcNo3dDEHLuVc3sceRu5cw@public.gmane.org \
--cc=ogerlitz-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org \
--cc=open-iscsi-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org \
--cc=sagig-LDSdmyG8hGV8YrgS2mwiifqBs+8SCbDb@public.gmane.org \
--cc=shlomop-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).