* ib_srpt status
@ 2011-10-12 9:23 Christoph Hellwig
2011-10-12 9:24 ` Christoph Hellwig
` (2 more replies)
0 siblings, 3 replies; 10+ messages in thread
From: Christoph Hellwig @ 2011-10-12 9:23 UTC (permalink / raw)
To: Roland Dreier, Nicholas A. Bellinger
Cc: Barn Van Assche, target-devel, linux-scsi
Hi Roland, hi Nick,
what is the status of queueing up ib_srpt for mainline? It's a very
clean looking driver that was submitted 8 month ago and after an initial
round of fixes only saw updates for changing core APIs.
Keeping it out of tree for so long is a very discuraging story for
contributors.
^ permalink raw reply [flat|nested] 10+ messages in thread* Re: ib_srpt status 2011-10-12 9:23 ib_srpt status Christoph Hellwig @ 2011-10-12 9:24 ` Christoph Hellwig 2011-10-12 16:17 ` Roland Dreier 2011-10-18 6:13 ` Bart Van Assche 2 siblings, 0 replies; 10+ messages in thread From: Christoph Hellwig @ 2011-10-12 9:24 UTC (permalink / raw) To: Roland Dreier, Nicholas A. Bellinger Cc: Barn Van Assche, target-devel, linux-scsi [addressing to a working address for Roland] On Wed, Oct 12, 2011 at 11:23:18AM +0200, Christoph Hellwig wrote: > Hi Roland, hi Nick, > > what is the status of queueing up ib_srpt for mainline? It's a very > clean looking driver that was submitted 8 month ago and after an initial > round of fixes only saw updates for changing core APIs. > > Keeping it out of tree for so long is a very discuraging story for > contributors. ---end quoted text--- ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: ib_srpt status 2011-10-12 9:23 ib_srpt status Christoph Hellwig 2011-10-12 9:24 ` Christoph Hellwig @ 2011-10-12 16:17 ` Roland Dreier 2011-10-13 22:07 ` Nicholas A. Bellinger 2011-10-18 6:13 ` Bart Van Assche 2 siblings, 1 reply; 10+ messages in thread From: Roland Dreier @ 2011-10-12 16:17 UTC (permalink / raw) To: Christoph Hellwig Cc: Roland Dreier, Nicholas A. Bellinger, Barn Van Assche, target-devel, linux-scsi On Wed, Oct 12, 2011 at 2:23 AM, Christoph Hellwig <hch@lst.de> wrote: > Hi Roland, hi Nick, > > what is the status of queueing up ib_srpt for mainline? It's a very > clean looking driver that was submitted 8 month ago and after an initial > round of fixes only saw updates for changing core APIs. > > Keeping it out of tree for so long is a very discuraging story for > contributors. Where is the latest patch? I would like to review it and at least get it into linux-next... I seem to recall having some concerns about the interface -- ie some things were done through module params and I don't want to be stuck supporting those module params forever. Other than that it is likely to be fine to merge. - R. ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: ib_srpt status 2011-10-12 16:17 ` Roland Dreier @ 2011-10-13 22:07 ` Nicholas A. Bellinger 0 siblings, 0 replies; 10+ messages in thread From: Nicholas A. Bellinger @ 2011-10-13 22:07 UTC (permalink / raw) To: Roland Dreier Cc: Christoph Hellwig, Barn Van Assche, target-devel, linux-scsi On Wed, 2011-10-12 at 09:17 -0700, Roland Dreier wrote: > On Wed, Oct 12, 2011 at 2:23 AM, Christoph Hellwig <hch@lst.de> wrote: > > Hi Roland, hi Nick, > > > > what is the status of queueing up ib_srpt for mainline? It's a very > > clean looking driver that was submitted 8 month ago and after an initial > > round of fixes only saw updates for changing core APIs. > > > > Keeping it out of tree for so long is a very discuraging story for > > contributors. > > Where is the latest patch? I would like to review it and at least > get it into linux-next... > > I seem to recall having some concerns about the interface -- ie > some things were done through module params and I don't want > to be stuck supporting those module params forever. > > Other than that it is likely to be fine to merge. > Hey guys, I'll post the latest version of ib_srpt in a bit after merging Christoph's remaining patches.. However, this will be against the v3.2 queued patches in target-pending.git/for-next, as the core cleanups in the se_cmd release path also effect current ib_srpt code. --nab ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: ib_srpt status 2011-10-12 9:23 ib_srpt status Christoph Hellwig 2011-10-12 9:24 ` Christoph Hellwig 2011-10-12 16:17 ` Roland Dreier @ 2011-10-18 6:13 ` Bart Van Assche 2011-10-18 16:41 ` Nicholas A. Bellinger 2 siblings, 1 reply; 10+ messages in thread From: Bart Van Assche @ 2011-10-18 6:13 UTC (permalink / raw) To: Christoph Hellwig Cc: Roland Dreier, Nicholas A. Bellinger, target-devel, linux-scsi On Wed, Oct 12, 2011 at 11:23 AM, Christoph Hellwig <hch@lst.de> wrote: > what is the status of queueing up ib_srpt for mainline? It's a very > clean looking driver that was submitted 8 month ago and after an initial > round of fixes only saw updates for changing core APIs. The ib_srpt driver was working fine last time I tested it. But with the current (3.1-rc9) LIO core performing I/O triggers grave memory corruption and random kernel crashes. Something must have changed in the LIO core that broke ib_srpt. Bart. ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: ib_srpt status 2011-10-18 6:13 ` Bart Van Assche @ 2011-10-18 16:41 ` Nicholas A. Bellinger 2011-10-18 16:43 ` Christoph Hellwig 2011-10-19 16:59 ` Bart Van Assche 0 siblings, 2 replies; 10+ messages in thread From: Nicholas A. Bellinger @ 2011-10-18 16:41 UTC (permalink / raw) To: Bart Van Assche Cc: Christoph Hellwig, target-devel, linux-scsi, Roland Dreier On Tue, 2011-10-18 at 08:13 +0200, Bart Van Assche wrote: > On Wed, Oct 12, 2011 at 11:23 AM, Christoph Hellwig <hch@lst.de> wrote: > > what is the status of queueing up ib_srpt for mainline? It's a very > > clean looking driver that was submitted 8 month ago and after an initial > > round of fixes only saw updates for changing core APIs. > > The ib_srpt driver was working fine last time I tested it. But with > the current (3.1-rc9) LIO core performing I/O triggers grave memory > corruption and random kernel crashes. Something must have changed in > the LIO core that broke ib_srpt. > Hi Bart, I'm having no problems with the patch posted for v3.2 review on MT26428 HCAs Can you verify you are actually using this patch, and not your private tree from github..? The reason is that your tree did not contain the bugfix below to ib_srpt code that was originally causing problems: ib_srpt: Fix bug with chainged SGLs in srpt_map_sg_to_ib_sge http://www.risingtidesystems.com/git/?p=lio-core-2.6.git;a=commitdiff;h=ea485147563b6555a97dbf811825fbb586519252 Thanks, --nab ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: ib_srpt status 2011-10-18 16:41 ` Nicholas A. Bellinger @ 2011-10-18 16:43 ` Christoph Hellwig 2011-10-18 16:48 ` Nicholas A. Bellinger 2011-10-19 16:59 ` Bart Van Assche 1 sibling, 1 reply; 10+ messages in thread From: Christoph Hellwig @ 2011-10-18 16:43 UTC (permalink / raw) To: Nicholas A. Bellinger Cc: Bart Van Assche, Christoph Hellwig, target-devel, linux-scsi, Roland Dreier On Tue, Oct 18, 2011 at 09:41:15AM -0700, Nicholas A. Bellinger wrote: > I'm having no problems with the patch posted for v3.2 review on MT26428 > HCAs > > Can you verify you are actually using this patch, and not your private > tree from github..? The reason is that your tree did not contain the > bugfix below to ib_srpt code that was originally causing problems: > > ib_srpt: Fix bug with chainged SGLs in srpt_map_sg_to_ib_sge > http://www.risingtidesystems.com/git/?p=lio-core-2.6.git;a=commitdiff;h=ea485147563b6555a97dbf811825fbb586519252 Also it would be useful to know the workload that causes the corruption. ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: ib_srpt status 2011-10-18 16:43 ` Christoph Hellwig @ 2011-10-18 16:48 ` Nicholas A. Bellinger 0 siblings, 0 replies; 10+ messages in thread From: Nicholas A. Bellinger @ 2011-10-18 16:48 UTC (permalink / raw) To: Christoph Hellwig Cc: Bart Van Assche, target-devel, linux-scsi, Roland Dreier On Tue, 2011-10-18 at 18:43 +0200, Christoph Hellwig wrote: > On Tue, Oct 18, 2011 at 09:41:15AM -0700, Nicholas A. Bellinger wrote: > > I'm having no problems with the patch posted for v3.2 review on MT26428 > > HCAs > > > > Can you verify you are actually using this patch, and not your private > > tree from github..? The reason is that your tree did not contain the > > bugfix below to ib_srpt code that was originally causing problems: > > > > ib_srpt: Fix bug with chainged SGLs in srpt_map_sg_to_ib_sge > > http://www.risingtidesystems.com/git/?p=lio-core-2.6.git;a=commitdiff;h=ea485147563b6555a97dbf811825fbb586519252 > > Also it would be useful to know the workload that causes the corruption. >From the above commit: This patch fixes a bug in srpt_map_sg_to_ib_sge() that assumed a single contigious array of struct scatterlist. This was causing a OOPs when se_cmd->t_task->t_tasks_no > 1 was using a set of sg_chained struct se_task->task_sg[] memory into srpt_map_sg_to_ib_sge() code. ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: ib_srpt status 2011-10-18 16:41 ` Nicholas A. Bellinger 2011-10-18 16:43 ` Christoph Hellwig @ 2011-10-19 16:59 ` Bart Van Assche 2011-10-19 17:38 ` Nicholas A. Bellinger 1 sibling, 1 reply; 10+ messages in thread From: Bart Van Assche @ 2011-10-19 16:59 UTC (permalink / raw) To: Nicholas A. Bellinger Cc: Christoph Hellwig, target-devel, linux-scsi, Roland Dreier On Tue, Oct 18, 2011 at 6:41 PM, Nicholas A. Bellinger <nab@linux-iscsi.org> wrote: > I'm having no problems with the patch posted for v3.2 review on MT26428 > HCAs > > Can you verify you are actually using this patch, and not your private > tree from github..? The reason is that your tree did not contain the > bugfix below to ib_srpt code that was originally causing problems: > > ib_srpt: Fix bug with chainged SGLs in srpt_map_sg_to_ib_sge > http://www.risingtidesystems.com/git/?p=lio-core-2.6.git;a=commitdiff;h=ea485147563b6555a97dbf811825fbb586519252 Yes, that patch was included - sorry, but I had made another mistake. 3.1-rc10 + ib_srpt + patch below is now working fine here. I noticed only one unusual issue so far (with kernel debugging enabled) and that is that the watchdog thread CPU time was unusually high: root 7 2.3 0.0 0 0 ? S 18:39 0:22 [watchdog/0] root 12 0.6 0.0 0 0 ? S 18:39 0:06 [watchdog/1] The patch I used to backport ib_srpt from LIO-4.1 to kernel 3.1-rc10 is: diff --git a/drivers/infiniband/ulp/srpt/ib_srpt.c b/drivers/infiniband/ulp/srpt/ib_srpt.c index 109b412..b3947fe 100644 --- a/drivers/infiniband/ulp/srpt/ib_srpt.c +++ b/drivers/infiniband/ulp/srpt/ib_srpt.c @@ -1340,7 +1340,7 @@ static void srpt_put_send_ioctx(struct srpt_send_ioctx *ioctx) WARN_ON(srpt_get_cmd_state(ioctx) != SRPT_STATE_DONE); srpt_unmap_sg_to_ib_sge(ioctx->ch, ioctx); - transport_generic_free_cmd(&ioctx->cmd, 0); + transport_generic_free_cmd(&ioctx->cmd, 0, 0); if (ioctx->n_rbuf > 1) { kfree(ioctx->rbufs); @@ -1899,7 +1899,7 @@ static void srpt_handle_tsk_mgmt(struct srpt_rdma_ch *ch, TMR_TASK_MGMT_FUNCTION_NOT_SUPPORTED; goto process_tmr; } - cmd->se_tmr_req = core_tmr_alloc_req(cmd, NULL, tcm_tmr, GFP_KERNEL); + cmd->se_tmr_req = core_tmr_alloc_req(cmd, NULL, tcm_tmr); if (!cmd->se_tmr_req) { send_ioctx->cmd.se_cmd_flags |= SCF_SCSI_CDB_EXCEPTION; send_ioctx->cmd.se_tmr_req->response = TMR_FUNCTION_REJECTED; Bart. ^ permalink raw reply related [flat|nested] 10+ messages in thread
* Re: ib_srpt status 2011-10-19 16:59 ` Bart Van Assche @ 2011-10-19 17:38 ` Nicholas A. Bellinger 0 siblings, 0 replies; 10+ messages in thread From: Nicholas A. Bellinger @ 2011-10-19 17:38 UTC (permalink / raw) To: Bart Van Assche Cc: Christoph Hellwig, target-devel, linux-scsi, Roland Dreier On Wed, 2011-10-19 at 18:59 +0200, Bart Van Assche wrote: > On Tue, Oct 18, 2011 at 6:41 PM, Nicholas A. Bellinger > <nab@linux-iscsi.org> wrote: > > I'm having no problems with the patch posted for v3.2 review on MT26428 > > HCAs > > > > Can you verify you are actually using this patch, and not your private > > tree from github..? The reason is that your tree did not contain the > > bugfix below to ib_srpt code that was originally causing problems: > > > > ib_srpt: Fix bug with chainged SGLs in srpt_map_sg_to_ib_sge > > http://www.risingtidesystems.com/git/?p=lio-core-2.6.git;a=commitdiff;h=ea485147563b6555a97dbf811825fbb586519252 > > Yes, that patch was included - sorry, but I had made another mistake. Thanks for the update here. > 3.1-rc10 + ib_srpt + patch below is now working fine here. I noticed > only one unusual issue so far (with kernel debugging enabled) and that > is that the watchdog thread CPU time was unusually high: > > root 7 2.3 0.0 0 0 ? S 18:39 0:22 [watchdog/0] > root 12 0.6 0.0 0 0 ? S 18:39 0:06 [watchdog/1] > > I'm not exactly sure what to make of this.. > The patch I used to backport ib_srpt from LIO-4.1 to kernel 3.1-rc10 is: > > diff --git a/drivers/infiniband/ulp/srpt/ib_srpt.c > b/drivers/infiniband/ulp/srpt/ib_srpt.c > index 109b412..b3947fe 100644 > --- a/drivers/infiniband/ulp/srpt/ib_srpt.c > +++ b/drivers/infiniband/ulp/srpt/ib_srpt.c > @@ -1340,7 +1340,7 @@ static void srpt_put_send_ioctx(struct > srpt_send_ioctx *ioctx) > WARN_ON(srpt_get_cmd_state(ioctx) != SRPT_STATE_DONE); > > srpt_unmap_sg_to_ib_sge(ioctx->ch, ioctx); > - transport_generic_free_cmd(&ioctx->cmd, 0); > + transport_generic_free_cmd(&ioctx->cmd, 0, 0); > > if (ioctx->n_rbuf > 1) { > kfree(ioctx->rbufs); > @@ -1899,7 +1899,7 @@ static void srpt_handle_tsk_mgmt(struct srpt_rdma_ch *ch, > TMR_TASK_MGMT_FUNCTION_NOT_SUPPORTED; > goto process_tmr; > } > - cmd->se_tmr_req = core_tmr_alloc_req(cmd, NULL, tcm_tmr, GFP_KERNEL); > + cmd->se_tmr_req = core_tmr_alloc_req(cmd, NULL, tcm_tmr); > if (!cmd->se_tmr_req) { > send_ioctx->cmd.se_cmd_flags |= SCF_SCSI_CDB_EXCEPTION; > send_ioctx->cmd.se_tmr_req->response = TMR_FUNCTION_REJECTED; > Yep, the ib_srpt patch I sent is against target-pending.git/for-next along with the other target-core updates for v3.2 that will be merged first. Btw, Roland asked me to take a look at the current set of module parameters for ib_srpt, and see which could be removed and/or converted to per SPR target endpoint configfs attributes.. So on that note, a few questions for you on a couple of the module parameters: *) use_port_guid_in_session_name: This appears to be a legacy compat item, can this be safetly removed for mainline..? *) srpt_service_guid: Should this really have global scope..? Aside from those, i'm currently having a look to see which of the module parameters are not used in the initial srpt_add_one() setup path, and that using per SRP target endpoint configfs attributes would make sense. Any input here would be appreciated. Thanks, --nab ^ permalink raw reply [flat|nested] 10+ messages in thread
end of thread, other threads:[~2011-10-19 17:38 UTC | newest] Thread overview: 10+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2011-10-12 9:23 ib_srpt status Christoph Hellwig 2011-10-12 9:24 ` Christoph Hellwig 2011-10-12 16:17 ` Roland Dreier 2011-10-13 22:07 ` Nicholas A. Bellinger 2011-10-18 6:13 ` Bart Van Assche 2011-10-18 16:41 ` Nicholas A. Bellinger 2011-10-18 16:43 ` Christoph Hellwig 2011-10-18 16:48 ` Nicholas A. Bellinger 2011-10-19 16:59 ` Bart Van Assche 2011-10-19 17:38 ` Nicholas A. Bellinger
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox