public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* reply: reply: [RFC PATCH v3 00/19] scsi: scsi_error: Introduce new error handle mechanism
@ 2025-04-01  3:32 Jiangjianjun
  2025-04-01  9:24 ` John Garry
  0 siblings, 1 reply; 2+ messages in thread
From: Jiangjianjun @ 2025-04-01  3:32 UTC (permalink / raw)
  To: John Garry
  Cc: jejb@linux.ibm.com, martin.petersen@oracle.com,
	linux-scsi@vger.kernel.org, linux-kernel@vger.kernel.org,
	lixiaokeng, hewenliang (C), Yangkunlin(Poincare), yangxingui,
	liyihang (C)


On 31/03/2025 04:10, Jiangjianjun wrote:
> Sorry for late message! I'm working on fixing and testing these issues before re-emailing.

What are you actually working on?

It seems that Hannes' "scsi: EH rework, main part" series and maybe this one can help resolve this following issue:

https://lore.kernel.org/linux-block/eef1e927-c9b2-c61d-7f48-92e65d8b0418@huawei.com/

with fix attempted in:

https://lore.kernel.org/linux-ide/20241031140731.224589-4-cassel@kernel.org/

so that we don't see "fixes" like:
https://lore.kernel.org/linux-scsi/20250329073236.2300582-1-liyihang9@huawei.com/T/#m80bcb3f57fd176b7ce41b1f26e8560de6ad52c9d

> 
> -----邮件原件-----
> 发件人: Christoph Hellwig <hch@infradead.org>
> 发送时间: 2025年3月20日 14:06
> 收件人: Hannes Reinecke <hare@suse.de>
> 抄送: Jiangjianjun <jiangjianjun3@huawei.com>; jejb@linux.ibm.com; 
> martin.petersen@oracle.com; linux-scsi@vger.kernel.org; 
> linux-kernel@vger.kernel.org; lixiaokeng <lixiaokeng@huawei.com>; 
> hewenliang (C) <hewenliang4@huawei.com>; Yangkunlin(Poincare) 
> <yangkunlin7@huawei.com>
> 主题: Re: [RFC PATCH v3 00/19] scsi: scsi_error: Introduce new error 
> handle mechanism
> 
> On Fri, Mar 14, 2025 at 10:01:40AM +0100, Hannes Reinecke wrote:
>> 3. The current EH framework is designed around 'struct scsi_cmnd'.
>> Which means that the command _initiating_ the error handling can only 
>> be returned once the _entire_ error handling (with all
>> escalations) is finished. And more often than not, the application is 
>> waiting on that command to be completed before the next I/O is sent.
>> And that really limits the effectiveness of any improved error 
>> handler; the application ultimatively has to wait for a host reset 
>> before it can contine.
> 
> And someone needs to get your old series to fix that merged before we even start talking about any major EH change.
> 

Sorry, the previous engineer Wen Chao's work has changed. Now I will continue to complete this work. In the future.
I will analyze the details of the solution, improve and refine the above suggestions, and carefully submit the email.


^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: reply: reply: [RFC PATCH v3 00/19] scsi: scsi_error: Introduce new error handle mechanism
  2025-04-01  3:32 reply: reply: [RFC PATCH v3 00/19] scsi: scsi_error: Introduce new error handle mechanism Jiangjianjun
@ 2025-04-01  9:24 ` John Garry
  0 siblings, 0 replies; 2+ messages in thread
From: John Garry @ 2025-04-01  9:24 UTC (permalink / raw)
  To: Jiangjianjun
  Cc: jejb@linux.ibm.com, martin.petersen@oracle.com,
	linux-scsi@vger.kernel.org, linux-kernel@vger.kernel.org,
	lixiaokeng, hewenliang (C), Yangkunlin(Poincare), yangxingui,
	liyihang (C)

On 01/04/2025 04:32, Jiangjianjun wrote:

Please use standard mailing practice of inlining response.

> On 31/03/2025 04:10, Jiangjianjun wrote:
>> Sorry for late message! I'm working on fixing and testing these issues before re-emailing.
> What are you actually working on?
> 
> It seems that Hannes' "scsi: EH rework, main part" series and maybe this one can help resolve this following issue:
> 
> https://urldefense.com/v3/__https://lore.kernel.org/linux-block/ 
> eef1e927-c9b2-c61d-7f48-92e65d8b0418@huawei.com/__;!!ACWV5N9M2RV99hQ! 
> OO5I73jOVLARfumNZnn0L_cNWCWVmFHmNuzz74pUu12bSxsb7F1wQFuTJBCzEBZrdDE8cqBRf8e_Ddc4AmrbBgdRq9I$ 
> 
> with fix attempted in:
> 
> https://urldefense.com/v3/__https://lore.kernel.org/linux- 
> ide/20241031140731.224589-4-cassel@kernel.org/__;!!ACWV5N9M2RV99hQ! 
> OO5I73jOVLARfumNZnn0L_cNWCWVmFHmNuzz74pUu12bSxsb7F1wQFuTJBCzEBZrdDE8cqBRf8e_Ddc4AmrbVJtRc_g$ 
> 
> so that we don't see "fixes" like:
> https://urldefense.com/v3/__https://lore.kernel.org/linux- 
> scsi/20250329073236.2300582-1-liyihang9@huawei.com/T/ 
> *m80bcb3f57fd176b7ce41b1f26e8560de6ad52c9d__;Iw!!ACWV5N9M2RV99hQ! 
> OO5I73jOVLARfumNZnn0L_cNWCWVmFHmNuzz74pUu12bSxsb7F1wQFuTJBCzEBZrdDE8cqBRf8e_Ddc4AmrbNUtxBIA$ 
> 
>> -----邮件原件-----
>> 发件人: Christoph Hellwig<hch@infradead.org>
>> 发送时间: 2025年3月20日 14:06
>> 收件人: Hannes Reinecke<hare@suse.de>
>> 抄送: Jiangjianjun<jiangjianjun3@huawei.com>;jejb@linux.ibm.com;
>> martin.petersen@oracle.com;linux-scsi@vger.kernel.org;
>> linux-kernel@vger.kernel.org; lixiaokeng<lixiaokeng@huawei.com>;
>> hewenliang (C)<hewenliang4@huawei.com>; Yangkunlin(Poincare)
>> <yangkunlin7@huawei.com>
>> 主题: Re: [RFC PATCH v3 00/19] scsi: scsi_error: Introduce new error
>> handle mechanism
>>
>> On Fri, Mar 14, 2025 at 10:01:40AM +0100, Hannes Reinecke wrote:
>>> 3. The current EH framework is designed around 'struct scsi_cmnd'.
>>> Which means that the command_initiating_ the error handling can only
>>> be returned once the_entire_ error handling (with all
>>> escalations) is finished. And more often than not, the application is
>>> waiting on that command to be completed before the next I/O is sent.
>>> And that really limits the effectiveness of any improved error
>>> handler; the application ultimatively has to wait for a host reset
>>> before it can contine.
>> And someone needs to get your old series to fix that merged before we even start talking about any major EH change.
>>
> Sorry, the previous engineer Wen Chao's work has changed. Now I will continue to complete this work. In the future.
> I will analyze the details of the solution, improve and refine the above suggestions, and carefully submit the email.

JFYI, IIRC, that "scsi: EH rework, main part" or one of the prep series 
may require some form of SCSI reserved command support. Niklas raised 
that point here:
https://lore.kernel.org/linux-scsi/Zyo-E1PCvx_XULvg@ryzen/

I also remember commenting on this, but cannot find a reference.

The SCSI reserved commands series includes the following attempts:
https://lore.kernel.org/linux-scsi/20211125151048.103910-1-hare@suse.de/
https://lore.kernel.org/linux-scsi/1666693096-180008-1-git-send-email-john.garry@huawei.com/

Maybe to move format we can implement a basic solution for the concerned 
drivers, so that progress can be made.



^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2025-04-01  9:25 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2025-04-01  3:32 reply: reply: [RFC PATCH v3 00/19] scsi: scsi_error: Introduce new error handle mechanism Jiangjianjun
2025-04-01  9:24 ` John Garry

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox