* Re: [PATCH 2/2] ata: libata: Issue non-NCQ command via EH when NCQ commands in-flight
[not found] ` <baceec65-ad60-f8e5-f417-0316c19a0234@huawei.com>
@ 2024-11-05 9:33 ` Niklas Cassel
0 siblings, 0 replies; only message in thread
From: Niklas Cassel @ 2024-11-05 9:33 UTC (permalink / raw)
To: yangxingui
Cc: Damien Le Moal, Hannes Reinecke, Yu Kuai, linux-ide, Wenchao Hao,
linux-scsi
On Mon, Nov 04, 2024 at 12:01:19PM +0800, yangxingui wrote:
(snip)
> After testing, the issues we encountered were resolved.
That is good news :)
>
> But the kernel prints the following log:
>
> [246993.392832] sas: Enter sas_scsi_recover_host busy: 1 failed: 1
> [246993.392839] sas: ata5: end_device-4:0: cmd error handler
> [246993.392855] sas: ata5: end_device-4:0: dev error handler
> [246993.392860] sas: ata6: end_device-4:3: dev error handler
> [246993.392863] sas: ata7: end_device-4:4: dev error handler
> [246993.606491] sas: --- Exit sas_scsi_recover_host: busy: 0 failed:
> 1 tries: 1
>
> And because the current EH will set the host to the recovery state,
> when we test and execute the smartctl command, it will affect the
> performance of all other disks under the same host.
>
> Perhaps we can continue to improve the EH mechanism that Wenchao
> tried to do before, and implement EH for a single disk. After a
> single disk enters EH, it may not affect other disks under the same
> host.
>
> https://lore.kernel.org/linux-scsi/20230901094127.2010873-1-haowenchao2@huawei.com/
That is bad news :(
Considering that this series will currently stall all other disks under
the same host, this series is currently not a viable solution to the
problem that you have reported (NCQ commands can starve out non-NCQ
commands).
Looking at:
https://lore.kernel.org/linux-scsi/20230901094127.2010873-1-haowenchao2@huawei.com/
It appears that a requirement for Wenchao's series to land,
is that Hannes's EH rework series:
https://lore.kernel.org/linux-scsi/20231023092837.33786-1-hare@suse.de/
lands first.
Unless these two SCSI series get merged first, it's illogical to carry this
increased complexity in libata.
If these two SCSI series ever get merged, then the series in $subject would
be a viable solution to the problem, and the extra complexity would be
justified.
Kind regards,
Niklas
^ permalink raw reply [flat|nested] only message in thread
only message in thread, other threads:[~2024-11-05 9:33 UTC | newest]
Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <20241031140731.224589-4-cassel@kernel.org>
[not found] ` <20241031140731.224589-6-cassel@kernel.org>
[not found] ` <baceec65-ad60-f8e5-f417-0316c19a0234@huawei.com>
2024-11-05 9:33 ` [PATCH 2/2] ata: libata: Issue non-NCQ command via EH when NCQ commands in-flight Niklas Cassel
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).