public inbox for linux-ide@vger.kernel.org
 help / color / mirror / Atom feed
* [PATCH] scsi: ata: Fix a race condition between scsi error handler and ahci interrupt
@ 2023-08-10  1:48 linan666
  2023-08-10  2:49 ` Damien Le Moal
  2023-08-21 13:51 ` Niklas Cassel
  0 siblings, 2 replies; 13+ messages in thread
From: linan666 @ 2023-08-10  1:48 UTC (permalink / raw)
  To: dlemoal
  Cc: linux-ide, linux-kernel, linan122, yukuai3, yi.zhang, houtao1,
	yangerkun

From: Li Nan <linan122@huawei.com>

interrupt                            scsi_eh

ahci_error_intr
  =>ata_port_freeze
    =>__ata_port_freeze
      =>ahci_freeze (turn IRQ off)
    =>ata_port_abort
      =>ata_port_schedule_eh
        =>shost->host_eh_scheduled++;
        host_eh_scheduled = 1
                                     scsi_error_handler
                                       =>ata_scsi_error
                                         =>ata_scsi_port_error_handler
                                           =>ahci_error_handler
                                           . =>sata_pmp_error_handler
                                           .   =>ata_eh_thaw_port
                                           .     =>ahci_thaw (turn IRQ on)
ahci_error_intr                            .
  =>ata_port_freeze                        .
    =>__ata_port_freeze                    .
      =>ahci_freeze (turn IRQ off)         .
    =>ata_port_abort                       .
      =>ata_port_schedule_eh               .
        =>shost->host_eh_scheduled++;      .
        host_eh_scheduled = 2              .
                                           =>ata_std_end_eh
                                             =>host->host_eh_scheduled = 0;

'host_eh_scheduled' is 0 and scsi eh thread will not be scheduled again,
and the ata port remain freeze and will never be enabled.

If EH thread is already running, no need to freeze port and schedule
EH again.

Reported-by: luojian <luojian5@huawei.com>
Signed-off-by: Li Nan <linan122@huawei.com>
---
 drivers/ata/libahci.c | 12 ++++++++++--
 1 file changed, 10 insertions(+), 2 deletions(-)

diff --git a/drivers/ata/libahci.c b/drivers/ata/libahci.c
index e2bacedf28ef..0dfb0b807324 100644
--- a/drivers/ata/libahci.c
+++ b/drivers/ata/libahci.c
@@ -1840,9 +1840,17 @@ static void ahci_error_intr(struct ata_port *ap, u32 irq_stat)
 
 	/* okay, let's hand over to EH */
 
-	if (irq_stat & PORT_IRQ_FREEZE)
+	if (irq_stat & PORT_IRQ_FREEZE) {
+		/*
+		 * EH already running, this may happen if the port is
+		 * thawed in the EH. But we cannot freeze it again
+		 * otherwise the port will never be thawed.
+		 */
+		if (ap->pflags & (ATA_PFLAG_EH_PENDING |
+			ATA_PFLAG_EH_IN_PROGRESS))
+			return;
 		ata_port_freeze(ap);
-	else if (fbs_need_dec) {
+	} else if (fbs_need_dec) {
 		ata_link_abort(link);
 		ahci_fbs_dec_intr(ap);
 	} else
-- 
2.39.2


^ permalink raw reply related	[flat|nested] 13+ messages in thread

end of thread, other threads:[~2023-09-04 13:00 UTC | newest]

Thread overview: 13+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2023-08-10  1:48 [PATCH] scsi: ata: Fix a race condition between scsi error handler and ahci interrupt linan666
2023-08-10  2:49 ` Damien Le Moal
2023-08-14  6:41   ` Li Nan
2023-08-14  7:50     ` Damien Le Moal
2023-08-14 13:20       ` Li Nan
2023-08-15  2:41         ` Damien Le Moal
2023-08-17  7:41           ` Li Nan
2023-08-21 13:51 ` Niklas Cassel
2023-08-22  9:20   ` Li Nan
2023-08-22 10:30     ` Niklas Cassel
2023-09-04 11:45       ` Li Nan
2023-09-04 11:57         ` Niklas Cassel
2023-09-04 13:00           ` Li Nan

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox