From: Michael Reed <mdr@sgi.com>
To: Frederic TEMPORELLI <frederic.temporelli@ext.bull.net>
Cc: linux-scsi@vger.kernel.org
Subject: Re: scan error after FC link recovery when device was in use
Date: Tue, 11 Jul 2006 14:34:47 -0500 [thread overview]
Message-ID: <44B3FD57.6090900@sgi.com> (raw)
In-Reply-To: <44AA4F59.3040800@ext.bull.net>
This is a known side effect of removing a fibre target which
has references. There is a crude workaround available. Essentially,
it modifies the fc transport to not delete the target when the
timer expires.
http://marc.theaimsgroup.com/?l=linux-scsi&m=115015423722568&w=2
Root cause is being investigated by various interested parties.
Be sure to read the entire thread. This patch is not going to be
accepted into the kernel.
More interesting threads:
http://marc.theaimsgroup.com/?l=linux-scsi&m=114979410315677&w=2
http://marc.theaimsgroup.com/?l=linux-scsi&m=115039021604020&w=2
http://marc.theaimsgroup.com/?l=linux-scsi&m=115039057504409&w=2
http://marc.theaimsgroup.com/?l=linux-scsi&m=115134812215106&w=2
Be sure to read the complete thread before applying any of the
patches....
Mike
Frederic TEMPORELLI wrote:
> Hi,
>
>
> With 2.6.17.2, if a FC device is in use (mounted FS), a scan error is
> reported when recovering after a 'long' link down period (>30s, 'no
> device timeout'). Then, devices aren't detected again (and oops are
> generated when reloading HBA drivers, but this may be a side effect).
>
> We have this issue with point to point configuration using:
> - Emulex + DDN
> - Emulex + NEC
> - QLogic + DDN
> - QLogic + NEC
>
> In all cases, we got the message "Unexpected response from lun 0 while
> scanning, scan aborted" in the scsi_report_lun_scan function
> (scsi_scan.c) and I'm thinking there's something wrong when calling
> scsi_probe_and_add_lun.
> We don't have such error if devices aren't used (not mounted) when link
> failure occurs.
>
> Do you know if such issue has already been encountered ?
> May be an already existing patch ? (I look for scsi_probe_and_add_lun in
> the mailling list but didn't find any revelant message)
>
> Here are the extracts from syslog about these errors:
>
>
> lpfc + DDN:
> ===========
> ...
> Jul 3 09:15:30 s_kernel@iotiger2 kernel: lpfc 0000:06:02.0: 0:1305 Link
> Down Event x2 received Data: x2 x20 x0
> Jul 3 09:16:00 s_kernel@iotiger2 kernel: lpfc 0000:06:02.0: 0:0203
> Nodev timeout on WWPN 23:0:0:1:ff:3:2:a8 NPort xef Data: x8 x7 x0
> Jul 3 09:16:05 s_kernel@iotiger2 kernel: rport-2:0-0: blocked FC remote
> port time out: removing target and saving binding
> Jul 3 09:17:01 s_kernel@iotiger2 kernel: lpfc 0000:06:02.0: 0:1303 Link
> Up Event x3 received Data: x3 x1 x8 x2
> Jul 3 09:17:01 s_kernel@iotiger2 kernel: Vendor: DDN Model: S2A
> 8500 Rev: 5.22
> Jul 3 09:17:01 s_kernel@iotiger2 kernel: Type: Direct-Access
> ANSI SCSI revision: 03
> Jul 3 09:17:01 s_kernel@iotiger2 kernel: SCSI device sdd: 571416576
> 512-byte hdwr sectors (292565 MB)
> Jul 3 09:17:01 s_kernel@iotiger2 kernel: sdd: Write Protect is off
> Jul 3 09:17:01 s_kernel@iotiger2 kernel: sdd: Mode Sense: a7 00 10 08
> Jul 3 09:17:01 s_kernel@iotiger2 kernel: SCSI device sdd: drive cache:
> write back w/ FUA
> Jul 3 09:17:01 s_kernel@iotiger2 kernel: SCSI device sdd: 571416576
> 512-byte hdwr sectors (292565 MB)
> Jul 3 09:17:01 s_kernel@iotiger2 kernel: sdd: Write Protect is off
> Jul 3 09:17:01 s_kernel@iotiger2 kernel: sdd: Mode Sense: a7 00 10 08
> Jul 3 09:17:01 s_kernel@iotiger2 kernel: SCSI device sdd: drive cache:
> write back w/ FUA
> Jul 3 09:17:01 s_kernel@iotiger2 kernel: sdd: unknown partition table
> Jul 3 09:17:01 s_kernel@iotiger2 kernel: sd 2:0:0:0: Attached scsi disk
> sdd
> Jul 3 09:17:01 s_kernel@iotiger2 kernel: sd 2:0:0:0: Attached scsi
> generic sg4 type 0
> Jul 3 09:17:01 s_kernel@iotiger2 kernel: Vendor: DDN Model: S2A
> 8500 Rev: 5.22
> Jul 3 09:17:01 s_kernel@iotiger2 kernel: Type: Direct-Access
> ANSI SCSI revision: 03
> Jul 3 09:17:01 s_kernel@iotiger2 kernel: error 1
> Jul 3 09:17:01 s_kernel@iotiger2 kernel: 2:0:0:0: Unexpected response
> from lun 0 while scanning, scan aborted
> ...
>
>
> lpfc + NEC:
> ===========
> ...
> Jul 4 06:11:44 s_kernel@iotiger2 kernel: EXT3 FS on sdg, internal journal
> Jul 4 06:11:44 s_kernel@iotiger2 kernel: EXT3-fs: recovery complete.
> Jul 4 06:11:44 s_kernel@iotiger2 kernel: EXT3-fs: mounted filesystem
> with ordered data mode.
> Jul 4 06:12:03 s_kernel@iotiger2 kernel: lpfc 0000:06:02.1: 1:1305 Link
> Down Event x2 received Data: x2 x20 x0
> Jul 4 06:12:33 s_kernel@iotiger2 kernel: lpfc 0000:06:02.1: 1:0203
> Nodev timeout on WWPN 20:6:0:0:13:84:0:35 NPort xdc Data: x8 x7 x0
> Jul 4 06:12:38 s_kernel@iotiger2 kernel: rport-3:0-0: blocked FC remote
> port time out: removing target and saving binding
> Jul 4 06:12:42 s_kernel@iotiger2 kernel: lpfc 0000:06:02.1: 1:1303 Link
> Up Event x3 received Data: x3 x1 x8 x2
> Jul 4 06:12:42 s_kernel@iotiger2 kernel: Vendor: NEC Model:
> iStorage 2000 Rev: 2800
> Jul 4 06:12:42 s_kernel@iotiger2 kernel: Type: Direct-Access
> ANSI SCSI revision: 04
> Jul 4 06:12:42 s_kernel@iotiger2 kernel: SCSI device sdh: 138412032
> 512-byte hdwr sectors (70867 MB)
> Jul 4 06:12:42 s_kernel@iotiger2 kernel: sdh: Write Protect is off
> Jul 4 06:12:42 s_kernel@iotiger2 kernel: sdh: Mode Sense: 97 00 00 08
> Jul 4 06:12:42 s_kernel@iotiger2 kernel: SCSI device sdh: drive cache:
> write back
> Jul 4 06:12:42 s_kernel@iotiger2 kernel: SCSI device sdh: 138412032
> 512-byte hdwr sectors (70867 MB)
> Jul 4 06:12:42 s_kernel@iotiger2 kernel: sdh: Write Protect is off
> Jul 4 06:12:42 s_kernel@iotiger2 kernel: sdh: Mode Sense: 97 00 00 08
> Jul 4 06:12:42 s_kernel@iotiger2 kernel: SCSI device sdh: drive cache:
> write back
> Jul 4 06:12:42 s_kernel@iotiger2 kernel: sdh: unknown partition table
> Jul 4 06:12:42 s_kernel@iotiger2 kernel: sd 3:0:0:0: Attached scsi disk
> sdh
> Jul 4 06:12:42 s_kernel@iotiger2 kernel: sd 3:0:0:0: Attached scsi
> generic sg12 type 0
> Jul 4 06:12:42 s_kernel@iotiger2 kernel: Vendor: NEC Model:
> iStorage 2000 Rev: 2800
> Jul 4 06:12:42 s_kernel@iotiger2 kernel: Type: Direct-Access
> ANSI SCSI revision: 04
> Jul 4 06:12:42 s_kernel@iotiger2 kernel: error 1
> Jul 4 06:12:42 s_kernel@iotiger2 kernel: 3:0:0:0: Unexpected response
> from lun 0 while scanning, scan aborted
> ...
>
> QLogic+NEC
> ==========
> ...
> Jul 4 09:47:10 s_kernel@iotiger2 kernel: qla2400 0000:07:01.1: LIP
> reset occured (f8ef).
> Jul 4 09:47:10 s_kernel@iotiger2 kernel: qla2400 0000:07:01.1: LOOP
> DOWN detected (2).
> Jul 4 09:47:14 s_kernel@iotiger2 kernel: qla2400 0000:07:01.0: LOOP
> DOWN detected (2).
> Jul 4 09:47:45 s_kernel@iotiger2 kernel: rport-9:0-0: blocked FC remote
> port time out: removing target and saving binding
> Jul 4 09:47:50 s_kernel@iotiger2 kernel: rport-8:0-0: blocked FC remote
> port time out: removing target and saving binding
> Jul 4 09:48:11 s_kernel@iotiger2 kernel: qla2400 0000:07:01.0: LIP
> reset occured (f7f7).
> Jul 4 09:48:11 s_kernel@iotiger2 kernel: qla2400 0000:07:01.0: LIP
> occured (f7f7).
> Jul 4 09:48:11 s_kernel@iotiger2 kernel: qla2400 0000:07:01.0: LOOP UP
> detected (2 Gbps).
> Jul 4 09:48:11 s_kernel@iotiger2 kernel: Vendor: NEC Model:
> iStorage 2000 Rev: 2800
> Jul 4 09:48:11 s_kernel@iotiger2 kernel: Type: Direct-Access
> ANSI SCSI revision: 04
> Jul 4 09:48:11 s_kernel@iotiger2 kernel: SCSI device sdi: 138412032
> 512-byte hdwr sectors (70867 MB)
> Jul 4 09:48:11 s_kernel@iotiger2 kernel: sdi: Write Protect is off
> Jul 4 09:48:11 s_kernel@iotiger2 kernel: sdi: Mode Sense: 97 00 00 08
> Jul 4 09:48:11 s_kernel@iotiger2 kernel: SCSI device sdi: drive cache:
> write back
> Jul 4 09:48:11 s_kernel@iotiger2 kernel: SCSI device sdi: 138412032
> 512-byte hdwr sectors (70867 MB)
> Jul 4 09:48:11 s_kernel@iotiger2 kernel: sdi: Write Protect is off
> Jul 4 09:48:11 s_kernel@iotiger2 kernel: sdi: Mode Sense: 97 00 00 08
> Jul 4 09:48:11 s_kernel@iotiger2 kernel: SCSI device sdi: drive cache:
> write back
> Jul 4 09:48:11 s_kernel@iotiger2 kernel: sdi: unknown partition table
> Jul 4 09:48:11 s_kernel@iotiger2 kernel: sd 8:0:0:0: Attached scsi disk
> sdi
> Jul 4 09:48:11 s_kernel@iotiger2 kernel: sd 8:0:0:0: Attached scsi
> generic sg14 type 0
> Jul 4 09:48:11 s_kernel@iotiger2 kernel: Vendor: NEC Model:
> iStorage 2000 Rev: 2800
> Jul 4 09:48:11 s_kernel@iotiger2 kernel: Type: Direct-Access
> ANSI SCSI revision: 04
> Jul 4 09:48:11 s_kernel@iotiger2 kernel: error 1
> Jul 4 09:48:11 s_kernel@iotiger2 kernel: 8:0:0:0: Unexpected response
> from lun 0 while scanning, scan aborted
> ...
>
> QLogic+DDN
> ==========
> ...
> Jul 4 10:01:31 s_kernel@iotiger2 kernel: qla2400 0000:07:01.1: LOOP
> DOWN detected (2).
> Jul 4 10:02:06 s_kernel@iotiger2 kernel: rport-9:0-0: blocked FC remote
> port time out: removing target and saving binding
> Jul 4 10:03:05 s_kernel@iotiger2 kernel: qla2400 0000:07:01.1: LIP
> reset occured (f7f7).
> Jul 4 10:03:05 s_kernel@iotiger2 kernel: qla2400 0000:07:01.1: LIP
> occured (f7f7).
> Jul 4 10:03:05 s_kernel@iotiger2 kernel: qla2400 0000:07:01.1: LOOP UP
> detected (2 Gbps).
> Jul 4 10:03:05 s_kernel@iotiger2 kernel: Vendor: DDN Model: S2A
> 8500 Rev: 5.22
> Jul 4 10:03:05 s_kernel@iotiger2 kernel: Type: Direct-Access
> ANSI SCSI revision: 03
> Jul 4 10:03:05 s_kernel@iotiger2 kernel: SCSI device sdl: 571416576
> 512-byte hdwr sectors (292565 MB)
> Jul 4 10:03:05 s_kernel@iotiger2 kernel: sdl: Write Protect is off
> Jul 4 10:03:05 s_kernel@iotiger2 kernel: sdl: Mode Sense: a7 00 10 08
> Jul 4 10:03:05 s_kernel@iotiger2 kernel: SCSI device sdl: drive cache:
> write back w/ FUA
> Jul 4 10:03:05 s_kernel@iotiger2 kernel: SCSI device sdl: 571416576
> 512-byte hdwr sectors (292565 MB)
> Jul 4 10:03:05 s_kernel@iotiger2 kernel: sdl: Write Protect is off
> Jul 4 10:03:05 s_kernel@iotiger2 kernel: sdl: Mode Sense: a7 00 10 08
> Jul 4 10:03:05 s_kernel@iotiger2 kernel: SCSI device sdl: drive cache:
> write back w/ FUA
> Jul 4 10:03:05 s_kernel@iotiger2 kernel: sdl: unknown partition table
> Jul 4 10:03:05 s_kernel@iotiger2 kernel: sd 9:0:0:0: Attached scsi disk
> sdl
> Jul 4 10:03:05 s_kernel@iotiger2 kernel: sd 9:0:0:0: Attached scsi
> generic sg15 type 0
> Jul 4 10:03:05 s_kernel@iotiger2 kernel: Vendor: DDN Model: S2A
> 8500 Rev: 5.22
> Jul 4 10:03:05 s_kernel@iotiger2 kernel: Type: Direct-Access
> ANSI SCSI revision: 03
> Jul 4 10:03:05 s_kernel@iotiger2 kernel: error 1
> Jul 4 10:03:05 s_kernel@iotiger2 kernel: 9:0:0:0: Unexpected response
> from lun 0 while scanning, scan aborted
> ...
>
prev parent reply other threads:[~2006-07-11 19:34 UTC|newest]
Thread overview: 2+ messages / expand[flat|nested] mbox.gz Atom feed top
2006-07-04 11:22 scan error after FC link recovery when device was in use Frederic TEMPORELLI
2006-07-11 19:34 ` Michael Reed [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=44B3FD57.6090900@sgi.com \
--to=mdr@sgi.com \
--cc=frederic.temporelli@ext.bull.net \
--cc=linux-scsi@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.