* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files @ 2002-04-30 21:18 Eric M 0 siblings, 0 replies; 37+ messages in thread From: Eric M @ 2002-04-30 21:18 UTC (permalink / raw) To: linux-kernel i have a plextor cd-rw drive which i use via scsi emulation, it is the only device on my second ide bus. often when i rip cds with grip that have scratches, grip completely stops responding to input and signals, even -9 wont go through. today i tried waiting for over half an hour for it to terminate, got bored and quit X. the grip process died after doing that but the kernel's scsi layer kept spewing error messages on my console and my machine "lagged" on about 10-15 second intervals a few seconds at a time. i waited over an hour for it to give up, but it didn't so i had to reboot. kernel version i'm using is 2.4.17. -- __ Ota itsellesi luotettava kotimainen email http://www.jippii.fi/ Tutustu samalla netin parhaaseen pelipaikkaan JIPPIIGAMESIIN. ^ permalink raw reply [flat|nested] 37+ messages in thread
* A CD with errors (scratches etc.) blocks the whole system while reading damadged files
@ 2002-04-18 13:13 Dr. Death
2002-04-19 14:14 ` Richard B. Johnson
2002-04-19 20:01 ` Erik Andersen
0 siblings, 2 replies; 37+ messages in thread
From: Dr. Death @ 2002-04-18 13:13 UTC (permalink / raw)
To: linux-kernel
Problem:
I use SuSE Linux 7.2 and when I create md5sums from damaged files on a
CD, the WHOLE system freezes or is ugly slow untill md5 has passed the
damaged part of the file !
My CDRom drive is a HP8100i IDE CD Writer attached as HDD at the onbord
IDE Controller
test case:
md5 "/cdrom/damaged file"
cat /proc/version output:
Linux version 2.4.4-4GB (root@Pentium.suse.de) (gcc version 2.95.3
20010315 (SuSE)) #1 Wed May 16 00:37:55 GMT 2001
linux_ver output:
Linux filez 2.4.4-4GB #1 Wed May 16 00:37:55 GMT 2001 i586 unknown
Gnu C 2.95.3
Gnu make 3.79.1
binutils 2.10.91.0.4
util-linux 2.11b
mount 2.11b
modutils 2.4.5
e2fsprogs 1.19
reiserfsprogs 3.x.0j
PPP 2.4.0
isdn4k-utils 3.1pre1a
Linux C Library x 1 root root 1343073 Mai 11 2001
/lib/libc.so.6
Dynamic linker (ldd) 2.2.2
Procps 2.0.7
Net-tools 1.60
Kbd 1.04
Sh-utils 2.0
Modules Loaded ppp_async ppp_generic nls_iso8859-1 snd-pcm-oss
snd-pcm-plugin snd-mixer-oss snd-synth-emu8000 snd-synth-emux
snd-seq-midi-emul snd-seq-virmidi snd-emux-mem snd-seq-midi
snd-seq-midi-event snd-seq snd-card-sbawe isa-pnp snd-sb16-csp
snd-sb16-dsp snd-pcm snd-mixer snd-opl3 snd-hwdep snd-timer
snd-mpu401-uart snd-rawmidi snd-seq-device snd soundcore af_packet nfsd
lp parport usbcore ne2k-pci 8390 hisax isdn loop_fish2 ide-scsi rtl8139
reiserfs
cat /proc/cpuinfo output:
processor : 0
vendor_id : AuthenticAMD
cpu family : 5
model : 7
model name : AMD-K6tm w/ multimedia extensions
stepping : 0
cpu MHz : 267.281
cache size : 64 KB
fdiv_bug : no
hlt_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 1
wp : yes
flags : fpu vme de pse tsc msr mce cx8 mmx
bogomips : 532.48
cat /proc/modules output:
ppp_async 6480 0 (autoclean) (unused)
ppp_generic 14416 0 (autoclean) [ppp_async]
nls_iso8859-1 2880 1 (autoclean)
snd-pcm-oss 18816 1 (autoclean)
snd-pcm-plugin 15024 0 (autoclean) [snd-pcm-oss]
snd-mixer-oss 5120 0 (autoclean) [snd-pcm-oss]
snd-synth-emu8000 16176 0 (unused)
snd-synth-emux 26592 0 [snd-synth-emu8000]
snd-seq-midi-emul 4480 0 [snd-synth-emux]
snd-seq-virmidi 8496 0 [snd-synth-emux]
snd-emux-mem 1616 0 [snd-synth-emu8000 snd-synth-emux]
snd-seq-midi 3568 0 (unused)
snd-seq-midi-event 2992 0 [snd-seq-virmidi snd-seq-midi]
snd-seq 42656 0 [snd-synth-emux snd-seq-virmidi
snd-seq-midi snd-seq-midi-event]
snd-card-sbawe 5536 1
isa-pnp 28176 0 [snd-card-sbawe]
snd-sb16-csp 15888 0 [snd-card-sbawe]
snd-sb16-dsp 15888 0 [snd-card-sbawe snd-sb16-csp]
snd-pcm 30560 0 [snd-pcm-oss snd-pcm-plugin snd-sb16-dsp]
snd-mixer 24224 0 [snd-mixer-oss snd-synth-emu8000
snd-sb16-csp snd-sb16-dsp]
snd-opl3 4848 0 [snd-card-sbawe]
snd-hwdep 3376 0 [snd-sb16-csp snd-opl3]
snd-timer 8560 0 [snd-seq snd-pcm snd-opl3]
snd-mpu401-uart 2512 0 [snd-card-sbawe]
snd-rawmidi 9664 0 [snd-seq-midi snd-mpu401-uart]
snd-seq-device 4032 0 [snd-synth-emu8000 snd-synth-emux
snd-seq-midi snd-seq snd-card-sbawe snd-rawmidi]
snd 34032 1 [snd-pcm-oss snd-pcm-plugin
snd-mixer-oss snd-synth-emu8000 snd-synth-emux snd-seq-virm
idi snd-emux-mem snd-seq-midi snd-seq-midi-event snd-seq snd-card-sbawe
snd-sb16-csp snd-sb16-dsp snd-pcm snd-mixer snd-
opl3 snd-hwdep snd-timer snd-mpu401-uart snd-rawmidi snd-seq-device]
soundcore 3632 7 [snd]
af_packet 11648 2 (autoclean)
nfsd 67280 4 (autoclean)
lp 5392 0 (autoclean)
parport 24352 0 (autoclean) [lp]
usbcore 47120 0 (autoclean)
ne2k-pci 4640 1 (autoclean)
8390 6240 0 (autoclean) [ne2k-pci]
hisax 496192 1
isdn 123056 2 [hisax]
loop_fish2 9280 0 (unused)
ide-scsi 7856 1
rtl8139 11520 1
reiserfs 156432 3
cat /proc/ioports output:
0000-001f : dma1
0020-003f : pic1
0040-005f : timer
0060-006f : keyboard
0070-007f : rtc
0080-008f : dma page reg
00a0-00bf : pic2
00c0-00df : dma2
00f0-00ff : fpu
0170-0177 : ide1
01f0-01f7 : ide0
0213-0213 : isapnp read
0220-022f : Sound Blaster AWE32/64
0330-0331 : Sound Blaster AWE32/64 - MPU-401
0376-0376 : ide1
0388-038b : Sound Blaster AWE32/64 - FM
03c0-03df : vga+
03f6-03f6 : ide0
0620-0623 : Sound Blaster AWE32/64 - WaveTable
0a20-0a23 : Sound Blaster AWE32/64 - WaveTable
0a79-0a79 : isapnp write
0cf8-0cff : PCI conf1
0e20-0e23 : Sound Blaster AWE32/64 - WaveTable
4000-40ff : PCI device 1106:3040
c000-cfff : PCI Bus #01
d000-d00f : PCI device 1106:0571
d000-d007 : ide0
d008-d00f : ide1
d800-d8ff : PCI device 10ec:8139
d800-d8ff : 8139too
dc00-dc1f : PCI device 1244:0a00
dc00-dc1f : avm PCI
e000-e01f : PCI device 10ec:8029
e000-e01f : ne2k-pci
cat /proc/iomem output:
00000000-0009fbff : System RAM
0009fc00-0009ffff : reserved
000a0000-000bffff : Video RAM area
000c0000-000c7fff : Video ROM
000f0000-000fffff : System ROM
00100000-0fffffff : System RAM
00100000-002327d1 : Kernel code
002327d2-0031bdcb : Kernel data
e0000000-e3ffffff : PCI device 1106:0598
e4000000-e40000ff : PCI device 10ec:8139
e4000000-e40000ff : 8139too
e4001000-e400101f : PCI device 1244:0a00
ffff0000-ffffffff : reserved
lspci -vvv output:
00:00.0 Host bridge: VIA Technologies, Inc. VT82C598 [Apollo MVP3] (rev 04)
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop-
ParErr- Stepping- SERR- FastB2B-
Status: Cap+ 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >SERR- <PERR+
Latency: 0
Region 0: Memory at e0000000 (32-bit, prefetchable) [size=64M]
Capabilities: [a0] AGP version 1.0
Status: RQ=7 SBA+ 64bit- FW- Rate=x1,x2
Command: RQ=0 SBA- AGP- 64bit- FW- Rate=<none>
00:01.0 PCI bridge: VIA Technologies, Inc. VT82C598/694x [Apollo
MVP3/Pro133x AGP] (prog-if 00 [Normal decode])
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop-
ParErr- Stepping- SERR- FastB2B-
Status: Cap- 66Mhz+ UDF- FastB2B- ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort+ >SERR- <PERR-
Latency: 0
Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
I/O behind bridge: 0000c000-0000cfff
Memory behind bridge: fff00000-000fffff
Prefetchable memory behind bridge: fff00000-000fffff
BridgeCtl: Parity- SERR- NoISA+ VGA- MAbort- >Reset- FastB2B-
00:07.0 ISA bridge: VIA Technologies, Inc. VT82C586/A/B PCI-to-ISA
[Apollo VP] (rev 47)
Subsystem: VIA Technologies, Inc. MVP3 ISA Bridge
Control: I/O+ Mem+ BusMaster+ SpecCycle+ MemWINV- VGASnoop-
ParErr- Stepping+ SERR- FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >SERR- <PERR-
Latency: 0
00:07.1 IDE interface: VIA Technologies, Inc. Bus Master IDE (rev 06)
(prog-if 8a [Master SecP PriP])
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop-
ParErr- Stepping- SERR- FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >SERR- <PERR-
Latency: 64
Region 4: I/O ports at d000 [size=16]
00:07.3 Host bridge: VIA Technologies, Inc. VT82C586B ACPI (rev 10)
Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop-
ParErr- Stepping- SERR- FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >SERR- <PERR-
00:12.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL-8139
(rev 10)
Subsystem: Realtek Semiconductor Co., Ltd. RT8139
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop-
ParErr- Stepping- SERR- FastB2B-
Status: Cap+ 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >SERR- <PERR+
Latency: 64 (8000ns min, 16000ns max)
Interrupt: pin A routed to IRQ 11
Region 0: I/O ports at d800 [size=256]
Region 1: Memory at e4000000 (32-bit, non-prefetchable) [size=256]
Capabilities: [50] Power Management version 2
Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA
PME(D0-,D1+,D2+,D3hot+,D3cold-)
Status: D0 PME-Enable- DSel=0 DScale=0 PME-
00:13.0 Network controller: AVM Audiovisuelles MKTG & Computer System
GmbH A1 ISDN [Fritz] (rev 02)
Subsystem: AVM Audiovisuelles MKTG & Computer System GmbH
FRITZ!Card ISDN Controller
Control: I/O+ Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop-
ParErr- Stepping- SERR- FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >SERR- <PERR-
Interrupt: pin A routed to IRQ 5
Region 0: Memory at e4001000 (32-bit, non-prefetchable) [size=32]
Region 1: I/O ports at dc00 [size=32]
00:14.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL-8029(AS)
Subsystem: Realtek Semiconductor Co., Ltd. RT8029(AS)
Control: I/O+ Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop-
ParErr- Stepping- SERR- FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >SERR- <PERR-
Interrupt: pin A routed to IRQ 7
Region 0: I/O ports at e000 [size=32]
cat /proc/scsi/scsi output:
Attached devices:
Host: scsi0 Channel: 00 Id: 00 Lun: 00
Vendor: HP Model: CD-Writer+ 8100 Rev: 1.0g
Type: CD-ROM ANSI SCSI revision: 02
/var/log/messages
Apr 18 15:06:18 filez kernel: scsi : aborting command due to timeout :
pid 0, scsi0, channel 0, id 0, lun 0 0x03 00 00 00 40 00
Apr 18 15:06:20 filez kernel: hdd: irq timeout: status=0xd0 { Busy }
Apr 18 15:06:20 filez kernel: hdd: ATAPI reset complete
Apr 18 15:06:20 filez kernel: hdd: irq timeout: status=0x80 { Busy }
Apr 18 15:06:20 filez kernel: hdd: ATAPI reset complete
Apr 18 15:06:21 filez kernel: hdd: irq timeout: status=0x80 { Busy }
Apr 18 15:06:21 filez kernel: hdd: status error: status=0x08 { DataRequest }
Apr 18 15:06:21 filez kernel: hdd: drive not ready for command
Apr 18 15:06:21 filez kernel: SCSI cdrom error : host 0 channel 0 id 0
lun 0 return code = 27000002
Apr 18 15:06:21 filez kernel: I/O error: dev 0b:00, sector 5780
Apr 18 15:06:51 filez kernel: scsi : aborting command due to timeout :
pid 0, scsi0, channel 0, id 0, lun 0 0x28 00 00 00 05 24 00 00 39 00
Apr 18 15:06:51 filez kernel: SCSI host 0 abort (pid 0) timed out -
resetting
Apr 18 15:06:51 filez kernel: SCSI bus is being reset for host 0 channel 0.
Apr 18 15:06:51 filez kernel: scsi : aborting command due to timeout :
pid 0, scsi0, channel 0, id 0, lun 0 0x28 00 00 00 05 24 00 00 39 00
Apr 18 15:06:51 filez kernel: SCSI host 0 abort (pid 0) timed out -
resetting
Apr 18 15:06:51 filez kernel: SCSI bus is being reset for host 0 channel 0.
Apr 18 15:06:52 filez kernel: scsi : aborting command due to timeout :
pid 0, scsi0, channel 0, id 0, lun 0 0x28 00 00 00 05 24 00 00 39 00
Apr 18 15:06:52 filez kernel: SCSI host 0 abort (pid 0) timed out -
resetting
Apr 18 15:06:52 filez kernel: SCSI bus is being reset for host 0 channel 0.
Apr 18 15:06:52 filez kernel: scsi : aborting command due to timeout :
pid 0, scsi0, channel 0, id 0, lun 0 0x28 00 00 00 05 24 00 00 39 00
Apr 18 15:06:52 filez kernel: SCSI host 0 abort (pid 0) timed out -
resetting
Apr 18 15:06:52 filez kernel: SCSI bus is being reset for host 0 channel 0.
Apr 18 15:06:53 filez kernel: scsi : aborting command due to timeout :
pid 0, scsi0, channel 0, id 0, lun 0 0x28 00 00 00 05 24 00 00 39 00
Apr 18 15:06:53 filez kernel: SCSI host 0 abort (pid 0) timed out -
resetting
Apr 18 15:06:53 filez kernel: SCSI bus is being reset for host 0 channel 0.
Apr 18 15:06:53 filez kernel: scsi : aborting command due to timeout :
pid 0, scsi0, channel 0, id 0, lun 0 0x28 00 00 00 05 24 00 00 39 00
Apr 18 15:06:53 filez kernel: SCSI host 0 abort (pid 0) timed out -
resetting
Apr 18 15:06:53 filez kernel: SCSI bus is being reset for host 0 channel 0.
Apr 18 15:06:54 filez kernel: scsi : aborting command due to timeout :
pid 0, scsi0, channel 0, id 0, lun 0 0x28 00 00 00 05 24 00 00 39 00
Apr 18 15:06:54 filez kernel: SCSI host 0 abort (pid 0) timed out -
resetting
Apr 18 15:06:54 filez kernel: SCSI bus is being reset for host 0 channel 0.
Apr 18 15:06:54 filez kernel: scsi : aborting command due to timeout :
pid 0, scsi0, channel 0, id 0, lun 0 0x28 00 00 00 05 24 00 00 39 00
Apr 18 15:06:54 filez kernel: SCSI host 0 abort (pid 0) timed out -
resetting
Apr 18 15:06:54 filez kernel: SCSI bus is being reset for host 0 channel 0.
Apr 18 15:06:55 filez kernel: scsi : aborting command due to timeout :
pid 0, scsi0, channel 0, id 0, lun 0 0x28 00 00 00 05 24 00 00 39 00
Apr 18 15:06:55 filez kernel: SCSI host 0 abort (pid 0) timed out -
resetting
Apr 18 15:06:55 filez kernel: SCSI bus is being reset for host 0 channel 0.
Apr 18 15:06:55 filez kernel: scsi : aborting command due to timeout :
pid 0, scsi0, channel 0, id 0, lun 0 0x28 00 00 00 05 24 00 00 39 00
Apr 18 15:06:55 filez kernel: SCSI host 0 abort (pid 0) timed out -
resetting
Apr 18 15:06:55 filez kernel: SCSI bus is being reset for host 0 channel 0.
Apr 18 15:06:56 filez kernel: scsi : aborting command due to timeout :
pid 0, scsi0, channel 0, id 0, lun 0 0x28 00 00 00 05 24 00 00 39 00
Apr 18 15:06:56 filez kernel: SCSI host 0 abort (pid 0) timed out -
resetting
Apr 18 15:06:56 filez kernel: SCSI bus is being reset for host 0 channel 0.
Apr 18 15:06:56 filez kernel: scsi : aborting command due to timeout :
pid 0, scsi0, channel 0, id 0, lun 0 0x28 00 00 00 05 24 00 00 39 00
Apr 18 15:06:56 filez kernel: SCSI host 0 abort (pid 0) timed out -
resetting
Apr 18 15:06:56 filez kernel: SCSI bus is being reset for host 0 channel 0.
Apr 18 15:06:57 filez kernel: scsi : aborting command due to timeout :
pid 0, scsi0, channel 0, id 0, lun 0 0x28 00 00 00 05 24 00 00 39 00
Apr 18 15:06:57 filez kernel: SCSI host 0 abort (pid 0) timed out -
resetting
Apr 18 15:06:57 filez kernel: SCSI bus is being reset for host 0 channel 0.
Apr 18 15:06:59 filez kernel: scsi : aborting command due to timeout :
pid 0, scsi0, channel 0, id 0, lun 0 0x03 00 00 00 40 00
Apr 18 15:06:59 filez kernel: hdd: irq timeout: status=0xd0 { Busy }
Apr 18 15:06:59 filez kernel: hdd: ATAPI reset complete
Apr 18 15:06:59 filez kernel: hdd: irq timeout: status=0x80 { Busy }
Apr 18 15:06:59 filez kernel: hdd: ATAPI reset complete
Have i forget something ? If yes just Mail me
Frederik Reiss
^ permalink raw reply [flat|nested] 37+ messages in thread* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-18 13:13 Dr. Death @ 2002-04-19 14:14 ` Richard B. Johnson 2002-04-19 14:22 ` Kent Borg ` (4 more replies) 2002-04-19 20:01 ` Erik Andersen 1 sibling, 5 replies; 37+ messages in thread From: Richard B. Johnson @ 2002-04-19 14:14 UTC (permalink / raw) To: Dr. Death; +Cc: linux-kernel On Thu, 18 Apr 2002, Dr. Death wrote: > Problem: > > I use SuSE Linux 7.2 and when I create md5sums from damaged files on a > CD, the WHOLE system freezes or is ugly slow untill md5 has passed the > damaged part of the file ! > So what do you suggest? You can see from the logs that the device is having difficulty reading your damaged CD. You can do what Windows-95 does (ignore the errors and pretend everything is fine), or what Windows-98 and Windows-2000/Prof does (blue-screen, and re-boot), or you can try like hell to read the files like Linux does. What do you suggest? Cheers, Dick Johnson Penguin : Linux version 2.4.18 on an i686 machine (797.90 BogoMips). Windows-2000/Professional isn't. ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-19 14:14 ` Richard B. Johnson @ 2002-04-19 14:22 ` Kent Borg 2002-04-19 14:46 ` Richard B. Johnson 2002-04-19 14:28 ` Darrell Wright ` (3 subsequent siblings) 4 siblings, 1 reply; 37+ messages in thread From: Kent Borg @ 2002-04-19 14:22 UTC (permalink / raw) To: Richard B. Johnson; +Cc: Dr. Death, linux-kernel On Fri, Apr 19, 2002 at 10:14:41AM -0400, Richard B. Johnson wrote: > On Thu, 18 Apr 2002, Dr. Death wrote: > > > Problem: > > > > I use SuSE Linux 7.2 and when I create md5sums from damaged files on a > > CD, the WHOLE system freezes or is ugly slow untill md5 has passed the > > damaged part of the file ! > > > > So what do you suggest? You can see from the logs that the device > is having difficulty reading your damaged CD. You can do what > Windows-95 does (ignore the errors and pretend everything is fine), > or what Windows-98 and Windows-2000/Prof does (blue-screen, and re-boot), > or you can try like hell to read the files like Linux does. What do you > suggest? You didn't ask me, but I would still suggest that it would be nice if the whole system didn't come to a near halt. -kb, the Kent who wonders of the preemption patch might help here. ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-19 14:22 ` Kent Borg @ 2002-04-19 14:46 ` Richard B. Johnson 0 siblings, 0 replies; 37+ messages in thread From: Richard B. Johnson @ 2002-04-19 14:46 UTC (permalink / raw) To: Kent Borg; +Cc: Dr. Death, linux-kernel On Fri, 19 Apr 2002, Kent Borg wrote: > On Fri, Apr 19, 2002 at 10:14:41AM -0400, Richard B. Johnson wrote: > > On Thu, 18 Apr 2002, Dr. Death wrote: > > > > > Problem: > > > > > > I use SuSE Linux 7.2 and when I create md5sums from damaged files on a > > > CD, the WHOLE system freezes or is ugly slow untill md5 has passed the > > > damaged part of the file ! > > > > > > > So what do you suggest? You can see from the logs that the device > > is having difficulty reading your damaged CD. You can do what > > Windows-95 does (ignore the errors and pretend everything is fine), > > or what Windows-98 and Windows-2000/Prof does (blue-screen, and re-boot), > > or you can try like hell to read the files like Linux does. What do you > > suggest? > > You didn't ask me, but I would still suggest that it would be nice if > the whole system didn't come to a near halt. > Some time-outs are enforced by hardware, some errors even result in a bus reset in which all the devices reload their firmware, etc. Therefore time-outs are made quite long so one has to wait for a relatively long to either retry or to give up upon an error. It is possible to dynamically determine the kind of time-out necessary for a particular error in a particular device. This could be determined, for instance, when a device is mounted or otherwise first accessed. However, then there would be complaints about the amount of time necessary to mount a device, etc. So, basically, the compromise seems to be that bad devices or bad media result in long-time retries. Note, if you have another task, that is not trying to use the errored device or its bus, it does not get starved for CPU time. The kernel still sleeps while waiting for pass/fail interrupts, therefore giving the CPU to somebody. Of course this doesn't help the task that's accessing the device, and to that task the system seems to, as you say, come to a near halt. FYI "Dr. Death" is phony and email to that node gets bounced. Cheers, Dick Johnson Penguin : Linux version 2.4.18 on an i686 machine (797.90 BogoMips). Windows-2000/Professional isn't. ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-19 14:14 ` Richard B. Johnson 2002-04-19 14:22 ` Kent Borg @ 2002-04-19 14:28 ` Darrell Wright 2002-04-19 14:36 ` dr john halewood ` (2 subsequent siblings) 4 siblings, 0 replies; 37+ messages in thread From: Darrell Wright @ 2002-04-19 14:28 UTC (permalink / raw) To: linux-kernel; +Cc: root I agree, I frequently find myself getting files off damages cd's that others running windows cannot access. It takes a while, but I can usually get everything but the files on the damaged parts and even then I can get parts of them usually. Darrell On Fri, 2002-04-19 at 10:14, Richard B. Johnson wrote: > On Thu, 18 Apr 2002, Dr. Death wrote: > > > Problem: > > > > I use SuSE Linux 7.2 and when I create md5sums from damaged files on a > > CD, the WHOLE system freezes or is ugly slow untill md5 has passed the > > damaged part of the file ! > > > > So what do you suggest? You can see from the logs that the device > is having difficulty reading your damaged CD. You can do what > Windows-95 does (ignore the errors and pretend everything is fine), > or what Windows-98 and Windows-2000/Prof does (blue-screen, and re-boot), > or you can try like hell to read the files like Linux does. What do you > suggest? > > Cheers, > Dick Johnson > > Penguin : Linux version 2.4.18 on an i686 machine (797.90 BogoMips). > > Windows-2000/Professional isn't. > > - > To unsubscribe from this list: send the line "unsubscribe linux-kernel" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > Please read the FAQ at http://www.tux.org/lkml/ ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-19 14:14 ` Richard B. Johnson 2002-04-19 14:22 ` Kent Borg 2002-04-19 14:28 ` Darrell Wright @ 2002-04-19 14:36 ` dr john halewood 2002-04-19 18:00 ` Oliver Xymoron 2002-04-23 22:34 ` Bill Davidsen 4 siblings, 0 replies; 37+ messages in thread From: dr john halewood @ 2002-04-19 14:36 UTC (permalink / raw) To: root, Dr. Death; +Cc: linux-kernel On Friday 19 April 2002 3:14 pm, Richard B. Johnson wrote: > So what do you suggest? You can see from the logs that the device > is having difficulty reading your damaged CD. You can do what > Windows-95 does (ignore the errors and pretend everything is fine), > or what Windows-98 and Windows-2000/Prof does (blue-screen, and re-boot), > or you can try like hell to read the files like Linux does. What do you > suggest? > Don't put them in an Xbox in the first place? (see http://www.newscientist.com/news/news.jsp?id=ns99992000) cheers john ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-19 14:14 ` Richard B. Johnson ` (2 preceding siblings ...) 2002-04-19 14:36 ` dr john halewood @ 2002-04-19 18:00 ` Oliver Xymoron 2002-04-23 22:34 ` Bill Davidsen 4 siblings, 0 replies; 37+ messages in thread From: Oliver Xymoron @ 2002-04-19 18:00 UTC (permalink / raw) To: Richard B. Johnson; +Cc: Dr. Death, linux-kernel On Fri, 19 Apr 2002, Richard B. Johnson wrote: > On Thu, 18 Apr 2002, Dr. Death wrote: > > > Problem: > > > > I use SuSE Linux 7.2 and when I create md5sums from damaged files on a > > CD, the WHOLE system freezes or is ugly slow untill md5 has passed the > > damaged part of the file ! > > > > So what do you suggest? You can see from the logs that the device > is having difficulty reading your damaged CD. You can do what > Windows-95 does (ignore the errors and pretend everything is fine), > or what Windows-98 and Windows-2000/Prof does (blue-screen, and re-boot), > or you can try like hell to read the files like Linux does. What do you > suggest? The problem is not that reading the disk is slow, it's that it brings the system to its knees. There are many valid scenarios where non-root users should be able to put CDs in a machine and they shouldn't be able to DoS it by doing so. Fact is the SCSI layer's error handling has been on the list of things in dire need of replacement for years and this is one of the many symptoms. -- "Love the dolphins," she advised him. "Write by W.A.S.T.E.." ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-19 14:14 ` Richard B. Johnson ` (3 preceding siblings ...) 2002-04-19 18:00 ` Oliver Xymoron @ 2002-04-23 22:34 ` Bill Davidsen 2002-04-24 12:31 ` Richard B. Johnson 4 siblings, 1 reply; 37+ messages in thread From: Bill Davidsen @ 2002-04-23 22:34 UTC (permalink / raw) To: Richard B. Johnson; +Cc: Linux Kernel Mailing List On Fri, 19 Apr 2002, Richard B. Johnson wrote: > On Thu, 18 Apr 2002, Dr. Death wrote: > > > Problem: > > > > I use SuSE Linux 7.2 and when I create md5sums from damaged files on a > > CD, the WHOLE system freezes or is ugly slow untill md5 has passed the > > damaged part of the file ! > > > > So what do you suggest? You can see from the logs that the device > is having difficulty reading your damaged CD. You can do what > Windows-95 does (ignore the errors and pretend everything is fine), > or what Windows-98 and Windows-2000/Prof does (blue-screen, and re-boot), > or you can try like hell to read the files like Linux does. What do you > suggest? Several things come to mind: 1 - don't dedicate the entire machine to retrying the error such that everything else runs slowly if at all. 2 - if the hardware returns an uncorrectable sector error that should be passed back to the user process rather than retried. An unconditional deep retry on an error the hardware labels as uncorrectable is not desirable, and not better than the Windows in most cases. I took a bottle cap to one of the morning's AOL CDs and then tried to read it. It's really not just annoying, it's pretty much useless. If you were staging software off a CD on a running server, your clients would NOT be happy! -- bill davidsen <davidsen@tmr.com> CTO, TMR Associates, Inc Doing interesting things with little computers since 1979. ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-23 22:34 ` Bill Davidsen @ 2002-04-24 12:31 ` Richard B. Johnson 2002-04-24 19:16 ` Bill Davidsen 0 siblings, 1 reply; 37+ messages in thread From: Richard B. Johnson @ 2002-04-24 12:31 UTC (permalink / raw) To: Bill Davidsen; +Cc: Linux Kernel Mailing List On Tue, 23 Apr 2002, Bill Davidsen wrote: > On Fri, 19 Apr 2002, Richard B. Johnson wrote: > > > On Thu, 18 Apr 2002, Dr. Death wrote: > > > > > Problem: > > > > > > I use SuSE Linux 7.2 and when I create md5sums from damaged files on a > > > CD, the WHOLE system freezes or is ugly slow untill md5 has passed the > > > damaged part of the file ! > > > > > > > So what do you suggest? You can see from the logs that the device > > is having difficulty reading your damaged CD. You can do what > > Windows-95 does (ignore the errors and pretend everything is fine), > > or what Windows-98 and Windows-2000/Prof does (blue-screen, and re-boot), > > or you can try like hell to read the files like Linux does. What do you > > suggest? > > Several things come to mind: > 1 - don't dedicate the entire machine to retrying the error such that > everything else runs slowly if at all. But it doesn't! As previously stated, if you have a device on a common 'channel' (like IDE), that everybody else is trying to use, then everybody else ends up waiting. However, if your errored devices don't take over a common I/O channel, everybody else gets the CPU while the errors are being retried. For instance, I have SCSI for my disks, and I use IDE for a R/W CD because it's cheap. I can "try forever" reading dorked CDs and the only process affected at all is the one trying to read the CD. I can do full-speed compiles while the CD is being retried. It's all about configuration. The kernel drivers sleep while waiting for interrupts that will determine the success or failure of the disk operation. The 'sleep' means that the CPU gets given to somebody who could use it. > 2 - if the hardware returns an uncorrectable sector error that should be > passed back to the user process rather than retried. An unconditional > deep retry on an error the hardware labels as uncorrectable is not > desirable, and not better than the Windows in most cases. > The problem is that the hardware usually waits for the worse-case time (disk spin-up time) to even report an error. It's not like you could somehow wait for one rev of the disk to determine if a sector could be read. The disk, itself, retries for a long time, then it reports a rather general-purpose error (media error on SCSI, bad sector on IDE, record not found, etc). > I took a bottle cap to one of the morning's AOL CDs and then tried to read > it. It's really not just annoying, it's pretty much useless. If you were > staging software off a CD on a running server, your clients would NOT be > happy! > Put your CDs on a different controller and you can do anything you want without affecting other tasks. Cheers, Dick Johnson Penguin : Linux version 2.4.18 on an i686 machine (797.90 BogoMips). Windows-2000/Professional isn't. ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-24 12:31 ` Richard B. Johnson @ 2002-04-24 19:16 ` Bill Davidsen 2002-04-24 19:52 ` Richard B. Johnson ` (2 more replies) 0 siblings, 3 replies; 37+ messages in thread From: Bill Davidsen @ 2002-04-24 19:16 UTC (permalink / raw) To: Richard B. Johnson; +Cc: Linux Kernel Mailing List On Wed, 24 Apr 2002, Richard B. Johnson wrote: > On Tue, 23 Apr 2002, Bill Davidsen wrote: > > Several things come to mind: > > 1 - don't dedicate the entire machine to retrying the error such that > > everything else runs slowly if at all. > > But it doesn't! As previously stated, if you have a device on a common > 'channel' (like IDE), that everybody else is trying to use, then > everybody else ends up waiting. However, if your errored devices don't > take over a common I/O channel, everybody else gets the CPU while the > errors are being retried. > > For instance, I have SCSI for my disks, and I use IDE for a R/W CD > because it's cheap. I can "try forever" reading dorked CDs and the > only process affected at all is the one trying to read the CD. I > can do full-speed compiles while the CD is being retried. That's very nice for a system where cost is no object, but ATAPI/IDE is where the bulk of Linux system are running. Putting the CD on another cable is realistic (the system I hung does that) but putting the CD on IDE and the disk on SCSI is not cost effective compared to fixing the hang in software. > It's all about configuration. The kernel drivers sleep while waiting > for interrupts that will determine the success or failure of the > disk operation. The 'sleep' means that the CPU gets given to somebody > who could use it. It would also be nice if the other IDE channels were given to "somebody who could use it," but that would appear in some cases not to happen. > > I took a bottle cap to one of the morning's AOL CDs and then tried to read > > it. It's really not just annoying, it's pretty much useless. If you were > > staging software off a CD on a running server, your clients would NOT be > > happy! > > > > Put your CDs on a different controller and you can do anything you > want without affecting other tasks. As above, another type of bus is not cost effective, another IDE cable doesn't solve the problem, no matter what theory says. -- bill davidsen <davidsen@tmr.com> CTO, TMR Associates, Inc Doing interesting things with little computers since 1979. ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-24 19:16 ` Bill Davidsen @ 2002-04-24 19:52 ` Richard B. Johnson 2002-04-24 22:58 ` Stephen Samuel 2002-04-26 10:16 ` Zwane Mwaikambo 2 siblings, 0 replies; 37+ messages in thread From: Richard B. Johnson @ 2002-04-24 19:52 UTC (permalink / raw) To: Bill Davidsen; +Cc: Linux Kernel Mailing List On Wed, 24 Apr 2002, Bill Davidsen wrote: > On Wed, 24 Apr 2002, Richard B. Johnson wrote: > > > On Tue, 23 Apr 2002, Bill Davidsen wrote: > > > > Several things come to mind: > > > 1 - don't dedicate the entire machine to retrying the error such that > > > everything else runs slowly if at all. > > > > But it doesn't! As previously stated, if you have a device on a common > > 'channel' (like IDE), that everybody else is trying to use, then > > everybody else ends up waiting. However, if your errored devices don't > > take over a common I/O channel, everybody else gets the CPU while the > > errors are being retried. > > > > For instance, I have SCSI for my disks, and I use IDE for a R/W CD > > because it's cheap. I can "try forever" reading dorked CDs and the > > only process affected at all is the one trying to read the CD. I > > can do full-speed compiles while the CD is being retried. > > That's very nice for a system where cost is no object, but ATAPI/IDE is > where the bulk of Linux system are running. Putting the CD on another > cable is realistic (the system I hung does that) but putting the CD on IDE > and the disk on SCSI is not cost effective compared to fixing the hang in > software. > It is NOT a hang in the software. IDE means Integrated Drive Electronics. The ONLY thing the CPU has to work with is the electronics IN the drive. It communicates with the drive from a port on the mother-board. There is no controller on the motherboard. There is absolutely nothing to isolate one drive from another. A single drive, doing its retries will "own" that drive-cable until it has either succeded or given up. The CPU software MUST NOT do anything on that port until the previous request was answered via interrupt. This means that if the CD is using the bus, no task can access any hard-disk drive that is on that same port. You can see that there is plenty of CPU time available by having one task do; while true; do echo "Hello World!"; done (before you access you damaged CD). Then, using another VT, access your damaged CD. When you switch to the 'Hello world' terminal, it's merrily spinning along, getting all the CPU time your other task would have gotten if the drive was readable. > > It's all about configuration. The kernel drivers sleep while waiting > > for interrupts that will determine the success or failure of the > > disk operation. The 'sleep' means that the CPU gets given to somebody > > who could use it. > > It would also be nice if the other IDE channels were given to "somebody > who could use it," but that would appear in some cases not to happen. > There are no other IDE channels to give up. The 'master/save' configuration should be labled 'cheaper' and the two/channel configuration should be labeled 'cheapest'. It's what you pay for. It has nothing to do with the kernel or its drivers. Recent work on IDE was an attempt to get DMA operations and other 'speed-up' operations to work better. They will never work well because IDE is not designed to work well. It's designed to simply exist. [SNIPPED...] Cheers, Dick Johnson Penguin : Linux version 2.4.1 on an i686 machine (797.90 BogoMips). Windows-2000/Professional isn't. ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-24 19:16 ` Bill Davidsen 2002-04-24 19:52 ` Richard B. Johnson @ 2002-04-24 22:58 ` Stephen Samuel 2002-04-25 3:33 ` Bill Davidsen 2002-04-26 10:16 ` Zwane Mwaikambo 2 siblings, 1 reply; 37+ messages in thread From: Stephen Samuel @ 2002-04-24 22:58 UTC (permalink / raw) To: linux-kernel I have a system with 2 hard disks (on separate controllers) and a CD-ROM on the second controller (shared with the disk that, among other things) handles /usr. I took an old data CD, scratched took a fork to it and mounted it. I then started up MMX playing 'Hotel California' and tried to wc(1) a 700K file on the CD.not too bad not too bad not too bad Hotel California played fine, but trying to do an 'ls' of /usr (same controller) took a LONG time..... (had to wait fnot too bad or the CD to release the controller). I could wc larg files in my /tmp directory, play music etc before that WC came back -- I could do anything I wanted, as long as I didn't need any data off of that second controller (e.g. loading programs in /usr would die, since that HD shares controller with the CD). Given that I rarely use my CD ROM, it's fine having / and /usr separated... On the other hand, if I was trying to read damaged CDs with any regularity, I'd be making sure that the CD ROM drive was sitting on it's own controller -- even if it meant putting all the other IO on the system onto one IDE drive/controller. > where the bulk of Linux system are running. Putting the CD on another > cable is realistic (the system I hung does that) but putting the CD on IDE > and the disk on SCSI is not cost effective compared to fixing the hang in > software. Note that this problem is a HARDWARE one -- not a software one. It's kinda like trying to cross a Singapore highway... You can do it faster, if you don't mind dealing with the nasty side of a (data) bus. (read: SPLAT) Bus error: car dumped. (and if you think Linux is bad, try doing the same thing in Windows!). -- Stephen Samuel +1(604)876-0426 samuel@bcgreen.com http://www.bcgreen.com/~samuel/ Powerful committed communication, reaching through fear, uncertainty and doubt to touch the jewel within each person and bring it to life. ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-24 22:58 ` Stephen Samuel @ 2002-04-25 3:33 ` Bill Davidsen 2002-04-26 4:04 ` Mike Fedyk 0 siblings, 1 reply; 37+ messages in thread From: Bill Davidsen @ 2002-04-25 3:33 UTC (permalink / raw) To: Stephen Samuel; +Cc: linux-kernel On Wed, 24 Apr 2002, Stephen Samuel wrote: > I said: > > where the bulk of Linux system are running. Putting the CD on another > > cable is realistic (the system I hung does that) but putting the CD on IDE > > and the disk on SCSI is not cost effective compared to fixing the hang in > > software. > > Note that this problem is a HARDWARE one -- not a software one. > It's kinda like trying to cross a Singapore highway... You can > do it faster, if you don't mind dealing with the nasty side of > a (data) bus. (read: SPLAT) I don't know how to say this any other way, reread my second sentence again. I have the disk and CD on separate cables, it still hangs. I NEVER mix a CD with anything and expect good response. The IDE devices are hanging, not just the one sharing the cable, nothing shares the cable but a ZIP drive I use a few times a year when someone sends me a ZIP, otherwise it's unused. The disk drive is all by itself, as I said the first time I mentioned this. I suspect (without having a good way to check) that all IDE devices sharing the IRQ with the error device *may* be affected. That's the only thing which comes to mind, I'll add a Promise controller and disk on a totally separate board and see if that changes anything. Hopefully it will not share the IRQ :-( -- bill davidsen <davidsen@tmr.com> CTO, TMR Associates, Inc Doing interesting things with little computers since 1979. ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-25 3:33 ` Bill Davidsen @ 2002-04-26 4:04 ` Mike Fedyk 2002-04-26 5:42 ` Erik Andersen ` (2 more replies) 0 siblings, 3 replies; 37+ messages in thread From: Mike Fedyk @ 2002-04-26 4:04 UTC (permalink / raw) To: Bill Davidsen; +Cc: Stephen Samuel, linux-kernel, Andre Hedrick On Wed, Apr 24, 2002 at 11:33:23PM -0400, Bill Davidsen wrote: > I suspect (without having a good way to check) that all IDE devices > sharing the IRQ with the error device *may* be affected. That's the only > thing which comes to mind, I'll add a Promise controller and disk on a > totally separate board and see if that changes anything. Hopefully it will > not share the IRQ :-( I don't think it has to do with the IRQs, but it sounds like the entire ide chipset (think two cables one one chipset...) has stopped responding when ONE device (out of a possible four (with two cables)) has failed media. Let's use an example to help shine the light on exactly what I'm saying (I'm trying to summarize what's been said in the threads, and I haven't tested this... though I will be working on such a system in the next few weeks): 1) Two drives each on a seperate cable, but on the same chipset: /dev/hda (hard drive) (chipset1) /dev/hdc (cd-rom) (chipset1) Put broken CD into /dev/hdc, and read somehow (dd, cat, whatever), now try to read from /dev/hda. This (according to this thread) should be damn slow and you will have a very hard time to use this system while it is trying to read the CD. 2) Two drives, each on a seperate cable and on different chipsets: /dev/hda (hard drive) (chipset1) /dev/hde (cd-rom) (chipset2) Put broken CD into /dev/hde, read it again, and try to read from /dev/hda. All should be good, with blue skies, and a responsive system. Can someone verify that the above is true, and acurately expresses what they've experienced? Also, can someone say for sure (Andre) that this is a hardware limitation, not a Linux IDE locking problem, and with no possibility of a software work-around? Thanks, Mike ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-26 4:04 ` Mike Fedyk @ 2002-04-26 5:42 ` Erik Andersen 2002-04-26 7:35 ` Andre Hedrick 2002-04-26 15:23 ` Stephen Samuel 2 siblings, 0 replies; 37+ messages in thread From: Erik Andersen @ 2002-04-26 5:42 UTC (permalink / raw) To: Bill Davidsen, Stephen Samuel, linux-kernel, Andre Hedrick On Thu Apr 25, 2002 at 09:04:57PM -0700, Mike Fedyk wrote: > 1) > Two drives each on a seperate cable, but on the same chipset: > /dev/hda (hard drive) (chipset1) > /dev/hdc (cd-rom) (chipset1) > > Put broken CD into /dev/hdc, and read somehow (dd, cat, whatever), now try > to read from /dev/hda. This (according to this thread) should be damn slow > and you will have a very hard time to use this system while it is trying to > read the CD. This has not been my experience. Reading from hda continues to work as expected. But the process reading from hdc stays stuck in D state for a _long_ time.... A kill -9 takes like 10 minutes before it gets around to actually killing anything. > 2) > Two drives, each on a seperate cable and on different chipsets: > /dev/hda (hard drive) (chipset1) > /dev/hde (cd-rom) (chipset2) > > Put broken CD into /dev/hde, read it again, and try to read from /dev/hda. > All should be good, with blue skies, and a responsive system. Sure. Same as above. > Also, can someone say for sure (Andre) that this is a hardware limitation, > not a Linux IDE locking problem, and with no possibility of a software > work-around? There is a certain amount of delay when a drive hits a bad sector. But Linux handles things pretty badly IMHO, and could do a much better job. -Erik -- Erik B. Andersen http://codepoet-consulting.com/ --This message was written using 73% post-consumer electrons-- ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-26 4:04 ` Mike Fedyk 2002-04-26 5:42 ` Erik Andersen @ 2002-04-26 7:35 ` Andre Hedrick 2002-04-25 20:50 ` Pavel Machek ` (2 more replies) 2002-04-26 15:23 ` Stephen Samuel 2 siblings, 3 replies; 37+ messages in thread From: Andre Hedrick @ 2002-04-26 7:35 UTC (permalink / raw) To: Mike Fedyk; +Cc: Bill Davidsen, Stephen Samuel, linux-kernel Basically it is a global design flaw from the beginning, and since I have only 2.4 to address it is a real nasty! Short version, each subdriver personally does not do unique error handling. Thus a the simple good/bad approach to a darwin world has come to bite hard now. There is a failure to address error/sense decoding based on the operations requested to perform. Second the mainloop is ATA/IDE centered for all events and this is in proccess to be fixed for 2.4 soon. Third requires all ATAPI to decode wrt to primary opcode executed and sense of the preferred event tables and not the generic catch all. It is a blood mess, and difficult to describe over email :-/ (for me). Cheers, Andre Hedrick LAD Storage Consulting Group PS Mike, "Mr. Hedrick" was my genetic donor, "Andre" is what I answer too. On Thu, 25 Apr 2002, Mike Fedyk wrote: > On Wed, Apr 24, 2002 at 11:33:23PM -0400, Bill Davidsen wrote: > > I suspect (without having a good way to check) that all IDE devices > > sharing the IRQ with the error device *may* be affected. That's the only > > thing which comes to mind, I'll add a Promise controller and disk on a > > totally separate board and see if that changes anything. Hopefully it will > > not share the IRQ :-( > > I don't think it has to do with the IRQs, but it sounds like the entire ide > chipset (think two cables one one chipset...) has stopped responding when > ONE device (out of a possible four (with two cables)) has failed media. > > Let's use an example to help shine the light on exactly what I'm saying (I'm > trying to summarize what's been said in the threads, and I haven't tested > this... though I will be working on such a system in the next few weeks): > > 1) > Two drives each on a seperate cable, but on the same chipset: > /dev/hda (hard drive) (chipset1) > /dev/hdc (cd-rom) (chipset1) > > Put broken CD into /dev/hdc, and read somehow (dd, cat, whatever), now try > to read from /dev/hda. This (according to this thread) should be damn slow > and you will have a very hard time to use this system while it is trying to > read the CD. > > 2) > Two drives, each on a seperate cable and on different chipsets: > /dev/hda (hard drive) (chipset1) > /dev/hde (cd-rom) (chipset2) > > Put broken CD into /dev/hde, read it again, and try to read from /dev/hda. > All should be good, with blue skies, and a responsive system. > > Can someone verify that the above is true, and acurately expresses > what they've experienced? > > Also, can someone say for sure (Andre) that this is a hardware limitation, > not a Linux IDE locking problem, and with no possibility of a software > work-around? > > Thanks, > > Mike > ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-26 7:35 ` Andre Hedrick @ 2002-04-25 20:50 ` Pavel Machek 2002-04-28 2:18 ` Andre Hedrick 2002-04-26 15:48 ` Mike Fedyk 2002-04-29 21:46 ` Bill Davidsen 2 siblings, 1 reply; 37+ messages in thread From: Pavel Machek @ 2002-04-25 20:50 UTC (permalink / raw) To: Andre Hedrick; +Cc: Mike Fedyk, Bill Davidsen, Stephen Samuel, linux-kernel Hi! > Basically it is a global design flaw from the beginning, and since I have > only 2.4 to address it is a real nasty! Short version, each subdriver Well, noone prevents you from submitting 2.5 patches to Martin.... Actually, his cleanups maybe made that job easier. Pavel -- Philips Velo 1: 1"x4"x8", 300gram, 60, 12MB, 40bogomips, linux, mutt, details at http://atrey.karlin.mff.cuni.cz/~pavel/velo/index.html. ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-25 20:50 ` Pavel Machek @ 2002-04-28 2:18 ` Andre Hedrick 0 siblings, 0 replies; 37+ messages in thread From: Andre Hedrick @ 2002-04-28 2:18 UTC (permalink / raw) To: linux-kernel Well I really do not have the time to follow the changes, but when I fix it in 2.4, please copy a and credit would be nice but not expected. Also it made resulted in a more complex mess to address. Regards, Andre Hedrick LAD Storage Consulting Group On Thu, 25 Apr 2002, Pavel Machek wrote: > Hi! > > > Basically it is a global design flaw from the beginning, and since I have > > only 2.4 to address it is a real nasty! Short version, each subdriver > > Well, noone prevents you from submitting 2.5 patches to Martin.... Actually, > his cleanups maybe made that job easier. > > Pavel > -- > Philips Velo 1: 1"x4"x8", 300gram, 60, 12MB, 40bogomips, linux, mutt, > details at http://atrey.karlin.mff.cuni.cz/~pavel/velo/index.html. > ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-26 7:35 ` Andre Hedrick 2002-04-25 20:50 ` Pavel Machek @ 2002-04-26 15:48 ` Mike Fedyk 2002-04-29 21:46 ` Bill Davidsen 2 siblings, 0 replies; 37+ messages in thread From: Mike Fedyk @ 2002-04-26 15:48 UTC (permalink / raw) To: Andre Hedrick; +Cc: Bill Davidsen, Stephen Samuel, linux-kernel On Fri, Apr 26, 2002 at 12:35:55AM -0700, Andre Hedrick wrote: > > Basically it is a global design flaw from the beginning, and since I have > only 2.4 to address it is a real nasty! Short version, each subdriver > personally does not do unique error handling. Thus a the simple good/bad > approach to a darwin world has come to bite hard now. There is a failure > to address error/sense decoding based on the operations requested to > perform. Second the mainloop is ATA/IDE centered for all events and this > is in proccess to be fixed for 2.4 soon. Third requires all ATAPI to > decode wrt to primary opcode executed and sense of the preferred event > tables and not the generic catch all. > > It is a blood mess, and difficult to describe over email :-/ (for me). > Ok, so there is hope for a fix. Andre, when you have the patches available, I'm sure meny people from this thread would be willing to help test, just announce. Is there a place where you keep your latest patches with a little documentation on the purpose of the changes? > Cheers, > > Andre Hedrick > LAD Storage Consulting Group > > PS Mike, "Mr. Hedrick" was my genetic donor, "Andre" is what I answer too. > ?? I think you're thinking of someone else. Read below, I addressed you as "Andre", and IIRC always have. I understand personally the "genetic donor" problem though. Mike > > Also, can someone say for sure (Andre) that this is a hardware limitation, > > not a Linux IDE locking problem, and with no possibility of a software > > work-around? > > > > Thanks, > > > > Mike ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-26 7:35 ` Andre Hedrick 2002-04-25 20:50 ` Pavel Machek 2002-04-26 15:48 ` Mike Fedyk @ 2002-04-29 21:46 ` Bill Davidsen 2002-05-02 3:45 ` Mike Fedyk 2 siblings, 1 reply; 37+ messages in thread From: Bill Davidsen @ 2002-04-29 21:46 UTC (permalink / raw) To: Andre Hedrick; +Cc: Mike Fedyk, Stephen Samuel, linux-kernel On Fri, 26 Apr 2002, Andre Hedrick wrote: > > Basically it is a global design flaw from the beginning, and since I have > only 2.4 to address it is a real nasty! Short version, each subdriver > personally does not do unique error handling. I'm snipping because obviously lots of folks are reading the previous posts. Since you clearly can see the issue, I will let this thread rest in hopes that we will have a patch to try and that it will make the whole problem shrink if not vanish. It seems the casual assumption that anyone who has a problem must be putting both devices on the same cable has been laid to rest, time to wait for new data and/or patches. I will try again next weekend to test the same problem using a totally separate (Promise) controller for the CD. -- bill davidsen <davidsen@tmr.com> CTO, TMR Associates, Inc Doing interesting things with little computers since 1979. ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-29 21:46 ` Bill Davidsen @ 2002-05-02 3:45 ` Mike Fedyk 2002-05-02 8:26 ` Stephen Samuel 0 siblings, 1 reply; 37+ messages in thread From: Mike Fedyk @ 2002-05-02 3:45 UTC (permalink / raw) To: Bill Davidsen; +Cc: Andre Hedrick, Stephen Samuel, linux-kernel On Mon, Apr 29, 2002 at 05:46:28PM -0400, Bill Davidsen wrote: > On Fri, 26 Apr 2002, Andre Hedrick wrote: > > > > > Basically it is a global design flaw from the beginning, and since I have > > only 2.4 to address it is a real nasty! Short version, each subdriver > > personally does not do unique error handling. > > I'm snipping because obviously lots of folks are reading the previous > posts. Since you clearly can see the issue, I will let this thread rest in > hopes that we will have a patch to try and that it will make the whole > problem shrink if not vanish. > > It seems the casual assumption that anyone who has a problem must be > putting both devices on the same cable has been laid to rest, time to wait > for new data and/or patches. I will try again next weekend to test the > same problem using a totally separate (Promise) controller for the CD. I found a CD that was having trouble reading, and put it on a 12x cd-rom drive at /dev/hdc (hda - d = PIIX3). The hard drive is on the Promise controller (hde - h = Promise 20262). Currently there are only two drives in this system. I ran: while true; do find / -xdev -type f -print0 |xargs -0 cat > /dev/null; done vmstat 1 > /dev/tty9 Then I ran `dd conv=noerror < /dev/hdc > /dev/null` and saw many resets and errors. Now while this is happening and I'm listening to the hard drive tick away, and hear the cd-rom drive trying to read the CD, and watching the vmstat output. All is working great and smooth. Now, I moved the cd-rom drive over to the promise controller (/dev/hdg, that's on the *same* controller, but *different* cables (remember, there's no hope if it's on the same cable as that's an IDE hardware design issue)) and reran the above test, and it was smooth too. This is with 2.4.19-pre6-vm33, I haven't tested with other kernels. I'll be adding 3 zip100s, 1 zip250, 1 cd-rw, and 1 ls-120 drive to this system over the next few days as time permits, so I'll do more testing and report. As you can see, it'll be a big issue on this system if one drive dominates the entire system. Can someone else test some other kernel versions/trees? I'd like this to see more systematic testing so we can see what exactly triggers the problem. Oh BTW, I have a cmd649 ide controller I can test too, and in a month or three I'll be able to test on a via chipset (but with an intel processor though). Mike lspci: 00:07.1 IDE interface: Intel Corp. 82371SB PIIX3 IDE [Natoma/Triton II] 00:0b.0 Unknown mass storage controller: Promise Technology, Inc. 20262 (rev 01) hde: IBM-DPTA-372730, ATA DISK drive hdg: TOSHIBA CD-ROM XM-5702B, ATAPI CD/DVD-ROM drive ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-05-02 3:45 ` Mike Fedyk @ 2002-05-02 8:26 ` Stephen Samuel 2002-05-02 8:49 ` Xavier Bestel 2002-05-02 15:40 ` Mike Fedyk 0 siblings, 2 replies; 37+ messages in thread From: Stephen Samuel @ 2002-05-02 8:26 UTC (permalink / raw) To: Mike Fedyk; +Cc: Bill Davidsen, Andre Hedrick, linux-kernel I ran a similar type of test on a 2.4.9.31 (redhat 7.1 ) kernel. With the CD on HDD, I could read off of HDA just peachy while the system was choking on a scratched (aol) cd. I did a WC of a 300MB file (only 256MB of ram on the system, so that's guaranteed to not fit in any cache). Times to read the file were statistically equivalent whether the system was choking on the CD or not. -- Stephen Samuel +1(604)876-0426 samuel@bcgreen.com http://www.bcgreen.com/~samuel/ Powerful committed communication, reaching through fear, uncertainty and doubt to touch the jewel within each person and bring it to life. ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-05-02 8:26 ` Stephen Samuel @ 2002-05-02 8:49 ` Xavier Bestel 2002-05-02 8:43 ` Zwane Mwaikambo 2002-05-02 15:48 ` Mike Fedyk 2002-05-02 15:40 ` Mike Fedyk 1 sibling, 2 replies; 37+ messages in thread From: Xavier Bestel @ 2002-05-02 8:49 UTC (permalink / raw) To: Stephen Samuel Cc: Mike Fedyk, Bill Davidsen, Andre Hedrick, Linux Kernel Mailing List Le jeu 02/05/2002 à 10:26, Stephen Samuel a écrit : > I ran a similar type of test on a 2.4.9.31 (redhat 7.1 ) kernel. > With the CD on HDD, I could read off of HDA just peachy while > the system was choking on a scratched (aol) cd. The "system grinding to a halt" happens to me too, when *ripping* scratched cds. Note that it's when using *userspace* access to the block device, with e.g. cdparanoia or grip (or dvd ripping tools). My DVD drive is on a VT82C693A/694x (ABit VP6). ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-05-02 8:49 ` Xavier Bestel @ 2002-05-02 8:43 ` Zwane Mwaikambo 2002-05-02 9:12 ` Xavier Bestel 2002-05-02 15:48 ` Mike Fedyk 1 sibling, 1 reply; 37+ messages in thread From: Zwane Mwaikambo @ 2002-05-02 8:43 UTC (permalink / raw) To: Xavier Bestel Cc: Stephen Samuel, Mike Fedyk, Bill Davidsen, Andre Hedrick, Linux Kernel Mailing List On 2 May 2002, Xavier Bestel wrote: > The "system grinding to a halt" happens to me too, when *ripping* > scratched cds. Note that it's when using *userspace* access to the block > device, with e.g. cdparanoia or grip (or dvd ripping tools). What does your system time usage look like? -- http://function.linuxpower.ca ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-05-02 8:43 ` Zwane Mwaikambo @ 2002-05-02 9:12 ` Xavier Bestel 0 siblings, 0 replies; 37+ messages in thread From: Xavier Bestel @ 2002-05-02 9:12 UTC (permalink / raw) To: Zwane Mwaikambo Cc: Stephen Samuel, Mike Fedyk, Bill Davidsen, Andre Hedrick, Linux Kernel Mailing List Le jeu 02/05/2002 à 10:43, Zwane Mwaikambo a écrit : > On 2 May 2002, Xavier Bestel wrote: > > > The "system grinding to a halt" happens to me too, when *ripping* > > scratched cds. Note that it's when using *userspace* access to the block > > device, with e.g. cdparanoia or grip (or dvd ripping tools). > > What does your system time usage look like? IIRC (don't have a scratched cd at hand) nearly 100% system time (nearly 50% in fact cause it's a dual-proc) Xav ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-05-02 8:49 ` Xavier Bestel 2002-05-02 8:43 ` Zwane Mwaikambo @ 2002-05-02 15:48 ` Mike Fedyk 2002-05-03 7:14 ` Xavier Bestel 1 sibling, 1 reply; 37+ messages in thread From: Mike Fedyk @ 2002-05-02 15:48 UTC (permalink / raw) To: Xavier Bestel Cc: Stephen Samuel, Bill Davidsen, Andre Hedrick, Linux Kernel Mailing List On Thu, May 02, 2002 at 10:49:50AM +0200, Xavier Bestel wrote: > Le jeu 02/05/2002 ? 10:26, Stephen Samuel a ?crit : > > I ran a similar type of test on a 2.4.9.31 (redhat 7.1 ) kernel. > > With the CD on HDD, I could read off of HDA just peachy while > > the system was choking on a scratched (aol) cd. > > The "system grinding to a halt" happens to me too, when *ripping* > scratched cds. Note that it's when using *userspace* access to the block > device, with e.g. cdparanoia or grip (or dvd ripping tools). > > My DVD drive is on a VT82C693A/694x (ABit VP6). Ahh, that's a good distinction to make. I'll be testing CD ripping on this machine eventually too. Are you burning CDs at the same time as ripping? If so, try doing that in a two step process and see where the problem occours. Can you post the full output of lspci? Be sure to mention what drives are on which controller (if you have two or more). Can you post the kernel output from dmesg when it detects your drives? Do you have any ISA devices in this system? Can you post the commands it takes to reproduce the condition? Also, `vmstat 1` output while the system is unresponsive would be good too. Thanks, Mike ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-05-02 15:48 ` Mike Fedyk @ 2002-05-03 7:14 ` Xavier Bestel 0 siblings, 0 replies; 37+ messages in thread From: Xavier Bestel @ 2002-05-03 7:14 UTC (permalink / raw) To: Mike Fedyk Cc: Stephen Samuel, Bill Davidsen, Andre Hedrick, Linux Kernel Mailing List Le jeu 02/05/2002 à 17:48, Mike Fedyk a écrit : > > The "system grinding to a halt" happens to me too, when *ripping* > > scratched cds. Note that it's when using *userspace* access to the block > > device, with e.g. cdparanoia or grip (or dvd ripping tools). > > > > My DVD drive is on a VT82C693A/694x (ABit VP6). > > Ahh, that's a good distinction to make. I'll be testing CD ripping on this > machine eventually too. > > Are you burning CDs at the same time as ripping? If so, try doing that in a > two step process and see where the problem occours. Well, I tried once and lost the CDR ... > Can you post the full output of lspci? Be sure to mention what drives are > on which controller (if you have two or more). > > Can you post the kernel output from dmesg when it detects your drives? > > Do you have any ISA devices in this system? A sound card and the legacy motherboard devices. > Can you post the commands it takes to reproduce the condition? Well, I use Grip (a gtk+ ripping proggy) in cdparanoia mode. Basically it's like cdparanoia + gui. > Also, `vmstat 1` output while the system is unresponsive would be good too. I'll post that when I'll do it again with a scratched CD. For now I need my machine to be responsive. Cheers, Xav 00:00.0 Host bridge: VIA Technologies, Inc. VT82C693A/694x [Apollo PRO133x] (rev c4) Subsystem: ABIT Computer Corp.: Unknown device a204 Flags: bus master, medium devsel, latency 8 Memory at d0000000 (32-bit, prefetchable) [size=64M] Capabilities: [a0] AGP version 2.0 Capabilities: [c0] Power Management version 2 00:01.0 PCI bridge: VIA Technologies, Inc. VT82C598/694x [Apollo MVP3/Pro133x AGP] (prog-if 00 [Normal decode]) Flags: bus master, 66Mhz, medium devsel, latency 0 Bus: primary=00, secondary=01, subordinate=01, sec-latency=0 I/O behind bridge: 00009000-00009fff Memory behind bridge: dd000000-deffffff Prefetchable memory behind bridge: d4000000-d7ffffff Capabilities: [80] Power Management version 2 00:07.0 ISA bridge: VIA Technologies, Inc. VT82C686 [Apollo Super South] (rev 40) Subsystem: ABIT Computer Corp.: Unknown device 0000 Flags: bus master, stepping, medium devsel, latency 0 Capabilities: [c0] Power Management version 2 00:07.1 IDE interface: VIA Technologies, Inc. Bus Master IDE (rev 06) (prog-if 8a [Master SecP PriP]) Subsystem: VIA Technologies, Inc. Bus Master IDE Flags: bus master, medium devsel, latency 32 I/O ports at a000 [size=16] Capabilities: [c0] Power Management version 2 00:07.2 USB Controller: VIA Technologies, Inc. UHCI USB (rev 16) (prog-if 00 [UHCI]) Subsystem: Unknown device 0925:1234 Flags: bus master, medium devsel, latency 32, IRQ 11 I/O ports at a400 [size=32] Capabilities: [80] Power Management version 2 00:07.3 USB Controller: VIA Technologies, Inc. UHCI USB (rev 16) (prog-if 00 [UHCI]) Subsystem: Unknown device 0925:1234 Flags: bus master, medium devsel, latency 32, IRQ 11 I/O ports at a800 [size=32] Capabilities: [80] Power Management version 2 00:07.4 Bridge: VIA Technologies, Inc. VT82C686 [Apollo Super ACPI] (rev 40) Flags: medium devsel, IRQ 11 Capabilities: [68] Power Management version 2 00:09.0 FireWire (IEEE 1394): Texas Instruments: Unknown device 8020 (prog-if 10 [OHCI]) Subsystem: Ads Technologies Inc: Unknown device 0000 Flags: bus master, medium devsel, latency 32, IRQ 9 Memory at df004000 (32-bit, non-prefetchable) [size=2K] Memory at df000000 (32-bit, non-prefetchable) [size=16K] Capabilities: [44] Power Management version 1 00:0a.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL-8139 (rev 10) Subsystem: Realtek Semiconductor Co., Ltd. RT8139 Flags: bus master, medium devsel, latency 32, IRQ 10 I/O ports at ac00 [size=256] Memory at df005000 (32-bit, non-prefetchable) [size=256] Expansion ROM at <unassigned> [disabled] [size=64K] 00:0b.0 VGA compatible controller: S3 Inc. ViRGE/DX or /GX (rev 01) (prog-if 00 [VGA]) Subsystem: S3 Inc. ViRGE/DX Flags: medium devsel, IRQ 10 Memory at d8000000 (32-bit, non-prefetchable) [size=64M] Expansion ROM at <unassigned> [disabled] [size=64K] 00:0c.0 Multimedia audio controller: Ensoniq ES1371 [AudioPCI-97] (rev 09) Subsystem: Ensoniq Creative Sound Blaster AudioPCI64V, AudioPCI128 Flags: bus master, slow devsel, latency 32, IRQ 11 I/O ports at b000 [size=64] Capabilities: [dc] Power Management version 2 00:0e.0 Unknown mass storage controller: Triones Technologies, Inc. HPT366 / HPT370 (rev 03) Subsystem: Triones Technologies, Inc.: Unknown device 0001 Flags: bus master, 66Mhz, medium devsel, latency 120, IRQ 11 I/O ports at b400 [size=8] I/O ports at b800 [size=4] I/O ports at bc00 [size=8] I/O ports at c000 [size=4] I/O ports at c400 [size=256] Expansion ROM at <unassigned> [disabled] [size=128K] Capabilities: [60] Power Management version 2 01:00.0 VGA compatible controller: ATI Technologies Inc Rage 128 PF (prog-if 00 [VGA]) Subsystem: ATI Technologies Inc: Unknown device 0028 Flags: bus master, stepping, 66Mhz, medium devsel, latency 32, IRQ 9 Memory at d4000000 (32-bit, prefetchable) [size=64M] I/O ports at 9000 [size=256] Memory at de000000 (32-bit, non-prefetchable) [size=16K] Expansion ROM at <unassigned> [disabled] [size=128K] Capabilities: [50] AGP version 2.0 Capabilities: [5c] Power Management version 2 Uniform Multi-Platform E-IDE driver Revision: 6.31 ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx VP_IDE: IDE controller on PCI bus 00 dev 39 VP_IDE: chipset revision 6 VP_IDE: not 100% native mode: will probe irqs later ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx VP_IDE: VIA vt82c686b (rev 40) IDE UDMA100 controller on pci00:07.1 ide0: BM-DMA at 0xa000-0xa007, BIOS settings: hda:DMA, hdb:pio ide1: BM-DMA at 0xa008-0xa00f, BIOS settings: hdc:DMA, hdd:DMA HPT370: IDE controller on PCI bus 00 dev 70 HPT370: chipset revision 3 HPT370: not 100% native mode: will probe irqs later HPT370: using 33MHz PCI clock ide2: BM-DMA at 0xc400-0xc407, BIOS settings: hde:DMA, hdf:pio ide3: BM-DMA at 0xc408-0xc40f, BIOS settings: hdg:pio, hdh:pio hdb: C/H/S=0/0/0 from BIOS ignored hda: WDC WD400BB-32BSA0, ATA DISK drive hdb: IOMEGA ZIP 100 ATAPI, ATAPI FLOPPY drive hdc: HITACHI DVD-ROM GD-7500, ATAPI CD/DVD-ROM drive hdd: LG CD-RW CED-8080B, ATAPI CD/DVD-ROM drive hde: ST340824A, ATA DISK drive ide0 at 0x1f0-0x1f7,0x3f6 on irq 14 ide1 at 0x170-0x177,0x376 on irq 15 ide2 at 0xb400-0xb407,0xb802 on irq 11 hda: 78165360 sectors (40021 MB) w/2048KiB Cache, CHS=38166/64/32, UDMA(100) hde: 78165360 sectors (40021 MB) w/2048KiB Cache, CHS=77545/16/63, UDMA(100) Partition check: hda: [PTBL] [5169/240/63] hda1 < hda5 hda6 hda7 > hdb:<3>ide-scsi: hdb: unsupported command in request queue (0) end_request: I/O error, dev 03:40 (hdb), sector 0 ide-scsi: hdb: unsupported command in request queue (0) end_request: I/O error, dev 03:40 (hdb), sector 2 ide-scsi: hdb: unsupported command in request queue (0) end_request: I/O error, dev 03:40 (hdb), sector 4 ide-scsi: hdb: unsupported command in request queue (0) end_request: I/O error, dev 03:40 (hdb), sector 6 ide-scsi: hdb: unsupported command in request queue (0) end_request: I/O error, dev 03:40 (hdb), sector 0 ide-scsi: hdb: unsupported command in request queue (0) end_request: I/O error, dev 03:40 (hdb), sector 2 ide-scsi: hdb: unsupported command in request queue (0) end_request: I/O error, dev 03:40 (hdb), sector 4 ide-scsi: hdb: unsupported command in request queue (0) end_request: I/O error, dev 03:40 (hdb), sector 6 unable to read partition table hde: hde1 hde2 [ That's Ok, it's the Zip drive ] SCSI subsystem driver Revision: 1.00 scsi0 : SCSI host adapter emulation for IDE ATAPI devices Vendor: IOMEGA Model: ZIP 100 Rev: 23.D Type: Direct-Access ANSI SCSI revision: 00 Vendor: HITACHI Model: DVD-ROM GD-7500 Rev: 0005 Type: CD-ROM ANSI SCSI revision: 02 Vendor: LG Model: CD-RW CED-8080B Rev: 1.06 Type: CD-ROM ANSI SCSI revision: 02 Attached scsi removable disk sda at scsi0, channel 0, id 0, lun 0 sda : READ CAPACITY failed. sda : status = 0, message = 00, host = 0, driver = 28 Current sd00:00: sense key Not Ready Additional sense indicates Medium not present sda : block size assumed to be 512 bytes, disk size 1GB. sda: I/O error: dev 08:00, sector 0 I/O error: dev 08:00, sector 0 unable to read partition table Attached scsi CD-ROM sr0 at scsi0, channel 0, id 1, lun 0 Attached scsi CD-ROM sr1 at scsi0, channel 0, id 2, lun 0 sr0: scsi3-mmc drive: 14x/40x cd/rw xa/form2 cdda tray Uniform CD-ROM driver Revision: 3.12 sr1: scsi3-mmc drive: 32x/32x writer cd/rw xa/form2 cdda tray ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-05-02 8:26 ` Stephen Samuel 2002-05-02 8:49 ` Xavier Bestel @ 2002-05-02 15:40 ` Mike Fedyk 2002-05-02 17:15 ` Stephen Samuel 1 sibling, 1 reply; 37+ messages in thread From: Mike Fedyk @ 2002-05-02 15:40 UTC (permalink / raw) To: Stephen Samuel; +Cc: Bill Davidsen, Andre Hedrick, linux-kernel On Thu, May 02, 2002 at 01:26:46AM -0700, Stephen Samuel wrote: > I ran a similar type of test on a 2.4.9.31 (redhat 7.1 ) kernel. > With the CD on HDD, I could read off of HDA just peachy while > the system was choking on a scratched (aol) cd. > > I did a WC of a 300MB file (only 256MB of ram on the system, > so that's guaranteed to not fit in any cache). > > Times to read the file were statistically equivalent whether > the system was choking on the CD or not. Great, can you post the lspci output for your IDE chipset(s)? ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-05-02 15:40 ` Mike Fedyk @ 2002-05-02 17:15 ` Stephen Samuel 0 siblings, 0 replies; 37+ messages in thread From: Stephen Samuel @ 2002-05-02 17:15 UTC (permalink / raw) To: Mike Fedyk; +Cc: Bill Davidsen, Andre Hedrick, linux-kernel 00:00.0 Host bridge: Intel Corporation 440BX/ZX - 82443BX/ZX Host bridge (rev 03) 00:01.0 PCI bridge: Intel Corporation 440BX/ZX - 82443BX/ZX AGP bridge (rev 03) 00:04.0 ISA bridge: Intel Corporation 82371AB PIIX4 ISA (rev 02) 00:04.1 IDE interface: Intel Corporation 82371AB PIIX4 IDE (rev 01) 00:04.2 USB Controller: Intel Corporation 82371AB PIIX4 USB (rev 01) 00:04.3 Bridge: Intel Corporation 82371AB PIIX4 ACPI (rev 02) Mike Fedyk wrote: > On Thu, May 02, 2002 at 01:26:46AM -0700, Stephen Samuel wrote: > >>I ran a similar type of test on a 2.4.9.31 (redhat 7.1 ) kernel. >>With the CD on HDD, I could read off of HDA just peachy while >>the system was choking on a scratched (aol) cd. >> >>I did a WC of a 300MB file (only 256MB of ram on the system, >>so that's guaranteed to not fit in any cache). >> >>Times to read the file were statistically equivalent whether >>the system was choking on the CD or not. > > > Great, can you post the lspci output for your IDE chipset(s)? -- Stephen Samuel +1(604)876-0426 samuel@bcgreen.com http://www.bcgreen.com/~samuel/ Powerful committed communication, reaching through fear, uncertainty and doubt to touch the jewel within each person and bring it to life. ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-26 4:04 ` Mike Fedyk 2002-04-26 5:42 ` Erik Andersen 2002-04-26 7:35 ` Andre Hedrick @ 2002-04-26 15:23 ` Stephen Samuel 2 siblings, 0 replies; 37+ messages in thread From: Stephen Samuel @ 2002-04-26 15:23 UTC (permalink / raw) To: Mike Fedyk; +Cc: Bill Davidsen, linux-kernel, Andre Hedrick > I don't think it has to do with the IRQs, but it sounds like the entire ide > chipset (think two cables one one chipset...) has stopped responding when > ONE device (out of a possible four (with two cables)) has failed media. In my case, I was able to read data from hda while the cdrom on hdd was trying to recover data from a scratched disk. Reading data from hdc (shared cable with the CDROM), on the other hand, was VERY slow. I have an 82371AB PIIX4 ide interface (dual IDE chipset), It's part of a 440BX chipset board. If accessing on one IDE interface (C/D) causes the other (A/B) to lockup, I'd guess you've got especially cheap hardware on your box. -- Stephen Samuel +1(604)876-0426 samuel@bcgreen.com http://www.bcgreen.com/~samuel/ Powerful committed communication, reaching through fear, uncertainty and doubt to touch the jewel within each person and bring it to life. ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-24 19:16 ` Bill Davidsen 2002-04-24 19:52 ` Richard B. Johnson 2002-04-24 22:58 ` Stephen Samuel @ 2002-04-26 10:16 ` Zwane Mwaikambo 2 siblings, 0 replies; 37+ messages in thread From: Zwane Mwaikambo @ 2002-04-26 10:16 UTC (permalink / raw) To: Bill Davidsen; +Cc: Richard B. Johnson, Linux Kernel Mailing List On Wed, 24 Apr 2002, Bill Davidsen wrote: > > Put your CDs on a different controller and you can do anything you > > want without affecting other tasks. > > As above, another type of bus is not cost effective, another IDE cable > doesn't solve the problem, no matter what theory says. Hmm, i have my cdrw on a different IDE controller in an all IDE system and never experience "hangs" even for completely borked cds. The disks are on the onboard IDE controller. This is also true for when burning CDs, i can thrash the harddisks with no noticeable slowdown. Zwane -- http://function.linuxpower.ca ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-18 13:13 Dr. Death 2002-04-19 14:14 ` Richard B. Johnson @ 2002-04-19 20:01 ` Erik Andersen 2002-04-21 15:18 ` Denis Vlasenko ` (2 more replies) 1 sibling, 3 replies; 37+ messages in thread From: Erik Andersen @ 2002-04-19 20:01 UTC (permalink / raw) To: Dr. Death; +Cc: linux-kernel On Thu Apr 18, 2002 at 03:13:35PM +0200, Dr. Death wrote: > Problem: > > I use SuSE Linux 7.2 and when I create md5sums from damaged files on a > CD, the WHOLE system freezes or is ugly slow untill md5 has passed the > damaged part of the file ! This should help somewhat. Currently, ide-cd.c retries ERROR_MAX (8) times when it sees an error. But ide.c is also retrying ERROR_MAX times when _it_ sees an error, and does a bus reset after evey 4 failures. So for each bad sector, you get 64 retries (with typical timouts of 7 seconds each) plus 16 bus resets per bad sector. The funny thing is though, we knew after the first read that we had an uncorrectable medium error. Try this patch vs 2.4.19-pre7 --- linux/drivers/ide/ide-cd.c.orig Tue Apr 9 06:59:56 2002 +++ linux/drivers/ide/ide-cd.c Tue Apr 9 07:04:59 2002 @@ -657,6 +657,11 @@ request or data protect error.*/ ide_dump_status (drive, "command error", stat); cdrom_end_request (0, drive); + } else if (sense_key == MEDIUM_ERROR) { + /* No point in re-trying a zillion times on a bad + * sector... If we got here the error is not correctable */ + ide_dump_status (drive, "media error (bad sector)", stat); + cdrom_end_request (0, drive); } else if ((err & ~ABRT_ERR) != 0) { /* Go to the default handler for other errors. */ -Erik -- Erik B. Andersen http://codepoet-consulting.com/ --This message was written using 73% post-consumer electrons-- ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-19 20:01 ` Erik Andersen @ 2002-04-21 15:18 ` Denis Vlasenko 2002-04-21 22:54 ` Hans-Peter Jansen 2002-04-22 14:49 ` Roger Larsson 2 siblings, 0 replies; 37+ messages in thread From: Denis Vlasenko @ 2002-04-21 15:18 UTC (permalink / raw) To: andersen, Dr. Death; +Cc: linux-kernel On 19 April 2002 18:01, Erik Andersen wrote: > On Thu Apr 18, 2002 at 03:13:35PM +0200, Dr. Death wrote: > > Problem: > > > > I use SuSE Linux 7.2 and when I create md5sums from damaged files on a > > CD, the WHOLE system freezes or is ugly slow untill md5 has passed the > > damaged part of the file ! > > This should help somewhat. Currently, ide-cd.c retries ERROR_MAX > (8) times when it sees an error. But ide.c is also retrying > ERROR_MAX times when _it_ sees an error, and does a bus reset > after evey 4 failures. So for each bad sector, you get 64 > retries (with typical timouts of 7 seconds each) plus 16 bus > resets per bad sector. And nobody knows how many tries is in hardware... so we get 8x8x?? retries, and *this* is slow. -- vda ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-19 20:01 ` Erik Andersen 2002-04-21 15:18 ` Denis Vlasenko @ 2002-04-21 22:54 ` Hans-Peter Jansen 2002-04-22 5:38 ` Andre Hedrick 2002-04-22 14:49 ` Roger Larsson 2 siblings, 1 reply; 37+ messages in thread From: Hans-Peter Jansen @ 2002-04-21 22:54 UTC (permalink / raw) To: andersen; +Cc: drd, linux-kernel On Fri, 19 Apr 2002 14:01:13 -0600 "Erik Andersen" <andersen@codepoet.org> wrote: > This should help somewhat. Currently, ide-cd.c retries ERROR_MAX > (8) times when it sees an error. But ide.c is also retrying > ERROR_MAX times when _it_ sees an error, and does a bus reset > after evey 4 failures. So for each bad sector, you get 64 > retries (with typical timouts of 7 seconds each) plus 16 bus > resets per bad sector. Thanks for investigation. BTW: Does this cover the ide-scsi case, too? > The funny thing is though, we knew after the first read that we > had an uncorrectable medium error. Try this patch vs 2.4.19-pre7 > > --- linux/drivers/ide/ide-cd.c.orig Tue Apr 9 06:59:56 2002 > +++ linux/drivers/ide/ide-cd.c Tue Apr 9 07:04:59 2002 > @@ -657,6 +657,11 @@ > request or data protect error.*/ > ide_dump_status (drive, "command error", stat); > cdrom_end_request (0, drive); > + } else if (sense_key == MEDIUM_ERROR) { > + /* No point in re-trying a zillion times on a bad > + * sector... If we got here the error is not correctable */ > + ide_dump_status (drive, "media error (bad sector)", stat); .. and some curious will want to know which sector has thrown the error [which would save me to patch this some day myself...] > + cdrom_end_request (0, drive); > } else if ((err & ~ABRT_ERR) != 0) { > /* Go to the default handler > for other errors. */ > -Erik > > -- > Erik B. Andersen http://codepoet-consulting.com/ > --This message was written using 73% post-consumer electrons-- Cheers, Hans-Peter ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-21 22:54 ` Hans-Peter Jansen @ 2002-04-22 5:38 ` Andre Hedrick 0 siblings, 0 replies; 37+ messages in thread From: Andre Hedrick @ 2002-04-22 5:38 UTC (permalink / raw) To: Hans-Peter Jansen; +Cc: andersen, drd, linux-kernel You have addressed the core problem I have been working on quietly. The export of the sense data and end request per subdriver is required to make the personalities proper. This is messy for two of the four current subdrivers, and teh fifth new one will be clean from the start. Cheers, Andre Hedrick LAD Storage Consulting Group On Mon, 22 Apr 2002, Hans-Peter Jansen wrote: > On Fri, 19 Apr 2002 14:01:13 -0600 > "Erik Andersen" <andersen@codepoet.org> wrote: > > > This should help somewhat. Currently, ide-cd.c retries ERROR_MAX > > (8) times when it sees an error. But ide.c is also retrying > > ERROR_MAX times when _it_ sees an error, and does a bus reset > > after evey 4 failures. So for each bad sector, you get 64 > > retries (with typical timouts of 7 seconds each) plus 16 bus > > resets per bad sector. > > Thanks for investigation. BTW: Does this cover the ide-scsi case, too? > > > The funny thing is though, we knew after the first read that we > > had an uncorrectable medium error. Try this patch vs 2.4.19-pre7 > > > > --- linux/drivers/ide/ide-cd.c.orig Tue Apr 9 06:59:56 2002 > > +++ linux/drivers/ide/ide-cd.c Tue Apr 9 07:04:59 2002 > > @@ -657,6 +657,11 @@ > > request or data protect error.*/ > > ide_dump_status (drive, "command error", stat); > > cdrom_end_request (0, drive); > > + } else if (sense_key == MEDIUM_ERROR) { > > + /* No point in re-trying a zillion times on a bad > > + * sector... If we got here the error is not correctable */ > > + ide_dump_status (drive, "media error (bad sector)", stat); > > .. and some curious will want to know which sector has thrown the error > [which would save me to patch this some day myself...] > > > + cdrom_end_request (0, drive); > > } else if ((err & ~ABRT_ERR) != 0) { > > /* Go to the default handler > > for other errors. */ > > -Erik > > > > -- > > Erik B. Andersen http://codepoet-consulting.com/ > > --This message was written using 73% post-consumer electrons-- > > Cheers, > Hans-Peter > - > To unsubscribe from this list: send the line "unsubscribe linux-kernel" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > Please read the FAQ at http://www.tux.org/lkml/ > ^ permalink raw reply [flat|nested] 37+ messages in thread
* Re: A CD with errors (scratches etc.) blocks the whole system while reading damadged files 2002-04-19 20:01 ` Erik Andersen 2002-04-21 15:18 ` Denis Vlasenko 2002-04-21 22:54 ` Hans-Peter Jansen @ 2002-04-22 14:49 ` Roger Larsson 2 siblings, 0 replies; 37+ messages in thread From: Roger Larsson @ 2002-04-22 14:49 UTC (permalink / raw) To: andersen, Dr. Death; +Cc: linux-kernel Hi, Byt why does it look up the whole system during these retries? Does it busy wait? (on timer or HW status) In that case the preemtive kernel could help too... (assuming that it does not busy wait under a lock - but it is not unlikely...) /RogerL On Friday 19 April 2002 22.01, Erik Andersen wrote: > On Thu Apr 18, 2002 at 03:13:35PM +0200, Dr. Death wrote: > > Problem: > > > > I use SuSE Linux 7.2 and when I create md5sums from damaged files on a > > CD, the WHOLE system freezes or is ugly slow untill md5 has passed the > > damaged part of the file ! > > This should help somewhat. Currently, ide-cd.c retries ERROR_MAX > (8) times when it sees an error. But ide.c is also retrying > ERROR_MAX times when _it_ sees an error, and does a bus reset > after evey 4 failures. So for each bad sector, you get 64 > retries (with typical timouts of 7 seconds each) plus 16 bus > resets per bad sector. > > The funny thing is though, we knew after the first read that we > had an uncorrectable medium error. Try this patch vs 2.4.19-pre7 > > --- linux/drivers/ide/ide-cd.c.orig Tue Apr 9 06:59:56 2002 > +++ linux/drivers/ide/ide-cd.c Tue Apr 9 07:04:59 2002 > @@ -657,6 +657,11 @@ > request or data protect error.*/ > ide_dump_status (drive, "command error", stat); > cdrom_end_request (0, drive); > + } else if (sense_key == MEDIUM_ERROR) { > + /* No point in re-trying a zillion times on a bad > + * sector... If we got here the error is not correctable */ > + ide_dump_status (drive, "media error (bad sector)", stat); > + cdrom_end_request (0, drive); > } else if ((err & ~ABRT_ERR) != 0) { > /* Go to the default handler > for other errors. */ > -Erik > > -- > Erik B. Andersen http://codepoet-consulting.com/ > --This message was written using 73% post-consumer electrons-- > - > To unsubscribe from this list: send the line "unsubscribe linux-kernel" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > Please read the FAQ at http://www.tux.org/lkml/ > > -- Roger Larsson Skellefteå Sweden ^ permalink raw reply [flat|nested] 37+ messages in thread
end of thread, other threads:[~2002-05-03 7:18 UTC | newest] Thread overview: 37+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2002-04-30 21:18 A CD with errors (scratches etc.) blocks the whole system while reading damadged files Eric M -- strict thread matches above, loose matches on Subject: below -- 2002-04-18 13:13 Dr. Death 2002-04-19 14:14 ` Richard B. Johnson 2002-04-19 14:22 ` Kent Borg 2002-04-19 14:46 ` Richard B. Johnson 2002-04-19 14:28 ` Darrell Wright 2002-04-19 14:36 ` dr john halewood 2002-04-19 18:00 ` Oliver Xymoron 2002-04-23 22:34 ` Bill Davidsen 2002-04-24 12:31 ` Richard B. Johnson 2002-04-24 19:16 ` Bill Davidsen 2002-04-24 19:52 ` Richard B. Johnson 2002-04-24 22:58 ` Stephen Samuel 2002-04-25 3:33 ` Bill Davidsen 2002-04-26 4:04 ` Mike Fedyk 2002-04-26 5:42 ` Erik Andersen 2002-04-26 7:35 ` Andre Hedrick 2002-04-25 20:50 ` Pavel Machek 2002-04-28 2:18 ` Andre Hedrick 2002-04-26 15:48 ` Mike Fedyk 2002-04-29 21:46 ` Bill Davidsen 2002-05-02 3:45 ` Mike Fedyk 2002-05-02 8:26 ` Stephen Samuel 2002-05-02 8:49 ` Xavier Bestel 2002-05-02 8:43 ` Zwane Mwaikambo 2002-05-02 9:12 ` Xavier Bestel 2002-05-02 15:48 ` Mike Fedyk 2002-05-03 7:14 ` Xavier Bestel 2002-05-02 15:40 ` Mike Fedyk 2002-05-02 17:15 ` Stephen Samuel 2002-04-26 15:23 ` Stephen Samuel 2002-04-26 10:16 ` Zwane Mwaikambo 2002-04-19 20:01 ` Erik Andersen 2002-04-21 15:18 ` Denis Vlasenko 2002-04-21 22:54 ` Hans-Peter Jansen 2002-04-22 5:38 ` Andre Hedrick 2002-04-22 14:49 ` Roger Larsson
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox