* Re: SIL_QUIRK_MOD15WRITE workaround problem on 2.6.14
[not found] <438BD351.60902@ucla.edu>
@ 2005-11-30 4:16 ` Tejun Heo
2005-11-30 4:42 ` Jeff Garzik
2005-11-30 4:48 ` Ethan Chen
0 siblings, 2 replies; 7+ messages in thread
From: Tejun Heo @ 2005-11-30 4:16 UTC (permalink / raw)
To: Ethan Chen; +Cc: linux-kernel, Jeff Garzik, Carlos Pardo, Linux-ide
[CC'ing Jeff, Carlos & linux-ide]
Ethan Chen wrote:
> I've got a dual Opteron 242 machine here with 2x Seagate ST3200822AS
> SATA drives attached to a Silicon Image SI3114 controller, and after
> upgrading to 2.6.14 from 2.6.13, it seems the SIL_QUIRK_MOD15WRITE
> workaround for the sata_sil driver isn't being applied anymore. This
> caused me trouble in the past before my drive was added to the
> blacklist, and this message that comes up when writing (~4GBfiles to
> test) files, right before the computer locks up, is the same as before:
> kernel: ata1: command 0x35 timeout, stat 0xd8 host_stat 0x61
> In the dmesg, the 'Applying Seagate errata fix' message doesn't appear
> anymore as well.
> Finally, without the fix, write speeds are much higher as well, before
> it locks up.
Hello, Ethan.
Sometime ago, Silicon Image has confirmed that 3114's and 3512's are not
affected by the m15w problem - only 3112's are affected. So, a patch
has made into the tree before 2.6.14 to apply the m15w quirk selectively.
Can you post 'lspci -nv' result?
--
tejun
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: SIL_QUIRK_MOD15WRITE workaround problem on 2.6.14
2005-11-30 4:16 ` SIL_QUIRK_MOD15WRITE workaround problem on 2.6.14 Tejun Heo
@ 2005-11-30 4:42 ` Jeff Garzik
2005-11-30 5:37 ` Tejun Heo
2005-11-30 4:48 ` Ethan Chen
1 sibling, 1 reply; 7+ messages in thread
From: Jeff Garzik @ 2005-11-30 4:42 UTC (permalink / raw)
To: Tejun Heo; +Cc: Ethan Chen, linux-kernel, Carlos Pardo, Linux-ide
Tejun Heo wrote:
> [CC'ing Jeff, Carlos & linux-ide]
>
> Ethan Chen wrote:
>
>> I've got a dual Opteron 242 machine here with 2x Seagate ST3200822AS
>> SATA drives attached to a Silicon Image SI3114 controller, and after
>> upgrading to 2.6.14 from 2.6.13, it seems the SIL_QUIRK_MOD15WRITE
>> workaround for the sata_sil driver isn't being applied anymore. This
>> caused me trouble in the past before my drive was added to the
>> blacklist, and this message that comes up when writing (~4GBfiles to
>> test) files, right before the computer locks up, is the same as before:
>> kernel: ata1: command 0x35 timeout, stat 0xd8 host_stat 0x61
>> In the dmesg, the 'Applying Seagate errata fix' message doesn't appear
>> anymore as well.
>> Finally, without the fix, write speeds are much higher as well, before
>> it locks up.
>
>
> Hello, Ethan.
>
> Sometime ago, Silicon Image has confirmed that 3114's and 3512's are not
> affected by the m15w problem - only 3112's are affected. So, a patch
> has made into the tree before 2.6.14 to apply the m15w quirk selectively.
Most likely, mod15write quirk was just hiding an unrelated problem.
mod15write often hid BIOS problems in the past which led to corruption.
Until sata_sil properly handles SATA phy / DMA errors by resetting the
controller and retrying the command, we won't know if its a driver
problem or not.
Jeff
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: SIL_QUIRK_MOD15WRITE workaround problem on 2.6.14
2005-11-30 4:16 ` SIL_QUIRK_MOD15WRITE workaround problem on 2.6.14 Tejun Heo
2005-11-30 4:42 ` Jeff Garzik
@ 2005-11-30 4:48 ` Ethan Chen
1 sibling, 0 replies; 7+ messages in thread
From: Ethan Chen @ 2005-11-30 4:48 UTC (permalink / raw)
To: Tejun Heo; +Cc: linux-kernel, thanatoz, Carlos.Pardo, jgarzik, linux-ide
Tejun,
The motherboard is an Iwill DK8X utilizing the AMD 8151+8131+8111 chips.
lspci -nv
00:00.0 Class 0600: 1022:7454 (rev 13)
Subsystem: 1022:7454
Flags: bus master, medium devsel, latency 0
Memory at f8000000 (32-bit, prefetchable) [size=64M]
Capabilities: [a0] AGP version 3.0
Capabilities: [c0] HyperTransport: Slave or Primary Interface
00:01.0 Class 0604: 1022:7455 (rev 13)
Flags: bus master, 66Mhz, medium devsel, latency 32
Bus: primary=00, secondary=01, subordinate=01, sec-latency=32
Memory behind bridge: fc700000-fe7fffff
Prefetchable memory behind bridge: e4600000-f45fffff
00:06.0 Class 0604: 1022:7460 (rev 07)
Flags: bus master, 66Mhz, medium devsel, latency 32
Bus: primary=00, secondary=02, subordinate=02, sec-latency=32
I/O behind bridge: 0000a000-0000afff
Memory behind bridge: fe800000-fe8fffff
Capabilities: [c0] HyperTransport: Slave or Primary Interface
Capabilities: [f0] HyperTransport: Interrupt Discovery and Configuration
00:07.0 Class 0601: 1022:7468 (rev 05)
Subsystem: 1022:7468
Flags: bus master, 66Mhz, medium devsel, latency 0
00:07.1 Class 0101: 1022:7469 (rev 03) (prog-if 8a [Master SecP PriP])
Subsystem: 1022:7469
Flags: bus master, medium devsel, latency 32
I/O ports at ffa0 [size=16]
00:07.2 Class 0c05: 1022:746a (rev 02)
Subsystem: 1022:746a
Flags: medium devsel, IRQ 9
I/O ports at cc00 [size=32]
00:07.3 Class 0680: 1022:746b (rev 05)
Subsystem: 1022:746b
Flags: medium devsel
00:0a.0 Class 0604: 1022:7450 (rev 12)
Flags: bus master, 66Mhz, medium devsel, latency 32
Bus: primary=00, secondary=03, subordinate=03, sec-latency=32
I/O behind bridge: 0000b000-0000bfff
Memory behind bridge: fe900000-feafffff
Capabilities: [a0] PCI-X bridge device.
Capabilities: [b8] HyperTransport: Interrupt Discovery and Configuration
Capabilities: [c0] HyperTransport: Slave or Primary Interface
00:0a.1 Class 0800: 1022:7451 (rev 01) (prog-if 10)
Subsystem: 1022:7451
Flags: bus master, medium devsel, latency 0
Memory at febff000 (64-bit, non-prefetchable) [size=4K]
00:0b.0 Class 0604: 1022:7450 (rev 12)
Flags: bus master, 66Mhz, medium devsel, latency 32
Bus: primary=00, secondary=04, subordinate=04, sec-latency=32
Capabilities: [a0] PCI-X bridge device.
Capabilities: [b8] HyperTransport: Interrupt Discovery and Configuration
00:0b.1 Class 0800: 1022:7451 (rev 01) (prog-if 10)
Subsystem: 1022:7451
Flags: bus master, medium devsel, latency 0
Memory at febfe000 (64-bit, non-prefetchable) [size=4K]
00:18.0 Class 0600: 1022:1100
Flags: fast devsel
Capabilities: [80] HyperTransport: Host or Secondary Interface
Capabilities: [a0] HyperTransport: Host or Secondary Interface
Capabilities: [c0] HyperTransport: Host or Secondary Interface
00:18.1 Class 0600: 1022:1101
Flags: fast devsel
00:18.2 Class 0600: 1022:1102
Flags: fast devsel
00:18.3 Class 0600: 1022:1103
Flags: fast devsel
00:19.0 Class 0600: 1022:1100
Flags: fast devsel
Capabilities: [80] HyperTransport: Host or Secondary Interface
Capabilities: [a0] HyperTransport: Host or Secondary Interface
Capabilities: [c0] HyperTransport: Host or Secondary Interface
00:19.1 Class 0600: 1022:1101
Flags: fast devsel
00:19.2 Class 0600: 1022:1102
Flags: fast devsel
00:19.3 Class 0600: 1022:1103
Flags: fast devsel
01:00.0 Class 0300: 10de:0281 (rev a1)
Flags: bus master, 66Mhz, medium devsel, latency 248, IRQ 18
Memory at fd000000 (32-bit, non-prefetchable) [size=16M]
Memory at e8000000 (32-bit, prefetchable) [size=128M]
Expansion ROM at fe7e0000 [disabled] [size=128K]
Capabilities: [60] Power Management version 2
Capabilities: [44] AGP version 3.0
02:00.0 Class 0c03: 1022:7464 (rev 0b) (prog-if 10)
Subsystem: 1022:7464
Flags: bus master, medium devsel, latency 32, IRQ 17
Memory at fe8fe000 (32-bit, non-prefetchable) [size=4K]
02:00.1 Class 0c03: 1022:7464 (rev 0b) (prog-if 10)
Subsystem: 1022:7464
Flags: bus master, medium devsel, latency 32, IRQ 17
Memory at fe8ff000 (32-bit, non-prefetchable) [size=4K]
02:04.0 Class 0401: 1102:0002 (rev 07)
Subsystem: 1102:8064
Flags: bus master, medium devsel, latency 32, IRQ 18
I/O ports at a880 [size=32]
Capabilities: [dc] Power Management version 1
02:04.1 Class 0980: 1102:7002 (rev 07)
Subsystem: 1102:0020
Flags: bus master, medium devsel, latency 32
I/O ports at ac00 [size=8]
Capabilities: [dc] Power Management version 1
02:06.0 Class 0c00: 104c:8023 (prog-if 10)
Subsystem: 104c:8023
Flags: bus master, medium devsel, latency 32, IRQ 19
Memory at fe8fd800 (32-bit, non-prefetchable) [size=2K]
Memory at fe8f8000 (32-bit, non-prefetchable) [size=16K]
Capabilities: [44] Power Management version 2
02:07.0 Class 0c03: 1033:0035 (rev 43) (prog-if 10)
Subsystem: 1033:0035
Flags: bus master, medium devsel, latency 32, IRQ 17
Memory at fe8f7000 (32-bit, non-prefetchable) [size=4K]
Capabilities: [40] Power Management version 2
02:07.1 Class 0c03: 1033:0035 (rev 43) (prog-if 10)
Subsystem: 1033:0035
Flags: bus master, medium devsel, latency 32, IRQ 18
Memory at fe8fc000 (32-bit, non-prefetchable) [size=4K]
Capabilities: [40] Power Management version 2
02:07.2 Class 0c03: 1033:00e0 (rev 04) (prog-if 20)
Subsystem: 1033:00e0
Flags: bus master, medium devsel, latency 32, IRQ 16
Memory at fe8fd400 (32-bit, non-prefetchable) [size=256]
Capabilities: [40] Power Management version 2
03:03.0 Class 0200: 11ab:4320 (rev 13)
Subsystem: 15d4:0047
Flags: bus master, 66Mhz, medium devsel, latency 32, IRQ 17
Memory at feaf8000 (32-bit, non-prefetchable) [size=16K]
I/O ports at b000 [size=256]
Expansion ROM at feac0000 [disabled] [size=128K]
Capabilities: [48] Power Management version 2
Capabilities: [50] Vital Product Data
03:05.0 Class 0180: 1095:3114 (rev 02)
Subsystem: 1095:3114
Flags: bus master, 66Mhz, medium devsel, latency 32, IRQ 16
I/O ports at bc00 [size=8]
I/O ports at b880 [size=4]
I/O ports at b800 [size=8]
I/O ports at b480 [size=4]
I/O ports at b400 [size=16]
Memory at feaffc00 (32-bit, non-prefetchable) [size=1K]
Expansion ROM at fea00000 [disabled] [size=512K]
Capabilities: [60] Power Management version 2
Thanks,
-Ethan
Tejun Heo wrote:
> [CC'ing Jeff, Carlos & linux-ide]
>
> Ethan Chen wrote:
>
>> I've got a dual Opteron 242 machine here with 2x Seagate ST3200822AS
>> SATA drives attached to a Silicon Image SI3114 controller, and after
>> upgrading to 2.6.14 from 2.6.13, it seems the SIL_QUIRK_MOD15WRITE
>> workaround for the sata_sil driver isn't being applied anymore. This
>> caused me trouble in the past before my drive was added to the
>> blacklist, and this message that comes up when writing (~4GBfiles to
>> test) files, right before the computer locks up, is the same as before:
>> kernel: ata1: command 0x35 timeout, stat 0xd8 host_stat 0x61
>> In the dmesg, the 'Applying Seagate errata fix' message doesn't
>> appear anymore as well.
>> Finally, without the fix, write speeds are much higher as well,
>> before it locks up.
>
>
> Hello, Ethan.
>
> Sometime ago, Silicon Image has confirmed that 3114's and 3512's are
> not affected by the m15w problem - only 3112's are affected. So, a
> patch has made into the tree before 2.6.14 to apply the m15w quirk
> selectively.
>
> Can you post 'lspci -nv' result?
>
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: SIL_QUIRK_MOD15WRITE workaround problem on 2.6.14
2005-11-30 4:42 ` Jeff Garzik
@ 2005-11-30 5:37 ` Tejun Heo
2005-12-02 2:01 ` Jeff Garzik
0 siblings, 1 reply; 7+ messages in thread
From: Tejun Heo @ 2005-11-30 5:37 UTC (permalink / raw)
To: Jeff Garzik; +Cc: Ethan Chen, linux-kernel, Carlos Pardo, Linux-ide
Jeff Garzik wrote:
> Tejun Heo wrote:
>
>> [CC'ing Jeff, Carlos & linux-ide]
>>
>> Ethan Chen wrote:
>>
>>> I've got a dual Opteron 242 machine here with 2x Seagate ST3200822AS
>>> SATA drives attached to a Silicon Image SI3114 controller, and after
>>> upgrading to 2.6.14 from 2.6.13, it seems the SIL_QUIRK_MOD15WRITE
>>> workaround for the sata_sil driver isn't being applied anymore. This
>>> caused me trouble in the past before my drive was added to the
>>> blacklist, and this message that comes up when writing (~4GBfiles to
>>> test) files, right before the computer locks up, is the same as before:
>>> kernel: ata1: command 0x35 timeout, stat 0xd8 host_stat 0x61
>>> In the dmesg, the 'Applying Seagate errata fix' message doesn't
>>> appear anymore as well.
>>> Finally, without the fix, write speeds are much higher as well,
>>> before it locks up.
>>
>>
>>
>> Hello, Ethan.
>>
>> Sometime ago, Silicon Image has confirmed that 3114's and 3512's are
>> not affected by the m15w problem - only 3112's are affected. So, a
>> patch has made into the tree before 2.6.14 to apply the m15w quirk
>> selectively.
>
>
> Most likely, mod15write quirk was just hiding an unrelated problem.
> mod15write often hid BIOS problems in the past which led to corruption.
>
> Until sata_sil properly handles SATA phy / DMA errors by resetting the
> controller and retrying the command, we won't know if its a driver
> problem or not.
>
Ethan confirmed that it's 1095:3114. Arghhh.... Maybe we should keep
m15w quirk for 3114's for the time being? Better be slow than hang.
Whatever bug the m15w quirk was hiding.
Carlos, can you double check that 3114's don't have the m15w issue? It
just seems too similar to the usual m15w lockup.
--
tejun
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: SIL_QUIRK_MOD15WRITE workaround problem on 2.6.14
2005-11-30 5:37 ` Tejun Heo
@ 2005-12-02 2:01 ` Jeff Garzik
2005-12-04 16:44 ` Tejun Heo
0 siblings, 1 reply; 7+ messages in thread
From: Jeff Garzik @ 2005-12-02 2:01 UTC (permalink / raw)
To: Tejun Heo; +Cc: Ethan Chen, linux-kernel, Carlos Pardo, Linux-ide
Tejun Heo wrote:
> Ethan confirmed that it's 1095:3114. Arghhh.... Maybe we should keep
> m15w quirk for 3114's for the time being? Better be slow than hang.
> Whatever bug the m15w quirk was hiding.
A generic 'slow_down_io' module option is reasonable.
It is not appropriate to apply mod15write quirk to hardware that isn't
affected by the chip bug.
A better solution is to write a 311x-specific interrupt handler.
Jeff
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: SIL_QUIRK_MOD15WRITE workaround problem on 2.6.14
2005-12-02 2:01 ` Jeff Garzik
@ 2005-12-04 16:44 ` Tejun Heo
2005-12-05 18:41 ` Jeff Garzik
0 siblings, 1 reply; 7+ messages in thread
From: Tejun Heo @ 2005-12-04 16:44 UTC (permalink / raw)
To: Jeff Garzik; +Cc: Ethan Chen, linux-kernel, Carlos Pardo, Linux-ide
Jeff Garzik wrote:
> Tejun Heo wrote:
>
>> Ethan confirmed that it's 1095:3114. Arghhh.... Maybe we should keep
>> m15w quirk for 3114's for the time being? Better be slow than hang.
>> Whatever bug the m15w quirk was hiding.
>
>
> A generic 'slow_down_io' module option is reasonable.
>
> It is not appropriate to apply mod15write quirk to hardware that isn't
> affected by the chip bug.
>
> A better solution is to write a 311x-specific interrupt handler.
>
Hello, Jeff. Hello, Carlos.
I bought a sii3114 controller yesterday and took out my ST3120026AS for
testing. The drive times out during a WRITE_EXT, and locks up.
* The ST3120026AS works perfectly on a VIA controller.
* The sii3114 controller works perfectly with Maxtor 6B080M0 drives.
I don't know. It acts and smells like m15w problem. What are the odds
of having the same symptom on the same combination?
Also, I've asked one of my friends who has a sii3512 onboard controller
and an affected seagate drive to test. The harddisk works on 2.6.13
with the quirk, but it freezes on 2.6.14. m15w affected seagate drives
does _NOT_ work on 3512 and 3114 on 2.6.14, be it m15w or something else.
--
tejun
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: SIL_QUIRK_MOD15WRITE workaround problem on 2.6.14
2005-12-04 16:44 ` Tejun Heo
@ 2005-12-05 18:41 ` Jeff Garzik
0 siblings, 0 replies; 7+ messages in thread
From: Jeff Garzik @ 2005-12-05 18:41 UTC (permalink / raw)
To: Tejun Heo; +Cc: Ethan Chen, linux-kernel, Carlos Pardo, Linux-ide
Tejun Heo wrote:
> Jeff Garzik wrote:
>
>> Tejun Heo wrote:
>>
>>> Ethan confirmed that it's 1095:3114. Arghhh.... Maybe we should
>>> keep m15w quirk for 3114's for the time being? Better be slow than
>>> hang. Whatever bug the m15w quirk was hiding.
>>
>>
>>
>> A generic 'slow_down_io' module option is reasonable.
>>
>> It is not appropriate to apply mod15write quirk to hardware that isn't
>> affected by the chip bug.
>>
>> A better solution is to write a 311x-specific interrupt handler.
>>
>
> Hello, Jeff. Hello, Carlos.
>
> I bought a sii3114 controller yesterday and took out my ST3120026AS for
> testing. The drive times out during a WRITE_EXT, and locks up.
>
> * The ST3120026AS works perfectly on a VIA controller.
> * The sii3114 controller works perfectly with Maxtor 6B080M0 drives.
>
> I don't know. It acts and smells like m15w problem. What are the odds
> of having the same symptom on the same combination?
A lock-up is very generic, and could be anything. Tons of problems are
hidden by slowing things down, so I just feel that presuming this is
_the_ mod15write problem may lead us down the wrong path.
Key example: On error, we need to do a channel-reset, and possibly a
FIFO-reset. See the FreeBSD code. sata_sil doesn't do this at all, and
probably should. The reset may cure lockups.
Jeff
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2005-12-05 18:41 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <438BD351.60902@ucla.edu>
2005-11-30 4:16 ` SIL_QUIRK_MOD15WRITE workaround problem on 2.6.14 Tejun Heo
2005-11-30 4:42 ` Jeff Garzik
2005-11-30 5:37 ` Tejun Heo
2005-12-02 2:01 ` Jeff Garzik
2005-12-04 16:44 ` Tejun Heo
2005-12-05 18:41 ` Jeff Garzik
2005-11-30 4:48 ` Ethan Chen
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).