* ATA errors corrupt other I/O & set / readonly @ 2012-05-02 8:59 Felix Miata 2012-05-19 23:28 ` Robert Hancock 0 siblings, 1 reply; 4+ messages in thread From: Felix Miata @ 2012-05-02 8:59 UTC (permalink / raw) To: linux-ide Two weeks ago on recommendation of downstream bug owner I filed https://bugzilla.kernel.org/show_bug.cgi?id=43128 expecting to see some kind of non-delayed reaction from someone either in the bug or here, and am quite disappointed that nothing has yet happened. Is the bug's owner sick, on holiday, or at a convention? I originally resisted filing because I believe that actual problem is the device's and not kernel or drivers. Yet, that it is possible for any such device failure to cause system corruption doesn't seem right either. So, can someone please comment on the likelihood that this bug can or will be fixed? I would like someone to respond here or in the bug before proceeding to try again to convince the manufacturer its product is defective and get a refund instead of yet another replacement of a defective product with another defective product. Once I return it for refund I will no longer be able to follow up on the bugs I filed, yet I need a working device that doesn't require I figure out a correct cmdline workaround for every one of the several systems it ever needs to be connected to. Please, what should I do? -- "The wise are known for their understanding, and pleasant words are persuasive." Proverbs 16:21 (New Living Translation) Team OS/2 ** Reg. Linux User #211409 ** a11y rocks! Felix Miata *** http://fm.no-ip.com/ ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: ATA errors corrupt other I/O & set / readonly 2012-05-02 8:59 ATA errors corrupt other I/O & set / readonly Felix Miata @ 2012-05-19 23:28 ` Robert Hancock 2012-05-20 4:56 ` Felix Miata 0 siblings, 1 reply; 4+ messages in thread From: Robert Hancock @ 2012-05-19 23:28 UTC (permalink / raw) To: Felix Miata; +Cc: linux-ide On 05/02/2012 02:59 AM, Felix Miata wrote: > Two weeks ago on recommendation of downstream bug owner I filed > https://bugzilla.kernel.org/show_bug.cgi?id=43128 expecting to see some > kind of non-delayed reaction from someone either in the bug or here, and > am quite disappointed that nothing has yet happened. Is the bug's owner > sick, on holiday, or at a convention? > > I originally resisted filing because I believe that actual problem is > the device's and not kernel or drivers. Yet, that it is possible for any > such device failure to cause system corruption doesn't seem right > either. So, can someone please comment on the likelihood that this bug > can or will be fixed? I would like someone to respond here or in the bug > before proceeding to try again to convince the manufacturer its product > is defective and get a refund instead of yet another replacement of a > defective product with another defective product. > > Once I return it for refund I will no longer be able to follow up on the > bugs I filed, yet I need a working device that doesn't require I figure > out a correct cmdline workaround for every one of the several systems it > ever needs to be connected to. > > Please, what should I do? It's hard to follow the discussion in the openSUSE bug report. I'm not sure if there was any dmesg posted from the case where other IO was being interfered with by the problems with the external device. In AHCI mode that really shouldn't happen, but in IDE mode on Intel controllers it may be more possible because of the PATA emulation that's effectively being done by the controller. It's not possible to put in a blacklist entry for a specific SATA enclosure because they're essentially a passive device and there's no way to identify them through software (unless they have a custom ID string like the WD MyBook drives). If it only works properly at 1.5 Gbps then a module/boot parameter may be the best solution. ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: ATA errors corrupt other I/O & set / readonly 2012-05-19 23:28 ` Robert Hancock @ 2012-05-20 4:56 ` Felix Miata 2012-05-25 1:03 ` Robert Hancock 0 siblings, 1 reply; 4+ messages in thread From: Felix Miata @ 2012-05-20 4:56 UTC (permalink / raw) To: linux-ide On 2012/05/19 17:28 (GMT-0600) Robert Hancock composed: > On 2012/05/02 Felix Miata wrote: >> https://bugzilla.kernel.org/show_bug.cgi?id=43128 ... >> Please, what should I do? > It's hard to follow the discussion in the openSUSE bug report. I'm not > sure if there was any dmesg posted from the case where other IO was > being interfered with by the problems with the external device. In AHCI > mode that really shouldn't happen, but in IDE mode on Intel controllers > it may be more possible because of the PATA emulation that's effectively > being done by the controller. The "other" I/O being done while I'm attempting to use these is usually limited to tty I/O, either bash or MC running in bash running on tty[1-6] in runlevel 3 to copy files between one HD and another, usually both source and target in individual RX-358 units, often just from the NTFS partition to the EXT2 partition in the same RX-358 unit. The STB[1] that I bought the V2 to use with only writes to Windows partitions via USB connection, which complicates life by forcing me to move files off the NTFS partition to free up space. The V2 also produces trouble in USB mode, and when connected via SATA to WinXP, for which the only workaround I know is rebooting until it gets recognized. Connected via USB to the STB the NTFS partition would sometimes become marked so as to become unavailable to the STB. The only way to reenable access to it is to have WinXP run CHKDSK on it, which is how I would discover the unreliability of its connectibility to WinXP. I use these devices with multiple machines, many of which are Dells whose BIOS contain the string "AHCI" nowhere in setup. I never set any machine to use a legacy mode except temporarily for troubleshooting purposes, but this doesn't necessarily mean AHCI is used. IIRC, I was only ever asked for /var/log/messages, never dmesg. I can never remember what distinguishes the content provided by the two. If you want a dmesg attached to either bug, I need to know when and how you want me to try and get it. I say "try" because once corruption begins, getting dmesg is unlikely via keyboard on a tty, and potential methods of fetching it any other way are not known to me. It might be sitting on tty11 or elsewhere that I could copy manually from if I knew well enough what to copy or omit. > It's not possible to put in a blacklist entry for a specific SATA > enclosure because they're essentially a passive device and there's no > way to identify them through software (unless they have a custom ID > string like the WD MyBook drives). If it only works properly at 1.5 Gbps > then a module/boot parameter may be the best solution. It would be helpful if I knew a blanket cmdline parameter to use to cap everything at 1.5 instead of for each machine having to discover all the bus particulars and remember which go where on reboot to place them into effect only for the port the V2 is or might be connected to. I can't picture myself noticing on any of my machines whether I/O was capped at 1.5 or not. The maximum number of HDs used at once in any of them would be two in most, and three in two machines with RAID1 configured. A couple of weeks after filing the kernel.org bug I contacted the manufacturer again to provide the new information contained in the bugs. Again it offered to replace. Only yesterday I finally answered the offer that I'd probably prefer to keep it in lieu of wasting more money on shipping only to be left with no opportunity for further testing and bug follow-up, and yet another replacement no better than the original, or a store credit making me shop for something else with equivalent functionality. If there was a kernel dev interested in pursuing this it might be possible to get the manufacturer to provide a V2 device to test with. Considering the current delivered price of the V2 is roughly half that of the V1 it appears remaining stock is trying to be dumped. [1] http://www.manhattan-digital.net/rs1933.htm -- "The wise are known for their understanding, and pleasant words are persuasive." Proverbs 16:21 (New Living Translation) Team OS/2 ** Reg. Linux User #211409 ** a11y rocks! Felix Miata *** http://fm.no-ip.com/ ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: ATA errors corrupt other I/O & set / readonly 2012-05-20 4:56 ` Felix Miata @ 2012-05-25 1:03 ` Robert Hancock 0 siblings, 0 replies; 4+ messages in thread From: Robert Hancock @ 2012-05-25 1:03 UTC (permalink / raw) To: Felix Miata; +Cc: linux-ide Sorry I missed your response initially. Please use reply-to-all. On 05/19/2012 10:56 PM, Felix Miata wrote: > On 2012/05/19 17:28 (GMT-0600) Robert Hancock composed: > >> On 2012/05/02 Felix Miata wrote: > >>> https://bugzilla.kernel.org/show_bug.cgi?id=43128 > ... >>> Please, what should I do? > >> It's hard to follow the discussion in the openSUSE bug report. I'm not >> sure if there was any dmesg posted from the case where other IO was >> being interfered with by the problems with the external device. In AHCI >> mode that really shouldn't happen, but in IDE mode on Intel controllers >> it may be more possible because of the PATA emulation that's effectively >> being done by the controller. > > The "other" I/O being done while I'm attempting to use these is usually > limited to tty I/O, either bash or MC running in bash running on > tty[1-6] in runlevel 3 to copy files between one HD and another, usually > both source and target in individual RX-358 units, often just from the > NTFS partition to the EXT2 partition in the same RX-358 unit. The STB[1] > that I bought the V2 to use with only writes to Windows partitions via > USB connection, which complicates life by forcing me to move files off > the NTFS partition to free up space. > > The V2 also produces trouble in USB mode, and when connected via SATA to > WinXP, for which the only workaround I know is rebooting until it gets > recognized. Connected via USB to the STB the NTFS partition would > sometimes become marked so as to become unavailable to the STB. The only > way to reenable access to it is to have WinXP run CHKDSK on it, which is > how I would discover the unreliability of its connectibility to WinXP. > > I use these devices with multiple machines, many of which are Dells > whose BIOS contain the string "AHCI" nowhere in setup. I never set any > machine to use a legacy mode except temporarily for troubleshooting > purposes, but this doesn't necessarily mean AHCI is used. > > IIRC, I was only ever asked for /var/log/messages, never dmesg. I can > never remember what distinguishes the content provided by the two. /var/log/messages contains non-kernel logging as well, but it sometimes doesn't show all log messages output in dmesg. > > If you want a dmesg attached to either bug, I need to know when and how > you want me to try and get it. I say "try" because once corruption > begins, getting dmesg is unlikely via keyboard on a tty, and potential > methods of fetching it any other way are not known to me. It might be > sitting on tty11 or elsewhere that I could copy manually from if I knew > well enough what to copy or omit. It's possible you may be able to use a previously-opened SSH login from another machine to see dmesg output even if the process running on the console is stalled due to an I/O problem. Also, you could try booting from a USB live image so that any ATA issues won't prevent the system from reading executables. > >> It's not possible to put in a blacklist entry for a specific SATA >> enclosure because they're essentially a passive device and there's no >> way to identify them through software (unless they have a custom ID >> string like the WD MyBook drives). If it only works properly at 1.5 Gbps >> then a module/boot parameter may be the best solution. > > It would be helpful if I knew a blanket cmdline parameter to use to cap > everything at 1.5 instead of for each machine having to discover all the > bus particulars and remember which go where on reboot to place them into > effect only for the port the V2 is or might be connected to. I can't > picture myself noticing on any of my machines whether I/O was capped at > 1.5 or not. The maximum number of HDs used at once in any of them would > be two in most, and three in two machines with RAID1 configured. I haven't tested but AFAICS, using libata.force=1.5Gbps should force 1.5 Gbps on all ports. > > A couple of weeks after filing the kernel.org bug I contacted the > manufacturer again to provide the new information contained in the bugs. > Again it offered to replace. Only yesterday I finally answered the offer > that I'd probably prefer to keep it in lieu of wasting more money on > shipping only to be left with no opportunity for further testing and bug > follow-up, and yet another replacement no better than the original, or a > store credit making me shop for something else with equivalent > functionality. > > If there was a kernel dev interested in pursuing this it might be > possible to get the manufacturer to provide a V2 device to test with. > Considering the current delivered price of the V2 is roughly half that > of the V1 it appears remaining stock is trying to be dumped. > > [1] http://www.manhattan-digital.net/rs1933.htm ^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2012-05-25 1:04 UTC | newest] Thread overview: 4+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2012-05-02 8:59 ATA errors corrupt other I/O & set / readonly Felix Miata 2012-05-19 23:28 ` Robert Hancock 2012-05-20 4:56 ` Felix Miata 2012-05-25 1:03 ` Robert Hancock
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).