* sata_sil 0.9 experience
@ 2005-06-01 18:51 Bob Ramstad
2005-06-01 22:32 ` Kanniball
2005-06-03 5:31 ` Tim Moore
0 siblings, 2 replies; 4+ messages in thread
From: Bob Ramstad @ 2005-06-01 18:51 UTC (permalink / raw)
To: linux-ide
This is anecdotal, but may be helpful. If you require more details,
please let me know.
I recently converted a WBEL3 (RHEL3 clone) system from using IDE
drives to using Serial ATA. I found the performance using the built
in driver abysmal, and switched over to libata + sata_sil. The
sata_sil version shipping with the RHEL3 kernel is 0.54 (ancient) but
gave me much better performance (45x as fast).
Things seemed fine, but I noticed when doing massive copies -- say 20
GB of data in 2000 files -- that it was not uncommon to have one or
two files corrupted in the copy. Recopying the affected files would
fix the problem. I started keeping md5 checksums of anything I was
copying back and forth.
The problem finally drove me nuts, and I did some research online and
found the 0.8 -> 0.9 sata_sil driver. Up until then I didn't know I
was using an ancient version of sata_sil.
Unfortunately, I was unable to just upgrade to 2.4.31 due to
threads... when I compiled 2.4.31-rc1 and booted with it, BIND aka
named would dump core, and upon investigation I discovered that the
vanilla 2.4 kernel would not work right with RHEL3. (I'm sure if I
spent a bunch of time that there has to be a way to compile 2.4 so
that it works with RHEL3, but I'm not really all that interested in
that particular learning curve.)
I then tried to use the sata_sil.c 0.9 with the kernel-source for
RHEL3 and that failed miserably with many compile time errors... I
also tried to replace the entire drivers/scsi directory, and that
failed too.
Finally, I patched by hand my 0.54 sata_sil.c source by looking at the
0.8 -> 0.9 changes and making the same changes.
The upshot of all of this is that my modified 0.54 with manual 0.8 ->
0.9 sata_sil.c seems to work much better. I've done some testing,
with three or four large copies, and had no data corruption. My
testing is not extensive, however.
My hardware is ASUS A7N8X-E Deluxe (nForce2 chipset) with onboard
Silicon Image SATA RAID. Drives are two new Seagates ST3250823AS. I
use them with Linux LVM with some partitions as ext3 on top of RAID1
and some straight ext3 partitions. I do not use the SI RAID
functionality, the drives are used as plain old /dev/sdX with software
/dev/mdX RAID.
I hope this information helps. Of course, I'm feeling a little bit
painted into a corner, I can't keep doing weird hand patches to the
kernel-source for RHEL3 when new versions of the kernel come out...
and given that RedHat seems to have no urgency to upgrade sata_sil
given the shipping version is 0.54 (archaic) I'm hopeful that the 0.9
will finally make it into the next official RHEL3 kernel given the
data corruption issues. The other alternative from my perspective is
to consider going to RHEL4 as at least the learning curve there is
probably something I'll have to deal with eventually anyway.
Thank you for all your hard work on sata_sil -- it is much appreciated
by a LOT of people!
Do let me know if there is anything specific I can test or details I
can provide... or if any of you have specific thoughts on how I can
best maintain my system, given the weird hand patched mess that it is
right now!
-- Bob
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: sata_sil 0.9 experience
2005-06-01 18:51 sata_sil 0.9 experience Bob Ramstad
@ 2005-06-01 22:32 ` Kanniball
2005-06-01 23:19 ` Bob Ramstad
2005-06-03 5:31 ` Tim Moore
1 sibling, 1 reply; 4+ messages in thread
From: Kanniball @ 2005-06-01 22:32 UTC (permalink / raw)
To: rramstad; +Cc: linux-ide
Bob Ramstad:
this is a well known issue. Check the source code for sata_sil.c where you
will find a lot of seagate drives blacklisted.
check this pages:
http://marc.theaimsgroup.com/?l=linux-ide&m=111079212116799&w=2
http://marc.theaimsgroup.com/?l=linux-ide&m=111260146423451&w=2
http://marc.theaimsgroup.com/?l=linux-ide&m=111173492412748&w=2
although with my experience on same hardware, it that only the write speed is
higher... (after applying both patches).
Regards,
Paulo Fidalgo, aka Kanniball
On Wednesday 01 June 2005 19:51, Bob Ramstad wrote:
> This is anecdotal, but may be helpful. If you require more details,
> please let me know.
>
> I recently converted a WBEL3 (RHEL3 clone) system from using IDE
> drives to using Serial ATA. I found the performance using the built
> in driver abysmal, and switched over to libata + sata_sil. The
> sata_sil version shipping with the RHEL3 kernel is 0.54 (ancient) but
> gave me much better performance (45x as fast).
>
> Things seemed fine, but I noticed when doing massive copies -- say 20
> GB of data in 2000 files -- that it was not uncommon to have one or
> two files corrupted in the copy. Recopying the affected files would
> fix the problem. I started keeping md5 checksums of anything I was
> copying back and forth.
>
> The problem finally drove me nuts, and I did some research online and
> found the 0.8 -> 0.9 sata_sil driver. Up until then I didn't know I
> was using an ancient version of sata_sil.
>
> Unfortunately, I was unable to just upgrade to 2.4.31 due to
> threads... when I compiled 2.4.31-rc1 and booted with it, BIND aka
> named would dump core, and upon investigation I discovered that the
> vanilla 2.4 kernel would not work right with RHEL3. (I'm sure if I
> spent a bunch of time that there has to be a way to compile 2.4 so
> that it works with RHEL3, but I'm not really all that interested in
> that particular learning curve.)
>
> I then tried to use the sata_sil.c 0.9 with the kernel-source for
> RHEL3 and that failed miserably with many compile time errors... I
> also tried to replace the entire drivers/scsi directory, and that
> failed too.
>
> Finally, I patched by hand my 0.54 sata_sil.c source by looking at the
> 0.8 -> 0.9 changes and making the same changes.
>
> The upshot of all of this is that my modified 0.54 with manual 0.8 ->
> 0.9 sata_sil.c seems to work much better. I've done some testing,
> with three or four large copies, and had no data corruption. My
> testing is not extensive, however.
>
> My hardware is ASUS A7N8X-E Deluxe (nForce2 chipset) with onboard
> Silicon Image SATA RAID. Drives are two new Seagates ST3250823AS. I
> use them with Linux LVM with some partitions as ext3 on top of RAID1
> and some straight ext3 partitions. I do not use the SI RAID
> functionality, the drives are used as plain old /dev/sdX with software
> /dev/mdX RAID.
>
> I hope this information helps. Of course, I'm feeling a little bit
> painted into a corner, I can't keep doing weird hand patches to the
> kernel-source for RHEL3 when new versions of the kernel come out...
> and given that RedHat seems to have no urgency to upgrade sata_sil
> given the shipping version is 0.54 (archaic) I'm hopeful that the 0.9
> will finally make it into the next official RHEL3 kernel given the
> data corruption issues. The other alternative from my perspective is
> to consider going to RHEL4 as at least the learning curve there is
> probably something I'll have to deal with eventually anyway.
>
> Thank you for all your hard work on sata_sil -- it is much appreciated
> by a LOT of people!
>
> Do let me know if there is anything specific I can test or details I
> can provide... or if any of you have specific thoughts on how I can
> best maintain my system, given the weird hand patched mess that it is
> right now!
>
> -- Bob
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: sata_sil 0.9 experience
2005-06-01 22:32 ` Kanniball
@ 2005-06-01 23:19 ` Bob Ramstad
0 siblings, 0 replies; 4+ messages in thread
From: Bob Ramstad @ 2005-06-01 23:19 UTC (permalink / raw)
To: kanniball; +Cc: linux-ide, jgarzik
Thanks for the response. I was actually writing to indicate that I
believe the sata_sil 0.9 patch fixes the data corruption problem I was
experiencing with no measurable performance penalty, not to complain
about a problem.
I'm sorry if my first post wasn't clear. Here's the summary. Using
an older kernel with sata_sil 0.54 I was experiencing data corruption
when copying 20+ GB in 2000+ files at a time. After applying the
diffs for 0.8 to 0.9 I am no longer experiencing the problem. ASUS
A7N8X-E Deluxe (nForce2 chipset) with onboard Silicon Image. Drives
are two new Seagates ST3250823AS.
FWIW, according to Seagate only a couple of drives have the mod15
problem, definitely not including the newer drives. Jeff Garzik
recently indicated in the 0.9 sata_sil patch that he's been wondering
if the increasingly large number of drives on the blacklist isn't
indicative of some other problem, like a timing issue, that only comes
up under heavy load, and that the blacklist serves as a way to slow
the drives down enough that the timing problem doesn't happen, as a
side effect, and the net result is that people think "hey, I need to
add my drive to the black list" when that isn't the best solution. I
think Jeff might be on to something, given my very limited personal
experience on this.
BTW, my drive model has never been blacklisted, to my knowledge.
They're pretty new though.
Also BTW, I ended up finding a RHEL3 clone with a much newer kernel
that had 0.8 sata_sil builtin, and was able to patch it to 0.9 and
recompile with it, and I'm running that now on my systems, rather than
the older kernel that I had to hand patch. So, as is usual in this
kind of situation, instead of making it harder than necessary I should
have looked around a bit more...
-- Bob
On 6/1/05, Kanniball <kanniball@zmail.pt> wrote:
> Bob Ramstad:
> this is a well known issue. Check the source code for sata_sil.c where you
> will find a lot of seagate drives blacklisted.
> check this pages:
> http://marc.theaimsgroup.com/?l=linux-ide&m=111079212116799&w=2
> http://marc.theaimsgroup.com/?l=linux-ide&m=111260146423451&w=2
> http://marc.theaimsgroup.com/?l=linux-ide&m=111173492412748&w=2
>
> although with my experience on same hardware, it that only the write speed is
> higher... (after applying both patches).
>
> Regards,
>
> Paulo Fidalgo, aka Kanniball
>
>
> On Wednesday 01 June 2005 19:51, Bob Ramstad wrote:
> > This is anecdotal, but may be helpful. If you require more details,
> > please let me know.
> >
> > I recently converted a WBEL3 (RHEL3 clone) system from using IDE
> > drives to using Serial ATA. I found the performance using the built
> > in driver abysmal, and switched over to libata + sata_sil. The
> > sata_sil version shipping with the RHEL3 kernel is 0.54 (ancient) but
> > gave me much better performance (45x as fast).
> >
> > Things seemed fine, but I noticed when doing massive copies -- say 20
> > GB of data in 2000 files -- that it was not uncommon to have one or
> > two files corrupted in the copy. Recopying the affected files would
> > fix the problem. I started keeping md5 checksums of anything I was
> > copying back and forth.
> >
> > The problem finally drove me nuts, and I did some research online and
> > found the 0.8 -> 0.9 sata_sil driver. Up until then I didn't know I
> > was using an ancient version of sata_sil.
> >
> > Unfortunately, I was unable to just upgrade to 2.4.31 due to
> > threads... when I compiled 2.4.31-rc1 and booted with it, BIND aka
> > named would dump core, and upon investigation I discovered that the
> > vanilla 2.4 kernel would not work right with RHEL3. (I'm sure if I
> > spent a bunch of time that there has to be a way to compile 2.4 so
> > that it works with RHEL3, but I'm not really all that interested in
> > that particular learning curve.)
> >
> > I then tried to use the sata_sil.c 0.9 with the kernel-source for
> > RHEL3 and that failed miserably with many compile time errors... I
> > also tried to replace the entire drivers/scsi directory, and that
> > failed too.
> >
> > Finally, I patched by hand my 0.54 sata_sil.c source by looking at the
> > 0.8 -> 0.9 changes and making the same changes.
> >
> > The upshot of all of this is that my modified 0.54 with manual 0.8 ->
> > 0.9 sata_sil.c seems to work much better. I've done some testing,
> > with three or four large copies, and had no data corruption. My
> > testing is not extensive, however.
> >
> > My hardware is ASUS A7N8X-E Deluxe (nForce2 chipset) with onboard
> > Silicon Image SATA RAID. Drives are two new Seagates ST3250823AS. I
> > use them with Linux LVM with some partitions as ext3 on top of RAID1
> > and some straight ext3 partitions. I do not use the SI RAID
> > functionality, the drives are used as plain old /dev/sdX with software
> > /dev/mdX RAID.
> >
> > I hope this information helps. Of course, I'm feeling a little bit
> > painted into a corner, I can't keep doing weird hand patches to the
> > kernel-source for RHEL3 when new versions of the kernel come out...
> > and given that RedHat seems to have no urgency to upgrade sata_sil
> > given the shipping version is 0.54 (archaic) I'm hopeful that the 0.9
> > will finally make it into the next official RHEL3 kernel given the
> > data corruption issues. The other alternative from my perspective is
> > to consider going to RHEL4 as at least the learning curve there is
> > probably something I'll have to deal with eventually anyway.
> >
> > Thank you for all your hard work on sata_sil -- it is much appreciated
> > by a LOT of people!
> >
> > Do let me know if there is anything specific I can test or details I
> > can provide... or if any of you have specific thoughts on how I can
> > best maintain my system, given the weird hand patched mess that it is
> > right now!
> >
> > -- Bob
>
>
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: sata_sil 0.9 experience
2005-06-01 18:51 sata_sil 0.9 experience Bob Ramstad
2005-06-01 22:32 ` Kanniball
@ 2005-06-03 5:31 ` Tim Moore
1 sibling, 0 replies; 4+ messages in thread
From: Tim Moore @ 2005-06-03 5:31 UTC (permalink / raw)
To: linux-ide
> ...
> Unfortunately, I was unable to just upgrade to 2.4.31 due to
> threads... when I compiled 2.4.31-rc1 and booted with it, BIND aka
> named would dump core, and upon investigation I discovered that the
> vanilla 2.4 kernel would not work right with RHEL3. (I'm sure if I
> spent a bunch of time that there has to be a way to compile 2.4 so
> that it works with RHEL3, but I'm not really all that interested in
> that particular learning curve.)
search for LD_ASSUME_KERNEL=2.4.19; export LD_ASSUME_KERNEL
I run 2.4.30 on a RHEL3 base at work without a problem, including bluecurve.
Try /etc/csh.*, /etc/profile /etc/init.d/{nscd,ypbind,named} for starters.
sata_sil should compile too.
t.
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2005-06-03 5:31 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-06-01 18:51 sata_sil 0.9 experience Bob Ramstad
2005-06-01 22:32 ` Kanniball
2005-06-01 23:19 ` Bob Ramstad
2005-06-03 5:31 ` Tim Moore
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).