* Compatibility of current 2.4.19.pending patchset with old data-logging?? @ 2002-09-12 0:24 Manuel Krause 2002-09-12 13:28 ` newbie how to darren 2002-09-12 21:30 ` Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) Manuel Krause 0 siblings, 2 replies; 19+ messages in thread From: Manuel Krause @ 2002-09-12 0:24 UTC (permalink / raw) To: reiserfs-list Hi! Originally I wanted to prepare a new kernel based on 2.4.20-pre6. That didn't work due to a gcc compiler failure I have to investigate on here right now. Very dubious, the script even told me to send a detailed bug report - definitely too much for tonight. A 2.4.19 compiles well with my old patchset. Previously I followed the 2.4.19.pending patch directory @ftp.namesys.com (by downloading and particularily reading, but not applying since Chris stopped his data-logging series -- That was when patch "05" changed its name from "05-hash_on_empty_fs-1.diff" to "05-reiserfs_make_bad_inode.diff", on 02-08-13, and maybe I even used some older patches from the latest rc to keep the data-logging working). BTW, it's an irritating thing to change the patches' names. That adds just more confusion to namesys' release management that could be considered "non-official" to our list or is lacking more public relation output. Now I'm completely confused with my private file-naming-convention, the newest reiserfs 2.4.19.pending directory and the following question: Is the current 2.4.19.pending compatible with data-logging any more??? When I see the discussion about the new block-allocator, the testing directory etc. I assume a definite "No." (Please, please, correct me, if I'm wrong...) So, for the overnight change to pre-kernel 7, where are the new data-logging patches Hans even announced to lkml? Thanks for your hints ;-) Manuel ^ permalink raw reply [flat|nested] 19+ messages in thread
* newbie how to 2002-09-12 0:24 Compatibility of current 2.4.19.pending patchset with old data-logging?? Manuel Krause @ 2002-09-12 13:28 ` darren 2002-09-12 14:37 ` Hans Reiser 2002-09-12 21:30 ` Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) Manuel Krause 1 sibling, 1 reply; 19+ messages in thread From: darren @ 2002-09-12 13:28 UTC (permalink / raw) To: 'reiserfs-list' Hi all, I have a brand new RedHat computer just setup. It currently have two partitions...both EXT3. How do I make one of my drives reiserfs? ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: newbie how to 2002-09-12 13:28 ` newbie how to darren @ 2002-09-12 14:37 ` Hans Reiser 2002-09-12 15:12 ` Guillermo Torreiro 0 siblings, 1 reply; 19+ messages in thread From: Hans Reiser @ 2002-09-12 14:37 UTC (permalink / raw) To: darren; +Cc: 'reiserfs-list' darren wrote: >Hi all, > >I have a brand new RedHat computer just setup. It currently have two >partitions...both EXT3. > >How do I make one of my drives reiserfs? > > > > > > use tar. tar does a good job, and nfs to a file server (or some borrowed hard drive) plus tar will work well. Remember to run lilo after untarring. ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: newbie how to 2002-09-12 14:37 ` Hans Reiser @ 2002-09-12 15:12 ` Guillermo Torreiro 0 siblings, 0 replies; 19+ messages in thread From: Guillermo Torreiro @ 2002-09-12 15:12 UTC (permalink / raw) To: darren; +Cc: reiserfs-list > darren wrote: > > >Hi all, > > > >I have a brand new RedHat computer just setup. It > currently have two > >partitions...both EXT3. > > > >How do I make one of my drives reiserfs? > > > > > > > > > > > > --- Hans Reiser <reiser@namesys.com> wrote: > use tar. tar does a good job, and nfs to a file > server (or some > borrowed hard drive) plus tar will work well. > > Remember to run lilo after untarring. > > a bit more extensive approach: 1) Like Hans said, use tar to backup your partition, and put it in some place over a NFS drive or another local drive. 2) umount your partition to be converted to ReiserFS. 3) format this partition using ReiserFS tools. 4) edit /etc/fstab and change the line having your partition to the following: /dev/hdxy /xx/yy reiserfs defaults 1 2 where /dev/hdxy is your partition (I guess it's ide) and /xx/yy is the mount point. 5) mount it again. 6) untar your backup into your new ReiserFS partition __________________________________________________ Do you Yahoo!? Yahoo! News - Today's headlines http://news.yahoo.com ^ permalink raw reply [flat|nested] 19+ messages in thread
* Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) 2002-09-12 0:24 Compatibility of current 2.4.19.pending patchset with old data-logging?? Manuel Krause 2002-09-12 13:28 ` newbie how to darren @ 2002-09-12 21:30 ` Manuel Krause 2002-09-13 7:37 ` Oleg Drokin 1 sibling, 1 reply; 19+ messages in thread From: Manuel Krause @ 2002-09-12 21:30 UTC (permalink / raw) To: reiserfs-list On 09/12/2002 02:24 AM, Manuel Krause wrote: > Hi! > > Originally I wanted to prepare a new kernel based on 2.4.20-pre6. That > didn't work due to a gcc compiler failure I have to investigate on here [...] Solved by upgrading from gcc-2.95.2 to the recommended version 2.95.3 [...] > Now I'm completely confused with my private file-naming-convention, the > newest reiserfs 2.4.19.pending directory and the following question: > > Is the current 2.4.19.pending compatible with data-logging any more??? > > When I see the discussion about the new block-allocator, the testing > directory etc. I assume a definite "No." (Please, please, correct me, if > I'm wrong...) ;-) You really didn't correct me so far. Mmh. :-( > > So, for the overnight change to pre-kernel 7, where are the new > data-logging patches Hans even announced to lkml? So I really got a 2.4.20-pre6 compiled with the preempt-kernel-rml-2.4.20-pre5-1.patch and Olegs fix he sent to the list yesterday. BTW, have the testing patches from -pre5 gone into -pre6? It isn't clear to me. You see the timing values of my backup session (cp -a A/. B/) from this afternoon as follows: partition 1: /dev/hdd7 2722896 2043700 679196 76% /mnt/beta (is my / filesystem) used FS: noatime,notail real 11m2.265s user 0m1.480s sys 0m47.550s ~3,014MB/s 32% fresh FS: noatime,notail real 5m19.686s user 0m1.210s sys 0m46.980s ~6,243MB/s 66% fresh FS: noatime,notail,data=ordered -- kernel 2.4.19+rml-preempt+reiserfs+data-logging-patches (real 3m19.895s user 0m1.400s sys 0m49.890s) ~9,409MB/s =100% partition 2: /dev/hdd11 5550248 3801504 1748744 69% /mnt/beta (my software "repository", accessed not really often) used FS: noatime,notail real 12m30.172s user 0m1.360s sys 1m10.350s ~4,949MB/s 61% fresh FS: noatime,notail real 10m11.303s user 0m1.240s sys 1m10.400s ~6,073MB/s 74% fresh FS: noatime,notail,data=ordered -- kernel 2.4.19+rml-preempt+reiserfs+data-logging-patches (real 8m56.040s user 0m1.410s sys 1m27.920s) ~8,180MB/s =100% (...) = timings with slightly different disk content one month ago Application startup timings of NS-7.0 and OOo-1.0.1 didn't increase. :-)) I hope I don't need to comment on the values... Thanks, Manuel ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) 2002-09-12 21:30 ` Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) Manuel Krause @ 2002-09-13 7:37 ` Oleg Drokin 2002-09-13 23:07 ` Manuel Krause 0 siblings, 1 reply; 19+ messages in thread From: Oleg Drokin @ 2002-09-13 7:37 UTC (permalink / raw) To: Manuel Krause; +Cc: reiserfs-list Hello! On Thu, Sep 12, 2002 at 11:30:22PM +0200, Manuel Krause wrote: > >When I see the discussion about the new block-allocator, the testing > >directory etc. I assume a definite "No." (Please, please, correct me, if > ;-) You really didn't correct me so far. Mmh. :-( Datalogging is incompatible with stuff in testing directory. It won't apply cleanly to new block allocatior, but that's can be easily fixed. > >So, for the overnight change to pre-kernel 7, where are the new > >data-logging patches Hans even announced to lkml? > So I really got a 2.4.20-pre6 compiled with the > preempt-kernel-rml-2.4.20-pre5-1.patch and Olegs fix he sent to the list > yesterday. BTW, have the testing patches from -pre5 gone into -pre6? It > isn't clear to me. Testing patches were merged into -pre6 by Marcelo's mistake and -pre7 have these patches removed already. > Application startup timings of NS-7.0 and OOo-1.0.1 didn't increase. :-)) Sure, read code path did not changed at all. > I hope I don't need to comment on the values... Sure. datalogging and new file_write code are not solutions to same problem. These are solutions for different problems and can be used together (after merging of course) for even better performance, I hope ;) Bye, Oleg ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) 2002-09-13 7:37 ` Oleg Drokin @ 2002-09-13 23:07 ` Manuel Krause 2002-09-14 9:02 ` Oleg Drokin 0 siblings, 1 reply; 19+ messages in thread From: Manuel Krause @ 2002-09-13 23:07 UTC (permalink / raw) To: Oleg Drokin; +Cc: reiserfs-list On 09/13/2002 09:37 AM, Oleg Drokin wrote: > Hello! > > On Thu, Sep 12, 2002 at 11:30:22PM +0200, Manuel Krause wrote: > >>>When I see the discussion about the new block-allocator, the testing >>>directory etc. I assume a definite "No." (Please, please, correct me, if >> >>;-) You really didn't correct me so far. Mmh. :-( > > Datalogging is incompatible with stuff in testing directory. It won't apply > cleanly to new block allocatior, but that's can be easily fixed. > >>>So, for the overnight change to pre-kernel 7, where are the new >>>data-logging patches Hans even announced to lkml? ??? Hello, Chris! >>So I really got a 2.4.20-pre6 compiled with the >>preempt-kernel-rml-2.4.20-pre5-1.patch and Olegs fix he sent to the list >>yesterday. BTW, have the testing patches from -pre5 gone into -pre6? It >>isn't clear to me. > > Testing patches were merged into -pre6 by Marcelo's mistake and -pre7 > have these patches removed already. Yes. I pre-read this @ lkml. >>Application startup timings of NS-7.0 and OOo-1.0.1 didn't increase. :-)) > > > Sure, read code path did not changed at all. Seems so, even AntiVir takes the same time as with data-logging ordered-mode... :-) >>I hope I don't need to comment on the values... > > Sure. datalogging and new file_write code are not solutions to same problem. > These are solutions for different problems and can be used together (after > merging of course) for even better performance, I hope ;) Thanks a lot for your informative reply, Oleg! Did I follow the thread "Re: [reiserfs-list] [BK] ReiserFS file write bug fix for 2.4" correctly that going to 2.4.20-pre7 that is without file_write (+) code should speed up (in my case copy actions) in comparison to 2.4.20-pre6? I retested my backup session (see last mail) with 2.4.20-pre7 but have _NO_ ( <0,2% !!!) difference at all. Have I missed something? Thanks, Manuel ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) 2002-09-13 23:07 ` Manuel Krause @ 2002-09-14 9:02 ` Oleg Drokin 2002-09-17 0:53 ` Manuel Krause [not found] ` <3D867C14.5060404@mb.tu-ilmenau.de> 0 siblings, 2 replies; 19+ messages in thread From: Oleg Drokin @ 2002-09-14 9:02 UTC (permalink / raw) To: Manuel Krause; +Cc: reiserfs-list Hello! On Sat, Sep 14, 2002 at 01:07:37AM +0200, Manuel Krause wrote: > >>>So, for the overnight change to pre-kernel 7, where are the new > >>>data-logging patches Hans even announced to lkml? > ??? Hello, Chris! Chris said he'll either announce something on tuesday or pass the code to me to finish it. > Did I follow the thread "Re: [reiserfs-list] [BK] ReiserFS file write > bug fix for 2.4" correctly that going to 2.4.20-pre7 that is without > file_write (+) code should speed up (in my case copy actions) in > comparison to 2.4.20-pre6? No. file_write code should speed up writes that are done in big chunks (more than 4k). > I retested my backup session (see last mail) with 2.4.20-pre7 but have > _NO_ ( <0,2% !!!) difference at all. What tool do you use for backup? I certainly see the difference when doing dd if=/dev/zero of=somefile bs=1024k count=100 also when doing cp bigfile /another_disk/anotherfile Also very huge difference is seen when creating holes ;) e.g compare time of seeking to 128Gb and writing one byte between 2.4.20-pre7 and 2.4.20-pre6 ;) Code for that was once posted on our mailing list. When you will feel that 2.4.20-pre7 hanged, don't trust that feeling and wait till the end of operation. ;) > Have I missed something? To see the difference you need to fed FS with large chuncks of data. Bye, Oleg ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) 2002-09-14 9:02 ` Oleg Drokin @ 2002-09-17 0:53 ` Manuel Krause [not found] ` <3D867C14.5060404@mb.tu-ilmenau.de> 1 sibling, 0 replies; 19+ messages in thread From: Manuel Krause @ 2002-09-17 0:53 UTC (permalink / raw) Cc: reiserfs-list On 09/14/2002 11:02 AM, Oleg Drokin wrote: > Hello! > > On Sat, Sep 14, 2002 at 01:07:37AM +0200, Manuel Krause wrote: > >>>>>So, for the overnight change to pre-kernel 7, where are the new >>>>>data-logging patches Hans even announced to lkml? >>>> >>??? Hello, Chris! > > > Chris said he'll either announce something on tuesday or pass the code to me to > finish it. > I included this into my prayers. > >>Did I follow the thread "Re: [reiserfs-list] [BK] ReiserFS file write >>bug fix for 2.4" correctly that going to 2.4.20-pre7 that is without >>file_write (+) code should speed up (in my case copy actions) in >>comparison to 2.4.20-pre6? > > > No. file_write code should speed up writes that are done in big chunks (more > than 4k). > > >>I retested my backup session (see last mail) with 2.4.20-pre7 but have >>_NO_ ( <0,2% !!!) difference at all. > > > What tool do you use for backup? Hey, Oleg, you should know this; I stayed with my simple but useful scripts using "cp -ax A/. B/" for testing reiserfs performance on this command related to my reiserfs partitions' content. That content is not theoretical, it's living for about one month, then newly recreated -- and these backups showed up some issues in the past. ;-)) > I certainly see the difference when doing > dd if=/dev/zero of=somefile bs=1024k count=100 I didn't do dd related time comparisons though having the values for the last months. Many months ago I found "bs=1M" to be more useful ;-) > > also when doing cp bigfile /another_disk/anotherfile > > Also very huge difference is seen when creating holes ;) > e.g compare time of seeking to 128Gb and writing one byte between > 2.4.20-pre7 and 2.4.20-pre6 ;) > Code for that was once posted on our mailing list. > When you will feel that 2.4.20-pre7 hanged, don't trust that feeling > and wait till the end of operation. ;) > You're kind of joking?! "time <command>" should show the total lack of time related performance, when [throughput] is divided by [time] or not?! I calculated these values manually based upon the timings & df values after copying. Yes, I want to show my personal engagement for the reiserfs community, too. > >>Have I missed something? > > > To see the difference you need to fed FS with large chuncks of data. > Thanks to you, Oleg, Manuel ^ permalink raw reply [flat|nested] 19+ messages in thread
[parent not found: <3D867C14.5060404@mb.tu-ilmenau.de>]
* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) [not found] ` <3D867C14.5060404@mb.tu-ilmenau.de> @ 2002-09-17 6:33 ` Oleg Drokin 2002-09-17 13:56 ` Manuel Krause 0 siblings, 1 reply; 19+ messages in thread From: Oleg Drokin @ 2002-09-17 6:33 UTC (permalink / raw) To: Manuel Krause; +Cc: Andrea Arcangeli, reiserfs-list Hello! On Tue, Sep 17, 2002 at 02:49:24AM +0200, Manuel Krause wrote: > >What tool do you use for backup? > Hey, Oleg, you should know this; I stayed with my simple but useful > scripts using "cp -ax A/. B/" for testing reiserfs performance on this > command related to my reiserfs partitions' content. That content is not I am just making sure. > >Also very huge difference is seen when creating holes ;) > >e.g compare time of seeking to 128Gb and writing one byte between > >2.4.20-pre7 and 2.4.20-pre6 ;) > >Code for that was once posted on our mailing list. > >When you will feel that 2.4.20-pre7 hanged, don't trust that feeling > >and wait till the end of operation. ;) > You're kind of joking?! "time <command>" should show the total lack of > time related performance, when [throughput] is divided by [time] or I must be missing something, but why do you want to divide throughput by time? throughput is amount of data divided by time. > not?! I calculated these values manually based upon the timings & df > values after copying. Yes, I want to show my personal engagement for the > reiserfs community, too. Hm. Are you sure you are hitting write-speed limit? May be you are read-speed bounded? Can you please check that? Thank you. Bye, Oleg ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) 2002-09-17 6:33 ` Oleg Drokin @ 2002-09-17 13:56 ` Manuel Krause 2002-09-17 14:00 ` Oleg Drokin 0 siblings, 1 reply; 19+ messages in thread From: Manuel Krause @ 2002-09-17 13:56 UTC (permalink / raw) To: Oleg Drokin; +Cc: reiserfs-list On 09/17/2002 08:33 AM, Oleg Drokin wrote: > Hello! > > On Tue, Sep 17, 2002 at 02:49:24AM +0200, Manuel Krause wrote: > > >>>What tool do you use for backup? >> >>Hey, Oleg, you should know this; I stayed with my simple but useful >>scripts using "cp -ax A/. B/" for testing reiserfs performance on this >>command related to my reiserfs partitions' content. That content is not > > > I am just making sure. > > >>>Also very huge difference is seen when creating holes ;) >>>e.g compare time of seeking to 128Gb and writing one byte between >>>2.4.20-pre7 and 2.4.20-pre6 ;) >>>Code for that was once posted on our mailing list. >>>When you will feel that 2.4.20-pre7 hanged, don't trust that feeling >>>and wait till the end of operation. ;) >> >>You're kind of joking?! "time <command>" should show the total lack of >>time related performance, when [throughput] is divided by [time] or > > > I must be missing something, but why do you want to divide throughput by time? > throughput is amount of data divided by time. Sorry, my wrong words. I meant the amount of data devided by time. > > >>not?! I calculated these values manually based upon the timings & df >>values after copying. Yes, I want to show my personal engagement for the >>reiserfs community, too. > > > Hm. Are you sure you are hitting write-speed limit? > May be you are read-speed bounded? Can you please check that? > How can I check this? E.g. with some dd process? I just don't know. Thanks, Manuel ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) 2002-09-17 13:56 ` Manuel Krause @ 2002-09-17 14:00 ` Oleg Drokin 2002-09-17 17:39 ` Manuel Krause 0 siblings, 1 reply; 19+ messages in thread From: Oleg Drokin @ 2002-09-17 14:00 UTC (permalink / raw) To: Manuel Krause; +Cc: reiserfs-list Hello! On Tue, Sep 17, 2002 at 03:56:13PM +0200, Manuel Krause wrote: > >>not?! I calculated these values manually based upon the timings & df > >>values after copying. Yes, I want to show my personal engagement for the > >>reiserfs community, too. > >Hm. Are you sure you are hitting write-speed limit? > >May be you are read-speed bounded? Can you please check that? > How can I check this? E.g. with some dd process? I just don't know. Copy same amount of data from RAM/nowhere to FS. E.g. make a file with file names and sizes and write a script that writes this amount of data from /dev/zero with these same names and needed sizes into FS. (or just use RAMFS as your source if you have not much data and huge RAM) Compare 2.4.20-pre[67] if you see any difference. Ah, also copy your data from original disk location to /dev/null and measure time of that operation to know how much of total time is occupied by reads. Also you can calculate read and write throughput separately this way. And if reads are slower than writes - ... Bye, Oleg ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) 2002-09-17 14:00 ` Oleg Drokin @ 2002-09-17 17:39 ` Manuel Krause 2002-09-18 5:20 ` Oleg Drokin 0 siblings, 1 reply; 19+ messages in thread From: Manuel Krause @ 2002-09-17 17:39 UTC (permalink / raw) To: Oleg Drokin; +Cc: reiserfs-list On 09/17/2002 04:00 PM, Oleg Drokin wrote: > Hello! > > On Tue, Sep 17, 2002 at 03:56:13PM +0200, Manuel Krause wrote: > >>>>not?! I calculated these values manually based upon the timings & df >>>>values after copying. Yes, I want to show my personal engagement for the >>>>reiserfs community, too. >>> >>>Hm. Are you sure you are hitting write-speed limit? >>>May be you are read-speed bounded? Can you please check that? >> >>How can I check this? E.g. with some dd process? I just don't know. > > > Copy same amount of data from RAM/nowhere to FS. > E.g. make a file with file names and sizes and write a script that > writes this amount of data from /dev/zero with these same names and needed sizes > into FS. (or just use RAMFS as your source if you have not much data and huge > RAM) To be honest, this already exceeds my linux knowledge... I was fiddling with some test directories containing 195.8MB I copied to and from /dev/shm with swap turned off. # time cp -a /dev/shm/. /mnt/beta/z.Backup.3/ kernel 2.4.20-pre7 | kernel 2.4.20-pre6 real 0m9.006s | real 0m6.740s user 0m0.190s | user 0m0.230s sys 0m5.250s | sys 0m4.780s # rm -r /dev/shm/* # time cp -a /mnt/beta/z.Backup.3/. /dev/shm/ kernel 2.4.20-pre7 | kernel 2.4.20-pre6 real 0m6.349s | real 0m6.180s user 0m0.210s | user 0m0.220s sys 0m2.450s | sys 0m2.510s and # time dd if=/dev/zero bs=1M count=1000 of=/mnt/beta/testfile.zero kernel 2.4.20-pre7 | kernel 2.4.20-pre6 real 1m11.390s | real 1m42.011s user 0m0.010s | user 0m0.000s sys 0m11.230s | sys 0m5.620s # time dd of=/dev/null bs=1M if=/mnt/beta/testfile.zero kernel 2.4.20-pre7 | kernel 2.4.20-pre6 real 1m16.738s | real 1m39.094s user 0m0.000s | user 0m0.000s sys 0m5.460s | sys 0m5.930s > Compare 2.4.20-pre[67] if you see any difference. > Ah, also copy your data from original disk location to /dev/null and measure > time of that operation to know how much of total time is occupied by reads. > > Also you can calculate read and write throughput separately this way. > And if reads are slower than writes - ... > I'm definitely not sure if my lines above are something you meant. Bye, Manuel ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) 2002-09-17 17:39 ` Manuel Krause @ 2002-09-18 5:20 ` Oleg Drokin 2002-09-19 1:14 ` Manuel Krause 0 siblings, 1 reply; 19+ messages in thread From: Oleg Drokin @ 2002-09-18 5:20 UTC (permalink / raw) To: Manuel Krause; +Cc: reiserfs-list Hello! On Tue, Sep 17, 2002 at 07:39:39PM +0200, Manuel Krause wrote: > >Copy same amount of data from RAM/nowhere to FS. > >E.g. make a file with file names and sizes and write a script that > >writes this amount of data from /dev/zero with these same names and needed > >sizes > >into FS. (or just use RAMFS as your source if you have not much data and > >huge > >RAM) > To be honest, this already exceeds my linux knowledge... I meant something to this extent: You run a script that runs over your filesystem and creates shell script that first creates whole dir structure of source dir and then for each file creates necessary command to recreate file of the same size: e.g for this directory contents: green@angband:~/z> ls -lR .: total 1 drwxr-xr-x 2 green green 114 Sep 18 09:08 t ./t: total 148 -rw-rw-r-- 1 green green 69570 Aug 10 16:34 inode.c -rw-rw-r-- 1 green green 66478 Aug 10 16:33 stree.c -rw-rw-r-- 1 green green 10256 Aug 10 16:32 tail_conversion.c Result of the work of the script would be: mkdir t mkdir t/z dd if=/dev/zero of=t/z/inode.c bs=69570 count=1 dd if=/dev/zero of=t/z/stree.c bs=66478 count=1 dd if=/dev/zero of=t/z/tail_conversion.c bs=10256 count=1 And you can run resulting script in target dir. > I was fiddling with some test directories containing 195.8MB I copied to > and from /dev/shm with swap turned off. > > # time cp -a /dev/shm/. /mnt/beta/z.Backup.3/ > kernel 2.4.20-pre7 | kernel 2.4.20-pre6 > real 0m9.006s | real 0m6.740s > user 0m0.190s | user 0m0.230s > sys 0m5.250s | sys 0m4.780s > # rm -r /dev/shm/* > # time cp -a /mnt/beta/z.Backup.3/. /dev/shm/ > kernel 2.4.20-pre7 | kernel 2.4.20-pre6 > real 0m6.349s | real 0m6.180s > user 0m0.210s | user 0m0.220s > sys 0m2.450s | sys 0m2.510s This dataset is way too small and entirely fits into your RAM I presume. So to avoid any distortion or results you'd better have all periodic stuff disabled. (though kupdated is still there) so it's better to run it several times. Also since it its into RAM, it must be flushed out, so I usually do this using such command: time sh -c "cp -a /testfs0/linux-2.4.18 /mnt/ ; umount /mnt" > # time dd if=/dev/zero bs=1M count=1000 of=/mnt/beta/testfile.zero > kernel 2.4.20-pre7 | kernel 2.4.20-pre6 > real 1m11.390s | real 1m42.011s > sys 0m11.230s | sys 0m5.620s Hm. While system time is less as expected, real time increased, that's strange. > # time dd of=/dev/null bs=1M if=/mnt/beta/testfile.zero > kernel 2.4.20-pre7 | kernel 2.4.20-pre6 > real 1m16.738s | real 1m39.094s > sys 0m5.460s | sys 0m5.930s And real time is bigger for reads too, so it seems data layout is different. That's really strange. If you can reproduce this behaviour, I am interested in getting debugreiserfs -d output for each case after you umount this volume (I assume that /mnt/beta/ filesystems contains nothing but this testfile.zero file). > >Compare 2.4.20-pre[67] if you see any difference. > >Ah, also copy your data from original disk location to /dev/null and > >measure > >time of that operation to know how much of total time is occupied by reads. > >Also you can calculate read and write throughput separately this way. > >And if reads are slower than writes - ... > I'm definitely not sure if my lines above are something you meant. Yes, kind of, though you have omitted timings of copying original data to /dev/shm/ that will give us read speed from original media. In fact instead of turning of swap you can do mount none /mnt/ramfs -t ramfs command (if you have ramfs compiled in of course) and /mnt/ramfs is now kind of ram filesystem with very low overhead. It also cannot be swapped out so if you fill all of your RAM, your box will OOM ;) Byt the test itself is very small. Probably you need to run something like time find /source/that/needs/to/be/backed/up -type f -exec cat {} >/dev/null \; to get read performance and implement a script like I mentioned in the beginning to measure writes. This way you do not need tons of RAM. Bye, Oleg ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) 2002-09-18 5:20 ` Oleg Drokin @ 2002-09-19 1:14 ` Manuel Krause 2002-09-19 6:34 ` Oleg Drokin 0 siblings, 1 reply; 19+ messages in thread From: Manuel Krause @ 2002-09-19 1:14 UTC (permalink / raw) To: Oleg Drokin; +Cc: reiserfs-list Hi! Even facing the problem to make this mail unreadable I want to answer all inline. Please, don't complain. I feel the urgent need to correct at least my last testing results as your last mail revealed heavy errors of my timing tests and kind-of opened my eyes. On 09/18/2002 07:20 AM, Oleg Drokin wrote: > Hello! > > On Tue, Sep 17, 2002 at 07:39:39PM +0200, Manuel Krause wrote: > >>>Copy same amount of data from RAM/nowhere to FS. >>>E.g. make a file with file names and sizes and write a script that >>>writes this amount of data from /dev/zero with these same names and needed >>>sizes into FS. (or just use RAMFS as your source if you have not much data >>>and huge RAM) >> >>To be honest, this already exceeds my linux knowledge... > > I meant something to this extent: > You run a script that runs over your filesystem and creates shell script > that first creates whole dir structure of source dir and then for each file > creates necessary command to recreate file of the same size: > e.g for this directory contents: > green@angband:~/z> ls -lR [...] > Result of the work of the script would be: > mkdir t > mkdir t/z > dd if=/dev/zero of=t/z/inode.c bs=69570 count=1 > dd if=/dev/zero of=t/z/stree.c bs=66478 count=1 > dd if=/dev/zero of=t/z/tail_conversion.c bs=10256 count=1 > > And you can run resulting script in target dir. Yes, I saw this work in a nightmare last night. Scheduled for some dark moonless cold snow flurry winter night, sorry. Except for someone experienced likes to provide me with a basic script for that... ;-)) >>I was fiddling with some test directories containing 195.8MB I copied to >>and from /dev/shm with swap turned off. >> >># time cp -a /dev/shm/. /mnt/beta/z.Backup.3/ >>kernel 2.4.20-pre7 | kernel 2.4.20-pre6 >>real 0m9.006s | real 0m6.740s >>user 0m0.190s | user 0m0.230s >>sys 0m5.250s | sys 0m4.780s >># rm -r /dev/shm/* >># time cp -a /mnt/beta/z.Backup.3/. /dev/shm/ >>kernel 2.4.20-pre7 | kernel 2.4.20-pre6 >>real 0m6.349s | real 0m6.180s >>user 0m0.210s | user 0m0.220s >>sys 0m2.450s | sys 0m2.510s > > This dataset is way too small and entirely fits into your RAM I presume. Yes, it fits. I know that problem with this RAM based test. Though I may increase the testing directory a bit closer to the OOM limit, having 512MB available. > So to avoid any distortion or results you'd better have all periodic stuff > disabled. (though kupdated is still there) so it's better to run it several > times. > Also since it its into RAM, it must be flushed out, so I usually do this > using such command: > time sh -c "cp -a /testfs0/linux-2.4.18 /mnt/ ; umount /mnt" Couldn't you have written these words to me some years earlier?! The effect is measurable and on almost any so far discussed fs interaction huge or at least relevant. So, after reviewing my partition-backup-scripts, forget _all_ results I posted to the list. They're all lacking the umount=flush component. Now, you caught me as the "fool of reiserfs-list"... Quite embarrassing. Mmmh. Painful. >># time dd if=/dev/zero bs=1M count=1000 of=/mnt/beta/testfile.zero >>kernel 2.4.20-pre7 | kernel 2.4.20-pre6 >>real 1m11.390s | real 1m42.011s >>sys 0m11.230s | sys 0m5.620s > > Hm. While system time is less as expected, real time increased, that's strange. > >># time dd of=/dev/null bs=1M if=/mnt/beta/testfile.zero >>kernel 2.4.20-pre7 | kernel 2.4.20-pre6 >>real 1m16.738s | real 1m39.094s >>sys 0m5.460s | sys 0m5.930s > > And real time is bigger for reads too, so it seems data layout is different. > > That's really strange. If you can reproduce this behaviour, I am interested > in getting debugreiserfs -d output for each case after you umount this volume > (I assume that2 /mnt/beta/ filesystems contains nothing but this testfile.zero > file). No. /mnt/beta/ is my software storage partition and contains this: /dev/hda11 5550248 4089088 1461160 74% /mnt/beta . I have no means to provide this complete "debugreiserfs -d /dev/hda11" output set, 42MB one of four, if I read your wording correctly (-pre6 without 1G file, -pre6 with 1GB file, -pre7 without 1GB file, -pre7 with 1GB file) on my web-space. As .tar.gz it's 4MB one of four, and even that set doesn't fit on my private t-online website. Maybe, it would work if sent sequentially by mail. Oh. O.k. you get a definite "No" on this, sorry. I just reviewed the debugreiserfs' output file content and I would not send or publish this in any way. It is simply too sensitive as it contains direct file and directory names. Is it possible to provide the needed info without clear directory or file names in future?! (These names replaced by sequentionally taken numbers?) //// O.k., too many words about unneeded things. I've remade the testing I posted and kept to umount the interacting related partition inbetween in order to force the needed flush and captured that time, too, now. My previously posted values have not been reproducible, if I review the new values correctly. But, you need to read and interprete it yourself! I took 3 timings each test and calculated the mean value. Comparison of "dd" actions: --------------------------- reading command: time sh -c "dd if=/mnt/beta/testfile.zero bs=1M count=1000 of=/dev/null ; umount /mnt/beta" writing command: time sh -c "dd if=/dev/zero bs=1M count=1000 of=/mnt/beta/testfile.zero ; umount /mnt/beta" mean-pre7 writing dd 1G | mean-pre6 writing dd 1G real 1m32.288s | real 1m30.531s user 0m0.013s | user 0m0.016s sys 0m11.207s | sys 0m5.036s mean-pre7 reading dd 1G | mean-pre6 reading dd 1G real 1m24.002s | real 1m22.039s user 0m0.010s | user 0m0.013s sys 0m6.470s | sys 0m6.083s related df values: /dev/hda11 5550248 4089088 1461160 74% /mnt/beta /dev/hda11 5550248 5114104 436144 93% /mnt/beta Yes, that's going over the "senfseful" filesystem content value. //// Comparison of "cp -a" actions: ------------------------------ reading command: time sh -c "cp -a /mnt/beta/z.Backup.3/. /mnt/ramfs/ ; umount /mnt/beta ; umount /mnt/ramfs" writing command: time sh -c "cp -a /mnt/ramfs/. /mnt/beta/z.Backup.3/ ; umount /mnt/beta ; umount /mnt/ramfs" mean-pre7 reading files | mean-pre6 reading files real 0m38.641s | real 0m39.110s user 0m0.200s | user 0m0.220s sys 0m3.400s | sys 0m3.200s mean-pre7 writing files | mean-pre6 writing files real 0m25.128s | real 0m27.689s user 0m0.200s | user 0m0.160s sys 0m4.860s | sys 0m5.217s directory content: 171.3MB //// >>>Compare 2.4.20-pre[67] if you see any difference. >>>Ah, also copy your data from original disk location to /dev/null and >>>measure >>>time of that operation to know how much of total time is occupied by reads. >>>Also you can calculate read and write throughput separately this way. >>>And if reads are slower than writes - ... >> >>I'm definitely not sure if my lines above are something you meant. > > Yes, kind of, though you have omitted timings of copying original data to > /dev/shm/ that will give us read speed from original media. I thought my posted set "# time cp -a /mnt/beta/z.Backup.3/. /dev/shm/" represented copying the original data from reiserfs partition /mnt/beta to /dev/shm ?! > In fact instead of turning of swap you can do > mount none /mnt/ramfs -t ramfs > command (if you have ramfs compiled in of course) and /mnt/ramfs is now > kind of ram filesystem with very low overhead. It also cannot be swapped out > so if you fill all of your RAM, your box will OOM ;) Even more, after finding that RAMFS was now internally set enabled in the related Config.in but searching it over and over before (Me: Am I really blind, now?!) I saw huge throughput diffs between shm and RAMFS. > Byt the test itself is very small. > Probably you need to run something like > time find /source/that/needs/to/be/backed/up -type f -exec cat {} >/dev/null \; > > to get read performance and implement a script like I mentioned in the beginning > to measure writes. > This way you do not need tons of RAM. I see. :-) Hope that mail brings some clarification and I haven't forgotten too much for now. Thanks, good night, Manuel ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) 2002-09-19 1:14 ` Manuel Krause @ 2002-09-19 6:34 ` Oleg Drokin 2002-09-19 15:52 ` Manuel Krause 0 siblings, 1 reply; 19+ messages in thread From: Oleg Drokin @ 2002-09-19 6:34 UTC (permalink / raw) To: Manuel Krause; +Cc: reiserfs-list Hello! On Thu, Sep 19, 2002 at 03:14:56AM +0200, Manuel Krause wrote: > >And you can run resulting script in target dir. > Yes, I saw this work in a nightmare last night. Scheduled for some dark > moonless cold snow flurry winter night, sorry. Except for someone > experienced likes to provide me with a basic script for that... ;-)) > >This dataset is way too small and entirely fits into your RAM I presume. > Yes, it fits. I know that problem with this RAM based test. Though I may > increase the testing directory a bit closer to the OOM limit, having > 512MB available. No, this is not enough of course since some data will remain unflushed and amount of such data is relatively big compared to total amount of data. > >So to avoid any distortion or results you'd better have all periodic stuff > >disabled. (though kupdated is still there) so it's better to run it several > >times. > >Also since it its into RAM, it must be flushed out, so I usually do this > >using such command: > >time sh -c "cp -a /testfs0/linux-2.4.18 /mnt/ ; umount /mnt" > Couldn't you have written these words to me some years earlier?! The > effect is measurable and on almost any so far discussed fs interaction > huge or at least relevant. So, after reviewing my > partition-backup-scripts, forget _all_ results I posted to the list. > They're all lacking the umount=flush component. It is only needed if data to be cached is big enough to be noticed when compared to total amount of data to be copied. > No. /mnt/beta/ is my software storage partition and contains this: > /dev/hda11 5550248 4089088 1461160 74% /mnt/beta . Ah! > Oh. O.k. you get a definite "No" on this, sorry. I just reviewed the > debugreiserfs' output file content and I would not send or publish this > in any way. It is simply too sensitive as it contains direct file and > directory names. No, then I do not need that debugreiserfs dump anyway. But here is another warning: I presume before each copy test is done, /mnt/beta/z.Backup.3 is removed completely and /mnt/beta is unmounted and mounted back, also between sevetral writing attempts (And during these attempts of course) no other processes can write to to this FS. If those two above clauses are not true, then results are also meaningless, as lots of unnecessary tree reads are issued for overwrite and new blocks are not allocated, but existing ones are reused. If somebody can write to FS, then with every next test blocks chosen for files are different (old ones may be already occupied). > Is it possible to provide the needed info without clear directory or > file names in future?! (These names replaced by sequentionally taken > numbers?) In such a case you can determine object id of big file (shown to userspace as inode number) and only provide it's SD and indirect items: | 9|4 357 0x0 SD (0), len 44, location 1572 entry count 65535, ... | 10|4 357 0x1 IND (1), len 504, location 1068 entr.... 126 pointers [ 9948(126)] This is a example of file with objectid 357, that have 126 blocks in size. 9948-10074 blocks (all continuous) are used. If file is very big, there would be several IND (indirect) items in other nodes, number in brackets will changes to show offset that this INDIRECT item starts with. > Comparison of "dd" actions: > --------------------------- > reading command: time sh -c "dd if=/mnt/beta/testfile.zero bs=1M > count=1000 of=/dev/null ; umount /mnt/beta" > writing command: time sh -c "dd if=/dev/zero bs=1M count=1000 > of=/mnt/beta/testfile.zero ; umount /mnt/beta" I presume you earased /mnt/beta/testfile.zero between tests and executed sync. Ah, until I forgot - in reiserfs if you erased something blocks that were freed are only get back to you on next journal flush or after sync. So if you do something like this: rm -f /mnt/beta/testfile.zero ; time sh -c "dd ...", then second file will get different blocknumbers. > related df values: > /dev/hda11 5550248 4089088 1461160 74% /mnt/beta > /dev/hda11 5550248 5114104 436144 93% /mnt/beta > Yes, that's going over the "senfseful" filesystem content value. Hm. This is before and after dd command or what? > Comparison of "cp -a" actions: > ------------------------------ > reading command: time sh -c "cp -a /mnt/beta/z.Backup.3/. /mnt/ramfs/ ; > umount /mnt/beta ; umount /mnt/ramfs" > writing command: time sh -c "cp -a /mnt/ramfs/. /mnt/beta/z.Backup.3/ ; > umount /mnt/beta ; umount /mnt/ramfs" You mean you executed your commands in this same order? I.e. first reading files from partition and then writing same files back in place of already existing ones? Then above thing about overwriting files applies directly in here. I thought you were reading files from one filesystem and writing these files to another one. > >Yes, kind of, though you have omitted timings of copying original data to > >/dev/shm/ that will give us read speed from original media. > I thought my posted set "# time cp -a /mnt/beta/z.Backup.3/. /dev/shm/" > represented copying the original data from reiserfs partition /mnt/beta > to /dev/shm ?! I thought that original data is residing on another filesystem on another disk. If originally you vere copying data from one disk to the same disk, just another partition, then thisis just lots of seeks. Bye, Oleg ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) 2002-09-19 6:34 ` Oleg Drokin @ 2002-09-19 15:52 ` Manuel Krause 2002-09-19 16:01 ` Oleg Drokin 0 siblings, 1 reply; 19+ messages in thread From: Manuel Krause @ 2002-09-19 15:52 UTC (permalink / raw) To: Oleg Drokin; +Cc: reiserfs-list On 09/19/2002 08:34 AM, Oleg Drokin wrote: > Hello! > > On Thu, Sep 19, 2002 at 03:14:56AM +0200, Manuel Krause wrote: > > >>>And you can run resulting script in target dir. >> >>Yes, I saw this work in a nightmare last night. Scheduled for some dark >>moonless cold snow flurry winter night, sorry. Except for someone >>experienced likes to provide me with a basic script for that... ;-)) > >>>This dataset is way too small and entirely fits into your RAM I presume. >> >>Yes, it fits. I know that problem with this RAM based test. Though I may >>increase the testing directory a bit closer to the OOM limit, having >>512MB available. > > No, this is not enough of course since some data will remain unflushed and > amount of such data is relatively big compared to total amount of data. And if the participating filesystems are umounted after writing the data? >>>So to avoid any distortion or results you'd better have all periodic stuff >>>disabled. (though kupdated is still there) so it's better to run it several >>>times. >>>Also since it its into RAM, it must be flushed out, so I usually do this >>>using such command: >>>time sh -c "cp -a /testfs0/linux-2.4.18 /mnt/ ; umount /mnt" >> >>Couldn't you have written these words to me some years earlier?! The >>effect is measurable and on almost any so far discussed fs interaction >>huge or at least relevant. So, after reviewing my >>partition-backup-scripts, forget _all_ results I posted to the list. >>They're all lacking the umount=flush component. > > It is only needed if data to be cached is big enough to be noticed when compared > to total amount of data to be copied. > >>No. /mnt/beta/ is my software storage partition and contains this: >> /dev/hda11 5550248 4089088 1461160 74% /mnt/beta . > > Ah! > >>Oh. O.k. you get a definite "No" on this, sorry. I just reviewed the >>debugreiserfs' output file content and I would not send or publish this >>in any way. It is simply too sensitive as it contains direct file and >>directory names. > > No, then I do not need that debugreiserfs dump anyway. > > But here is another warning: > I presume before each copy test is done, /mnt/beta/z.Backup.3 is removed > completely and /mnt/beta is unmounted and mounted back, also > between sevetral writing attempts (And during these attempts of course) > no other processes can write to to this FS. These clauses are both true and apply to the dd tests, too. Mmh, except for the case when /mnt/beta/z.Backup.3 or testfile.zero were the source to be copied/dd'ed... There haven't been overwrites or other writes to the disk during the testsets. > If those two above clauses are not true, then results are also meaningless, > as lots of unnecessary tree reads are issued for overwrite and new blocks are > not allocated, but existing ones are reused. > If somebody can write to FS, then with every next test blocks chosen for files > are different (old ones may be already occupied). > >>Is it possible to provide the needed info without clear directory or >>file names in future?! (These names replaced by sequentionally taken >>numbers?) > > In such a case you can determine object id of big file (shown to userspace > as inode number) and only provide it's SD and indirect items: > | 9|4 357 0x0 SD (0), len 44, location 1572 entry count 65535, ... > | 10|4 357 0x1 IND (1), len 504, location 1068 entr.... > 126 pointers > [ 9948(126)] > > This is a example of file with objectid 357, that have 126 blocks in size. > 9948-10074 blocks (all continuous) are used. > > If file is very big, there would be several IND (indirect) items in other nodes, > number in brackets will changes to show offset that this INDIRECT item starts > with. > >>Comparison of "dd" actions: >>--------------------------- >>reading command: time sh -c "dd if=/mnt/beta/testfile.zero bs=1M >> count=1000 of=/dev/null ; umount /mnt/beta" >>writing command: time sh -c "dd if=/dev/zero bs=1M count=1000 >> of=/mnt/beta/testfile.zero ; umount /mnt/beta" > > I presume you earased /mnt/beta/testfile.zero between tests and executed > sync. I umounted the partition and mounted it back. I thought that would be the right action to avoid what you describe in the following: > Ah, until I forgot - in reiserfs if you erased something blocks that were > freed are only get back to you on next journal flush or after sync. > So if you do something like this: > rm -f /mnt/beta/testfile.zero ; time sh -c "dd ...", > then second file will get different blocknumbers. > >>related df values: >>/dev/hda11 5550248 4089088 1461160 74% /mnt/beta >>/dev/hda11 5550248 5114104 436144 93% /mnt/beta >>Yes, that's going over the "senfseful" filesystem content value. > > Hm. This is before and after dd command or what? Yes, without and with the 1G file. >>Comparison of "cp -a" actions: >>------------------------------ >>reading command: time sh -c "cp -a /mnt/beta/z.Backup.3/. /mnt/ramfs/ ; >> umount /mnt/beta ; umount /mnt/ramfs" >>writing command: time sh -c "cp -a /mnt/ramfs/. /mnt/beta/z.Backup.3/ ; >> umount /mnt/beta ; umount /mnt/ramfs" > > You mean you executed your commands in this same order? The order was: mount /mnt/beta, mount ramfs, executing the reading command with the umounts -- and this order 2 more times -- for the read values. Then : mount ramfs, mount /mnt/beta, move data to ramfs, umount and mount back /mnt/beta, executing the writing command with the umounts -- and this 2 more times -- for the write values. So I didn't execute the above commands in direct order. > I.e. first reading files from partition and then writing same files back > in place of already existing ones? Then above thing about overwriting files > applies directly in here. > I thought you were reading files from one filesystem and writing these files > to another one. >>>Yes, kind of, though you have omitted timings of copying original data to >>>/dev/shm/ that will give us read speed from original media. >> >>I thought my posted set "# time cp -a /mnt/beta/z.Backup.3/. /dev/shm/" >>represented copying the original data from reiserfs partition /mnt/beta >>to /dev/shm ?! > > I thought that original data is residing on another filesystem on another disk. > If originally you vere copying data from one disk to the same disk, just > another partition, then thisis just lots of seeks. Thanks for these reminders. Originally I wanted to copy data from one disk to another and on the way capture the timings to compare the throughput... Then you pointed me to re-check pure reads and pure writes mainly to make sure the writes were not read-speed bound and to compare this behaviour on -pre6 and -pre7. Bye, Manuel ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) 2002-09-19 15:52 ` Manuel Krause @ 2002-09-19 16:01 ` Oleg Drokin 2002-09-23 0:36 ` Manuel Krause 0 siblings, 1 reply; 19+ messages in thread From: Oleg Drokin @ 2002-09-19 16:01 UTC (permalink / raw) To: Manuel Krause; +Cc: reiserfs-list Hello! On Thu, Sep 19, 2002 at 05:52:15PM +0200, Manuel Krause wrote: > >>Yes, it fits. I know that problem with this RAM based test. Though I may > >>increase the testing directory a bit closer to the OOM limit, having > >>512MB available. > >No, this is not enough of course since some data will remain unflushed and > >amount of such data is relatively big compared to total amount of data. > And if the participating filesystems are umounted after writing the data? Then it is ok of course. All fs' buffers are flushed on umount. > >But here is another warning: > >I presume before each copy test is done, /mnt/beta/z.Backup.3 is removed > >completely and /mnt/beta is unmounted and mounted back, also > >between sevetral writing attempts (And during these attempts of course) > >no other processes can write to to this FS. > These clauses are both true and apply to the dd tests, too. Mmh, except > for the case when /mnt/beta/z.Backup.3 or testfile.zero were the source > to be copied/dd'ed... There haven't been overwrites or other writes to > the disk during the testsets. Ok, good. > >I presume you earased /mnt/beta/testfile.zero between tests and executed > >sync. > I umounted the partition and mounted it back. I thought that would be > the right action to avoid what you describe in the following: Yes. > >I thought that original data is residing on another filesystem on another > >disk. > >If originally you vere copying data from one disk to the same disk, just > >another partition, then thisis just lots of seeks. > Thanks for these reminders. Originally I wanted to copy data from one > disk to another and on the way capture the timings to compare the > throughput... Then you pointed me to re-check pure reads and pure writes > mainly to make sure the writes were not read-speed bound and to compare > this behaviour on -pre6 and -pre7. Originally I was mostly interested in reads from source fs, not the one where you have copied the data (though that one is also might be useful). Ok, thank you for lots of testing. Bye, Oleg ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) 2002-09-19 16:01 ` Oleg Drokin @ 2002-09-23 0:36 ` Manuel Krause 0 siblings, 0 replies; 19+ messages in thread From: Manuel Krause @ 2002-09-23 0:36 UTC (permalink / raw) To: Oleg Drokin; +Cc: reiserfs-list On 09/19/2002 06:01 PM, Oleg Drokin wrote: > Hello! > > On Thu, Sep 19, 2002 at 05:52:15PM +0200, Manuel Krause wrote: > [...] >> >>Thanks for these reminders. Originally I wanted to copy data from one >>disk to another and on the way capture the timings to compare the >>throughput... Then you pointed me to re-check pure reads and pure writes >>mainly to make sure the writes were not read-speed bound and to compare >>this behaviour on -pre6 and -pre7. > > Originally I was mostly interested in reads from source fs, not the one > where you have copied the data (though that one is also might be useful). > > Ok, thank you for lots of testing. > > Bye, > Oleg > Hi Oleg & others! O.k. somehow I managed to "find" the "script to make the scripts" to measure writes from nowhere to the target disk. I decided to make up 2 scripts - one for the dirs and one for the files, originally for the comparison of "pure" file wites to the reads (read below) including the umounts. They're containing just the lists of commands Oleg posted recently (e.g. mkdir "/mnt/beta/dir1" /// dd if=/dev/zero count=1 bs=8466760 of="/mnt/beta/dir1/.../filename1" ). And I re-used Olegs posted command to measure reads from the source disk for the most recent reiserfs kernel versions. The results are shown below. I'm not very glad with comparing the "pure" reads against writes especially when watching the copy timings as different commands are included and their (relative) overhead is quite unknown, am I right? (So e.g. I didn't want to take these values' relation to tweak the disks read & write latency settings for now.) CPU usage during the dd... and the find...cat commands is very high. Also the disk access when watched in ksysguardd is different for all these kinds of tests. So, read and compare it yourself and I would be glad to get comments on how I needed to refine it. Good night, Manuel ------------------------------------------------------------------ /dev/hda11 5550248 3927572 1622676 71% /mnt/beta /dev/hdd11 5550248 3927572 1622676 71% /mnt/gamma containing 58015 files, 3481 files kernel 2.4.19- 2.4.20-pre6 2.4.20-pre7 data-logging # time sh -c "cp -ax /mnt/gamma/. /mnt/beta/ ; umount /mnt/gamma ; umount /mnt/beta " (representing copies from source to target disk) real 7m46.970s 10m57.716s 10m04.328s user 0m01.710s 0m01.390s 0m01.540s sys 1m25.440s 1m11.100s 1m18.840s # time /tmp/script.dirs.sh (representing directory writes to target disk) real 0m09.972s 0m10.055s 0m10.316s user 0m04.960s 0m04.770s 0m04.880s sys 0m04.370s 0m04.590s 0m04.550s # time sh -c "/tmp/script.files.sh ; umount /dev/hda11 ; umount /dev/hdd11 " (representing file writes to target disk) real 7m35.992s 8m24.499s 8m20.830s user 1m31.100s 1m30.120s 1m32.280s sys 2m21.480s 1m42.230s 2m02.660s # cd /mnt/ ; time sh -c "find ./gamma/* -type f -exec cat {} >/dev/null \; ; umount /dev/hda11 ; umount /dev/hdd11 " (representing reads from source disk) real 8m34.665s 9m54.103s 9m50.796s user 1m11.650s 1m10.760s 1m11.100s sys 1m15.000s 1m29.960s 1m17.370s I took care to freshly mount the participating partitions each test, recreate and mount+umount the counterpart reiserfs partition before when appropriate, had no overwrites and no other accesses to the source partition during this test. Going back from 2.4.20-pre7 to 2.4.19-data-logging and beeing in doubt of the effects of the new block allocator I recreated the source filesystem completely (copy to new target and copy back) to measure the above data-logging values. The first copy from the former 2.4.20-pre7 reiserfs to 2.4.19-data-logging had this timings: real 7m38.715s user 0m2.030s sys 1m29.560s Command to take the "/tmp/script.[dirs,files].sh" : sh -c 'cd /mnt/gamma ; find * -type d -fprintf /tmp/script.dirs.sh "mkdir \"/mnt/beta/%p\"\n" ; find * -type f -fprintf /tmp/script.files.sh "dd if=/dev/zero count=1 bs=%s of=\"/mnt/beta/%p\"\n" ; chmod u+x /tmp/script.*.sh' ------------------------------------------------------------------ ^ permalink raw reply [flat|nested] 19+ messages in thread
end of thread, other threads:[~2002-09-23 0:36 UTC | newest]
Thread overview: 19+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2002-09-12 0:24 Compatibility of current 2.4.19.pending patchset with old data-logging?? Manuel Krause
2002-09-12 13:28 ` newbie how to darren
2002-09-12 14:37 ` Hans Reiser
2002-09-12 15:12 ` Guillermo Torreiro
2002-09-12 21:30 ` Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) Manuel Krause
2002-09-13 7:37 ` Oleg Drokin
2002-09-13 23:07 ` Manuel Krause
2002-09-14 9:02 ` Oleg Drokin
2002-09-17 0:53 ` Manuel Krause
[not found] ` <3D867C14.5060404@mb.tu-ilmenau.de>
2002-09-17 6:33 ` Oleg Drokin
2002-09-17 13:56 ` Manuel Krause
2002-09-17 14:00 ` Oleg Drokin
2002-09-17 17:39 ` Manuel Krause
2002-09-18 5:20 ` Oleg Drokin
2002-09-19 1:14 ` Manuel Krause
2002-09-19 6:34 ` Oleg Drokin
2002-09-19 15:52 ` Manuel Krause
2002-09-19 16:01 ` Oleg Drokin
2002-09-23 0:36 ` Manuel Krause
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.