Compatibility of current 2.4.19.pending patchset with old data-logging??

All of lore.kernel.org
 help / color / mirror / Atom feed

* Compatibility of  current 2.4.19.pending patchset with old data-logging??
@ 2002-09-12  0:24 Manuel Krause
  2002-09-12 13:28 ` newbie how to darren
  2002-09-12 21:30 ` Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) Manuel Krause
  0 siblings, 2 replies; 19+ messages in thread
From: Manuel Krause @ 2002-09-12  0:24 UTC (permalink / raw)
  To: reiserfs-list

Hi!

Originally I wanted to prepare a new kernel based on 2.4.20-pre6. That 
didn't work due to a gcc compiler failure I have to investigate on here 
right now. Very dubious, the script even told me to send a detailed bug 
report - definitely too much for tonight. A 2.4.19 compiles well with my 
old patchset.

Previously I followed the 2.4.19.pending patch directory 
@ftp.namesys.com (by downloading and particularily reading, but not 
applying since Chris stopped his data-logging series -- That was when 
patch "05" changed its name from "05-hash_on_empty_fs-1.diff" to 
"05-reiserfs_make_bad_inode.diff", on 02-08-13, and maybe I even used 
some older patches from the latest rc to keep the data-logging working). 
BTW, it's an irritating thing to change the patches' names. That adds 
just more confusion to namesys' release management that could be 
considered "non-official" to our list or is lacking more public relation 
output.

Now I'm completely confused with my private file-naming-convention, the 
newest reiserfs 2.4.19.pending directory and the following question:

  Is the current 2.4.19.pending compatible with data-logging any more???

When I see the discussion about the new block-allocator, the testing 
directory etc. I assume a definite "No." (Please, please, correct me, if 
I'm wrong...)

So, for the overnight change to pre-kernel 7, where are the new 
data-logging patches Hans even announced to lkml?

Thanks for your hints ;-)

Manuel

^ permalink raw reply	[flat|nested] 19+ messages in thread

* newbie how to
  2002-09-12  0:24 Compatibility of current 2.4.19.pending patchset with old data-logging?? Manuel Krause
@ 2002-09-12 13:28 ` darren
  2002-09-12 14:37   ` Hans Reiser
  2002-09-12 21:30 ` Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) Manuel Krause
  1 sibling, 1 reply; 19+ messages in thread
From: darren @ 2002-09-12 13:28 UTC (permalink / raw)
  To: 'reiserfs-list'

Hi all,

I have a brand new RedHat computer just setup. It currently have two
partitions...both EXT3.

How do I make one of my drives reiserfs?



^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: newbie how to
  2002-09-12 13:28 ` newbie how to darren
@ 2002-09-12 14:37   ` Hans Reiser
  2002-09-12 15:12     ` Guillermo Torreiro
  0 siblings, 1 reply; 19+ messages in thread
From: Hans Reiser @ 2002-09-12 14:37 UTC (permalink / raw)
  To: darren; +Cc: 'reiserfs-list'

darren wrote:

>Hi all,
>
>I have a brand new RedHat computer just setup. It currently have two
>partitions...both EXT3.
>
>How do I make one of my drives reiserfs?
>
>
>
>
>  
>
use tar.  tar does a good job, and nfs to a file server (or some 
borrowed hard drive) plus tar will work well.

Remember to run lilo after untarring.



^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: newbie how to
  2002-09-12 14:37   ` Hans Reiser
@ 2002-09-12 15:12     ` Guillermo Torreiro
  0 siblings, 0 replies; 19+ messages in thread
From: Guillermo Torreiro @ 2002-09-12 15:12 UTC (permalink / raw)
  To: darren; +Cc: reiserfs-list

> darren wrote:
> 
> >Hi all,
> >
> >I have a brand new RedHat computer just setup. It
> currently have two
> >partitions...both EXT3.
> >
> >How do I make one of my drives reiserfs?
> >
> >
> >
> >
> >  
> >
--- Hans Reiser <reiser@namesys.com> wrote:
> use tar.  tar does a good job, and nfs to a file
> server (or some 
> borrowed hard drive) plus tar will work well.
> 
> Remember to run lilo after untarring.
> 
> 

a bit more extensive approach:

1) Like Hans said, use tar to backup your partition,
and put it in some place over a NFS drive or another
local drive.
2) umount your partition to be converted to ReiserFS.
3) format this partition using ReiserFS tools.
4) edit /etc/fstab and change the line having your
partition to the following:
/dev/hdxy /xx/yy  reiserfs  defaults    1   2
where /dev/hdxy is your partition (I guess it's ide)
and /xx/yy is the mount point.
5) mount it again.
6) untar your backup into your new ReiserFS partition

__________________________________________________
Do you Yahoo!?
Yahoo! News - Today's headlines
http://news.yahoo.com

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of  current 2.4.19.pending ...)
  2002-09-12  0:24 Compatibility of current 2.4.19.pending patchset with old data-logging?? Manuel Krause
  2002-09-12 13:28 ` newbie how to darren
@ 2002-09-12 21:30 ` Manuel Krause
  2002-09-13  7:37   ` Oleg Drokin
  1 sibling, 1 reply; 19+ messages in thread
From: Manuel Krause @ 2002-09-12 21:30 UTC (permalink / raw)
  To: reiserfs-list

On 09/12/2002 02:24 AM, Manuel Krause wrote:
> Hi!
> 
> Originally I wanted to prepare a new kernel based on 2.4.20-pre6. That 
> didn't work due to a gcc compiler failure I have to investigate on here 
[...]
Solved by upgrading from gcc-2.95.2 to the recommended version 2.95.3
[...]
> Now I'm completely confused with my private file-naming-convention, the 
> newest reiserfs 2.4.19.pending directory and the following question:
> 
>  Is the current 2.4.19.pending compatible with data-logging any more???
> 
> When I see the discussion about the new block-allocator, the testing 
> directory etc. I assume a definite "No." (Please, please, correct me, if 
> I'm wrong...)

;-) You really didn't correct me so far. Mmh. :-(

> 
> So, for the overnight change to pre-kernel 7, where are the new 
> data-logging patches Hans even announced to lkml?

So I really got a 2.4.20-pre6 compiled with the 
preempt-kernel-rml-2.4.20-pre5-1.patch and Olegs fix he sent to the list 
yesterday. BTW, have the testing patches from -pre5 gone into -pre6? It 
isn't clear to me.

You see the timing values of my backup session (cp -a A/. B/) from this 
afternoon as follows:


partition 1: /dev/hdd7       2722896   2043700    679196  76% /mnt/beta
  (is my / filesystem)

used FS:  noatime,notail
  real  11m2.265s    user  0m1.480s    sys  0m47.550s  ~3,014MB/s    32%
fresh FS: noatime,notail
  real   5m19.686s   user  0m1.210s    sys  0m46.980s  ~6,243MB/s    66%
fresh FS: noatime,notail,data=ordered -- kernel
2.4.19+rml-preempt+reiserfs+data-logging-patches
  (real  3m19.895s   user  0m1.400s   sys  0m49.890s)  ~9,409MB/s  =100%


partition 2: /dev/hdd11      5550248   3801504   1748744  69% /mnt/beta
  (my software "repository", accessed not really often)

used FS:  noatime,notail
  real 12m30.172s   user  0m1.360s   sys  1m10.350s    ~4,949MB/s    61%
fresh FS: noatime,notail
  real 10m11.303s   user  0m1.240s   sys  1m10.400s    ~6,073MB/s    74%
fresh FS: noatime,notail,data=ordered -- kernel
2.4.19+rml-preempt+reiserfs+data-logging-patches
  (real  8m56.040s  user  0m1.410s  sys  1m27.920s)    ~8,180MB/s  =100%


(...) = timings with slightly different disk content one month ago


Application startup timings of NS-7.0 and OOo-1.0.1 didn't increase. :-))

I hope I don't need to comment on the values...


Thanks,

Manuel


^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of  current 2.4.19.pending ...)
  2002-09-12 21:30 ` Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) Manuel Krause
@ 2002-09-13  7:37   ` Oleg Drokin
  2002-09-13 23:07     ` Manuel Krause
  0 siblings, 1 reply; 19+ messages in thread
From: Oleg Drokin @ 2002-09-13  7:37 UTC (permalink / raw)
  To: Manuel Krause; +Cc: reiserfs-list

Hello!

On Thu, Sep 12, 2002 at 11:30:22PM +0200, Manuel Krause wrote:

> >When I see the discussion about the new block-allocator, the testing 
> >directory etc. I assume a definite "No." (Please, please, correct me, if 
> ;-) You really didn't correct me so far. Mmh. :-(

Datalogging is incompatible with stuff in testing directory. It won't apply
cleanly to new block allocatior, but that's can be easily fixed.

> >So, for the overnight change to pre-kernel 7, where are the new 
> >data-logging patches Hans even announced to lkml?
> So I really got a 2.4.20-pre6 compiled with the 
> preempt-kernel-rml-2.4.20-pre5-1.patch and Olegs fix he sent to the list 
> yesterday. BTW, have the testing patches from -pre5 gone into -pre6? It 
> isn't clear to me.

Testing patches were merged into -pre6 by Marcelo's mistake and -pre7
have these patches removed already.

> Application startup timings of NS-7.0 and OOo-1.0.1 didn't increase. :-))

Sure, read code path did not changed at all.

> I hope I don't need to comment on the values...

Sure. datalogging and new file_write code are not solutions to same problem.
These are solutions for different problems and can be used together (after
merging of course) for even better performance, I hope ;)

Bye,
    Oleg

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of  current 2.4.19.pending ...)
  2002-09-13  7:37   ` Oleg Drokin
@ 2002-09-13 23:07     ` Manuel Krause
  2002-09-14  9:02       ` Oleg Drokin
  0 siblings, 1 reply; 19+ messages in thread
From: Manuel Krause @ 2002-09-13 23:07 UTC (permalink / raw)
  To: Oleg Drokin; +Cc: reiserfs-list

On 09/13/2002 09:37 AM, Oleg Drokin wrote:
> Hello!
> 
> On Thu, Sep 12, 2002 at 11:30:22PM +0200, Manuel Krause wrote:
> 
>>>When I see the discussion about the new block-allocator, the testing 
>>>directory etc. I assume a definite "No." (Please, please, correct me, if 
>>
>>;-) You really didn't correct me so far. Mmh. :-(
> 
> Datalogging is incompatible with stuff in testing directory. It won't apply
> cleanly to new block allocatior, but that's can be easily fixed.
> 
>>>So, for the overnight change to pre-kernel 7, where are the new 
>>>data-logging patches Hans even announced to lkml?

??? Hello, Chris!

>>So I really got a 2.4.20-pre6 compiled with the 
>>preempt-kernel-rml-2.4.20-pre5-1.patch and Olegs fix he sent to the list 
>>yesterday. BTW, have the testing patches from -pre5 gone into -pre6? It 
>>isn't clear to me.
> 
> Testing patches were merged into -pre6 by Marcelo's mistake and -pre7
> have these patches removed already.

Yes. I pre-read this @ lkml.

>>Application startup timings of NS-7.0 and OOo-1.0.1 didn't increase. :-))
> 
> 
> Sure, read code path did not changed at all.

Seems so, even AntiVir takes the same time as with data-logging 
ordered-mode... :-)

>>I hope I don't need to comment on the values...
> 
> Sure. datalogging and new file_write code are not solutions to same problem.
> These are solutions for different problems and can be used together (after
> merging of course) for even better performance, I hope ;)


Thanks a lot for your informative reply, Oleg!

Did I follow the thread "Re: [reiserfs-list] [BK] ReiserFS file write 
bug fix for 2.4" correctly that going to 2.4.20-pre7 that is without 
file_write (+) code should speed up (in my case copy actions) in 
comparison to 2.4.20-pre6?

I retested my backup session (see last mail) with 2.4.20-pre7 but have 
_NO_ ( <0,2% !!!) difference at all.

Have I missed something?

Thanks,

Manuel


^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of  current 2.4.19.pending ...)
  2002-09-13 23:07     ` Manuel Krause
@ 2002-09-14  9:02       ` Oleg Drokin
  2002-09-17  0:53         ` Manuel Krause
       [not found]         ` <3D867C14.5060404@mb.tu-ilmenau.de>
  0 siblings, 2 replies; 19+ messages in thread
From: Oleg Drokin @ 2002-09-14  9:02 UTC (permalink / raw)
  To: Manuel Krause; +Cc: reiserfs-list

Hello!

On Sat, Sep 14, 2002 at 01:07:37AM +0200, Manuel Krause wrote:
> >>>So, for the overnight change to pre-kernel 7, where are the new 
> >>>data-logging patches Hans even announced to lkml?
> ??? Hello, Chris!

Chris said he'll either announce something on tuesday or pass the code to me to
finish it.

> Did I follow the thread "Re: [reiserfs-list] [BK] ReiserFS file write 
> bug fix for 2.4" correctly that going to 2.4.20-pre7 that is without 
> file_write (+) code should speed up (in my case copy actions) in 
> comparison to 2.4.20-pre6?

No. file_write code should speed up writes that are done in big chunks (more
than 4k).

> I retested my backup session (see last mail) with 2.4.20-pre7 but have 
> _NO_ ( <0,2% !!!) difference at all.

What tool do you use for backup?
I certainly see the difference when doing
 dd if=/dev/zero of=somefile bs=1024k count=100

also when doing cp bigfile /another_disk/anotherfile

Also very huge difference is seen when creating holes ;)
e.g compare time of seeking to 128Gb and writing one byte between
2.4.20-pre7 and 2.4.20-pre6 ;)
Code for that was once posted on our mailing list.
When you will feel that 2.4.20-pre7 hanged, don't trust that feeling
and wait till the end of operation. ;)

> Have I missed something?

To see the difference you need to fed FS with large chuncks of data.

Bye,
    Oleg

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of  current 2.4.19.pending ...)
  2002-09-14  9:02       ` Oleg Drokin
@ 2002-09-17  0:53         ` Manuel Krause
       [not found]         ` <3D867C14.5060404@mb.tu-ilmenau.de>
  1 sibling, 0 replies; 19+ messages in thread
From: Manuel Krause @ 2002-09-17  0:53 UTC (permalink / raw)
  Cc: reiserfs-list

On 09/14/2002 11:02 AM, Oleg Drokin wrote:
 > Hello!
 >
 > On Sat, Sep 14, 2002 at 01:07:37AM +0200, Manuel Krause wrote:
 >
 >>>>>So, for the overnight change to pre-kernel 7, where are the new
 >>>>>data-logging patches Hans even announced to lkml?
 >>>>
 >>??? Hello, Chris!
 >
 >
 > Chris said he'll either announce something on tuesday or pass the 
code to me to
 > finish it.
 >

I included this into my prayers.

 >
 >>Did I follow the thread "Re: [reiserfs-list] [BK] ReiserFS file write
 >>bug fix for 2.4" correctly that going to 2.4.20-pre7 that is without
 >>file_write (+) code should speed up (in my case copy actions) in
 >>comparison to 2.4.20-pre6?
 >
 >
 > No. file_write code should speed up writes that are done in big 
chunks (more
 > than 4k).
 >
 >
 >>I retested my backup session (see last mail) with 2.4.20-pre7 but have
 >>_NO_ ( <0,2% !!!) difference at all.
 >
 >
 > What tool do you use for backup?

Hey, Oleg, you should know this; I stayed with my simple but useful
scripts using "cp -ax A/. B/" for testing reiserfs performance on this
command related to my reiserfs partitions' content. That content is not
theoretical, it's living for about one month, then newly recreated --
and these backups showed up some issues in the past. ;-))

 > I certainly see the difference when doing
 >  dd if=/dev/zero of=somefile bs=1024k count=100

I didn't do dd related time comparisons though having the values for the
last months. Many months ago I found "bs=1M" to be more useful ;-)

 >
 > also when doing cp bigfile /another_disk/anotherfile
 >
 > Also very huge difference is seen when creating holes ;)
 > e.g compare time of seeking to 128Gb and writing one byte between
 > 2.4.20-pre7 and 2.4.20-pre6 ;)
 > Code for that was once posted on our mailing list.
 > When you will feel that 2.4.20-pre7 hanged, don't trust that feeling
 > and wait till the end of operation. ;)
 >

You're kind of joking?! "time <command>" should show the total lack of
time related performance, when [throughput] is divided by [time] or
not?! I calculated these values manually based upon the timings & df
values after copying. Yes, I want to show my personal engagement for the
reiserfs community, too.

 >
 >>Have I missed something?
 >
 >
 > To see the difference you need to fed FS with large chuncks of data.
 >

Thanks to you, Oleg,

Manuel



^ permalink raw reply	[flat|nested] 19+ messages in thread

[parent not found: <3D867C14.5060404@mb.tu-ilmenau.de>]

* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of  current 2.4.19.pending ...)
       [not found]         ` <3D867C14.5060404@mb.tu-ilmenau.de>
@ 2002-09-17  6:33           ` Oleg Drokin
  2002-09-17 13:56             ` Manuel Krause
  0 siblings, 1 reply; 19+ messages in thread
From: Oleg Drokin @ 2002-09-17  6:33 UTC (permalink / raw)
  To: Manuel Krause; +Cc: Andrea Arcangeli, reiserfs-list

Hello!

On Tue, Sep 17, 2002 at 02:49:24AM +0200, Manuel Krause wrote:

> >What tool do you use for backup?
> Hey, Oleg, you should know this; I stayed with my simple but useful 
> scripts using "cp -ax A/. B/" for testing reiserfs performance on this 
> command related to my reiserfs partitions' content. That content is not 

I am just making sure.

> >Also very huge difference is seen when creating holes ;)
> >e.g compare time of seeking to 128Gb and writing one byte between
> >2.4.20-pre7 and 2.4.20-pre6 ;)
> >Code for that was once posted on our mailing list.
> >When you will feel that 2.4.20-pre7 hanged, don't trust that feeling
> >and wait till the end of operation. ;)
> You're kind of joking?! "time <command>" should show the total lack of 
> time related performance, when [throughput] is divided by [time] or 

I must be missing something, but why do you want to divide throughput by time?
throughput is amount of data divided by time.

> not?! I calculated these values manually based upon the timings & df 
> values after copying. Yes, I want to show my personal engagement for the 
> reiserfs community, too.

Hm. Are you sure you are hitting write-speed limit?
May be you are read-speed bounded? Can you please check that?

Thank you.

Bye,
    Oleg

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of  current 2.4.19.pending ...)
  2002-09-17  6:33           ` Oleg Drokin
@ 2002-09-17 13:56             ` Manuel Krause
  2002-09-17 14:00               ` Oleg Drokin
  0 siblings, 1 reply; 19+ messages in thread
From: Manuel Krause @ 2002-09-17 13:56 UTC (permalink / raw)
  To: Oleg Drokin; +Cc: reiserfs-list

On 09/17/2002 08:33 AM, Oleg Drokin wrote:
> Hello!
> 
> On Tue, Sep 17, 2002 at 02:49:24AM +0200, Manuel Krause wrote:
> 
> 
>>>What tool do you use for backup?
>>
>>Hey, Oleg, you should know this; I stayed with my simple but useful 
>>scripts using "cp -ax A/. B/" for testing reiserfs performance on this 
>>command related to my reiserfs partitions' content. That content is not 
> 
> 
> I am just making sure.
> 
> 
>>>Also very huge difference is seen when creating holes ;)
>>>e.g compare time of seeking to 128Gb and writing one byte between
>>>2.4.20-pre7 and 2.4.20-pre6 ;)
>>>Code for that was once posted on our mailing list.
>>>When you will feel that 2.4.20-pre7 hanged, don't trust that feeling
>>>and wait till the end of operation. ;)
>>
>>You're kind of joking?! "time <command>" should show the total lack of 
>>time related performance, when [throughput] is divided by [time] or 
> 
> 
> I must be missing something, but why do you want to divide throughput by time?
> throughput is amount of data divided by time.

Sorry, my wrong words. I meant the amount of data devided by time.

> 
> 
>>not?! I calculated these values manually based upon the timings & df 
>>values after copying. Yes, I want to show my personal engagement for the 
>>reiserfs community, too.
> 
> 
> Hm. Are you sure you are hitting write-speed limit?
> May be you are read-speed bounded? Can you please check that?
> 

How can I check this? E.g. with some dd process? I just don't know.

Thanks,

Manuel


^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of  current 2.4.19.pending ...)
  2002-09-17 13:56             ` Manuel Krause
@ 2002-09-17 14:00               ` Oleg Drokin
  2002-09-17 17:39                 ` Manuel Krause
  0 siblings, 1 reply; 19+ messages in thread
From: Oleg Drokin @ 2002-09-17 14:00 UTC (permalink / raw)
  To: Manuel Krause; +Cc: reiserfs-list

Hello!

On Tue, Sep 17, 2002 at 03:56:13PM +0200, Manuel Krause wrote:
> >>not?! I calculated these values manually based upon the timings & df 
> >>values after copying. Yes, I want to show my personal engagement for the 
> >>reiserfs community, too.
> >Hm. Are you sure you are hitting write-speed limit?
> >May be you are read-speed bounded? Can you please check that?
> How can I check this? E.g. with some dd process? I just don't know.

Copy same amount of data from RAM/nowhere to FS.
E.g. make a file with file names and sizes and write a script that
writes this amount of data from /dev/zero with these same names and needed sizes
into FS. (or just use RAMFS as your source if you have not much data and huge
RAM)
Compare 2.4.20-pre[67] if you see any difference.
Ah, also copy your data from original disk location to /dev/null and measure
time of that operation to know how much of total time is occupied by reads.

Also you can calculate read and write throughput separately this way.
And if reads are slower than writes - ...

Bye,
    Oleg

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of  current 2.4.19.pending ...)
  2002-09-17 14:00               ` Oleg Drokin
@ 2002-09-17 17:39                 ` Manuel Krause
  2002-09-18  5:20                   ` Oleg Drokin
  0 siblings, 1 reply; 19+ messages in thread
From: Manuel Krause @ 2002-09-17 17:39 UTC (permalink / raw)
  To: Oleg Drokin; +Cc: reiserfs-list

On 09/17/2002 04:00 PM, Oleg Drokin wrote:
> Hello!
> 
> On Tue, Sep 17, 2002 at 03:56:13PM +0200, Manuel Krause wrote:
> 
>>>>not?! I calculated these values manually based upon the timings & df 
>>>>values after copying. Yes, I want to show my personal engagement for the 
>>>>reiserfs community, too.
>>>
>>>Hm. Are you sure you are hitting write-speed limit?
>>>May be you are read-speed bounded? Can you please check that?
>>
>>How can I check this? E.g. with some dd process? I just don't know.
> 
> 
> Copy same amount of data from RAM/nowhere to FS.
> E.g. make a file with file names and sizes and write a script that
> writes this amount of data from /dev/zero with these same names and needed sizes
> into FS. (or just use RAMFS as your source if you have not much data and huge
> RAM)

To be honest, this already exceeds my linux knowledge...

I was fiddling with some test directories containing 195.8MB I copied to 
and from /dev/shm with swap turned off.

# time cp -a /dev/shm/. /mnt/beta/z.Backup.3/
kernel 2.4.20-pre7  | kernel 2.4.20-pre6
real    0m9.006s    | real    0m6.740s
user    0m0.190s    | user    0m0.230s
sys     0m5.250s    | sys     0m4.780s
# rm -r /dev/shm/*
# time cp -a /mnt/beta/z.Backup.3/. /dev/shm/
kernel 2.4.20-pre7  | kernel 2.4.20-pre6
real    0m6.349s    | real    0m6.180s
user    0m0.210s    | user    0m0.220s
sys     0m2.450s    | sys     0m2.510s

and

# time dd if=/dev/zero bs=1M count=1000 of=/mnt/beta/testfile.zero
kernel 2.4.20-pre7  | kernel 2.4.20-pre6
real    1m11.390s   | real    1m42.011s
user    0m0.010s    | user    0m0.000s
sys     0m11.230s   | sys     0m5.620s

# time dd of=/dev/null bs=1M if=/mnt/beta/testfile.zero
kernel 2.4.20-pre7  | kernel 2.4.20-pre6
real    1m16.738s   | real    1m39.094s
user    0m0.000s    | user    0m0.000s
sys     0m5.460s    | sys     0m5.930s


> Compare 2.4.20-pre[67] if you see any difference.
> Ah, also copy your data from original disk location to /dev/null and measure
> time of that operation to know how much of total time is occupied by reads.
> 
> Also you can calculate read and write throughput separately this way.
> And if reads are slower than writes - ...
> 

I'm definitely not sure if my lines above are something you meant.

Bye,

Manuel


^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of  current 2.4.19.pending ...)
  2002-09-17 17:39                 ` Manuel Krause
@ 2002-09-18  5:20                   ` Oleg Drokin
  2002-09-19  1:14                     ` Manuel Krause
  0 siblings, 1 reply; 19+ messages in thread
From: Oleg Drokin @ 2002-09-18  5:20 UTC (permalink / raw)
  To: Manuel Krause; +Cc: reiserfs-list

Hello!

On Tue, Sep 17, 2002 at 07:39:39PM +0200, Manuel Krause wrote:

> >Copy same amount of data from RAM/nowhere to FS.
> >E.g. make a file with file names and sizes and write a script that
> >writes this amount of data from /dev/zero with these same names and needed 
> >sizes
> >into FS. (or just use RAMFS as your source if you have not much data and 
> >huge
> >RAM)
> To be honest, this already exceeds my linux knowledge...

I meant something to this extent:
You run a script that runs over your filesystem and creates shell script
that first creates whole dir structure of source dir and then for each file
creates necessary command to recreate file of the same size:
e.g for this directory contents:
green@angband:~/z> ls -lR
.:
total 1
drwxr-xr-x    2 green    green         114 Sep 18 09:08 t

./t:
total 148
-rw-rw-r--    1 green    green       69570 Aug 10 16:34 inode.c
-rw-rw-r--    1 green    green       66478 Aug 10 16:33 stree.c
-rw-rw-r--    1 green    green       10256 Aug 10 16:32 tail_conversion.c

Result of the work of the script would be:
mkdir t
mkdir t/z
dd if=/dev/zero of=t/z/inode.c bs=69570 count=1
dd if=/dev/zero of=t/z/stree.c bs=66478 count=1
dd if=/dev/zero of=t/z/tail_conversion.c bs=10256 count=1

And you can run resulting script in target dir.

> I was fiddling with some test directories containing 195.8MB I copied to 
> and from /dev/shm with swap turned off.
> 
> # time cp -a /dev/shm/. /mnt/beta/z.Backup.3/
> kernel 2.4.20-pre7  | kernel 2.4.20-pre6
> real    0m9.006s    | real    0m6.740s
> user    0m0.190s    | user    0m0.230s
> sys     0m5.250s    | sys     0m4.780s
> # rm -r /dev/shm/*
> # time cp -a /mnt/beta/z.Backup.3/. /dev/shm/
> kernel 2.4.20-pre7  | kernel 2.4.20-pre6
> real    0m6.349s    | real    0m6.180s
> user    0m0.210s    | user    0m0.220s
> sys     0m2.450s    | sys     0m2.510s

This dataset is way too small and entirely fits into your RAM I presume.
So to avoid any distortion or results you'd better have all periodic stuff
disabled. (though kupdated is still there) so it's better to run it several
times.
Also since it its into RAM, it must be flushed out, so I usually do this
using such command:
time sh -c "cp -a /testfs0/linux-2.4.18 /mnt/ ; umount /mnt"

> # time dd if=/dev/zero bs=1M count=1000 of=/mnt/beta/testfile.zero
> kernel 2.4.20-pre7  | kernel 2.4.20-pre6
> real    1m11.390s   | real    1m42.011s
> sys     0m11.230s   | sys     0m5.620s

Hm. While system time is less as expected, real time increased, that's strange.

> # time dd of=/dev/null bs=1M if=/mnt/beta/testfile.zero
> kernel 2.4.20-pre7  | kernel 2.4.20-pre6
> real    1m16.738s   | real    1m39.094s
> sys     0m5.460s    | sys     0m5.930s

And real time is bigger for reads too, so it seems data layout is different.

That's really strange. If you can reproduce this behaviour, I am interested
in getting debugreiserfs -d output for each case after you umount this volume
(I assume that /mnt/beta/ filesystems contains nothing but this testfile.zero
file).

> >Compare 2.4.20-pre[67] if you see any difference.
> >Ah, also copy your data from original disk location to /dev/null and 
> >measure
> >time of that operation to know how much of total time is occupied by reads.
> >Also you can calculate read and write throughput separately this way.
> >And if reads are slower than writes - ...
> I'm definitely not sure if my lines above are something you meant.

Yes, kind of, though you have omitted timings of copying original data to
/dev/shm/ that will give us read speed from original media.

In fact instead of turning of swap you can do
mount none /mnt/ramfs -t ramfs
command (if you have ramfs compiled in of course) and /mnt/ramfs is now
kind of ram filesystem with very low overhead. It also cannot be swapped out
so if you fill all of your RAM, your box will OOM ;)
Byt the test itself is very small.
Probably you need to run something like
time find /source/that/needs/to/be/backed/up -type f -exec cat {} >/dev/null \;

to get read performance and implement a script like I mentioned in the beginning
to measure writes.
This way you do not need tons of RAM.

Bye,
    Oleg

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of  current 2.4.19.pending ...)
  2002-09-18  5:20                   ` Oleg Drokin
@ 2002-09-19  1:14                     ` Manuel Krause
  2002-09-19  6:34                       ` Oleg Drokin
  0 siblings, 1 reply; 19+ messages in thread
From: Manuel Krause @ 2002-09-19  1:14 UTC (permalink / raw)
  To: Oleg Drokin; +Cc: reiserfs-list

Hi!

Even facing the problem to make this mail unreadable I want to answer 
all inline. Please, don't complain.

I feel the urgent need to correct at least my last testing results as 
your last mail revealed heavy errors of my timing tests and kind-of 
opened my eyes.

On 09/18/2002 07:20 AM, Oleg Drokin wrote:
> Hello!
> 
> On Tue, Sep 17, 2002 at 07:39:39PM +0200, Manuel Krause wrote:
> 
>>>Copy same amount of data from RAM/nowhere to FS.
>>>E.g. make a file with file names and sizes and write a script that
>>>writes this amount of data from /dev/zero with these same names and needed 
>>>sizes into FS. (or just use RAMFS as your source if you have not much data 
>>>and huge RAM)
>>
>>To be honest, this already exceeds my linux knowledge...
> 
> I meant something to this extent:
> You run a script that runs over your filesystem and creates shell script
> that first creates whole dir structure of source dir and then for each file
> creates necessary command to recreate file of the same size:
> e.g for this directory contents:
> green@angband:~/z> ls -lR 
[...]
> Result of the work of the script would be:
> mkdir t
> mkdir t/z
> dd if=/dev/zero of=t/z/inode.c bs=69570 count=1
> dd if=/dev/zero of=t/z/stree.c bs=66478 count=1
> dd if=/dev/zero of=t/z/tail_conversion.c bs=10256 count=1
> 
> And you can run resulting script in target dir.

Yes, I saw this work in a nightmare last night. Scheduled for some dark 
moonless cold snow flurry winter night, sorry. Except for someone 
experienced likes to provide me with a basic script for that... ;-))

>>I was fiddling with some test directories containing 195.8MB I copied to 
>>and from /dev/shm with swap turned off.
>>
>># time cp -a /dev/shm/. /mnt/beta/z.Backup.3/
>>kernel 2.4.20-pre7  | kernel 2.4.20-pre6
>>real    0m9.006s    | real    0m6.740s
>>user    0m0.190s    | user    0m0.230s
>>sys     0m5.250s    | sys     0m4.780s
>># rm -r /dev/shm/*
>># time cp -a /mnt/beta/z.Backup.3/. /dev/shm/
>>kernel 2.4.20-pre7  | kernel 2.4.20-pre6
>>real    0m6.349s    | real    0m6.180s
>>user    0m0.210s    | user    0m0.220s
>>sys     0m2.450s    | sys     0m2.510s
> 
> This dataset is way too small and entirely fits into your RAM I presume.

Yes, it fits. I know that problem with this RAM based test. Though I may 
increase the testing directory a bit closer to the OOM limit, having 
512MB available.

> So to avoid any distortion or results you'd better have all periodic stuff
> disabled. (though kupdated is still there) so it's better to run it several
> times.
> Also since it its into RAM, it must be flushed out, so I usually do this
> using such command:
> time sh -c "cp -a /testfs0/linux-2.4.18 /mnt/ ; umount /mnt"

Couldn't you have written these words to me some years earlier?! The 
effect is measurable and on almost any so far discussed fs interaction 
huge or at least relevant. So, after reviewing my 
partition-backup-scripts, forget _all_ results I posted to the list. 
They're all lacking the umount=flush component.

Now, you caught me as the "fool of reiserfs-list"... Quite embarrassing. 
Mmmh. Painful.

>># time dd if=/dev/zero bs=1M count=1000 of=/mnt/beta/testfile.zero
>>kernel 2.4.20-pre7  | kernel 2.4.20-pre6
>>real    1m11.390s   | real    1m42.011s
>>sys     0m11.230s   | sys     0m5.620s
> 
> Hm. While system time is less as expected, real time increased, that's strange.
> 
>># time dd of=/dev/null bs=1M if=/mnt/beta/testfile.zero
>>kernel 2.4.20-pre7  | kernel 2.4.20-pre6
>>real    1m16.738s   | real    1m39.094s
>>sys     0m5.460s    | sys     0m5.930s
> 
> And real time is bigger for reads too, so it seems data layout is different.
> 
> That's really strange. If you can reproduce this behaviour, I am interested
> in getting debugreiserfs -d output for each case after you umount this volume
> (I assume that2 /mnt/beta/ filesystems contains nothing but this testfile.zero
> file).

No. /mnt/beta/ is my software storage partition and contains this:
  /dev/hda11   5550248   4089088   1461160  74% /mnt/beta .

I have no means to provide this complete "debugreiserfs -d /dev/hda11" 
output set, 42MB one of four, if I read your wording correctly (-pre6 
without 1G file, -pre6 with 1GB file, -pre7 without 1GB file, -pre7 with 
1GB file) on my web-space. As .tar.gz it's 4MB one of four, and even 
that set doesn't fit on my private t-online  website. Maybe, it would 
work if sent sequentially by mail.
Oh. O.k. you get a definite "No" on this, sorry. I just reviewed the 
debugreiserfs' output file content and I would not send or publish this 
in any way. It is simply too sensitive as it contains direct file and 
directory names.
Is it possible to provide the needed info without clear directory or 
file names in future?! (These names replaced by sequentionally taken 
numbers?)

////

O.k., too many words about unneeded things. I've remade the testing I 
posted and kept to umount the interacting related partition inbetween in 
order to force the needed flush and captured that time, too, now. My 
previously posted values have not been reproducible, if I review the new 
values correctly. But, you need to read and interprete it yourself!

I took 3 timings each test and calculated the mean value.

Comparison of "dd" actions:
---------------------------
reading command: time sh -c "dd if=/mnt/beta/testfile.zero bs=1M
  count=1000 of=/dev/null ; umount /mnt/beta"
writing command: time sh -c "dd if=/dev/zero bs=1M count=1000
  of=/mnt/beta/testfile.zero ; umount /mnt/beta"

mean-pre7 writing dd 1G | mean-pre6 writing dd 1G
real   1m32.288s        | real   1m30.531s
user   0m0.013s         | user   0m0.016s
sys    0m11.207s        | sys    0m5.036s

mean-pre7 reading dd 1G | mean-pre6 reading dd 1G
real    1m24.002s       | real   1m22.039s
user    0m0.010s        | user   0m0.013s
sys     0m6.470s        | sys    0m6.083s

related df values:
/dev/hda11   5550248   4089088   1461160  74% /mnt/beta
/dev/hda11   5550248   5114104    436144  93% /mnt/beta

Yes, that's going over the "senfseful" filesystem content value.

////

Comparison of "cp -a" actions:
------------------------------
reading command: time sh -c "cp -a /mnt/beta/z.Backup.3/. /mnt/ramfs/ ;
  umount /mnt/beta ; umount /mnt/ramfs"
writing command: time sh -c "cp -a /mnt/ramfs/. /mnt/beta/z.Backup.3/ ;
  umount /mnt/beta ; umount /mnt/ramfs"

mean-pre7 reading files | mean-pre6 reading files
real    0m38.641s       | real    0m39.110s
user    0m0.200s        | user    0m0.220s
sys     0m3.400s        | sys     0m3.200s

mean-pre7 writing files | mean-pre6 writing files
real    0m25.128s       | real    0m27.689s
user    0m0.200s        | user    0m0.160s
sys     0m4.860s        | sys     0m5.217s

directory content: 171.3MB

////

>>>Compare 2.4.20-pre[67] if you see any difference.
>>>Ah, also copy your data from original disk location to /dev/null and 
>>>measure
>>>time of that operation to know how much of total time is occupied by reads.
>>>Also you can calculate read and write throughput separately this way.
>>>And if reads are slower than writes - ...
>>
>>I'm definitely not sure if my lines above are something you meant.
> 
> Yes, kind of, though you have omitted timings of copying original data to
> /dev/shm/ that will give us read speed from original media.

I thought my posted set "# time cp -a /mnt/beta/z.Backup.3/. /dev/shm/" 
represented copying the original data from reiserfs partition /mnt/beta 
to /dev/shm ?!

> In fact instead of turning of swap you can do
> mount none /mnt/ramfs -t ramfs
> command (if you have ramfs compiled in of course) and /mnt/ramfs is now
> kind of ram filesystem with very low overhead. It also cannot be swapped out
> so if you fill all of your RAM, your box will OOM ;)

Even more, after finding that RAMFS was now internally set enabled in 
the related Config.in but searching it over and over before (Me: Am I 
really blind, now?!) I saw huge throughput diffs between shm and RAMFS.

> Byt the test itself is very small.
> Probably you need to run something like
> time find /source/that/needs/to/be/backed/up -type f -exec cat {} >/dev/null \;
> 
> to get read performance and implement a script like I mentioned in the beginning
> to measure writes.
> This way you do not need tons of RAM.

I see. :-)
Hope that mail brings some clarification and I haven't forgotten too 
much for now.

Thanks, good night,

Manuel

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of  current 2.4.19.pending ...)
  2002-09-19  1:14                     ` Manuel Krause
@ 2002-09-19  6:34                       ` Oleg Drokin
  2002-09-19 15:52                         ` Manuel Krause
  0 siblings, 1 reply; 19+ messages in thread
From: Oleg Drokin @ 2002-09-19  6:34 UTC (permalink / raw)
  To: Manuel Krause; +Cc: reiserfs-list

Hello!

On Thu, Sep 19, 2002 at 03:14:56AM +0200, Manuel Krause wrote:

> >And you can run resulting script in target dir.
> Yes, I saw this work in a nightmare last night. Scheduled for some dark 
> moonless cold snow flurry winter night, sorry. Except for someone 
> experienced likes to provide me with a basic script for that... ;-))

> >This dataset is way too small and entirely fits into your RAM I presume.
> Yes, it fits. I know that problem with this RAM based test. Though I may 
> increase the testing directory a bit closer to the OOM limit, having 
> 512MB available.

No, this is not enough of course since some data will remain unflushed and
amount of such data is relatively big compared to total amount of data.

> >So to avoid any distortion or results you'd better have all periodic stuff
> >disabled. (though kupdated is still there) so it's better to run it several
> >times.
> >Also since it its into RAM, it must be flushed out, so I usually do this
> >using such command:
> >time sh -c "cp -a /testfs0/linux-2.4.18 /mnt/ ; umount /mnt"
> Couldn't you have written these words to me some years earlier?! The 
> effect is measurable and on almost any so far discussed fs interaction 
> huge or at least relevant. So, after reviewing my 
> partition-backup-scripts, forget _all_ results I posted to the list. 
> They're all lacking the umount=flush component.

It is only needed if data to be cached is big enough to be noticed when compared
to total amount of data to be copied.

> No. /mnt/beta/ is my software storage partition and contains this:
>  /dev/hda11   5550248   4089088   1461160  74% /mnt/beta .

Ah!

> Oh. O.k. you get a definite "No" on this, sorry. I just reviewed the 
> debugreiserfs' output file content and I would not send or publish this 
> in any way. It is simply too sensitive as it contains direct file and 
> directory names.
No, then I do not need that debugreiserfs dump anyway.

But here is another warning:
I presume before each copy test is done, /mnt/beta/z.Backup.3 is removed
completely and /mnt/beta is unmounted and mounted back, also
between sevetral writing attempts (And during these attempts of course)
no other processes can write to to this FS.
If those two above clauses are not true, then results are also meaningless,
as lots of unnecessary tree reads are issued for overwrite and new blocks are
not allocated, but existing ones are reused.
If somebody can write to FS, then with every next test blocks chosen for files
are different (old ones may be already occupied).

> Is it possible to provide the needed info without clear directory or 
> file names in future?! (These names replaced by sequentionally taken 
> numbers?)

In such a case you can determine object id of big file (shown to userspace
as inode number) and only provide it's SD and indirect items:
|  9|4 357 0x0 SD (0), len 44, location 1572 entry count 65535, ...
| 10|4 357 0x1 IND (1), len 504, location 1068 entr....
126 pointers
[ 9948(126)]

This is a example of file with objectid 357, that have 126 blocks in size.
9948-10074 blocks (all continuous) are used.

If file is very big, there would be several IND (indirect) items in other nodes,
number in brackets will changes to show offset that this INDIRECT item starts
with.

> Comparison of "dd" actions:
> ---------------------------
> reading command: time sh -c "dd if=/mnt/beta/testfile.zero bs=1M
>  count=1000 of=/dev/null ; umount /mnt/beta"
> writing command: time sh -c "dd if=/dev/zero bs=1M count=1000
>  of=/mnt/beta/testfile.zero ; umount /mnt/beta"

I presume you earased /mnt/beta/testfile.zero between tests and executed
sync.

Ah, until I forgot - in reiserfs if you erased something blocks that were
freed are only get back to you on next journal flush or after sync.
So if you do something like this:
rm -f /mnt/beta/testfile.zero ; time sh -c "dd ...",
then second file will get different blocknumbers.

> related df values:
> /dev/hda11   5550248   4089088   1461160  74% /mnt/beta
> /dev/hda11   5550248   5114104    436144  93% /mnt/beta
> Yes, that's going over the "senfseful" filesystem content value.

Hm. This is before and after dd command or what?

> Comparison of "cp -a" actions:
> ------------------------------
> reading command: time sh -c "cp -a /mnt/beta/z.Backup.3/. /mnt/ramfs/ ;
>  umount /mnt/beta ; umount /mnt/ramfs"
> writing command: time sh -c "cp -a /mnt/ramfs/. /mnt/beta/z.Backup.3/ ;
>  umount /mnt/beta ; umount /mnt/ramfs"

You mean you executed your commands in this same order?
I.e. first reading files from partition and then writing same files back
in place of already existing ones? Then above thing about overwriting files
applies directly in here.
I thought you were reading files from one filesystem and writing these files
to another one.

> >Yes, kind of, though you have omitted timings of copying original data to
> >/dev/shm/ that will give us read speed from original media.
> I thought my posted set "# time cp -a /mnt/beta/z.Backup.3/. /dev/shm/" 
> represented copying the original data from reiserfs partition /mnt/beta 
> to /dev/shm ?!

I thought that original data is residing on another filesystem on another disk.
If originally you vere copying data from one disk to the same disk, just
another partition, then thisis just lots of seeks.

Bye,
    Oleg

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of  current 2.4.19.pending ...)
  2002-09-19  6:34                       ` Oleg Drokin
@ 2002-09-19 15:52                         ` Manuel Krause
  2002-09-19 16:01                           ` Oleg Drokin
  0 siblings, 1 reply; 19+ messages in thread
From: Manuel Krause @ 2002-09-19 15:52 UTC (permalink / raw)
  To: Oleg Drokin; +Cc: reiserfs-list

On 09/19/2002 08:34 AM, Oleg Drokin wrote:
> Hello!
> 
> On Thu, Sep 19, 2002 at 03:14:56AM +0200, Manuel Krause wrote:
> 
> 
>>>And you can run resulting script in target dir.
>>
>>Yes, I saw this work in a nightmare last night. Scheduled for some dark 
>>moonless cold snow flurry winter night, sorry. Except for someone 
>>experienced likes to provide me with a basic script for that... ;-))
> 
>>>This dataset is way too small and entirely fits into your RAM I presume.
>>
>>Yes, it fits. I know that problem with this RAM based test. Though I may 
>>increase the testing directory a bit closer to the OOM limit, having 
>>512MB available.
> 
> No, this is not enough of course since some data will remain unflushed and
> amount of such data is relatively big compared to total amount of data.

And if the participating filesystems are umounted after writing the data?

>>>So to avoid any distortion or results you'd better have all periodic stuff
>>>disabled. (though kupdated is still there) so it's better to run it several
>>>times.
>>>Also since it its into RAM, it must be flushed out, so I usually do this
>>>using such command:
>>>time sh -c "cp -a /testfs0/linux-2.4.18 /mnt/ ; umount /mnt"
>>
>>Couldn't you have written these words to me some years earlier?! The 
>>effect is measurable and on almost any so far discussed fs interaction 
>>huge or at least relevant. So, after reviewing my 
>>partition-backup-scripts, forget _all_ results I posted to the list. 
>>They're all lacking the umount=flush component.
> 
> It is only needed if data to be cached is big enough to be noticed when compared
> to total amount of data to be copied.
> 
>>No. /mnt/beta/ is my software storage partition and contains this:
>> /dev/hda11   5550248   4089088   1461160  74% /mnt/beta .
> 
> Ah!
> 
>>Oh. O.k. you get a definite "No" on this, sorry. I just reviewed the 
>>debugreiserfs' output file content and I would not send or publish this 
>>in any way. It is simply too sensitive as it contains direct file and 
>>directory names.
> 
> No, then I do not need that debugreiserfs dump anyway.
> 
> But here is another warning:
> I presume before each copy test is done, /mnt/beta/z.Backup.3 is removed
> completely and /mnt/beta is unmounted and mounted back, also
> between sevetral writing attempts (And during these attempts of course)
> no other processes can write to to this FS.

These clauses are both true and apply to the dd tests, too. Mmh, except 
for the case when /mnt/beta/z.Backup.3 or testfile.zero were the source 
to be copied/dd'ed... There haven't been overwrites or other writes to 
the disk during the testsets.

> If those two above clauses are not true, then results are also meaningless,
> as lots of unnecessary tree reads are issued for overwrite and new blocks are
> not allocated, but existing ones are reused.
> If somebody can write to FS, then with every next test blocks chosen for files
> are different (old ones may be already occupied).
> 
>>Is it possible to provide the needed info without clear directory or 
>>file names in future?! (These names replaced by sequentionally taken 
>>numbers?)
> 
> In such a case you can determine object id of big file (shown to userspace
> as inode number) and only provide it's SD and indirect items:
> |  9|4 357 0x0 SD (0), len 44, location 1572 entry count 65535, ...
> | 10|4 357 0x1 IND (1), len 504, location 1068 entr....
> 126 pointers
> [ 9948(126)]
> 
> This is a example of file with objectid 357, that have 126 blocks in size.
> 9948-10074 blocks (all continuous) are used.
> 
> If file is very big, there would be several IND (indirect) items in other nodes,
> number in brackets will changes to show offset that this INDIRECT item starts
> with.
> 
>>Comparison of "dd" actions:
>>---------------------------
>>reading command: time sh -c "dd if=/mnt/beta/testfile.zero bs=1M
>> count=1000 of=/dev/null ; umount /mnt/beta"
>>writing command: time sh -c "dd if=/dev/zero bs=1M count=1000
>> of=/mnt/beta/testfile.zero ; umount /mnt/beta"
> 
> I presume you earased /mnt/beta/testfile.zero between tests and executed
> sync.

I umounted the partition and mounted it back. I thought that would be 
the right action to avoid what you describe in the following:

> Ah, until I forgot - in reiserfs if you erased something blocks that were
> freed are only get back to you on next journal flush or after sync.
> So if you do something like this:
> rm -f /mnt/beta/testfile.zero ; time sh -c "dd ...",
> then second file will get different blocknumbers.
> 
>>related df values:
>>/dev/hda11   5550248   4089088   1461160  74% /mnt/beta
>>/dev/hda11   5550248   5114104    436144  93% /mnt/beta
>>Yes, that's going over the "senfseful" filesystem content value.
> 
> Hm. This is before and after dd command or what?

Yes, without and with the 1G file.

>>Comparison of "cp -a" actions:
>>------------------------------
>>reading command: time sh -c "cp -a /mnt/beta/z.Backup.3/. /mnt/ramfs/ ;
>> umount /mnt/beta ; umount /mnt/ramfs"
>>writing command: time sh -c "cp -a /mnt/ramfs/. /mnt/beta/z.Backup.3/ ;
>> umount /mnt/beta ; umount /mnt/ramfs"
> 
> You mean you executed your commands in this same order?

The order was:
mount /mnt/beta, mount ramfs, executing the reading command with the 
umounts -- and this order 2 more times -- for the read values.
Then :
mount ramfs, mount /mnt/beta, move data to ramfs, umount and mount back 
/mnt/beta, executing the writing command with the umounts -- and this 2 
more times -- for the write values.

So I didn't execute the above commands in direct order.

> I.e. first reading files from partition and then writing same files back
> in place of already existing ones? Then above thing about overwriting files
> applies directly in here.
> I thought you were reading files from one filesystem and writing these files
> to another one.

>>>Yes, kind of, though you have omitted timings of copying original data to
>>>/dev/shm/ that will give us read speed from original media.
>>
>>I thought my posted set "# time cp -a /mnt/beta/z.Backup.3/. /dev/shm/" 
>>represented copying the original data from reiserfs partition /mnt/beta 
>>to /dev/shm ?!
> 
> I thought that original data is residing on another filesystem on another disk.
> If originally you vere copying data from one disk to the same disk, just
> another partition, then thisis just lots of seeks.

Thanks for these reminders. Originally I wanted to copy data from one 
disk to another and on the way capture the timings to compare the 
throughput... Then you pointed me to re-check pure reads and pure writes 
mainly to make sure the writes were not read-speed bound and to compare 
this behaviour on -pre6 and -pre7.

Bye,

Manuel


^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of  current 2.4.19.pending ...)
  2002-09-19 15:52                         ` Manuel Krause
@ 2002-09-19 16:01                           ` Oleg Drokin
  2002-09-23  0:36                             ` Manuel Krause
  0 siblings, 1 reply; 19+ messages in thread
From: Oleg Drokin @ 2002-09-19 16:01 UTC (permalink / raw)
  To: Manuel Krause; +Cc: reiserfs-list

Hello!

On Thu, Sep 19, 2002 at 05:52:15PM +0200, Manuel Krause wrote:

> >>Yes, it fits. I know that problem with this RAM based test. Though I may 
> >>increase the testing directory a bit closer to the OOM limit, having 
> >>512MB available.
> >No, this is not enough of course since some data will remain unflushed and
> >amount of such data is relatively big compared to total amount of data.
> And if the participating filesystems are umounted after writing the data?

Then it is ok of course. All fs' buffers are flushed on umount.

> >But here is another warning:
> >I presume before each copy test is done, /mnt/beta/z.Backup.3 is removed
> >completely and /mnt/beta is unmounted and mounted back, also
> >between sevetral writing attempts (And during these attempts of course)
> >no other processes can write to to this FS.
> These clauses are both true and apply to the dd tests, too. Mmh, except 
> for the case when /mnt/beta/z.Backup.3 or testfile.zero were the source 
> to be copied/dd'ed... There haven't been overwrites or other writes to 
> the disk during the testsets.
Ok, good.

> >I presume you earased /mnt/beta/testfile.zero between tests and executed
> >sync.
> I umounted the partition and mounted it back. I thought that would be 
> the right action to avoid what you describe in the following:

Yes.

> >I thought that original data is residing on another filesystem on another 
> >disk.
> >If originally you vere copying data from one disk to the same disk, just
> >another partition, then thisis just lots of seeks.
> Thanks for these reminders. Originally I wanted to copy data from one 
> disk to another and on the way capture the timings to compare the 
> throughput... Then you pointed me to re-check pure reads and pure writes 
> mainly to make sure the writes were not read-speed bound and to compare 
> this behaviour on -pre6 and -pre7.

Originally I was mostly interested in reads from source fs, not the one
where you have copied the data (though that one is also might be useful).

Ok, thank you for lots of testing.

Bye,
    Oleg

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of  current 2.4.19.pending ...)
  2002-09-19 16:01                           ` Oleg Drokin
@ 2002-09-23  0:36                             ` Manuel Krause
  0 siblings, 0 replies; 19+ messages in thread
From: Manuel Krause @ 2002-09-23  0:36 UTC (permalink / raw)
  To: Oleg Drokin; +Cc: reiserfs-list

On 09/19/2002 06:01 PM, Oleg Drokin wrote:
> Hello!
> 
> On Thu, Sep 19, 2002 at 05:52:15PM +0200, Manuel Krause wrote:
> 
[...]
>>
>>Thanks for these reminders. Originally I wanted to copy data from one 
>>disk to another and on the way capture the timings to compare the 
>>throughput... Then you pointed me to re-check pure reads and pure writes 
>>mainly to make sure the writes were not read-speed bound and to compare 
>>this behaviour on -pre6 and -pre7.
> 
> Originally I was mostly interested in reads from source fs, not the one
> where you have copied the data (though that one is also might be useful).
> 
> Ok, thank you for lots of testing.
> 
> Bye,
>     Oleg
> 

Hi Oleg & others!

O.k. somehow I managed to "find" the "script to make the scripts" to 
measure writes from nowhere to the target disk. I decided to make up 2 
scripts - one for the dirs and one for the files, originally for the 
comparison of "pure" file wites to the reads (read below) including the 
umounts. They're containing just the lists of commands Oleg posted 
recently (e.g. mkdir "/mnt/beta/dir1" /// dd if=/dev/zero count=1 
bs=8466760 of="/mnt/beta/dir1/.../filename1" ). And I re-used Olegs 
posted command to measure reads from the source disk for the most recent 
reiserfs kernel versions.

The results are shown below.

I'm not very glad with comparing the "pure" reads against writes 
especially when watching the copy timings as different commands are 
included and their (relative) overhead is quite unknown, am I right? (So 
e.g. I didn't want to take these values' relation to tweak the disks 
read & write latency settings for now.)  CPU usage during the dd... and 
the find...cat commands is very high. Also the disk access when watched 
in ksysguardd is different for all these kinds of tests.

So, read and compare it yourself and I would be glad to get comments on 
how I needed to refine it.

Good night,

Manuel

------------------------------------------------------------------

/dev/hda11             5550248   3927572   1622676  71% /mnt/beta
/dev/hdd11             5550248   3927572   1622676  71% /mnt/gamma
containing 58015 files, 3481 files

kernel 2.4.19-      2.4.20-pre6  2.4.20-pre7
        data-logging

# time sh -c "cp -ax /mnt/gamma/. /mnt/beta/ ; umount /mnt/gamma ; 
umount /mnt/beta "
(representing copies from source to target disk)

real   7m46.970s    10m57.716s   10m04.328s
user   0m01.710s     0m01.390s    0m01.540s
sys    1m25.440s     1m11.100s    1m18.840s

# time /tmp/script.dirs.sh
(representing directory writes to target disk)

real   0m09.972s     0m10.055s    0m10.316s
user   0m04.960s     0m04.770s    0m04.880s
sys    0m04.370s     0m04.590s    0m04.550s

# time sh -c "/tmp/script.files.sh ; umount /dev/hda11 ; umount /dev/hdd11 "
(representing file writes to target disk)

real   7m35.992s     8m24.499s    8m20.830s
user   1m31.100s     1m30.120s    1m32.280s
sys    2m21.480s     1m42.230s    2m02.660s

# cd /mnt/ ; time sh -c "find ./gamma/* -type f -exec cat {} >/dev/null 
\; ; umount /dev/hda11 ; umount /dev/hdd11 "
(representing reads from source disk)
real   8m34.665s     9m54.103s    9m50.796s
user   1m11.650s     1m10.760s    1m11.100s
sys    1m15.000s     1m29.960s    1m17.370s

I took care to freshly mount the participating partitions each
test, recreate and mount+umount the counterpart reiserfs
partition before when appropriate, had no overwrites and no
other accesses to the source partition during this test.

Going back from 2.4.20-pre7 to 2.4.19-data-logging and
beeing in doubt of the effects of the new block allocator I
recreated the source filesystem completely (copy to new
target and copy back) to measure the above data-logging
values. The first copy from the former 2.4.20-pre7 reiserfs
to 2.4.19-data-logging had this timings:
   real    7m38.715s
   user    0m2.030s
   sys     1m29.560s

Command to take the "/tmp/script.[dirs,files].sh" :
sh -c 'cd /mnt/gamma ; find * -type d -fprintf /tmp/script.dirs.sh 
"mkdir \"/mnt/beta/%p\"\n" ; find * -type f -fprintf 
/tmp/script.files.sh "dd if=/dev/zero count=1 bs=%s 
of=\"/mnt/beta/%p\"\n" ; chmod u+x /tmp/script.*.sh'

------------------------------------------------------------------

^ permalink raw reply	[flat|nested] 19+ messages in thread

end of thread, other threads:[~2002-09-23  0:36 UTC | newest]

Thread overview: 19+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2002-09-12  0:24 Compatibility of current 2.4.19.pending patchset with old data-logging?? Manuel Krause
2002-09-12 13:28 ` newbie how to darren
2002-09-12 14:37   ` Hans Reiser
2002-09-12 15:12     ` Guillermo Torreiro
2002-09-12 21:30 ` Copy time comparison 2.4.20-pre6 <-> 2.4.19+data-logging (was:Compatibility of current 2.4.19.pending ...) Manuel Krause
2002-09-13  7:37   ` Oleg Drokin
2002-09-13 23:07     ` Manuel Krause
2002-09-14  9:02       ` Oleg Drokin
2002-09-17  0:53         ` Manuel Krause
     [not found]         ` <3D867C14.5060404@mb.tu-ilmenau.de>
2002-09-17  6:33           ` Oleg Drokin
2002-09-17 13:56             ` Manuel Krause
2002-09-17 14:00               ` Oleg Drokin
2002-09-17 17:39                 ` Manuel Krause
2002-09-18  5:20                   ` Oleg Drokin
2002-09-19  1:14                     ` Manuel Krause
2002-09-19  6:34                       ` Oleg Drokin
2002-09-19 15:52                         ` Manuel Krause
2002-09-19 16:01                           ` Oleg Drokin
2002-09-23  0:36                             ` Manuel Krause

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.