* File System Performance results
@ 2008-10-22 20:06 Steven Pratt
2008-10-25 9:15 ` Andreas Dilger
2008-10-29 19:26 ` Chris Mason
0 siblings, 2 replies; 10+ messages in thread
From: Steven Pratt @ 2008-10-22 20:06 UTC (permalink / raw)
To: linux-fsdevel
We have set up a new page which is intended mainly for tracking the
performance of BTRFS, but in doing so we are testing other filesystems
as well (ext3, ext4, xfs and jfs). Thought some people here might find
the results useful.
The main page is here:
http://btrfs.boxacle.net/
Information about the machine configuration, tests run, how to reproduce
the run and link to graphs of all the results are provided off of this
page. When looking at any individual test, links are provided to the
detail output from the tests including iostat, mpstat, oprofile data and
more.
Feedback and request for additional tests are always welcomed.
Steve
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: File System Performance results
2008-10-22 20:06 File System Performance results Steven Pratt
@ 2008-10-25 9:15 ` Andreas Dilger
2008-10-27 14:28 ` Steven Pratt
2008-10-29 19:26 ` Chris Mason
1 sibling, 1 reply; 10+ messages in thread
From: Andreas Dilger @ 2008-10-25 9:15 UTC (permalink / raw)
To: Steven Pratt; +Cc: linux-fsdevel
On Oct 22, 2008 15:06 -0500, Steven Pratt wrote:
> We have set up a new page which is intended mainly for tracking the
> performance of BTRFS, but in doing so we are testing other filesystems
> as well (ext3, ext4, xfs and jfs). Thought some people here might find
> the results useful.
>
>
> The main page is here:
>
> http://btrfs.boxacle.net/
>
> Information about the machine configuration, tests run, how to reproduce
> the run and link to graphs of all the results are provided off of this
> page. When looking at any individual test, links are provided to the
> detail output from the tests including iostat, mpstat, oprofile data and
> more.
Steve,
thanks for posting the numbers. They are definitely interesting. On
the surface, ext4 is doing quite well overall (yay!), but the important
point to realize is that btrfs is also providing a lot of extra function
under the covers so it isn't necessarily a clear-cut answer on which one
to pick.
The extra CPU cost of btrfs will become increasingly irrelevant in the
future I think.
Cheers, Andreas
--
Andreas Dilger
Sr. Staff Engineer, Lustre Group
Sun Microsystems of Canada, Inc.
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: File System Performance results
2008-10-25 9:15 ` Andreas Dilger
@ 2008-10-27 14:28 ` Steven Pratt
2008-10-27 14:37 ` Chris Mason
2008-10-27 16:02 ` Theodore Tso
0 siblings, 2 replies; 10+ messages in thread
From: Steven Pratt @ 2008-10-27 14:28 UTC (permalink / raw)
To: Andreas Dilger; +Cc: linux-fsdevel
Andreas Dilger wrote:
> On Oct 22, 2008 15:06 -0500, Steven Pratt wrote:
>
>> We have set up a new page which is intended mainly for tracking the
>> performance of BTRFS, but in doing so we are testing other filesystems
>> as well (ext3, ext4, xfs and jfs). Thought some people here might find
>> the results useful.
>>
>>
>> The main page is here:
>>
>> http://btrfs.boxacle.net/
>>
>> Information about the machine configuration, tests run, how to reproduce
>> the run and link to graphs of all the results are provided off of this
>> page. When looking at any individual test, links are provided to the
>> detail output from the tests including iostat, mpstat, oprofile data and
>> more.
>>
>
> Steve,
> thanks for posting the numbers. They are definitely interesting. On
> the surface, ext4 is doing quite well overall (yay!),
Yes, that was good news. Along these lines if there is anything else we
can do to help out ext4, just let us know.
> but the important
> point to realize is that btrfs is also providing a lot of extra function
> under the covers so it isn't necessarily a clear-cut answer on which one
> to pick.
>
> The extra CPU cost of btrfs will become increasingly irrelevant in the
> future I think.
>
While I agree that CPU usage is becoming less and less of an issue, I
think that at this point in the development cycle of btrfs, we still
need to take a hard look at any areas where cpu usage is excessive, and
see if we can keep that to a minimum. This is the main reason we did
runs without checksumming, so we could see a better apple to apple
comparison, not because it is not a useful feature. It will be very
interesting to see how much HW checksumming changes this with Nehelam.
Steve
> Cheers, Andreas
> --
> Andreas Dilger
> Sr. Staff Engineer, Lustre Group
> Sun Microsystems of Canada, Inc.
>
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: File System Performance results
2008-10-27 14:28 ` Steven Pratt
@ 2008-10-27 14:37 ` Chris Mason
2008-10-27 15:12 ` Steven Pratt
2008-10-27 16:02 ` Theodore Tso
1 sibling, 1 reply; 10+ messages in thread
From: Chris Mason @ 2008-10-27 14:37 UTC (permalink / raw)
To: Steven Pratt; +Cc: Andreas Dilger, linux-fsdevel
On Mon, 2008-10-27 at 09:28 -0500, Steven Pratt wrote:
> Andreas Dilger wrote:
> > On Oct 22, 2008 15:06 -0500, Steven Pratt wrote:
> >
> >> We have set up a new page which is intended mainly for tracking the
> >> performance of BTRFS, but in doing so we are testing other filesystems
> >> as well (ext3, ext4, xfs and jfs). Thought some people here might find
> >> the results useful.
> >>
> >>
> >> The main page is here:
> >>
> >> http://btrfs.boxacle.net/
I meant to ask if this is a permanent site for the results? It might
make sense to add a more generic fsperf.boxacle.net page.
> >>
> >> Information about the machine configuration, tests run, how to reproduce
> >> the run and link to graphs of all the results are provided off of this
> >> page. When looking at any individual test, links are provided to the
> >> detail output from the tests including iostat, mpstat, oprofile data and
> >> more.
> >>
> >
> > Steve,
> > thanks for posting the numbers. They are definitely interesting. On
> > the surface, ext4 is doing quite well overall (yay!),
> Yes, that was good news. Along these lines if there is anything else we
> can do to help out ext4, just let us know.
>
> > but the important
> > point to realize is that btrfs is also providing a lot of extra function
> > under the covers so it isn't necessarily a clear-cut answer on which one
> > to pick.
> >
> > The extra CPU cost of btrfs will become increasingly irrelevant in the
> > future I think.
> >
> While I agree that CPU usage is becoming less and less of an issue, I
> think that at this point in the development cycle of btrfs, we still
> need to take a hard look at any areas where cpu usage is excessive, and
> see if we can keep that to a minimum.
Very true, especially performance results with checksumming off (like
the nodatacow results for random writes).
> This is the main reason we did
> runs without checksumming, so we could see a better apple to apple
> comparison, not because it is not a useful feature. It will be very
> interesting to see how much HW checksumming changes this with Nehelam.
See the btrfs crc header file for the #define to enable the hw assist
mode if you're got the hardware that can do it. In my runs here, it
makes the checksumming free aside from the time spent storing the extra
metadata.
-chris
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: File System Performance results
2008-10-27 14:37 ` Chris Mason
@ 2008-10-27 15:12 ` Steven Pratt
0 siblings, 0 replies; 10+ messages in thread
From: Steven Pratt @ 2008-10-27 15:12 UTC (permalink / raw)
To: Chris Mason; +Cc: Andreas Dilger, linux-fsdevel
Chris Mason wrote:
> On Mon, 2008-10-27 at 09:28 -0500, Steven Pratt wrote:
>
>> Andreas Dilger wrote:
>>
>>> On Oct 22, 2008 15:06 -0500, Steven Pratt wrote:
>>>
>>>
>>>> We have set up a new page which is intended mainly for tracking the
>>>> performance of BTRFS, but in doing so we are testing other filesystems
>>>> as well (ext3, ext4, xfs and jfs). Thought some people here might find
>>>> the results useful.
>>>>
>>>>
>>>> The main page is here:
>>>>
>>>> http://btrfs.boxacle.net/
>>>>
>
> I meant to ask if this is a permanent site for the results? It might
> make sense to add a more generic fsperf.boxacle.net page.
>
We hope the site is permanent, as long as traffic and volume does not
get too high. I'll see what I can do about the new page.
>
>>>> Information about the machine configuration, tests run, how to reproduce
>>>> the run and link to graphs of all the results are provided off of this
>>>> page. When looking at any individual test, links are provided to the
>>>> detail output from the tests including iostat, mpstat, oprofile data and
>>>> more.
>>>>
>>>>
>>> Steve,
>>> thanks for posting the numbers. They are definitely interesting. On
>>> the surface, ext4 is doing quite well overall (yay!),
>>>
>> Yes, that was good news. Along these lines if there is anything else we
>> can do to help out ext4, just let us know.
>>
>>
>>> but the important
>>> point to realize is that btrfs is also providing a lot of extra function
>>> under the covers so it isn't necessarily a clear-cut answer on which one
>>> to pick.
>>>
>>> The extra CPU cost of btrfs will become increasingly irrelevant in the
>>> future I think.
>>>
>>>
>> While I agree that CPU usage is becoming less and less of an issue, I
>> think that at this point in the development cycle of btrfs, we still
>> need to take a hard look at any areas where cpu usage is excessive, and
>> see if we can keep that to a minimum.
>>
>
> Very true, especially performance results with checksumming off (like
> the nodatacow results for random writes).
>
>
>> This is the main reason we did
>> runs without checksumming, so we could see a better apple to apple
>> comparison, not because it is not a useful feature. It will be very
>> interesting to see how much HW checksumming changes this with Nehelam.
>>
>
> See the btrfs crc header file for the #define to enable the hw assist
> mode if you're got the hardware that can do it. In my runs here, it
> makes the checksumming free aside from the time spent storing the extra
> metadata.
>
>
Great, but you are making me jealous. I've been waiting for my Nehelam
box for months!
Steve
> -chris
>
>
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: File System Performance results
2008-10-27 14:28 ` Steven Pratt
2008-10-27 14:37 ` Chris Mason
@ 2008-10-27 16:02 ` Theodore Tso
2008-10-27 16:17 ` Steven Pratt
1 sibling, 1 reply; 10+ messages in thread
From: Theodore Tso @ 2008-10-27 16:02 UTC (permalink / raw)
To: Steven Pratt; +Cc: Andreas Dilger, linux-fsdevel
On Mon, Oct 27, 2008 at 09:28:55AM -0500, Steven Pratt wrote:
>> thanks for posting the numbers. They are definitely interesting. On
>> the surface, ext4 is doing quite well overall (yay!),
>
> Yes, that was good news. Along these lines if there is anything else we
> can do to help out ext4, just let us know.
Indeed, thanks Steven, for doing this work; it's much appreciated!
Couple of quick questions/suggestions
*) What version of e2fsprogs did you use for your ext4 tests, and what
if any options did you give to mke2fs when creating the filesystem?
*) For all of the filesystems except for btrfs, where you mentioned
different mount options in use, is it fair to assume that no mount
options were specified so the filesystem defaults were used?
*) With the full knowledge that ext4 will probably not do terribly
well with this suggested change, it would be interesting if you did a
variant of the mail server benchmark where the benchmark code
performed an fsync() before closing the new file, and an fsync on the
containing directory after deleting a file, to simulate what real
MTA's tend to do in order to assure correct SMTP semantics. (Some
won't do the fsync on delete, on the assumption that sending an
e-mail twice is harmless, where as losing an e-mail is completely
unacceptable. In a world where 97% of the mail messages transiting
the backbone is spam, some might disagree with those sentiments, but
the SMTP protocol requires, and most MTA's do perform, an fsync() an
incoming mail message before they acknowledge that they mail message
has been accepted and they are now taking responsibility for sending
the mail message on to its final destination. :-)
In a 2-3 days after I get the ext4 patch queue back into shape after
2.6.28-rc2, it would interesting to get a benchmark run of 2.6.28-rc2
plus the ext4 patch queue with and without the akpm_lock_hack mount
option. I'm not sure how much time/effort it takes to do a complete
set of benchmark runs for a different kernel version and/or mount
option, but there are definitely a number of experiments where it
would be very useful to crank through your benchmark systems, and as
long as it doesn't burden your primary benchmarking objectings overly
much, it would be great to see what those experiments run on your
benchmarking setup would turn up.
Thanks, regards,
- Ted
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: File System Performance results
2008-10-27 16:02 ` Theodore Tso
@ 2008-10-27 16:17 ` Steven Pratt
0 siblings, 0 replies; 10+ messages in thread
From: Steven Pratt @ 2008-10-27 16:17 UTC (permalink / raw)
To: Theodore Tso; +Cc: Andreas Dilger, linux-fsdevel
Theodore Tso wrote:
> On Mon, Oct 27, 2008 at 09:28:55AM -0500, Steven Pratt wrote:
>
>>> thanks for posting the numbers. They are definitely interesting. On
>>> the surface, ext4 is doing quite well overall (yay!),
>>>
>> Yes, that was good news. Along these lines if there is anything else we
>> can do to help out ext4, just let us know.
>>
>
> Indeed, thanks Steven, for doing this work; it's much appreciated!
>
> Couple of quick questions/suggestions
>
> *) What version of e2fsprogs did you use for your ext4 tests,
[root@btrfs1 ~]# mkfs.ext4dev -V
mke2fs 1.41.2 (02-Oct-2008)
Using EXT2FS Library version 1.41.2
[root@btrfs1
~]#
> and what
> if any options did you give to mke2fs when creating the filesystem?
>
Basically none.
'mkfs.ext4dev -F /dev/ffsbdev1'
> *) For all of the filesystems except for btrfs, where you mentioned
> different mount options in use, is it fair to assume that no mount
> options were specified so the filesystem defaults were used?
>
Right, except where noted on btrfs, no mount options were used on any
filesystem.
> *) With the full knowledge that ext4 will probably not do terribly
> well with this suggested change, it would be interesting if you did a
> variant of the mail server benchmark where the benchmark code
> performed an fsync() before closing the new file, and an fsync on the
> containing directory after deleting a file, to simulate what real
> MTA's tend to do in order to assure correct SMTP semantics. (Some
> won't do the fsync on delete, on the assumption that sending an
> e-mail twice is harmless, where as losing an e-mail is completely
> unacceptable. In a world where 97% of the mail messages transiting
> the backbone is spam, some might disagree with those sentiments, but
> the SMTP protocol requires, and most MTA's do perform, an fsync() an
> incoming mail message before they acknowledge that they mail message
> has been accepted and they are now taking responsibility for sending
> the mail message on to its final destination. :-)
>
Yes, Chris Mason had already requested this. Should have it this week.
> In a 2-3 days after I get the ext4 patch queue back into shape after
> 2.6.28-rc2, it would interesting to get a benchmark run of 2.6.28-rc2
> plus the ext4 patch queue with and without the akpm_lock_hack mount
> option. I'm not sure how much time/effort it takes to do a complete
> set of benchmark runs for a different kernel version and/or mount
> option, but there are definitely a number of experiments where it
> would be very useful to crank through your benchmark systems, and as
> long as it doesn't burden your primary benchmarking objectings overly
> much, it would be great to see what those experiments run on your
> benchmarking setup would turn up.
>
It is not much effort at all. Runs take about 3 hours per filesystem
(for currently defined set of benchmarks) so we can complete full set in
about 15hours. It is all automated, so assuming no breakage, about 30
minutes to set up for a new set of runs, and 30 minutes after all
complete to verify and graph it all.
If there are specific runs you would like to see, just let me know
specifics.
Steve
> Thanks, regards,
>
> - Ted
>
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: File System Performance results
2008-10-22 20:06 File System Performance results Steven Pratt
2008-10-25 9:15 ` Andreas Dilger
@ 2008-10-29 19:26 ` Chris Mason
2008-10-29 20:05 ` Steven Pratt
1 sibling, 1 reply; 10+ messages in thread
From: Chris Mason @ 2008-10-29 19:26 UTC (permalink / raw)
To: Steven Pratt; +Cc: linux-fsdevel
On Wed, 2008-10-22 at 15:06 -0500, Steven Pratt wrote:
> We have set up a new page which is intended mainly for tracking the
> performance of BTRFS, but in doing so we are testing other filesystems
> as well (ext3, ext4, xfs and jfs). Thought some people here might find
> the results useful.
I think I understand the bad read performance in btrfs. I was forcing a
tiny max readahead size.
The current git tree has fixes for it, along with a ton of new code.
-chris
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: File System Performance results
2008-10-29 19:26 ` Chris Mason
@ 2008-10-29 20:05 ` Steven Pratt
2008-10-29 21:49 ` Steven Pratt
0 siblings, 1 reply; 10+ messages in thread
From: Steven Pratt @ 2008-10-29 20:05 UTC (permalink / raw)
To: Chris Mason; +Cc: linux-fsdevel
Chris Mason wrote:
> On Wed, 2008-10-22 at 15:06 -0500, Steven Pratt wrote:
>
>> We have set up a new page which is intended mainly for tracking the
>> performance of BTRFS, but in doing so we are testing other filesystems
>> as well (ext3, ext4, xfs and jfs). Thought some people here might find
>> the results useful.
>>
>
> I think I understand the bad read performance in btrfs. I was forcing a
> tiny max readahead size.
>
> The current git tree has fixes for it, along with a ton of new code.
>
Ok, we'll kick off some new runs.
Also, just need to push out the data fro odirect random write and mail
server with fsync.
Steve
> -chris
>
>
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: File System Performance results
2008-10-29 20:05 ` Steven Pratt
@ 2008-10-29 21:49 ` Steven Pratt
0 siblings, 0 replies; 10+ messages in thread
From: Steven Pratt @ 2008-10-29 21:49 UTC (permalink / raw)
To: Chris Mason; +Cc: linux-fsdevel, linux-btrfs
Steven Pratt wrote:
> Chris Mason wrote:
>> On Wed, 2008-10-22 at 15:06 -0500, Steven Pratt wrote:
>>
>>> We have set up a new page which is intended mainly for tracking the
>>> performance of BTRFS, but in doing so we are testing other
>>> filesystems as well (ext3, ext4, xfs and jfs). Thought some people
>>> here might find the results useful.
>>>
>>
>> I think I understand the bad read performance in btrfs. I was forcing a
>> tiny max readahead size.
>>
>> The current git tree has fixes for it, along with a ton of new code.
>>
> Ok, we'll kick off some new runs.
>
> Also, just need to push out the data fro odirect random write and mail
> server with fsync.
> Steve
We have completed testing on the random write tests with odirect.
Results can be found at:
Single disk:
http://btrfs.boxacle.net/repository/single-disk/Oct28-odirect-random-writes/Oct28-odirect-random-writes.html
Raid:
http://btrfs.boxacle.net/repository/raid/Oct28-odirect-random-writes/Oct28-odirect-random-writes.html
Also, the changes to the mail server emulation to use fsync on the
creates is also complete:
Single disk:
http://btrfs.boxacle.net/repository/single-disk/Oct28-mail-server-fsync-creates/Oct28-mail-server-fsync-creates.html
Raid:
http://btrfs.boxacle.net/repository/raid/Oct28-mail-server-fsync-creates/mail-server-fsync-creates.html
Steve
>
>> -chris
>>
>>
>
> --
> To unsubscribe from this list: send the line "unsubscribe
> linux-fsdevel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 10+ messages in thread
end of thread, other threads:[~2008-10-29 21:49 UTC | newest]
Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-10-22 20:06 File System Performance results Steven Pratt
2008-10-25 9:15 ` Andreas Dilger
2008-10-27 14:28 ` Steven Pratt
2008-10-27 14:37 ` Chris Mason
2008-10-27 15:12 ` Steven Pratt
2008-10-27 16:02 ` Theodore Tso
2008-10-27 16:17 ` Steven Pratt
2008-10-29 19:26 ` Chris Mason
2008-10-29 20:05 ` Steven Pratt
2008-10-29 21:49 ` Steven Pratt
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).