From: Gordan Bobic <gordan-UpbECiGlrmGsTnJN9+BGXg@public.gmane.org>
To: Vincent Diepeveen <diep-qWit8jRvyhVmR6Xm/wNWPw@public.gmane.org>
Cc: linux-nilfs-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
Subject: Re: SSD and non-SSD Suitability
Date: Fri, 28 May 2010 11:44:04 +0100 [thread overview]
Message-ID: <4BFF9E74.6040900@bobich.net> (raw)
In-Reply-To: <72C0FCE6-CE1A-4262-B89F-A1C3CBA99EAD-qWit8jRvyhVmR6Xm/wNWPw@public.gmane.org>
Vincent Diepeveen wrote:
>>>> 1) Modern SSDs (e.g. Intel) do this logical/physical mapping
>>>> internally, so that the writes happen sequentially anyway.
>>> Could you explain that, as far as i know modern SSD's have 8
>>> independant channels to do read and writes, which is why they are
>>> having that big read and write speed and can in theory therefore
>>> support 8 threads doing reads and writes. Each channel say using
>>> blocks of 4KB, so it's 64KB in total.
>>
>> I'm talking about something else. I'm talking about the fact that you
>> can turn logical random writes into physical sequential writes by
>> re-mapping logical blocks to sequential physical blocks.
>
> That's doing 2 steps back in history isn't it?
Sorry, I don't see what you mean. Can you elaborate?
> The big speedup that SSD's deliver for average usage is ESPECIALLY
> because of the faster random access to the hardware.
Sure - on reads. Writes are a different beast. Look at some reviews of
SSDs of various types and generations. Until relatively recently, random
write performance (and to a large extent, any write performance) on them
has been very poor. Cheap flash media (e.g. USB sticks) still suffers
from this.
Don't confuse fast random reads with fast random writes.
> if you have some petabytes of storage, i guess the bigger bandwidth that
> SSD's deliver is not relevant, as the limitation
> is the network bandwidth anyway, so some raid5 with extra spare will
> deliver more than sufficient bandwidth.
RAID3/4/5/6 is inherently unsuitable for fast random writes because if a
write-read-write cycle required to update the parity.
>>> So a file system should benefit from the special properties of a SSD
>>> to be suited for this modern hardware.
>>
>> The only actual benefit is decreased latency.
>
> Which is mighty important; so the ONLY interesting type of filesystem
> for a SSD is a filesystem
> that is optimized for read and write latency rather than bandwidth IMHO.
Indeed, I agree (up to a point). Random IOPS has long been the defining
measure of disk performance for a reason.
> Especially read latency i consider most important.
Depends on your application. Remember that reads can be sped up by caching.
I look after a number of systems running applications that are
write-bound because the vast majority of reads can be satisfied from
page cache, but writes are unavoidable because transactions have to be
committed to persistent storage.
You cannot limit your performance assessment to the use-case of an
average desktop user running Firefox, Thunderbird and OpenOffice 99% of
the time. Those are not the users that file systems advances of the past
30 years are aimed at.
>>> Of course i understand you skip ext4 as that obviously still has to
>>> get bugfixed.
>>
>> It seems to be deemed stable enough for several distros, and will be
>> the default in RHEL6 in a few months' time, so that's less of a concern.
>>
>
> I ran into severe problems with ext4 and i just used it at 1 harddrive,
> same experiences with other linux users.
How recently have you tried it? RHEL6b has only been out for a month.
> Note i used ubuntu.
I guess that explains some of your desktop-centric views.
> Stuff like RHEL is more expensive a copy than i have at my bank account.
RHEL6b is a public beta, freely downloadable.
CentOS is a community recompile of RHEL, 100% binary compatible, just
with different artwork/logos. Freely available. As is Scientific Linux
(a very similar project to CentOS, also a free recompile of RHEL). If
you haven't found them, you can't have looked very hard.
>> I am more interested in metrics for how much writing is required
>> relative to the amount of data being transferred. For example, if I am
>> restoring a full running system (call it 5GB) from a tar ball onto
>> nilfs2, ext2, ext3, btrfs, etc., I am interested in how many blocks
>> worth of writes actually hit the disk, and to a lesser extent how many
>> of those end up being merged together (since merged operations, in
>> theory, can cause less wear on an SSD because bigger blocks can be
>> handle more efficiently if erasing is required.
>
> The most efficient blocksize for SSD's is 8 channels of 4KB blocks.
I'm not going to bite and get involved in debating the correctness of
this (somewhat limited) view. I'll just point out that it bears very
little relevant to the paragraph that it appears to be responding to.
Gordan
--
To unsubscribe from this list: send the line "unsubscribe linux-nilfs" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
next prev parent reply other threads:[~2010-05-28 10:44 UTC|newest]
Thread overview: 19+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-05-26 10:18 SSD and non-SSD Suitability Gordan Bobic
[not found] ` <4BFCF55A.80205-UpbECiGlrmGsTnJN9+BGXg@public.gmane.org>
2010-05-28 6:29 ` Jiro SEKIBA
[not found] ` <87typspmiq.wl%jir-27yqGEOhnJbQT0dZR+AlfA@public.gmane.org>
2010-05-28 9:50 ` Gordan Bobic
[not found] ` <4BFF91E7.9000102-UpbECiGlrmGsTnJN9+BGXg@public.gmane.org>
2010-05-29 7:31 ` Jiro SEKIBA
[not found] ` <87d3wf17vj.wl%jir-27yqGEOhnJbQT0dZR+AlfA@public.gmane.org>
2010-05-29 7:50 ` David Arendt
[not found] ` <4C00C745.6050903-/LHdS3kC8BfYtjvyW6yDsg@public.gmane.org>
2010-05-29 8:45 ` Gordan Bobic
[not found] ` <4C00D433.2010406-UpbECiGlrmGsTnJN9+BGXg@public.gmane.org>
2010-05-29 8:56 ` David Arendt
2010-05-29 8:43 ` Gordan Bobic
[not found] ` <4C00D3B7.8060904-UpbECiGlrmGsTnJN9+BGXg@public.gmane.org>
2010-06-01 13:05 ` Jiro SEKIBA
2010-05-28 8:17 ` Vincent Diepeveen
[not found] ` <927E6E4B-B072-42EE-915A-FD34A88D478A-qWit8jRvyhVmR6Xm/wNWPw@public.gmane.org>
2010-05-28 9:24 ` Gordan Bobic
[not found] ` <4BFF8BD6.7080802-UpbECiGlrmGsTnJN9+BGXg@public.gmane.org>
2010-05-28 10:15 ` Vincent Diepeveen
[not found] ` <72C0FCE6-CE1A-4262-B89F-A1C3CBA99EAD-qWit8jRvyhVmR6Xm/wNWPw@public.gmane.org>
2010-05-28 10:44 ` Gordan Bobic [this message]
[not found] ` <4BFF9E74.6040900-UpbECiGlrmGsTnJN9+BGXg@public.gmane.org>
2010-05-28 12:33 ` Vincent Diepeveen
[not found] ` <BF3C6199-02BC-415A-B028-E856312FB2DD-qWit8jRvyhVmR6Xm/wNWPw@public.gmane.org>
2010-05-28 13:36 ` Gordan Bobic
[not found] ` <4BFFC6FA.8010208-UpbECiGlrmGsTnJN9+BGXg@public.gmane.org>
2010-05-28 14:31 ` Vincent Diepeveen
[not found] ` <20C856F0-0CEB-45B9-A668-C07C89A7D338-qWit8jRvyhVmR6Xm/wNWPw@public.gmane.org>
2010-05-28 15:36 ` Gordan Bobic
2010-05-28 12:45 ` Vincent Diepeveen
[not found] ` <A3BB0C84-D2BD-4119-9296-0A4D9FC02F19-qWit8jRvyhVmR6Xm/wNWPw@public.gmane.org>
2010-05-28 13:39 ` Gordan Bobic
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4BFF9E74.6040900@bobich.net \
--to=gordan-upbeciglrmgstnjn9+bgxg@public.gmane.org \
--cc=diep-qWit8jRvyhVmR6Xm/wNWPw@public.gmane.org \
--cc=linux-nilfs-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox