From: Willy Tarreau <w@1wt.eu>
To: Daniel Phillips <phillips@phunq.net>
Cc: Alan Cox <alan@lxorguk.ukuu.org.uk>,
David Newall <davidn@davidnewall.com>,
linux-kernel@vger.kernel.org
Subject: Re: [ANNOUNCE] Ramback: faster than a speeding bullet
Date: Sat, 15 Mar 2008 22:54:27 +0100 [thread overview]
Message-ID: <20080315215427.GC13012@1wt.eu> (raw)
In-Reply-To: <200803151417.13899.phillips@phunq.net>
On Sat, Mar 15, 2008 at 01:17:13PM -0800, Daniel Phillips wrote:
> On Saturday 15 March 2008 13:59, Willy Tarreau wrote:
> > On Thu, Mar 13, 2008 at 11:14:39AM -0800, Daniel Phillips wrote:
> > > On Thursday 13 March 2008 06:22, Alan Cox wrote:
> > > > ...Ext3 cannot recover well from massive loss of intermediate
> > > > writes. It isn't a normal failure mode and there isn't sufficient fs
> > > > metadata robustness for this. A log structured backing store would deal
> > > > with that but all you apparently want to do is scream FUD at anyone who
> > > > doesn't agree with you.
> > >
> > > Scream is an exaggeration, and FUD only applies to somebody who
> > > consistently overlooks the primary proposition in this design: that the
> > > battery backed power supply, computer hardware and Linux are reliable
> > > enough to entrust your data to them. I say this is practical, you say
> > > it is impossible, I say FUD.
> > >
> > > All you are proposing is that nobody can entrust their data to any
> > > hardware. Good point. There is no absolute reliability, only degrees
> > > of it.
> > >
> > > Many raid controllers now have battery backed writeback cache, which
> > > is exactly the same reliability proposition as ramback, on a smaller
> > > scale. Do you refuse to entrust your corporate data to such
> > > controllers?
> >
> > RAID controllers do not have half a terabyte of RAM.
>
> And? Either you have battery backed ram with critical data in it or
> you do not. Exactly how much makes little difference to the question.
It completely changes the method to power it and the time the data may
remain in RAM. The Smart 3200 I have right here simply has lithium
batteries directly connected to the static RAM chips. Very low risk of
power failure. The way your presented your work shows it rely on a UPS
to sustain the PC's power supply, which it turn maintains the PC alive,
which in turn tries not to reboot to keep its RAM consistent. There are
a lot of reasons here to get a failure.
Don't get me wrong, I still think your project has a lot of usages. But
you have to admit that there are huge differences between using it in
an appliance with battery-backed RAM which is able to recover data after
a system crash, power outage or anything, and the average Joe's PC setup
as an NFS server for the company with a cheap UPS to try not to lose the
data should a power outage occur.
I think it could get major adoption with ordered writes.
> > Also, you are always
> > invited to choose between speed (write back) and reliability (write through).
>
> As is the case with ramback. Just echo 1 >/proc/driver/ramback/<name>.
>
> > Also, please note that the problem here is not related to the number of
> > nines of availability. This number only counts the ratio between uptime
> > and downtime. We're more facing a problem of MTBF, where the consequences
> > of a failure are hard to predict.
>
> That is why I keep recommending that a ramback setup be replicated or
> mirrored, which people in this thread keep glossing over. When
> replicated or mirrored, you still get the microsecond-level transaction
> times, and you get the safety too.
I agree, but in this case, you should present it this way. You have been
insisting too much on the average PC's reliability, the fact that no kernel
ever crashed for you, etc... So you are demonstrating that your product is
good provided that everything goes perfectly. All people who have experienced
software or hardware problems in the past (ie mostly everyone here) will not
trust your code because it relies on pre-requisites they know they do not
have.
> Then there is a big class of applications where the data on the ramdisk
> can be reconstructed, it is just a pain and reduces uptime. These are
> potential ramback users, and in fact I will be one of those, using it
> on my kernel hacking partition.
>
> > What I'm thinking about is that considering the fact that storage
> > technologies are moving towards SSD (and I think 2008 will be the
> > year of SSD), you should implement ordered writes (I've not said
> > write through) since there's no seek time on those devices. Thus
> > you will have the speed of RAM with the reliability of a properly
> > synced FS. If your system crashes once a week, it will not be a
> > problem anymore.
>
> There will be a whole bunch of patches from me that are SSD oriented,
> over time. The fact is, enterprise scale ramdisks are here now, while
> enterprise scale flash is not. Getting close, but not here. And flash
> does not approach the write performance of RAM, not now and probably
> not ever.
My goal is not to replace RAM with flash, but disk with flash. You are
against ordered writes for a performance reason. Use SSD instead of
hard drives and it will be as fast as sequential writes. Also, when
you say that enterprise scale flash is not there, I don't agree. You
can already afford hundreds of gigs of flash in 3,5" form factor. An
1.6 TB SSD has even been presented at CES2008, with sales announced
for Q3. So clearly this will replace your hard drives soon, very soon.
Even if it costs $5k, that's a very acceptable solution to replace a
disk in a RAM-speed appliance.
Willy
next prev parent reply other threads:[~2008-03-15 21:55 UTC|newest]
Thread overview: 153+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-03-10 6:46 [ANNOUNCE] Ramback: faster than a speeding bullet Daniel Phillips
2008-03-10 7:51 ` Grzegorz Kulewski
2008-03-10 8:23 ` Daniel Phillips
2008-03-10 9:37 ` Alan Cox
2008-03-10 21:03 ` Lars Marowsky-Bree
2008-03-11 11:14 ` Daniel Phillips
2008-03-11 11:23 ` Lars Marowsky-Bree
2008-03-11 11:50 ` Daniel Phillips
2008-03-11 17:26 ` Chris Friesen
2008-03-11 19:56 ` Daniel Phillips
2008-03-11 20:53 ` Willy Tarreau
2008-03-12 8:17 ` Daniel Phillips
2008-03-12 14:41 ` Mike Snitzer
2008-03-13 20:34 ` Rik van Riel
2008-03-14 2:20 ` Daniel Phillips
2008-03-11 21:56 ` Lars Marowsky-Bree
2008-03-11 23:02 ` Daniel Phillips
2008-03-12 13:25 ` Benny Amorsen
2008-03-12 13:30 ` Alan Cox
2008-03-13 15:29 ` Benny Amorsen
2008-03-14 9:30 ` Pavel Machek
2008-03-14 11:07 ` Ric Wheeler
2008-03-14 11:41 ` Benny Amorsen
2008-03-14 12:12 ` Ric Wheeler
2008-03-14 12:56 ` Theodore Tso
2008-03-14 15:47 ` Ric Wheeler
2008-03-14 16:49 ` Theodore Tso
2008-03-14 17:04 ` Ric Wheeler
2008-03-14 18:03 ` david
2008-03-14 19:03 ` writeback cache dangers " Pavel Machek
2008-03-14 19:29 ` Theodore Tso
2008-03-13 9:15 ` Matthias Schniedermeyer
2008-03-11 23:30 ` Daniel Phillips
2008-03-13 13:27 ` Ric Wheeler
2008-03-13 19:02 ` Daniel Phillips
2008-03-13 19:12 ` Ric Wheeler
2008-03-13 19:38 ` Daniel Phillips
2008-03-11 4:23 ` Daniel Phillips
2008-03-10 9:22 ` Alan Cox
2008-03-10 19:01 ` Rik van Riel
2008-03-11 4:28 ` Daniel Phillips
2008-03-11 3:50 ` Daniel Phillips
2008-03-11 13:32 ` Artur Skawina
2008-03-11 14:31 ` Artur Skawina
2008-03-12 13:11 ` Alan Cox
2008-03-12 17:29 ` Daniel Phillips
2008-03-12 18:11 ` Chris Friesen
2008-03-12 22:56 ` Daniel Phillips
2008-03-13 5:45 ` David Newall
2008-03-13 6:17 ` Daniel Phillips
2008-03-13 6:30 ` David Newall
2008-03-13 6:50 ` Daniel Phillips
2008-03-13 7:05 ` David Newall
2008-03-13 7:13 ` Daniel Phillips
2008-03-15 13:32 ` Pavel Machek
2008-03-15 20:22 ` Daniel Phillips
2008-03-15 21:33 ` Pavel Machek
2008-03-15 21:47 ` Daniel Phillips
2008-03-13 6:32 ` david
2008-03-13 7:12 ` Daniel Phillips
2008-03-13 7:55 ` david
2008-03-13 8:06 ` Daniel Phillips
2008-03-13 8:39 ` david
2008-03-13 9:16 ` Daniel Phillips
2008-03-13 16:25 ` david
2008-03-13 19:32 ` Daniel Phillips
2008-03-13 19:50 ` David Newall
2008-03-13 20:03 ` Daniel Phillips
2008-03-14 17:53 ` Jeff Moyer
2008-03-15 20:26 ` Pavel Machek
2008-03-15 20:40 ` Mike Snitzer
2008-03-15 21:05 ` Daniel Phillips
2008-03-15 20:18 ` Pavel Machek
2008-03-15 20:51 ` Daniel Phillips
2008-03-13 9:49 ` Daniel Phillips
2008-03-13 5:39 ` David Newall
2008-03-13 6:14 ` Daniel Phillips
2008-03-13 13:22 ` Alan Cox
2008-03-13 19:14 ` Daniel Phillips
2008-03-13 20:27 ` Rik van Riel
2008-03-14 2:23 ` Daniel Phillips
2008-03-14 5:22 ` David Newall
2008-03-14 5:42 ` Daniel Phillips
2008-03-14 14:00 ` John Stoffel
2008-03-15 20:59 ` Willy Tarreau
2008-03-15 20:56 ` Alan Cox
2008-03-15 21:25 ` Daniel Phillips
2008-03-15 21:08 ` Alan Cox
2008-03-15 21:51 ` Daniel Phillips
2008-03-15 21:17 ` Daniel Phillips
2008-03-15 21:03 ` Alan Cox
2008-03-15 22:00 ` Daniel Phillips
2008-03-15 23:05 ` Alan Cox
2008-03-16 21:57 ` Daniel Phillips
2008-03-16 21:55 ` Alan Cox
2008-03-16 22:36 ` Daniel Phillips
2008-03-16 22:46 ` Alan Cox
2008-03-16 23:39 ` Daniel Phillips
2008-03-17 11:53 ` Alan Cox
2008-03-17 1:31 ` David Newall
2008-03-17 2:42 ` Daniel Phillips
2008-03-17 3:59 ` david
2008-03-17 5:52 ` Daniel Phillips
2008-03-17 6:49 ` david
2008-03-17 8:16 ` Daniel Phillips
2008-03-17 10:39 ` Alan Cox
2008-03-17 13:52 ` Ric Wheeler
2008-03-17 14:42 ` david
2008-03-17 17:23 ` david
2008-03-17 17:30 ` Willy Tarreau
[not found] ` <200803180233.10156.phillips@phunq.net>
2008-03-18 13:03 ` David Newall
2008-03-18 16:36 ` david
2008-03-31 11:40 ` Daniel Phillips
2008-04-01 0:28 ` david
2008-04-01 4:07 ` Daniel Phillips
2008-04-01 4:23 ` david
2008-04-01 6:08 ` Daniel Phillips
2008-03-18 13:57 ` Alan Cox
2008-03-31 11:39 ` Daniel Phillips
2008-03-17 7:14 ` David Newall
2008-03-17 8:25 ` Daniel Phillips
2008-03-17 18:56 ` David Newall
2008-03-23 9:33 ` Pavel Machek
2008-03-23 20:44 ` Daniel Phillips
2008-03-15 21:54 ` Willy Tarreau [this message]
2008-03-15 22:33 ` Daniel Phillips
2008-03-15 23:22 ` david
2008-03-15 23:57 ` Krzysztof Halasa
2008-03-15 23:22 ` Willy Tarreau
2008-03-16 3:33 ` Daniel Phillips
2008-03-16 5:24 ` David Newall
2008-03-16 12:49 ` Ingo Oeser
2008-03-16 6:56 ` Willy Tarreau
2008-03-16 22:12 ` Krzysztof Halasa
2008-03-16 13:14 ` Alan Cox
2008-03-16 19:04 ` Theodore Tso
2008-03-16 22:02 ` Krzysztof Halasa
2008-03-15 23:18 ` Bernd Eckenfels
2008-03-16 5:42 ` David Newall
2008-03-16 20:48 ` Daniel Phillips
2008-03-16 22:15 ` Krzysztof Halasa
2008-03-16 22:38 ` Daniel Phillips
2008-03-16 23:08 ` Krzysztof Halasa
2008-03-16 23:43 ` Daniel Phillips
2008-03-10 14:51 ` Artur Skawina
2008-03-10 18:49 ` Chris Snook
2008-03-11 5:06 ` Greg KH
2008-03-11 5:22 ` Daniel Phillips
2008-03-11 5:48 ` david
2008-03-11 6:27 ` Greg KH
2008-03-12 12:01 ` tvrtko.ursulin
2008-03-12 17:27 ` Daniel Phillips
[not found] <OFA00954A4.45F32CA2-ON8025740B.005D7B40-8025740B.005EECA6@sophos.com>
2008-03-13 19:34 ` Daniel Phillips
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20080315215427.GC13012@1wt.eu \
--to=w@1wt.eu \
--cc=alan@lxorguk.ukuu.org.uk \
--cc=davidn@davidnewall.com \
--cc=linux-kernel@vger.kernel.org \
--cc=phillips@phunq.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox