All of lore.kernel.org
 help / color / mirror / Atom feed
From: Adrian Bunk <bunk@stusta.de>
To: "Tomasz Kłoczko" <kloczek@rudy.mif.pg.gda.pl>
Cc: Mike Snitzer <snitzer@gmail.com>, Neil Brown <neilb@suse.de>,
	"David R. Litwin" <presently42@gmail.com>,
	linux-kernel@vger.kernel.org
Subject: Re: ZFS with Linux: An Open Plea
Date: Mon, 16 Apr 2007 19:21:55 +0200	[thread overview]
Message-ID: <20070416172155.GF10338@stusta.de> (raw)
In-Reply-To: <Pine.BSO.4.63.0704161633320.11088@rudy.mif.pg.gda.pl>

On Mon, Apr 16, 2007 at 05:27:51PM +0200, Tomasz Kłoczko wrote:
> On Mon, 16 Apr 2007, Adrian Bunk wrote:
>
>> On Mon, Apr 16, 2007 at 04:01:23PM +0200, Tomasz Kłoczko wrote:
>>> ...
>>> Few days ago I'm swich two backup servers with few TB storage from Linux 
>>> to
>>> Solaris .. only because client want use ZFS .. because ZFS is EXCELENT 
>>> for
>>> this kind tasks (only because it allow save many thousands of
>>> <put_your_currency_name> because it allow better utilize the same 
>>> storage)
>>> and trust me .. cases like this will be more .. much more and changing
>>> licensing Linux code will be MUCH more easier than reinventing wheel 
>>> (wich
>>> will be reimplemnting ZFS under GPL).
>>
>> This is a technical mailing list, so let's start with technical
>> arguments:
>>
>> Why did this client want to use ZFS?
>
> Because switching to Solaris was chipper than buying next faster FC/SCSI 
> storage. Simple ?
>
>> Because his boss was convinced by a marketing guy that ZFS was the best
>> invention since sliced bread?
>
> In this scenario ther is no place for "marketing guy" .. try again .. 
> (maybe it can occure in US or Germany but trust me .. not in Poland 8-)
>
>> Or due to technical limitations in what Linux currently offers
>> resulting in ZFS bringing him direct advantages on these servers?
>
> Yes .. it is Linux limitiations because it is very hard to provide 
> simultanouse streams of backup data with threaded compression (using in 
> this case pbzip2) with good CPUs utilization because most streams waits on 
> I/Os and most of CPUs are not fully utilized. All this becase single stram 
> of compressed data can't be easy dinamically switched to another (not busy) 
> disk in JBOD. ZFS by two level allocation (on device and block level) will 
> not wait for finish I/O but will try use another/not busy device in ZFS 
> pool. This is *main* reason integrate in one layer VFS and LVM in case ZFS. 
> By integrate this two layers you can make deciion where data will be 
> written depending on *current* devices utilization. In all other "classic" 
> ways you will break layered OS model .. so in ZFS case conclution like "we 
> must integrate this two layers in one" it is not bug but feacture and was 
> FUNDAMENTAL.

You are having IO problems doing bzip2 ???

This sounds as if your application is doing something silly like e.g. 
using O_DIRECT or you are mounting your filesystems with "-o sync".

On my 1.8 GHz Athlon, it takes 2.5 minutes to compress a 250 MB kernel 
tar to 40 MB - and this is with the data cached in RAM, so no IO 
involved. That's 1.7 MB/s resp. 0.3 MB/s.

I'm even having problems to imagine IO problems with 4 or 8 CPU 
machines doing parallel bzip2 to a single disk - with any filesystem.

> This is not all .. backup data must be safe in best possible form .. in 
> time .. and it mean in this case that checksumming is NECCESSARY. Look .. 
> ext4 for now only have plans for implemting checksuming 
> (http://www.bullopensource.org/ext4/files/ext4.txt) and ATM on Linux there 
> is no FS with this kind abilities .. so yes again: this was Linux technical 
> limitation.

bzip2 already checksums all data, so why do you need a second checksum 
at the filesystem level for your backups?

>>> Problem is not on technical area but on licensing and it is plain Linux
>>> word problem because neccessary in this case changes on CDDL side will 
>>> make
>>> this code less oppended than now .. so you can (probably ?) forget about
>>> GPLing ZFS code (and ZFS it is not all what will good to have from Sol in
>>> Linux).
>>> IMO current Linux licensing less is importand than bringing in possible
>>> simpler way things like ZFS to Linux. So best/simpler way will be start
>>> change Linux licensing for save all GPL goodies and allow interract with
>>> code on license like CDDL.
>>>
>>> Licensing is for allow keep in best possible form Linux. If it can't do
>>> this in best possible way it must be change (must evolve .. like many 
>>> othes
>>> things around).
>>
>> There are at about 10.000 people who contributed to the Linux kernel,
>> some of them unreachable or even dead.
>
> Do you know who was Paracelsus ? He was medic hundriet years ago. They 
> discover and verbalise some kind fundamental (?) law for medicine which can 
> be used not only on medicine area. He sayd "kills not subtance but dose of 
> substance". So anything can kill you/animal/project .. you can kill someon 
> also using oxigen (not only low level of oxigen kills but also to much can 
> kill). Try to think on how this law on how many diffret ways can be 
> trasformed/appied to this kind of arguments. Look on how many developers 
> migrate to another unices in last few years (count only two for simplicity 
> like Solaris and MOX). Try looking for public forums statistics for example 
> Linux vs. Solaris and after this try to answer on "is it 10k is it realy 
> big number in this case or not ?" (IIRC google provides very good tools for 
> anyone who want this kind answers).

I don't need Google for this.

It's easy to extract from git that patches that were applied to the 
Linux kernel during the last 2 years contained 3196 unique 
Signed-off-by: lines.

Some people might have lines with different email addresses, but this 
still makes > 2000 contributors during the last 2 years alone. Plus all 
the other people who contributed during the first 14 years of Linux 
kernel development but didn't during the last 2 years.

You could argue whether there were really 10.000 people or "only"
5.000 people who contributed code to the Linux kernel, but 
that doesn't make a real difference.

> kloczek

cu
Adrian

-- 

       "Is there not promise of rain?" Ling Tan asked suddenly out
        of the darkness. There had been need of rain for many days.
       "Only a promise," Lao Er said.
                                       Pearl S. Buck - Dragon Seed


  reply	other threads:[~2007-04-16 17:21 UTC|newest]

Thread overview: 83+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2007-04-13 23:18 ZFS with Linux: An Open Plea David R. Litwin
2007-04-13 23:43 ` Neil Brown
2007-04-14 12:16   ` Christoph Hellwig
2007-04-14 14:04   ` Mike Snitzer
2007-04-14 20:53     ` Jan Engelhardt
2007-04-16  9:40       ` Tomasz Kłoczko
2007-04-16 11:19         ` John Anthony Kazos Jr.
2007-04-16 14:02         ` Stefan Richter
2007-04-16 14:20           ` Tomasz Kłoczko
2007-04-16 14:55             ` Christoph Hellwig
2007-04-16 15:46               ` Tomasz Kłoczko
2007-04-16 15:59                 ` Christoph Hellwig
2007-04-16 19:02                 ` Diego Calleja
2007-04-16 20:18                   ` Tomasz Kłoczko
2007-04-18 17:25                     ` Lennart Sorensen
2007-04-18 17:39                       ` Jeff Garzik
2007-04-27  5:21                       ` Valerie Henson
2007-04-27 21:57                         ` Matt Mackall
2007-04-16 19:46                 ` Jörn Engel
2007-04-16 18:19             ` Stefan Richter
2007-04-16 19:21               ` Bernd Eckenfels
2007-04-16 19:26                 ` Lee Revell
2007-04-16 20:20                   ` Bernd Eckenfels
2007-04-16 20:15                 ` Stefan Richter
2007-04-14 21:13     ` Bill Huey
2007-04-16  9:58     ` Tomasz Kłoczko
     [not found]       ` <170fa0d20704160507w4af4cb92ua259a55789f95c3e@mail.gmail.com>
2007-04-16 14:01         ` Tomasz Kłoczko
2007-04-16 14:30           ` Adrian Bunk
2007-04-16 15:27             ` Tomasz Kłoczko
2007-04-16 17:21               ` Adrian Bunk [this message]
2007-04-14 18:56   ` Krzysztof Halasa
2007-04-16  3:00     ` David Chinner
2007-04-15  4:16 ` Kasper Sandberg
2007-04-15 21:58 ` Jesper Juhl
2007-05-02 15:03 ` Tomasz Kłoczko
2007-05-02 15:42   ` Alan Cox
2007-05-02 20:53     ` Theodore Tso
  -- strict thread matches above, loose matches on Subject: below --
2007-04-14 17:40 Ignatich
2007-04-15 12:44 ` Nikita Danilov
2007-04-17 14:14   ` Alan Cox
2007-04-15  8:54 David R. Litwin
2007-04-16  0:50 ` Rik van Riel
2007-04-16  3:07   ` David Chinner
2007-04-15  8:57 David R. Litwin
2007-04-15 17:34 ` Kasper Sandberg
2007-04-17  6:54 David R. Litwin
2007-04-17  8:18 ` Miklos Szeredi
2007-04-17 13:10 ` Theodore Tso
2007-04-17 13:47   ` Tomasz Kłoczko
2007-04-17 13:59     ` Matthew Garrett
2007-04-17 15:46       ` Tomasz Kłoczko
2007-04-17 15:59         ` Alan Cox
2007-04-17 16:29         ` Daniel Hazelton
2007-04-17 19:58           ` Tomasz Kłoczko
2007-04-17 22:19             ` Daniel Hazelton
2007-04-17 22:12               ` David Lang
2007-04-17 22:52                 ` Daniel Hazelton
2007-04-17 22:38               ` Roland Dreier
2007-04-17 14:06     ` Erik Mouw
2007-04-17 14:32     ` John Anthony Kazos Jr.
2007-04-17 15:41       ` Tomasz Kłoczko
2007-04-17 16:02         ` John Anthony Kazos Jr.
2007-04-17 14:37     ` Diego Calleja
2007-04-17 14:48     ` Alan Cox
2007-04-17 15:06       ` Ricardo Correia
2007-04-17 15:23         ` Xavier Bestel
2007-04-17 15:30           ` Ricardo Correia
2007-04-17 15:36             ` Alan Cox
2007-04-17 16:02       ` Mike Snitzer
2007-04-17 16:57       ` Alistair John Strachan
2007-04-18 11:10       ` Manoj Joseph
2007-04-18 11:23         ` Alan Cox
2007-04-18 11:32           ` Manoj Joseph
2007-04-17 16:22     ` Daniel Hazelton
2007-04-17 17:50       ` Theodore Tso
2007-04-17 19:24         ` Florian Weimer
2007-04-17 19:56           ` Ricardo Correia
2007-04-17 20:05             ` Ricardo Correia
2007-04-17 14:59   ` linux-os (Dick Johnson)
2007-04-17 15:08     ` Xavier Bestel
2007-04-17 15:12       ` linux-os (Dick Johnson)
2007-04-17 15:29     ` Michal Schmidt
2007-04-17  8:42 David R. Litwin

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20070416172155.GF10338@stusta.de \
    --to=bunk@stusta.de \
    --cc=kloczek@rudy.mif.pg.gda.pl \
    --cc=linux-kernel@vger.kernel.org \
    --cc=neilb@suse.de \
    --cc=presently42@gmail.com \
    --cc=snitzer@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.