public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Adrian Bunk <bunk@stusta.de>
To: "Tomasz Kłoczko" <kloczek@rudy.mif.pg.gda.pl>
Cc: Mike Snitzer <snitzer@gmail.com>, Neil Brown <neilb@suse.de>,
	"David R. Litwin" <presently42@gmail.com>,
	linux-kernel@vger.kernel.org
Subject: Re: ZFS with Linux: An Open Plea
Date: Mon, 16 Apr 2007 19:21:55 +0200	[thread overview]
Message-ID: <20070416172155.GF10338@stusta.de> (raw)
In-Reply-To: <Pine.BSO.4.63.0704161633320.11088@rudy.mif.pg.gda.pl>

On Mon, Apr 16, 2007 at 05:27:51PM +0200, Tomasz Kłoczko wrote:
> On Mon, 16 Apr 2007, Adrian Bunk wrote:
>
>> On Mon, Apr 16, 2007 at 04:01:23PM +0200, Tomasz Kłoczko wrote:
>>> ...
>>> Few days ago I'm swich two backup servers with few TB storage from Linux 
>>> to
>>> Solaris .. only because client want use ZFS .. because ZFS is EXCELENT 
>>> for
>>> this kind tasks (only because it allow save many thousands of
>>> <put_your_currency_name> because it allow better utilize the same 
>>> storage)
>>> and trust me .. cases like this will be more .. much more and changing
>>> licensing Linux code will be MUCH more easier than reinventing wheel 
>>> (wich
>>> will be reimplemnting ZFS under GPL).
>>
>> This is a technical mailing list, so let's start with technical
>> arguments:
>>
>> Why did this client want to use ZFS?
>
> Because switching to Solaris was chipper than buying next faster FC/SCSI 
> storage. Simple ?
>
>> Because his boss was convinced by a marketing guy that ZFS was the best
>> invention since sliced bread?
>
> In this scenario ther is no place for "marketing guy" .. try again .. 
> (maybe it can occure in US or Germany but trust me .. not in Poland 8-)
>
>> Or due to technical limitations in what Linux currently offers
>> resulting in ZFS bringing him direct advantages on these servers?
>
> Yes .. it is Linux limitiations because it is very hard to provide 
> simultanouse streams of backup data with threaded compression (using in 
> this case pbzip2) with good CPUs utilization because most streams waits on 
> I/Os and most of CPUs are not fully utilized. All this becase single stram 
> of compressed data can't be easy dinamically switched to another (not busy) 
> disk in JBOD. ZFS by two level allocation (on device and block level) will 
> not wait for finish I/O but will try use another/not busy device in ZFS 
> pool. This is *main* reason integrate in one layer VFS and LVM in case ZFS. 
> By integrate this two layers you can make deciion where data will be 
> written depending on *current* devices utilization. In all other "classic" 
> ways you will break layered OS model .. so in ZFS case conclution like "we 
> must integrate this two layers in one" it is not bug but feacture and was 
> FUNDAMENTAL.

You are having IO problems doing bzip2 ???

This sounds as if your application is doing something silly like e.g. 
using O_DIRECT or you are mounting your filesystems with "-o sync".

On my 1.8 GHz Athlon, it takes 2.5 minutes to compress a 250 MB kernel 
tar to 40 MB - and this is with the data cached in RAM, so no IO 
involved. That's 1.7 MB/s resp. 0.3 MB/s.

I'm even having problems to imagine IO problems with 4 or 8 CPU 
machines doing parallel bzip2 to a single disk - with any filesystem.

> This is not all .. backup data must be safe in best possible form .. in 
> time .. and it mean in this case that checksumming is NECCESSARY. Look .. 
> ext4 for now only have plans for implemting checksuming 
> (http://www.bullopensource.org/ext4/files/ext4.txt) and ATM on Linux there 
> is no FS with this kind abilities .. so yes again: this was Linux technical 
> limitation.

bzip2 already checksums all data, so why do you need a second checksum 
at the filesystem level for your backups?

>>> Problem is not on technical area but on licensing and it is plain Linux
>>> word problem because neccessary in this case changes on CDDL side will 
>>> make
>>> this code less oppended than now .. so you can (probably ?) forget about
>>> GPLing ZFS code (and ZFS it is not all what will good to have from Sol in
>>> Linux).
>>> IMO current Linux licensing less is importand than bringing in possible
>>> simpler way things like ZFS to Linux. So best/simpler way will be start
>>> change Linux licensing for save all GPL goodies and allow interract with
>>> code on license like CDDL.
>>>
>>> Licensing is for allow keep in best possible form Linux. If it can't do
>>> this in best possible way it must be change (must evolve .. like many 
>>> othes
>>> things around).
>>
>> There are at about 10.000 people who contributed to the Linux kernel,
>> some of them unreachable or even dead.
>
> Do you know who was Paracelsus ? He was medic hundriet years ago. They 
> discover and verbalise some kind fundamental (?) law for medicine which can 
> be used not only on medicine area. He sayd "kills not subtance but dose of 
> substance". So anything can kill you/animal/project .. you can kill someon 
> also using oxigen (not only low level of oxigen kills but also to much can 
> kill). Try to think on how this law on how many diffret ways can be 
> trasformed/appied to this kind of arguments. Look on how many developers 
> migrate to another unices in last few years (count only two for simplicity 
> like Solaris and MOX). Try looking for public forums statistics for example 
> Linux vs. Solaris and after this try to answer on "is it 10k is it realy 
> big number in this case or not ?" (IIRC google provides very good tools for 
> anyone who want this kind answers).

I don't need Google for this.

It's easy to extract from git that patches that were applied to the 
Linux kernel during the last 2 years contained 3196 unique 
Signed-off-by: lines.

Some people might have lines with different email addresses, but this 
still makes > 2000 contributors during the last 2 years alone. Plus all 
the other people who contributed during the first 14 years of Linux 
kernel development but didn't during the last 2 years.

You could argue whether there were really 10.000 people or "only"
5.000 people who contributed code to the Linux kernel, but 
that doesn't make a real difference.

> kloczek

cu
Adrian

-- 

       "Is there not promise of rain?" Ling Tan asked suddenly out
        of the darkness. There had been need of rain for many days.
       "Only a promise," Lao Er said.
                                       Pearl S. Buck - Dragon Seed


  reply	other threads:[~2007-04-16 17:21 UTC|newest]

Thread overview: 83+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2007-04-13 23:18 ZFS with Linux: An Open Plea David R. Litwin
2007-04-13 23:43 ` Neil Brown
2007-04-14 12:16   ` Christoph Hellwig
2007-04-14 14:04   ` Mike Snitzer
2007-04-14 20:53     ` Jan Engelhardt
2007-04-16  9:40       ` Tomasz Kłoczko
2007-04-16 11:19         ` John Anthony Kazos Jr.
2007-04-16 14:02         ` Stefan Richter
2007-04-16 14:20           ` Tomasz Kłoczko
2007-04-16 14:55             ` Christoph Hellwig
2007-04-16 15:46               ` Tomasz Kłoczko
2007-04-16 15:59                 ` Christoph Hellwig
2007-04-16 19:02                 ` Diego Calleja
2007-04-16 20:18                   ` Tomasz Kłoczko
2007-04-18 17:25                     ` Lennart Sorensen
2007-04-18 17:39                       ` Jeff Garzik
2007-04-27  5:21                       ` Valerie Henson
2007-04-27 21:57                         ` Matt Mackall
2007-04-16 19:46                 ` Jörn Engel
2007-04-16 18:19             ` Stefan Richter
2007-04-16 19:21               ` Bernd Eckenfels
2007-04-16 19:26                 ` Lee Revell
2007-04-16 20:20                   ` Bernd Eckenfels
2007-04-16 20:15                 ` Stefan Richter
2007-04-14 21:13     ` Bill Huey
2007-04-16  9:58     ` Tomasz Kłoczko
     [not found]       ` <170fa0d20704160507w4af4cb92ua259a55789f95c3e@mail.gmail.com>
2007-04-16 14:01         ` Tomasz Kłoczko
2007-04-16 14:30           ` Adrian Bunk
2007-04-16 15:27             ` Tomasz Kłoczko
2007-04-16 17:21               ` Adrian Bunk [this message]
2007-04-14 18:56   ` Krzysztof Halasa
2007-04-16  3:00     ` David Chinner
2007-04-15  4:16 ` Kasper Sandberg
2007-04-15 21:58 ` Jesper Juhl
2007-05-02 15:03 ` Tomasz Kłoczko
2007-05-02 15:42   ` Alan Cox
2007-05-02 20:53     ` Theodore Tso
  -- strict thread matches above, loose matches on Subject: below --
2007-04-14 17:40 Ignatich
2007-04-15 12:44 ` Nikita Danilov
2007-04-17 14:14   ` Alan Cox
2007-04-15  8:54 David R. Litwin
2007-04-16  0:50 ` Rik van Riel
2007-04-16  3:07   ` David Chinner
2007-04-15  8:57 David R. Litwin
2007-04-15 17:34 ` Kasper Sandberg
2007-04-17  6:54 David R. Litwin
2007-04-17  8:18 ` Miklos Szeredi
2007-04-17 13:10 ` Theodore Tso
2007-04-17 13:47   ` Tomasz Kłoczko
2007-04-17 13:59     ` Matthew Garrett
2007-04-17 15:46       ` Tomasz Kłoczko
2007-04-17 15:59         ` Alan Cox
2007-04-17 16:29         ` Daniel Hazelton
2007-04-17 19:58           ` Tomasz Kłoczko
2007-04-17 22:19             ` Daniel Hazelton
2007-04-17 22:12               ` David Lang
2007-04-17 22:52                 ` Daniel Hazelton
2007-04-17 22:38               ` Roland Dreier
2007-04-17 14:06     ` Erik Mouw
2007-04-17 14:32     ` John Anthony Kazos Jr.
2007-04-17 15:41       ` Tomasz Kłoczko
2007-04-17 16:02         ` John Anthony Kazos Jr.
2007-04-17 14:37     ` Diego Calleja
2007-04-17 14:48     ` Alan Cox
2007-04-17 15:06       ` Ricardo Correia
2007-04-17 15:23         ` Xavier Bestel
2007-04-17 15:30           ` Ricardo Correia
2007-04-17 15:36             ` Alan Cox
2007-04-17 16:02       ` Mike Snitzer
2007-04-17 16:57       ` Alistair John Strachan
2007-04-18 11:10       ` Manoj Joseph
2007-04-18 11:23         ` Alan Cox
2007-04-18 11:32           ` Manoj Joseph
2007-04-17 16:22     ` Daniel Hazelton
2007-04-17 17:50       ` Theodore Tso
2007-04-17 19:24         ` Florian Weimer
2007-04-17 19:56           ` Ricardo Correia
2007-04-17 20:05             ` Ricardo Correia
2007-04-17 14:59   ` linux-os (Dick Johnson)
2007-04-17 15:08     ` Xavier Bestel
2007-04-17 15:12       ` linux-os (Dick Johnson)
2007-04-17 15:29     ` Michal Schmidt
2007-04-17  8:42 David R. Litwin

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20070416172155.GF10338@stusta.de \
    --to=bunk@stusta.de \
    --cc=kloczek@rudy.mif.pg.gda.pl \
    --cc=linux-kernel@vger.kernel.org \
    --cc=neilb@suse.de \
    --cc=presently42@gmail.com \
    --cc=snitzer@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox