From mboxrd@z Thu Jan 1 00:00:00 1970 From: Jeff Garzik Subject: Re: [osd-dev] [PATCH 7/9] exofs: mkexofs Date: Tue, 13 Jan 2009 08:24:39 -0500 Message-ID: <496C9617.80701@garzik.org> References: <4947BFAA.4030208@panasas.com> <4947CA5C.50104@panasas.com> <20081229121423.efde9d06.akpm@linux-foundation.org> <495B8D90.1090004@panasas.com> <1230739053.3408.74.camel@localhost.localdomain> <4960D3CA.2000202@panasas.com> <1231783926.3256.29.camel@localhost.localdomain> <496B989F.7050907@garzik.org> <1231790190.15161.29.camel@localhost.localdomain> <496BA671.3070900@garzik.org> <1231802758.27151.18.camel@localhost.localdomain> <496C9109.5040803@panasas.com> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <496C9109.5040803@panasas.com> Sender: linux-fsdevel-owner@vger.kernel.org To: Benny Halevy Cc: James Bottomley , open-osd development , linux-scsi , Matthew Wilcox , linux-kernel , Avishay Traeger , Al Viro , linux-fsdevel , Andrew Morton List-Id: linux-scsi@vger.kernel.org Benny Halevy wrote: > IMO the main advantage of moving block allocation down to the OSD target > is more apparent with distributed file systems a-la pNFS over objects > where paralleling that task is a key for scalable performance. > > The thing is that the target needs to implement its own mapping from > object logical offsets into disk blocks and this is usually done > using some kind of a (possibly trimmed down) local file system. > Therefore the I/O performance of a single OSD is likely to be similar > to a single file server's. Well, modern SATA devices are already mini-filesystems internally, when you consider logical block remapping etc. And the claim by drive research guys at the filesystem/storage summit was that OSD offered the potential to better optimize storage based on access/usage patterns. (of course, whether or not reality bears out this guess is another question) > I can understand representing a single object as a block device (although I > think that using a file for that should be good enough and easier) but > why representing the whole OSD as a block device? The OSD holds partitions > and objects each with attributes and OSD security related support. Hence > representing that in a namespace using a filesystem seems straight forward. I am actually considering writing a simple "osdblk" driver, that would represent a single object as a block device. This would NOT replace exofs or other OSD filesystems, but it would be nice to have, and it will give me more experience with OSDs. Jeff