From mboxrd@z Thu Jan 1 00:00:00 1970 From: Grant Grundler Subject: Re: Enterprise workload testing for storage and filesystems Date: Thu, 20 Nov 2008 20:42:29 -0800 Message-ID: References: <1226962063.3403.13.camel@localhost.localdomain> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit Cc: linux-scsi , linux-kernel , linux-fsdevel@vger.kernel.org, "K.S. Bhaskar" To: James Bottomley Return-path: In-Reply-To: <1226962063.3403.13.camel@localhost.localdomain> Sender: linux-scsi-owner@vger.kernel.org List-Id: linux-fsdevel.vger.kernel.org On Mon, Nov 17, 2008 at 2:47 PM, James Bottomley wrote: > Hi all, > > High on our list at the recent Linux Foundation end user summit was > obtaining a method of obtaining enterprise workloads (or simulators) we > can run in our own testing environments. The main problem being that > the data sets used by the systems are usually secret or under regulatory > embargo and thus unobtainable. However, several participants noted that > regulatory prohibitions also extended to their own in-house IT team, > thus they had had to develop simulators for the workloads which, since > they contained no customer data, might be more widely distributable. Google has the same concerns. Hoping to work through those, last year I arranged funding for UNSW (Joshua Root) to develop a GPL linux block layer replay tool: http://www.gelato.unsw.edu.au/IA64wiki/JoshuaRoot/MarkovChains Unfortunately, I've not been able to address all concerns with this and thus can't offer any google specific markov chains. :( Still I hope this tool can be of use to others. > Fidelity National Information Service were the first to try this. > They've kicked off a sourceforge site for their stress testing tool > (which is the same tool they use in their own qualification labs). The > source for the tool is available here: > > http://sourceforge.net/project/showfiles.php?group_id=11026&package_id=298597 Awesome! Kudos! > And it comes with a fairly detailed readme explaining what it's trying > to simulate and why. Hopefully this will give us all a much better > insight into both enterprise workloads and the way enterprise IT > departments conduct testing. > > Let's see how our storage and filesystem tuning measures up to this. *nod* thanks, grant