From mboxrd@z Thu Jan 1 00:00:00 1970 From: Nic Henke Date: Tue, 29 Sep 2009 13:02:25 -0500 Subject: [Lustre-devel] using LST for performance testing In-Reply-To: <1254245568.5827.5.camel@lap75545.ornl.gov> References: <4ABBD78E.50506@cray.com> <20090928173554.GA4911@sun.com> <4AC23B21.2030207@cray.com> <1254245568.5827.5.camel@lap75545.ornl.gov> Message-ID: <4AC24BB1.8040502@cray.com> List-Id: MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit To: lustre-devel@lists.lustre.org David Dillow wrote: > On Tue, 2009-09-29 at 11:51 -0500, Nic Henke wrote: > >> I'm wondering if we couldn't add a new 'batch_stat' command. The idea is >> that the client code will fill in the start/stop times for each test and >> then after the test is done, 'batch_stat' would collect this data. The >> collection would still be passive and a new command should minimize the >> protocol changes. The per-test data would allow us to get accurate perf >> numbers and also provide some data into how parallel the tests were, if >> there are any unfairness issues, etc. >> > > Along these lines, it would be nice if we could specify a run time for > each test rather than an amount of data to be transferred -- it makes it > easier to get aggregate bandwidth numbers, and often shows imbalances > nicely -- the node getting starved is the one that transfers less data. > > It may also make sense to add a 'delay' parameter that causes each test > to wait a specified amount of time from the 'go' signal. This allows the > signal to propagate without running into congestion from the test, > helping to cause all of the clients to start the test closer to > simultaneously. > Interesting - can you elaborate, perhaps in the form of a patch ? :-) I like both ideas, but not signing up to code them just yet. Nic