From mboxrd@z Thu Jan 1 00:00:00 1970 From: Marc Lehmann Subject: Re: SMR drive test 2; 128GB partition; no obvious corruption, much more sane behaviour, weird overprovisioning Date: Thu, 24 Sep 2015 01:24:14 +0200 Message-ID: <20150923232414.GC3463@schmorp.de> References: <20150808205003.GA6546@schmorp.de> <20150810203106.GA4575@jaegeuk-mac02> <20150920235901.GA7017@schmorp.de> <20150921081748.GA5637@schmorp.de> <20150921081937.GA5718@schmorp.de> <20150921095806.GA6809@schmorp.de> <20150923011239.GA32520@jaegeuk-mac02.mot.com> <20150923041523.GB4946@schmorp.de> <20150923212931.GC36564@jaegeuk-mac02.mot.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: Received: from sog-mx-1.v43.ch3.sourceforge.com ([172.29.43.191] helo=mx.sourceforge.net) by sfs-ml-3.v29.ch3.sourceforge.com with esmtp (Exim 4.76) (envelope-from ) id 1ZetOh-0003ny-06 for linux-f2fs-devel@lists.sourceforge.net; Wed, 23 Sep 2015 23:24:23 +0000 Received: from mail.nethype.de ([5.9.56.24]) by sog-mx-1.v43.ch3.sourceforge.com with esmtps (TLSv1:AES128-SHA:128) (Exim 4.76) id 1ZetOf-0004AB-HB for linux-f2fs-devel@lists.sourceforge.net; Wed, 23 Sep 2015 23:24:22 +0000 Content-Disposition: inline In-Reply-To: <20150923212931.GC36564@jaegeuk-mac02.mot.com> List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: linux-f2fs-devel-bounces@lists.sourceforge.net To: Jaegeuk Kim Cc: linux-f2fs-devel@lists.sourceforge.net On Wed, Sep 23, 2015 at 02:29:31PM -0700, Jaegeuk Kim wrote: > > Can you elaborate? I do get a speed improvement with only two logs, but of > > course, GC time is an impoprtant factor, so maybe more logs would be a > > necessary trade-off. > > This will help you to understand more precisely. Thanks, will read more thoroughly, but that means I probably do want two logs. Regarding your elaboration: > One GC needs to move whole valid blocks inside a section, so if the section > size is too large, every GC is likely to show very long latency. > In addion, we need more overprovision space too. That wouldn't increase the overhead in general though, because the overhead depends on how much space is free in each section. > And, if the number of logs is small, GC can suffer from moving hot and cold > data blocks which represents somewhat temporal locality. I am somewhat skeptical of this for (on of my) my usage(s) (archival), because there is absolutely no way to know in advance what is hot and what is cold. Example: a file might be deleted, but there is no way in advance to know which it will be. The only thing I know is that files never get modified after written once (but often replaced). In another of of my usages, files do get modified, but there is no way to know in advance which it will be, and they will only ever be modified once (after initial allocation). So I am very suspicious of both static and dynamic attempts to seperate data into hot/cold. You can't know from file extensions, and you can't know from past modification history. The only applicability of hot/cold I can see is filesystem metadata and directories (files get moved/renamed/added), and afaics, f2fs already does that. > Of course, these numbers highly depend on storage speed and workloads, so > it needs to be tuned up. >>From your original comment, I assumed that the gc somehow needs more logs to be more efficient for some internal reason, but it seems since it is mostly a matter of section size (which I want to have "unreasonably" large), which means potentially a lot of valid data has to be moved, and hot/cold data, which I am very skeptical about. (I think hot/cold works absolutely splendid for normal desktop uses and most forms of /home, though). -- The choice of a Deliantra, the free code+content MORPG -----==- _GNU_ http://www.deliantra.net ----==-- _ generation ---==---(_)__ __ ____ __ Marc Lehmann --==---/ / _ \/ // /\ \/ / schmorp@schmorp.de -=====/_/_//_/\_,_/ /_/\_\ ------------------------------------------------------------------------------ Monitor Your Dynamic Infrastructure at Any Scale With Datadog! Get real-time metrics from all of your servers, apps and tools in one place. SourceForge users - Click here to start your Free Trial of Datadog now! http://pubads.g.doubleclick.net/gampad/clk?id=241902991&iu=/4140