From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-ob0-f174.google.com ([209.85.214.174]:51445 "EHLO mail-ob0-f174.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755785Ab3CQW5K (ORCPT ); Sun, 17 Mar 2013 18:57:10 -0400 Received: by mail-ob0-f174.google.com with SMTP id 16so4777746obc.5 for ; Sun, 17 Mar 2013 15:57:09 -0700 (PDT) MIME-Version: 1.0 In-Reply-To: References: <20121218013111.GB22912@shiny> Date: Sun, 17 Mar 2013 23:57:09 +0100 Message-ID: Subject: Re: Online Deduplication for Btrfs (Master's thesis) From: =?UTF-8?B?TWFydGluIEvFmcOtxb5law==?= To: Chris Mason , Alexander Block , =?UTF-8?B?TWFydGluIEvFmcOtxb5law==?= , "linux-btrfs@vger.kernel.org" , "lczerner@redhat.com" , jbacik@fusionio.com Content-Type: text/plain; charset=ISO-8859-1 Sender: linux-btrfs-owner@vger.kernel.org List-ID: Hi, I am looking for a place (source code wise) where the actual online/immediate dedup would happen: -- splitting an extent about to be written into new extents that are unique (that would be actually written) and into blocks of the original extent that would be thrown away (duplicates) and file_extent_item pointing to portion of existing extent created instead. Could anyone please give me some pointers on that place and/or functions that could be used for that? Would delalloc time would be appropriate? Thanks! Martin