All of lore.kernel.org
 help / color / mirror / Atom feed
From: Li Zefan <lizf@cn.fujitsu.com>
To: "Yan, Zheng " <yanzheng@21cn.com>
Cc: "linux-btrfs@vger.kernel.org" <linux-btrfs@vger.kernel.org>,
	Zheng <zheng.z.yan@intel.com>
Subject: Re: [RFC] Btrfs design defect in extent backref ?
Date: Fri, 26 Aug 2011 10:00:43 +0800	[thread overview]
Message-ID: <4E56FE4B.80606@cn.fujitsu.com> (raw)
In-Reply-To: <CAAM7YAkxNcC_dk4uUnYFUoynkzb2Mx2KmJrNT+GJ8oZGKp3JqA@mail.gmail.com>

Yan, Zheng wrote:
> On Thu, Aug 25, 2011 at 3:56 PM, Li Zefan <lizf@cn.fujitsu.com> wrote:
>> We have an offset in file extent to indicate its position in the
>> corresponding extent item in extent tree. We also have an offset in
>> extent item to indicate the start position of the file extent that
>> uses this item.
>>
>> The math is:
>>
>>    extent_item.extent_data_ref.offset = file_pos - file_extent.extent_offset.
>>
>>                       e1
>> disk extents:    |--------------|
>>                 ^
>>                 |                  e2
>>                 |          |-----------------|
>>                 |          |   ^
>>                 |          |   |
>>                 v          v   |
>> file extents:    |----- f1 -----|----- f2 -----|
>>
>> So it looks like e2.offset points to f1 not f2. Therefore given an extent item,
>> we'll have to search through all the file extents in an inode to find the
>> relative file extent in the worst case, which makes this field somewhat useless.
>>
> 
> The reason for this is reducing number of file extent backref itmes.

It seems to me a rare case, which isn't worth the complexity and inconvenience
it brings, and it requires an extra field (.count).

> we don't have to search all the file extents because the file extent size
> is limited and we have extent_data_ref.count.

Yes we have to, and for a big file with many small file extents, the extent
number is not trivial.

> 
>> What makes things worse is the above fomula can make the offset a negative
>> value (cast to u64):
>>
>>    # touch /mnt/dst
>>    # clone_range -s 8192 -d 4096 /mnt/src /mnt/dst
>>    # umount /mnt
>>    # btrfs-debug-tree /dev/sda7
>>    ...
>>        item 2 key (12582912 EXTENT_ITEM 49152) itemoff 3865 itemsize 82
>>                extent refs 2 gen 8 flags 1
>>                extent data backref root 5 objectid 258 offset 18446744073709543424 count 1
>>                extent data backref root 5 objectid 257 offset 0 count 1
>>    ...
>>
>> and relocation won't work in this case:
>>
>>    # mount /dev/sda7 /mnt
>>    # rm /mnt/src
>>    # sync
>>    # btrfs fi bal /mnt
>>    (kernel warning !!)
>>    (hung up !!)
>>
>> I don't see the necessity or benefit of the substraction in the fomula,
>> and I think the correct one is:
>>
>>    extent_item.extent_data_ref.offset = file_pos
>>
>> (As a side effect thereafter we don't need extent_data_ref.count)
>>
>> That's what this patch does. Unfornately it is an incompatable change
>> in disk format.
>>
>> So I think we have to live with this defect, just fix relocation for
>> the negative offset case ?
> 
> I prefer fixing relocation.
> 

Sure, though I would prefer the alternative if not for the stablity of
disk format.

  reply	other threads:[~2011-08-26  2:00 UTC|newest]

Thread overview: 5+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-08-25  7:56 [RFC] Btrfs design defect in extent backref ? Li Zefan
2011-08-25  8:47 ` Yan, Zheng 
2011-08-26  2:00   ` Li Zefan [this message]
2011-08-26  2:38     ` Yan, Zheng 
2011-08-26  3:04       ` Li Zefan

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4E56FE4B.80606@cn.fujitsu.com \
    --to=lizf@cn.fujitsu.com \
    --cc=linux-btrfs@vger.kernel.org \
    --cc=yanzheng@21cn.com \
    --cc=zheng.z.yan@intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.