* How btrfs-find-root knows that the block is actually a root? @ 2014-12-22 14:57 Alex Lyakas 2014-12-23 5:27 ` Qu Wenruo 0 siblings, 1 reply; 4+ messages in thread From: Alex Lyakas @ 2014-12-22 14:57 UTC (permalink / raw) To: linux-btrfs Greetings, I am looking at the code of search_iobuf() in btrfs-find-root.c.(3.17.3) I see that we probe nodesize blocks one by one, and for each block we check: - its owner is what we are looking for - its header->bytenr is what we are looking at currently - its level is not too small - it has valid checksum - it has the desired generation If all those conditions are true, we declare this block as a root and end the program. How do we actually know that it's a root and not a leaf or an intermediate node? What if we are searching for a root of the root tree, which has one node and two leafs (all have the same highest transid), and one of the leafs has "logical" lower than the actual root, i.e., it comes first in our scan. Then we will declare this leaf as a root, won't we? Or somehow the root always has the lowest "logical"? Also, I am confused by this line: level = h_level; This means that if we encounter a block that "seems good", we will skip all other blocks that have lower level. Is this intended? Thanks, Alex. ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: How btrfs-find-root knows that the block is actually a root? 2014-12-22 14:57 How btrfs-find-root knows that the block is actually a root? Alex Lyakas @ 2014-12-23 5:27 ` Qu Wenruo 2014-12-23 10:08 ` Alex Lyakas 0 siblings, 1 reply; 4+ messages in thread From: Qu Wenruo @ 2014-12-23 5:27 UTC (permalink / raw) To: Alex Lyakas; +Cc: linux-btrfs@vger.kernel.org >> linux-btrfs -------- Original Message -------- Subject: How btrfs-find-root knows that the block is actually a root? From: Alex Lyakas <alex.btrfs@zadarastorage.com> To: linux-btrfs <linux-btrfs@vger.kernel.org> Date: 2014年12月22日 22:57 > Greetings, > > I am looking at the code of search_iobuf() in > btrfs-find-root.c.(3.17.3) I see that we probe nodesize blocks one by > one, and for each block we check: > - its owner is what we are looking for > - its header->bytenr is what we are looking at currently > - its level is not too small > - it has valid checksum > - it has the desired generation > > If all those conditions are true, we declare this block as a root and > end the program. > > How do we actually know that it's a root and not a leaf or an > intermediate node? What if we are searching for a root of the root > tree, which has one node and two leafs (all have the same highest > transid), and one of the leafs has "logical" lower than the actual > root, i.e., it comes first in our scan. Then we will declare this leaf > as a root, won't we? Or somehow the root always has the lowest > "logical"? You can refer to this patch: https://patchwork.kernel.org/patch/5285521/ Your questions are mostly right. The best method should be search through all the metadata, and only the highest level header for a given generation may be the root for that generation. But that method still has some problems. 1) Overwritten old node/leaf As btrfs metadata cow happens, old node/leaf may be overwritten and become incompletely, so above method won't always work as expected. 2) Corrupted fs That will makes everything not work as expected. But sadly, when someone needs to use btrfs-find-root, there is a high possibility the fs is already corrupted. 3) Slow speed It needs to scan over all the sectors of metadata chunks, it may var from megabytese to tegabytes, which makes the complete scan impractical. So current find-root uses a trade-off, if find a header at the position superblock points to, and generation matches, then just consider it as the desired root and exit. > > Also, I am confused by this line: > level = h_level; > This means that if we encounter a block that "seems good", we will > skip all other blocks that have lower level. Is this intended? This is intended, for case user already know the root's level, so it will skip any header whose level is below it. Thanks, Qu > > Thanks, > Alex. > -- > To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: How btrfs-find-root knows that the block is actually a root? 2014-12-23 5:27 ` Qu Wenruo @ 2014-12-23 10:08 ` Alex Lyakas 2014-12-24 0:41 ` Qu Wenruo 0 siblings, 1 reply; 4+ messages in thread From: Alex Lyakas @ 2014-12-23 10:08 UTC (permalink / raw) To: Qu Wenruo; +Cc: linux-btrfs@vger.kernel.org >> linux-btrfs Hi Qu, On Tue, Dec 23, 2014 at 7:27 AM, Qu Wenruo <quwenruo@cn.fujitsu.com> wrote: > > -------- Original Message -------- > Subject: How btrfs-find-root knows that the block is actually a root? > From: Alex Lyakas <alex.btrfs@zadarastorage.com> > To: linux-btrfs <linux-btrfs@vger.kernel.org> > Date: 2014年12月22日 22:57 >> >> Greetings, >> >> I am looking at the code of search_iobuf() in >> btrfs-find-root.c.(3.17.3) I see that we probe nodesize blocks one by >> one, and for each block we check: >> - its owner is what we are looking for >> - its header->bytenr is what we are looking at currently >> - its level is not too small >> - it has valid checksum >> - it has the desired generation >> >> If all those conditions are true, we declare this block as a root and >> end the program. >> >> How do we actually know that it's a root and not a leaf or an >> intermediate node? What if we are searching for a root of the root >> tree, which has one node and two leafs (all have the same highest >> transid), and one of the leafs has "logical" lower than the actual >> root, i.e., it comes first in our scan. Then we will declare this leaf >> as a root, won't we? Or somehow the root always has the lowest >> "logical"? > > You can refer to this patch: > https://patchwork.kernel.org/patch/5285521/ I see that this has not been applied to any of David's branches. Do you have a repo to look at this code in its entirety? > > Your questions are mostly right. > The best method should be search through all the metadata, and only the > highest level header for > a given generation may be the root for that generation. > > But that method still has some problems. > 1) Overwritten old node/leaf > As btrfs metadata cow happens, old node/leaf may be overwritten and become > incompletely, > so above method won't always work as expected. > > 2) Corrupted fs > That will makes everything not work as expected. > But sadly, when someone needs to use btrfs-find-root, there is a high > possibility the fs is already corrupted. > > 3) Slow speed > It needs to scan over all the sectors of metadata chunks, it may var from > megabytese to tegabytes, > which makes the complete scan impractical. > So current find-root uses a trade-off, if find a header at the position > superblock points to, and generation > matches, then just consider it as the desired root and exit. I think this is a bit optimistic. What if the root tree has several leaves having the same generation as the root? Then we might declare a leaf as a root and exit. But further recovery based on that output will get us into trouble. > >> >> Also, I am confused by this line: >> level = h_level; >> This means that if we encounter a block that "seems good", we will >> skip all other blocks that have lower level. Is this intended? > > This is intended, for case user already know the root's level, so it will > skip any header whose level is below it. But this line is performed before the generation check. Let's say that user did not specify any level (so search_level==0). Then assume we encounter a block, which has lower generation than what we need, but higher level. At this point, we do level = h_level; and we will skip any blocks lower than this level from now on. What if the root tree got shirnked (due to subvolume deletion, for example), and the "good" root has lower level? We will skip it then, and will not find the root. Thanks for your comments, Alex. > > Thanks, > Qu >> >> >> Thanks, >> Alex. >> -- >> To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in >> the body of a message to majordomo@vger.kernel.org >> More majordomo info at http://vger.kernel.org/majordomo-info.html > > ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: How btrfs-find-root knows that the block is actually a root? 2014-12-23 10:08 ` Alex Lyakas @ 2014-12-24 0:41 ` Qu Wenruo 0 siblings, 0 replies; 4+ messages in thread From: Qu Wenruo @ 2014-12-24 0:41 UTC (permalink / raw) To: Alex Lyakas; +Cc: linux-btrfs@vger.kernel.org >> linux-btrfs -------- Original Message -------- Subject: Re: How btrfs-find-root knows that the block is actually a root? From: Alex Lyakas <alex.btrfs@zadarastorage.com> To: Qu Wenruo <quwenruo@cn.fujitsu.com> Date: 2014年12月23日 18:08 > Hi Qu, > > On Tue, Dec 23, 2014 at 7:27 AM, Qu Wenruo <quwenruo@cn.fujitsu.com> wrote: >> -------- Original Message -------- >> Subject: How btrfs-find-root knows that the block is actually a root? >> From: Alex Lyakas <alex.btrfs@zadarastorage.com> >> To: linux-btrfs <linux-btrfs@vger.kernel.org> >> Date: 2014年12月22日 22:57 >>> Greetings, >>> >>> I am looking at the code of search_iobuf() in >>> btrfs-find-root.c.(3.17.3) I see that we probe nodesize blocks one by >>> one, and for each block we check: >>> - its owner is what we are looking for >>> - its header->bytenr is what we are looking at currently >>> - its level is not too small >>> - it has valid checksum >>> - it has the desired generation >>> >>> If all those conditions are true, we declare this block as a root and >>> end the program. >>> >>> How do we actually know that it's a root and not a leaf or an >>> intermediate node? What if we are searching for a root of the root >>> tree, which has one node and two leafs (all have the same highest >>> transid), and one of the leafs has "logical" lower than the actual >>> root, i.e., it comes first in our scan. Then we will declare this leaf >>> as a root, won't we? Or somehow the root always has the lowest >>> "logical"? >> You can refer to this patch: >> https://patchwork.kernel.org/patch/5285521/ > I see that this has not been applied to any of David's branches. Do > you have a repo to look at this code in its entirety? Saddly, no personal repo... > >> Your questions are mostly right. >> The best method should be search through all the metadata, and only the >> highest level header for >> a given generation may be the root for that generation. >> >> But that method still has some problems. >> 1) Overwritten old node/leaf >> As btrfs metadata cow happens, old node/leaf may be overwritten and become >> incompletely, >> so above method won't always work as expected. >> >> 2) Corrupted fs >> That will makes everything not work as expected. >> But sadly, when someone needs to use btrfs-find-root, there is a high >> possibility the fs is already corrupted. >> >> 3) Slow speed >> It needs to scan over all the sectors of metadata chunks, it may var from >> megabytese to tegabytes, >> which makes the complete scan impractical. >> So current find-root uses a trade-off, if find a header at the position >> superblock points to, and generation >> matches, then just consider it as the desired root and exit. > I think this is a bit optimistic. What if the root tree has several > leaves having the same generation as the root? Then we might declare a > leaf as a root and exit. But further recovery based on that output > will get us into trouble. That's right, but if btrfs-find-root takes several hours each time to do full scan, most user will be crazy... Especially when it is executed on a clean btrfs. > >>> Also, I am confused by this line: >>> level = h_level; >>> This means that if we encounter a block that "seems good", we will >>> skip all other blocks that have lower level. Is this intended? >> This is intended, for case user already know the root's level, so it will >> skip any header whose level is below it. > But this line is performed before the generation check. Let's say that > user did not specify any level (so search_level==0). Then assume we > encounter a block, which has lower generation than what we need, but > higher level. At this point, we do > level = h_level; > and we will skip any blocks lower than this level from now on. What if > the root tree got shirnked (due to subvolume deletion, for example), > and the "good" root has lower level? We will skip it then, and will > not find the root. > > Thanks for your comments, > Alex. Oh, I misunderstand your meaning, you're completely right, the original codes will skip any level lower than current found. That's deadly for root with higher generation but with lower level, as you mentioned, item delete. So the best method would be only skip leaf/node whose level is lower than current found highest level *among the same generation*. Oh, that's what I do in my patch. :-) Thanks, Qu > > >> Thanks, >> Qu >>> >>> Thanks, >>> Alex. >>> -- >>> To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in >>> the body of a message to majordomo@vger.kernel.org >>> More majordomo info at http://vger.kernel.org/majordomo-info.html >> ^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2014-12-24 0:41 UTC | newest] Thread overview: 4+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2014-12-22 14:57 How btrfs-find-root knows that the block is actually a root? Alex Lyakas 2014-12-23 5:27 ` Qu Wenruo 2014-12-23 10:08 ` Alex Lyakas 2014-12-24 0:41 ` Qu Wenruo
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).