* [PATCH] btrfs: balance: fix null-ptr-deref in btrfs_may_alloc_data_chunk
@ 2026-03-16 13:46 ZhengYuan Huang
2026-03-16 20:46 ` Qu Wenruo
0 siblings, 1 reply; 3+ messages in thread
From: ZhengYuan Huang @ 2026-03-16 13:46 UTC (permalink / raw)
To: dsterba, clm, bo.li.liu
Cc: linux-btrfs, linux-kernel, baijiaju1990, r33s3n6, zzzccc427,
ZhengYuan Huang, stable
[BUG]
Running btrfs balance can trigger a null-ptr-deref before relocating a
data chunk when metadata corruption leaves a chunk in the chunk tree
without a corresponding block group in the in-memory cache:
KASAN: null-ptr-deref in range [0x0000000000000088-0x000000000000008f]
RIP: 0010:btrfs_may_alloc_data_chunk+0x40/0x1c0 fs/btrfs/volumes.c:3601
Call Trace:
__btrfs_balance fs/btrfs/volumes.c:4217 [inline]
btrfs_balance+0x2516/0x42b0 fs/btrfs/volumes.c:4604
btrfs_ioctl_balance fs/btrfs/ioctl.c:3577 [inline]
btrfs_ioctl+0x25cf/0x5b90 fs/btrfs/ioctl.c:5313
...
[CAUSE]
__btrfs_balance() iterates the on-disk chunk tree and passes the chunk
logical bytenr to btrfs_may_alloc_data_chunk() before relocating a data
chunk. That helper then queries the in-memory block group cache:
cache = btrfs_lookup_block_group(fs_info, chunk_offset);
chunk_type = cache->flags; /* cache may be NULL */
On a corrupt image can contain a chunk item whose matching block group
item is missing, so no block group is ever inserted into the cache. In
that case btrfs_lookup_block_group() returns NULL.
The code only guards this with ASSERT(cache), which becomes a no-op when
CONFIG_BTRFS_ASSERT is disabled. The subsequent dereference of
cache->flags therefore crashes the kernel.
[FIX]
Add a NULL check after btrfs_lookup_block_group() in
btrfs_may_alloc_data_chunk(). If the lookup fails, emit a btrfs_err()
message identifying the affected bytenr and return -EUCLEAN to report
filesystem corruption instead of dereferencing NULL.
The caller already treats negative returns from
btrfs_may_alloc_data_chunk() as fatal errors, so balance aborts cleanly
and reports the corruption to userspace.
Fixes: a6f93c71d412 ("Btrfs: avoid losing data raid profile when deleting a device")
Cc: stable@vger.kernel.org
Signed-off-by: ZhengYuan Huang <gality369@gmail.com>
---
fs/btrfs/volumes.c | 6 +++++-
1 file changed, 5 insertions(+), 1 deletion(-)
diff --git a/fs/btrfs/volumes.c b/fs/btrfs/volumes.c
index 4958e074d420..4657b826b48b 100644
--- a/fs/btrfs/volumes.c
+++ b/fs/btrfs/volumes.c
@@ -3597,7 +3597,12 @@ static int btrfs_may_alloc_data_chunk(struct btrfs_fs_info *fs_info,
u64 bytes_used;
u64 chunk_type;
cache = btrfs_lookup_block_group(fs_info, chunk_offset);
- ASSERT(cache);
+ if (!cache) {
+ btrfs_err(fs_info,
+ "balance: chunk at bytenr %llu has no corresponding block group",
+ chunk_offset);
+ return -EUCLEAN;
+ }
chunk_type = cache->flags;
btrfs_put_block_group(cache);
--
2.43.0
^ permalink raw reply related [flat|nested] 3+ messages in thread
* Re: [PATCH] btrfs: balance: fix null-ptr-deref in btrfs_may_alloc_data_chunk
2026-03-16 13:46 [PATCH] btrfs: balance: fix null-ptr-deref in btrfs_may_alloc_data_chunk ZhengYuan Huang
@ 2026-03-16 20:46 ` Qu Wenruo
2026-03-17 0:09 ` ZhengYuan Huang
0 siblings, 1 reply; 3+ messages in thread
From: Qu Wenruo @ 2026-03-16 20:46 UTC (permalink / raw)
To: ZhengYuan Huang, dsterba, clm, bo.li.liu
Cc: linux-btrfs, linux-kernel, baijiaju1990, r33s3n6, zzzccc427,
stable
在 2026/3/17 00:16, ZhengYuan Huang 写道:
> [BUG]
> Running btrfs balance can trigger a null-ptr-deref before relocating a
> data chunk when metadata corruption leaves a chunk in the chunk tree
> without a corresponding block group in the in-memory cache:
>
> KASAN: null-ptr-deref in range [0x0000000000000088-0x000000000000008f]
> RIP: 0010:btrfs_may_alloc_data_chunk+0x40/0x1c0 fs/btrfs/volumes.c:3601
> Call Trace:
> __btrfs_balance fs/btrfs/volumes.c:4217 [inline]
> btrfs_balance+0x2516/0x42b0 fs/btrfs/volumes.c:4604
> btrfs_ioctl_balance fs/btrfs/ioctl.c:3577 [inline]
> btrfs_ioctl+0x25cf/0x5b90 fs/btrfs/ioctl.c:5313
> ...
>
> [CAUSE]
> __btrfs_balance() iterates the on-disk chunk tree and passes the chunk
> logical bytenr to btrfs_may_alloc_data_chunk() before relocating a data
> chunk. That helper then queries the in-memory block group cache:
>
> cache = btrfs_lookup_block_group(fs_info, chunk_offset);
> chunk_type = cache->flags; /* cache may be NULL */
>
> On a corrupt image can contain a chunk item whose matching block group
> item is missing, so no block group is ever inserted into the cache. In
> that case btrfs_lookup_block_group() returns NULL.
I'd say adding a proper chunk/bg mapping check is the root fix.
Or you'll need to adhoc a lot of null pointer checks.
>
> The code only guards this with ASSERT(cache), which becomes a no-op when
> CONFIG_BTRFS_ASSERT is disabled. The subsequent dereference of
> cache->flags therefore crashes the kernel.
>
> [FIX]
> Add a NULL check after btrfs_lookup_block_group() in
> btrfs_may_alloc_data_chunk(). If the lookup fails, emit a btrfs_err()
> message identifying the affected bytenr and return -EUCLEAN to report
> filesystem corruption instead of dereferencing NULL.
>
> The caller already treats negative returns from
> btrfs_may_alloc_data_chunk() as fatal errors, so balance aborts cleanly
> and reports the corruption to userspace.
>
> Fixes: a6f93c71d412 ("Btrfs: avoid losing data raid profile when deleting a device")
> Cc: stable@vger.kernel.org
> Signed-off-by: ZhengYuan Huang <gality369@gmail.com>
> ---
> fs/btrfs/volumes.c | 6 +++++-
> 1 file changed, 5 insertions(+), 1 deletion(-)
>
> diff --git a/fs/btrfs/volumes.c b/fs/btrfs/volumes.c
> index 4958e074d420..4657b826b48b 100644
> --- a/fs/btrfs/volumes.c
> +++ b/fs/btrfs/volumes.c
> @@ -3597,7 +3597,12 @@ static int btrfs_may_alloc_data_chunk(struct btrfs_fs_info *fs_info,
> u64 bytes_used;
> u64 chunk_type;
>
> cache = btrfs_lookup_block_group(fs_info, chunk_offset);
> - ASSERT(cache);
> + if (!cache) {
> + btrfs_err(fs_info,
> + "balance: chunk at bytenr %llu has no corresponding block group",
> + chunk_offset);
> + return -EUCLEAN;
> + }
> chunk_type = cache->flags;
> btrfs_put_block_group(cache);
>
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [PATCH] btrfs: balance: fix null-ptr-deref in btrfs_may_alloc_data_chunk
2026-03-16 20:46 ` Qu Wenruo
@ 2026-03-17 0:09 ` ZhengYuan Huang
0 siblings, 0 replies; 3+ messages in thread
From: ZhengYuan Huang @ 2026-03-17 0:09 UTC (permalink / raw)
To: Qu Wenruo
Cc: dsterba, clm, bo.li.liu, linux-btrfs, linux-kernel, baijiaju1990,
r33s3n6, zzzccc427, stable
On Tue, Mar 17, 2026 at 4:46 AM Qu Wenruo <wqu@suse.com> wrote:
> I'd say adding a proper chunk/bg mapping check is the root fix.
>
> Or you'll need to adhoc a lot of null pointer checks.
Thank you for the reminder. A patch addressing the root cause of this
issue has already been submitted, but I am unsure if the solution is
fully appropriate and would appreciate further guidance. The relevant
patch series can be found here:
https://lore.kernel.org/all/20260314123741.1439792-1-gality369@gmail.com/.
Please take a look at the series.
Since you are not directly responsible for this, I didn't CC you on
the email to avoid causing any disruption. Apologies for any
inconvenience this may have caused.
Looking forward to your feedback,
ZhengYuan Huang
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2026-03-17 0:09 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2026-03-16 13:46 [PATCH] btrfs: balance: fix null-ptr-deref in btrfs_may_alloc_data_chunk ZhengYuan Huang
2026-03-16 20:46 ` Qu Wenruo
2026-03-17 0:09 ` ZhengYuan Huang
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox