* Rebuild mode for tail-pack layouts
@ 2026-04-30 12:09 xtex
2026-05-04 15:13 ` Gao Xiang
0 siblings, 1 reply; 5+ messages in thread
From: xtex @ 2026-04-30 12:09 UTC (permalink / raw)
To: linux-erofs
Hi!
In erofs_rebuild_write_blob_index (rebuild.c), only EROFS_INODE_CHUNK_BASED
and FLAT_PLAIN are implemented, so when generating a metadata index with
rebuild mode, the sources cannot use tail-pack nor inline data layout.
However, disabling tail-packing can lead to great disk-space waste in many
cases, especially when the file-system consists of a lot of small files.
Thus I attempted to implement FLAT_INLINE for it, only to realize that the
current chunk entry formats can only represent physical addresses that are
block-aligned while tail-pack extent is not.
I wonder what do you think about adding a new chunk entry format? And how
should it be named?
I would suggest the following structure:
struct erofs_inode_chunk_index_tp {
__le16 startblk_hi; /* starting block number MSB */
__le16 device_id; /* back-end storage id (with bits masked)
*/
__le32 startblk_lo; /* starting block number of this chunk */
/* new fields below */
__le16 startblk_off; /* starting block offset */
__le16 reserved;
} __packed;
The 16b offset should be enough unless we are to support block size > 64K.
The reserved field is added for alignment.
Best wishes.
--
xtex (Bingwu Zhang, a.k.a Zerogawa Chiaki) @ Thu, 30 Apr 2026 11:45:17 +0000
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: Rebuild mode for tail-pack layouts
2026-04-30 12:09 Rebuild mode for tail-pack layouts xtex
@ 2026-05-04 15:13 ` Gao Xiang
2026-05-09 8:37 ` Bingwu Zhang
0 siblings, 1 reply; 5+ messages in thread
From: Gao Xiang @ 2026-05-04 15:13 UTC (permalink / raw)
To: xtex, linux-erofs
Hi xtex,
On 2026/4/30 20:09, xtex wrote:
> Hi!
>
> In erofs_rebuild_write_blob_index (rebuild.c), only EROFS_INODE_CHUNK_BASED
> and FLAT_PLAIN are implemented, so when generating a metadata index with
> rebuild mode, the sources cannot use tail-pack nor inline data layout.
> However, disabling tail-packing can lead to great disk-space waste in many
> cases, especially when the file-system consists of a lot of small files.
>
> Thus I attempted to implement FLAT_INLINE for it, only to realize that the
> current chunk entry formats can only represent physical addresses that are
> block-aligned while tail-pack extent is not.
>
> I wonder what do you think about adding a new chunk entry format? And how
> should it be named?
>
> I would suggest the following structure:
> struct erofs_inode_chunk_index_tp {
> __le16 startblk_hi; /* starting block number MSB */
> __le16 device_id; /* back-end storage id (with bits masked)
> */
> __le32 startblk_lo; /* starting block number of this chunk */
> /* new fields below */
> __le16 startblk_off; /* starting block offset */
> __le16 reserved;
> } __packed;
> The 16b offset should be enough unless we are to support block size > 64K.
> The reserved field is added for alignment.
Sorry about the late response.
Thanks for the question.
FLAT_INLINE can be used for index rebuilding, which can work with
uniaddr (or mapped_blkaddr) since the blkaddr will be mapped
into the relative address based on the blob starting with
mapped_blkaddr:
https://erofs.docs.kernel.org/en/latest/ondisk/chunked_format.html#device-table
But I agree the expression in the page above is a bit
ambigious through.
Thanks,
Gao Xiang
>
> Best wishes.
>
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: Rebuild mode for tail-pack layouts
2026-05-04 15:13 ` Gao Xiang
@ 2026-05-09 8:37 ` Bingwu Zhang
2026-05-09 10:03 ` Gao Xiang
0 siblings, 1 reply; 5+ messages in thread
From: Bingwu Zhang @ 2026-05-09 8:37 UTC (permalink / raw)
To: linux-erofs; +Cc: Gao Xiang
Hi,
On Monday, May 4, 2026 11:13:23 PM China Standard Time Gao Xiang wrote:
> Hi xtex,
>
> On 2026/4/30 20:09, xtex wrote:
> > Hi!
> >
> > In erofs_rebuild_write_blob_index (rebuild.c), only
> > EROFS_INODE_CHUNK_BASED
> > and FLAT_PLAIN are implemented, so when generating a metadata index with
> > rebuild mode, the sources cannot use tail-pack nor inline data layout.
> > However, disabling tail-packing can lead to great disk-space waste in many
> > cases, especially when the file-system consists of a lot of small files.
> >
> > Thus I attempted to implement FLAT_INLINE for it, only to realize that the
> > current chunk entry formats can only represent physical addresses that are
> > block-aligned while tail-pack extent is not.
> >
> > I wonder what do you think about adding a new chunk entry format? And how
> > should it be named?
> >
> > I would suggest the following structure:
> > struct erofs_inode_chunk_index_tp {
> >
> > __le16 startblk_hi; /* starting block number MSB */
> > __le16 device_id; /* back-end storage id (with bits masked)
> >
> > */
> >
> > __le32 startblk_lo; /* starting block number of this chunk */
> > /* new fields below */
> > __le16 startblk_off; /* starting block offset */
> > __le16 reserved;
> >
> > } __packed;
> > The 16b offset should be enough unless we are to support block size > 64K.
> > The reserved field is added for alignment.
>
> Sorry about the late response.
>
> Thanks for the question.
>
> FLAT_INLINE can be used for index rebuilding, which can work with
> uniaddr (or mapped_blkaddr) since the blkaddr will be mapped
> into the relative address based on the blob starting with
> mapped_blkaddr:
>
> https://erofs.docs.kernel.org/en/latest/ondisk/chunked_format.html#device-ta
> ble
>
> But I agree the expression in the page above is a bit
> ambigious through.
>
> Thanks,
> Gao Xiang
>
> > Best wishes.
Sorry for the late response and thanks for your answer.
I am sorry about that I didn't get it.
In __erofs_map_blocks:
> map->m_pa = erofs_pos(sbi, startblk);
and
> #define erofs_pos(sbi, nr) ((erofs_off_t)(nr) << (sbi)->blkszbits)
It seems like that the mapped PA is always block-aligned? However, the last
chunk of inline data is not?
Best wishes.
--
xtex (a.k.a. Bingwu Zhang) @ Sat, 09 May 2026 08:24:53 +0000
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: Rebuild mode for tail-pack layouts
2026-05-09 8:37 ` Bingwu Zhang
@ 2026-05-09 10:03 ` Gao Xiang
2026-05-09 14:29 ` Bingwu Zhang
0 siblings, 1 reply; 5+ messages in thread
From: Gao Xiang @ 2026-05-09 10:03 UTC (permalink / raw)
To: Bingwu Zhang; +Cc: linux-erofs, Gao Xiang
On Sat, May 09, 2026 at 04:37:43PM +0800, Bingwu Zhang wrote:
> Hi,
>
> On Monday, May 4, 2026 11:13:23 PM China Standard Time Gao Xiang wrote:
> > Hi xtex,
> >
> > On 2026/4/30 20:09, xtex wrote:
> > > Hi!
> > >
> > > In erofs_rebuild_write_blob_index (rebuild.c), only
> > > EROFS_INODE_CHUNK_BASED
> > > and FLAT_PLAIN are implemented, so when generating a metadata index with
> > > rebuild mode, the sources cannot use tail-pack nor inline data layout.
> > > However, disabling tail-packing can lead to great disk-space waste in many
> > > cases, especially when the file-system consists of a lot of small files.
> > >
> > > Thus I attempted to implement FLAT_INLINE for it, only to realize that the
> > > current chunk entry formats can only represent physical addresses that are
> > > block-aligned while tail-pack extent is not.
> > >
> > > I wonder what do you think about adding a new chunk entry format? And how
> > > should it be named?
> > >
> > > I would suggest the following structure:
> > > struct erofs_inode_chunk_index_tp {
> > >
> > > __le16 startblk_hi; /* starting block number MSB */
> > > __le16 device_id; /* back-end storage id (with bits masked)
> > >
> > > */
> > >
> > > __le32 startblk_lo; /* starting block number of this chunk */
> > > /* new fields below */
> > > __le16 startblk_off; /* starting block offset */
> > > __le16 reserved;
> > >
> > > } __packed;
> > > The 16b offset should be enough unless we are to support block size > 64K.
> > > The reserved field is added for alignment.
> >
> > Sorry about the late response.
> >
> > Thanks for the question.
> >
> > FLAT_INLINE can be used for index rebuilding, which can work with
> > uniaddr (or mapped_blkaddr) since the blkaddr will be mapped
> > into the relative address based on the blob starting with
> > mapped_blkaddr:
> >
> > https://erofs.docs.kernel.org/en/latest/ondisk/chunked_format.html#device-ta
> > ble
> >
> > But I agree the expression in the page above is a bit
> > ambigious through.
> >
> > Thanks,
> > Gao Xiang
> >
> > > Best wishes.
>
> Sorry for the late response and thanks for your answer.
>
> I am sorry about that I didn't get it.
> In __erofs_map_blocks:
> > map->m_pa = erofs_pos(sbi, startblk);
> and
> > #define erofs_pos(sbi, nr) ((erofs_off_t)(nr) << (sbi)->blkszbits)
> It seems like that the mapped PA is always block-aligned? However, the last
> chunk of inline data is not?
Yes, sorry I didn't express explicitly.
I mean the main data except the trailing inline data part
can be remapped into another external blob.
But inline data should be kept with the metadata; otherwise it
won't be called _inline data_ anymore.
Or do you have a case we have to redirect the inline data?
Thanks,
Gao Xiang
>
> Best wishes.
>
> --
> xtex (a.k.a. Bingwu Zhang) @ Sat, 09 May 2026 08:24:53 +0000
>
>
>
>
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: Rebuild mode for tail-pack layouts
2026-05-09 10:03 ` Gao Xiang
@ 2026-05-09 14:29 ` Bingwu Zhang
0 siblings, 0 replies; 5+ messages in thread
From: Bingwu Zhang @ 2026-05-09 14:29 UTC (permalink / raw)
To: linux-erofs, Gao Xiang
On Saturday, May 9, 2026 6:03:15 PM China Standard Time Gao Xiang wrote:
> On Sat, May 09, 2026 at 04:37:43PM +0800, Bingwu Zhang wrote:
> > Hi,
> >
> > On Monday, May 4, 2026 11:13:23 PM China Standard Time Gao Xiang wrote:
> > > Hi xtex,
> > >
> > > On 2026/4/30 20:09, xtex wrote:
> > > > Hi!
> > > >
> > > > In erofs_rebuild_write_blob_index (rebuild.c), only
> > > > EROFS_INODE_CHUNK_BASED
> > > > and FLAT_PLAIN are implemented, so when generating a metadata index
> > > > with
> > > > rebuild mode, the sources cannot use tail-pack nor inline data layout.
> > > > However, disabling tail-packing can lead to great disk-space waste in
> > > > many
> > > > cases, especially when the file-system consists of a lot of small
> > > > files.
> > > >
> > > > Thus I attempted to implement FLAT_INLINE for it, only to realize that
> > > > the
> > > > current chunk entry formats can only represent physical addresses that
> > > > are
> > > > block-aligned while tail-pack extent is not.
> > > >
> > > > I wonder what do you think about adding a new chunk entry format? And
> > > > how
> > > > should it be named?
> > > >
> > > > I would suggest the following structure:
> > > > struct erofs_inode_chunk_index_tp {
> > > >
> > > > __le16 startblk_hi; /* starting block number MSB */
> > > > __le16 device_id; /* back-end storage id (with bits masked)
> > > >
> > > > */
> > > >
> > > > __le32 startblk_lo; /* starting block number of this chunk */
> > > > /* new fields below */
> > > > __le16 startblk_off; /* starting block offset */
> > > > __le16 reserved;
> > > >
> > > > } __packed;
> > > > The 16b offset should be enough unless we are to support block size >
> > > > 64K.
> > > > The reserved field is added for alignment.
> > >
> > > Sorry about the late response.
> > >
> > > Thanks for the question.
> > >
> > > FLAT_INLINE can be used for index rebuilding, which can work with
> > > uniaddr (or mapped_blkaddr) since the blkaddr will be mapped
> > > into the relative address based on the blob starting with
> > > mapped_blkaddr:
> > >
> > > https://erofs.docs.kernel.org/en/latest/ondisk/chunked_format.html#devic
> > > e-ta ble
> > >
> > > But I agree the expression in the page above is a bit
> > > ambigious through.
> > >
> > > Thanks,
> > > Gao Xiang
> > >
> > > > Best wishes.
> >
> > Sorry for the late response and thanks for your answer.
> >
> > I am sorry about that I didn't get it.
> >
> > In __erofs_map_blocks:
> > > map->m_pa = erofs_pos(sbi, startblk);
> >
> > and
> >
> > > #define erofs_pos(sbi, nr) ((erofs_off_t)(nr) << (sbi)->blkszbits)
> >
> > It seems like that the mapped PA is always block-aligned? However, the
> > last
> > chunk of inline data is not?
>
> Yes, sorry I didn't express explicitly.
>
> I mean the main data except the trailing inline data part
> can be remapped into another external blob.
>
> But inline data should be kept with the metadata; otherwise it
> won't be called _inline data_ anymore.
>
> Or do you have a case we have to redirect the inline data?
>
> Thanks,
> Gao Xiang
>
> > Best wishes.
Ah! Thanks for your explanation. I initially thought that copying inline data
would increase the size of the metadata. Thanks!
Best wishes.
--
xtex (a.k.a. Bingwu Zhang) @ Sat, 09 May 2026 14:27:41 +0000
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2026-05-09 14:29 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2026-04-30 12:09 Rebuild mode for tail-pack layouts xtex
2026-05-04 15:13 ` Gao Xiang
2026-05-09 8:37 ` Bingwu Zhang
2026-05-09 10:03 ` Gao Xiang
2026-05-09 14:29 ` Bingwu Zhang
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.