From: "Darrick J. Wong" <djwong@kernel.org>
To: Jeremy Bingham <jbingham@gmail.com>
Cc: linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org,
brauner@kernel.org, jkoolstra@xs4all.nl, jack@suse.cz,
hch@infradead.org, viro@zeniv.linux.org.uk,
syzkaller@googlegroups.com
Subject: Re: [PATCH v2 1/4] minix: add iomap infrastructure
Date: Wed, 1 Jul 2026 11:06:56 -0700 [thread overview]
Message-ID: <20260701180656.GB6507@frogsfrogsfrogs> (raw)
In-Reply-To: <041284c8aa6d5e73e834bac9842b91885d05b982.1782619718.git.jbingham@gmail.com>
On Sat, Jun 27, 2026 at 10:15:53PM -0700, Jeremy Bingham wrote:
> Add an iomap implementation for minix, adapted from the out-of-tree
> xiafs module's iomap code. The xiafs module, in turn, borrowed heavily
> from minix's own code.
>
> The iomap_begin function replaces get_block for the iomap path:
> it walks the indirect block tree, allocates blocks when needed, and
> returns the mapping in a struct iomap. The iomap_end function is a
> nop since minix has no extents or transactions to finalize.
>
> Because minix has two filesystem versions (V1 with 16-bit block
> numbers, V2/V3 with 32-bit) that share itree_common.c via #include,
> the iomap.c file follows the same pattern. It is #included into
> itree_v1.c and itree_v2.c so it can use the version-specific block_t,
> block_to_cpu, and cpu_to_block definitions directly.
>
> Each version gets its own iomap_ops struct (V1_minix_iomap_ops,
> V2_minix_iomap_ops) with thin wrappers around the shared
> minix_iomap_begin/minix_iomap_end. A minix_iomap_ops_ver() helper
> selects the correct ops based on the filesystem version.
>
> minix_get_block is exported from inode.c for use by the iomap
> writeback path and the directory aops that still use buffer_heads.
>
> Signed-off-by: Jeremy Bingham <jbingham@gmail.com>
> ---
> fs/minix/iomap.c | 114 ++++++++++++++++++++++++++++++++++++++++++++
> fs/minix/itree_v1.c | 25 +++++++++-
> fs/minix/itree_v2.c | 17 ++++++-
> fs/minix/minix.h | 23 ++++++++-
> 4 files changed, 175 insertions(+), 4 deletions(-)
> create mode 100644 fs/minix/iomap.c
>
> diff --git a/fs/minix/iomap.c b/fs/minix/iomap.c
> new file mode 100644
> index 000000000000..7bb0439e3669
> --- /dev/null
> +++ b/fs/minix/iomap.c
> @@ -0,0 +1,114 @@
> +// SPDX-License-Identifier: GPL-2.0-only
> +/*
> + * iomap functions for minix. At least the first pass of this file was taken
> + * from the xiafs iomap.c, which is fitting since the xiafs module in turn
> + * borrowed heavily from the modernized minix fs kernel module.
Is this the "modernized" minix fs kernel module??
xiafs hasn't been in the kernel for years; I see little point in
mentioning an externally maintained driver inside the kernel source.
> + */
> +
> +/*
> + * minix_iomap_begin - map a file range to disk blocks. It acts as a replacement
> + * for get_block in itree_common.c, at least in the important ways, and is
> + * adapted from it, but it uses iomap instead of buffer_head. This is taken
> + * directly from the out-of-tree xiafs iomap changes, and the exfat iomap
> + * changes were an inspiration for that.
> + */
> +static int minix_iomap_begin(struct inode *inode, loff_t offset, loff_t length,
> + unsigned int flags, struct iomap *iomap, struct iomap *srcmap)
> +{
> + struct super_block *sb = inode->i_sb;
> + unsigned int blkbits = sb->s_blocksize_bits;
> + sector_t iblock = offset >> blkbits;
> + int create = flags & IOMAP_WRITE;
> +
> + /* Mostly taken from modern-xiafs itree.c get_block with elements from
> + * similar exfat operations.
> + */
> + int offsets[DEPTH];
> + Indirect chain[DEPTH];
> + Indirect *partial;
> + int depth = block_to_path(inode, iblock, offsets);
> + int left;
> + int err = -EIO;
> +
> + sector_t phys;
> +
> + /* block is beyond max file size */
> + if (depth == 0)
> + goto out;
> +
> + iomap->bdev = inode->i_sb->s_bdev;
> +
> +reread:
> + partial = get_branch(inode, depth, offsets, chain, &err);
> +
> + /* Simplest case - block found, no allocation needed */
> + if (!partial) {
> + /* Bit of a weird order, but it'll make sense when you get to
> + * the bottom.
> + */
> + iomap->flags = IOMAP_F_MERGED;
> +got_it:
> + phys = block_to_cpu(chain[depth - 1].key);
> + partial = chain+depth-1;
> + /* Set up the iomap struct before cleaning up */
> + iomap->type = IOMAP_MAPPED;
> + iomap->addr = (u64)phys << blkbits;
> + iomap->length = 1 << blkbits;
> + iomap->offset = (u64)iblock << blkbits;
> + goto cleanup;
> + }
> +
> + /* Next simple case - plain lookup or failed read of indirect block */
> + if (!create || err == -EIO) {
> + iomap->type = IOMAP_HOLE;
> + iomap->addr = IOMAP_NULL_ADDR;
> + iomap->length = 1 << blkbits;
> + iomap->offset = (u64)iblock << blkbits;
> + iomap->flags = 0;
> +cleanup:
> + while (partial > chain) {
> + brelse(partial->bh);
> + partial--;
> + }
Uhhh ... these three ought to be factored out into separate helpers to
(a) clean up @chain, (b) set up a IOMAP_MAPPED mapping, and (c) set up a
IOMAP_HOLE mapping. Don't write a pile of spaghetti goes.
Also kinda surprised that you set IOMAP_F_MERGED but don't try to look
forward to find adjacent blocks?
> +out:
> + return err;
> + }
> +
> + /*
> + * Indirect block might be removed by truncate while we were
> + * reading it. Handling of that case (forget what we've got and
> + * reread) is taken out of the main path.
/me wonders how we'd be racing with setattr?
--D
> + */
> + if (err == -EAGAIN)
> + goto changed;
> +
> + left = (chain + depth) - partial;
> + err = alloc_branch(inode, left, offsets + (partial - chain), partial);
> + if (err)
> + goto cleanup;
> +
> + if (splice_branch(inode, chain, partial, left) < 0)
> + goto changed;
> +
> + /* Successful allocation, mapping it. */
> + iomap->flags = IOMAP_F_NEW;
> + goto got_it;
> +
> +changed:
> + while (partial > chain) {
> + brelse(partial->bh);
> + partial--;
> + }
> + goto reread;
> +}
> +
> +/*
> + * minix_iomap_end ends up being a nop; since minix doesn't have any extents or
> + * transactions to worry about, there isn't anything to update here. The on-disk
> + * indirect blocks get dirtied in minix_iomap_begin.
> + */
> +static int minix_iomap_end(struct inode *inode, loff_t offset, loff_t length,
> + ssize_t written, unsigned int flags, struct iomap *iomap)
> +{
> + return 0;
> +}
> diff --git a/fs/minix/itree_v1.c b/fs/minix/itree_v1.c
> index 1fed906042aa..58c29f4443d3 100644
> --- a/fs/minix/itree_v1.c
> +++ b/fs/minix/itree_v1.c
> @@ -49,6 +49,18 @@ static int block_to_path(struct inode * inode, long block, int offsets[DEPTH])
> }
>
> #include "itree_common.c"
> +/* NOTA BENE:
> + *
> + * This is icky to me, but at the same time having it be a standalone C file
> + * that's compiled to object form and linked separately like it is in xiafs is
> + * much nastier in minix because of the different versions of the minix fs that
> + * have some very, very different aspects, like the size of block_t. I don't
> + * like it, but since minix already has this pattern where a common itree file
> + * is included in the itree_v1 and itree_v2(and v3) files, I'm including iomap.c
> + * in these files as well. It does at least avoid exporting some currently
> + * static functions that aren't needed anywhere but itree_common.c and iomap.c.
> + */
> +#include "iomap.c"
>
> int V1_minix_get_block(struct inode * inode, long block,
> struct buffer_head *bh_result, int create)
> @@ -61,7 +73,18 @@ void V1_minix_truncate(struct inode * inode)
> truncate(inode);
> }
>
> -unsigned V1_minix_blocks(loff_t size, struct super_block *sb)
> +unsigned int V1_minix_blocks(loff_t size, struct super_block *sb)
> {
> return nblocks(size, sb);
> }
> +
> +int V1_minix_iomap_begin(struct inode *inode, loff_t offset, loff_t length,
> + unsigned int flags, struct iomap *iomap, struct iomap *srcmap)
> +{
> + return minix_iomap_begin(inode, offset, length, flags, iomap, srcmap);
> +}
> +
> +const struct iomap_ops V1_minix_iomap_ops = {
> + .iomap_begin = V1_minix_iomap_begin,
> + .iomap_end = minix_iomap_end,
> +};
> diff --git a/fs/minix/itree_v2.c b/fs/minix/itree_v2.c
> index 9d00f31a2d9d..fc7a5ae8fa1c 100644
> --- a/fs/minix/itree_v2.c
> +++ b/fs/minix/itree_v2.c
> @@ -57,6 +57,10 @@ static int block_to_path(struct inode * inode, long block, int offsets[DEPTH])
> }
>
> #include "itree_common.c"
> +/* See the note in itree_v1 in a comment that starts "NOTA BENE" for an
> + * explanation for why iomap.c is included here.
> + */
> +#include "iomap.c"
>
> int V2_minix_get_block(struct inode * inode, long block,
> struct buffer_head *bh_result, int create)
> @@ -69,7 +73,18 @@ void V2_minix_truncate(struct inode * inode)
> truncate(inode);
> }
>
> -unsigned V2_minix_blocks(loff_t size, struct super_block *sb)
> +unsigned int V2_minix_blocks(loff_t size, struct super_block *sb)
> {
> return nblocks(size, sb);
> }
> +
> +int V2_minix_iomap_begin(struct inode *inode, loff_t offset, loff_t length,
> + unsigned int flags, struct iomap *iomap, struct iomap *srcmap)
> +{
> + return minix_iomap_begin(inode, offset, length, flags, iomap, srcmap);
> +}
> +
> +const struct iomap_ops V2_minix_iomap_ops = {
> + .iomap_begin = V2_minix_iomap_begin,
> + .iomap_end = minix_iomap_end,
> +};
> diff --git a/fs/minix/minix.h b/fs/minix/minix.h
> index f2025c9b5825..77e503cca97f 100644
> --- a/fs/minix/minix.h
> +++ b/fs/minix/minix.h
> @@ -5,6 +5,7 @@
> #include <linux/fs.h>
> #include <linux/pagemap.h>
> #include <linux/minix_fs.h>
> +#include <linux/iomap.h>
>
> #define INODE_VERSION(inode) minix_sb(inode->i_sb)->s_version
> #define MINIX_V1 0x0001 /* original minix fs */
> @@ -69,6 +70,8 @@ extern int V1_minix_get_block(struct inode *, long, struct buffer_head *, int);
> extern int V2_minix_get_block(struct inode *, long, struct buffer_head *, int);
> extern unsigned V1_minix_blocks(loff_t, struct super_block *);
> extern unsigned V2_minix_blocks(loff_t, struct super_block *);
> +extern int minix_get_block(struct inode *inode, sector_t block,
> + struct buffer_head *bh_result, int create);
>
> struct minix_dir_entry *minix_find_entry(struct dentry *, struct folio **);
> int minix_add_link(struct dentry*, struct inode*);
> @@ -80,10 +83,20 @@ int minix_set_link(struct minix_dir_entry *de, struct folio *folio,
> struct minix_dir_entry *minix_dotdot(struct inode*, struct folio **);
> ino_t minix_inode_by_name(struct dentry*);
>
> +extern int V1_minix_iomap_begin(struct inode *inode, loff_t offset,
> + loff_t length, unsigned int flags, struct iomap *iomap,
> + struct iomap *srcmap);
> +extern int V2_minix_iomap_begin(struct inode *inode, loff_t offset,
> + loff_t length, unsigned int flags, struct iomap *iomap,
> + struct iomap *srcmap);
> +
> +extern const struct address_space_operations minix_aops;
> extern const struct inode_operations minix_file_inode_operations;
> extern const struct inode_operations minix_dir_inode_operations;
> extern const struct file_operations minix_file_operations;
> extern const struct file_operations minix_dir_operations;
> +extern const struct iomap_ops V1_minix_iomap_ops;
> +extern const struct iomap_ops V2_minix_iomap_ops;
>
> static inline struct minix_sb_info *minix_sb(struct super_block *sb)
> {
> @@ -95,11 +108,17 @@ static inline struct minix_inode_info *minix_i(struct inode *inode)
> return container_of(inode, struct minix_inode_info, vfs_inode);
> }
>
> -static inline unsigned minix_blocks_needed(unsigned bits, unsigned blocksize)
> +static inline unsigned int minix_blocks_needed(unsigned int bits, unsigned int blocksize)
> {
> return DIV_ROUND_UP(bits, blocksize * 8);
> }
>
> +static inline const struct iomap_ops *minix_iomap_ops_ver(struct inode *inode)
> +{
> + return (INODE_VERSION(inode) == MINIX_V1) ?
> + &V1_minix_iomap_ops : &V2_minix_iomap_ops;
> +}
> +
> #if defined(CONFIG_MINIX_FS_NATIVE_ENDIAN) && \
> defined(CONFIG_MINIX_FS_BIG_ENDIAN_16BIT_INDEXED)
>
> @@ -129,7 +148,7 @@ static inline unsigned minix_blocks_needed(unsigned bits, unsigned blocksize)
> * big-endian 16bit indexed bitmaps
> */
>
> -static inline int minix_find_first_zero_bit(const void *vaddr, unsigned size)
> +static inline int minix_find_first_zero_bit(const void *vaddr, unsigned int size)
> {
> const unsigned short *p = vaddr, *addr = vaddr;
> unsigned short num;
> --
> 2.47.3
>
>
next prev parent reply other threads:[~2026-07-01 18:06 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-06-28 5:15 [PATCH v2 0/4] minix: convert to iomap and add direct I/O Jeremy Bingham
2026-06-28 5:15 ` [PATCH v2 1/4] minix: add iomap infrastructure Jeremy Bingham
2026-07-01 18:06 ` Darrick J. Wong [this message]
2026-07-01 18:32 ` Jeremy Bingham
2026-06-28 5:15 ` [PATCH v2 2/4] minix: convert address space operations to iomap Jeremy Bingham
2026-07-01 18:14 ` Darrick J. Wong
2026-07-01 18:37 ` Jeremy Bingham
2026-07-01 18:47 ` Darrick J. Wong
2026-06-28 5:15 ` [PATCH v2 3/4] minix: convert file operations to iomap and add Jeremy Bingham
2026-07-01 18:21 ` Darrick J. Wong
2026-07-01 18:42 ` Jeremy Bingham
2026-06-28 5:15 ` [PATCH v2 4/4] minix: fix symlink and truncate for iomap Jeremy Bingham
2026-06-29 5:27 ` [syzbot ci] Re: minix: convert to iomap and add direct I/O syzbot ci
2026-06-30 3:05 ` Jeremy Bingham
2026-06-30 4:05 ` syzbot ci
2026-07-01 18:00 ` [PATCH v2 0/4] " Darrick J. Wong
2026-07-02 18:42 ` Jeremy Bingham
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20260701180656.GB6507@frogsfrogsfrogs \
--to=djwong@kernel.org \
--cc=brauner@kernel.org \
--cc=hch@infradead.org \
--cc=jack@suse.cz \
--cc=jbingham@gmail.com \
--cc=jkoolstra@xs4all.nl \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=syzkaller@googlegroups.com \
--cc=viro@zeniv.linux.org.uk \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox