public inbox for linux-xfs@vger.kernel.org
 help / color / mirror / Atom feed
From: Dave Chinner <david@fromorbit.com>
To: "Darrick J. Wong" <djwong@kernel.org>
Cc: linux-xfs@vger.kernel.org, cem@kernel.org,
	Christoph Hellwig <hch@lst.de>
Subject: Re: [RFC] xfs: opting in or out of online repair
Date: Thu, 25 Jul 2024 12:05:26 +1000	[thread overview]
Message-ID: <ZqGy5qcZAbHtY61r@dread.disaster.area> (raw)
In-Reply-To: <20240724213852.GA612460@frogsfrogsfrogs>

On Wed, Jul 24, 2024 at 02:38:52PM -0700, Darrick J. Wong wrote:
> Hi everyone,
> 
> Over in this thread:
> https://lore.kernel.org/linux-xfs/20240724131825.GA12250@lst.de/
> 
> I originally proposed turning on background xfs_scrub by default in
> Debian.  Christoph wondered if it was such a good idea to unleash this
> upon hoards of debian sid users, considering that the introduction of
> e2scrub by default didn't go entirely smoothly.  Debian policy is to
> enable packaged services at installation time, which is reasonable for
> httpd but perhaps not so much for experimental services.
> 
> I proposed modifying xfsprogs so that for Debian builds, it would spit
> out a separate "xfsprogs-self-healing" package containing only the files
> needed to activate the background scrub services by default.  People
> would have to opt-in to installing it... insofar as one can configure
> apt to pull in Suggests: and Recommends: packages.
> 
> Christoph suggested adding a compat flag to the superblock so that
> sysadmins can mark filesystems for (or against) background online
> repair.  Therefore, I'm proposing adding such a flag.  Before I start
> any work on that, I thought I'd sketch out what I'd add to make that
> happen:
> 
> On Disk Format Changes
> ======================
> 
> Add a new COMPAT flag XFS_SB_FEAT_COMPAT_SELF_HEALING.  If set,
> the background xfs_scrub_all service can periodically scan the
> filesystem and initiate repairs.  If not set, then background repairs
> are not allowed.

Maybe I'm missing something important - this doesn't feel like
on-disk format stuff. Why would having online repair enabled make
the fileystem unmountable on older kernels?

Hmmm. Could this be implemented with an xattr on the root inode
that says "self healing allowed"?

That way installing the xfsprogs-self-healing package can turn
itself on at install/upgrade time when we decide that online reapir
is no longer experimental without on-disk format implications. In
the mean time we could add an xfs_admin command to set/clear the
xattr on the mounted filesystem for the admin to chose to enable it.

> Note that administrator-initated scans (e.g. invoking xfs_scrub from the
> CLI) would not be blocked by this flag.
> 
> Question: Should this compat flag control background scrubs as well?

Probably. scrub is less intrusive, but I can see people wanting to
avoid it because it can have a perf impact. Could this be done with
a different xattr on the root inode?

> Userspace ABI Changes
> =====================

Answers from here are purely about mkfs based compat bit, though I'd
think this is unnecesary if we use an xattr.

> The first thing is adding a new CLI option to mkfs.xfs to set the value
> of the flag:
> 
> # mkfs.xfs -m self_healing=1
> 
> Question: If rmap and parent pointers are turned on, should we turn on
> self healing by default?

No.

> Question: If self healing is turned on, should we turn on rmap and
> parent pointers as well?

Yes.

> For xfs_scrub_all, we'd need a way to query the status of the self
> healing bit; this could be snuck into the xfs geometry ioctl as a new
> feature bit XFS_FSOP_GEOM_FLAGS_SELF_HEALING.
> 
> # xfs_info /dev/sdf
> meta-data=/dev/sdf     isize=512    agcount=1, agsize=268435455 blks
>          =             sectsz=512   attr=2, projid32bit=1
>          =             crc=1        finobt=1, sparse=1, rmapbt=1
>          =             reflink=1    bigtime=1 inobtcount=1 nrext64=1
>          =             exchange=0   metadir=0 self_healing=1
> data     =             bsize=4096   blocks=5192704, imaxpct=25
>          =             sunit=0      swidth=0 blks
> naming   =version 2    bsize=4096   ascii-ci=0, ftype=1, parent=0
> log      =internal log bsize=4096   blocks=16384, version=2
>          =             sectsz=512   sunit=0 blks, lazy-count=1
> realtime =none         extsz=4096   blocks=0, rtextents=0
>          =             rgcount=0    rgsize=0 extents

OK.

> I think we also want to enable sysadmins to change this on the fly.
> For mounted filesystems, this could be done through an administrative
> ioctl or mount options.  Offline, this would become another xfs_admin
> command.
> 
> # xfs_spaceman -c 'self_healing on' /usr
> # xfs_spaceman -c 'self_healing off' /tmp/extraspace
> 
> # xfs_admin --self-healing=on /dev/sda1
> # xfs_admin --self-healing=off /dev/sda2

I'd prefer the primary admin to be done through an xfs_admin
command.

> # mount /dev/sda2 /mnt -o self_healing
> 
> Thoughts?  Flames?

This really seems like information that userspace can maintain on
the filesystem itself, not something that belongs in the on-disk
format for the filesystem. Implementing it as an xattr the
xfsprogs-self-healing package maintains means no
kernel, mkfs, spaceman, db or repair changes are needed.

-Dave.
-- 
Dave Chinner
david@fromorbit.com

  reply	other threads:[~2024-07-25  2:05 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-07-24 21:38 [RFC] xfs: opting in or out of online repair Darrick J. Wong
2024-07-25  2:05 ` Dave Chinner [this message]
2024-07-25 14:14   ` Christoph Hellwig
2024-07-25 22:33     ` Dave Chinner
2024-07-26  0:41       ` Darrick J. Wong
2024-07-26  1:15         ` Dave Chinner
2024-07-26 13:59         ` Christoph Hellwig
2024-07-26 15:15           ` Darrick J. Wong

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=ZqGy5qcZAbHtY61r@dread.disaster.area \
    --to=david@fromorbit.com \
    --cc=cem@kernel.org \
    --cc=djwong@kernel.org \
    --cc=hch@lst.de \
    --cc=linux-xfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox