All of lore.kernel.org
 help / color / mirror / Atom feed
From: Christoph Hellwig <hch@lst.de>
To: "Darrick J. Wong" <djwong@kernel.org>
Cc: Christoph Hellwig <hch@lst.de>, Carlos Maiolino <cem@kernel.org>,
	Hans Holmberg <hans.holmberg@wdc.com>,
	linux-xfs@vger.kernel.org
Subject: Re: [PATCH 25/43] xfs: add support for zoned space reservations
Date: Sun, 15 Dec 2024 06:31:35 +0100	[thread overview]
Message-ID: <20241215053135.GE10051@lst.de> (raw)
In-Reply-To: <20241213210140.GO6678@frogsfrogsfrogs>

On Fri, Dec 13, 2024 at 01:01:40PM -0800, Darrick J. Wong wrote:
> > +#define XFS_ZR_GREEDY		(1U << 0)
> > +#define XFS_ZR_NOWAIT		(1U << 1)
> > +#define XFS_ZR_RESERVED		(1U << 2)
> 
> What do these flag values mean?  Can we put that into comments?

Sure.

> > + * For XC_FREE_RTAVAILABLE only the smaller reservation required for GC and
> > + * block zeroing is excluded from the user capacity, while XC_FREE_RTEXTENTS
> > + * is further restricted by at least one zone as well as the optional
> > + * persistently reserved blocks.  This allows the allocator to run more
> > + * smoothly by not always triggering GC.
> 
> Hmm, so _RTAVAILABLE really means _RTNOGC?  That makes sense.

Yes, it means block available without doing further work.
I can't say _RTNOGC is very descriptive either, but I would not mind
a better name if someone came up with a good one :)

> > +		spin_unlock(&zi->zi_reservation_lock);
> > +		schedule();
> > +		spin_lock(&zi->zi_reservation_lock);
> > +	}
> > +	list_del(&reservation.entry);
> > +	spin_unlock(&zi->zi_reservation_lock);
> 
> Hmm.  So if I'm understanding correctly, threads wanting to write to a
> file try to locklessly reserve space from RTAVAILABLE.

At least if there are no waiters yet, yes.

> If they can't
> get space because the zone is nearly full / needs gc / etc then everyone
> gets to wait FIFO style in the reclaim_reservations list.

Yes (In a way modelled after the log grant waits).

> They can be
> woken up from the wait if either (a) someone gives back reserved space
> or (b) the copygc empties out this zone.
> 
> Or if the thread isn't willing to wait, we skip the fifo and either fail
> up to userspace

Yes.

> or just move on to the next zone?

No other zone to move to.

> I think I understand the general idea, but I don't quite know when we're
> going to use the greedy algorithm.  Later I see XFS_ZR_GREEDY gets used
> from the buffered write path, but there doesn't seem to be an obvious
> reason why?

Posix/Linux semantics for buffered writes require us to implement
short writes.  That is if a single (p)write(v) syscall for say 10MB
only find 512k of space it should write those instead of failing
with ENOSPC.  The XFS_ZR_GREEDY implements that by backing down to
what we can allocate (and the current implementation for that is
a little ugly, I plan to find some time for changes to the core
percpu_counters to improve this after the code is merged).


  reply	other threads:[~2024-12-15  5:31 UTC|newest]

Thread overview: 143+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-12-11  8:54 RFC: support for zoned devices Christoph Hellwig
2024-12-11  8:54 ` [PATCH 01/43] xfs: constify feature checks Christoph Hellwig
2024-12-12 20:44   ` Darrick J. Wong
2024-12-11  8:54 ` [PATCH 02/43] xfs: factor out a xfs_rt_check_size helper Christoph Hellwig
2024-12-12 21:11   ` Darrick J. Wong
2024-12-11  8:54 ` [PATCH 03/43] xfs: add a rtg_blocks helper Christoph Hellwig
2024-12-12 21:12   ` Darrick J. Wong
2024-12-13  5:00     ` Christoph Hellwig
2024-12-15 18:10       ` Darrick J. Wong
2024-12-11  8:54 ` [PATCH 04/43] xfs: move xfs_bmapi_reserve_delalloc to xfs_iomap.c Christoph Hellwig
2024-12-12 21:18   ` Darrick J. Wong
2024-12-13  5:04     ` Christoph Hellwig
2024-12-15 18:13       ` Darrick J. Wong
2024-12-11  8:54 ` [PATCH 05/43] xfs: don't take m_sb_lock in xfs_fs_statfs Christoph Hellwig
2024-12-12 21:42   ` Darrick J. Wong
2024-12-13  5:06     ` Christoph Hellwig
2024-12-15 18:16       ` Darrick J. Wong
2024-12-11  8:54 ` [PATCH 06/43] xfs: refactor xfs_fs_statfs Christoph Hellwig
2024-12-12 21:24   ` Darrick J. Wong
2024-12-13  5:08     ` Christoph Hellwig
2024-12-11  8:54 ` [PATCH 07/43] xfs: cleanup xfs_vn_getattr Christoph Hellwig
2024-12-12 21:24   ` Darrick J. Wong
2024-12-11  8:54 ` [PATCH 08/43] xfs: report the correct dio alignment for COW inodes Christoph Hellwig
2024-12-12 21:29   ` Darrick J. Wong
2024-12-13  5:09     ` Christoph Hellwig
2024-12-11  8:54 ` [PATCH 09/43] xfs: generalize the freespace and reserved blocks handling Christoph Hellwig
2024-12-12 21:37   ` Darrick J. Wong
2024-12-13  5:11     ` Christoph Hellwig
2024-12-11  8:54 ` [PATCH 10/43] xfs: preserve RT reservations across remounts Christoph Hellwig
2024-12-12 21:38   ` Darrick J. Wong
2024-12-13  9:15     ` Hans Holmberg
2024-12-15 18:42       ` Darrick J. Wong
2024-12-11  8:54 ` [PATCH 11/43] xfs: skip always_cow inodes in xfs_reflink_trim_around_shared Christoph Hellwig
2024-12-12 21:38   ` Darrick J. Wong
2024-12-13  5:12     ` Christoph Hellwig
2024-12-11  8:54 ` [PATCH 12/43] xfs: refine the unaligned check for always COW inodes in xfs_file_dio_write Christoph Hellwig
2024-12-12 21:44   ` Darrick J. Wong
2024-12-13  5:14     ` Christoph Hellwig
2024-12-13 23:14       ` Darrick J. Wong
2024-12-11  8:54 ` [PATCH 13/43] xfs: support XFS_BMAPI_REMAP in xfs_bmap_del_extent_delay Christoph Hellwig
2024-12-12 21:47   ` [PATCH 13/43] xfs: support XFS_BMAPI_REMAP in xfs_bmap_del_extent_delayOM Darrick J. Wong
2024-12-13  5:14     ` Christoph Hellwig
2024-12-11  8:54 ` [PATCH 14/43] xfs: add a xfs_rtrmap_first_unwritten_rgbno helper Christoph Hellwig
2024-12-12 21:48   ` Darrick J. Wong
2024-12-13  5:16     ` Christoph Hellwig
2024-12-11  8:54 ` [PATCH 15/43] xfs: define the zoned on-disk format Christoph Hellwig
2024-12-12 22:02   ` Darrick J. Wong
2024-12-13  5:22     ` Christoph Hellwig
2024-12-13 17:09       ` Darrick J. Wong
2024-12-15  5:20         ` Christoph Hellwig
2024-12-11  8:54 ` [PATCH 16/43] xfs: allow internal RT devices for zoned mode Christoph Hellwig
2024-12-12 22:06   ` Darrick J. Wong
2024-12-11  8:54 ` [PATCH 17/43] xfs: don't allow growfs of the data device with internal RT device Christoph Hellwig
2024-12-12 22:07   ` Darrick J. Wong
2024-12-13  5:22     ` Christoph Hellwig
2024-12-11  8:54 ` [PATCH 18/43] xfs: export zoned geometry via XFS_FSOP_GEOM Christoph Hellwig
2024-12-12 22:09   ` Darrick J. Wong
2024-12-13  5:23     ` Christoph Hellwig
2024-12-11  8:54 ` [PATCH 19/43] xfs: disable sb_frextents for zoned file systems Christoph Hellwig
2024-12-12 22:26   ` Darrick J. Wong
2024-12-13  5:29     ` Christoph Hellwig
2024-12-11  8:54 ` [PATCH 20/43] xfs: disable FITRIM for zoned RT devices Christoph Hellwig
2024-12-12 22:13   ` Darrick J. Wong
2024-12-11  8:54 ` [PATCH 21/43] xfs: don't call xfs_can_free_eofblocks from ->release for zoned inodes Christoph Hellwig
2024-12-12 22:15   ` Darrick J. Wong
2024-12-13  5:28     ` Christoph Hellwig
2024-12-13 17:13       ` Darrick J. Wong
2024-12-13 17:18         ` Christoph Hellwig
2024-12-11  8:54 ` [PATCH 22/43] xfs: skip zoned RT inodes in xfs_inodegc_want_queue_rt_file Christoph Hellwig
2024-12-12 22:15   ` Darrick J. Wong
2024-12-11  8:54 ` [PATCH 23/43] xfs: parse and validate hardware zone information Christoph Hellwig
2024-12-13 17:31   ` Darrick J. Wong
2024-12-15  5:24     ` Christoph Hellwig
2024-12-11  8:54 ` [PATCH 24/43] xfs: add the zoned space allocator Christoph Hellwig
2024-12-13 18:33   ` Darrick J. Wong
2024-12-15  5:27     ` Christoph Hellwig
2024-12-11  8:54 ` [PATCH 25/43] xfs: add support for zoned space reservations Christoph Hellwig
2024-12-13 21:01   ` Darrick J. Wong
2024-12-15  5:31     ` Christoph Hellwig [this message]
2024-12-17 16:59       ` Darrick J. Wong
2024-12-19  5:50         ` Christoph Hellwig
2024-12-19 16:00           ` Darrick J. Wong
2024-12-19 17:36             ` Christoph Hellwig
2024-12-19 17:37               ` Darrick J. Wong
2024-12-11  8:54 ` [PATCH 26/43] xfs: implement zoned garbage collection Christoph Hellwig
2024-12-13 22:18   ` Darrick J. Wong
2024-12-15  5:57     ` Christoph Hellwig
2024-12-17  1:27       ` Darrick J. Wong
2024-12-17  4:06         ` Christoph Hellwig
2024-12-17 17:42           ` Darrick J. Wong
2024-12-18  7:13             ` Christoph Hellwig
2024-12-11  8:54 ` [PATCH 27/43] xfs: implement buffered writes to zoned RT devices Christoph Hellwig
2024-12-13 22:37   ` Darrick J. Wong
2024-12-15  6:12     ` Christoph Hellwig
2024-12-11  8:54 ` [PATCH 28/43] xfs: implement direct " Christoph Hellwig
2024-12-13 22:39   ` Darrick J. Wong
2024-12-11  8:54 ` [PATCH 29/43] xfs: wire up zoned block freeing in xfs_rtextent_free_finish_item Christoph Hellwig
2024-12-13 22:40   ` Darrick J. Wong
2024-12-11  8:54 ` [PATCH 30/43] xfs: hide reserved RT blocks from statfs Christoph Hellwig
2024-12-13 22:43   ` Darrick J. Wong
2024-12-15  6:03     ` Christoph Hellwig
2024-12-11  8:54 ` [PATCH 31/43] xfs: support growfs on zoned file systems Christoph Hellwig
2024-12-13 22:45   ` Darrick J. Wong
2024-12-11  8:54 ` [PATCH 32/43] xfs: allow COW forks on zoned file systems in xchk_bmap Christoph Hellwig
2024-12-13 22:47   ` Darrick J. Wong
2024-12-11  8:54 ` [PATCH 33/43] xfs: support xchk_xref_is_used_rt_space on zoned file systems Christoph Hellwig
2024-12-13 22:49   ` Darrick J. Wong
2024-12-15  6:13     ` Christoph Hellwig
2024-12-17 17:02       ` Darrick J. Wong
2024-12-11  8:54 ` [PATCH 34/43] xfs: support xrep_require_rtext_inuse " Christoph Hellwig
2024-12-13 22:49   ` Darrick J. Wong
2024-12-11  8:55 ` [PATCH 35/43] xfs: enable fsmap reporting for internal RT devices Christoph Hellwig
2024-12-13 23:11   ` Darrick J. Wong
2024-12-15  6:26     ` Christoph Hellwig
2024-12-17 17:06       ` Darrick J. Wong
2024-12-11  8:55 ` [PATCH 36/43] xfs: disable reflink for zoned file systems Christoph Hellwig
2024-12-13 23:12   ` Darrick J. Wong
2024-12-15  6:26     ` Christoph Hellwig
2024-12-17 17:10       ` Darrick J. Wong
2024-12-18  7:09         ` Christoph Hellwig
2024-12-18 18:16           ` Darrick J. Wong
2024-12-11  8:55 ` [PATCH 37/43] xfs: disable rt quotas " Christoph Hellwig
2024-12-13 23:05   ` Darrick J. Wong
2024-12-15  6:21     ` Christoph Hellwig
2024-12-11  8:55 ` [PATCH 38/43] xfs: enable the zoned RT device feature Christoph Hellwig
2024-12-13 22:52   ` Darrick J. Wong
2024-12-15  6:15     ` Christoph Hellwig
2024-12-11  8:55 ` [PATCH 39/43] xfs: support zone gaps Christoph Hellwig
2024-12-13 22:55   ` Darrick J. Wong
2024-12-11  8:55 ` [PATCH 40/43] xfs: add a max_open_zones mount option Christoph Hellwig
2024-12-13 22:57   ` Darrick J. Wong
2024-12-15  6:16     ` Christoph Hellwig
2024-12-17 17:12       ` Darrick J. Wong
2024-12-11  8:55 ` [PATCH 41/43] xfs: support write life time based data placement Christoph Hellwig
2024-12-13 23:00   ` Darrick J. Wong
2024-12-15  6:19     ` Christoph Hellwig
2024-12-17 17:14       ` Darrick J. Wong
2024-12-18  7:10         ` Christoph Hellwig
2024-12-18 18:19           ` Darrick J. Wong
2024-12-11  8:55 ` [PATCH 42/43] xfs: wire up the show_stats super operation Christoph Hellwig
2024-12-13 23:01   ` Darrick J. Wong
2024-12-11  8:55 ` [PATCH 43/43] xfs: export zone stats in /proc/*/mountstats Christoph Hellwig
2024-12-13 23:04   ` Darrick J. Wong

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20241215053135.GE10051@lst.de \
    --to=hch@lst.de \
    --cc=cem@kernel.org \
    --cc=djwong@kernel.org \
    --cc=hans.holmberg@wdc.com \
    --cc=linux-xfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.