From: "Darrick J. Wong" <djwong@kernel.org>
To: Dave Chinner <david@fromorbit.com>
Cc: linux-xfs@vger.kernel.org
Subject: Re: [PATCH 20/45] xfs: only CIL pushes require a start record
Date: Mon, 8 Mar 2021 16:07:20 -0800 [thread overview]
Message-ID: <20210309000720.GG3419940@magnolia> (raw)
In-Reply-To: <20210305051143.182133-21-david@fromorbit.com>
On Fri, Mar 05, 2021 at 04:11:18PM +1100, Dave Chinner wrote:
> From: Dave Chinner <dchinner@redhat.com>
>
> So move the one-off start record writing in xlog_write() out into
> the static header that the CIL push builds to write into the log
> initially. This simplifes the xlog_write() logic a lot.
>
> pahole on x86-64 confirms that the xlog_cil_trans_hdr is correctly
> 32 bit aligned and packed for copying the log op and transaction
> headers directly into the log as a single log region copy.
>
> struct xlog_cil_trans_hdr {
> struct xlog_op_header oph[2]; /* 0 24 */
> struct xfs_trans_header thdr; /* 24 16 */
> struct xfs_log_iovec lhdr; /* 40 16 */
>
> /* size: 56, cachelines: 1, members: 3 */
> /* last cacheline: 56 bytes */
> };
>
> A wart is needed to handle the fact that length of the region the
> opheader points to doesn't include the opheader length. hence if
> we embed the opheader, we have to substract the opheader length from
> the length written into the opheader by the generic copying code.
> This will eventually go away when everything is converted to
> embedded opheaders.
>
> Signed-off-by: Dave Chinner <dchinner@redhat.com>
Looks ... ugly, but looking forward a few patches you're clearly getting
ready to refactor a bunch of grody 4-indent code so...
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
--D
> ---
> fs/xfs/xfs_log.c | 90 ++++++++++++++++++++++----------------------
> fs/xfs/xfs_log_cil.c | 44 ++++++++++++++++++----
> 2 files changed, 81 insertions(+), 53 deletions(-)
>
> diff --git a/fs/xfs/xfs_log.c b/fs/xfs/xfs_log.c
> index f54d48f4584e..b2f9fb1b4fed 100644
> --- a/fs/xfs/xfs_log.c
> +++ b/fs/xfs/xfs_log.c
> @@ -2106,9 +2106,9 @@ xlog_print_trans(
> }
>
> /*
> - * Calculate the potential space needed by the log vector. We may need a start
> - * record, and each region gets its own struct xlog_op_header and may need to be
> - * double word aligned.
> + * Calculate the potential space needed by the log vector. If this is a start
> + * transaction, the caller has already accounted for both opheaders in the start
> + * transaction, so we don't need to account for them here.
> */
> static int
> xlog_write_calc_vec_length(
> @@ -2121,9 +2121,6 @@ xlog_write_calc_vec_length(
> int len = 0;
> int i;
>
> - if (optype & XLOG_START_TRANS)
> - headers++;
> -
> for (lv = log_vector; lv; lv = lv->lv_next) {
> /* we don't write ordered log vectors */
> if (lv->lv_buf_len == XFS_LOG_VEC_ORDERED)
> @@ -2139,24 +2136,20 @@ xlog_write_calc_vec_length(
> }
> }
>
> + /* Don't account for regions with embedded ophdrs */
> + if (optype && headers > 0) {
> + if (optype & XLOG_START_TRANS) {
> + ASSERT(headers >= 2);
> + headers -= 2;
> + }
> + }
> +
> ticket->t_res_num_ophdrs += headers;
> len += headers * sizeof(struct xlog_op_header);
>
> return len;
> }
>
> -static void
> -xlog_write_start_rec(
> - struct xlog_op_header *ophdr,
> - struct xlog_ticket *ticket)
> -{
> - ophdr->oh_tid = cpu_to_be32(ticket->t_tid);
> - ophdr->oh_clientid = ticket->t_clientid;
> - ophdr->oh_len = 0;
> - ophdr->oh_flags = XLOG_START_TRANS;
> - ophdr->oh_res2 = 0;
> -}
> -
> static xlog_op_header_t *
> xlog_write_setup_ophdr(
> struct xlog *log,
> @@ -2361,9 +2354,11 @@ xlog_write(
> * If this is a commit or unmount transaction, we don't need a start
> * record to be written. We do, however, have to account for the
> * commit or unmount header that gets written. Hence we always have
> - * to account for an extra xlog_op_header here.
> + * to account for an extra xlog_op_header here for commit and unmount
> + * records.
> */
> - ticket->t_curr_res -= sizeof(struct xlog_op_header);
> + if (optype & (XLOG_COMMIT_TRANS | XLOG_UNMOUNT_TRANS))
> + ticket->t_curr_res -= sizeof(struct xlog_op_header);
> if (ticket->t_curr_res < 0) {
> xfs_alert_tag(log->l_mp, XFS_PTAG_LOGRES,
> "ctx ticket reservation ran out. Need to up reservation");
> @@ -2411,7 +2406,7 @@ xlog_write(
> int copy_len;
> int copy_off;
> bool ordered = false;
> - bool wrote_start_rec = false;
> + bool added_ophdr = false;
>
> /* ordered log vectors have no regions to write */
> if (lv->lv_buf_len == XFS_LOG_VEC_ORDERED) {
> @@ -2425,25 +2420,24 @@ xlog_write(
> ASSERT((unsigned long)ptr % sizeof(int32_t) == 0);
>
> /*
> - * Before we start formatting log vectors, we need to
> - * write a start record. Only do this for the first
> - * iclog we write to.
> + * The XLOG_START_TRANS has embedded ophdrs for the
> + * start record and transaction header. They will always
> + * be the first two regions in the lv chain.
> */
> if (optype & XLOG_START_TRANS) {
> - xlog_write_start_rec(ptr, ticket);
> - xlog_write_adv_cnt(&ptr, &len, &log_offset,
> - sizeof(struct xlog_op_header));
> - optype &= ~XLOG_START_TRANS;
> - wrote_start_rec = true;
> - }
> -
> - ophdr = xlog_write_setup_ophdr(log, ptr, ticket, optype);
> - if (!ophdr)
> - return -EIO;
> + ophdr = reg->i_addr;
> + if (index)
> + optype &= ~XLOG_START_TRANS;
> + } else {
> + ophdr = xlog_write_setup_ophdr(log, ptr,
> + ticket, optype);
> + if (!ophdr)
> + return -EIO;
>
> - xlog_write_adv_cnt(&ptr, &len, &log_offset,
> + xlog_write_adv_cnt(&ptr, &len, &log_offset,
> sizeof(struct xlog_op_header));
> -
> + added_ophdr = true;
> + }
> len += xlog_write_setup_copy(ticket, ophdr,
> iclog->ic_size-log_offset,
> reg->i_len,
> @@ -2452,13 +2446,22 @@ xlog_write(
> &partial_copy_len);
> xlog_verify_dest_ptr(log, ptr);
>
> +
> + /*
> + * Wart: need to update length in embedded ophdr not
> + * to include it's own length.
> + */
> + if (!added_ophdr) {
> + ophdr->oh_len = cpu_to_be32(copy_len -
> + sizeof(struct xlog_op_header));
> + }
> /*
> * Copy region.
> *
> - * Unmount records just log an opheader, so can have
> - * empty payloads with no data region to copy. Hence we
> - * only copy the payload if the vector says it has data
> - * to copy.
> + * Commit and unmount records just log an opheader, so
> + * we can have empty payloads with no data region to
> + * copy. Hence we only copy the payload if the vector
> + * says it has data to copy.
> */
> ASSERT(copy_len >= 0);
> if (copy_len > 0) {
> @@ -2466,12 +2469,9 @@ xlog_write(
> xlog_write_adv_cnt(&ptr, &len, &log_offset,
> copy_len);
> }
> - copy_len += sizeof(struct xlog_op_header);
> - record_cnt++;
> - if (wrote_start_rec) {
> + if (added_ophdr)
> copy_len += sizeof(struct xlog_op_header);
> - record_cnt++;
> - }
> + record_cnt++;
> data_cnt += contwr ? copy_len : 0;
>
> error = xlog_write_copy_finish(log, iclog, optype,
> diff --git a/fs/xfs/xfs_log_cil.c b/fs/xfs/xfs_log_cil.c
> index b515002e7959..e9da074ecd69 100644
> --- a/fs/xfs/xfs_log_cil.c
> +++ b/fs/xfs/xfs_log_cil.c
> @@ -652,14 +652,22 @@ xlog_cil_process_committed(
> }
>
> struct xlog_cil_trans_hdr {
> + struct xlog_op_header oph[2];
> struct xfs_trans_header thdr;
> - struct xfs_log_iovec lhdr;
> + struct xfs_log_iovec lhdr[2];
> };
>
> /*
> * Build a checkpoint transaction header to begin the journal transaction. We
> * need to account for the space used by the transaction header here as it is
> * not accounted for in xlog_write().
> + *
> + * This is the only place we write a transaction header, so we also build the
> + * log opheaders that indicate the start of a log transaction and wrap the
> + * transaction header. We keep the start record in it's own log vector rather
> + * than compacting them into a single region as this ends up making the logic
> + * in xlog_write() for handling empty opheaders for start, commit and unmount
> + * records much simpler.
> */
> static void
> xlog_cil_build_trans_hdr(
> @@ -669,20 +677,40 @@ xlog_cil_build_trans_hdr(
> int num_iovecs)
> {
> struct xlog_ticket *tic = ctx->ticket;
> + uint32_t tid = cpu_to_be32(tic->t_tid);
>
> memset(hdr, 0, sizeof(*hdr));
>
> + /* Log start record */
> + hdr->oph[0].oh_tid = tid;
> + hdr->oph[0].oh_clientid = XFS_TRANSACTION;
> + hdr->oph[0].oh_flags = XLOG_START_TRANS;
> +
> + /* log iovec region pointer */
> + hdr->lhdr[0].i_addr = &hdr->oph[0];
> + hdr->lhdr[0].i_len = sizeof(struct xlog_op_header);
> + hdr->lhdr[0].i_type = XLOG_REG_TYPE_LRHEADER;
> +
> + /* log opheader */
> + hdr->oph[1].oh_tid = tid;
> + hdr->oph[1].oh_clientid = XFS_TRANSACTION;
> +
> + /* transaction header */
> hdr->thdr.th_magic = XFS_TRANS_HEADER_MAGIC;
> hdr->thdr.th_type = XFS_TRANS_CHECKPOINT;
> - hdr->thdr.th_tid = tic->t_tid;
> + hdr->thdr.th_tid = tid;
> hdr->thdr.th_num_items = num_iovecs;
> - hdr->lhdr.i_addr = &hdr->thdr;
> - hdr->lhdr.i_len = sizeof(xfs_trans_header_t);
> - hdr->lhdr.i_type = XLOG_REG_TYPE_TRANSHDR;
> - tic->t_curr_res -= hdr->lhdr.i_len + sizeof(xlog_op_header_t);
>
> - lvhdr->lv_niovecs = 1;
> - lvhdr->lv_iovecp = &hdr->lhdr;
> + /* log iovec region pointer */
> + hdr->lhdr[1].i_addr = &hdr->oph[1];
> + hdr->lhdr[1].i_len = sizeof(struct xlog_op_header) +
> + sizeof(struct xfs_trans_header);
> + hdr->lhdr[1].i_type = XLOG_REG_TYPE_TRANSHDR;
> +
> + tic->t_curr_res -= hdr->lhdr[0].i_len + hdr->lhdr[1].i_len;
> +
> + lvhdr->lv_niovecs = 2;
> + lvhdr->lv_iovecp = &hdr->lhdr[0];
> lvhdr->lv_next = ctx->lv_chain;
> }
>
> --
> 2.28.0
>
next prev parent reply other threads:[~2021-03-09 0:08 UTC|newest]
Thread overview: 145+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-03-05 5:10 [PATCH 00/45 v3] xfs: consolidated log and optimisation changes Dave Chinner
2021-03-05 5:10 ` [PATCH 01/45] xfs: initialise attr fork on inode create Dave Chinner
2021-03-08 22:20 ` Darrick J. Wong
2021-03-16 8:35 ` Christoph Hellwig
2021-03-05 5:11 ` [PATCH 02/45] xfs: log stripe roundoff is a property of the log Dave Chinner
2021-03-05 5:11 ` [PATCH 03/45] xfs: separate CIL commit record IO Dave Chinner
2021-03-08 8:34 ` Chandan Babu R
2021-03-15 14:40 ` Brian Foster
2021-03-16 8:40 ` Christoph Hellwig
2021-03-05 5:11 ` [PATCH 04/45] xfs: remove xfs_blkdev_issue_flush Dave Chinner
2021-03-08 9:31 ` Chandan Babu R
2021-03-08 22:21 ` Darrick J. Wong
2021-03-15 14:40 ` Brian Foster
2021-03-16 8:41 ` Christoph Hellwig
2021-03-05 5:11 ` [PATCH 05/45] xfs: async blkdev cache flush Dave Chinner
2021-03-08 9:48 ` Chandan Babu R
2021-03-08 22:24 ` Darrick J. Wong
2021-03-15 14:41 ` Brian Foster
2021-03-15 16:32 ` Darrick J. Wong
2021-03-16 8:43 ` Christoph Hellwig
2021-03-08 22:26 ` Darrick J. Wong
2021-03-15 14:42 ` Brian Foster
2021-03-05 5:11 ` [PATCH 06/45] xfs: CIL checkpoint flushes caches unconditionally Dave Chinner
2021-03-15 14:43 ` Brian Foster
2021-03-16 8:47 ` Christoph Hellwig
2021-03-05 5:11 ` [PATCH 07/45] xfs: remove need_start_rec parameter from xlog_write() Dave Chinner
2021-03-15 14:45 ` Brian Foster
2021-03-16 14:15 ` Christoph Hellwig
2021-03-05 5:11 ` [PATCH 08/45] xfs: journal IO cache flush reductions Dave Chinner
2021-03-08 10:49 ` Chandan Babu R
2021-03-08 12:25 ` Brian Foster
2021-03-09 1:13 ` Dave Chinner
2021-03-10 20:49 ` Brian Foster
2021-03-10 21:28 ` Dave Chinner
2021-03-05 5:11 ` [PATCH 09/45] xfs: Fix CIL throttle hang when CIL space used going backwards Dave Chinner
2021-03-05 5:11 ` [PATCH 10/45] xfs: reduce buffer log item shadow allocations Dave Chinner
2021-03-15 14:52 ` Brian Foster
2021-03-05 5:11 ` [PATCH 11/45] xfs: xfs_buf_item_size_segment() needs to pass segment offset Dave Chinner
2021-03-05 5:11 ` [PATCH 12/45] xfs: optimise xfs_buf_item_size/format for contiguous regions Dave Chinner
2021-03-05 5:11 ` [PATCH 13/45] xfs: xfs_log_force_lsn isn't passed a LSN Dave Chinner
2021-03-08 22:53 ` Darrick J. Wong
2021-03-11 0:26 ` Dave Chinner
2021-03-05 5:11 ` [PATCH 14/45] xfs: AIL needs asynchronous CIL forcing Dave Chinner
2021-03-08 23:45 ` Darrick J. Wong
2021-03-05 5:11 ` [PATCH 15/45] xfs: CIL work is serialised, not pipelined Dave Chinner
2021-03-08 23:14 ` Darrick J. Wong
2021-03-08 23:38 ` Dave Chinner
2021-03-09 1:55 ` Darrick J. Wong
2021-03-09 22:35 ` Andi Kleen
2021-03-10 6:11 ` Dave Chinner
2021-03-05 5:11 ` [PATCH 16/45] xfs: type verification is expensive Dave Chinner
2021-03-05 5:11 ` [PATCH 17/45] xfs: No need for inode number error injection in __xfs_dir3_data_check Dave Chinner
2021-03-05 5:11 ` [PATCH 18/45] xfs: reduce debug overhead of dir leaf/node checks Dave Chinner
2021-03-05 5:11 ` [PATCH 19/45] xfs: factor out the CIL transaction header building Dave Chinner
2021-03-08 23:47 ` Darrick J. Wong
2021-03-16 14:50 ` Brian Foster
2021-03-05 5:11 ` [PATCH 20/45] xfs: only CIL pushes require a start record Dave Chinner
2021-03-09 0:07 ` Darrick J. Wong [this message]
2021-03-16 14:51 ` Brian Foster
2021-03-05 5:11 ` [PATCH 21/45] xfs: embed the xlog_op_header in the unmount record Dave Chinner
2021-03-09 0:15 ` Darrick J. Wong
2021-03-11 2:54 ` Dave Chinner
2021-03-05 5:11 ` [PATCH 22/45] xfs: embed the xlog_op_header in the commit record Dave Chinner
2021-03-09 0:17 ` Darrick J. Wong
2021-03-05 5:11 ` [PATCH 23/45] xfs: log tickets don't need log client id Dave Chinner
2021-03-09 0:21 ` Darrick J. Wong
2021-03-09 1:19 ` Dave Chinner
2021-03-09 1:48 ` Darrick J. Wong
2021-03-11 3:01 ` Dave Chinner
2021-03-16 14:51 ` Brian Foster
2021-03-05 5:11 ` [PATCH 24/45] xfs: move log iovec alignment to preparation function Dave Chinner
2021-03-09 2:14 ` Darrick J. Wong
2021-03-16 14:51 ` Brian Foster
2021-03-05 5:11 ` [PATCH 25/45] xfs: reserve space and initialise xlog_op_header in item formatting Dave Chinner
2021-03-09 2:21 ` Darrick J. Wong
2021-03-11 3:29 ` Dave Chinner
2021-03-11 3:41 ` Darrick J. Wong
2021-03-16 14:54 ` Brian Foster
2021-03-16 14:53 ` Brian Foster
2021-05-19 3:18 ` Dave Chinner
2021-03-05 5:11 ` [PATCH 26/45] xfs: log ticket region debug is largely useless Dave Chinner
2021-03-09 2:31 ` Darrick J. Wong
2021-03-16 14:55 ` Brian Foster
2021-05-19 3:27 ` Dave Chinner
2021-03-05 5:11 ` [PATCH 27/45] xfs: pass lv chain length into xlog_write() Dave Chinner
2021-03-09 2:36 ` Darrick J. Wong
2021-03-11 3:37 ` Dave Chinner
2021-03-16 18:38 ` Brian Foster
2021-03-05 5:11 ` [PATCH 28/45] xfs: introduce xlog_write_single() Dave Chinner
2021-03-09 2:39 ` Darrick J. Wong
2021-03-11 4:19 ` Dave Chinner
2021-03-16 18:39 ` Brian Foster
2021-05-19 3:44 ` Dave Chinner
2021-03-05 5:11 ` [PATCH 29/45] xfs:_introduce xlog_write_partial() Dave Chinner
2021-03-09 2:59 ` Darrick J. Wong
2021-03-11 4:33 ` Dave Chinner
2021-03-18 13:22 ` Brian Foster
2021-05-19 4:49 ` Dave Chinner
2021-05-20 12:33 ` Brian Foster
2021-05-27 18:03 ` Darrick J. Wong
2021-03-05 5:11 ` [PATCH 30/45] xfs: xlog_write() no longer needs contwr state Dave Chinner
2021-03-09 3:01 ` Darrick J. Wong
2021-03-05 5:11 ` [PATCH 31/45] xfs: CIL context doesn't need to count iovecs Dave Chinner
2021-03-09 3:16 ` Darrick J. Wong
2021-03-11 5:03 ` Dave Chinner
2021-03-05 5:11 ` [PATCH 32/45] xfs: use the CIL space used counter for emptiness checks Dave Chinner
2021-03-10 23:01 ` Darrick J. Wong
2021-03-05 5:11 ` [PATCH 33/45] xfs: lift init CIL reservation out of xc_cil_lock Dave Chinner
2021-03-10 23:25 ` Darrick J. Wong
2021-03-11 5:42 ` Dave Chinner
2021-03-05 5:11 ` [PATCH 34/45] xfs: rework per-iclog header CIL reservation Dave Chinner
2021-03-11 0:03 ` Darrick J. Wong
2021-03-11 6:03 ` Dave Chinner
2021-03-05 5:11 ` [PATCH 35/45] xfs: introduce per-cpu CIL tracking sructure Dave Chinner
2021-03-11 0:11 ` Darrick J. Wong
2021-03-11 6:33 ` Dave Chinner
2021-03-11 6:42 ` Dave Chinner
2021-03-05 5:11 ` [PATCH 36/45] xfs: implement percpu cil space used calculation Dave Chinner
2021-03-11 0:20 ` Darrick J. Wong
2021-03-11 6:51 ` Dave Chinner
2021-03-05 5:11 ` [PATCH 37/45] xfs: track CIL ticket reservation in percpu structure Dave Chinner
2021-03-11 0:26 ` Darrick J. Wong
2021-03-12 0:47 ` Dave Chinner
2021-03-05 5:11 ` [PATCH 38/45] xfs: convert CIL busy extents to per-cpu Dave Chinner
2021-03-11 0:36 ` Darrick J. Wong
2021-03-12 1:15 ` Dave Chinner
2021-03-05 5:11 ` [PATCH 39/45] xfs: Add order IDs to log items in CIL Dave Chinner
2021-03-11 1:00 ` Darrick J. Wong
2021-03-05 5:11 ` [PATCH 40/45] xfs: convert CIL to unordered per cpu lists Dave Chinner
2021-03-11 1:15 ` Darrick J. Wong
2021-03-12 2:18 ` Dave Chinner
2021-03-05 5:11 ` [PATCH 41/45] xfs: move CIL ordering to the logvec chain Dave Chinner
2021-03-11 1:34 ` Darrick J. Wong
2021-03-12 2:29 ` Dave Chinner
2021-03-05 5:11 ` [PATCH 42/45] xfs: __percpu_counter_compare() inode count debug too expensive Dave Chinner
2021-03-11 1:36 ` Darrick J. Wong
2021-03-05 5:11 ` [PATCH 43/45] xfs: avoid cil push lock if possible Dave Chinner
2021-03-11 1:47 ` Darrick J. Wong
2021-03-12 2:36 ` Dave Chinner
2021-03-05 5:11 ` [PATCH 44/45] xfs: xlog_sync() manually adjusts grant head space Dave Chinner
2021-03-11 2:00 ` Darrick J. Wong
2021-03-16 3:04 ` Dave Chinner
2021-03-05 5:11 ` [PATCH 45/45] xfs: expanding delayed logging design with background material Dave Chinner
2021-03-11 2:30 ` Darrick J. Wong
2021-03-16 3:28 ` Dave Chinner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20210309000720.GG3419940@magnolia \
--to=djwong@kernel.org \
--cc=david@fromorbit.com \
--cc=linux-xfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox