public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Tejun Heo <tj@kernel.org>
To: jiangshanlai@gmail.com
Cc: linux-kernel@vger.kernel.org, kernel-team@meta.com,
	Tejun Heo <tj@kernel.org>, Christoph Hellwig <hch@infradead.org>
Subject: [PATCH 22/22] workqueue: Don't implicitly make UNBOUND workqueues w/ @max_active==1 ordered
Date: Thu, 20 Apr 2023 16:50:46 -1000	[thread overview]
Message-ID: <20230421025046.4008499-23-tj@kernel.org> (raw)
In-Reply-To: <20230421025046.4008499-1-tj@kernel.org>

5c0338c68706 ("workqueue: restore WQ_UNBOUND/max_active==1 to be ordered")
automoatically promoted UNBOUND workqueues w/ @max_active==1 to ordered
workqueues because UNBOUND workqueues w/ @max_active==1 used to be the way
to create ordered workqueues and the new NUMA support broke it. These
problems can be subtle and the fact that they can only trigger on NUMA
machines made them even more difficult to debug.

However, overloading the UNBOUND allocation interface this way creates other
issues. It's difficult to tell whether a given workqueue actually needs to
be ordered and users that legitimately want a min concurrency level wq
unexpectedly gets an ordered one instead. With planned UNBOUND workqueue
udpates to improve execution locality and more prevalence of chiplet designs
which can benefit from such improvements, this isn't a state we wanna be in
forever.

There aren't that many UNBOUND w/ @max_active==1 users in the tree and the
preceding patches audited all and converted them to
alloc_ordered_workqueue() as appropriate. This patch removes the implicit
promotion of UNBOUND w/ @max_active==1 workqueues to ordered ones.

Workqueue will also add a debug option to make all unordered UNBOUND
workqueues to use per-cpu pool_workqueues so that these problems can be
surfaced easier on most machines.

Signed-off-by: Tejun Heo <tj@kernel.org>
Cc: Christoph Hellwig <hch@infradead.org>
---
 include/linux/workqueue.h |  4 +---
 kernel/workqueue.c        | 23 ++++-------------------
 2 files changed, 5 insertions(+), 22 deletions(-)

diff --git a/include/linux/workqueue.h b/include/linux/workqueue.h
index ac551b8ee7d9..e547a90f0328 100644
--- a/include/linux/workqueue.h
+++ b/include/linux/workqueue.h
@@ -339,7 +339,6 @@ enum {
 	__WQ_DRAINING		= 1 << 16, /* internal: workqueue is draining */
 	__WQ_ORDERED		= 1 << 17, /* internal: workqueue is ordered */
 	__WQ_LEGACY		= 1 << 18, /* internal: create*_workqueue() */
-	__WQ_ORDERED_EXPLICIT	= 1 << 19, /* internal: alloc_ordered_workqueue() */
 
 	WQ_MAX_ACTIVE		= 512,	  /* I like 512, better ideas? */
 	WQ_MAX_UNBOUND_PER_CPU	= 4,	  /* 4 * #cpus for unbound wq */
@@ -417,8 +416,7 @@ alloc_workqueue(const char *fmt, unsigned int flags, int max_active, ...);
  * Pointer to the allocated workqueue on success, %NULL on failure.
  */
 #define alloc_ordered_workqueue(fmt, flags, args...)			\
-	alloc_workqueue(fmt, WQ_UNBOUND | __WQ_ORDERED |		\
-			__WQ_ORDERED_EXPLICIT | (flags), 1, ##args)
+	alloc_workqueue(fmt, WQ_UNBOUND | __WQ_ORDERED | (flags), 1, ##args)
 
 #define create_workqueue(name)						\
 	alloc_workqueue("%s", __WQ_LEGACY | WQ_MEM_RECLAIM, 1, (name))
diff --git a/kernel/workqueue.c b/kernel/workqueue.c
index b8b541caed48..00bdcc3c5b36 100644
--- a/kernel/workqueue.c
+++ b/kernel/workqueue.c
@@ -4180,12 +4180,8 @@ static int apply_workqueue_attrs_locked(struct workqueue_struct *wq,
 		return -EINVAL;
 
 	/* creating multiple pwqs breaks ordering guarantee */
-	if (!list_empty(&wq->pwqs)) {
-		if (WARN_ON(wq->flags & __WQ_ORDERED_EXPLICIT))
-			return -EINVAL;
-
-		wq->flags &= ~__WQ_ORDERED;
-	}
+	if (WARN_ON(wq->flags & __WQ_ORDERED))
+		return -EINVAL;
 
 	ctx = apply_wqattrs_prepare(wq, attrs, wq_unbound_cpumask);
 	if (!ctx)
@@ -4408,16 +4404,6 @@ struct workqueue_struct *alloc_workqueue(const char *fmt,
 	struct workqueue_struct *wq;
 	struct pool_workqueue *pwq;
 
-	/*
-	 * Unbound && max_active == 1 used to imply ordered, which is no
-	 * longer the case on NUMA machines due to per-node pools.  While
-	 * alloc_ordered_workqueue() is the right way to create an ordered
-	 * workqueue, keep the previous behavior to avoid subtle breakages
-	 * on NUMA.
-	 */
-	if ((flags & WQ_UNBOUND) && max_active == 1)
-		flags |= __WQ_ORDERED;
-
 	/* see the comment above the definition of WQ_POWER_EFFICIENT */
 	if ((flags & WQ_POWER_EFFICIENT) && wq_power_efficient)
 		flags |= WQ_UNBOUND;
@@ -4625,14 +4611,13 @@ void workqueue_set_max_active(struct workqueue_struct *wq, int max_active)
 	struct pool_workqueue *pwq;
 
 	/* disallow meddling with max_active for ordered workqueues */
-	if (WARN_ON(wq->flags & __WQ_ORDERED_EXPLICIT))
+	if (WARN_ON(wq->flags & __WQ_ORDERED))
 		return;
 
 	max_active = wq_clamp_max_active(max_active, wq->flags, wq->name);
 
 	mutex_lock(&wq->mutex);
 
-	wq->flags &= ~__WQ_ORDERED;
 	wq->saved_max_active = max_active;
 
 	for_each_pwq(pwq, wq)
@@ -5868,7 +5853,7 @@ int workqueue_sysfs_register(struct workqueue_struct *wq)
 	 * attributes breaks ordering guarantee.  Disallow exposing ordered
 	 * workqueues.
 	 */
-	if (WARN_ON(wq->flags & __WQ_ORDERED_EXPLICIT))
+	if (WARN_ON(wq->flags & __WQ_ORDERED))
 		return -EINVAL;
 
 	wq->wq_dev = wq_dev = kzalloc(sizeof(*wq_dev), GFP_KERNEL);
-- 
2.40.0


  parent reply	other threads:[~2023-04-21  2:52 UTC|newest]

Thread overview: 61+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-04-21  2:50 [PATCHSET wq/for-6.5] workqueue: Ordered workqueue creation cleanup Tejun Heo
2023-04-21  2:50 ` [PATCH 01/22] powerpc, workqueue: Use alloc_ordered_workqueue() to create ordered workqueues Tejun Heo
2023-04-21  5:20   ` Michael Ellerman
2023-05-08 23:55   ` Tejun Heo
2023-04-21  2:50 ` [PATCH 02/22] greybus: " Tejun Heo
2023-04-21  5:03   ` Greg Kroah-Hartman
2023-04-21  6:29   ` Johan Hovold
2023-04-21  8:15   ` Alex Elder
2023-05-08 23:55   ` Tejun Heo
2023-04-21  2:50 ` [PATCH 03/22] IB/hfi1: " Tejun Heo
2023-04-21 14:02   ` Dennis Dalessandro
2023-05-08 23:56     ` Tejun Heo
2023-05-09  1:35       ` Tejun Heo
2023-04-21  2:50 ` [PATCH 04/22] dm integrity: " Tejun Heo
2023-04-21  2:50 ` [PATCH 05/22] media: amphion: " Tejun Heo
2023-04-21  2:50 ` [PATCH 06/22] net: thunderx: " Tejun Heo
2023-04-21  6:19   ` [EXT] " Sunil Kovvuri Goutham
2023-04-21 14:01   ` Jakub Kicinski
2023-04-21 14:13     ` Tejun Heo
2023-04-21 14:28       ` Jakub Kicinski
2023-05-08 23:57   ` Tejun Heo
2023-04-21  2:50 ` [PATCH 07/22] net: octeontx2: " Tejun Heo
2023-04-21  6:16   ` [EXT] " Sunil Kovvuri Goutham
2023-05-08 23:56   ` Tejun Heo
2023-04-21  2:50 ` [PATCH 08/22] wifi: ath10/11/12k: " Tejun Heo
2023-04-21  2:50 ` [PATCH 09/22] wifi: iwlwifi: " Tejun Heo
2023-04-24 17:31   ` Johannes Berg
2023-05-05 22:52     ` [PATCH] wifi: iwlwifi: Use default @max_active for trans_pcie->rba.alloc_wq Tejun Heo
2023-05-08 16:05       ` Johannes Berg
2023-05-08 23:59   ` [PATCH 09/22] wifi: iwlwifi: Use alloc_ordered_workqueue() to create ordered workqueues Tejun Heo
2023-05-09 15:25     ` Tejun Heo
2023-04-21  2:50 ` [PATCH 10/22] wifi: mwifiex: " Tejun Heo
2023-04-25 18:14   ` Brian Norris
2023-05-05 22:53     ` [PATCH] wifi: mwifiex: Use default @max_active for workqueues Tejun Heo
2023-04-21  2:50 ` [PATCH 11/22] net: wwan: t7xx: Use alloc_ordered_workqueue() to create ordered workqueues Tejun Heo
2023-04-21  2:50 ` [PATCH 12/22] scsi: " Tejun Heo
2023-04-21 14:23   ` Benjamin Block
2023-05-08 23:57   ` Tejun Heo
2023-05-09  1:22     ` Tejun Heo
2023-04-21  2:50 ` [PATCH 13/22] virt: acrn: " Tejun Heo
2023-04-21  5:03   ` Greg Kroah-Hartman
2023-05-08 23:58   ` Tejun Heo
2023-04-21  2:50 ` [PATCH 14/22] soc: qcom: qmi: " Tejun Heo
2023-04-21  2:50 ` [PATCH 15/22] xen/pvcalls: " Tejun Heo
2023-05-08 11:58   ` Juergen Gross
2023-05-08 23:59   ` Tejun Heo
2023-04-21  2:50 ` [PATCH 16/22] btrfs: " Tejun Heo
2023-04-30  4:40   ` Wang Yugui
2023-05-05 22:58     ` [PATCH v2 " Tejun Heo
2023-05-06  1:40       ` Wang Yugui
2023-05-08 23:50         ` Tejun Heo
2023-04-21  2:50 ` [PATCH 17/22] cifs: " Tejun Heo
2023-04-21 18:38   ` Paulo Alcantara
2023-05-08 23:58     ` Tejun Heo
2023-04-21  2:50 ` [PATCH 18/22] net: qrtr: " Tejun Heo
2023-04-21  2:50 ` [PATCH 19/22] rxrpc: " Tejun Heo
2023-04-21  2:50 ` [PATCH 20/22] crypto: octeontx2: " Tejun Heo
2023-04-21  2:50 ` [PATCH 21/22] media: coda: " Tejun Heo
2023-04-21  2:50 ` Tejun Heo [this message]
2023-04-24  5:03   ` [PATCH 22/22] workqueue: Don't implicitly make UNBOUND workqueues w/ @max_active==1 ordered Christoph Hellwig
2023-05-25  4:54 ` (subset) [PATCHSET wq/for-6.5] workqueue: Ordered workqueue creation cleanup Bjorn Andersson

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20230421025046.4008499-23-tj@kernel.org \
    --to=tj@kernel.org \
    --cc=hch@infradead.org \
    --cc=jiangshanlai@gmail.com \
    --cc=kernel-team@meta.com \
    --cc=linux-kernel@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox