linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Oleg Nesterov <oleg@redhat.com>
To: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Alan Cox <alan@lxorguk.ukuu.org.uk>,
	Paul Fulghum <paulkf@microgate.com>,
	Boyan <btanastasov@yahoo.co.uk>,
	"Rafael J. Wysocki" <rjw@sisk.pl>,
	Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
	Kernel Testers List <kernel-testers@vger.kernel.org>,
	Dmitry Torokhov <dmitry.torokhov@gmail.com>,
	Ed Tomlinson <edt@aei.ca>,
	OGAWA Hirofumi <hirofumi@mail.parknet.co.jp>
Subject: Re: [Bug #14388] keyboard under X with 2.6.31
Date: Thu, 15 Oct 2009 17:29:59 +0200	[thread overview]
Message-ID: <20091015152959.GA18681@redhat.com> (raw)
In-Reply-To: <20091015124730.GA9398@redhat.com>

On 10/15, Oleg Nesterov wrote:
>
> But, this can race with cpu_down(). I think this is solvable but needs
> more locking. I mean, the caller of queue_work_xxx() must not use the old
> get_wq_data(work) if this CPU is already dead, but a simple cpu_online()
> is not enough, we can race with workqueue_cpu_callback(CPU_POST_DEAD)
> flushing this cwq, in this case we should carefully insert this work
> into the almost-dead queue.
>
> Or, perhaps better, instead of new helper, we can probably use the free
> bit in work_struct->data to mark this work/dwork as "single-instance-work".
> In this case __queue_work and queue_delayed_work_on should check this bit.

Actually, this looks simple. Please see the patch below.

Of course! the horror in __queue_work() should be cleanuped somehow.
The change queue_delayed_work_on() needs a separate patch probably.


All, what do you think? Do we need this?

Oleg.

If the work_struct/delayed_work has WORK_STRUCT_XXX bit set, it can never
race with itself.

Note: queue_work_on() or queue_delayed_work_on() must not be used if it is
work_xxx().

Also, we can optimize flush/cancel operations to not scan all CPUs if this
work is "singlethreaded".

PROBLEM: work_xxx() work can block cpu_down() if it contsantly re-queues
itself, hopefully we shouldn't have such stupid users.
---

--- TTT_32/include/linux/workqueue.h~WORK_XXX	2009-09-23 21:12:03.000000000 +0200
+++ TTT_32/include/linux/workqueue.h	2009-10-15 16:49:25.000000000 +0200
@@ -24,7 +24,8 @@ typedef void (*work_func_t)(struct work_
 
 struct work_struct {
 	atomic_long_t data;
-#define WORK_STRUCT_PENDING 0		/* T if work item pending execution */
+#define WORK_STRUCT_PENDING	0	/* T if work item pending execution */
+#define WORK_STRUCT_XXX		1	/* deny multiple running instances */
 #define WORK_STRUCT_FLAG_MASK (3UL)
 #define WORK_STRUCT_WQ_DATA_MASK (~WORK_STRUCT_FLAG_MASK)
 	struct list_head entry;
@@ -148,6 +149,9 @@ struct execute_work {
 #define work_pending(work) \
 	test_bit(WORK_STRUCT_PENDING, work_data_bits(work))
 
+#define work_xxx(work) \
+	test_bit(WORK_STRUCT_XXX, work_data_bits(work))
+
 /**
  * delayed_work_pending - Find out whether a delayable work item is currently
  * pending
--- TTT_32/kernel/workqueue.c~WORK_XXX	2009-09-12 21:40:11.000000000 +0200
+++ TTT_32/kernel/workqueue.c	2009-10-15 17:09:51.000000000 +0200
@@ -145,6 +145,35 @@ static void __queue_work(struct cpu_work
 {
 	unsigned long flags;
 
+	if (work_xxx(work)) {
+		struct cpu_workqueue_struct *old = get_wq_data(work);
+		bool done = false;
+
+		if (!old)
+			goto fallback;
+
+		// This lockless check is racy. We should either remove it
+		// or add mb__before_clear_bit() into run_workqueue().
+		if (old->current_work != work)
+			goto fallback;
+
+		// OK, we should keep this old cwq. But its CPU can be dead,
+		// we have to recheck under old->lock
+		spin_lock_irqsave(&old->lock, flags);
+		if (old->current_work == work) {
+			// It is stiill running, queue the work here.
+			// even if this CPU is dead, run_workqueue()
+			// can't return without noticing this work
+			insert_work(old, work, &old->worklist);
+			done = true;
+		}
+		spin_unlock_irqrestore(&cwq->lock, flags);
+
+		if (done)
+			return;
+	}
+
+fallback:
 	spin_lock_irqsave(&cwq->lock, flags);
 	insert_work(cwq, work, &cwq->worklist);
 	spin_unlock_irqrestore(&cwq->lock, flags);
@@ -246,7 +275,8 @@ int queue_delayed_work_on(int cpu, struc
 		timer_stats_timer_set_start_info(&dwork->timer);
 
 		/* This stores cwq for the moment, for the timer_fn */
-		set_wq_data(work, wq_per_cpu(wq, raw_smp_processor_id()));
+		if (!get_wq_data(work))
+			set_wq_data(work, wq_per_cpu(wq, raw_smp_processor_id()));
 		timer->expires = jiffies + delay;
 		timer->data = (unsigned long)dwork;
 		timer->function = delayed_work_timer_fn;


  reply	other threads:[~2009-10-15 15:35 UTC|newest]

Thread overview: 165+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-10-11 22:41 2.6.32-rc4: Reported regressions 2.6.30 -> 2.6.31 Rafael J. Wysocki
2009-10-11 22:41 ` [Bug #13645] NULL pointer dereference at (null) (level2_spare_pgt) Rafael J. Wysocki
2009-10-11 22:49 ` [Bug #13733] 2.6.31-rc2: irq 16: nobody cared Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #13941] x86 Geode issue Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #13809] oprofile: possible circular locking dependency detected Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #13940] 2.6.31-rc1 - iwlagn and sky2 stopped working when ACPI enabled - Toshiba U400-17b, Acer Aspire 8935G Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #13836] suspend script fails, related to stdout? Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #13906] Huawei E169 GPRS connection causes Ooops Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #13987] Received NMI interrupt at resume Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #13948] ath5k broken after suspend-to-ram Rafael J. Wysocki
2009-10-12  0:19   ` Bob Copeland
2009-10-12 21:24     ` Rafael J. Wysocki
2009-10-15 21:38       ` Johannes Stezenbach
2009-10-11 23:01 ` [Bug #13943] WARNING: at net/mac80211/mlme.c:2292 with ath5k Rafael J. Wysocki
2009-10-12  7:24   ` Fabio Comolli
2009-10-12 21:23     ` Rafael J. Wysocki
2009-10-13  8:46       ` Fabio Comolli
2009-10-11 23:01 ` [Bug #14013] hd don't show up Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14058] Oops in fsnotify Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14017] _end symbol missing from Symbol.map Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14070] lockdep warning triggered by dup_fd Rafael J. Wysocki
2009-10-12 17:10   ` Bart Van Assche
2009-10-12 21:26     ` Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14141] order 2 page allocation failures in iwlagn Rafael J. Wysocki
2009-10-11 23:57   ` Frans Pop
2009-10-12 21:29     ` Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14129] 2.6.31 regression - pci_get_slot oops, udev boot hang - toshiba X200 Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14114] Tuning a saa7134 based card is broken in kernel 2.6.31-rc7 Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14137] usb console regressions Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14090] WARNING: at fs/notify/inotify/inotify_user.c:394 Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14157] end_request: I/O error, dev cciss/cXdX, sector 0 Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14143] OOPS when setting nr_requests for md devices Rafael J. Wysocki
2009-10-12 14:21   ` Chuck Ebbert
2009-10-12 21:30     ` Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14181] b43 causes panic at ifconfig down / shutdown Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14252] WARNING: at include/linux/skbuff.h:1382 w/ e1000 Rafael J. Wysocki
2009-10-12 10:49   ` David Miller
2009-10-12 11:44     ` Stephan von Krawczynski
2009-10-11 23:01 ` [Bug #14248] 2.6.31 wireless: WARNING: at net/wireless/ibss.c:34 Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14185] Oops in driversbasefirmware_class Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14249] BUG: oops in gss_validate on 2.6.31 Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14204] MCE prevent booting on my computer(pentium iii @500Mhz) Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14253] Oops in driversbasefirmware_class Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14258] Memory leak in SCSI initialization Rafael J. Wysocki
2009-10-15  2:30   ` Tetsuo Handa
2009-10-11 23:01 ` [Bug #14256] kernel BUG at fs/ext3/super.c:435 Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14257] Not able to boot on 32 bit System Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14261] e1000e jumbo frames no longer work: 'Unsupported MTU setting' Rafael J. Wysocki
2009-10-12  3:12   ` David Miller
2009-10-12 21:32     ` Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14264] ehci problem - mouse dead on scroll Rafael J. Wysocki
2009-10-13 15:35   ` Alan Stern
2009-10-13 15:55     ` Volker Armin Hemmann
2009-10-13 20:39       ` Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14265] ifconfig: page allocation failure. order:5, mode:0x8020 w/ e100 Rafael J. Wysocki
2009-10-12 11:05   ` David Miller
2009-10-13 12:29     ` Karol Lewandowski
2009-10-11 23:01 ` [Bug #14294] kernel BUG at drivers/ide/ide-disk.c:187 Rafael J. Wysocki
2009-10-12 10:51   ` David Miller
2009-10-12 12:09     ` Santiago Garcia Mantinan
2009-10-12 21:38       ` Rafael J. Wysocki
2009-10-12 23:21       ` David Miller
2009-10-11 23:01 ` [Bug #14275] kernel>=2.6.31: ahci.c: do not force unconditionally sb600 to 32bit dma any more? Rafael J. Wysocki
2009-10-12 14:39   ` Chuck Ebbert
2009-10-11 23:01 ` [Bug #14266] regression in page writeback Rafael J. Wysocki
2009-10-12  1:02   ` Shaohua Li
2009-10-12 21:34     ` Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14267] Disassociating atheros wlan Rafael J. Wysocki
2009-10-11 23:11   ` Justin P. Mattock
2009-10-12 21:35     ` Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14309] MCA on hp rx8640 Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14385] DMAR regression in 2.6.31 leads to ext4 corruption? Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14301] WARNING: at net/ipv4/af_inet.c:154 Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14329] Sata disk doesn't wake up after S3 suspend Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14377] "conservative" cpufreq governor broken Rafael J. Wysocki
2009-10-12  1:47   ` Steven Noonan
2009-10-12 21:39     ` Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14391] use after free of struct powernow_k8_data Rafael J. Wysocki
2009-10-11 23:01 ` [Bug #14388] keyboard under X with 2.6.31 Rafael J. Wysocki
2009-10-12 18:53   ` Justin P. Mattock
2009-10-12 21:41     ` Rafael J. Wysocki
2009-10-12 22:59     ` Nix
2009-10-12 23:38       ` Alan Cox
2009-10-12 23:46         ` Dmitry Torokhov
2009-10-13  0:14           ` Justin P. Mattock
2009-10-13 11:00           ` Alan Cox
2009-10-13 14:51             ` Jiri Kosina
2009-10-13 15:56               ` Andi Kleen
2009-10-13  2:00         ` Daniel Hazelton
2009-10-13  0:16       ` Linus Torvalds
2009-10-13  2:54         ` Frédéric L. W. Meunier
2009-10-13 19:32           ` Nix
2009-10-13  3:24         ` Linus Torvalds
2009-10-13  3:43           ` Justin P. Mattock
2009-10-13  7:13             ` Frédéric L. W. Meunier
2009-10-13  8:19               ` Boyan
2009-10-13  9:17                 ` Dmitry Torokhov
2009-10-13 14:33                 ` Frédéric L. W. Meunier
2009-10-13 15:05                 ` Linus Torvalds
2009-10-13 20:08                   ` Boyan
2009-10-13 20:53                     ` Linus Torvalds
2009-10-13 21:02                       ` Linus Torvalds
2009-10-13 21:13                       ` Linus Torvalds
2009-10-14  0:55                         ` Frédéric L. W. Meunier
2009-10-14  1:12                           ` Linus Torvalds
2009-10-14  1:20                             ` david
2009-10-14  7:45                         ` Boyan
2009-10-13 21:32                       ` Alan Cox
2009-10-13 22:54                         ` Linus Torvalds
2009-10-13 23:11                           ` Alan Cox
2009-10-13 23:16                             ` Linus Torvalds
2009-10-13 21:46                       ` Paul Fulghum
2009-10-13 22:42                         ` Linus Torvalds
2009-10-13 23:01                           ` Alan Cox
2009-10-14  0:08                           ` Paul Fulghum
     [not found]                             ` <4AD51D6B.7010509@microgate.com>
2009-10-14  1:03                               ` Linus Torvalds
2009-10-14  1:05                                 ` Linus Torvalds
2009-10-14  1:34                                 ` Paul Fulghum
2009-10-14 11:58                                 ` Alan Cox
2009-10-14 15:07                                   ` Linus Torvalds
2009-10-14 16:34                                     ` Paul Fulghum
2009-10-14 16:38                                     ` Linus Torvalds
2009-10-14 18:20                                       ` Oleg Nesterov
2009-10-14 18:51                                         ` Linus Torvalds
2009-10-14 19:52                                           ` Oleg Nesterov
2009-10-14 20:55                                             ` Linus Torvalds
2009-10-15 12:47                                               ` Oleg Nesterov
2009-10-15 15:29                                                 ` Oleg Nesterov [this message]
2009-10-15 16:04                                                   ` Linus Torvalds
2009-10-15 15:53                                                 ` Linus Torvalds
2009-10-14 21:16                                             ` Alan Cox
2009-10-14 21:51                                               ` David Miller
2009-10-14 19:59                                       ` Boyan
2009-10-14 21:02                                         ` Linus Torvalds
2009-10-14 21:39                                           ` Alan Cox
2009-10-15  7:24                                           ` Boyan
2009-10-15 17:38                                       ` OGAWA Hirofumi
2009-10-15 19:00                                         ` Oleg Nesterov
2009-10-15 21:49                                         ` Linus Torvalds
2009-10-15 22:29                                           ` OGAWA Hirofumi
2009-10-13 10:34             ` Alan Cox
2009-10-13 15:16               ` Justin P. Mattock
2009-10-13 10:32           ` Alan Cox
2009-10-13 13:25             ` Paul Fulghum
2009-10-13 14:39             ` Linus Torvalds
2009-10-13 15:02               ` Linus Torvalds
2009-10-13 15:08               ` Paul Fulghum
2009-10-13 15:33               ` Paul Fulghum
2009-10-13 15:41                 ` Linus Torvalds
2009-10-13 15:59                   ` Alan Cox
2009-10-13 16:42                     ` Linus Torvalds
2009-10-13 17:28                   ` Paul Fulghum
2009-10-17 16:40           ` Pavel Machek
2009-10-11 23:24 ` 2.6.32-rc4: Reported regressions 2.6.30 -> 2.6.31 Larry Finger
2009-10-12 21:43   ` Rafael J. Wysocki
2009-10-12 12:22 ` Frederik Deweerdt
2009-10-12 21:46   ` Rafael J. Wysocki
2009-10-12 19:58 ` Andrew Patterson
2009-10-12 21:48   ` Rafael J. Wysocki
  -- strict thread matches above, loose matches on Subject: below --
2009-10-26 19:26 2.6.32-rc5-git3: " Rafael J. Wysocki
2009-10-26 19:31 ` [Bug #14388] keyboard under X with 2.6.31 Rafael J. Wysocki
2009-10-26 22:25   ` Boyan
2009-10-26 22:45     ` Linus Torvalds
2009-10-26 23:45       ` Ed Tomlinson
2009-10-27  4:34         ` Justin P. Mattock
2009-10-27  8:23       ` Rafael J. Wysocki

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20091015152959.GA18681@redhat.com \
    --to=oleg@redhat.com \
    --cc=alan@lxorguk.ukuu.org.uk \
    --cc=btanastasov@yahoo.co.uk \
    --cc=dmitry.torokhov@gmail.com \
    --cc=edt@aei.ca \
    --cc=hirofumi@mail.parknet.co.jp \
    --cc=kernel-testers@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=paulkf@microgate.com \
    --cc=rjw@sisk.pl \
    --cc=torvalds@linux-foundation.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).