public inbox for linux-ia64@vger.kernel.org
 help / color / mirror / Atom feed
* [Linux-ia64] switch_mm race condition with Ingo's scheduler
@ 2002-01-30 16:58 Erich Focht
  0 siblings, 0 replies; 2+ messages in thread
From: Erich Focht @ 2002-01-30 16:58 UTC (permalink / raw)
  To: linux-ia64

Hi,

I found another race condition with the O(1) MQ scheduler from Ingo. The
pattern is the following:

CPU#0:
------
schedule        -> take runqueue #0 lock
  context_switch
    activate_mm
      get_new_mmu_context
        wrap_mmu_context
          flush_tlb_all
            smp_call_function (and wait...)

CPU#1:
------
schedule        -> disable interrupts, take runqueue #1 lock
  load_balance  -> wants to get task from runqueue #0: release rq#1,
                   spinlock on rq #0 (interrupts disabled) => wait forever...


As the interrupts are disabled on CPU#1, the IPI cannot reach CPU#1.

Enabling interrupts while waiting for the rq#0 lock in load_balance is not
an option because it leads to other race conditions (the timer-interrupt
may call sched_tick and load_balance and end up spin-waiting on the same
lock).

Is it really necessary to flush all TLBs when doing a context switch on
one?

Any ideas, solutions?

Thanks,
Erich



^ permalink raw reply	[flat|nested] 2+ messages in thread
* [Linux-ia64] switch_mm race condition with Ingo's scheduler
@ 2002-07-10 17:27 Erich Focht
  0 siblings, 0 replies; 2+ messages in thread
From: Erich Focht @ 2002-07-10 17:27 UTC (permalink / raw)
  To: linux-ia64

[-- Attachment #1: Type: text/plain, Size: 175 bytes --]

Hi David,

the attached patch fixes the switch_mm -> wrap_mmu_context deadlock which
can occur in 2.5.18+ia64. We discussed this in January/February...

Regards,
Erich

[-- Attachment #2: wrap_mmu_ia64.patch --]
[-- Type: text/x-diff, Size: 2457 bytes --]

diff -urNp linux-2.5.18-ia64/arch/ia64/mm/tlb.c linux-2.5.18-ia64-wrap/arch/ia64/mm/tlb.c
--- linux-2.5.18-ia64/arch/ia64/mm/tlb.c	Sat May 25 03:55:29 2002
+++ linux-2.5.18-ia64-wrap/arch/ia64/mm/tlb.c	Wed Jul 10 17:08:46 2002
@@ -77,7 +77,11 @@ wrap_mmu_context (struct mm_struct *mm)
 			ia64_ctx.limit = tsk_context;
 	}
 	read_unlock(&tasklist_lock);
-	flush_tlb_all();
+	//flush_tlb_all(); /* potential race condition with O(1) scheduler [EF] */
+	for (i=0; i<smp_num_cpus; i++)
+		cpu_data(i)->tlb_flush = 1;
+	__flush_tlb_all();
+	local_cpu_data->tlb_flush = 0;
 }
 
 void
diff -urNp linux-2.5.18-ia64/include/asm-ia64/mmu_context.h linux-2.5.18-ia64-wrap/include/asm-ia64/mmu_context.h
--- linux-2.5.18-ia64/include/asm-ia64/mmu_context.h	Sat May 25 03:55:19 2002
+++ linux-2.5.18-ia64-wrap/include/asm-ia64/mmu_context.h	Wed Jul 10 17:03:03 2002
@@ -44,6 +44,23 @@ enter_lazy_tlb (struct mm_struct *mm, st
 {
 }
 
+/* 
+ * When the context counter wraps around all TLBs need to be flushed because
+ * an old context number might have been reused. This is signalled by a bit
+ * set in ia64_ctx.flush, which is checked in the routine below. Called by
+ * activate_mm().                                        <efocht@ess.nec.de>
+ */
+static inline void
+delayed_tlb_flush (void)
+{
+	extern void __flush_tlb_all (void);
+
+	if (unlikely(local_cpu_data->tlb_flush)) {
+		__flush_tlb_all();
+		local_cpu_data->tlb_flush = 0;
+	}
+}
+
 static inline void
 get_new_mmu_context (struct mm_struct *mm)
 {
@@ -54,7 +71,6 @@ get_new_mmu_context (struct mm_struct *m
 		mm->context = ia64_ctx.next++;
 	}
 	spin_unlock(&ia64_ctx.lock);
-
 }
 
 static inline void
@@ -113,6 +129,7 @@ activate_mm (struct mm_struct *prev, str
 	 * We may get interrupts here, but that's OK because interrupt
 	 * handlers cannot touch user-space.
 	 */
+	delayed_tlb_flush();
 	ia64_set_kr(IA64_KR_PT_BASE, __pa(next->pgd));
 	get_mmu_context(next);
 	reload_context(next);
diff -urNp linux-2.5.18-ia64/include/asm-ia64/processor.h linux-2.5.18-ia64-wrap/include/asm-ia64/processor.h
--- linux-2.5.18-ia64/include/asm-ia64/processor.h	Thu Jul  4 12:23:26 2002
+++ linux-2.5.18-ia64-wrap/include/asm-ia64/processor.h	Wed Jul 10 17:05:20 2002
@@ -151,6 +151,7 @@ extern struct cpuinfo_ia64 {
 	/* CPUID-derived information: */
 	__u64 ppn;
 	__u64 features;
+	__u8 tlb_flush;		/* flush TLB before next context switch if non-zero */
 	__u8 number;
 	__u8 revision;
 	__u8 model;

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2002-07-10 17:27 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2002-01-30 16:58 [Linux-ia64] switch_mm race condition with Ingo's scheduler Erich Focht
  -- strict thread matches above, loose matches on Subject: below --
2002-07-10 17:27 Erich Focht

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox