Re: [PATCH] MIPS: ASID conflict after CPU hotplug

All of lore.kernel.org
 help / color / mirror / Atom feed

From: "Kevin D. Kissell" <kevink@paralogos.com>
To: Maksim Rayskiy <maksim.rayskiy@gmail.com>
Cc: Ralf Baechle <ralf@linux-mips.org>, linux-mips@linux-mips.org
Subject: Re: [PATCH] MIPS: ASID conflict after CPU hotplug
Date: Mon, 29 Nov 2010 18:53:53 -0800	[thread overview]
Message-ID: <4CF46741.9060902@paralogos.com> (raw)
In-Reply-To: <AANLkTinUSjvjwHVJoRW-Fr75WDfheq3hSM_hEBMsEUXK@mail.gmail.com>

Having done surgery in the past to the ASID management code, this sounds 
like
a much more rational explanation of the observed problem.  Your proposed mod
sounds like it might work, but local_flush_tlb_mm() is implemented in 
terms of
drop_mmu_context(), which only does what you want if the CPU executing the
code is *not* one of the CPUs participating in the memory map.  Otherwise,
instead of clearing the ASID in the table, it allocates a new one.  I 
have a concern
that this may re-randomize things in a way that will solve your problem 
*most*
of the time, but not always.

Now that we have a better understanding of the failure, your initial notion
of *not* restarting the ASID sequence on a hotplug insertion doesn't seem
as crazy - it's certainly the zen "doing by doing nothing" way to go, 
without
the iterative overhead of walking the full process table.  But as we 
discussed,
it has the downside of requiring new state infrastructure for tracking 
hotplugs,
and we'd want to be sure that it's well behaved in the case where we have a
post-initial-boot hotplug event that brings a CPU online that has never been
initialized.  To take that tack, we'd need a per-CPU-slot bit which says 
"I have
a valid ASID sequence, thank you", which is checked in per_cpu_trap_init()
(or some other appropriate hook), and the ASID "cache" is initialized only
if it's needed, which *might* be on a hotplug.

/K.

On 11/29/10 17:35, Maksim Rayskiy wrote:
> Kevin,
>
> Thank you for your suggestions. I think I see where the conflict is
> coming from.
> If a usermode process during its lifetime migrates between cpus (TPs
> in my case) it has a non-zero value in cpu_context(cpu, mm) fields
> (asid in mips case) for all cpus it has ever run on. Moving from one
> cpu to another does not clear cpu_context. This is okay as long as all
> cpus are running. When cpu hotplug occurs, cpu_context for this
> threads running on cpu being stopped does not get cleared either. It
> might have been cleared if local_flush_tlb_mm() is called for these
> threads, but I do not see it happening.
> As a result, after secondary cpu is brought online, some processes
> already have stale asid values stored in their mm structures. One of
> these processes in my case happens to be 'init' with asid=0x101, which
> causes a conflict because next asid value selected by
> get_new_mmu_context() is ASID_FIRST_VERSION+ASID_INC=0x101 as well.
> Basically, at this point there is a discrepancy between global
> (per-cpu) asid_cache value and some individual thread asid values
> stored in mm->context.asid[cpu] fields.
> Just adding a simple code to call drop_mmu_context() on every thread
> which has ever run on the cpu getting offlined solves my problem. It
> does not require distinguishing warm boot from cold as before.
> Something like (called from hotplugged cpu)
>
> void local_flush_tlb_all_mm(void)
> {
> 	struct task_struct *p;
> 	for_each_process(p)
> 		if (p->mm)
> 			local_flush_tlb_mm(p->mm);
> }
>
> I can generate a patch if you think it is a more viable solution.
>
> Regards,
> Maksim.
>
> On Thu, Nov 25, 2010 at 7:57 AM, Kevin D. Kissell<kevink@paralogos.com>  wrote:
>    
>> This approach certainly makes per_cpu_trap_init() more readable and
>> maintainable,  but it has the downside of creating state and infrastructure
>> that have footprints elsewhere that add to global cruft and complexity.
>>   Note that your patch, as written, wouldn't solve your problem, because it
>> doesn't include the code that would actually set and clear the elements of
>> your cpu_warm_boot[] array.  If we do need to pay attention to warm boot
>> state elsewhere in the kernel (Does any other architecture?  That should be
>> a clue...), then some bits in memory like that should perhaps be defined
>> (though I'd wonder why it couldn't be a bit in some existing per-CPU state
>> entity like cpu_data[]).  Otherwise, as I said earlier, the cleanest
>> approach strikes me as one of resetting the value of EntryHi as well as the
>> ASID cache when the hotplug event takes place.  The cleanest possible
>> outcome would be if one could *move* the reset initialization of EntryHi
>> from wherever it is now to per_cpu_trap_init(), so there would be *zero* net
>> additional code, but it may be (it's Thanksgiving and I'm limited in time
>> and internet access, so I can't really go look for you) that it's
>> initialized as a side effect of something that happens repeatedly, such that
>> actually *moving* it would be dangerous.  But if you have the time, try
>> setting up EntryHi explicitly and unconditionally in per_cpu_trap_init() and
>> see if it doesn't solve your initial problem.
>>
>> Happy Holiday to you all,
>>
>> /K.
>>
>> On 11/24/10 7:03 PM, Maksim Rayskiy wrote:
>>      
>>> I certainly agree that it is a bad idea to look at the current value
>>> of asid_cache when figuring out if it is a warm or cold boot.
>>> I could not tell how the code ended up with this entryHi value after
>>> the hotplug. So, I can only address the simplest portion of issues you
>>> mentioned.
>>> How about we add a variable to tell warm restart from cold and
>>> preserve asid_cache across hotplug event. It is not much of an
>>> improvement over the original code, I must admit.
>>>
>>> Signed-off-by: Maksim Rayskiy<maksim.rayskiy@gmail.com>
>>>
>>> diff --git a/arch/mips/kernel/traps.c b/arch/mips/kernel/traps.c
>>> index d83f325..9116adb 100644
>>> --- a/arch/mips/kernel/traps.c
>>> +++ b/arch/mips/kernel/traps.c
>>> @@ -1489,6 +1489,8 @@ static int __init ulri_disable(char *s)
>>>   }
>>>   __setup("noulri", ulri_disable);
>>>
>>> +static int cpu_warm_boot[NR_CPUS];
>>> +
>>>   void __cpuinit per_cpu_trap_init(void)
>>>   {
>>>          unsigned int cpu = smp_processor_id();
>>> @@ -1577,7 +1579,9 @@ void __cpuinit per_cpu_trap_init(void)
>>>          }
>>>   #endif /* CONFIG_MIPS_MT_SMTC */
>>>
>>> -       cpu_data[cpu].asid_cache = ASID_FIRST_VERSION;
>>> +       if (!cpu_warm_boot[cpu])
>>> +               cpu_data[cpu].asid_cache = ASID_FIRST_VERSION;
>>> +       cpu_warm_boot[cpu] = 1;
>>>          TLBMISS_HANDLER_SETUP();
>>>
>>>          atomic_inc(&init_mm.mm_count);
>>>
>>>
>>> On Mon, Nov 22, 2010 at 1:34 PM, Kevin D. Kissell<kevink@paralogos.com>
>>>   wrote:
>>>        
>>>> On 11/21/10 19:41, Ralf Baechle wrote:
>>>>          
>>>>> ...
>>>>> Need to think a little about potencial consequences of your suggested
>>>>> patch.  It seems ok.  Kevin, what do you think?
>>>>>
>>>>>            
>>>> Since you ask, while I would imagine that Maksim's patch works fine for
>>>> him,
>>>> I'm not sure that it's really the right fix.  I never did succeed in
>>>> getting
>>>> CPU hotplugging working back in the 2.6.18 days, so I don't know as much
>>>> about it as I'd like, but if per_cpu_trap_init() needs to be invoked on a
>>>> hot plugin event, and if its behavior needs to be different , I'd really,
>>>> really prefer to see that state propagated explicitly, rather than
>>>> inferring
>>>> it from whatever happens to be in cache/memory at
>>>> cpu_data[cpu].asid_cache.
>>>>   But beyond that, if the problem arises because setting
>>>> cpu_data[cpu].asid_cache to a known initial state on a plugin event can
>>>> conflict with the residual content of EntryHi, rather than creating a
>>>> special case where we don't initialize the ASID cache, since we seem to
>>>> be
>>>> (re)initializing a lot of other privileged state, why aren't we also
>>>> setting
>>>> a known sane initial EntryHi value?   Wouldn't that be a cleaner fix?
>>>>   (And
>>>> I don't mean that as a rhetorical question - there may be very good
>>>> reasons
>>>> to let EntryHi values persist across hot unplug/plug events.  I just
>>>> can't
>>>> imagine them offhand over coffee.)
>>>>
>>>> /K.
>>>>
>>>>
>>>>          
>>
>>

next prev parent reply	other threads:[~2010-11-30  2:54 UTC|newest]

Thread overview: 11+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-11-17 18:49 [PATCH] MIPS: ASID conflict after CPU hotplug Maksim Rayskiy
2010-11-22  3:41 ` Ralf Baechle
2010-11-22 18:38   ` Maksim Rayskiy
2010-11-22 21:34   ` Kevin D. Kissell
     [not found]     ` <AANLkTimuJLxG2KoibRxzcHkX3LoKsTWqJSF_e=ouFi+b@mail.gmail.com>
2010-11-25 15:57       ` Kevin D. Kissell
     [not found]         ` <AANLkTinUSjvjwHVJoRW-Fr75WDfheq3hSM_hEBMsEUXK@mail.gmail.com>
2010-11-30  2:53           ` Kevin D. Kissell [this message]
2010-11-30  3:21             ` Maksim Rayskiy
2010-11-30 10:05               ` Kevin D. Kissell
2010-11-30 19:49                 ` Maksim Rayskiy
2010-12-01 11:51                   ` Sergei Shtylyov
2011-11-10 13:09 ` Ralf Baechle

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4CF46741.9060902@paralogos.com \
    --to=kevink@paralogos.com \
    --cc=linux-mips@linux-mips.org \
    --cc=maksim.rayskiy@gmail.com \
    --cc=ralf@linux-mips.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.