From: Saravana Kannan <skannan@codeaurora.org>
To: Mathieu Desnoyers <mathieu.desnoyers@efficios.com>
Cc: cpufreq <cpufreq@vger.kernel.org>,
linux-arm-msm <linux-arm-msm@vger.kernel.org>,
Dave Jones <davej@redhat.com>, Thomas Renninger <trenn@suse.de>,
Arjan van de Ven <arjan@infradead.org>,
linux-kernel@vger.kernel.org, Ingo Molnar <mingo@elte.hu>,
Peter Zijlstra <peterz@infradead.org>
Subject: Re: CPUfreq - udelay() interaction issues
Date: Tue, 27 Apr 2010 16:41:39 -0700 [thread overview]
Message-ID: <4BD77633.9050403@codeaurora.org> (raw)
In-Reply-To: <20100424135817.GA27322@Krystal>
Hi Mathieu,
Thanks for taking the time to provide your input. More responses below.
Mathieu Desnoyers wrote:
> * Saravana Kannan (skannan@codeaurora.org) wrote:
> [...]
>> Seems a bit more complicated than what I had in mind. This is touching
>> the scheduler I think we can get away without having to. Also, there is
>> no simple implementation for the "slowpath" that can guarantee the delay
>> without starting over the loop and hoping not to get interrupted or just
>> giving up and doing a massively inaccurate delay (like msleep, etc).
>
> Not necessarily. Another way to do it: we could keep the udelay loop counter in
> the task struct. When ondemand changes frequency, and upon migration, this
> counter would be adapted to the current cpu frequency.
This will take us back to the scalability problem because we now have to
go through every process running on a CPU to update their udelay loop
counters whenever the CPU freq changes.
>> I was thinking of something along the lines of this:
>>
>> udelay()
>> {
>> if (!is_atomic())
>
> see hardirq.h:
>
> /*
> * Are we running in atomic context? WARNING: this macro cannot
> * always detect atomic context; in particular, it cannot know about
> * held spinlocks in non-preemptible kernels. Thus it should not be
> * used in the general case to determine whether sleeping is possible.
> * Do not use in_atomic() in driver code.
> */
> #define in_atomic() ((preempt_count() & ~PREEMPT_ACTIVE) != PREEMPT_INATOMIC_BASE)
>
> Sorry, your scheme is broken on !PREEMPT kernels.
If it's a !PREEMPT kernel, we don't have to worry about the CPUfreq
changing on us. CPU freq is changed in a deferrable work queue context.
>> down_read(&freq_sem);
>> /* else
>> do nothing since cpufreq can't interrupt you.
>> */
>
> This comment seems broken. in_atomic() can return true because preemption is
> disabled, thus letting cpufreq interrupts coming in.
As mentioned earlier, cpufreq change can't happen when udelay is running
in !PREEMPT kernel (which is where in_atomic() won't work). Btw, I
actually wasn't referring to the real in_atomic() macro (I remembered it
having limitations). But now that you mentioned the limitation, it might
not be a problem after all.
>> call usual code since cpufreq is not going to preempt you.
>>
>> if (!is_atomic())
>> up_read(&freq_sem);
>> }
>>
>> __cpufreq_driver_target(...)
>> {
>> down_write(&freq_sem);
>> cpufreq_driver->target(...);
>> up_write(&freq_sem);
>> }
>>
>> In the implementation of the cpufreq driver, they just need to make sure
>> they always increase the LPJ _before_ increasing the freq and decrease
>> the LPJ _after_ decreasing the freq. This is make sure that when an
>> interrupt handler preempts the cpufreq driver code (since atomic
>> contexts aren't looking at the r/w semaphore) the LPJ value will be good
>> enough to satisfy the _at least_ guarantee of udelay().
>>
>> For the CPU switching issue, I think the solution I proposed is quite
>> simple and should work.
>
> You mean this ?
>
>>>>> udelay(us)
>>>>> {
>>>>> set cpu affinity to current CPU;
>>>>> Do the usual udelay code;
>>>>> restore cpu affinity status;
>>>>> }
>
> Things like lock scalability and performance degradations comes to my mind. We
> can expect some drivers to make very heavy use of udelay(). This should not
> bring a 4096-core box to its knees. sched_setaffinity() is very far from being
> lightweight, as it locks cpu hotplug (that's a global mutex protecting a
> refcount), allocates memory, manipulates cpumasks, etc...
Hmm... set affinity does seem more complicated than what I expected.
>> Does my better explained solution look palatable?
>
> Nope, not on a multiprocessor system.
Yes, set affinity seems to be a problem.
Didn't get to work on this for the past few days. Let me think more
about this before I get back. In the mean time, if you can come up with
a relatively simple solution without scalability issues, I would be glad
to drop my existing solution.
Thanks again for the input.
-Saravana
prev parent reply other threads:[~2010-04-27 23:41 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-04-22 3:34 CPUfreq - udelay() interaction issues Saravana Kannan
2010-04-22 21:22 ` Saravana Kannan
2010-04-22 23:18 ` Thomas Renninger
2010-04-22 23:37 ` Saravana Kannan
2010-04-22 23:21 ` Saravana Kannan
2010-04-23 18:40 ` Mathieu Desnoyers
2010-04-23 19:22 ` Arjan van de Ven
2010-04-23 19:55 ` Mathieu Desnoyers
2010-04-24 18:56 ` Arjan van de Ven
2010-04-24 21:00 ` Mathieu Desnoyers
2010-04-24 23:20 ` Arjan van de Ven
2010-04-24 2:57 ` Saravana Kannan
2010-04-24 2:49 ` Saravana Kannan
2010-04-24 5:56 ` Pavel Machek
2010-04-24 13:58 ` Mathieu Desnoyers
2010-04-27 23:41 ` Saravana Kannan [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4BD77633.9050403@codeaurora.org \
--to=skannan@codeaurora.org \
--cc=arjan@infradead.org \
--cc=cpufreq@vger.kernel.org \
--cc=davej@redhat.com \
--cc=linux-arm-msm@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=mathieu.desnoyers@efficios.com \
--cc=mingo@elte.hu \
--cc=peterz@infradead.org \
--cc=trenn@suse.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.