From: Michael Neuling <mikey@neuling.org>
To: Vaidyanathan Srinivasan <svaidy@linux.vnet.ibm.com>
Cc: Linux Kernel <linux-kernel@vger.kernel.org>,
venkatesh.pallipadi@intel.com, suresh.b.siddha@intel.com,
Balbir Singh <balbir@linux.vnet.ibm.com>,
"Amit K. Arora" <aarora@linux.vnet.ibm.com>
Subject: Re: [RFC PATCH v1 2/3] Make calls to account_scaled_stats
Date: Thu, 29 May 2008 10:18:56 -0500 [thread overview]
Message-ID: <19576.1212074336@neuling.org> (raw)
In-Reply-To: <20080526143146.24680.36724.stgit@drishya.in.ibm.com>
In message <20080526143146.24680.36724.stgit@drishya.in.ibm.com> you wrote:
> Hook various accounting functions to call scaled stats
>
> * Hook porcess contect switch: __switch_to()
> * Hook IRQ handling account_system_vtime() in hardirq.hA
> * Update __delayacct_add_tsk() to take care of scaling by 1000
> * Update bacct_add_tsk() to take care of scaling by 1000
>
> Signed-off-by: Amit K. Arora <aarora@linux.vnet.ibm.com>
> Signed-off-by: Vaidyanathan Srinivasan <svaidy@linux.vnet.ibm.com>
> ---
>
> arch/x86/kernel/process_32.c | 8 ++++++++
> include/linux/hardirq.h | 4 ++++
> kernel/delayacct.c | 7 ++++++-
> kernel/timer.c | 2 --
> kernel/tsacct.c | 10 ++++++++--
> 5 files changed, 26 insertions(+), 5 deletions(-)
>
> diff --git a/arch/x86/kernel/process_32.c b/arch/x86/kernel/process_32.c
> index f8476df..c81a783 100644
> --- a/arch/x86/kernel/process_32.c
> +++ b/arch/x86/kernel/process_32.c
> @@ -56,6 +56,9 @@
> #include <asm/cpu.h>
> #include <asm/kdebug.h>
>
> +extern void account_scaled_stats(struct task_struct *tsk);
> +extern void reset_for_scaled_stats(struct task_struct *tsk);
> +
> asmlinkage void ret_from_fork(void) __asm__("ret_from_fork");
>
> static int hlt_counter;
> @@ -660,6 +663,11 @@ struct task_struct * __switch_to(struct task_struct *pre
v_p, struct task_struct
> loadsegment(gs, next->gs);
>
> x86_write_percpu(current_task, next_p);
> + /* Account scaled statistics for the task leaving CPU */
> + account_scaled_stats(prev_p);
> + barrier();
> + /* Initialise stats counter for new task */
> + reset_for_scaled_stats(next_p);
>
> return prev_p;
> }
> diff --git a/include/linux/hardirq.h b/include/linux/hardirq.h
> index 181006c..4458736 100644
> --- a/include/linux/hardirq.h
> +++ b/include/linux/hardirq.h
> @@ -7,6 +7,9 @@
> #include <asm/hardirq.h>
> #include <asm/system.h>
>
> +/* TBD: Add config option */
> +extern void account_scaled_stats(struct task_struct *tsk);
> +
> /*
> * We put the hardirq and softirq counter into the preemption
> * counter. The bitmask has the following meaning:
> @@ -115,6 +118,7 @@ struct task_struct;
> #ifndef CONFIG_VIRT_CPU_ACCOUNTING
> static inline void account_system_vtime(struct task_struct *tsk)
> {
> + account_scaled_stats(tsk);
> }
> #endif
>
> diff --git a/kernel/delayacct.c b/kernel/delayacct.c
> index 10e43fd..3e2938f 100644
> --- a/kernel/delayacct.c
> +++ b/kernel/delayacct.c
> @@ -117,7 +117,12 @@ int __delayacct_add_tsk(struct taskstats *d, struct task
_struct *tsk)
>
> tmp = (s64)d->cpu_scaled_run_real_total;
> cputime_to_timespec(tsk->utimescaled + tsk->stimescaled, &ts);
> - tmp += timespec_to_ns(&ts);
> + /* HACK: Remember, we multipled the cputime_t by 1000 to include
> + * fraction. Now it is time to scale it back to correct 'ns' value.
> + * Perhaps, we should use nano second unit (u64 type) for utimescaled
> + * and stimescaled?
> + */
> + tmp += div_s64(timespec_to_ns(&ts),1000);
This is going to break other archs (specifically powerpc) which doesn't
do this magical scale by 1000.
How often is this function called as the divide is going to slow things
down?
> d->cpu_scaled_run_real_total =
> (tmp < (s64)d->cpu_scaled_run_real_total) ? 0 : tmp;
>
> diff --git a/kernel/timer.c b/kernel/timer.c
> index ceacc66..de8a615 100644
> --- a/kernel/timer.c
> +++ b/kernel/timer.c
> @@ -964,10 +964,8 @@ void account_process_tick(struct task_struct *p, int use
r_tick)
>
> if (user_tick) {
> account_user_time(p, one_jiffy);
> - account_user_time_scaled(p, cputime_to_scaled(one_jiffy));
> } else {
> account_system_time(p, HARDIRQ_OFFSET, one_jiffy);
> - account_system_time_scaled(p, cputime_to_scaled(one_jiffy));
> }
> }
Why did you remove this?
> #endif
> diff --git a/kernel/tsacct.c b/kernel/tsacct.c
> index 4ab1b58..ee0d93b 100644
> --- a/kernel/tsacct.c
> +++ b/kernel/tsacct.c
> @@ -62,10 +62,16 @@ void bacct_add_tsk(struct taskstats *stats, struct task_s
truct *tsk)
> rcu_read_unlock();
> stats->ac_utime = cputime_to_msecs(tsk->utime) * USEC_PER_MSEC;
> stats->ac_stime = cputime_to_msecs(tsk->stime) * USEC_PER_MSEC;
> + /* HACK: cputime unit is jiffies on x86 and not good for fractional
> + * additional. cputime_t type {u,s}timescaled is multiplied by
> + * 1000 for scaled accounting. Hence, cputime_to_msecs will actually
> + * give the required micro second value. The multiplier
> + * USEC_PER_MSEC has been dropped.
> + */
> stats->ac_utimescaled =
> - cputime_to_msecs(tsk->utimescaled) * USEC_PER_MSEC;
> + cputime_to_msecs(tsk->utimescaled);
> stats->ac_stimescaled =
> - cputime_to_msecs(tsk->stimescaled) * USEC_PER_MSEC;
> + cputime_to_msecs(tsk->stimescaled);
Again, isn't this going to effect other archs?
Mikey
> stats->ac_minflt = tsk->min_flt;
> stats->ac_majflt = tsk->maj_flt;
>
>
next prev parent reply other threads:[~2008-05-29 15:21 UTC|newest]
Thread overview: 31+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-05-26 14:31 [RFC PATCH v1 0/3] Scaled statistics using APERF/MPERF in x86 Vaidyanathan Srinivasan
2008-05-26 14:31 ` [RFC PATCH v1 1/3] General framework for APERF/MPERF access and accounting Vaidyanathan Srinivasan
2008-05-26 18:11 ` Balbir Singh
2008-05-27 14:54 ` Vaidyanathan Srinivasan
2008-05-26 14:31 ` [RFC PATCH v1 2/3] Make calls to account_scaled_stats Vaidyanathan Srinivasan
2008-05-26 18:18 ` Balbir Singh
2008-05-27 15:02 ` Vaidyanathan Srinivasan
2008-05-29 15:18 ` Michael Neuling [this message]
2008-05-29 18:23 ` Vaidyanathan Srinivasan
2008-05-26 14:31 ` [RFC PATCH v1 3/3] Print scaled utime and stime in getdelays Vaidyanathan Srinivasan
2008-05-26 15:50 ` [RFC PATCH v1 0/3] Scaled statistics using APERF/MPERF in x86 Arjan van de Ven
2008-05-26 17:24 ` Balbir Singh
2008-05-26 18:00 ` Arjan van de Ven
2008-05-26 18:36 ` Balbir Singh
2008-05-26 18:51 ` Arjan van de Ven
2008-05-27 12:59 ` Balbir Singh
2008-05-27 13:19 ` Vaidyanathan Srinivasan
2008-05-27 14:15 ` Arjan van de Ven
2008-05-27 15:27 ` Vaidyanathan Srinivasan
2008-05-31 21:27 ` Pavel Machek
2008-06-02 17:54 ` Vaidyanathan Srinivasan
2008-06-03 2:20 ` Arjan van de Ven
2008-05-27 13:29 ` Vaidyanathan Srinivasan
2008-05-27 14:19 ` Arjan van de Ven
2008-05-27 15:20 ` Vaidyanathan Srinivasan
2008-05-27 14:04 ` Vaidyanathan Srinivasan
2008-05-27 16:40 ` Arjan van de Ven
2008-05-27 18:26 ` Vaidyanathan Srinivasan
2008-05-31 21:17 ` Pavel Machek
2008-05-31 21:13 ` Pavel Machek
2008-06-02 6:08 ` Balbir Singh
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=19576.1212074336@neuling.org \
--to=mikey@neuling.org \
--cc=aarora@linux.vnet.ibm.com \
--cc=balbir@linux.vnet.ibm.com \
--cc=linux-kernel@vger.kernel.org \
--cc=suresh.b.siddha@intel.com \
--cc=svaidy@linux.vnet.ibm.com \
--cc=venkatesh.pallipadi@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox