* [PATCH] Fix ETA calculations
@ 2015-11-19 14:52 Jens Rosenboom
2015-11-19 20:32 ` Jens Axboe
0 siblings, 1 reply; 2+ messages in thread
From: Jens Rosenboom @ 2015-11-19 14:52 UTC (permalink / raw)
To: Fio
There are a couple of issues in eta.c:
- The comparison for time- vs. rate-based ETA calculation is done the
wrong way round.
- The elapsed ramp time should only be subtracted when the thread has
indeed entered the ramp phase.
- Currently jobs that are stonewalled are not taken into account for ETA
calculation. This patch at least fixes this for the case that there is
only a single job per stonewall group.
---
This patch tries to fix some issues I have found, this may
also be related to the ones mentioned earlier in
http://www.spinics.net/lists/fio/msg04187.html
The part regarding stonewalled jobs may require some further work, but
at least it gives proper values now for my standard fio runs containing
just a sequence of singular jobs.
Also, I'm still wondering about the intention behind the way done_secs
is meant to work. It cumulates time spent in finished jobs globally, but
is being used in per-job ETA calculations. This seems to break at
least for the case of stonewalled jobs, too.
eta.c | 29 ++++++++++++++++++-----------
1 file changed, 18 insertions(+), 11 deletions(-)
diff --git a/eta.c b/eta.c
index 7eb1c27..e73cc2c 100644
--- a/eta.c
+++ b/eta.c
@@ -214,7 +214,7 @@ static unsigned long thread_eta(struct thread_data *td)
if (td->o.time_based) {
if (timeout) {
perc_t = (double) elapsed / (double) timeout;
- if (perc_t < perc)
+ if (perc_t > perc)
perc = perc_t;
} else {
/*
@@ -226,6 +226,7 @@ static unsigned long thread_eta(struct thread_data *td)
}
eta_sec = (unsigned long) (elapsed * (1.0 / perc)) - elapsed;
+ // printf("eta_sec: %lu, elapsed: %lu, perc: %02.6f, done: %llu, total: %llu\n", eta_sec, elapsed, perc, bytes_done, bytes_total);
if (td->o.timeout &&
eta_sec > (timeout + done_secs - elapsed))
@@ -250,7 +251,7 @@ static unsigned long thread_eta(struct thread_data *td)
t_eta = __timeout + start_delay + ramp_time;
t_eta /= 1000000ULL;
- if (in_ramp_time(td)) {
+ if ((td->runstate == TD_RAMP) && in_ramp_time(td)) {
unsigned long ramp_left;
ramp_left = mtime_since_now(&td->epoch);
@@ -438,19 +439,25 @@ int calc_thread_status(struct jobs_eta *je, int force)
}
}
- if (exitall_on_terminate)
+ if (exitall_on_terminate) {
je->eta_sec = INT_MAX;
- else
- je->eta_sec = 0;
-
- for_each_td(td, i) {
- if (exitall_on_terminate) {
+ for_each_td(td, i) {
if (eta_secs[i] < je->eta_sec)
je->eta_sec = eta_secs[i];
- } else {
- if (eta_secs[i] > je->eta_sec)
- je->eta_sec = eta_secs[i];
}
+ } else {
+ unsigned long eta_stone = 0;
+ je->eta_sec = 0;
+ for_each_td(td, i) {
+ if ((td->runstate == TD_NOT_CREATED) && (td->o.stonewall)) {
+ eta_stone += eta_secs[i];
+ } else {
+ if (eta_secs[i] > je->eta_sec)
+ je->eta_sec = eta_secs[i];
+ }
+ }
+ je->eta_sec += eta_stone;
+ // printf("stoned: %lu\n", eta_stone);
}
free(eta_secs);
--
2.4.10
^ permalink raw reply related [flat|nested] 2+ messages in thread
* Re: [PATCH] Fix ETA calculations
2015-11-19 14:52 [PATCH] Fix ETA calculations Jens Rosenboom
@ 2015-11-19 20:32 ` Jens Axboe
0 siblings, 0 replies; 2+ messages in thread
From: Jens Axboe @ 2015-11-19 20:32 UTC (permalink / raw)
To: Jens Rosenboom, Fio
On 11/19/2015 07:52 AM, Jens Rosenboom wrote:
> There are a couple of issues in eta.c:
>
> - The comparison for time- vs. rate-based ETA calculation is done the
> wrong way round.
> - The elapsed ramp time should only be subtracted when the thread has
> indeed entered the ramp phase.
> - Currently jobs that are stonewalled are not taken into account for ETA
> calculation. This patch at least fixes this for the case that there is
> only a single job per stonewall group.
> ---
>
> This patch tries to fix some issues I have found, this may
> also be related to the ones mentioned earlier in
> http://www.spinics.net/lists/fio/msg04187.html
>
> The part regarding stonewalled jobs may require some further work, but
> at least it gives proper values now for my standard fio runs containing
> just a sequence of singular jobs.
Thanks this is great, I'll get this applied and tested.
> Also, I'm still wondering about the intention behind the way done_secs
> is meant to work. It cumulates time spent in finished jobs globally, but
> is being used in per-job ETA calculations. This seems to break at
> least for the case of stonewalled jobs, too.
The eta output could use some love, no doubt about that. It could easily
be a lot more clever wrt multiple jobs, fluctuations, etc.
--
Jens Axboe
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2015-11-19 20:32 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-11-19 14:52 [PATCH] Fix ETA calculations Jens Rosenboom
2015-11-19 20:32 ` Jens Axboe
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox