From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751585AbcFCHRo (ORCPT ); Fri, 3 Jun 2016 03:17:44 -0400 Received: from mail-wm0-f49.google.com ([74.125.82.49]:35350 "EHLO mail-wm0-f49.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750923AbcFCHRm (ORCPT ); Fri, 3 Jun 2016 03:17:42 -0400 Date: Fri, 3 Jun 2016 09:16:56 +0200 From: Ingo Molnar To: Wanpeng Li Cc: linux-kernel@vger.kernel.org, kvm@vger.kernel.org, Wanpeng Li , "Peter Zijlstra (Intel)" , Rik van Riel , Thomas Gleixner , Frederic Weisbecker , Paolo Bonzini , Radim Subject: Re: [PATCH v3] sched/cputime: add steal time support to full dynticks CPU time accounting Message-ID: <20160603071656.GA7466@gmail.com> References: <1463574454-3587-1-git-send-email-wanpeng.li@hotmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1463574454-3587-1-git-send-email-wanpeng.li@hotmail.com> User-Agent: Mutt/1.5.24 (2015-08-30) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org * Wanpeng Li wrote: > From: Wanpeng Li > > This patch adds steal guest time support to full dynticks CPU > time accounting. After 'commit ff9a9b4c4334 ("sched, time: Switch > VIRT_CPU_ACCOUNTING_GEN to jiffy granularity")', time is jiffy > based sampling even if it's still listened to ring boundaries, so > steal_account_process_tick() is reused to account how much 'ticks' > are steal time after the last accumulation. WTF? This changelog has 4 grammar errors and it sails through review just like that? 1) What does 'time is jiffy based sampling' mean? 2) what does 'even if it's still listened to ring boundaries' mean? 3) "how muck 'ticks'"? 4) "are steal time"? So I fixed this to be at least parseable: This patch adds guest steal-time support to full dynticks CPU time accounting. After the following commit: ff9a9b4c4334 ("sched, time: Switch VIRT_CPU_ACCOUNTING_GEN to jiffy granularity") ... time sampling became jiffy based, even if it's still listened to ring boundaries, so steal_account_process_tick() is reused to account how many 'ticks' are stolen-time, after the last accumulation. Although I'm still wondering what this key phrase means: even if it's still listened to ring boundaries, Could someone please explain what this means? (Beyond the 5th grammar error this portion has, which I'll fix once it actually makes sense to me...) Furthermore, the real problem that made me go back and tear the changelog apart is that the code flow itself is incredibly ugly and fragile as hell: > write_seqcount_begin(&tsk->vtime_seqcount); > tsk->vtime_snap_whence = VTIME_SYS; > if (vtime_delta(tsk)) { > + cputime_t steal_time; > + unsigned long delta_st = steal_account_process_tick(); > delta_cpu = get_vtime_delta(tsk); > + steal_time = jiffies_to_cputime(delta_st); > + > + if (steal_time >= delta_cpu) { > + write_seqcount_end(&tsk->vtime_seqcount); > + return; > + } > + delta_cpu -= steal_time; > account_user_time(tsk, delta_cpu, cputime_to_scaled(delta_cpu)); > } > write_seqcount_end(&tsk->vtime_seqcount); > } Yeah, a return in the middle of a locking critical section, really?? Also, how about basic style details like leaving an extra newline after local variable definition sections, like every other scheduler function does? Also, what's this thing about calling a time unit variable 'delta_cpu'? When I reviewed this one of my first reactions was: "Why are we comparing time to CPU ID??". Plus as an added bonus a 'delta_st' variable name to count ticks, which variable is not just badly named but single-use. WTF? Something like this looks much better and shorter: void vtime_account_user(struct task_struct *tsk) { cputime_t delta_time, steal_time; write_seqcount_begin(&tsk->vtime_seqcount); tsk->vtime_snap_whence = VTIME_SYS; if (vtime_delta(tsk)) { delta_time = get_vtime_delta(tsk); steal_time = jiffies_to_cputime(steal_account_process_tick()); if (steal_time < delta_time) { delta_time -= steal_time; account_user_time(tsk, delta_time, cputime_to_scaled(delta_time)); } } write_seqcount_end(&tsk->vtime_seqcount); } See the consistent, obvious naming of the variables and the clear code flow? (Totally untested, etc.) But I'm very annoyed that this was in v3 and still has so many trivial problems: incredibly bad and confusing spelling, totally bad, fragile, apparently write-only code - and I counted like three acks from 3 other people who should really know better ... This is core scheduler code! Thanks, Ingo