From mboxrd@z Thu Jan 1 00:00:00 1970 From: Johannes Weiner Subject: Re: [PATCH v2 5/5] psi: introduce psi monitor Date: Wed, 16 Jan 2019 14:27:44 -0500 Message-ID: <20190116192744.GA1576@cmpxchg.org> References: <20190110220718.261134-1-surenb@google.com> <20190110220718.261134-6-surenb@google.com> <20190114102137.GB14054@worktop.programming.kicks-ass.net> <20190116132446.GF10803@hirez.programming.kicks-ass.net> <20190116191728.GA1380@cmpxchg.org> Mime-Version: 1.0 Return-path: DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cmpxchg-org.20150623.gappssmtp.com; s=20150623; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=WiVtpIOuEwH4DhBlsSM5aJC00b41A66MDpCmjW42bSs=; b=cuMu4RiAnGXx2I+SUKExjCnU+aFOY/M6XwCvtAb5DrpByPtxk2pa4v5MavdkB+jxr5 vIFRDzdZVnLY62BzGNnvFt8620kZQA7tU36sJCp3WdWGSkpe9urA+idxSp6llrqwX6IK 385vnuqHJMXf5xvzdhZh70oiei31K9tsZ2USDFYJriBloD1IQNqjSx57DlElEfMeDi37 BhVLkctMRe/eiNKD5P92BvaziOx0fYopaziptyNkL64zKIds37UablJmBAK+YyZ5AeOk 4fj9W71/L5V9Vtoa3ea6i4/PydI+RaQG6mJqL9NFlzyh2y0HEx9b9s5DbFh2tG3I5wgE Rtfg== Content-Disposition: inline In-Reply-To: <20190116191728.GA1380@cmpxchg.org> Sender: linux-kernel-owner@vger.kernel.org List-ID: Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit To: Suren Baghdasaryan Cc: Peter Zijlstra , Greg Kroah-Hartman , Tejun Heo , lizefan@huawei.com, axboe@kernel.dk, dennis@kernel.org, Dennis Zhou , Ingo Molnar , Andrew Morton , Jonathan Corbet , cgroups@vger.kernel.org, linux-mm , linux-doc@vger.kernel.org, LKML , kernel-team@android.com On Wed, Jan 16, 2019 at 02:17:28PM -0500, Johannes Weiner wrote: > On Wed, Jan 16, 2019 at 09:39:13AM -0800, Suren Baghdasaryan wrote: > > On Wed, Jan 16, 2019 at 5:24 AM Peter Zijlstra wrote: > > > > > > On Mon, Jan 14, 2019 at 11:30:12AM -0800, Suren Baghdasaryan wrote: > > > > For memory ordering (which Johannes also pointed out) the critical point is: > > > > > > > > times[cpu] += delta | if g->polling: > > > > smp_wmb() | g->polling = polling = 0 > > > > cmpxchg(g->polling, 0, 1) | smp_rmb() > > > > | delta = times[*] (through goto SLOWPATH) > > > > > > > > So that hotpath writes to times[] then g->polling and slowpath reads > > > > g->polling then times[]. cmpxchg() implies a full barrier, so we can > > > > drop smp_wmb(). Something like this: > > > > > > > > times[cpu] += delta | if g->polling: > > > > cmpxchg(g->polling, 0, 1) | g->polling = polling = 0 > > > > | smp_rmb() > > > > | delta = times[*] (through goto SLOWPATH) > > > > > > > > Would that address your concern about ordering? > > > > > > cmpxchg() implies smp_mb() before and after, so the smp_wmb() on the > > > left column is superfluous. > > > > Should I keep it in the comments to make it obvious and add a note > > about implicit barriers being the reason we don't call smp_mb() in the > > code explicitly? > > I'd keep 'em out if they aren't actually in the code. But I'd switch > > delta = times[*] > > in this comment to to > > get_recent_times() // implies smp_mb() Actually, I might have been mistaken about this. The seqcount locking does an smp_rmb() and an smp_wmb(), and that orders reads and writes respectively, but doesn't necessarily order reads against writes. So I think we need an explicit smp_mb() after all.