From mboxrd@z Thu Jan 1 00:00:00 1970 From: Julien Grall Subject: Re: [PATCH v2 0/2] xen/arm: vcpu_yield on WFE Date: Wed, 23 Jul 2014 19:12:18 +0100 Message-ID: <53CFFB02.2030802@linaro.org> References: <53CFDEB9.5010007@linaro.org> <53CFE3DC.1070700@linaro.org> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: Received: from mail6.bemta4.messagelabs.com ([85.158.143.247]) by lists.xen.org with esmtp (Exim 4.72) (envelope-from ) id 1XA11d-0000Bw-4B for xen-devel@lists.xenproject.org; Wed, 23 Jul 2014 18:12:25 +0000 Received: by mail-wi0-f182.google.com with SMTP id d1so2659391wiv.15 for ; Wed, 23 Jul 2014 11:12:22 -0700 (PDT) In-Reply-To: <53CFE3DC.1070700@linaro.org> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Stefano Stabellini Cc: xen-devel@lists.xenproject.org, Ian Campbell , anup.patel@linaro.org, pranavkumar@linaro.org List-Id: xen-devel@lists.xenproject.org On 07/23/2014 05:33 PM, Julien Grall wrote: > On 07/23/2014 05:21 PM, Stefano Stabellini wrote: >> On Wed, 23 Jul 2014, Julien Grall wrote: >>> On 07/23/2014 02:35 PM, Stefano Stabellini wrote: >>>> Hi all, >>> >>> Hi Stefano, >>> >>>> the patches should be self-explanatory. >>> >>> I gave a try of this series on midway and hackbench get stuck after few >>> minutes. >> >> Just as a confirmation, you did run hackbench before on midway and it >> didn't get stuck, right? > > Yes. The latest working version is c047211. > > I got 2 different kind of errors with a guest setup with 2 VCPUs: > 1) Hackbench gets stuck and use 100% of the CPU. > 2) Hackbench heap gets corrupted > > For the former, I was thinking about a possible missing SGI. > >>> FYI, I'm not able to reproduce on our local midway node. >> >> Without a reproducing environment is going to be very hard to fix it. >> It might be worth living it running in a loop for a few hours... > > Sorry I meant I'm able. I spent my afternoon to set up the same > environment as the CI loop. > > It looks like the bug is likely the same on the Arndale, even if WFE is > not trap. I will try to bisect on this board. Hrmmmm, I misread the CI log [1]. The Arndale is working fine without this patch. I'm able to confirm to confirm this behavior locally on my board. So both Midway and the Arndale is impacted with this bug. Even though it took a bit longer to reproduce the issue on the Arndale. With your series, the benchmark is broken on midway and working on the Arndale. So there is a hidden bug at least on ARM32. [1] https://validation.linaro.org/dashboard/streams/private/team/linaro/virtualization/bundles/067c7274dfb08eeed970cbc51b9acdd256a83aa9/ -- Julien Grall