From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-0.8 required=3.0 tests=DKIMWL_WL_HIGH,DKIM_SIGNED, DKIM_VALID,HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,SPF_HELO_NONE, SPF_PASS autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 004C4C433FF for ; Wed, 14 Aug 2019 09:01:42 +0000 (UTC) Received: from bombadil.infradead.org (bombadil.infradead.org [198.137.202.133]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPS id C6076205F4 for ; Wed, 14 Aug 2019 09:01:41 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=pass (2048-bit key) header.d=lists.infradead.org header.i=@lists.infradead.org header.b="meQFgd9O" DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org C6076205F4 Authentication-Results: mail.kernel.org; dmarc=none (p=none dis=none) header.from=sntech.de Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=linux-arm-kernel-bounces+infradead-linux-arm-kernel=archiver.kernel.org@lists.infradead.org DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=lists.infradead.org; s=bombadil.20170209; h=Sender: Content-Transfer-Encoding:Content-Type:Cc:List-Subscribe:List-Help:List-Post: List-Archive:List-Unsubscribe:List-Id:MIME-Version:References:In-Reply-To: Message-ID:Date:Subject:To:From:Reply-To:Content-ID:Content-Description: Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID: List-Owner; bh=5fzcXR4Z3XuVhFRPldtj6gM/Jcly9QZijPnarnvQkmE=; b=meQFgd9ORMVMvT roJ6Jf993lIJe24XmSwR4RMqr0ZBqBuDc6bw6BTlDg7PTO8c/kDHeJor229379ynlt/VcNDzInA4b jBIFaaONy3aUeNYqozWlfVXSpkKvgHWjJMBCU1yQOUJH4bVnYMn65JwqvWLsUX1j4qU5uMy6xnOkz NfnWUxlww6AF5TkEtyggoPwA4T4432mELuQb3iCdsa8utjULAvsMhJr+gr/GGr3mFz+CWVnlFhjVi TlwIuB9NSwQmspFz95wlZ2GBSHiMt/peFAr4NFgRSsBEQB3dUPZjMsW/LatP6LjcZ836fTeYFMYW9 BRPCWJn5aTi3gtsF3l2w==; Received: from localhost ([127.0.0.1] helo=bombadil.infradead.org) by bombadil.infradead.org with esmtp (Exim 4.92 #3 (Red Hat Linux)) id 1hxpA5-000062-4a; Wed, 14 Aug 2019 09:01:41 +0000 Received: from gloria.sntech.de ([185.11.138.130]) by bombadil.infradead.org with esmtps (Exim 4.92 #3 (Red Hat Linux)) id 1hxpA0-00005K-JT; Wed, 14 Aug 2019 09:01:38 +0000 Received: from wf0413.dip.tu-dresden.de ([141.76.181.157] helo=phil.localnet) by gloria.sntech.de with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.89) (envelope-from ) id 1hxp9u-0004Ye-Dp; Wed, 14 Aug 2019 11:01:30 +0200 From: Heiko Stuebner To: Kevin Hilman Subject: Re: CPUfreq fail on rk3399-firefly (was: next/master boot: 285 boots: 16 failed, 264 passed with 3 offline, 1 untried/unknown, 1 conflict (next-20190718)) Date: Wed, 14 Aug 2019 11:01:29 +0200 Message-ID: <2314814.WbdfqDVNqK@phil> In-Reply-To: <7hmugdynmk.fsf@baylibre.com> References: <5d3057c8.1c69fb81.c6489.8ad2@mx.google.com> <20190718162005.GF5761@sirena.org.uk> <7hmugdynmk.fsf@baylibre.com> MIME-Version: 1.0 X-CRM114-Version: 20100106-BlameMichelson ( TRE 0.8.0 (BSD) ) MR-646709E3 X-CRM114-CacheID: sfid-20190814_020136_793237_620475A9 X-CRM114-Status: GOOD ( 24.26 ) X-BeenThere: linux-arm-kernel@lists.infradead.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: linux-rockchip@lists.infradead.org, Mark Brown , linux-next@vger.kernel.org, linux-arm-kernel@lists.infradead.org, kernel-build-reports@lists.linaro.org Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Sender: "linux-arm-kernel" Errors-To: linux-arm-kernel-bounces+infradead-linux-arm-kernel=archiver.kernel.org@lists.infradead.org Hi, Am Dienstag, 13. August 2019, 19:35:31 CEST schrieb Kevin Hilman: > [ resent with correct addr for linux-rockchip list ] > > Mark Brown writes: > > > On Thu, Jul 18, 2019 at 04:28:08AM -0700, kernelci.org bot wrote: > > > > Today's -next started failing to boot defconfig on rk3399-firefly: > > > >> arm64: > > > >> defconfig: > >> gcc-8: > >> rk3399-firefly: 1 failed lab > > > > It hits a BUG() trying to set up cpufreq: > > > > [ 87.381606] cpufreq: cpufreq_online: CPU0: Running at unlisted freq: 200000 KHz > > [ 87.393244] cpufreq: cpufreq_online: CPU0: Unlisted initial frequency changed to: 408000 KHz > > [ 87.469777] cpufreq: cpufreq_online: CPU4: Running at unlisted freq: 12000 KHz > > [ 87.488595] cpu cpu4: _generic_set_opp_clk_only: failed to set clock rate: -22 > > [ 87.491881] cpufreq: __target_index: Failed to change cpu frequency: -22 > > [ 87.495335] ------------[ cut here ]------------ > > [ 87.496821] kernel BUG at drivers/cpufreq/cpufreq.c:1438! > > [ 87.498462] Internal error: Oops - BUG: 0 [#1] PREEMPT SMP > > > > I'm struggling to see anything relevant in the diff from yesterday, the > > unlisted frequency warnings were there in the logs yesterday but no oops > > and I'm not seeing any changes in cpufreq, clk or anything relevant > > looking. > > > > Full bootlog and other info can be found here: > > > > https://kernelci.org/boot/id/5d302d8359b51498d049e983/ > > I confirm that disabling CPUfreq in the defconfig (CONFIG_CPU_FREQ=n) > makes the firefly board start working again. > > Note that the default defconfig enables the "performance" CPUfreq > governor as the default governor, so during kernel boot, it will always > switch to the max frequency. > > For fun, I set the default governor to "userspace" so the kernel > wouldn't make any OPP changes, and that leads to a slightly more > informative splat[1] > > There is still an OPP change happening because the detected OPP is not > one that's listed in the table, so it tries to change to a listed OPP > and fails in the bowels of clk_set_rate() Though I think that might only be a symptom as well. Both the PLL setting code as well as the actual cpu-clock implementation is unchanged since 2017 (and runs just fine on all boards in my farm). One source for these issues is often the regulator supplying the cpu going haywire - aka the voltage not matching the opp. As in this error-case it's CPU4 being set, this would mean it might be the big cluster supplied by the external syr825 (fan5355 clone) that might act up. In the Firefly-rk3399 case this is even stranger. There is a discrepancy between the "fcs,suspend-voltage-selector" between different bootloader versions (how the selection-pin is set up), so the kernel might actually write his requested voltage to the wrong register (not the one for actual voltage, but the second set used for the suspend voltage). Did you by chance swap bootloaders at some point in recent past? I'd assume [2] might actually be the same issue last year, though the CI-logs are not available anymore it seems. Could you try to set the vdd_cpu_b regulator to disabled, so that cpufreq for this cluster defers and see what happens? I don't really have a Firefly in my boardfarm, so I let 5.3-rc run on a Theobroma Puma which has the same regulator setup as the Firefly and all including the performance governor did run nicely, so it really looks like some sort of Firefly specific issue. Heiko > [1] https://termbin.com/3oum [2] https://lkml.org/lkml/2018/6/19/1167 _______________________________________________ linux-arm-kernel mailing list linux-arm-kernel@lists.infradead.org http://lists.infradead.org/mailman/listinfo/linux-arm-kernel