* next/master boot: 179 boots: 11 failed, 167 passed with 1 offline (next-20180731) [not found] <5b607cc4.1c69fb81.6c1d6.6534@mx.google.com> @ 2018-07-31 16:06 ` Mark Brown 2018-08-01 8:19 ` Ludovic BARRE 2018-08-01 10:05 ` Ulf Hansson 2018-07-31 16:11 ` Mark Brown 1 sibling, 2 replies; 8+ messages in thread From: Mark Brown @ 2018-07-31 16:06 UTC (permalink / raw) To: linux-arm-kernel On Tue, Jul 31, 2018 at 08:14:12AM -0700, kernelci.org bot wrote: Today's -next fails to boot on a variety of Qualcomm 32 bit platforms: > multi_v7_defconfig: > qcom-apq8064-cm-qs600: > lab-baylibre-seattle: new failure (last pass: next-20180730) > qcom-apq8064-ifc6410: > lab-baylibre-seattle: new failure (last pass: next-20180730) > > qcom_defconfig: > qcom-apq8064-cm-qs600: > lab-baylibre-seattle: new failure (last pass: next-20180730) The logs are all somewhat similar, for example: https://storage.kernelci.org/next/master/next-20180731/arm/multi_v7_defconfig/lab-baylibre-seattle/boot-qcom-apq8064-cm-qs600.html detects a DMA problem during MMCI initialization: [ 2.237566] mmci-pl18x 121c0000.sdcc: mmc2: PL180 manf 51 rev0 at 0x121c0000 irq 32,0 (pio) [ 2.244790] mmci-pl18x 121c0000.sdcc: DMA channels RX dma2chan1, TX dma2chan2 [ 2.271722] mmci-pl18x 12400000.sdcc: error during DMA transfer! [ 2.271757] mmci-pl18x 12400000.sdcc: buggy DMA detected. Taking evasive action. [ 2.276798] ------------[ cut here ]------------ [ 2.284185] WARNING: CPU: 0 PID: 0 at ../include/linux/dma-mapping.h:551 bam_free_chan+0x2d8/0x2e0 [ 2.288772] Modules linked in: [ 2.297534] CPU: 0 PID: 0 Comm: swapper/0 Not tainted 4.18.0-rc7-next-20180731 #1 then panics: [ 2.513796] ------------[ cut here ]------------ [ 2.518367] kernel BUG at ../mm/vmalloc.c:1608! [ 2.522968] Internal error: Oops - BUG: 0 [#1] SMP ARM trying to release the DMA channel. I've not done any bisection or anything but I do note 8bb2299d2d0b5cc (mmc: mmci: Add and implement a ->dma_setup() callback for qcom dml) and some related commits in the MMC tree. More details for each of the failed boots at: https://kernelci.org/boot/id/5b6054f559b5144b9396baa9/ https://kernelci.org/boot/id/5b60551259b5144abb96bab6/ https://kernelci.org/boot/id/5b6054e259b5144b1e96bab2/ including full logs, details of the build and so on. -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 488 bytes Desc: not available URL: <http://lists.infradead.org/pipermail/linux-arm-kernel/attachments/20180731/f514af29/attachment.sig> ^ permalink raw reply [flat|nested] 8+ messages in thread
* next/master boot: 179 boots: 11 failed, 167 passed with 1 offline (next-20180731) 2018-07-31 16:06 ` next/master boot: 179 boots: 11 failed, 167 passed with 1 offline (next-20180731) Mark Brown @ 2018-08-01 8:19 ` Ludovic BARRE 2018-08-01 9:58 ` Ulf Hansson 2018-08-01 10:05 ` Ulf Hansson 1 sibling, 1 reply; 8+ messages in thread From: Ludovic BARRE @ 2018-08-01 8:19 UTC (permalink / raw) To: linux-arm-kernel hi Mark, Ulf When I see log, I think the patch in attachment could fix this issue , but like I've not qcom board I can't test if it's fixed :-(. Ulf: for patch delivery, you prefer a patch delivery on mailing list ? BR Ludo On 07/31/2018 06:06 PM, Mark Brown wrote: > On Tue, Jul 31, 2018 at 08:14:12AM -0700, kernelci.org bot wrote: > > Today's -next fails to boot on a variety of Qualcomm 32 bit platforms: > >> multi_v7_defconfig: >> qcom-apq8064-cm-qs600: >> lab-baylibre-seattle: new failure (last pass: next-20180730) >> qcom-apq8064-ifc6410: >> lab-baylibre-seattle: new failure (last pass: next-20180730) >> >> qcom_defconfig: >> qcom-apq8064-cm-qs600: >> lab-baylibre-seattle: new failure (last pass: next-20180730) > > The logs are all somewhat similar, for example: > > https://storage.kernelci.org/next/master/next-20180731/arm/multi_v7_defconfig/lab-baylibre-seattle/boot-qcom-apq8064-cm-qs600.html > > detects a DMA problem during MMCI initialization: > > [ 2.237566] mmci-pl18x 121c0000.sdcc: mmc2: PL180 manf 51 rev0 at 0x121c0000 irq 32,0 (pio) > [ 2.244790] mmci-pl18x 121c0000.sdcc: DMA channels RX dma2chan1, TX dma2chan2 > [ 2.271722] mmci-pl18x 12400000.sdcc: error during DMA transfer! > [ 2.271757] mmci-pl18x 12400000.sdcc: buggy DMA detected. Taking evasive action. > [ 2.276798] ------------[ cut here ]------------ > [ 2.284185] WARNING: CPU: 0 PID: 0 at ../include/linux/dma-mapping.h:551 bam_free_chan+0x2d8/0x2e0 > [ 2.288772] Modules linked in: > [ 2.297534] CPU: 0 PID: 0 Comm: swapper/0 Not tainted 4.18.0-rc7-next-20180731 #1 > > then panics: > > [ 2.513796] ------------[ cut here ]------------ > [ 2.518367] kernel BUG at ../mm/vmalloc.c:1608! > [ 2.522968] Internal error: Oops - BUG: 0 [#1] SMP ARM > > trying to release the DMA channel. I've not done any bisection or > anything but I do note 8bb2299d2d0b5cc (mmc: mmci: Add and implement a > ->dma_setup() callback for qcom dml) and some related commits in the MMC > tree. > > More details for each of the failed boots at: > > https://kernelci.org/boot/id/5b6054f559b5144b9396baa9/ > https://kernelci.org/boot/id/5b60551259b5144abb96bab6/ > https://kernelci.org/boot/id/5b6054e259b5144b1e96bab2/ > > including full logs, details of the build and so on. > -------------- next part -------------- A non-text attachment was scrubbed... Name: 0001-mmc-mmci-fix-qcom-dma-issue-during-mmci-init-with-ne.patch Type: text/x-patch Size: 1267 bytes Desc: not available URL: <http://lists.infradead.org/pipermail/linux-arm-kernel/attachments/20180801/c20093b4/attachment.bin> ^ permalink raw reply [flat|nested] 8+ messages in thread
* next/master boot: 179 boots: 11 failed, 167 passed with 1 offline (next-20180731) 2018-08-01 8:19 ` Ludovic BARRE @ 2018-08-01 9:58 ` Ulf Hansson 0 siblings, 0 replies; 8+ messages in thread From: Ulf Hansson @ 2018-08-01 9:58 UTC (permalink / raw) To: linux-arm-kernel On 1 August 2018 at 10:19, Ludovic BARRE <ludovic.barre@st.com> wrote: > hi Mark, Ulf > > When I see log, I think the patch in attachment could fix this issue > , but like I've not qcom board I can't test if it's fixed :-(. > > Ulf: for patch delivery, you prefer a patch delivery on mailing list ? Thanks for looking into this. However, no need to post a fix this time (your patch fixed the issue, but should declare the qcom_variant_init() in mmci.h. I have already amended the patch, so no further actions is needed. [...] Kind regards Uffe ^ permalink raw reply [flat|nested] 8+ messages in thread
* next/master boot: 179 boots: 11 failed, 167 passed with 1 offline (next-20180731) 2018-07-31 16:06 ` next/master boot: 179 boots: 11 failed, 167 passed with 1 offline (next-20180731) Mark Brown 2018-08-01 8:19 ` Ludovic BARRE @ 2018-08-01 10:05 ` Ulf Hansson 1 sibling, 0 replies; 8+ messages in thread From: Ulf Hansson @ 2018-08-01 10:05 UTC (permalink / raw) To: linux-arm-kernel On 31 July 2018 at 18:06, Mark Brown <broonie@kernel.org> wrote: > On Tue, Jul 31, 2018 at 08:14:12AM -0700, kernelci.org bot wrote: > > Today's -next fails to boot on a variety of Qualcomm 32 bit platforms: > >> multi_v7_defconfig: >> qcom-apq8064-cm-qs600: >> lab-baylibre-seattle: new failure (last pass: next-20180730) >> qcom-apq8064-ifc6410: >> lab-baylibre-seattle: new failure (last pass: next-20180730) >> >> qcom_defconfig: >> qcom-apq8064-cm-qs600: >> lab-baylibre-seattle: new failure (last pass: next-20180730) > > The logs are all somewhat similar, for example: > > https://storage.kernelci.org/next/master/next-20180731/arm/multi_v7_defconfig/lab-baylibre-seattle/boot-qcom-apq8064-cm-qs600.html > > detects a DMA problem during MMCI initialization: > > [ 2.237566] mmci-pl18x 121c0000.sdcc: mmc2: PL180 manf 51 rev0 at 0x121c0000 irq 32,0 (pio) > [ 2.244790] mmci-pl18x 121c0000.sdcc: DMA channels RX dma2chan1, TX dma2chan2 > [ 2.271722] mmci-pl18x 12400000.sdcc: error during DMA transfer! > [ 2.271757] mmci-pl18x 12400000.sdcc: buggy DMA detected. Taking evasive action. > [ 2.276798] ------------[ cut here ]------------ > [ 2.284185] WARNING: CPU: 0 PID: 0 at ../include/linux/dma-mapping.h:551 bam_free_chan+0x2d8/0x2e0 > [ 2.288772] Modules linked in: > [ 2.297534] CPU: 0 PID: 0 Comm: swapper/0 Not tainted 4.18.0-rc7-next-20180731 #1 > > then panics: > > [ 2.513796] ------------[ cut here ]------------ > [ 2.518367] kernel BUG at ../mm/vmalloc.c:1608! > [ 2.522968] Internal error: Oops - BUG: 0 [#1] SMP ARM > > trying to release the DMA channel. I've not done any bisection or > anything but I do note 8bb2299d2d0b5cc (mmc: mmci: Add and implement a > ->dma_setup() callback for qcom dml) and some related commits in the MMC > tree. > > More details for each of the failed boots at: > > https://kernelci.org/boot/id/5b6054f559b5144b9396baa9/ > https://kernelci.org/boot/id/5b60551259b5144abb96bab6/ > https://kernelci.org/boot/id/5b6054e259b5144b1e96bab2/ > > including full logs, details of the build and so on. Mark, thanks for reporting. Problem was a simple one liner that should have been added to included in my patch "mmc: mmci: Add and implement a ->dma_setup() callback for qcom dml". The missing oneliner caused mmci to wrongly use dma for the qcom variant. I have amended the patch and published it, it should reach the next tree as of tomorrow. Apologize for the mess it created. Kind regards Uffe ^ permalink raw reply [flat|nested] 8+ messages in thread
* next/master boot: 179 boots: 11 failed, 167 passed with 1 offline (next-20180731) [not found] <5b607cc4.1c69fb81.6c1d6.6534@mx.google.com> 2018-07-31 16:06 ` next/master boot: 179 boots: 11 failed, 167 passed with 1 offline (next-20180731) Mark Brown @ 2018-07-31 16:11 ` Mark Brown 2018-07-31 19:50 ` Niklas Cassel 1 sibling, 1 reply; 8+ messages in thread From: Mark Brown @ 2018-07-31 16:11 UTC (permalink / raw) To: linux-arm-kernel On Tue, Jul 31, 2018 at 08:14:12AM -0700, kernelci.org bot wrote: Today's -next fails to boot on db820c: > arm64: > defconfig: > apq8096-db820c: > lab-bjorn: new failure (last pass: next-20180730) There's nothing immediately obvious as the boot failure cause in the logs, the last output is a failure to load the ath10k_pci firmware: 04:02:53.750283 [ 4.503980] ath10k_pci 0000:01:00.0: Failed to find firmware-N.bin (N between 2 and 6) from ath10k/QCA6174/hw3.0: -2 04:02:53.756384 [ 4.504010] ath10k_pci 0000:01:00.0: could not fetch firmware files (-2) 04:02:53.760522 [ 4.513736] ath10k_pci 0000:01:00.0: could not probe fw (-2) but I'm not sure that's the actual cause. More details, including the full boot log, here: https://kernelci.org/boot/id/5b6042b559b514136096babf/ -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 488 bytes Desc: not available URL: <http://lists.infradead.org/pipermail/linux-arm-kernel/attachments/20180731/ef582ea2/attachment.sig> ^ permalink raw reply [flat|nested] 8+ messages in thread
* next/master boot: 179 boots: 11 failed, 167 passed with 1 offline (next-20180731) 2018-07-31 16:11 ` Mark Brown @ 2018-07-31 19:50 ` Niklas Cassel 2018-08-01 9:31 ` Mark Brown 0 siblings, 1 reply; 8+ messages in thread From: Niklas Cassel @ 2018-07-31 19:50 UTC (permalink / raw) To: linux-arm-kernel On Tue, Jul 31, 2018 at 05:11:14PM +0100, Mark Brown wrote: > On Tue, Jul 31, 2018 at 08:14:12AM -0700, kernelci.org bot wrote: > > Today's -next fails to boot on db820c: > > > arm64: > > > defconfig: > > apq8096-db820c: > > lab-bjorn: new failure (last pass: next-20180730) > > There's nothing immediately obvious as the boot failure cause in the > logs, the last output is a failure to load the ath10k_pci firmware: > > 04:02:53.750283 [ 4.503980] ath10k_pci 0000:01:00.0: Failed to find firmware-N.bin (N between 2 and 6) from ath10k/QCA6174/hw3.0: -2 > 04:02:53.756384 [ 4.504010] ath10k_pci 0000:01:00.0: could not fetch firmware files (-2) > 04:02:53.760522 [ 4.513736] ath10k_pci 0000:01:00.0: could not probe fw (-2) > > but I'm not sure that's the actual cause. More details, including the > full boot log, here: > > https://kernelci.org/boot/id/5b6042b559b514136096babf/ I tried booting today's -next on db820c, using arm64 defconfig, and it booted correctly: I also tried removing the ath10k firmware from my initrd, but it still booted correctly. # cat /proc/version Linux version 4.18.0-rc7-next-20180731-00001-g47055e3ba913 (nks at centauri) (gcc version 7.2.1 20171011 (Linaro GCC 7.2-2017.11)) #9 SMP PREEMPT Tue Jul 31 21:34:43 CEST 2018 I guess it could be a bug that does not trigger on every boot, or it could be a problem in the kernelci infrastructure. Kind regards, Niklas ^ permalink raw reply [flat|nested] 8+ messages in thread
* next/master boot: 179 boots: 11 failed, 167 passed with 1 offline (next-20180731) 2018-07-31 19:50 ` Niklas Cassel @ 2018-08-01 9:31 ` Mark Brown 2018-08-01 20:50 ` Bjorn Andersson 0 siblings, 1 reply; 8+ messages in thread From: Mark Brown @ 2018-08-01 9:31 UTC (permalink / raw) To: linux-arm-kernel On Tue, Jul 31, 2018 at 09:50:37PM +0200, Niklas Cassel wrote: > I guess it could be a bug that does not trigger on every boot, > or it could be a problem in the kernelci infrastructure. Infrastructure bugs *tend* to manifest differently to this FWIW, though it can never be excluded. -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 488 bytes Desc: not available URL: <http://lists.infradead.org/pipermail/linux-arm-kernel/attachments/20180801/18e9f5f6/attachment.sig> ^ permalink raw reply [flat|nested] 8+ messages in thread
* next/master boot: 179 boots: 11 failed, 167 passed with 1 offline (next-20180731) 2018-08-01 9:31 ` Mark Brown @ 2018-08-01 20:50 ` Bjorn Andersson 0 siblings, 0 replies; 8+ messages in thread From: Bjorn Andersson @ 2018-08-01 20:50 UTC (permalink / raw) To: linux-arm-kernel On Wed 01 Aug 02:31 PDT 2018, Mark Brown wrote: > On Tue, Jul 31, 2018 at 09:50:37PM +0200, Niklas Cassel wrote: > > > I guess it could be a bug that does not trigger on every boot, > > or it could be a problem in the kernelci infrastructure. > > Infrastructure bugs *tend* to manifest differently to this FWIW, though > it can never be excluded. No, that's not an infrastructure issue. The board did warn about not finding the ath10k firmware, which is always does, so that's not the issue - in itself. Then nothing happened for 266 seconds, so my lab decided to terminate the agony. So this is either an issue with the stability of next-20180731 or with the specific board. PS. Today's next did boot successfully on the board. Regards, Bjorn ^ permalink raw reply [flat|nested] 8+ messages in thread
end of thread, other threads:[~2018-08-01 20:50 UTC | newest]
Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <5b607cc4.1c69fb81.6c1d6.6534@mx.google.com>
2018-07-31 16:06 ` next/master boot: 179 boots: 11 failed, 167 passed with 1 offline (next-20180731) Mark Brown
2018-08-01 8:19 ` Ludovic BARRE
2018-08-01 9:58 ` Ulf Hansson
2018-08-01 10:05 ` Ulf Hansson
2018-07-31 16:11 ` Mark Brown
2018-07-31 19:50 ` Niklas Cassel
2018-08-01 9:31 ` Mark Brown
2018-08-01 20:50 ` Bjorn Andersson
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).