* [PATCH] mmc: dw_mmc: fix bug that cause 'Timeout sending command' @ 2015-02-05 11:13 Addy Ke 2015-02-09 4:48 ` Ulf Hansson [not found] ` <1423134801-23219-1-git-send-email-addy.ke-TNX95d0MmH7DzftRWevZcw@public.gmane.org> 0 siblings, 2 replies; 21+ messages in thread From: Addy Ke @ 2015-02-05 11:13 UTC (permalink / raw) To: robh+dt, pawel.moll, mark.rutland, ijc+devicetree, galak, rdunlap, tgih.jun, jh80.chung, chris, ulf.hansson, dinguyen, heiko, olof, dianders, sonnyrao, amstan Cc: devicetree, linux-doc, linux-kernel, linux-mmc, linux-arm-kernel, linux-rockchip, zhenfu.fang, cf, lintao, chenfen, zyf, xjq, huangtao, zyw, yzq, hj, kever.yang, zhangqing, hl, Addy Ke Because of some uncertain factors, such as worse card or worse hardware, DAT[3:0](the data lines) may be pulled down by card, and mmc controller will be in busy state. This should not happend when mmc controller send command to update card clocks. If this happends, mci_send_cmd will be failed and we will get 'Timeout sending command', and then system will be blocked. To avoid this, we need reset mmc controller. Signed-off-by: Addy Ke <addy.ke@rock-chips.com> --- drivers/mmc/host/dw_mmc.c | 23 +++++++++++++++++++++++ 1 file changed, 23 insertions(+) diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c index 4d2e3c2..b1d6dfb 100644 --- a/drivers/mmc/host/dw_mmc.c +++ b/drivers/mmc/host/dw_mmc.c @@ -100,6 +100,7 @@ struct idmac_desc { }; #endif /* CONFIG_MMC_DW_IDMAC */ +static int dw_mci_card_busy(struct mmc_host *mmc); static bool dw_mci_reset(struct dw_mci *host); static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); @@ -888,6 +889,26 @@ static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) cmd, arg, cmd_status); } +static void dw_mci_wait_busy(struct dw_mci_slot *slot) +{ + struct dw_mci *host = slot->host; + unsigned long timeout = jiffies + msecs_to_jiffies(500); + + while (time_before(jiffies, timeout)) { + if (!dw_mci_card_busy(slot->mmc)) + return; + } + dev_err(host->dev, "Data busy (status %#x)\n", + mci_readl(slot->host, STATUS)); + + /* + * Data busy, this should not happend when mmc controller send command + * to update card clocks in non-volt-switch state. If it happends, we + * should reset controller to avoid getting "Timeout sending command". + */ + dw_mci_ctrl_reset(host, SDMMC_CTRL_ALL_RESET_FLAGS); +} + static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) { struct dw_mci *host = slot->host; @@ -899,6 +920,8 @@ static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) /* We must continue to set bit 28 in CMD until the change is complete */ if (host->state == STATE_WAITING_CMD11_DONE) sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; + else + dw_mci_wait_busy(slot); if (!clock) { mci_writel(host, CLKENA, 0); -- 1.8.3.2 ^ permalink raw reply related [flat|nested] 21+ messages in thread
* Re: [PATCH] mmc: dw_mmc: fix bug that cause 'Timeout sending command' 2015-02-05 11:13 [PATCH] mmc: dw_mmc: fix bug that cause 'Timeout sending command' Addy Ke @ 2015-02-09 4:48 ` Ulf Hansson [not found] ` <1423134801-23219-1-git-send-email-addy.ke-TNX95d0MmH7DzftRWevZcw@public.gmane.org> 1 sibling, 0 replies; 21+ messages in thread From: Ulf Hansson @ 2015-02-09 4:48 UTC (permalink / raw) To: Addy Ke Cc: Rob Herring, Paweł Moll, Mark Rutland, Ian Campbell, Kumar Gala, Randy Dunlap, tgih.jun@samsung.com, Jaehoon Chung, Chris Ball, Dinh Nguyen, Heiko Stübner, Olof Johansson, Doug Anderson, Sonny Rao, Alexandru Stan, devicetree@vger.kernel.org, linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, linux-mmc, linux-arm-kernel@lists.infradead.org, open list:ARM/Rockchip SoC..., zhenfu.fang, Eddie Cai, lintao, chenfen, zyf, Jianqun Xu, Tao Huang, Chris Zhong, 姚智情, Han Jiang, Kever Yang, zhangqing, Lin Huang [-- Attachment #1: Type: text/plain, Size: 2982 bytes --] On 5 February 2015 at 12:13, Addy Ke <addy.ke@rock-chips.com> wrote: > Because of some uncertain factors, such as worse card or worse hardware, > DAT[3:0](the data lines) may be pulled down by card, and mmc controller > will be in busy state. This should not happend when mmc controller > send command to update card clocks. If this happends, mci_send_cmd will > be failed and we will get 'Timeout sending command', and then system will > be blocked. To avoid this, we need reset mmc controller. > > Signed-off-by: Addy Ke <addy.ke@rock-chips.com> > Hi Addy, Should I consider $subject patch as a better option to the one below? [PATCH] mmc: dw_mmc: rockchip: Add DW_MCI_QUIRK_RETRY_DELAY https://lkml.org/lkml/2015/1/13/562 Kind regards Uffe > --- > drivers/mmc/host/dw_mmc.c | 23 +++++++++++++++++++++++ > 1 file changed, 23 insertions(+) > > diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c > index 4d2e3c2..b1d6dfb 100644 > --- a/drivers/mmc/host/dw_mmc.c > +++ b/drivers/mmc/host/dw_mmc.c > @@ -100,6 +100,7 @@ struct idmac_desc { > }; > #endif /* CONFIG_MMC_DW_IDMAC */ > > +static int dw_mci_card_busy(struct mmc_host *mmc); > static bool dw_mci_reset(struct dw_mci *host); > static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); > > @@ -888,6 +889,26 @@ static void mci_send_cmd(struct dw_mci_slot *slot, > u32 cmd, u32 arg) > cmd, arg, cmd_status); > } > > +static void dw_mci_wait_busy(struct dw_mci_slot *slot) > +{ > + struct dw_mci *host = slot->host; > + unsigned long timeout = jiffies + msecs_to_jiffies(500); > + > + while (time_before(jiffies, timeout)) { > + if (!dw_mci_card_busy(slot->mmc)) > + return; > + } > + dev_err(host->dev, "Data busy (status %#x)\n", > + mci_readl(slot->host, STATUS)); > + > + /* > + * Data busy, this should not happend when mmc controller send > command > + * to update card clocks in non-volt-switch state. If it happends, > we > + * should reset controller to avoid getting "Timeout sending > command". > + */ > + dw_mci_ctrl_reset(host, SDMMC_CTRL_ALL_RESET_FLAGS); > +} > + > static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) > { > struct dw_mci *host = slot->host; > @@ -899,6 +920,8 @@ static void dw_mci_setup_bus(struct dw_mci_slot *slot, > bool force_clkinit) > /* We must continue to set bit 28 in CMD until the change is > complete */ > if (host->state == STATE_WAITING_CMD11_DONE) > sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; > + else > + dw_mci_wait_busy(slot); > > if (!clock) { > mci_writel(host, CLKENA, 0); > -- > 1.8.3.2 > > > -- > To unsubscribe from this list: send the line "unsubscribe linux-mmc" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > [-- Attachment #2: Type: text/html, Size: 4232 bytes --] ^ permalink raw reply [flat|nested] 21+ messages in thread
[parent not found: <1423134801-23219-1-git-send-email-addy.ke-TNX95d0MmH7DzftRWevZcw@public.gmane.org>]
* Re: [PATCH] mmc: dw_mmc: fix bug that cause 'Timeout sending command' [not found] ` <1423134801-23219-1-git-send-email-addy.ke-TNX95d0MmH7DzftRWevZcw@public.gmane.org> @ 2015-02-09 4:51 ` Ulf Hansson 2015-02-09 6:56 ` Addy 2015-02-09 7:25 ` [PATCH v2 0/2] about data busy Addy Ke 1 sibling, 1 reply; 21+ messages in thread From: Ulf Hansson @ 2015-02-09 4:51 UTC (permalink / raw) To: Addy Ke Cc: Rob Herring, Paweł Moll, Mark Rutland, Ian Campbell, Kumar Gala, Randy Dunlap, tgih.jun-Sze3O3UU22JBDgjK7y7TUQ@public.gmane.org, Jaehoon Chung, Chris Ball, Dinh Nguyen, Heiko Stübner, Olof Johansson, Doug Anderson, Sonny Rao, Alexandru Stan, devicetree-u79uwXL29TY76Z2rM5mHXA@public.gmane.org, linux-doc-u79uwXL29TY76Z2rM5mHXA@public.gmane.org, linux-kernel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org, linux-mmc, linux-arm-kernel-IAPFreCvJWM7uuMidbF8XUB+6BGkLq7r@public.gmane.org On 5 February 2015 at 12:13, Addy Ke <addy.ke-TNX95d0MmH7DzftRWevZcw@public.gmane.org> wrote: > > Because of some uncertain factors, such as worse card or worse hardware, > DAT[3:0](the data lines) may be pulled down by card, and mmc controller > will be in busy state. This should not happend when mmc controller > send command to update card clocks. If this happends, mci_send_cmd will > be failed and we will get 'Timeout sending command', and then system will > be blocked. To avoid this, we need reset mmc controller. > > Signed-off-by: Addy Ke <addy.ke-TNX95d0MmH7DzftRWevZcw@public.gmane.org> Hi Addy, Should I consider $subject patch as a better option to the one below? [PATCH] mmc: dw_mmc: rockchip: Add DW_MCI_QUIRK_RETRY_DELAY https://lkml.org/lkml/2015/1/13/562 Kind regards Uffe > --- > drivers/mmc/host/dw_mmc.c | 23 +++++++++++++++++++++++ > 1 file changed, 23 insertions(+) > > diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c > index 4d2e3c2..b1d6dfb 100644 > --- a/drivers/mmc/host/dw_mmc.c > +++ b/drivers/mmc/host/dw_mmc.c > @@ -100,6 +100,7 @@ struct idmac_desc { > }; > #endif /* CONFIG_MMC_DW_IDMAC */ > > +static int dw_mci_card_busy(struct mmc_host *mmc); > static bool dw_mci_reset(struct dw_mci *host); > static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); > > @@ -888,6 +889,26 @@ static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) > cmd, arg, cmd_status); > } > > +static void dw_mci_wait_busy(struct dw_mci_slot *slot) > +{ > + struct dw_mci *host = slot->host; > + unsigned long timeout = jiffies + msecs_to_jiffies(500); > + > + while (time_before(jiffies, timeout)) { > + if (!dw_mci_card_busy(slot->mmc)) > + return; > + } > + dev_err(host->dev, "Data busy (status %#x)\n", > + mci_readl(slot->host, STATUS)); > + > + /* > + * Data busy, this should not happend when mmc controller send command > + * to update card clocks in non-volt-switch state. If it happends, we > + * should reset controller to avoid getting "Timeout sending command". > + */ > + dw_mci_ctrl_reset(host, SDMMC_CTRL_ALL_RESET_FLAGS); > +} > + > static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) > { > struct dw_mci *host = slot->host; > @@ -899,6 +920,8 @@ static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) > /* We must continue to set bit 28 in CMD until the change is complete */ > if (host->state == STATE_WAITING_CMD11_DONE) > sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; > + else > + dw_mci_wait_busy(slot); > > if (!clock) { > mci_writel(host, CLKENA, 0); > -- > 1.8.3.2 > > > -- > To unsubscribe from this list: send the line "unsubscribe linux-mmc" in > the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org > More majordomo info at http://vger.kernel.org/majordomo-info.html -- To unsubscribe from this list: send the line "unsubscribe devicetree" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 21+ messages in thread
* Re: [PATCH] mmc: dw_mmc: fix bug that cause 'Timeout sending command' 2015-02-09 4:51 ` Ulf Hansson @ 2015-02-09 6:56 ` Addy 2015-02-09 7:04 ` Jaehoon Chung 0 siblings, 1 reply; 21+ messages in thread From: Addy @ 2015-02-09 6:56 UTC (permalink / raw) To: Ulf Hansson Cc: Rob Herring, Paweł Moll, Mark Rutland, Ian Campbell, Kumar Gala, Randy Dunlap, tgih.jun@samsung.com, Jaehoon Chung, Chris Ball, Dinh Nguyen, Heiko Stübner, Olof Johansson, Doug Anderson, Sonny Rao, Alexandru Stan, devicetree@vger.kernel.org, linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, linux-mmc, linux-arm-kernel@lists.infradead.org On 2015.02.09 12:51, Ulf Hansson wrote: > On 5 February 2015 at 12:13, Addy Ke <addy.ke@rock-chips.com> wrote: >> Because of some uncertain factors, such as worse card or worse hardware, >> DAT[3:0](the data lines) may be pulled down by card, and mmc controller >> will be in busy state. This should not happend when mmc controller >> send command to update card clocks. If this happends, mci_send_cmd will >> be failed and we will get 'Timeout sending command', and then system will >> be blocked. To avoid this, we need reset mmc controller. >> >> Signed-off-by: Addy Ke <addy.ke@rock-chips.com> > > Hi Addy, > > Should I consider $subject patch as a better option to the one below? No: This patch fix the bug, which can be found by script: cd /sys/bus/platform/drivers/dwmmc_rockchip for i in $(seq 1 10000); do echo "========================" $i echo ff0c0000.dwmmc > unbind sleep .5 echo ff0c0000.dwmmc > bind sleep 2 done > [PATCH] mmc: dw_mmc: rockchip: Add DW_MCI_QUIRK_RETRY_DELAY This patch is for tuning issue: we should delay until card go to idle state, when the previous command return error. > https://lkml.org/lkml/2015/1/13/562 > > Kind regards > Uffe > > >> --- >> drivers/mmc/host/dw_mmc.c | 23 +++++++++++++++++++++++ >> 1 file changed, 23 insertions(+) >> >> diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c >> index 4d2e3c2..b1d6dfb 100644 >> --- a/drivers/mmc/host/dw_mmc.c >> +++ b/drivers/mmc/host/dw_mmc.c >> @@ -100,6 +100,7 @@ struct idmac_desc { >> }; >> #endif /* CONFIG_MMC_DW_IDMAC */ >> >> +static int dw_mci_card_busy(struct mmc_host *mmc); >> static bool dw_mci_reset(struct dw_mci *host); >> static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); >> >> @@ -888,6 +889,26 @@ static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) >> cmd, arg, cmd_status); >> } >> >> +static void dw_mci_wait_busy(struct dw_mci_slot *slot) >> +{ >> + struct dw_mci *host = slot->host; >> + unsigned long timeout = jiffies + msecs_to_jiffies(500); >> + >> + while (time_before(jiffies, timeout)) { >> + if (!dw_mci_card_busy(slot->mmc)) >> + return; >> + } >> + dev_err(host->dev, "Data busy (status %#x)\n", >> + mci_readl(slot->host, STATUS)); >> + >> + /* >> + * Data busy, this should not happend when mmc controller send command >> + * to update card clocks in non-volt-switch state. If it happends, we >> + * should reset controller to avoid getting "Timeout sending command". >> + */ >> + dw_mci_ctrl_reset(host, SDMMC_CTRL_ALL_RESET_FLAGS); >> +} >> + >> static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >> { >> struct dw_mci *host = slot->host; >> @@ -899,6 +920,8 @@ static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >> /* We must continue to set bit 28 in CMD until the change is complete */ >> if (host->state == STATE_WAITING_CMD11_DONE) >> sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; >> + else >> + dw_mci_wait_busy(slot); >> >> if (!clock) { >> mci_writel(host, CLKENA, 0); >> -- >> 1.8.3.2 >> >> >> -- >> To unsubscribe from this list: send the line "unsubscribe linux-mmc" in >> the body of a message to majordomo@vger.kernel.org >> More majordomo info at http://vger.kernel.org/majordomo-info.html > > ^ permalink raw reply [flat|nested] 21+ messages in thread
* Re: [PATCH] mmc: dw_mmc: fix bug that cause 'Timeout sending command' 2015-02-09 6:56 ` Addy @ 2015-02-09 7:04 ` Jaehoon Chung [not found] ` <54D85C15.8010007-Sze3O3UU22JBDgjK7y7TUQ@public.gmane.org> 0 siblings, 1 reply; 21+ messages in thread From: Jaehoon Chung @ 2015-02-09 7:04 UTC (permalink / raw) To: Addy, Ulf Hansson Cc: Rob Herring, Paweł Moll, Mark Rutland, Ian Campbell, Kumar Gala, Randy Dunlap, tgih.jun@samsung.com, Chris Ball, Dinh Nguyen, Heiko Stübner, Olof Johansson, Doug Anderson, Sonny Rao, Alexandru Stan, devicetree@vger.kernel.org, linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, linux-mmc, linux-arm-kernel@lists.infradead.org On 02/09/2015 03:56 PM, Addy wrote: > > > On 2015.02.09 12:51, Ulf Hansson wrote: >> On 5 February 2015 at 12:13, Addy Ke <addy.ke@rock-chips.com> wrote: >>> Because of some uncertain factors, such as worse card or worse hardware, >>> DAT[3:0](the data lines) may be pulled down by card, and mmc controller >>> will be in busy state. This should not happend when mmc controller >>> send command to update card clocks. If this happends, mci_send_cmd will >>> be failed and we will get 'Timeout sending command', and then system will >>> be blocked. To avoid this, we need reset mmc controller. I know that it needs to check whether card is busy or not, before clock-off. This patch seems to related with it. right? Best Regards, Jaehoon Chung >>> >>> Signed-off-by: Addy Ke <addy.ke@rock-chips.com> >> >> Hi Addy, >> >> Should I consider $subject patch as a better option to the one below? > No: > This patch fix the bug, which can be found by script: > cd /sys/bus/platform/drivers/dwmmc_rockchip > for i in $(seq 1 10000); do > echo "========================" $i > echo ff0c0000.dwmmc > unbind > sleep .5 > echo ff0c0000.dwmmc > bind > sleep 2 > done > >> [PATCH] mmc: dw_mmc: rockchip: Add DW_MCI_QUIRK_RETRY_DELAY > This patch is for tuning issue: we should delay until card go to idle state, when the previous command return error. >> https://lkml.org/lkml/2015/1/13/562 >> >> Kind regards >> Uffe >> >> >>> --- >>> drivers/mmc/host/dw_mmc.c | 23 +++++++++++++++++++++++ >>> 1 file changed, 23 insertions(+) >>> >>> diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c >>> index 4d2e3c2..b1d6dfb 100644 >>> --- a/drivers/mmc/host/dw_mmc.c >>> +++ b/drivers/mmc/host/dw_mmc.c >>> @@ -100,6 +100,7 @@ struct idmac_desc { >>> }; >>> #endif /* CONFIG_MMC_DW_IDMAC */ >>> >>> +static int dw_mci_card_busy(struct mmc_host *mmc); >>> static bool dw_mci_reset(struct dw_mci *host); >>> static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); >>> >>> @@ -888,6 +889,26 @@ static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) >>> cmd, arg, cmd_status); >>> } >>> >>> +static void dw_mci_wait_busy(struct dw_mci_slot *slot) >>> +{ >>> + struct dw_mci *host = slot->host; >>> + unsigned long timeout = jiffies + msecs_to_jiffies(500); >>> + >>> + while (time_before(jiffies, timeout)) { >>> + if (!dw_mci_card_busy(slot->mmc)) >>> + return; >>> + } >>> + dev_err(host->dev, "Data busy (status %#x)\n", >>> + mci_readl(slot->host, STATUS)); >>> + >>> + /* >>> + * Data busy, this should not happend when mmc controller send command >>> + * to update card clocks in non-volt-switch state. If it happends, we >>> + * should reset controller to avoid getting "Timeout sending command". >>> + */ >>> + dw_mci_ctrl_reset(host, SDMMC_CTRL_ALL_RESET_FLAGS); >>> +} >>> + >>> static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >>> { >>> struct dw_mci *host = slot->host; >>> @@ -899,6 +920,8 @@ static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >>> /* We must continue to set bit 28 in CMD until the change is complete */ >>> if (host->state == STATE_WAITING_CMD11_DONE) >>> sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; >>> + else >>> + dw_mci_wait_busy(slot); >>> >>> if (!clock) { >>> mci_writel(host, CLKENA, 0); >>> -- >>> 1.8.3.2 >>> >>> >>> -- >>> To unsubscribe from this list: send the line "unsubscribe linux-mmc" in >>> the body of a message to majordomo@vger.kernel.org >>> More majordomo info at http://vger.kernel.org/majordomo-info.html >> >> > > > ^ permalink raw reply [flat|nested] 21+ messages in thread
[parent not found: <54D85C15.8010007-Sze3O3UU22JBDgjK7y7TUQ@public.gmane.org>]
* Re: [PATCH] mmc: dw_mmc: fix bug that cause 'Timeout sending command' [not found] ` <54D85C15.8010007-Sze3O3UU22JBDgjK7y7TUQ@public.gmane.org> @ 2015-02-09 9:17 ` addy ke 0 siblings, 0 replies; 21+ messages in thread From: addy ke @ 2015-02-09 9:17 UTC (permalink / raw) To: jh80.chung-Sze3O3UU22JBDgjK7y7TUQ, ulf.hansson-QSEj5FYQhm4dnm+yROfE0A Cc: robh+dt-DgEjT+Ai2ygdnm+yROfE0A, pawel.moll-5wv7dgnIgG8, mark.rutland-5wv7dgnIgG8, ijc+devicetree-KcIKpvwj1kUDXYZnReoRVg, galak-sgV2jX0FEOL9JmXXK+q4OQ, rdunlap-wEGCiKHe2LqWVfeAwA7xHQ, tgih.jun-Sze3O3UU22JBDgjK7y7TUQ, chris-OsFVWbfNK3isTnJN9+BGXg, dinguyen-EIB2kfCEclfQT0dZR+AlfA, heiko-4mtYJXux2i+zQB+pC5nmwQ, olof-nZhT3qVonbNeoWH0uzbU5w, dianders-F7+t8E8rja9g9hUCZPvPmw, sonnyrao-F7+t8E8rja9g9hUCZPvPmw, amstan-F7+t8E8rja9g9hUCZPvPmw, devicetree-u79uwXL29TY76Z2rM5mHXA, linux-doc-u79uwXL29TY76Z2rM5mHXA, linux-kernel-u79uwXL29TY76Z2rM5mHXA, linux-mmc-u79uwXL29TY76Z2rM5mHXA, linux-arm-kernel-IAPFreCvJWM7uuMidbF8XUB+6BGkLq7r, linux-rockchip-IAPFreCvJWM7uuMidbF8XUB+6BGkLq7r, zhenfu.fang-TNX95d0MmH7DzftRWevZcw, cf-TNX95d0MmH7DzftRWevZcw, lintao-TNX95d0MmH7DzftRWevZcw, chenfen-TNX95d0MmH7DzftRWevZcw, zyf-TNX95d0MmH7DzftRWevZcw, xjq-TNX95d0MmH7DzftRWevZcw, huangtao-TNX95d0MmH7DzftRWevZcw, zyw-TNX95d0MmH7DzftRWevZcw, yzq-TNX95d0MmH7DzftRWevZcw, hj-TNX95d0MmH7DzftRWevZcw, kever.yang-TNX95d0MmH7DzftRWevZcw, zhangqing-TNX95d0MmH7DzftRWevZcw, hl-TNX95d0MmH7DzftRWevZcw On 2015/2/9 15:04, Jaehoon Chung wrote: > On 02/09/2015 03:56 PM, Addy wrote: >> >> >> On 2015.02.09 12:51, Ulf Hansson wrote: >>> On 5 February 2015 at 12:13, Addy Ke <addy.ke-TNX95d0MmH7DzftRWevZcw@public.gmane.org> wrote: >>>> Because of some uncertain factors, such as worse card or worse hardware, >>>> DAT[3:0](the data lines) may be pulled down by card, and mmc controller >>>> will be in busy state. This should not happend when mmc controller >>>> send command to update card clocks. If this happends, mci_send_cmd will >>>> be failed and we will get 'Timeout sending command', and then system will >>>> be blocked. To avoid this, we need reset mmc controller. > > I know that it needs to check whether card is busy or not, before clock-off. > This patch seems to related with it. right? Yes, it is. > > Best Regards, > Jaehoon Chung > >>>> >>>> Signed-off-by: Addy Ke <addy.ke-TNX95d0MmH7DzftRWevZcw@public.gmane.org> >>> >>> Hi Addy, >>> >>> Should I consider $subject patch as a better option to the one below? >> No: >> This patch fix the bug, which can be found by script: >> cd /sys/bus/platform/drivers/dwmmc_rockchip >> for i in $(seq 1 10000); do >> echo "========================" $i >> echo ff0c0000.dwmmc > unbind >> sleep .5 >> echo ff0c0000.dwmmc > bind >> sleep 2 >> done >> >>> [PATCH] mmc: dw_mmc: rockchip: Add DW_MCI_QUIRK_RETRY_DELAY >> This patch is for tuning issue: we should delay until card go to idle state, when the previous command return error. >>> https://lkml.org/lkml/2015/1/13/562 >>> >>> Kind regards >>> Uffe >>> >>> >>>> --- >>>> drivers/mmc/host/dw_mmc.c | 23 +++++++++++++++++++++++ >>>> 1 file changed, 23 insertions(+) >>>> >>>> diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c >>>> index 4d2e3c2..b1d6dfb 100644 >>>> --- a/drivers/mmc/host/dw_mmc.c >>>> +++ b/drivers/mmc/host/dw_mmc.c >>>> @@ -100,6 +100,7 @@ struct idmac_desc { >>>> }; >>>> #endif /* CONFIG_MMC_DW_IDMAC */ >>>> >>>> +static int dw_mci_card_busy(struct mmc_host *mmc); >>>> static bool dw_mci_reset(struct dw_mci *host); >>>> static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); >>>> >>>> @@ -888,6 +889,26 @@ static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) >>>> cmd, arg, cmd_status); >>>> } >>>> >>>> +static void dw_mci_wait_busy(struct dw_mci_slot *slot) >>>> +{ >>>> + struct dw_mci *host = slot->host; >>>> + unsigned long timeout = jiffies + msecs_to_jiffies(500); >>>> + >>>> + while (time_before(jiffies, timeout)) { >>>> + if (!dw_mci_card_busy(slot->mmc)) >>>> + return; >>>> + } >>>> + dev_err(host->dev, "Data busy (status %#x)\n", >>>> + mci_readl(slot->host, STATUS)); >>>> + >>>> + /* >>>> + * Data busy, this should not happend when mmc controller send command >>>> + * to update card clocks in non-volt-switch state. If it happends, we >>>> + * should reset controller to avoid getting "Timeout sending command". >>>> + */ >>>> + dw_mci_ctrl_reset(host, SDMMC_CTRL_ALL_RESET_FLAGS); >>>> +} >>>> + >>>> static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >>>> { >>>> struct dw_mci *host = slot->host; >>>> @@ -899,6 +920,8 @@ static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >>>> /* We must continue to set bit 28 in CMD until the change is complete */ >>>> if (host->state == STATE_WAITING_CMD11_DONE) >>>> sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; >>>> + else >>>> + dw_mci_wait_busy(slot); >>>> >>>> if (!clock) { >>>> mci_writel(host, CLKENA, 0); >>>> -- >>>> 1.8.3.2 >>>> >>>> >>>> -- >>>> To unsubscribe from this list: send the line "unsubscribe linux-mmc" in >>>> the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org >>>> More majordomo info at http://vger.kernel.org/majordomo-info.html >>> >>> >> >> >> > > > > -- To unsubscribe from this list: send the line "unsubscribe devicetree" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 21+ messages in thread
* [PATCH v2 0/2] about data busy [not found] ` <1423134801-23219-1-git-send-email-addy.ke-TNX95d0MmH7DzftRWevZcw@public.gmane.org> 2015-02-09 4:51 ` Ulf Hansson @ 2015-02-09 7:25 ` Addy Ke 2015-02-09 7:25 ` [PATCH v2 1/2] mmc: dw_mmc: fix bug that cause 'Timeout sending command' Addy Ke 2015-02-09 7:25 ` [PATCH v2 2/2] mmc: dw_mmc: Don't start command while data busy Addy Ke 1 sibling, 2 replies; 21+ messages in thread From: Addy Ke @ 2015-02-09 7:25 UTC (permalink / raw) To: robh+dt-DgEjT+Ai2ygdnm+yROfE0A, pawel.moll-5wv7dgnIgG8, mark.rutland-5wv7dgnIgG8, ijc+devicetree-KcIKpvwj1kUDXYZnReoRVg, galak-sgV2jX0FEOL9JmXXK+q4OQ, rdunlap-wEGCiKHe2LqWVfeAwA7xHQ, tgih.jun-Sze3O3UU22JBDgjK7y7TUQ, jh80.chung-Sze3O3UU22JBDgjK7y7TUQ, chris-OsFVWbfNK3isTnJN9+BGXg, ulf.hansson-QSEj5FYQhm4dnm+yROfE0A, dinguyen-EIB2kfCEclfQT0dZR+AlfA, heiko-4mtYJXux2i+zQB+pC5nmwQ, olof-nZhT3qVonbNeoWH0uzbU5w, dianders-F7+t8E8rja9g9hUCZPvPmw, sonnyrao-F7+t8E8rja9g9hUCZPvPmw, amstan-F7+t8E8rja9g9hUCZPvPmw, djkurtz-F7+t8E8rja9g9hUCZPvPmw Cc: devicetree-u79uwXL29TY76Z2rM5mHXA, linux-doc-u79uwXL29TY76Z2rM5mHXA, linux-kernel-u79uwXL29TY76Z2rM5mHXA, linux-mmc-u79uwXL29TY76Z2rM5mHXA, linux-arm-kernel-IAPFreCvJWM7uuMidbF8XUB+6BGkLq7r, linux-rockchip-IAPFreCvJWM7uuMidbF8XUB+6BGkLq7r, zhenfu.fang-TNX95d0MmH7DzftRWevZcw, cf-TNX95d0MmH7DzftRWevZcw, lintao-TNX95d0MmH7DzftRWevZcw, chenfen-TNX95d0MmH7DzftRWevZcw, zyf-TNX95d0MmH7DzftRWevZcw, xjq-TNX95d0MmH7DzftRWevZcw, huangtao-TNX95d0MmH7DzftRWevZcw, zyw-TNX95d0MmH7DzftRWevZcw, yzq-TNX95d0MmH7DzftRWevZcw, hj-TNX95d0MmH7DzftRWevZcw, kever.yang-TNX95d0MmH7DzftRWevZcw, zhangqing-TNX95d0MmH7DzftRWevZcw, hl-TNX95d0MmH7DzftRWevZcw, Addy Ke Addy Ke (2): mmc: dw_mmc: fix bug that cause 'Timeout sending command' mmc: dw_mmc: Don't start command while data busy drivers/mmc/host/dw_mmc.c | 35 +++++++++++++++++++++++++++++++++++ 1 file changed, 35 insertions(+) -- Changes in v2: - add new patch to handle data busy when start command - add cpu_relaxed, suggested by Daniel Kurtz <djkurtz-F7+t8E8rja9g9hUCZPvPmw@public.gmane.org> 1.8.3.2 -- To unsubscribe from this list: send the line "unsubscribe devicetree" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 21+ messages in thread
* [PATCH v2 1/2] mmc: dw_mmc: fix bug that cause 'Timeout sending command' 2015-02-09 7:25 ` [PATCH v2 0/2] about data busy Addy Ke @ 2015-02-09 7:25 ` Addy Ke 2015-02-09 10:01 ` Jaehoon Chung 2015-02-10 15:22 ` Alim Akhtar 2015-02-09 7:25 ` [PATCH v2 2/2] mmc: dw_mmc: Don't start command while data busy Addy Ke 1 sibling, 2 replies; 21+ messages in thread From: Addy Ke @ 2015-02-09 7:25 UTC (permalink / raw) To: robh+dt, pawel.moll, mark.rutland, ijc+devicetree, galak, rdunlap, tgih.jun, jh80.chung, chris, ulf.hansson, dinguyen, heiko, olof, dianders, sonnyrao, amstan, djkurtz Cc: devicetree, linux-doc, linux-kernel, linux-mmc, linux-arm-kernel, linux-rockchip, zhenfu.fang, cf, lintao, chenfen, zyf, xjq, huangtao, zyw, yzq, hj, kever.yang, zhangqing, hl, Addy Ke Because of some uncertain factors, such as worse card or worse hardware, DAT[3:0](the data lines) may be pulled down by card, and mmc controller will be in busy state. This should not happend when mmc controller send command to update card clocks. If this happends, mci_send_cmd will be failed and we will get 'Timeout sending command', and then system will be blocked. To avoid this, we need reset mmc controller. Signed-off-by: Addy Ke <addy.ke@rock-chips.com> --- drivers/mmc/host/dw_mmc.c | 28 ++++++++++++++++++++++++++++ 1 file changed, 28 insertions(+) diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c index 4d2e3c2..b0b57e3 100644 --- a/drivers/mmc/host/dw_mmc.c +++ b/drivers/mmc/host/dw_mmc.c @@ -100,6 +100,7 @@ struct idmac_desc { }; #endif /* CONFIG_MMC_DW_IDMAC */ +static int dw_mci_card_busy(struct mmc_host *mmc); static bool dw_mci_reset(struct dw_mci *host); static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); @@ -888,6 +889,31 @@ static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) cmd, arg, cmd_status); } +static void dw_mci_wait_busy(struct dw_mci_slot *slot) +{ + struct dw_mci *host = slot->host; + unsigned long timeout = jiffies + msecs_to_jiffies(500); + + do { + if (!dw_mci_card_busy(slot->mmc)) + return; + cpu_relax(); + } while (time_before(jiffies, timeout)); + + dev_err(host->dev, "Data busy (status %#x)\n", + mci_readl(slot->host, STATUS)); + + /* + * Data busy, this should not happend when mmc controller send command + * to update card clocks in non-volt-switch state. If it happends, we + * should reset controller to avoid getting "Timeout sending command". + */ + dw_mci_ctrl_reset(host, SDMMC_CTRL_ALL_RESET_FLAGS); + + /* Fail to reset controller or still data busy, WARN_ON! */ + WARN_ON(dw_mci_card_busy(slot->mmc)); +} + static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) { struct dw_mci *host = slot->host; @@ -899,6 +925,8 @@ static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) /* We must continue to set bit 28 in CMD until the change is complete */ if (host->state == STATE_WAITING_CMD11_DONE) sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; + else + dw_mci_wait_busy(slot); if (!clock) { mci_writel(host, CLKENA, 0); -- 1.8.3.2 ^ permalink raw reply related [flat|nested] 21+ messages in thread
* Re: [PATCH v2 1/2] mmc: dw_mmc: fix bug that cause 'Timeout sending command' 2015-02-09 7:25 ` [PATCH v2 1/2] mmc: dw_mmc: fix bug that cause 'Timeout sending command' Addy Ke @ 2015-02-09 10:01 ` Jaehoon Chung 2015-02-11 3:07 ` Addy 2015-02-10 15:22 ` Alim Akhtar 1 sibling, 1 reply; 21+ messages in thread From: Jaehoon Chung @ 2015-02-09 10:01 UTC (permalink / raw) To: Addy Ke, robh+dt, pawel.moll, mark.rutland, ijc+devicetree, galak, rdunlap, tgih.jun, chris, ulf.hansson, dinguyen, heiko, olof, dianders, sonnyrao, amstan, djkurtz Cc: devicetree, linux-doc, linux-kernel, linux-mmc, linux-arm-kernel, linux-rockchip, zhenfu.fang, cf, lintao, chenfen, zyf, xjq, huangtao, zyw, yzq, hj, kever.yang, zhangqing, hl Hi, Addy. On 02/09/2015 04:25 PM, Addy Ke wrote: > Because of some uncertain factors, such as worse card or worse hardware, > DAT[3:0](the data lines) may be pulled down by card, and mmc controller > will be in busy state. This should not happend when mmc controller > send command to update card clocks. If this happends, mci_send_cmd will > be failed and we will get 'Timeout sending command', and then system will > be blocked. To avoid this, we need reset mmc controller. > > Signed-off-by: Addy Ke <addy.ke@rock-chips.com> > --- > drivers/mmc/host/dw_mmc.c | 28 ++++++++++++++++++++++++++++ > 1 file changed, 28 insertions(+) > > diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c > index 4d2e3c2..b0b57e3 100644 > --- a/drivers/mmc/host/dw_mmc.c > +++ b/drivers/mmc/host/dw_mmc.c > @@ -100,6 +100,7 @@ struct idmac_desc { > }; > #endif /* CONFIG_MMC_DW_IDMAC */ > > +static int dw_mci_card_busy(struct mmc_host *mmc); > static bool dw_mci_reset(struct dw_mci *host); > static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); > > @@ -888,6 +889,31 @@ static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) > cmd, arg, cmd_status); > } > > +static void dw_mci_wait_busy(struct dw_mci_slot *slot) > +{ > + struct dw_mci *host = slot->host; > + unsigned long timeout = jiffies + msecs_to_jiffies(500); > + > + do { > + if (!dw_mci_card_busy(slot->mmc)) > + return; > + cpu_relax(); > + } while (time_before(jiffies, timeout)); > + > + dev_err(host->dev, "Data busy (status %#x)\n", > + mci_readl(slot->host, STATUS)); > + > + /* > + * Data busy, this should not happend when mmc controller send command > + * to update card clocks in non-volt-switch state. If it happends, we > + * should reset controller to avoid getting "Timeout sending command". > + */ > + dw_mci_ctrl_reset(host, SDMMC_CTRL_ALL_RESET_FLAGS); If reset is failed, then dw_mci_ctrl_reset should return "false". ret = dw_mci_ctrl_reset(); WARN_ON(!ret || dw_mci_card_busy(slot->mmc)); Is it right? In my experiment, if reset is failed or card is busy, eMMC can't work anymore..right? I think this patch is reasonable to prevent blocking issue. Best Regards, Jaehoon Chung > + > + /* Fail to reset controller or still data busy, WARN_ON! */ > + WARN_ON(dw_mci_card_busy(slot->mmc)); > +} > + > static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) > { > struct dw_mci *host = slot->host; > @@ -899,6 +925,8 @@ static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) > /* We must continue to set bit 28 in CMD until the change is complete */ > if (host->state == STATE_WAITING_CMD11_DONE) > sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; > + else > + dw_mci_wait_busy(slot); > > if (!clock) { > mci_writel(host, CLKENA, 0); > ^ permalink raw reply [flat|nested] 21+ messages in thread
* Re: [PATCH v2 1/2] mmc: dw_mmc: fix bug that cause 'Timeout sending command' 2015-02-09 10:01 ` Jaehoon Chung @ 2015-02-11 3:07 ` Addy 0 siblings, 0 replies; 21+ messages in thread From: Addy @ 2015-02-11 3:07 UTC (permalink / raw) To: Jaehoon Chung, robh+dt, pawel.moll, mark.rutland, ijc+devicetree, galak, rdunlap, tgih.jun, chris, ulf.hansson, dinguyen, heiko, olof, dianders, sonnyrao, amstan, djkurtz Cc: devicetree, linux-doc, linux-kernel, linux-mmc, linux-arm-kernel, linux-rockchip, zhenfu.fang, cf, lintao, chenfen, zyf, xjq, huangtao, zyw, yzq, hj, kever.yang, zhangqing, hl On 2015/02/09 18:01, Jaehoon Chung wrote: > Hi, Addy. > > On 02/09/2015 04:25 PM, Addy Ke wrote: >> Because of some uncertain factors, such as worse card or worse hardware, >> DAT[3:0](the data lines) may be pulled down by card, and mmc controller >> will be in busy state. This should not happend when mmc controller >> send command to update card clocks. If this happends, mci_send_cmd will >> be failed and we will get 'Timeout sending command', and then system will >> be blocked. To avoid this, we need reset mmc controller. >> >> Signed-off-by: Addy Ke <addy.ke@rock-chips.com> >> --- >> drivers/mmc/host/dw_mmc.c | 28 ++++++++++++++++++++++++++++ >> 1 file changed, 28 insertions(+) >> >> diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c >> index 4d2e3c2..b0b57e3 100644 >> --- a/drivers/mmc/host/dw_mmc.c >> +++ b/drivers/mmc/host/dw_mmc.c >> @@ -100,6 +100,7 @@ struct idmac_desc { >> }; >> #endif /* CONFIG_MMC_DW_IDMAC */ >> >> +static int dw_mci_card_busy(struct mmc_host *mmc); >> static bool dw_mci_reset(struct dw_mci *host); >> static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); >> >> @@ -888,6 +889,31 @@ static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) >> cmd, arg, cmd_status); >> } >> >> +static void dw_mci_wait_busy(struct dw_mci_slot *slot) >> +{ >> + struct dw_mci *host = slot->host; >> + unsigned long timeout = jiffies + msecs_to_jiffies(500); >> + >> + do { >> + if (!dw_mci_card_busy(slot->mmc)) >> + return; >> + cpu_relax(); >> + } while (time_before(jiffies, timeout)); >> + >> + dev_err(host->dev, "Data busy (status %#x)\n", >> + mci_readl(slot->host, STATUS)); >> + >> + /* >> + * Data busy, this should not happend when mmc controller send command >> + * to update card clocks in non-volt-switch state. If it happends, we >> + * should reset controller to avoid getting "Timeout sending command". >> + */ >> + dw_mci_ctrl_reset(host, SDMMC_CTRL_ALL_RESET_FLAGS); > If reset is failed, then dw_mci_ctrl_reset should return "false". > > ret = dw_mci_ctrl_reset(); > > WARN_ON(!ret || dw_mci_card_busy(slot->mmc)); > > Is it right? you are right, and I will update it in my next version patch. thank you. > > In my experiment, if reset is failed or card is busy, eMMC can't work anymore..right? > I think this patch is reasonable to prevent blocking issue. > > Best Regards, > Jaehoon Chung > > >> + >> + /* Fail to reset controller or still data busy, WARN_ON! */ >> + WARN_ON(dw_mci_card_busy(slot->mmc)); >> +} >> + >> static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >> { >> struct dw_mci *host = slot->host; >> @@ -899,6 +925,8 @@ static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >> /* We must continue to set bit 28 in CMD until the change is complete */ >> if (host->state == STATE_WAITING_CMD11_DONE) >> sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; >> + else >> + dw_mci_wait_busy(slot); >> >> if (!clock) { >> mci_writel(host, CLKENA, 0); >> > > > ^ permalink raw reply [flat|nested] 21+ messages in thread
* Re: [PATCH v2 1/2] mmc: dw_mmc: fix bug that cause 'Timeout sending command' 2015-02-09 7:25 ` [PATCH v2 1/2] mmc: dw_mmc: fix bug that cause 'Timeout sending command' Addy Ke 2015-02-09 10:01 ` Jaehoon Chung @ 2015-02-10 15:22 ` Alim Akhtar 2015-02-11 2:57 ` Addy 1 sibling, 1 reply; 21+ messages in thread From: Alim Akhtar @ 2015-02-10 15:22 UTC (permalink / raw) To: Addy Ke Cc: robh+dt, pawel.moll, Mark Rutland, ijc+devicetree, galak, rdunlap, Seungwon Jeon, Jaehoon Chung, Chris Ball, Ulf Hansson, dinguyen, Heiko Stübner, Olof Johansson, Douglas Anderson, Sonny Rao, amstan, djkurtz, huangtao, devicetree@vger.kernel.org, hl, linux-doc, yzq, zyw, zhangqing, linux-mmc@vger.kernel.org, linux-kernel@vger.kernel.org, kever.yang Hi Addy, On Mon, Feb 9, 2015 at 12:55 PM, Addy Ke <addy.ke@rock-chips.com> wrote: > Because of some uncertain factors, such as worse card or worse hardware, > DAT[3:0](the data lines) may be pulled down by card, and mmc controller > will be in busy state. This should not happend when mmc controller > send command to update card clocks. If this happends, mci_send_cmd will > be failed and we will get 'Timeout sending command', and then system will > be blocked. To avoid this, we need reset mmc controller. > > Signed-off-by: Addy Ke <addy.ke@rock-chips.com> > --- > drivers/mmc/host/dw_mmc.c | 28 ++++++++++++++++++++++++++++ > 1 file changed, 28 insertions(+) > > diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c > index 4d2e3c2..b0b57e3 100644 > --- a/drivers/mmc/host/dw_mmc.c > +++ b/drivers/mmc/host/dw_mmc.c > @@ -100,6 +100,7 @@ struct idmac_desc { > }; > #endif /* CONFIG_MMC_DW_IDMAC */ > > +static int dw_mci_card_busy(struct mmc_host *mmc); > static bool dw_mci_reset(struct dw_mci *host); > static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); > > @@ -888,6 +889,31 @@ static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) > cmd, arg, cmd_status); > } > > +static void dw_mci_wait_busy(struct dw_mci_slot *slot) > +{ > + struct dw_mci *host = slot->host; > + unsigned long timeout = jiffies + msecs_to_jiffies(500); > + Why 500 msec? > + do { > + if (!dw_mci_card_busy(slot->mmc)) > + return; > + cpu_relax(); > + } while (time_before(jiffies, timeout)); > + > + dev_err(host->dev, "Data busy (status %#x)\n", > + mci_readl(slot->host, STATUS)); > + > + /* > + * Data busy, this should not happend when mmc controller send command > + * to update card clocks in non-volt-switch state. If it happends, we > + * should reset controller to avoid getting "Timeout sending command". > + */ > + dw_mci_ctrl_reset(host, SDMMC_CTRL_ALL_RESET_FLAGS); > + Why you need to reset all blocks? may be CTRL_RESET is good enough here. > + /* Fail to reset controller or still data busy, WARN_ON! */ > + WARN_ON(dw_mci_card_busy(slot->mmc)); > +} > + > static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) > { > struct dw_mci *host = slot->host; > @@ -899,6 +925,8 @@ static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) > /* We must continue to set bit 28 in CMD until the change is complete */ > if (host->state == STATE_WAITING_CMD11_DONE) > sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; > + else > + dw_mci_wait_busy(slot); > hmm...I would suggest you to call dw_mci_wait_busy() from inside mci_send_cmd(), seems like dw_mmc hangs while sending update clock cmd in multiple cases.see [1] [1]: http://permalink.gmane.org/gmane.linux.kernel.mmc/31140 > if (!clock) { > mci_writel(host, CLKENA, 0); > -- > 1.8.3.2 > > > > _______________________________________________ > linux-arm-kernel mailing list > linux-arm-kernel@lists.infradead.org > http://lists.infradead.org/mailman/listinfo/linux-arm-kernel -- Regards, Alim ^ permalink raw reply [flat|nested] 21+ messages in thread
* Re: [PATCH v2 1/2] mmc: dw_mmc: fix bug that cause 'Timeout sending command' 2015-02-10 15:22 ` Alim Akhtar @ 2015-02-11 2:57 ` Addy 2015-02-11 11:58 ` Andrzej Hajda 0 siblings, 1 reply; 21+ messages in thread From: Addy @ 2015-02-11 2:57 UTC (permalink / raw) To: Alim Akhtar Cc: robh+dt, pawel.moll, Mark Rutland, ijc+devicetree, galak, rdunlap, Seungwon Jeon, Jaehoon Chung, Chris Ball, Ulf Hansson, dinguyen, Heiko Stübner, Olof Johansson, Douglas Anderson, Sonny Rao, amstan, djkurtz, huangtao, devicetree@vger.kernel.org, hl, linux-doc, yzq, zyw, zhangqing, linux-mmc@vger.kernel.org, linux-kernel@vger.kernel.org, kever.yang On 2015/02/10 23:22, Alim Akhtar wrote: > Hi Addy, > > On Mon, Feb 9, 2015 at 12:55 PM, Addy Ke <addy.ke@rock-chips.com> wrote: >> Because of some uncertain factors, such as worse card or worse hardware, >> DAT[3:0](the data lines) may be pulled down by card, and mmc controller >> will be in busy state. This should not happend when mmc controller >> send command to update card clocks. If this happends, mci_send_cmd will >> be failed and we will get 'Timeout sending command', and then system will >> be blocked. To avoid this, we need reset mmc controller. >> >> Signed-off-by: Addy Ke <addy.ke@rock-chips.com> >> --- >> drivers/mmc/host/dw_mmc.c | 28 ++++++++++++++++++++++++++++ >> 1 file changed, 28 insertions(+) >> >> diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c >> index 4d2e3c2..b0b57e3 100644 >> --- a/drivers/mmc/host/dw_mmc.c >> +++ b/drivers/mmc/host/dw_mmc.c >> @@ -100,6 +100,7 @@ struct idmac_desc { >> }; >> #endif /* CONFIG_MMC_DW_IDMAC */ >> >> +static int dw_mci_card_busy(struct mmc_host *mmc); >> static bool dw_mci_reset(struct dw_mci *host); >> static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); >> >> @@ -888,6 +889,31 @@ static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) >> cmd, arg, cmd_status); >> } >> >> +static void dw_mci_wait_busy(struct dw_mci_slot *slot) >> +{ >> + struct dw_mci *host = slot->host; >> + unsigned long timeout = jiffies + msecs_to_jiffies(500); >> + > Why 500 msec? This timeout value is the same as mci_send_cmd: static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) { struct dw_mci *host = slot->host; unsigned long timeout = jiffies + msecs_to_jiffies(500); .... } I have not clear that which is suitable. Do you have any suggestion on it? > >> + do { >> + if (!dw_mci_card_busy(slot->mmc)) >> + return; >> + cpu_relax(); >> + } while (time_before(jiffies, timeout)); >> + >> + dev_err(host->dev, "Data busy (status %#x)\n", >> + mci_readl(slot->host, STATUS)); >> + >> + /* >> + * Data busy, this should not happend when mmc controller send command >> + * to update card clocks in non-volt-switch state. If it happends, we >> + * should reset controller to avoid getting "Timeout sending command". >> + */ >> + dw_mci_ctrl_reset(host, SDMMC_CTRL_ALL_RESET_FLAGS); >> + > Why you need to reset all blocks? may be CTRL_RESET is good enough here. I have tested on rk3288, if only reset ctroller, data busy bit will not be cleaned,and we will still get "Timeout sending command". > >> + /* Fail to reset controller or still data busy, WARN_ON! */ >> + WARN_ON(dw_mci_card_busy(slot->mmc)); >> +} >> + >> static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >> { >> struct dw_mci *host = slot->host; >> @@ -899,6 +925,8 @@ static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >> /* We must continue to set bit 28 in CMD until the change is complete */ >> if (host->state == STATE_WAITING_CMD11_DONE) >> sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; >> + else >> + dw_mci_wait_busy(slot); >> > hmm...I would suggest you to call dw_mci_wait_busy() from inside > mci_send_cmd(), seems like dw_mmc hangs while sending update clock cmd > in multiple cases.see [1] > > [1]: http://permalink.gmane.org/gmane.linux.kernel.mmc/31140 I think this patch is more reasonable. So I will resend patches based on this patch. thank you! > >> if (!clock) { >> mci_writel(host, CLKENA, 0); >> -- >> 1.8.3.2 >> >> >> >> _______________________________________________ >> linux-arm-kernel mailing list >> linux-arm-kernel@lists.infradead.org >> http://lists.infradead.org/mailman/listinfo/linux-arm-kernel > > ^ permalink raw reply [flat|nested] 21+ messages in thread
* Re: [PATCH v2 1/2] mmc: dw_mmc: fix bug that cause 'Timeout sending command' 2015-02-11 2:57 ` Addy @ 2015-02-11 11:58 ` Andrzej Hajda 2015-02-11 23:20 ` Alim Akhtar 0 siblings, 1 reply; 21+ messages in thread From: Andrzej Hajda @ 2015-02-11 11:58 UTC (permalink / raw) To: Addy, Alim Akhtar Cc: robh+dt, pawel.moll, Mark Rutland, ijc+devicetree, galak, rdunlap, Seungwon Jeon, Jaehoon Chung, Chris Ball, Ulf Hansson, dinguyen, Heiko Stübner, Olof Johansson, Douglas Anderson, Sonny Rao, amstan, djkurtz, huangtao, devicetree@vger.kernel.org, hl, linux-doc, yzq, zyw, zhangqing, linux-mmc@vger.kernel.org, linux-kernel@vger.kernel.org, kever.yang Hi Alim, On 02/11/2015 03:57 AM, Addy wrote: > > On 2015/02/10 23:22, Alim Akhtar wrote: >> Hi Addy, >> >> On Mon, Feb 9, 2015 at 12:55 PM, Addy Ke <addy.ke@rock-chips.com> wrote: >>> Because of some uncertain factors, such as worse card or worse hardware, >>> DAT[3:0](the data lines) may be pulled down by card, and mmc controller >>> will be in busy state. This should not happend when mmc controller >>> send command to update card clocks. If this happends, mci_send_cmd will >>> be failed and we will get 'Timeout sending command', and then system will >>> be blocked. To avoid this, we need reset mmc controller. >>> >>> Signed-off-by: Addy Ke <addy.ke@rock-chips.com> >>> --- >>> drivers/mmc/host/dw_mmc.c | 28 ++++++++++++++++++++++++++++ >>> 1 file changed, 28 insertions(+) >>> >>> diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c >>> index 4d2e3c2..b0b57e3 100644 >>> --- a/drivers/mmc/host/dw_mmc.c >>> +++ b/drivers/mmc/host/dw_mmc.c >>> @@ -100,6 +100,7 @@ struct idmac_desc { >>> }; >>> #endif /* CONFIG_MMC_DW_IDMAC */ >>> >>> +static int dw_mci_card_busy(struct mmc_host *mmc); >>> static bool dw_mci_reset(struct dw_mci *host); >>> static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); >>> >>> @@ -888,6 +889,31 @@ static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) >>> cmd, arg, cmd_status); >>> } >>> >>> +static void dw_mci_wait_busy(struct dw_mci_slot *slot) >>> +{ >>> + struct dw_mci *host = slot->host; >>> + unsigned long timeout = jiffies + msecs_to_jiffies(500); >>> + >> Why 500 msec? > This timeout value is the same as mci_send_cmd: > static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) > { > struct dw_mci *host = slot->host; > unsigned long timeout = jiffies + msecs_to_jiffies(500); > .... > } > > I have not clear that which is suitable. > Do you have any suggestion on it? >> >>> + do { >>> + if (!dw_mci_card_busy(slot->mmc)) >>> + return; >>> + cpu_relax(); >>> + } while (time_before(jiffies, timeout)); >>> + >>> + dev_err(host->dev, "Data busy (status %#x)\n", >>> + mci_readl(slot->host, STATUS)); >>> + >>> + /* >>> + * Data busy, this should not happend when mmc controller send command >>> + * to update card clocks in non-volt-switch state. If it happends, we >>> + * should reset controller to avoid getting "Timeout sending command". >>> + */ >>> + dw_mci_ctrl_reset(host, SDMMC_CTRL_ALL_RESET_FLAGS); >>> + >> Why you need to reset all blocks? may be CTRL_RESET is good enough here. > I have tested on rk3288, if only reset ctroller, data busy bit will not > be cleaned,and we will still get > > "Timeout sending command". > >> >>> + /* Fail to reset controller or still data busy, WARN_ON! */ >>> + WARN_ON(dw_mci_card_busy(slot->mmc)); >>> +} >>> + >>> static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >>> { >>> struct dw_mci *host = slot->host; >>> @@ -899,6 +925,8 @@ static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >>> /* We must continue to set bit 28 in CMD until the change is complete */ >>> if (host->state == STATE_WAITING_CMD11_DONE) >>> sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; >>> + else >>> + dw_mci_wait_busy(slot); >>> >> hmm...I would suggest you to call dw_mci_wait_busy() from inside >> mci_send_cmd(), seems like dw_mmc hangs while sending update clock cmd >> in multiple cases.see [1] >> >> [1]: http://permalink.gmane.org/gmane.linux.kernel.mmc/31140 > I think this patch is more reasonable. > So I will resend patches based on this patch. > thank you! I have tested your patches instead [1] above and they do not solve my issue: Board: odroid-xu3/exynos5422/dw_mmc_250a. MMC card: absent, broken-cd quirk SD card: present System hangs during boot after few minutes kernel spits: [ 242.188098] INFO: task kworker/u16:1:50 blocked for more than 120 seconds. [ 242.193524] Not tainted 3.19.0-next-20150210-00002-gf96831b-dirty #3834 [ 242.200622] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 242.208422] kworker/u16:1 D c04766ac 0 50 2 0x00000000 [ 242.214756] Workqueue: kmmcd mmc_rescan [ 242.218553] [<c04766ac>] (__schedule) from [<c0476a58>] (schedule+0x34/0x98) [ 242.225591] [<c0476a58>] (schedule) from [<c047a4dc>] (schedule_timeout+0x110/0x164) [ 242.233302] [<c047a4dc>] (schedule_timeout) from [<c04774f0>] (wait_for_common+0xb8/0x14c) [ 242.241539] [<c04774f0>] (wait_for_common) from [<c0362138>] (mmc_wait_for_req+0x68/0x17c) [ 242.249861] [<c0362138>] (mmc_wait_for_req) from [<c03622cc>] (mmc_wait_for_cmd+0x80/0xa0) [ 242.258002] [<c03622cc>] (mmc_wait_for_cmd) from [<c0367e50>] (mmc_go_idle+0x78/0xf8) [ 242.265796] [<c0367e50>] (mmc_go_idle) from [<c0363e2c>] (mmc_rescan+0x280/0x314) [ 242.273253] [<c0363e2c>] (mmc_rescan) from [<c0034764>] (process_one_work+0x120/0x324) [ 242.281135] [<c0034764>] (process_one_work) from [<c00349cc>] (worker_thread+0x30/0x42c) [ 242.289194] [<c00349cc>] (worker_thread) from [<c003926c>] (kthread+0xd8/0xf4) [ 242.296389] [<c003926c>] (kthread) from [<c000e7c0>] (ret_from_fork+0x14/0x34) Just for record, Exynos4412/dw_mmc_240a with the same configuration (no MMC card, broken-cd) works OK without patches. Regards Andrzej >> >>> if (!clock) { >>> mci_writel(host, CLKENA, 0); >>> -- >>> 1.8.3.2 >>> >>> >>> >>> _______________________________________________ >>> linux-arm-kernel mailing list >>> linux-arm-kernel@lists.infradead.org >>> http://lists.infradead.org/mailman/listinfo/linux-arm-kernel >> >> > > > -- > To unsubscribe from this list: send the line "unsubscribe linux-doc" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > ^ permalink raw reply [flat|nested] 21+ messages in thread
* Re: [PATCH v2 1/2] mmc: dw_mmc: fix bug that cause 'Timeout sending command' 2015-02-11 11:58 ` Andrzej Hajda @ 2015-02-11 23:20 ` Alim Akhtar 2015-02-12 2:28 ` addy ke 2015-02-12 11:13 ` Andrzej Hajda 0 siblings, 2 replies; 21+ messages in thread From: Alim Akhtar @ 2015-02-11 23:20 UTC (permalink / raw) To: Andrzej Hajda Cc: Addy, robh+dt, pawel.moll, Mark Rutland, ijc+devicetree, Kumar Gala, rdunlap, Seungwon Jeon, Jaehoon Chung, Chris Ball, Ulf Hansson, dinguyen, Heiko Stübner, Olof Johansson, Douglas Anderson, Sonny Rao, Alexandru Stan, Daniel Kurtz, Tao Huang, devicetree@vger.kernel.org, Lin Huang, linux-doc, 姚智情, Chris Zhong, zhangqing Hi Andrzej, On Wed, Feb 11, 2015 at 5:28 PM, Andrzej Hajda <a.hajda@samsung.com> wrote: > Hi Alim, > > On 02/11/2015 03:57 AM, Addy wrote: >> >> On 2015/02/10 23:22, Alim Akhtar wrote: >>> Hi Addy, >>> >>> On Mon, Feb 9, 2015 at 12:55 PM, Addy Ke <addy.ke@rock-chips.com> wrote: >>>> Because of some uncertain factors, such as worse card or worse hardware, >>>> DAT[3:0](the data lines) may be pulled down by card, and mmc controller >>>> will be in busy state. This should not happend when mmc controller >>>> send command to update card clocks. If this happends, mci_send_cmd will >>>> be failed and we will get 'Timeout sending command', and then system will >>>> be blocked. To avoid this, we need reset mmc controller. >>>> >>>> Signed-off-by: Addy Ke <addy.ke@rock-chips.com> >>>> --- >>>> drivers/mmc/host/dw_mmc.c | 28 ++++++++++++++++++++++++++++ >>>> 1 file changed, 28 insertions(+) >>>> >>>> diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c >>>> index 4d2e3c2..b0b57e3 100644 >>>> --- a/drivers/mmc/host/dw_mmc.c >>>> +++ b/drivers/mmc/host/dw_mmc.c >>>> @@ -100,6 +100,7 @@ struct idmac_desc { >>>> }; >>>> #endif /* CONFIG_MMC_DW_IDMAC */ >>>> >>>> +static int dw_mci_card_busy(struct mmc_host *mmc); >>>> static bool dw_mci_reset(struct dw_mci *host); >>>> static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); >>>> >>>> @@ -888,6 +889,31 @@ static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) >>>> cmd, arg, cmd_status); >>>> } >>>> >>>> +static void dw_mci_wait_busy(struct dw_mci_slot *slot) >>>> +{ >>>> + struct dw_mci *host = slot->host; >>>> + unsigned long timeout = jiffies + msecs_to_jiffies(500); >>>> + >>> Why 500 msec? >> This timeout value is the same as mci_send_cmd: >> static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) >> { >> struct dw_mci *host = slot->host; >> unsigned long timeout = jiffies + msecs_to_jiffies(500); >> .... >> } >> >> I have not clear that which is suitable. >> Do you have any suggestion on it? >>> >>>> + do { >>>> + if (!dw_mci_card_busy(slot->mmc)) >>>> + return; >>>> + cpu_relax(); >>>> + } while (time_before(jiffies, timeout)); >>>> + >>>> + dev_err(host->dev, "Data busy (status %#x)\n", >>>> + mci_readl(slot->host, STATUS)); >>>> + >>>> + /* >>>> + * Data busy, this should not happend when mmc controller send command >>>> + * to update card clocks in non-volt-switch state. If it happends, we >>>> + * should reset controller to avoid getting "Timeout sending command". >>>> + */ >>>> + dw_mci_ctrl_reset(host, SDMMC_CTRL_ALL_RESET_FLAGS); >>>> + >>> Why you need to reset all blocks? may be CTRL_RESET is good enough here. >> I have tested on rk3288, if only reset ctroller, data busy bit will not >> be cleaned,and we will still get >> >> "Timeout sending command". >> >>> >>>> + /* Fail to reset controller or still data busy, WARN_ON! */ >>>> + WARN_ON(dw_mci_card_busy(slot->mmc)); >>>> +} >>>> + >>>> static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >>>> { >>>> struct dw_mci *host = slot->host; >>>> @@ -899,6 +925,8 @@ static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >>>> /* We must continue to set bit 28 in CMD until the change is complete */ >>>> if (host->state == STATE_WAITING_CMD11_DONE) >>>> sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; >>>> + else >>>> + dw_mci_wait_busy(slot); >>>> >>> hmm...I would suggest you to call dw_mci_wait_busy() from inside >>> mci_send_cmd(), seems like dw_mmc hangs while sending update clock cmd >>> in multiple cases.see [1] >>> >>> [1]: http://permalink.gmane.org/gmane.linux.kernel.mmc/31140 >> I think this patch is more reasonable. >> So I will resend patches based on this patch. >> thank you! > > I have tested your patches instead [1] above and they do not solve my issue: > Board: odroid-xu3/exynos5422/dw_mmc_250a. > MMC card: absent, broken-cd quirk > SD card: present > I doubt $SUBJECT patch in current form can resolve you issue. I have already given comments on $subject patch. Can you try out below patch (I have not tested yet) on top of $SUBJECT patch? ======= diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c index b0b57e3..ea87844 100644 --- a/drivers/mmc/host/dw_mmc.c +++ b/drivers/mmc/host/dw_mmc.c @@ -101,6 +101,7 @@ struct idmac_desc { #endif /* CONFIG_MMC_DW_IDMAC */ static int dw_mci_card_busy(struct mmc_host *mmc); +static void dw_mci_wait_busy(struct dw_mci_slot *slot); static bool dw_mci_reset(struct dw_mci *host); static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); @@ -874,16 +875,22 @@ static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) struct dw_mci *host = slot->host; unsigned long timeout = jiffies + msecs_to_jiffies(500); unsigned int cmd_status = 0; + int re_try = 3; /* just random for now, 1 re-try should be ok */ - mci_writel(host, CMDARG, arg); - wmb(); - mci_writel(host, CMD, SDMMC_CMD_START | cmd); + while(re_try--) { + mci_writel(host, CMDARG, arg); + wmb(); + mci_writel(host, CMD, SDMMC_CMD_START | cmd); - while (time_before(jiffies, timeout)) { - cmd_status = mci_readl(host, CMD); - if (!(cmd_status & SDMMC_CMD_START)) - return; + while (time_before(jiffies, timeout)) { + cmd_status = mci_readl(host, CMD); + if (!(cmd_status & SDMMC_CMD_START)) + return; + } + + dw_mci_wait_busy(slot); } + dev_err(&slot->mmc->class_dev, "Timeout sending command (cmd %#x arg %#x status %#x)\n", cmd, arg, cmd_status); @@ -925,8 +932,6 @@ static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) /* We must continue to set bit 28 in CMD until the change is complete */ if (host->state == STATE_WAITING_CMD11_DONE) sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; - else - dw_mci_wait_busy(slot); if (!clock) { mci_writel(host, CLKENA, 0); ===== end ====== > System hangs during boot after few minutes kernel spits: > [ 242.188098] INFO: task kworker/u16:1:50 blocked for more than 120 > seconds. > [ 242.193524] Not tainted > 3.19.0-next-20150210-00002-gf96831b-dirty #3834 > [ 242.200622] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" > disables this message. > [ 242.208422] kworker/u16:1 D c04766ac 0 50 2 0x00000000 > [ 242.214756] Workqueue: kmmcd mmc_rescan > [ 242.218553] [<c04766ac>] (__schedule) from [<c0476a58>] > (schedule+0x34/0x98) > [ 242.225591] [<c0476a58>] (schedule) from [<c047a4dc>] > (schedule_timeout+0x110/0x164) > [ 242.233302] [<c047a4dc>] (schedule_timeout) from [<c04774f0>] > (wait_for_common+0xb8/0x14c) > [ 242.241539] [<c04774f0>] (wait_for_common) from [<c0362138>] > (mmc_wait_for_req+0x68/0x17c) > [ 242.249861] [<c0362138>] (mmc_wait_for_req) from [<c03622cc>] > (mmc_wait_for_cmd+0x80/0xa0) > [ 242.258002] [<c03622cc>] (mmc_wait_for_cmd) from [<c0367e50>] > (mmc_go_idle+0x78/0xf8) > [ 242.265796] [<c0367e50>] (mmc_go_idle) from [<c0363e2c>] > (mmc_rescan+0x280/0x314) > [ 242.273253] [<c0363e2c>] (mmc_rescan) from [<c0034764>] > (process_one_work+0x120/0x324) > [ 242.281135] [<c0034764>] (process_one_work) from [<c00349cc>] > (worker_thread+0x30/0x42c) > [ 242.289194] [<c00349cc>] (worker_thread) from [<c003926c>] > (kthread+0xd8/0xf4) > [ 242.296389] [<c003926c>] (kthread) from [<c000e7c0>] > (ret_from_fork+0x14/0x34) > > Just for record, Exynos4412/dw_mmc_240a with the same configuration > (no MMC card, broken-cd) works OK without patches. > > > Regards > Andrzej > >>> >>>> if (!clock) { >>>> mci_writel(host, CLKENA, 0); >>>> -- >>>> 1.8.3.2 >>>> >>>> >>>> >>>> _______________________________________________ >>>> linux-arm-kernel mailing list >>>> linux-arm-kernel@lists.infradead.org >>>> http://lists.infradead.org/mailman/listinfo/linux-arm-kernel >>> >>> >> >> >> -- >> To unsubscribe from this list: send the line "unsubscribe linux-doc" in >> the body of a message to majordomo@vger.kernel.org >> More majordomo info at http://vger.kernel.org/majordomo-info.html >> > -- Regards, Alim ^ permalink raw reply related [flat|nested] 21+ messages in thread
* Re: [PATCH v2 1/2] mmc: dw_mmc: fix bug that cause 'Timeout sending command' 2015-02-11 23:20 ` Alim Akhtar @ 2015-02-12 2:28 ` addy ke 2015-02-12 11:10 ` Andrzej Hajda 2015-02-12 11:13 ` Andrzej Hajda 1 sibling, 1 reply; 21+ messages in thread From: addy ke @ 2015-02-12 2:28 UTC (permalink / raw) To: alim.akhtar, a.hajda Cc: robh+dt, pawel.moll, mark.rutland, ijc+devicetree, galak, rdunlap, tgih.jun, jh80.chung, chris, ulf.hansson, dinguyen, heiko, olof, dianders, sonnyrao, amstan, djkurtz, huangtao, devicetree, hl, linux-doc, yzq, zyw, zhangqing, linux-mmc, linux-kernel, kever.yang Hi Andrzej and Alim On 2015/2/12 07:20, Alim Akhtar wrote: > Hi Andrzej, > > On Wed, Feb 11, 2015 at 5:28 PM, Andrzej Hajda <a.hajda@samsung.com> wrote: >> Hi Alim, >> >> On 02/11/2015 03:57 AM, Addy wrote: >>> >>> On 2015/02/10 23:22, Alim Akhtar wrote: >>>> Hi Addy, >>>> >>>> On Mon, Feb 9, 2015 at 12:55 PM, Addy Ke <addy.ke@rock-chips.com> wrote: >>>>> Because of some uncertain factors, such as worse card or worse hardware, >>>>> DAT[3:0](the data lines) may be pulled down by card, and mmc controller >>>>> will be in busy state. This should not happend when mmc controller >>>>> send command to update card clocks. If this happends, mci_send_cmd will >>>>> be failed and we will get 'Timeout sending command', and then system will >>>>> be blocked. To avoid this, we need reset mmc controller. >>>>> >>>>> Signed-off-by: Addy Ke <addy.ke@rock-chips.com> >>>>> --- >>>>> drivers/mmc/host/dw_mmc.c | 28 ++++++++++++++++++++++++++++ >>>>> 1 file changed, 28 insertions(+) >>>>> >>>>> diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c >>>>> index 4d2e3c2..b0b57e3 100644 >>>>> --- a/drivers/mmc/host/dw_mmc.c >>>>> +++ b/drivers/mmc/host/dw_mmc.c >>>>> @@ -100,6 +100,7 @@ struct idmac_desc { >>>>> }; >>>>> #endif /* CONFIG_MMC_DW_IDMAC */ >>>>> >>>>> +static int dw_mci_card_busy(struct mmc_host *mmc); >>>>> static bool dw_mci_reset(struct dw_mci *host); >>>>> static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); >>>>> >>>>> @@ -888,6 +889,31 @@ static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) >>>>> cmd, arg, cmd_status); >>>>> } >>>>> >>>>> +static void dw_mci_wait_busy(struct dw_mci_slot *slot) >>>>> +{ >>>>> + struct dw_mci *host = slot->host; >>>>> + unsigned long timeout = jiffies + msecs_to_jiffies(500); >>>>> + >>>> Why 500 msec? >>> This timeout value is the same as mci_send_cmd: >>> static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) >>> { >>> struct dw_mci *host = slot->host; >>> unsigned long timeout = jiffies + msecs_to_jiffies(500); >>> .... >>> } >>> >>> I have not clear that which is suitable. >>> Do you have any suggestion on it? >>>> >>>>> + do { >>>>> + if (!dw_mci_card_busy(slot->mmc)) >>>>> + return; >>>>> + cpu_relax(); >>>>> + } while (time_before(jiffies, timeout)); >>>>> + >>>>> + dev_err(host->dev, "Data busy (status %#x)\n", >>>>> + mci_readl(slot->host, STATUS)); >>>>> + >>>>> + /* >>>>> + * Data busy, this should not happend when mmc controller send command >>>>> + * to update card clocks in non-volt-switch state. If it happends, we >>>>> + * should reset controller to avoid getting "Timeout sending command". >>>>> + */ >>>>> + dw_mci_ctrl_reset(host, SDMMC_CTRL_ALL_RESET_FLAGS); >>>>> + >>>> Why you need to reset all blocks? may be CTRL_RESET is good enough here. >>> I have tested on rk3288, if only reset ctroller, data busy bit will not >>> be cleaned,and we will still get >>> >>> "Timeout sending command". >>> >>>> >>>>> + /* Fail to reset controller or still data busy, WARN_ON! */ >>>>> + WARN_ON(dw_mci_card_busy(slot->mmc)); >>>>> +} >>>>> + >>>>> static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >>>>> { >>>>> struct dw_mci *host = slot->host; >>>>> @@ -899,6 +925,8 @@ static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >>>>> /* We must continue to set bit 28 in CMD until the change is complete */ >>>>> if (host->state == STATE_WAITING_CMD11_DONE) >>>>> sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; >>>>> + else >>>>> + dw_mci_wait_busy(slot); >>>>> >>>> hmm...I would suggest you to call dw_mci_wait_busy() from inside >>>> mci_send_cmd(), seems like dw_mmc hangs while sending update clock cmd >>>> in multiple cases.see [1] >>>> >>>> [1]: http://permalink.gmane.org/gmane.linux.kernel.mmc/31140 >>> I think this patch is more reasonable. >>> So I will resend patches based on this patch. >>> thank you! >> >> I have tested your patches instead [1] above and they do not solve my issue: >> Board: odroid-xu3/exynos5422/dw_mmc_250a. >> MMC card: absent, broken-cd quirk >> SD card: present >> > I doubt $SUBJECT patch in current form can resolve you issue. I have > already given comments on $subject patch. > > Can you try out below patch (I have not tested yet) on top of $SUBJECT patch? > > ======= > diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c > index b0b57e3..ea87844 100644 > --- a/drivers/mmc/host/dw_mmc.c > +++ b/drivers/mmc/host/dw_mmc.c > @@ -101,6 +101,7 @@ struct idmac_desc { > #endif /* CONFIG_MMC_DW_IDMAC */ > > static int dw_mci_card_busy(struct mmc_host *mmc); > +static void dw_mci_wait_busy(struct dw_mci_slot *slot); > static bool dw_mci_reset(struct dw_mci *host); > static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); > > @@ -874,16 +875,22 @@ static void mci_send_cmd(struct dw_mci_slot > *slot, u32 cmd, u32 arg) > struct dw_mci *host = slot->host; > unsigned long timeout = jiffies + msecs_to_jiffies(500); > unsigned int cmd_status = 0; > + int re_try = 3; /* just random for now, 1 re-try should be ok */ > > - mci_writel(host, CMDARG, arg); > - wmb(); > - mci_writel(host, CMD, SDMMC_CMD_START | cmd); > + while(re_try--) { > + mci_writel(host, CMDARG, arg); > + wmb(); > + mci_writel(host, CMD, SDMMC_CMD_START | cmd); > > - while (time_before(jiffies, timeout)) { > - cmd_status = mci_readl(host, CMD); > - if (!(cmd_status & SDMMC_CMD_START)) > - return; > + while (time_before(jiffies, timeout)) { > + cmd_status = mci_readl(host, CMD); > + if (!(cmd_status & SDMMC_CMD_START)) > + return; > + } > + > + dw_mci_wait_busy(slot); > } > + > dev_err(&slot->mmc->class_dev, > "Timeout sending command (cmd %#x arg %#x status %#x)\n", > cmd, arg, cmd_status); > @@ -925,8 +932,6 @@ static void dw_mci_setup_bus(struct dw_mci_slot > *slot, bool force_clkinit) > /* We must continue to set bit 28 in CMD until the change is complete */ > if (host->state == STATE_WAITING_CMD11_DONE) > sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; > - else > - dw_mci_wait_busy(slot); > > if (!clock) { > mci_writel(host, CLKENA, 0); > > ===== end ====== The reason why we are fail to send command is that we got data busy in none-switch-volt state(host->state != STATE_WAITING_CMD11_DONE). So: if(host->state != STATE_WAITING_CMD11_DONE), we must wait until data not busy, And if (host->state == STATE_WAITING_CMD11_DONE) we should not wait. >> System hangs during boot after few minutes kernel spits: >> [ 242.188098] INFO: task kworker/u16:1:50 blocked for more than 120 >> seconds. >> [ 242.193524] Not tainted >> 3.19.0-next-20150210-00002-gf96831b-dirty #3834 >> [ 242.200622] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >> disables this message. >> [ 242.208422] kworker/u16:1 D c04766ac 0 50 2 0x00000000 >> [ 242.214756] Workqueue: kmmcd mmc_rescan >> [ 242.218553] [<c04766ac>] (__schedule) from [<c0476a58>] >> (schedule+0x34/0x98) >> [ 242.225591] [<c0476a58>] (schedule) from [<c047a4dc>] >> (schedule_timeout+0x110/0x164) >> [ 242.233302] [<c047a4dc>] (schedule_timeout) from [<c04774f0>] >> (wait_for_common+0xb8/0x14c) >> [ 242.241539] [<c04774f0>] (wait_for_common) from [<c0362138>] >> (mmc_wait_for_req+0x68/0x17c) >> [ 242.249861] [<c0362138>] (mmc_wait_for_req) from [<c03622cc>] >> (mmc_wait_for_cmd+0x80/0xa0) >> [ 242.258002] [<c03622cc>] (mmc_wait_for_cmd) from [<c0367e50>] >> (mmc_go_idle+0x78/0xf8) >> [ 242.265796] [<c0367e50>] (mmc_go_idle) from [<c0363e2c>] >> (mmc_rescan+0x280/0x314) >> [ 242.273253] [<c0363e2c>] (mmc_rescan) from [<c0034764>] >> (process_one_work+0x120/0x324) >> [ 242.281135] [<c0034764>] (process_one_work) from [<c00349cc>] >> (worker_thread+0x30/0x42c) >> [ 242.289194] [<c00349cc>] (worker_thread) from [<c003926c>] >> (kthread+0xd8/0xf4) >> [ 242.296389] [<c003926c>] (kthread) from [<c000e7c0>] >> (ret_from_fork+0x14/0x34) >> >> Just for record, Exynos4412/dw_mmc_240a with the same configuration >> (no MMC card, broken-cd) works OK without patches. This is because mmc start command,but mmc_request_done() is't called. I have ever found this issue. I found that host does't get DTO interrupt when mmc send command to read data. I have sent a patch for it, see: https://patchwork.kernel.org/patch/5426531/ Would you please merge it and test again? >> >> >> Regards >> Andrzej >> >>>> >>>>> if (!clock) { >>>>> mci_writel(host, CLKENA, 0); >>>>> -- >>>>> 1.8.3.2 >>>>> >>>>> >>>>> >>>>> _______________________________________________ >>>>> linux-arm-kernel mailing list >>>>> linux-arm-kernel@lists.infradead.org >>>>> http://lists.infradead.org/mailman/listinfo/linux-arm-kernel >>>> >>>> >>> >>> >>> -- >>> To unsubscribe from this list: send the line "unsubscribe linux-doc" in >>> the body of a message to majordomo@vger.kernel.org >>> More majordomo info at http://vger.kernel.org/majordomo-info.html >>> >> > > > ^ permalink raw reply [flat|nested] 21+ messages in thread
* Re: [PATCH v2 1/2] mmc: dw_mmc: fix bug that cause 'Timeout sending command' 2015-02-12 2:28 ` addy ke @ 2015-02-12 11:10 ` Andrzej Hajda [not found] ` <54DC8A1B.7070402-Sze3O3UU22JBDgjK7y7TUQ@public.gmane.org> 0 siblings, 1 reply; 21+ messages in thread From: Andrzej Hajda @ 2015-02-12 11:10 UTC (permalink / raw) To: addy ke, alim.akhtar Cc: robh+dt, pawel.moll, mark.rutland, ijc+devicetree, galak, rdunlap, tgih.jun, jh80.chung, chris, ulf.hansson, dinguyen, heiko, olof, dianders, sonnyrao, amstan, djkurtz, huangtao, devicetree, hl, linux-doc, yzq, zyw, zhangqing, linux-mmc, linux-kernel, kever.yang On 02/12/2015 03:28 AM, addy ke wrote: > Hi Andrzej and Alim > > On 2015/2/12 07:20, Alim Akhtar wrote: >> Hi Andrzej, >> >> On Wed, Feb 11, 2015 at 5:28 PM, Andrzej Hajda <a.hajda@samsung.com> wrote: >>> Hi Alim, >>> >>> On 02/11/2015 03:57 AM, Addy wrote: >>>> On 2015/02/10 23:22, Alim Akhtar wrote: >>>>> Hi Addy, >>>>> >>>>> On Mon, Feb 9, 2015 at 12:55 PM, Addy Ke <addy.ke@rock-chips.com> wrote: >>>>>> Because of some uncertain factors, such as worse card or worse hardware, >>>>>> DAT[3:0](the data lines) may be pulled down by card, and mmc controller >>>>>> will be in busy state. This should not happend when mmc controller >>>>>> send command to update card clocks. If this happends, mci_send_cmd will >>>>>> be failed and we will get 'Timeout sending command', and then system will >>>>>> be blocked. To avoid this, we need reset mmc controller. >>>>>> >>>>>> Signed-off-by: Addy Ke <addy.ke@rock-chips.com> >>>>>> --- >>>>>> drivers/mmc/host/dw_mmc.c | 28 ++++++++++++++++++++++++++++ >>>>>> 1 file changed, 28 insertions(+) >>>>>> >>>>>> diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c >>>>>> index 4d2e3c2..b0b57e3 100644 >>>>>> --- a/drivers/mmc/host/dw_mmc.c >>>>>> +++ b/drivers/mmc/host/dw_mmc.c >>>>>> @@ -100,6 +100,7 @@ struct idmac_desc { >>>>>> }; >>>>>> #endif /* CONFIG_MMC_DW_IDMAC */ >>>>>> >>>>>> +static int dw_mci_card_busy(struct mmc_host *mmc); >>>>>> static bool dw_mci_reset(struct dw_mci *host); >>>>>> static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); >>>>>> >>>>>> @@ -888,6 +889,31 @@ static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) >>>>>> cmd, arg, cmd_status); >>>>>> } >>>>>> >>>>>> +static void dw_mci_wait_busy(struct dw_mci_slot *slot) >>>>>> +{ >>>>>> + struct dw_mci *host = slot->host; >>>>>> + unsigned long timeout = jiffies + msecs_to_jiffies(500); >>>>>> + >>>>> Why 500 msec? >>>> This timeout value is the same as mci_send_cmd: >>>> static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) >>>> { >>>> struct dw_mci *host = slot->host; >>>> unsigned long timeout = jiffies + msecs_to_jiffies(500); >>>> .... >>>> } >>>> >>>> I have not clear that which is suitable. >>>> Do you have any suggestion on it? >>>>>> + do { >>>>>> + if (!dw_mci_card_busy(slot->mmc)) >>>>>> + return; >>>>>> + cpu_relax(); >>>>>> + } while (time_before(jiffies, timeout)); >>>>>> + >>>>>> + dev_err(host->dev, "Data busy (status %#x)\n", >>>>>> + mci_readl(slot->host, STATUS)); >>>>>> + >>>>>> + /* >>>>>> + * Data busy, this should not happend when mmc controller send command >>>>>> + * to update card clocks in non-volt-switch state. If it happends, we >>>>>> + * should reset controller to avoid getting "Timeout sending command". >>>>>> + */ >>>>>> + dw_mci_ctrl_reset(host, SDMMC_CTRL_ALL_RESET_FLAGS); >>>>>> + >>>>> Why you need to reset all blocks? may be CTRL_RESET is good enough here. >>>> I have tested on rk3288, if only reset ctroller, data busy bit will not >>>> be cleaned,and we will still get >>>> >>>> "Timeout sending command". >>>> >>>>>> + /* Fail to reset controller or still data busy, WARN_ON! */ >>>>>> + WARN_ON(dw_mci_card_busy(slot->mmc)); >>>>>> +} >>>>>> + >>>>>> static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >>>>>> { >>>>>> struct dw_mci *host = slot->host; >>>>>> @@ -899,6 +925,8 @@ static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >>>>>> /* We must continue to set bit 28 in CMD until the change is complete */ >>>>>> if (host->state == STATE_WAITING_CMD11_DONE) >>>>>> sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; >>>>>> + else >>>>>> + dw_mci_wait_busy(slot); >>>>>> >>>>> hmm...I would suggest you to call dw_mci_wait_busy() from inside >>>>> mci_send_cmd(), seems like dw_mmc hangs while sending update clock cmd >>>>> in multiple cases.see [1] >>>>> >>>>> [1]: http://permalink.gmane.org/gmane.linux.kernel.mmc/31140 >>>> I think this patch is more reasonable. >>>> So I will resend patches based on this patch. >>>> thank you! >>> I have tested your patches instead [1] above and they do not solve my issue: >>> Board: odroid-xu3/exynos5422/dw_mmc_250a. >>> MMC card: absent, broken-cd quirk >>> SD card: present >>> >> I doubt $SUBJECT patch in current form can resolve you issue. I have >> already given comments on $subject patch. >> >> Can you try out below patch (I have not tested yet) on top of $SUBJECT patch? >> >> ======= >> diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c >> index b0b57e3..ea87844 100644 >> --- a/drivers/mmc/host/dw_mmc.c >> +++ b/drivers/mmc/host/dw_mmc.c >> @@ -101,6 +101,7 @@ struct idmac_desc { >> #endif /* CONFIG_MMC_DW_IDMAC */ >> >> static int dw_mci_card_busy(struct mmc_host *mmc); >> +static void dw_mci_wait_busy(struct dw_mci_slot *slot); >> static bool dw_mci_reset(struct dw_mci *host); >> static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); >> >> @@ -874,16 +875,22 @@ static void mci_send_cmd(struct dw_mci_slot >> *slot, u32 cmd, u32 arg) >> struct dw_mci *host = slot->host; >> unsigned long timeout = jiffies + msecs_to_jiffies(500); >> unsigned int cmd_status = 0; >> + int re_try = 3; /* just random for now, 1 re-try should be ok */ >> >> - mci_writel(host, CMDARG, arg); >> - wmb(); >> - mci_writel(host, CMD, SDMMC_CMD_START | cmd); >> + while(re_try--) { >> + mci_writel(host, CMDARG, arg); >> + wmb(); >> + mci_writel(host, CMD, SDMMC_CMD_START | cmd); >> >> - while (time_before(jiffies, timeout)) { >> - cmd_status = mci_readl(host, CMD); >> - if (!(cmd_status & SDMMC_CMD_START)) >> - return; >> + while (time_before(jiffies, timeout)) { >> + cmd_status = mci_readl(host, CMD); >> + if (!(cmd_status & SDMMC_CMD_START)) >> + return; >> + } >> + >> + dw_mci_wait_busy(slot); >> } >> + >> dev_err(&slot->mmc->class_dev, >> "Timeout sending command (cmd %#x arg %#x status %#x)\n", >> cmd, arg, cmd_status); >> @@ -925,8 +932,6 @@ static void dw_mci_setup_bus(struct dw_mci_slot >> *slot, bool force_clkinit) >> /* We must continue to set bit 28 in CMD until the change is complete */ >> if (host->state == STATE_WAITING_CMD11_DONE) >> sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; >> - else >> - dw_mci_wait_busy(slot); >> >> if (!clock) { >> mci_writel(host, CLKENA, 0); >> >> ===== end ====== > The reason why we are fail to send command is that we got data busy in > none-switch-volt state(host->state != STATE_WAITING_CMD11_DONE). > So: > if(host->state != STATE_WAITING_CMD11_DONE), we must wait until data not busy, > And if (host->state == STATE_WAITING_CMD11_DONE) we should not wait. > >>> System hangs during boot after few minutes kernel spits: >>> [ 242.188098] INFO: task kworker/u16:1:50 blocked for more than 120 >>> seconds. >>> [ 242.193524] Not tainted >>> 3.19.0-next-20150210-00002-gf96831b-dirty #3834 >>> [ 242.200622] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >>> disables this message. >>> [ 242.208422] kworker/u16:1 D c04766ac 0 50 2 0x00000000 >>> [ 242.214756] Workqueue: kmmcd mmc_rescan >>> [ 242.218553] [<c04766ac>] (__schedule) from [<c0476a58>] >>> (schedule+0x34/0x98) >>> [ 242.225591] [<c0476a58>] (schedule) from [<c047a4dc>] >>> (schedule_timeout+0x110/0x164) >>> [ 242.233302] [<c047a4dc>] (schedule_timeout) from [<c04774f0>] >>> (wait_for_common+0xb8/0x14c) >>> [ 242.241539] [<c04774f0>] (wait_for_common) from [<c0362138>] >>> (mmc_wait_for_req+0x68/0x17c) >>> [ 242.249861] [<c0362138>] (mmc_wait_for_req) from [<c03622cc>] >>> (mmc_wait_for_cmd+0x80/0xa0) >>> [ 242.258002] [<c03622cc>] (mmc_wait_for_cmd) from [<c0367e50>] >>> (mmc_go_idle+0x78/0xf8) >>> [ 242.265796] [<c0367e50>] (mmc_go_idle) from [<c0363e2c>] >>> (mmc_rescan+0x280/0x314) >>> [ 242.273253] [<c0363e2c>] (mmc_rescan) from [<c0034764>] >>> (process_one_work+0x120/0x324) >>> [ 242.281135] [<c0034764>] (process_one_work) from [<c00349cc>] >>> (worker_thread+0x30/0x42c) >>> [ 242.289194] [<c00349cc>] (worker_thread) from [<c003926c>] >>> (kthread+0xd8/0xf4) >>> [ 242.296389] [<c003926c>] (kthread) from [<c000e7c0>] >>> (ret_from_fork+0x14/0x34) >>> >>> Just for record, Exynos4412/dw_mmc_240a with the same configuration >>> (no MMC card, broken-cd) works OK without patches. > This is because mmc start command,but mmc_request_done() is't called. > I have ever found this issue. > I found that host does't get DTO interrupt when mmc send command to read data. > I have sent a patch for it, see: > https://patchwork.kernel.org/patch/5426531/ > > Would you please merge it and test again? I have merged it and added quirk to exynos, but it does not help. There is still timeout: [ 242.188178] INFO: task kworker/u16:1:50 blocked for more than 120 seconds. [ 242.193605] Not tainted 3.19.0-next-20150212-00003-g7850750-dirty #3841 [ 242.200703] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 242.208592] kworker/u16:1 D c04755f4 0 50 2 0x00000000 [ 242.214840] Workqueue: kmmcd mmc_rescan [ 242.218635] [<c04755f4>] (__schedule) from [<c04759a0>] (schedule+0x34/0x98) [ 242.225671] [<c04759a0>] (schedule) from [<c0479424>] (schedule_timeout+0x110/0x164) [ 242.233383] [<c0479424>] (schedule_timeout) from [<c0476438>] (wait_for_common+0xb8/0x14c) [ 242.241619] [<c0476438>] (wait_for_common) from [<c0361600>] (mmc_wait_for_req+0xb0/0x13c) [ 242.249848] [<c0361600>] (mmc_wait_for_req) from [<c036170c>] (mmc_wait_for_cmd+0x80/0xa0) [ 242.258086] [<c036170c>] (mmc_wait_for_cmd) from [<c03676e0>] (mmc_go_idle+0x78/0xf8) [ 242.265876] [<c03676e0>] (mmc_go_idle) from [<c0363700>] (mmc_rescan+0x25c/0x2e4) [ 242.273333] [<c0363700>] (mmc_rescan) from [<c0034764>] (process_one_work+0x120/0x324) [ 242.281216] [<c0034764>] (process_one_work) from [<c00349cc>] (worker_thread+0x30/0x42c) [ 242.289275] [<c00349cc>] (worker_thread) from [<c003926c>] (kthread+0xd8/0xf4) [ 242.296469] [<c003926c>] (kthread) from [<c000e7c0>] (ret_from_fork+0x14/0x34) Regards Andrzej >>> >>> Regards >>> Andrzej >>> >>>>>> if (!clock) { >>>>>> mci_writel(host, CLKENA, 0); >>>>>> -- >>>>>> 1.8.3.2 >>>>>> >>>>>> >>>>>> >>>>>> _______________________________________________ >>>>>> linux-arm-kernel mailing list >>>>>> linux-arm-kernel@lists.infradead.org >>>>>> http://lists.infradead.org/mailman/listinfo/linux-arm-kernel >>>>> >>>> >>>> -- >>>> To unsubscribe from this list: send the line "unsubscribe linux-doc" in >>>> the body of a message to majordomo@vger.kernel.org >>>> More majordomo info at http://vger.kernel.org/majordomo-info.html >>>> >> >> > ^ permalink raw reply [flat|nested] 21+ messages in thread
[parent not found: <54DC8A1B.7070402-Sze3O3UU22JBDgjK7y7TUQ@public.gmane.org>]
* Re: [PATCH v2 1/2] mmc: dw_mmc: fix bug that cause 'Timeout sending command' [not found] ` <54DC8A1B.7070402-Sze3O3UU22JBDgjK7y7TUQ@public.gmane.org> @ 2015-02-12 13:59 ` Alim Akhtar 2015-02-13 8:15 ` addy ke 0 siblings, 1 reply; 21+ messages in thread From: Alim Akhtar @ 2015-02-12 13:59 UTC (permalink / raw) To: Andrzej Hajda Cc: addy ke, robh+dt, pawel.moll-5wv7dgnIgG8, Mark Rutland, ijc+devicetree-KcIKpvwj1kUDXYZnReoRVg, Kumar Gala, rdunlap-wEGCiKHe2LqWVfeAwA7xHQ, Seungwon Jeon, Jaehoon Chung, Chris Ball, Ulf Hansson, dinguyen-EIB2kfCEclfQT0dZR+AlfA, Heiko Stübner, Olof Johansson, Douglas Anderson, Sonny Rao, Alexandru Stan, Daniel Kurtz, Tao Huang, devicetree-u79uwXL29TY76Z2rM5mHXA@public.gmane.org, Lin Huang, linux-doc-u79uwXL29TY76Z2rM5mHXA, 姚智情, Chris Zhong, zhangqing-TNX95d0MmH7DzftRWevZcw On Thu, Feb 12, 2015 at 4:40 PM, Andrzej Hajda <a.hajda-Sze3O3UU22JBDgjK7y7TUQ@public.gmane.org> wrote: > On 02/12/2015 03:28 AM, addy ke wrote: >> Hi Andrzej and Alim >> >> On 2015/2/12 07:20, Alim Akhtar wrote: >>> Hi Andrzej, >>> >>> On Wed, Feb 11, 2015 at 5:28 PM, Andrzej Hajda <a.hajda-Sze3O3UU22JBDgjK7y7TUQ@public.gmane.org> wrote: >>>> Hi Alim, >>>> >>>> On 02/11/2015 03:57 AM, Addy wrote: >>>>> On 2015/02/10 23:22, Alim Akhtar wrote: >>>>>> Hi Addy, >>>>>> >>>>>> On Mon, Feb 9, 2015 at 12:55 PM, Addy Ke <addy.ke-TNX95d0MmH7DzftRWevZcw@public.gmane.org> wrote: >>>>>>> Because of some uncertain factors, such as worse card or worse hardware, >>>>>>> DAT[3:0](the data lines) may be pulled down by card, and mmc controller >>>>>>> will be in busy state. This should not happend when mmc controller >>>>>>> send command to update card clocks. If this happends, mci_send_cmd will >>>>>>> be failed and we will get 'Timeout sending command', and then system will >>>>>>> be blocked. To avoid this, we need reset mmc controller. >>>>>>> >>>>>>> Signed-off-by: Addy Ke <addy.ke-TNX95d0MmH7DzftRWevZcw@public.gmane.org> >>>>>>> --- >>>>>>> drivers/mmc/host/dw_mmc.c | 28 ++++++++++++++++++++++++++++ >>>>>>> 1 file changed, 28 insertions(+) >>>>>>> >>>>>>> diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c >>>>>>> index 4d2e3c2..b0b57e3 100644 >>>>>>> --- a/drivers/mmc/host/dw_mmc.c >>>>>>> +++ b/drivers/mmc/host/dw_mmc.c >>>>>>> @@ -100,6 +100,7 @@ struct idmac_desc { >>>>>>> }; >>>>>>> #endif /* CONFIG_MMC_DW_IDMAC */ >>>>>>> >>>>>>> +static int dw_mci_card_busy(struct mmc_host *mmc); >>>>>>> static bool dw_mci_reset(struct dw_mci *host); >>>>>>> static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); >>>>>>> >>>>>>> @@ -888,6 +889,31 @@ static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) >>>>>>> cmd, arg, cmd_status); >>>>>>> } >>>>>>> >>>>>>> +static void dw_mci_wait_busy(struct dw_mci_slot *slot) >>>>>>> +{ >>>>>>> + struct dw_mci *host = slot->host; >>>>>>> + unsigned long timeout = jiffies + msecs_to_jiffies(500); >>>>>>> + >>>>>> Why 500 msec? >>>>> This timeout value is the same as mci_send_cmd: >>>>> static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) >>>>> { >>>>> struct dw_mci *host = slot->host; >>>>> unsigned long timeout = jiffies + msecs_to_jiffies(500); >>>>> .... >>>>> } >>>>> >>>>> I have not clear that which is suitable. >>>>> Do you have any suggestion on it? >>>>>>> + do { >>>>>>> + if (!dw_mci_card_busy(slot->mmc)) >>>>>>> + return; >>>>>>> + cpu_relax(); >>>>>>> + } while (time_before(jiffies, timeout)); >>>>>>> + >>>>>>> + dev_err(host->dev, "Data busy (status %#x)\n", >>>>>>> + mci_readl(slot->host, STATUS)); >>>>>>> + >>>>>>> + /* >>>>>>> + * Data busy, this should not happend when mmc controller send command >>>>>>> + * to update card clocks in non-volt-switch state. If it happends, we >>>>>>> + * should reset controller to avoid getting "Timeout sending command". >>>>>>> + */ >>>>>>> + dw_mci_ctrl_reset(host, SDMMC_CTRL_ALL_RESET_FLAGS); >>>>>>> + >>>>>> Why you need to reset all blocks? may be CTRL_RESET is good enough here. >>>>> I have tested on rk3288, if only reset ctroller, data busy bit will not >>>>> be cleaned,and we will still get >>>>> >>>>> "Timeout sending command". >>>>> >>>>>>> + /* Fail to reset controller or still data busy, WARN_ON! */ >>>>>>> + WARN_ON(dw_mci_card_busy(slot->mmc)); >>>>>>> +} >>>>>>> + >>>>>>> static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >>>>>>> { >>>>>>> struct dw_mci *host = slot->host; >>>>>>> @@ -899,6 +925,8 @@ static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >>>>>>> /* We must continue to set bit 28 in CMD until the change is complete */ >>>>>>> if (host->state == STATE_WAITING_CMD11_DONE) >>>>>>> sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; >>>>>>> + else >>>>>>> + dw_mci_wait_busy(slot); >>>>>>> >>>>>> hmm...I would suggest you to call dw_mci_wait_busy() from inside >>>>>> mci_send_cmd(), seems like dw_mmc hangs while sending update clock cmd >>>>>> in multiple cases.see [1] >>>>>> >>>>>> [1]: http://permalink.gmane.org/gmane.linux.kernel.mmc/31140 >>>>> I think this patch is more reasonable. >>>>> So I will resend patches based on this patch. >>>>> thank you! >>>> I have tested your patches instead [1] above and they do not solve my issue: >>>> Board: odroid-xu3/exynos5422/dw_mmc_250a. >>>> MMC card: absent, broken-cd quirk >>>> SD card: present >>>> >>> I doubt $SUBJECT patch in current form can resolve you issue. I have >>> already given comments on $subject patch. >>> >>> Can you try out below patch (I have not tested yet) on top of $SUBJECT patch? >>> >>> ======= >>> diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c >>> index b0b57e3..ea87844 100644 >>> --- a/drivers/mmc/host/dw_mmc.c >>> +++ b/drivers/mmc/host/dw_mmc.c >>> @@ -101,6 +101,7 @@ struct idmac_desc { >>> #endif /* CONFIG_MMC_DW_IDMAC */ >>> >>> static int dw_mci_card_busy(struct mmc_host *mmc); >>> +static void dw_mci_wait_busy(struct dw_mci_slot *slot); >>> static bool dw_mci_reset(struct dw_mci *host); >>> static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); >>> >>> @@ -874,16 +875,22 @@ static void mci_send_cmd(struct dw_mci_slot >>> *slot, u32 cmd, u32 arg) >>> struct dw_mci *host = slot->host; >>> unsigned long timeout = jiffies + msecs_to_jiffies(500); >>> unsigned int cmd_status = 0; >>> + int re_try = 3; /* just random for now, 1 re-try should be ok */ >>> >>> - mci_writel(host, CMDARG, arg); >>> - wmb(); >>> - mci_writel(host, CMD, SDMMC_CMD_START | cmd); >>> + while(re_try--) { >>> + mci_writel(host, CMDARG, arg); >>> + wmb(); >>> + mci_writel(host, CMD, SDMMC_CMD_START | cmd); >>> >>> - while (time_before(jiffies, timeout)) { >>> - cmd_status = mci_readl(host, CMD); >>> - if (!(cmd_status & SDMMC_CMD_START)) >>> - return; >>> + while (time_before(jiffies, timeout)) { >>> + cmd_status = mci_readl(host, CMD); >>> + if (!(cmd_status & SDMMC_CMD_START)) >>> + return; >>> + } >>> + >>> + dw_mci_wait_busy(slot); >>> } >>> + >>> dev_err(&slot->mmc->class_dev, >>> "Timeout sending command (cmd %#x arg %#x status %#x)\n", >>> cmd, arg, cmd_status); >>> @@ -925,8 +932,6 @@ static void dw_mci_setup_bus(struct dw_mci_slot >>> *slot, bool force_clkinit) >>> /* We must continue to set bit 28 in CMD until the change is complete */ >>> if (host->state == STATE_WAITING_CMD11_DONE) >>> sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; >>> - else >>> - dw_mci_wait_busy(slot); >>> >>> if (!clock) { >>> mci_writel(host, CLKENA, 0); >>> >>> ===== end ====== >> The reason why we are fail to send command is that we got data busy in >> none-switch-volt state(host->state != STATE_WAITING_CMD11_DONE). >> So: >> if(host->state != STATE_WAITING_CMD11_DONE), we must wait until data not busy, >> And if (host->state == STATE_WAITING_CMD11_DONE) we should not wait. >> >>>> System hangs during boot after few minutes kernel spits: >>>> [ 242.188098] INFO: task kworker/u16:1:50 blocked for more than 120 >>>> seconds. >>>> [ 242.193524] Not tainted >>>> 3.19.0-next-20150210-00002-gf96831b-dirty #3834 >>>> [ 242.200622] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >>>> disables this message. >>>> [ 242.208422] kworker/u16:1 D c04766ac 0 50 2 0x00000000 >>>> [ 242.214756] Workqueue: kmmcd mmc_rescan >>>> [ 242.218553] [<c04766ac>] (__schedule) from [<c0476a58>] >>>> (schedule+0x34/0x98) >>>> [ 242.225591] [<c0476a58>] (schedule) from [<c047a4dc>] >>>> (schedule_timeout+0x110/0x164) >>>> [ 242.233302] [<c047a4dc>] (schedule_timeout) from [<c04774f0>] >>>> (wait_for_common+0xb8/0x14c) >>>> [ 242.241539] [<c04774f0>] (wait_for_common) from [<c0362138>] >>>> (mmc_wait_for_req+0x68/0x17c) >>>> [ 242.249861] [<c0362138>] (mmc_wait_for_req) from [<c03622cc>] >>>> (mmc_wait_for_cmd+0x80/0xa0) >>>> [ 242.258002] [<c03622cc>] (mmc_wait_for_cmd) from [<c0367e50>] >>>> (mmc_go_idle+0x78/0xf8) >>>> [ 242.265796] [<c0367e50>] (mmc_go_idle) from [<c0363e2c>] >>>> (mmc_rescan+0x280/0x314) >>>> [ 242.273253] [<c0363e2c>] (mmc_rescan) from [<c0034764>] >>>> (process_one_work+0x120/0x324) >>>> [ 242.281135] [<c0034764>] (process_one_work) from [<c00349cc>] >>>> (worker_thread+0x30/0x42c) >>>> [ 242.289194] [<c00349cc>] (worker_thread) from [<c003926c>] >>>> (kthread+0xd8/0xf4) >>>> [ 242.296389] [<c003926c>] (kthread) from [<c000e7c0>] >>>> (ret_from_fork+0x14/0x34) >>>> >>>> Just for record, Exynos4412/dw_mmc_240a with the same configuration >>>> (no MMC card, broken-cd) works OK without patches. >> This is because mmc start command,but mmc_request_done() is't called. >> I have ever found this issue. >> I found that host does't get DTO interrupt when mmc send command to read data. >> I have sent a patch for it, see: >> https://patchwork.kernel.org/patch/5426531/ >> >> Would you please merge it and test again? > > I have merged it and added quirk to exynos, but it does not help. There > is still timeout: > I don't think this DTO issue. I think we need a way to reproduce this, at least on Exyons5422/5800. what type of sd card is being used? Are you trying UHS-I mode by any chance? > [ 242.188178] INFO: task kworker/u16:1:50 blocked for more than 120 > seconds. > [ 242.193605] Not tainted > 3.19.0-next-20150212-00003-g7850750-dirty #3841 > [ 242.200703] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" > disables this message. > [ 242.208592] kworker/u16:1 D c04755f4 0 50 2 0x00000000 > [ 242.214840] Workqueue: kmmcd mmc_rescan > [ 242.218635] [<c04755f4>] (__schedule) from [<c04759a0>] > (schedule+0x34/0x98) > [ 242.225671] [<c04759a0>] (schedule) from [<c0479424>] > (schedule_timeout+0x110/0x164) > [ 242.233383] [<c0479424>] (schedule_timeout) from [<c0476438>] > (wait_for_common+0xb8/0x14c) > [ 242.241619] [<c0476438>] (wait_for_common) from [<c0361600>] > (mmc_wait_for_req+0xb0/0x13c) > [ 242.249848] [<c0361600>] (mmc_wait_for_req) from [<c036170c>] > (mmc_wait_for_cmd+0x80/0xa0) > [ 242.258086] [<c036170c>] (mmc_wait_for_cmd) from [<c03676e0>] > (mmc_go_idle+0x78/0xf8) > [ 242.265876] [<c03676e0>] (mmc_go_idle) from [<c0363700>] > (mmc_rescan+0x25c/0x2e4) > [ 242.273333] [<c0363700>] (mmc_rescan) from [<c0034764>] > (process_one_work+0x120/0x324) > [ 242.281216] [<c0034764>] (process_one_work) from [<c00349cc>] > (worker_thread+0x30/0x42c) > [ 242.289275] [<c00349cc>] (worker_thread) from [<c003926c>] > (kthread+0xd8/0xf4) > [ 242.296469] [<c003926c>] (kthread) from [<c000e7c0>] > (ret_from_fork+0x14/0x34) > > > Regards > Andrzej > >>>> >>>> Regards >>>> Andrzej >>>> >>>>>>> if (!clock) { >>>>>>> mci_writel(host, CLKENA, 0); >>>>>>> -- >>>>>>> 1.8.3.2 >>>>>>> >>>>>>> >>>>>>> >>>>>>> _______________________________________________ >>>>>>> linux-arm-kernel mailing list >>>>>>> linux-arm-kernel-IAPFreCvJWM7uuMidbF8XUB+6BGkLq7r@public.gmane.org >>>>>>> http://lists.infradead.org/mailman/listinfo/linux-arm-kernel >>>>>> >>>>> >>>>> -- >>>>> To unsubscribe from this list: send the line "unsubscribe linux-doc" in >>>>> the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org >>>>> More majordomo info at http://vger.kernel.org/majordomo-info.html >>>>> >>> >>> >> > -- Regards, Alim -- To unsubscribe from this list: send the line "unsubscribe devicetree" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 21+ messages in thread
* Re: [PATCH v2 1/2] mmc: dw_mmc: fix bug that cause 'Timeout sending command' 2015-02-12 13:59 ` Alim Akhtar @ 2015-02-13 8:15 ` addy ke 0 siblings, 0 replies; 21+ messages in thread From: addy ke @ 2015-02-13 8:15 UTC (permalink / raw) To: alim.akhtar, a.hajda Cc: robh+dt, pawel.moll, mark.rutland, ijc+devicetree, galak, rdunlap, tgih.jun, jh80.chung, chris, ulf.hansson, dinguyen, heiko, olof, dianders, sonnyrao, amstan, djkurtz, huangtao, devicetree, hl, linux-doc, yzq, zyw, zhangqing, linux-mmc, linux-kernel, kever.yang On 2015/2/12 21:59, Alim Akhtar wrote: > On Thu, Feb 12, 2015 at 4:40 PM, Andrzej Hajda <a.hajda@samsung.com> wrote: >> On 02/12/2015 03:28 AM, addy ke wrote: >>> Hi Andrzej and Alim >>> >>> On 2015/2/12 07:20, Alim Akhtar wrote: >>>> Hi Andrzej, >>>> >>>> On Wed, Feb 11, 2015 at 5:28 PM, Andrzej Hajda <a.hajda@samsung.com> wrote: >>>>> Hi Alim, >>>>> >>>>> On 02/11/2015 03:57 AM, Addy wrote: >>>>>> On 2015/02/10 23:22, Alim Akhtar wrote: >>>>>>> Hi Addy, >>>>>>> >>>>>>> On Mon, Feb 9, 2015 at 12:55 PM, Addy Ke <addy.ke@rock-chips.com> wrote: >>>>>>>> Because of some uncertain factors, such as worse card or worse hardware, >>>>>>>> DAT[3:0](the data lines) may be pulled down by card, and mmc controller >>>>>>>> will be in busy state. This should not happend when mmc controller >>>>>>>> send command to update card clocks. If this happends, mci_send_cmd will >>>>>>>> be failed and we will get 'Timeout sending command', and then system will >>>>>>>> be blocked. To avoid this, we need reset mmc controller. >>>>>>>> >>>>>>>> Signed-off-by: Addy Ke <addy.ke@rock-chips.com> >>>>>>>> --- >>>>>>>> drivers/mmc/host/dw_mmc.c | 28 ++++++++++++++++++++++++++++ >>>>>>>> 1 file changed, 28 insertions(+) >>>>>>>> >>>>>>>> diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c >>>>>>>> index 4d2e3c2..b0b57e3 100644 >>>>>>>> --- a/drivers/mmc/host/dw_mmc.c >>>>>>>> +++ b/drivers/mmc/host/dw_mmc.c >>>>>>>> @@ -100,6 +100,7 @@ struct idmac_desc { >>>>>>>> }; >>>>>>>> #endif /* CONFIG_MMC_DW_IDMAC */ >>>>>>>> >>>>>>>> +static int dw_mci_card_busy(struct mmc_host *mmc); >>>>>>>> static bool dw_mci_reset(struct dw_mci *host); >>>>>>>> static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); >>>>>>>> >>>>>>>> @@ -888,6 +889,31 @@ static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) >>>>>>>> cmd, arg, cmd_status); >>>>>>>> } >>>>>>>> >>>>>>>> +static void dw_mci_wait_busy(struct dw_mci_slot *slot) >>>>>>>> +{ >>>>>>>> + struct dw_mci *host = slot->host; >>>>>>>> + unsigned long timeout = jiffies + msecs_to_jiffies(500); >>>>>>>> + >>>>>>> Why 500 msec? >>>>>> This timeout value is the same as mci_send_cmd: >>>>>> static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) >>>>>> { >>>>>> struct dw_mci *host = slot->host; >>>>>> unsigned long timeout = jiffies + msecs_to_jiffies(500); >>>>>> .... >>>>>> } >>>>>> >>>>>> I have not clear that which is suitable. >>>>>> Do you have any suggestion on it? >>>>>>>> + do { >>>>>>>> + if (!dw_mci_card_busy(slot->mmc)) >>>>>>>> + return; >>>>>>>> + cpu_relax(); >>>>>>>> + } while (time_before(jiffies, timeout)); >>>>>>>> + >>>>>>>> + dev_err(host->dev, "Data busy (status %#x)\n", >>>>>>>> + mci_readl(slot->host, STATUS)); >>>>>>>> + >>>>>>>> + /* >>>>>>>> + * Data busy, this should not happend when mmc controller send command >>>>>>>> + * to update card clocks in non-volt-switch state. If it happends, we >>>>>>>> + * should reset controller to avoid getting "Timeout sending command". >>>>>>>> + */ >>>>>>>> + dw_mci_ctrl_reset(host, SDMMC_CTRL_ALL_RESET_FLAGS); >>>>>>>> + >>>>>>> Why you need to reset all blocks? may be CTRL_RESET is good enough here. >>>>>> I have tested on rk3288, if only reset ctroller, data busy bit will not >>>>>> be cleaned,and we will still get >>>>>> >>>>>> "Timeout sending command". >>>>>> >>>>>>>> + /* Fail to reset controller or still data busy, WARN_ON! */ >>>>>>>> + WARN_ON(dw_mci_card_busy(slot->mmc)); >>>>>>>> +} >>>>>>>> + >>>>>>>> static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >>>>>>>> { >>>>>>>> struct dw_mci *host = slot->host; >>>>>>>> @@ -899,6 +925,8 @@ static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >>>>>>>> /* We must continue to set bit 28 in CMD until the change is complete */ >>>>>>>> if (host->state == STATE_WAITING_CMD11_DONE) >>>>>>>> sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; >>>>>>>> + else >>>>>>>> + dw_mci_wait_busy(slot); >>>>>>>> >>>>>>> hmm...I would suggest you to call dw_mci_wait_busy() from inside >>>>>>> mci_send_cmd(), seems like dw_mmc hangs while sending update clock cmd >>>>>>> in multiple cases.see [1] >>>>>>> >>>>>>> [1]: http://permalink.gmane.org/gmane.linux.kernel.mmc/31140 >>>>>> I think this patch is more reasonable. >>>>>> So I will resend patches based on this patch. >>>>>> thank you! >>>>> I have tested your patches instead [1] above and they do not solve my issue: >>>>> Board: odroid-xu3/exynos5422/dw_mmc_250a. >>>>> MMC card: absent, broken-cd quirk >>>>> SD card: present >>>>> >>>> I doubt $SUBJECT patch in current form can resolve you issue. I have >>>> already given comments on $subject patch. >>>> >>>> Can you try out below patch (I have not tested yet) on top of $SUBJECT patch? >>>> >>>> ======= >>>> diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c >>>> index b0b57e3..ea87844 100644 >>>> --- a/drivers/mmc/host/dw_mmc.c >>>> +++ b/drivers/mmc/host/dw_mmc.c >>>> @@ -101,6 +101,7 @@ struct idmac_desc { >>>> #endif /* CONFIG_MMC_DW_IDMAC */ >>>> >>>> static int dw_mci_card_busy(struct mmc_host *mmc); >>>> +static void dw_mci_wait_busy(struct dw_mci_slot *slot); >>>> static bool dw_mci_reset(struct dw_mci *host); >>>> static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); >>>> >>>> @@ -874,16 +875,22 @@ static void mci_send_cmd(struct dw_mci_slot >>>> *slot, u32 cmd, u32 arg) >>>> struct dw_mci *host = slot->host; >>>> unsigned long timeout = jiffies + msecs_to_jiffies(500); >>>> unsigned int cmd_status = 0; >>>> + int re_try = 3; /* just random for now, 1 re-try should be ok */ >>>> >>>> - mci_writel(host, CMDARG, arg); >>>> - wmb(); >>>> - mci_writel(host, CMD, SDMMC_CMD_START | cmd); >>>> + while(re_try--) { >>>> + mci_writel(host, CMDARG, arg); >>>> + wmb(); >>>> + mci_writel(host, CMD, SDMMC_CMD_START | cmd); >>>> >>>> - while (time_before(jiffies, timeout)) { >>>> - cmd_status = mci_readl(host, CMD); >>>> - if (!(cmd_status & SDMMC_CMD_START)) >>>> - return; >>>> + while (time_before(jiffies, timeout)) { >>>> + cmd_status = mci_readl(host, CMD); >>>> + if (!(cmd_status & SDMMC_CMD_START)) >>>> + return; >>>> + } >>>> + >>>> + dw_mci_wait_busy(slot); >>>> } >>>> + >>>> dev_err(&slot->mmc->class_dev, >>>> "Timeout sending command (cmd %#x arg %#x status %#x)\n", >>>> cmd, arg, cmd_status); >>>> @@ -925,8 +932,6 @@ static void dw_mci_setup_bus(struct dw_mci_slot >>>> *slot, bool force_clkinit) >>>> /* We must continue to set bit 28 in CMD until the change is complete */ >>>> if (host->state == STATE_WAITING_CMD11_DONE) >>>> sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; >>>> - else >>>> - dw_mci_wait_busy(slot); >>>> >>>> if (!clock) { >>>> mci_writel(host, CLKENA, 0); >>>> >>>> ===== end ====== >>> The reason why we are fail to send command is that we got data busy in >>> none-switch-volt state(host->state != STATE_WAITING_CMD11_DONE). >>> So: >>> if(host->state != STATE_WAITING_CMD11_DONE), we must wait until data not busy, >>> And if (host->state == STATE_WAITING_CMD11_DONE) we should not wait. >>> >>>>> System hangs during boot after few minutes kernel spits: >>>>> [ 242.188098] INFO: task kworker/u16:1:50 blocked for more than 120 >>>>> seconds. >>>>> [ 242.193524] Not tainted >>>>> 3.19.0-next-20150210-00002-gf96831b-dirty #3834 >>>>> [ 242.200622] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >>>>> disables this message. >>>>> [ 242.208422] kworker/u16:1 D c04766ac 0 50 2 0x00000000 >>>>> [ 242.214756] Workqueue: kmmcd mmc_rescan >>>>> [ 242.218553] [<c04766ac>] (__schedule) from [<c0476a58>] >>>>> (schedule+0x34/0x98) >>>>> [ 242.225591] [<c0476a58>] (schedule) from [<c047a4dc>] >>>>> (schedule_timeout+0x110/0x164) >>>>> [ 242.233302] [<c047a4dc>] (schedule_timeout) from [<c04774f0>] >>>>> (wait_for_common+0xb8/0x14c) >>>>> [ 242.241539] [<c04774f0>] (wait_for_common) from [<c0362138>] >>>>> (mmc_wait_for_req+0x68/0x17c) >>>>> [ 242.249861] [<c0362138>] (mmc_wait_for_req) from [<c03622cc>] >>>>> (mmc_wait_for_cmd+0x80/0xa0) >>>>> [ 242.258002] [<c03622cc>] (mmc_wait_for_cmd) from [<c0367e50>] >>>>> (mmc_go_idle+0x78/0xf8) >>>>> [ 242.265796] [<c0367e50>] (mmc_go_idle) from [<c0363e2c>] >>>>> (mmc_rescan+0x280/0x314) >>>>> [ 242.273253] [<c0363e2c>] (mmc_rescan) from [<c0034764>] >>>>> (process_one_work+0x120/0x324) >>>>> [ 242.281135] [<c0034764>] (process_one_work) from [<c00349cc>] >>>>> (worker_thread+0x30/0x42c) >>>>> [ 242.289194] [<c00349cc>] (worker_thread) from [<c003926c>] >>>>> (kthread+0xd8/0xf4) >>>>> [ 242.296389] [<c003926c>] (kthread) from [<c000e7c0>] >>>>> (ret_from_fork+0x14/0x34) >>>>> >>>>> Just for record, Exynos4412/dw_mmc_240a with the same configuration >>>>> (no MMC card, broken-cd) works OK without patches. >>> This is because mmc start command,but mmc_request_done() is't called. >>> I have ever found this issue. >>> I found that host does't get DTO interrupt when mmc send command to read data. >>> I have sent a patch for it, see: >>> https://patchwork.kernel.org/patch/5426531/ >>> >>> Would you please merge it and test again? >> >> I have merged it and added quirk to exynos, but it does not help. There >> is still timeout: >> > I don't think this DTO issue. I think we need a way to reproduce this, > at least on Exyons5422/5800. > what type of sd card is being used? Are you trying UHS-I mode by any chance? > Emmc, sd card and sdio, all have this issue on rk3288, I have NOT found data busy before send command, but still not get DTO interrupt. And we have ever found little probability of no command done interrupt. >> [ 242.188178] INFO: task kworker/u16:1:50 blocked for more than 120 >> seconds. >> [ 242.193605] Not tainted >> 3.19.0-next-20150212-00003-g7850750-dirty #3841 >> [ 242.200703] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >> disables this message. >> [ 242.208592] kworker/u16:1 D c04755f4 0 50 2 0x00000000 >> [ 242.214840] Workqueue: kmmcd mmc_rescan >> [ 242.218635] [<c04755f4>] (__schedule) from [<c04759a0>] >> (schedule+0x34/0x98) >> [ 242.225671] [<c04759a0>] (schedule) from [<c0479424>] >> (schedule_timeout+0x110/0x164) >> [ 242.233383] [<c0479424>] (schedule_timeout) from [<c0476438>] >> (wait_for_common+0xb8/0x14c) >> [ 242.241619] [<c0476438>] (wait_for_common) from [<c0361600>] >> (mmc_wait_for_req+0xb0/0x13c) >> [ 242.249848] [<c0361600>] (mmc_wait_for_req) from [<c036170c>] >> (mmc_wait_for_cmd+0x80/0xa0) >> [ 242.258086] [<c036170c>] (mmc_wait_for_cmd) from [<c03676e0>] >> (mmc_go_idle+0x78/0xf8) >> [ 242.265876] [<c03676e0>] (mmc_go_idle) from [<c0363700>] >> (mmc_rescan+0x25c/0x2e4) >> [ 242.273333] [<c0363700>] (mmc_rescan) from [<c0034764>] >> (process_one_work+0x120/0x324) >> [ 242.281216] [<c0034764>] (process_one_work) from [<c00349cc>] >> (worker_thread+0x30/0x42c) >> [ 242.289275] [<c00349cc>] (worker_thread) from [<c003926c>] >> (kthread+0xd8/0xf4) >> [ 242.296469] [<c003926c>] (kthread) from [<c000e7c0>] >> (ret_from_fork+0x14/0x34) >> >> >> Regards >> Andrzej >> >>>>> >>>>> Regards >>>>> Andrzej >>>>> >>>>>>>> if (!clock) { >>>>>>>> mci_writel(host, CLKENA, 0); >>>>>>>> -- >>>>>>>> 1.8.3.2 >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> _______________________________________________ >>>>>>>> linux-arm-kernel mailing list >>>>>>>> linux-arm-kernel@lists.infradead.org >>>>>>>> http://lists.infradead.org/mailman/listinfo/linux-arm-kernel >>>>>>> >>>>>> >>>>>> -- >>>>>> To unsubscribe from this list: send the line "unsubscribe linux-doc" in >>>>>> the body of a message to majordomo@vger.kernel.org >>>>>> More majordomo info at http://vger.kernel.org/majordomo-info.html >>>>>> >>>> >>>> >>> >> > > > ^ permalink raw reply [flat|nested] 21+ messages in thread
* Re: [PATCH v2 1/2] mmc: dw_mmc: fix bug that cause 'Timeout sending command' 2015-02-11 23:20 ` Alim Akhtar 2015-02-12 2:28 ` addy ke @ 2015-02-12 11:13 ` Andrzej Hajda 2015-02-12 13:53 ` Alim Akhtar 1 sibling, 1 reply; 21+ messages in thread From: Andrzej Hajda @ 2015-02-12 11:13 UTC (permalink / raw) To: Alim Akhtar Cc: Addy, robh+dt, pawel.moll, Mark Rutland, ijc+devicetree, Kumar Gala, rdunlap, Seungwon Jeon, Jaehoon Chung, Chris Ball, Ulf Hansson, dinguyen, Heiko Stübner, Olof Johansson, Douglas Anderson, Sonny Rao, Alexandru Stan, Daniel Kurtz, Tao Huang, devicetree@vger.kernel.org, Lin Huang, linux-doc, 姚智情, Chris Zhong, zhangqing On 02/12/2015 12:20 AM, Alim Akhtar wrote: > Hi Andrzej, > > On Wed, Feb 11, 2015 at 5:28 PM, Andrzej Hajda <a.hajda@samsung.com> wrote: >> Hi Alim, >> >> On 02/11/2015 03:57 AM, Addy wrote: >>> On 2015/02/10 23:22, Alim Akhtar wrote: >>>> Hi Addy, >>>> >>>> On Mon, Feb 9, 2015 at 12:55 PM, Addy Ke <addy.ke@rock-chips.com> wrote: >>>>> Because of some uncertain factors, such as worse card or worse hardware, >>>>> DAT[3:0](the data lines) may be pulled down by card, and mmc controller >>>>> will be in busy state. This should not happend when mmc controller >>>>> send command to update card clocks. If this happends, mci_send_cmd will >>>>> be failed and we will get 'Timeout sending command', and then system will >>>>> be blocked. To avoid this, we need reset mmc controller. >>>>> >>>>> Signed-off-by: Addy Ke <addy.ke@rock-chips.com> >>>>> --- >>>>> drivers/mmc/host/dw_mmc.c | 28 ++++++++++++++++++++++++++++ >>>>> 1 file changed, 28 insertions(+) >>>>> >>>>> diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c >>>>> index 4d2e3c2..b0b57e3 100644 >>>>> --- a/drivers/mmc/host/dw_mmc.c >>>>> +++ b/drivers/mmc/host/dw_mmc.c >>>>> @@ -100,6 +100,7 @@ struct idmac_desc { >>>>> }; >>>>> #endif /* CONFIG_MMC_DW_IDMAC */ >>>>> >>>>> +static int dw_mci_card_busy(struct mmc_host *mmc); >>>>> static bool dw_mci_reset(struct dw_mci *host); >>>>> static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); >>>>> >>>>> @@ -888,6 +889,31 @@ static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) >>>>> cmd, arg, cmd_status); >>>>> } >>>>> >>>>> +static void dw_mci_wait_busy(struct dw_mci_slot *slot) >>>>> +{ >>>>> + struct dw_mci *host = slot->host; >>>>> + unsigned long timeout = jiffies + msecs_to_jiffies(500); >>>>> + >>>> Why 500 msec? >>> This timeout value is the same as mci_send_cmd: >>> static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) >>> { >>> struct dw_mci *host = slot->host; >>> unsigned long timeout = jiffies + msecs_to_jiffies(500); >>> .... >>> } >>> >>> I have not clear that which is suitable. >>> Do you have any suggestion on it? >>>>> + do { >>>>> + if (!dw_mci_card_busy(slot->mmc)) >>>>> + return; >>>>> + cpu_relax(); >>>>> + } while (time_before(jiffies, timeout)); >>>>> + >>>>> + dev_err(host->dev, "Data busy (status %#x)\n", >>>>> + mci_readl(slot->host, STATUS)); >>>>> + >>>>> + /* >>>>> + * Data busy, this should not happend when mmc controller send command >>>>> + * to update card clocks in non-volt-switch state. If it happends, we >>>>> + * should reset controller to avoid getting "Timeout sending command". >>>>> + */ >>>>> + dw_mci_ctrl_reset(host, SDMMC_CTRL_ALL_RESET_FLAGS); >>>>> + >>>> Why you need to reset all blocks? may be CTRL_RESET is good enough here. >>> I have tested on rk3288, if only reset ctroller, data busy bit will not >>> be cleaned,and we will still get >>> >>> "Timeout sending command". >>> >>>>> + /* Fail to reset controller or still data busy, WARN_ON! */ >>>>> + WARN_ON(dw_mci_card_busy(slot->mmc)); >>>>> +} >>>>> + >>>>> static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >>>>> { >>>>> struct dw_mci *host = slot->host; >>>>> @@ -899,6 +925,8 @@ static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >>>>> /* We must continue to set bit 28 in CMD until the change is complete */ >>>>> if (host->state == STATE_WAITING_CMD11_DONE) >>>>> sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; >>>>> + else >>>>> + dw_mci_wait_busy(slot); >>>>> >>>> hmm...I would suggest you to call dw_mci_wait_busy() from inside >>>> mci_send_cmd(), seems like dw_mmc hangs while sending update clock cmd >>>> in multiple cases.see [1] >>>> >>>> [1]: http://permalink.gmane.org/gmane.linux.kernel.mmc/31140 >>> I think this patch is more reasonable. >>> So I will resend patches based on this patch. >>> thank you! >> I have tested your patches instead [1] above and they do not solve my issue: >> Board: odroid-xu3/exynos5422/dw_mmc_250a. >> MMC card: absent, broken-cd quirk >> SD card: present >> > I doubt $SUBJECT patch in current form can resolve you issue. I have > already given comments on $subject patch. > > Can you try out below patch (I have not tested yet) on top of $SUBJECT patch? > > ======= > diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c > index b0b57e3..ea87844 100644 > --- a/drivers/mmc/host/dw_mmc.c > +++ b/drivers/mmc/host/dw_mmc.c > @@ -101,6 +101,7 @@ struct idmac_desc { > #endif /* CONFIG_MMC_DW_IDMAC */ > > static int dw_mci_card_busy(struct mmc_host *mmc); > +static void dw_mci_wait_busy(struct dw_mci_slot *slot); > static bool dw_mci_reset(struct dw_mci *host); > static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); > > @@ -874,16 +875,22 @@ static void mci_send_cmd(struct dw_mci_slot > *slot, u32 cmd, u32 arg) > struct dw_mci *host = slot->host; > unsigned long timeout = jiffies + msecs_to_jiffies(500); > unsigned int cmd_status = 0; > + int re_try = 3; /* just random for now, 1 re-try should be ok */ > > - mci_writel(host, CMDARG, arg); > - wmb(); > - mci_writel(host, CMD, SDMMC_CMD_START | cmd); > + while(re_try--) { > + mci_writel(host, CMDARG, arg); > + wmb(); > + mci_writel(host, CMD, SDMMC_CMD_START | cmd); > > - while (time_before(jiffies, timeout)) { > - cmd_status = mci_readl(host, CMD); > - if (!(cmd_status & SDMMC_CMD_START)) > - return; > + while (time_before(jiffies, timeout)) { > + cmd_status = mci_readl(host, CMD); > + if (!(cmd_status & SDMMC_CMD_START)) > + return; > + } > + > + dw_mci_wait_busy(slot); > } > + > dev_err(&slot->mmc->class_dev, > "Timeout sending command (cmd %#x arg %#x status %#x)\n", > cmd, arg, cmd_status); > @@ -925,8 +932,6 @@ static void dw_mci_setup_bus(struct dw_mci_slot > *slot, bool force_clkinit) > /* We must continue to set bit 28 in CMD until the change is complete */ > if (host->state == STATE_WAITING_CMD11_DONE) > sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; > - else > - dw_mci_wait_busy(slot); > > if (!clock) { > mci_writel(host, CLKENA, 0); It does not help, there is still the same timeout. Regards Andrzej > > ===== end ====== >> System hangs during boot after few minutes kernel spits: >> [ 242.188098] INFO: task kworker/u16:1:50 blocked for more than 120 >> seconds. >> [ 242.193524] Not tainted >> 3.19.0-next-20150210-00002-gf96831b-dirty #3834 >> [ 242.200622] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >> disables this message. >> [ 242.208422] kworker/u16:1 D c04766ac 0 50 2 0x00000000 >> [ 242.214756] Workqueue: kmmcd mmc_rescan >> [ 242.218553] [<c04766ac>] (__schedule) from [<c0476a58>] >> (schedule+0x34/0x98) >> [ 242.225591] [<c0476a58>] (schedule) from [<c047a4dc>] >> (schedule_timeout+0x110/0x164) >> [ 242.233302] [<c047a4dc>] (schedule_timeout) from [<c04774f0>] >> (wait_for_common+0xb8/0x14c) >> [ 242.241539] [<c04774f0>] (wait_for_common) from [<c0362138>] >> (mmc_wait_for_req+0x68/0x17c) >> [ 242.249861] [<c0362138>] (mmc_wait_for_req) from [<c03622cc>] >> (mmc_wait_for_cmd+0x80/0xa0) >> [ 242.258002] [<c03622cc>] (mmc_wait_for_cmd) from [<c0367e50>] >> (mmc_go_idle+0x78/0xf8) >> [ 242.265796] [<c0367e50>] (mmc_go_idle) from [<c0363e2c>] >> (mmc_rescan+0x280/0x314) >> [ 242.273253] [<c0363e2c>] (mmc_rescan) from [<c0034764>] >> (process_one_work+0x120/0x324) >> [ 242.281135] [<c0034764>] (process_one_work) from [<c00349cc>] >> (worker_thread+0x30/0x42c) >> [ 242.289194] [<c00349cc>] (worker_thread) from [<c003926c>] >> (kthread+0xd8/0xf4) >> [ 242.296389] [<c003926c>] (kthread) from [<c000e7c0>] >> (ret_from_fork+0x14/0x34) >> >> Just for record, Exynos4412/dw_mmc_240a with the same configuration >> (no MMC card, broken-cd) works OK without patches. >> >> >> Regards >> Andrzej >> >>>>> if (!clock) { >>>>> mci_writel(host, CLKENA, 0); >>>>> -- >>>>> 1.8.3.2 >>>>> >>>>> >>>>> >>>>> _______________________________________________ >>>>> linux-arm-kernel mailing list >>>>> linux-arm-kernel@lists.infradead.org >>>>> http://lists.infradead.org/mailman/listinfo/linux-arm-kernel >>>> >>> >>> -- >>> To unsubscribe from this list: send the line "unsubscribe linux-doc" in >>> the body of a message to majordomo@vger.kernel.org >>> More majordomo info at http://vger.kernel.org/majordomo-info.html >>> > > ^ permalink raw reply [flat|nested] 21+ messages in thread
* Re: [PATCH v2 1/2] mmc: dw_mmc: fix bug that cause 'Timeout sending command' 2015-02-12 11:13 ` Andrzej Hajda @ 2015-02-12 13:53 ` Alim Akhtar 0 siblings, 0 replies; 21+ messages in thread From: Alim Akhtar @ 2015-02-12 13:53 UTC (permalink / raw) To: Andrzej Hajda Cc: Addy, robh+dt, pawel.moll, Mark Rutland, ijc+devicetree, Kumar Gala, rdunlap, Seungwon Jeon, Jaehoon Chung, Chris Ball, Ulf Hansson, dinguyen, Heiko Stübner, Olof Johansson, Douglas Anderson, Sonny Rao, Alexandru Stan, Daniel Kurtz, Tao Huang, devicetree@vger.kernel.org, Lin Huang, linux-doc, 姚智情, Chris Zhong, zhangqing Hi Andrzej, On Thu, Feb 12, 2015 at 4:43 PM, Andrzej Hajda <a.hajda@samsung.com> wrote: > On 02/12/2015 12:20 AM, Alim Akhtar wrote: >> Hi Andrzej, >> >> On Wed, Feb 11, 2015 at 5:28 PM, Andrzej Hajda <a.hajda@samsung.com> wrote: >>> Hi Alim, >>> >>> On 02/11/2015 03:57 AM, Addy wrote: >>>> On 2015/02/10 23:22, Alim Akhtar wrote: >>>>> Hi Addy, >>>>> >>>>> On Mon, Feb 9, 2015 at 12:55 PM, Addy Ke <addy.ke@rock-chips.com> wrote: >>>>>> Because of some uncertain factors, such as worse card or worse hardware, >>>>>> DAT[3:0](the data lines) may be pulled down by card, and mmc controller >>>>>> will be in busy state. This should not happend when mmc controller >>>>>> send command to update card clocks. If this happends, mci_send_cmd will >>>>>> be failed and we will get 'Timeout sending command', and then system will >>>>>> be blocked. To avoid this, we need reset mmc controller. >>>>>> >>>>>> Signed-off-by: Addy Ke <addy.ke@rock-chips.com> >>>>>> --- >>>>>> drivers/mmc/host/dw_mmc.c | 28 ++++++++++++++++++++++++++++ >>>>>> 1 file changed, 28 insertions(+) >>>>>> >>>>>> diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c >>>>>> index 4d2e3c2..b0b57e3 100644 >>>>>> --- a/drivers/mmc/host/dw_mmc.c >>>>>> +++ b/drivers/mmc/host/dw_mmc.c >>>>>> @@ -100,6 +100,7 @@ struct idmac_desc { >>>>>> }; >>>>>> #endif /* CONFIG_MMC_DW_IDMAC */ >>>>>> >>>>>> +static int dw_mci_card_busy(struct mmc_host *mmc); >>>>>> static bool dw_mci_reset(struct dw_mci *host); >>>>>> static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); >>>>>> >>>>>> @@ -888,6 +889,31 @@ static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) >>>>>> cmd, arg, cmd_status); >>>>>> } >>>>>> >>>>>> +static void dw_mci_wait_busy(struct dw_mci_slot *slot) >>>>>> +{ >>>>>> + struct dw_mci *host = slot->host; >>>>>> + unsigned long timeout = jiffies + msecs_to_jiffies(500); >>>>>> + >>>>> Why 500 msec? >>>> This timeout value is the same as mci_send_cmd: >>>> static void mci_send_cmd(struct dw_mci_slot *slot, u32 cmd, u32 arg) >>>> { >>>> struct dw_mci *host = slot->host; >>>> unsigned long timeout = jiffies + msecs_to_jiffies(500); >>>> .... >>>> } >>>> >>>> I have not clear that which is suitable. >>>> Do you have any suggestion on it? >>>>>> + do { >>>>>> + if (!dw_mci_card_busy(slot->mmc)) >>>>>> + return; >>>>>> + cpu_relax(); >>>>>> + } while (time_before(jiffies, timeout)); >>>>>> + >>>>>> + dev_err(host->dev, "Data busy (status %#x)\n", >>>>>> + mci_readl(slot->host, STATUS)); >>>>>> + >>>>>> + /* >>>>>> + * Data busy, this should not happend when mmc controller send command >>>>>> + * to update card clocks in non-volt-switch state. If it happends, we >>>>>> + * should reset controller to avoid getting "Timeout sending command". >>>>>> + */ >>>>>> + dw_mci_ctrl_reset(host, SDMMC_CTRL_ALL_RESET_FLAGS); >>>>>> + >>>>> Why you need to reset all blocks? may be CTRL_RESET is good enough here. >>>> I have tested on rk3288, if only reset ctroller, data busy bit will not >>>> be cleaned,and we will still get >>>> >>>> "Timeout sending command". >>>> >>>>>> + /* Fail to reset controller or still data busy, WARN_ON! */ >>>>>> + WARN_ON(dw_mci_card_busy(slot->mmc)); >>>>>> +} >>>>>> + >>>>>> static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >>>>>> { >>>>>> struct dw_mci *host = slot->host; >>>>>> @@ -899,6 +925,8 @@ static void dw_mci_setup_bus(struct dw_mci_slot *slot, bool force_clkinit) >>>>>> /* We must continue to set bit 28 in CMD until the change is complete */ >>>>>> if (host->state == STATE_WAITING_CMD11_DONE) >>>>>> sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; >>>>>> + else >>>>>> + dw_mci_wait_busy(slot); >>>>>> >>>>> hmm...I would suggest you to call dw_mci_wait_busy() from inside >>>>> mci_send_cmd(), seems like dw_mmc hangs while sending update clock cmd >>>>> in multiple cases.see [1] >>>>> >>>>> [1]: http://permalink.gmane.org/gmane.linux.kernel.mmc/31140 >>>> I think this patch is more reasonable. >>>> So I will resend patches based on this patch. >>>> thank you! >>> I have tested your patches instead [1] above and they do not solve my issue: >>> Board: odroid-xu3/exynos5422/dw_mmc_250a. >>> MMC card: absent, broken-cd quirk >>> SD card: present >>> >> I doubt $SUBJECT patch in current form can resolve you issue. I have >> already given comments on $subject patch. >> >> Can you try out below patch (I have not tested yet) on top of $SUBJECT patch? >> >> ======= >> diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c >> index b0b57e3..ea87844 100644 >> --- a/drivers/mmc/host/dw_mmc.c >> +++ b/drivers/mmc/host/dw_mmc.c >> @@ -101,6 +101,7 @@ struct idmac_desc { >> #endif /* CONFIG_MMC_DW_IDMAC */ >> >> static int dw_mci_card_busy(struct mmc_host *mmc); >> +static void dw_mci_wait_busy(struct dw_mci_slot *slot); >> static bool dw_mci_reset(struct dw_mci *host); >> static bool dw_mci_ctrl_reset(struct dw_mci *host, u32 reset); >> >> @@ -874,16 +875,22 @@ static void mci_send_cmd(struct dw_mci_slot >> *slot, u32 cmd, u32 arg) >> struct dw_mci *host = slot->host; >> unsigned long timeout = jiffies + msecs_to_jiffies(500); >> unsigned int cmd_status = 0; >> + int re_try = 3; /* just random for now, 1 re-try should be ok */ >> >> - mci_writel(host, CMDARG, arg); >> - wmb(); >> - mci_writel(host, CMD, SDMMC_CMD_START | cmd); >> + while(re_try--) { >> + mci_writel(host, CMDARG, arg); >> + wmb(); >> + mci_writel(host, CMD, SDMMC_CMD_START | cmd); >> >> - while (time_before(jiffies, timeout)) { >> - cmd_status = mci_readl(host, CMD); >> - if (!(cmd_status & SDMMC_CMD_START)) >> - return; >> + while (time_before(jiffies, timeout)) { >> + cmd_status = mci_readl(host, CMD); >> + if (!(cmd_status & SDMMC_CMD_START)) >> + return; >> + } >> + >> + dw_mci_wait_busy(slot); >> } >> + >> dev_err(&slot->mmc->class_dev, >> "Timeout sending command (cmd %#x arg %#x status %#x)\n", >> cmd, arg, cmd_status); >> @@ -925,8 +932,6 @@ static void dw_mci_setup_bus(struct dw_mci_slot >> *slot, bool force_clkinit) >> /* We must continue to set bit 28 in CMD until the change is complete */ >> if (host->state == STATE_WAITING_CMD11_DONE) >> sdmmc_cmd_bits |= SDMMC_CMD_VOLT_SWITCH; >> - else >> - dw_mci_wait_busy(slot); >> >> if (!clock) { >> mci_writel(host, CLKENA, 0); > > It does not help, there is still the same timeout. > Did you tried this on top of the original $subject patch? What does STATUS register tells? Can you post the full log here? > Regards > Andrzej > >> >> ===== end ====== >>> System hangs during boot after few minutes kernel spits: >>> [ 242.188098] INFO: task kworker/u16:1:50 blocked for more than 120 >>> seconds. >>> [ 242.193524] Not tainted >>> 3.19.0-next-20150210-00002-gf96831b-dirty #3834 >>> [ 242.200622] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >>> disables this message. >>> [ 242.208422] kworker/u16:1 D c04766ac 0 50 2 0x00000000 >>> [ 242.214756] Workqueue: kmmcd mmc_rescan >>> [ 242.218553] [<c04766ac>] (__schedule) from [<c0476a58>] >>> (schedule+0x34/0x98) >>> [ 242.225591] [<c0476a58>] (schedule) from [<c047a4dc>] >>> (schedule_timeout+0x110/0x164) >>> [ 242.233302] [<c047a4dc>] (schedule_timeout) from [<c04774f0>] >>> (wait_for_common+0xb8/0x14c) >>> [ 242.241539] [<c04774f0>] (wait_for_common) from [<c0362138>] >>> (mmc_wait_for_req+0x68/0x17c) >>> [ 242.249861] [<c0362138>] (mmc_wait_for_req) from [<c03622cc>] >>> (mmc_wait_for_cmd+0x80/0xa0) >>> [ 242.258002] [<c03622cc>] (mmc_wait_for_cmd) from [<c0367e50>] >>> (mmc_go_idle+0x78/0xf8) >>> [ 242.265796] [<c0367e50>] (mmc_go_idle) from [<c0363e2c>] >>> (mmc_rescan+0x280/0x314) >>> [ 242.273253] [<c0363e2c>] (mmc_rescan) from [<c0034764>] >>> (process_one_work+0x120/0x324) >>> [ 242.281135] [<c0034764>] (process_one_work) from [<c00349cc>] >>> (worker_thread+0x30/0x42c) >>> [ 242.289194] [<c00349cc>] (worker_thread) from [<c003926c>] >>> (kthread+0xd8/0xf4) >>> [ 242.296389] [<c003926c>] (kthread) from [<c000e7c0>] >>> (ret_from_fork+0x14/0x34) >>> >>> Just for record, Exynos4412/dw_mmc_240a with the same configuration >>> (no MMC card, broken-cd) works OK without patches. >>> >>> >>> Regards >>> Andrzej >>> >>>>>> if (!clock) { >>>>>> mci_writel(host, CLKENA, 0); >>>>>> -- >>>>>> 1.8.3.2 >>>>>> >>>>>> >>>>>> >>>>>> _______________________________________________ >>>>>> linux-arm-kernel mailing list >>>>>> linux-arm-kernel@lists.infradead.org >>>>>> http://lists.infradead.org/mailman/listinfo/linux-arm-kernel >>>>> >>>> >>>> -- >>>> To unsubscribe from this list: send the line "unsubscribe linux-doc" in >>>> the body of a message to majordomo@vger.kernel.org >>>> More majordomo info at http://vger.kernel.org/majordomo-info.html >>>> >> >> > -- Regards, Alim ^ permalink raw reply [flat|nested] 21+ messages in thread
* [PATCH v2 2/2] mmc: dw_mmc: Don't start command while data busy 2015-02-09 7:25 ` [PATCH v2 0/2] about data busy Addy Ke 2015-02-09 7:25 ` [PATCH v2 1/2] mmc: dw_mmc: fix bug that cause 'Timeout sending command' Addy Ke @ 2015-02-09 7:25 ` Addy Ke 1 sibling, 0 replies; 21+ messages in thread From: Addy Ke @ 2015-02-09 7:25 UTC (permalink / raw) To: robh+dt, pawel.moll, mark.rutland, ijc+devicetree, galak, rdunlap, tgih.jun, jh80.chung, chris, ulf.hansson, dinguyen, heiko, olof, dianders, sonnyrao, amstan, djkurtz Cc: devicetree, linux-doc, linux-kernel, linux-mmc, linux-arm-kernel, linux-rockchip, zhenfu.fang, cf, lintao, chenfen, zyf, xjq, huangtao, zyw, yzq, hj, kever.yang, zhangqing, hl, Addy Ke We should wait for data busy here in non-volt-switch state. This may happend when sdio sends CMD53. Signed-off-by: Addy Ke <addy.ke@rock-chips.com> --- drivers/mmc/host/dw_mmc.c | 7 +++++++ 1 file changed, 7 insertions(+) diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c index b0b57e3..b40080d 100644 --- a/drivers/mmc/host/dw_mmc.c +++ b/drivers/mmc/host/dw_mmc.c @@ -1007,6 +1007,13 @@ static void __dw_mci_start_request(struct dw_mci *host, mci_writel(host, BLKSIZ, data->blksz); } + /* + * We should wait for data busy here in non-volt-switch state. + * This may happend when sdio sends CMD53. + */ + if (host->state != STATE_WAITING_CMD11_DONE) + dw_mci_wait_busy(slot); + cmdflags = dw_mci_prepare_command(slot->mmc, cmd); /* this is the first command, send the initialization clock */ -- 1.8.3.2 ^ permalink raw reply related [flat|nested] 21+ messages in thread
end of thread, other threads:[~2015-02-13 8:15 UTC | newest] Thread overview: 21+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2015-02-05 11:13 [PATCH] mmc: dw_mmc: fix bug that cause 'Timeout sending command' Addy Ke 2015-02-09 4:48 ` Ulf Hansson [not found] ` <1423134801-23219-1-git-send-email-addy.ke-TNX95d0MmH7DzftRWevZcw@public.gmane.org> 2015-02-09 4:51 ` Ulf Hansson 2015-02-09 6:56 ` Addy 2015-02-09 7:04 ` Jaehoon Chung [not found] ` <54D85C15.8010007-Sze3O3UU22JBDgjK7y7TUQ@public.gmane.org> 2015-02-09 9:17 ` addy ke 2015-02-09 7:25 ` [PATCH v2 0/2] about data busy Addy Ke 2015-02-09 7:25 ` [PATCH v2 1/2] mmc: dw_mmc: fix bug that cause 'Timeout sending command' Addy Ke 2015-02-09 10:01 ` Jaehoon Chung 2015-02-11 3:07 ` Addy 2015-02-10 15:22 ` Alim Akhtar 2015-02-11 2:57 ` Addy 2015-02-11 11:58 ` Andrzej Hajda 2015-02-11 23:20 ` Alim Akhtar 2015-02-12 2:28 ` addy ke 2015-02-12 11:10 ` Andrzej Hajda [not found] ` <54DC8A1B.7070402-Sze3O3UU22JBDgjK7y7TUQ@public.gmane.org> 2015-02-12 13:59 ` Alim Akhtar 2015-02-13 8:15 ` addy ke 2015-02-12 11:13 ` Andrzej Hajda 2015-02-12 13:53 ` Alim Akhtar 2015-02-09 7:25 ` [PATCH v2 2/2] mmc: dw_mmc: Don't start command while data busy Addy Ke
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).