linux-mmc.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Vladimir Zapolskiy <vladimir_zapolskiy@mentor.com>
To: Adrian Hunter <adrian.hunter@intel.com>
Cc: linux-mmc@vger.kernel.org,
	Ed Sutter <ed.sutter@alcatel-lucent.com>,
	Chris Ball <cjb@laptop.org>
Subject: Re: [PATCH] mmc: sdhci: don't limit discard timeout by data line timeout
Date: Fri, 29 Nov 2013 09:33:16 +0200	[thread overview]
Message-ID: <5298433C.2020208@mentor.com> (raw)
In-Reply-To: <52973FCA.8000901@intel.com>

On 11/28/13 15:06, Adrian Hunter wrote:
> On 28/11/13 13:48, Vladimir Zapolskiy wrote:
>> On 11/28/13 09:12, Adrian Hunter wrote:
>>> On 27/11/13 16:57, Vladimir Zapolskiy wrote:
>>>> On 11/27/13 10:21, Adrian Hunter wrote:
>>>>> On 26/11/13 18:33, Vladimir Zapolskiy wrote:
>>>>>> On 11/26/13 11:04, Adrian Hunter wrote:
>>>>>>> On 22/11/13 17:21, Vladimir Zapolskiy wrote:
>>>>>>>> On 22.11.2013 16:04, Adrian Hunter wrote:
>>>>>>>>> On 22/11/13 15:50, Vladimir Zapolskiy wrote:
>>>>>>>>>> On 22.11.2013 14:04, Adrian Hunter wrote:
>>>>>>>>>>> On 22/11/13 14:24, Vladimir Zapolskiy wrote:
>>>>>>>>>>>> On 22.11.2013 12:38, Adrian Hunter wrote:
>>>>>>>>>>>>> On 21/11/13 17:07, Vladimir Zapolskiy wrote:
>>>>>>>>>>>>>> JEDEC specification defines quite high erase timeout value for
>>>>>>>>>>>>>> 300ms
>>>>>>>>>>>>>> multiplied by erase group number, and SD Host Controller
>>>>>>>>>>>>>> specification
>>>>>>>>>>>>>> data line timeout may be much less, e.g. 2^13 / 52MHz ~ 160us.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>            From block layer and MMC perfromance perspective it is
>>>>>>>>>>>>>>> desirable
>>>>>>>>>>>>>>> that
>>>>>>>>>>>>>> millions of erase groups are discarded at once, so there is no
>>>>>>>>>>>>>> much
>>>>>>>>>>>>>> sense to limit maximum erase timeout by data line timeout, if a
>>>>>>>>>>>>>> controller handles correctly erase operation without indication of
>>>>>>>>>>>>>> data line timeout.
>>>>>>>>>>>>>
>>>>>>>>>>>>> Would you explain that some more.  Do you mean that:
>>>>>>>>>>>>>           a) it does not have a timeout
>>>>>>>>>>>>
>>>>>>>>>>>> JEDEC defines a timeout on erase/trim operations, also in
>>>>>>>>>>>> drivers/mmc/core/core.c
>>>>>>>>>>>> there is a reasonable enough 10 minutes limitation for discard
>>>>>>>>>>>> operations.
>>>>>>>>>>>>
>>>>>>>>>>>>>           b) it has a timeout which is less than the timeout
>>>>>>>>>>>>> specified
>>>>>>>>>>>>> by the
>>>>>>>>>>>>> standard but the operation nevertheless completes
>>>>>>>>>>>>
>>>>>>>>>>>> SDHC data line timeout is enormously less than erase group timeout,
>>>>>>>>>>>> and
>>>>>>>>>>>> trivial testing shows that those two timeouts are independent,
>>>>>>>>>>>> probably
>>>>>>>>>>>> except some particular cases of controllers not known before commits
>>>>>>>>>>>> 58d1246db3 and e056a1b5b.
>>>>>>>>>>>>
>>>>>>>>>>>> According to the currently implemented logic, mmc_do_erase()
>>>>>>>>>>>> commonly is
>>>>>>>>>>>> instructed to discard 1-2 erase groups at maximum, however it tends
>>>>>>>>>>>> to be
>>>>>>>>>>>> capable to successfully discard millions of erase groups at once
>>>>>>>>>>>> ignoring
>>>>>>>>>>>> that SDHC data line timeout limitation.
>>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>> You seem to be trying to say that the SDHCI spec. says that the host
>>>>>>>>>>> controller does not timeout erase operations or uses a different
>>>>>>>>>>> timeout
>>>>>>>>>>> than the one programmed in the "Timeout Control Register".  Where is
>>>>>>>>>>> that is
>>>>>>>>>>> the SDHCI spec?
>>>>>>>>>>
>>>>>>>>>> According to the spec a host controller timeouts erase operations
>>>>>>>>>> like any
>>>>>>>>>> other R1B command.
>>>>>>>>>>
>>>>>>>>>> So in your opinion, should there be SDHCI_QUIRK_BROKEN_TIMEOUT_VAL
>>>>>>>>>> instead
>>>>>>>>>> of the new quirk?
>>>>>>>>>
>>>>>>>>> I don't understand how SDHCI_QUIRK_BROKEN_TIMEOUT_VAL would help.  It
>>>>>>>>> just
>>>>>>>>> sets the timeout to maximum but max_discard_to is the maximum timeout.
>>>>>>>>
>>>>>>>> Here I meant to do something like:
>>>>>>>>
>>>>>>>>         if (host->quirks&     SDHCI_QUIRK_BROKEN_TIMEOUT_VAL)
>>>>>>>>             mmc->max_discard_to = 0;
>>>>>>>>
>>>>>>>> Again I'm not sure that this applies well to all
>>>>>>>> SDHCI_QUIRK_BROKEN_TIMEOUT_VAL
>>>>>>>> controllers, therefore a new quirk might be better.
>>>>>>>>
>>>>>>>>> As I understand it you don't want to limit the discard size, either
>>>>>>>>> because
>>>>>>>>> your controller does not timeout, or because you are happy that the
>>>>>>>>> maximum
>>>>>>>>> timeout is enough for your users and their use-cases.
>>>>>>>>>
>>>>>>>>> If that is the case then the original patch just needs the quirk the
>>>>>>>>> other
>>>>>>>>> way around. i.e.
>>>>>>>>>
>>>>>>>>>          if (host->quirks2&      SDHCI_QUIRK2_NO_DISCARD_LIMIT)
>>>>>>>>>              mmc->max_discard_to = 0;
>>>>>>>>>          else
>>>>>>>>>              mmc->max_discard_to = (1<<      27) / host->timeout_clk;
>>>>>>>>
>>>>>>>> This suits me fine, thanks for review, and I'll resend a change based on
>>>>>>>> this.
>>>>>>>>
>>>>>>>> Also I'd like to pay your attention to (1<<     27) / host->timeout_clk
>>>>>>>> part of
>>>>>>>> calculation, following the spec it might be better to account the actual
>>>>>>>> value of Data Timeout Counter, otherwise a controller may get
>>>>>>>> unintentional
>>>>>>>> Data Timeout Error pretty soon. Please correct me, if I'm mistaken here.
>>>>>>>
>>>>>>> Not sure what you mean.  max_discard_to is the maximum timeout (in
>>>>>>> milliseconds) that the host controller supports.  The intent is to limit
>>>>>>> erase operations to ones that have a timeout that is less than or
>>>>>>> equal to
>>>>>>> that.
>>>>>>
>>>>>> That's clear. But it's not obvious why do you prefer (1<<    27) numerator
>>>>>> instead
>>>>>> of secure (1<<    13) or (1<<    (13 + sdhci_readl(host,
>>>>>> SDHCI_TIMEOUT_CONTROL))).
>>>>>
>>>>> The maximum value of "Data Timeout Counter Value" in "Timeout Control
>>>>> Register" is 14 and the maximum timeout is therefore (1<<    27).
>>>>
>>>> So, from this perspective I assume this is a potential theoretical maximum
>>>> timeout for a controller, which may be 16384 times more than the maximum
>>>> guaranteed timeout before getting a DAT timeout. Why is the theoretical
>>>> maximum
>>>
>>> Where do you get the notion of "maximum guaranteed timeout"?  The timeout is
>>> what is programmed in "Data Timeout Counter Value".
>>
>> And exactly this "Data Timeout Counter Value" is not used in your code to
>> predict controller's data line timeout.
>>
>>>> supposed to be used in calculations of a guaranteed discard operation
>>>> timeout
>>>> instead of promised DAT timeout by a controller?
>>>
>>> What is "promised DAT timeout"?
>>
>> This is a timeout with respect to "Data Timeout Counter Value".
>>
>> According to your words max_discard_to is the maximum timeout that the host
>> controller supports, but such a parameter is useless, because nobody sets
>> the host controller SDHCI_TIMEOUT_CONTROL register to maximum supported value,
>
> sdhci_prepare_data() ->  sdhci_calc_timeout() sets the timeout based on what
> the upper layers specify, up to and including the maximum value.
>
> So what max_discard_to does is to limit the erase size so that when
> sdhci_calc_timeout() is called it won't exceed the maximum.

Now the idea is clear, thank you.

>> so there is a probability that you greatly overestimate Data Timeout value,
>> and therefore block layer or other subsystem can't rely on it. Please correct
>> me here.
>>
>>>>
>>>>>>
>>>>>>> Currently, the limit gets applied by the block layer before the mmc
>>>>>>> layer is
>>>>>>> involved so there is no possibility to take the actual timeout into
>>>>>>> account.
>>>>>>>      However if you have erase_group_def set, then it won't make any
>>>>>>> difference
>>>>>>> i.e. the limit will be the same.
>>>>>>>
>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> Potentially the change may break some of the SDHCs on discard of
>>>>>>>>>>>>>> mmc,
>>>>>>>>>>>>>> and for backward compatibility a new quirk is introduced, which
>>>>>>>>>>>>>> is NOT
>>>>>>>>>>>>>> set by default.
>>>>>>>>>>>>>
>>>>>>>>>>>>> It sounds to me that what you want to do is not standard so the
>>>>>>>>>>>>> quirk
>>>>>>>>>>>>> should
>>>>>>>>>>>>> be the other way around.
>>>>>>>>>>>>
>>>>>>>>>>>> Please take a look at commits 58d1246db3 and e056a1b5b, I'd be
>>>>>>>>>>>> glad, if
>>>>>>>>>>>> you
>>>>>>>>>>>> could elaborate to which "some host controllers" the quirk in my
>>>>>>>>>>>> definition
>>>>>>>>>>>> applies, I believe all other host controllers present at that
>>>>>>>>>>>> time in
>>>>>>>>>>>> drivers/mmc/host/* are capable to discard without introduced
>>>>>>>>>>>> limitation.
>>>>>>>>>>>
>>>>>>>>>>> "some host controllers" == SDHCI i.e. to all of the ones you are
>>>>>>>>>>> applying
>>>>>>>>>>> the change.
>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> Signed-off-by: Vladimir Zapolskiy<vladimir_zapolskiy@mentor.com>
>>>>>>>>>>>>>> Reported-by: Ed Sutter<ed.sutter@alcatel-lucent.com>
>>>>>>>>>>>>>> Cc: Chris Ball<cjb@laptop.org>
>>>>>>>>>>>>>> Cc: Adrian Hunter<adrian.hunter@intel.com>
>>>>>>>>>>>>>> ---
>>>>>>>>>>>>>>         drivers/mmc/host/sdhci.c  |    5 ++++-
>>>>>>>>>>>>>>         include/linux/mmc/sdhci.h |    1 +
>>>>>>>>>>>>>>         2 files changed, 5 insertions(+), 1 deletion(-)
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> diff --git a/drivers/mmc/host/sdhci.c b/drivers/mmc/host/sdhci.c
>>>>>>>>>>>>>> index bd8a098..b1fdddb 100644
>>>>>>>>>>>>>> --- a/drivers/mmc/host/sdhci.c
>>>>>>>>>>>>>> +++ b/drivers/mmc/host/sdhci.c
>>>>>>>>>>>>>> @@ -2930,7 +2930,10 @@ int sdhci_add_host(struct sdhci_host *host)
>>>>>>>>>>>>>>             if (host->quirks&
>>>>>>>>>>>>>> SDHCI_QUIRK_DATA_TIMEOUT_USES_SDCLK)
>>>>>>>>>>>>>>                 host->timeout_clk = mmc->f_max / 1000;
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> -    mmc->max_discard_to = (1<<        27) / host->timeout_clk;
>>>>>>>>>>>>>> +    if (host->quirks2&
>>>>>>>>>>>>>> SDHCI_QUIRK2_DATA_TIMEOUT_ON_DISCARD)
>>>>>>>>>>>>>> +        mmc->max_discard_to = (1<<        27) / host->timeout_clk;
>>>>>>>>>>>>>> +    else
>>>>>>>>>>>>>> +        mmc->max_discard_to = 0;
>>>>>>>>>>>>>>
>>>>>>>>>>>>>>             mmc->caps |= MMC_CAP_SDIO_IRQ | MMC_CAP_ERASE |
>>>>>>>>>>>>>> MMC_CAP_CMD23;
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> diff --git a/include/linux/mmc/sdhci.h b/include/linux/mmc/sdhci.h
>>>>>>>>>>>>>> index 3e781b8..e7f6bd2 100644
>>>>>>>>>>>>>> --- a/include/linux/mmc/sdhci.h
>>>>>>>>>>>>>> +++ b/include/linux/mmc/sdhci.h
>>>>>>>>>>>>>> @@ -98,6 +98,7 @@ struct sdhci_host {
>>>>>>>>>>>>>>         #define SDHCI_QUIRK2_CARD_ON_NEEDS_BUS_ON        (1<<4)
>>>>>>>>>>>>>>         /* Controller has a non-standard host control register */
>>>>>>>>>>>>>>         #define SDHCI_QUIRK2_BROKEN_HOST_CONTROL        (1<<5)
>>>>>>>>>>>>>> +#define SDHCI_QUIRK2_DATA_TIMEOUT_ON_DISCARD        (1<<6)
>>>>>>>>>>>>>>
>>>>>>>>>>>>>>             int irq;        /* Device IRQ */
>>>>>>>>>>>>>>             void __iomem *ioaddr;    /* Mapped address */
>>>>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>
>>>>>>> --
>>>>>>> To unsubscribe from this list: send the line "unsubscribe linux-mmc" in
>>>>>>> the body of a message to majordomo@vger.kernel.org
>>>>>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>>>>>
>>>>>>
>>>>>
>>>>> --
>>>>> To unsubscribe from this list: send the line "unsubscribe linux-mmc" in
>>>>> the body of a message to majordomo@vger.kernel.org
>>>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>>>
>>>>
>>>
>>
>>
>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-mmc" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

  reply	other threads:[~2013-11-29  7:33 UTC|newest]

Thread overview: 19+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-11-21 15:07 [PATCH] mmc: sdhci: don't limit discard timeout by data line timeout Vladimir Zapolskiy
2013-11-22 11:38 ` Adrian Hunter
2013-11-22 12:24   ` Vladimir Zapolskiy
2013-11-22 13:04     ` Adrian Hunter
2013-11-22 13:50       ` Vladimir Zapolskiy
2013-11-22 15:04         ` Adrian Hunter
2013-11-22 15:21           ` Vladimir Zapolskiy
2013-11-26  9:04             ` Adrian Hunter
2013-11-26 16:33               ` Vladimir Zapolskiy
2013-11-27  8:21                 ` Adrian Hunter
2013-11-27 14:57                   ` Vladimir Zapolskiy
2013-11-27 15:48                     ` Philip Rakity
2013-11-27 16:11                       ` Vladimir Zapolskiy
2013-11-28  7:12                     ` Adrian Hunter
2013-11-28 11:48                       ` Vladimir Zapolskiy
2013-11-28 13:06                         ` Adrian Hunter
2013-11-29  7:33                           ` Vladimir Zapolskiy [this message]
2013-11-25 18:20       ` Ed Sutter
2013-11-25 22:06         ` Vladimir Zapolskiy

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5298433C.2020208@mentor.com \
    --to=vladimir_zapolskiy@mentor.com \
    --cc=adrian.hunter@intel.com \
    --cc=cjb@laptop.org \
    --cc=ed.sutter@alcatel-lucent.com \
    --cc=linux-mmc@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).