linux-mmc.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Seungwon Jeon <tgih.jun@samsung.com>
To: 'Sonny Rao' <sonnyrao@chromium.org>,
	'Yuvaraj Kumar' <yuvaraj.cd@gmail.com>
Cc: 'Grant Grundler' <grundler@chromium.org>,
	linux-samsung-soc@vger.kernel.org,
	linux-arm-kernel@lists.infradead.org,
	'Jaehoon Chung' <jh80.chung@samsung.com>,
	'Chris Ball' <cjb@laptop.org>,
	'linux-mmc' <linux-mmc@vger.kernel.org>,
	kgene.kim@samsung.com, 'sunil joshi' <joshi@samsung.com>,
	'Tomasz Figa' <t.figa@samsung.com>,
	'Doug Anderson' <dianders@chromium.org>
Subject: RE: [PATCH] mmc: dw_mmc: Make sure we don't get stuck when we get an error
Date: Sat, 10 May 2014 23:11:17 +0900	[thread overview]
Message-ID: <002001cf6c59$b5b5e680$2121b380$%jun@samsung.com> (raw)
In-Reply-To: <CAPz6YkVwT0PrcLP1MJy0vVRw811+_7PwsV_ggKxQkqrjOWeWWw@mail.gmail.com>

On Fri, May 09, 2014, Sonny Rao wrote:
> On Thu, May 8, 2014 at 2:42 AM, Yuvaraj Kumar <yuvaraj.cd@gmail.com> wrote:
> > Any comments on this patch?
> >
> 
> I'll just add that without this fix, running the tuning loop for UHS
> modes is not reliable on dw_mmc because errors will happen and you
> will eventually hit this race and hang.  This can happen any time
> there is tuning like during boot or during resume from suspend.
> 
> > On Thu, Mar 27, 2014 at 11:48 AM, Yuvaraj Kumar C D
> > <yuvaraj.cd@gmail.com> wrote:
> >> From: Doug Anderson <dianders@chromium.org>
> >>
> >> If we happened to get a data error at just the wrong time the dw_mmc
> >> driver could get into a state where it would never complete its
> >> request.  That would leave the caller just hanging there.
> >>
> >> We fix this two ways and both of the two fixes on their own appear to
> >> fix the problems we've seen:
> >>
> >> 1. Fix a race in the tasklet where the interrupt setting the data
> >>    error happens _just after_ we check for it, then we get a
> >>    EVENT_XFER_COMPLETE.  We fix this by repeating a bit of code.
I think repeating is not good approach to fix race.
In your case, XFER_COMPLETE preceded data error and DTO didn't come?
It seems strange case.
I want to know actual error value if you can reproduce.

> >> 2. Fix it so that if we detect that we've got an error in the "data
> >>    busy" state and we're not going to do anything else we end the
> >>    request and unblock anyone waiting.
> >>
> >> Signed-off-by: Doug Anderson <dianders@chromium.org>
> >> Signed-off-by: Yuvaraj Kumar C D <yuvaraj.cd@gmail.com>
> >> ---
> >>  drivers/mmc/host/dw_mmc.c |   47 +++++++++++++++++++++++++++++++++++++++++++++
> >>  1 file changed, 47 insertions(+)
> >>
> >> diff --git a/drivers/mmc/host/dw_mmc.c b/drivers/mmc/host/dw_mmc.c
> >> index 1d77431..4c589f1 100644
> >> --- a/drivers/mmc/host/dw_mmc.c
> >> +++ b/drivers/mmc/host/dw_mmc.c
> >> @@ -1300,6 +1300,14 @@ static void dw_mci_tasklet_func(unsigned long priv)
> >>                         /* fall through */
> >>
> >>                 case STATE_SENDING_DATA:
> >> +                       /*
> >> +                        * We could get a data error and never a transfer
> >> +                        * complete so we'd better check for it here.
> >> +                        *
> >> +                        * Note that we don't really care if we also got a
> >> +                        * transfer complete; stopping the DMA and sending an
> >> +                        * abort won't hurt.
> >> +                        */
> >>                         if (test_and_clear_bit(EVENT_DATA_ERROR,
> >>                                                &host->pending_events)) {
> >>                                 dw_mci_stop_dma(host);
> >> @@ -1313,7 +1321,29 @@ static void dw_mci_tasklet_func(unsigned long priv)
> >>                                 break;
> >>
> >>                         set_bit(EVENT_XFER_COMPLETE, &host->completed_events);
> >> +
> >> +                       /*
> >> +                        * Handle an EVENT_DATA_ERROR that might have shown up
> >> +                        * before the transfer completed.  This might not have
> >> +                        * been caught by the check above because the interrupt
> >> +                        * could have gone off between the previous check and
> >> +                        * the check for transfer complete.
> >> +                        *
> >> +                        * Technically this ought not be needed assuming we
> >> +                        * get a DATA_COMPLETE eventually (we'll notice the
> >> +                        * error and end the request), but it shouldn't hurt.
> >> +                        *
> >> +                        * This has the advantage of sending the stop command.
> >> +                        */
> >> +                       if (test_and_clear_bit(EVENT_DATA_ERROR,
> >> +                                              &host->pending_events)) {
> >> +                               dw_mci_stop_dma(host);
> >> +                               send_stop_abort(host, data);
> >> +                               state = STATE_DATA_ERROR;
> >> +                               break;
> >> +                       }
> >>                         prev_state = state = STATE_DATA_BUSY;
> >> +
> >>                         /* fall through */
> >>
> >>                 case STATE_DATA_BUSY:
> >> @@ -1336,6 +1366,23 @@ static void dw_mci_tasklet_func(unsigned long priv)
> >>                                 /* stop command for open-ended transfer*/
> >>                                 if (data->stop)
> >>                                         send_stop_abort(host, data);
> >> +                       } else {
> >> +                               /*
> >> +                                * If we don't have a command complete now we'll
> >> +                                * never get one since we just reset everything;
> >> +                                * better end the request.
> >> +                                *
> >> +                                * If we do have a command complete we'll fall
> >> +                                * through to the SENDING_STOP command and
> >> +                                * everything will be peachy keen.
> >> +                                *
> >> +                                * TODO: I guess we shouldn't send a stop?
> >> +                                */
> >> +                               if (!test_bit(EVENT_CMD_COMPLETE,
> >> +                                             &host->pending_events)) {
> >> +                                       dw_mci_request_end(host, mrq);
> >> +                                       goto unlock;
> >> +                               }
Can you explain what happens above?
What is it for?

Thanks,
Seungwon Jeon

  reply	other threads:[~2014-05-10 14:11 UTC|newest]

Thread overview: 13+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-03-27  6:18 [PATCH] mmc: dw_mmc: Make sure we don't get stuck when we get an error Yuvaraj Kumar C D
2014-05-08  9:42 ` Yuvaraj Kumar
2014-05-09  3:21   ` Sonny Rao
2014-05-10 14:11     ` Seungwon Jeon [this message]
2014-05-12 21:50       ` Doug Anderson
2014-05-13  4:52         ` Seungwon Jeon
2014-05-13 15:56           ` Doug Anderson
2014-05-16  1:46             ` Seungwon Jeon
2014-05-16 16:21               ` Doug Anderson
2014-05-20  1:24                 ` Seungwon Jeon
2014-05-20  1:51           ` Seungwon Jeon
2014-05-20 22:08             ` Doug Anderson
2014-05-21  9:05               ` Seungwon Jeon

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='002001cf6c59$b5b5e680$2121b380$%jun@samsung.com' \
    --to=tgih.jun@samsung.com \
    --cc=cjb@laptop.org \
    --cc=dianders@chromium.org \
    --cc=grundler@chromium.org \
    --cc=jh80.chung@samsung.com \
    --cc=joshi@samsung.com \
    --cc=kgene.kim@samsung.com \
    --cc=linux-arm-kernel@lists.infradead.org \
    --cc=linux-mmc@vger.kernel.org \
    --cc=linux-samsung-soc@vger.kernel.org \
    --cc=sonnyrao@chromium.org \
    --cc=t.figa@samsung.com \
    --cc=yuvaraj.cd@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).