public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* [PATCH] [OLPC] sdhci: add quirk for the Marvell CaFe's interrupt timeout
@ 2008-06-23 14:13 Andres Salomon
  2008-06-24  0:04 ` Andrew Morton
  2008-06-27 17:30 ` Pierre Ossman
  0 siblings, 2 replies; 7+ messages in thread
From: Andres Salomon @ 2008-06-23 14:13 UTC (permalink / raw)
  To: akpm; +Cc: drzeus-sdhci, sdhci-devel, linux-kernel


The CaFe chip has a hardware bug that ends up with us getting a timeout
value that's too small, causing the following sorts of problems:

[   60.525138] mmcblk0: error -110 transferring data
[   60.531477] end_request: I/O error, dev mmcblk0, sector 1484353
[   60.533371] Buffer I/O error on device mmcblk0p2, logical block 181632
[   60.533371] lost page write due to I/O error on mmcblk0p2

Presumably this is an off-by-one error in the hardware.  Incrementing
the timeout count value that we stuff into the TIMEOUT_CONTROL register
gets us a value that works.  This bug was originally discovered by
Pierre Ossman, I believe.

[thanks to Robert Millan for proving that this was still a problem]

Signed-off-by: Andres Salomon <dilinger@debian.org>
---
 drivers/mmc/host/sdhci.c |   12 +++++++++++-
 1 files changed, 11 insertions(+), 1 deletions(-)

diff --git a/drivers/mmc/host/sdhci.c b/drivers/mmc/host/sdhci.c
index 5b74c8c..2b3f06a 100644
--- a/drivers/mmc/host/sdhci.c
+++ b/drivers/mmc/host/sdhci.c
@@ -57,6 +57,8 @@ static unsigned int debug_quirks = 0;
 #define SDHCI_QUIRK_RESET_AFTER_REQUEST			(1<<8)
 /* Controller needs voltage and power writes to happen separately */
 #define SDHCI_QUIRK_NO_SIMULT_VDD_AND_POWER		(1<<9)
+/* Controller has an off-by-one issue with timeout value */
+#define SDHCI_QUIRK_INCR_TIMEOUT_CONTROL		(1<<10)
 
 static const struct pci_device_id pci_ids[] __devinitdata = {
 	{
@@ -134,7 +136,8 @@ static const struct pci_device_id pci_ids[] __devinitdata = {
 		.device         = PCI_DEVICE_ID_MARVELL_CAFE_SD,
 		.subvendor      = PCI_ANY_ID,
 		.subdevice      = PCI_ANY_ID,
-		.driver_data    = SDHCI_QUIRK_NO_SIMULT_VDD_AND_POWER,
+		.driver_data    = SDHCI_QUIRK_NO_SIMULT_VDD_AND_POWER |
+				  SDHCI_QUIRK_INCR_TIMEOUT_CONTROL,
 	},
 
 	{
@@ -479,6 +482,13 @@ static void sdhci_prepare_data(struct sdhci_host *host, struct mmc_data *data)
 			break;
 	}
 
+	/*
+	 * Compensate for an off-by-one error in the CaFe hardware; otherwise,
+	 * a too-small count gives us interrupt timeouts.
+	 */
+	if ((host->chip->quirks & SDHCI_QUIRK_INCR_TIMEOUT_CONTROL))
+		count++;
+
 	if (count >= 0xF) {
 		printk(KERN_WARNING "%s: Too large timeout requested!\n",
 			mmc_hostname(host->mmc));
-- 
1.5.5.3


^ permalink raw reply related	[flat|nested] 7+ messages in thread

* Re: [PATCH] [OLPC] sdhci: add quirk for the Marvell CaFe's interrupt timeout
  2008-06-23 14:13 [PATCH] [OLPC] sdhci: add quirk for the Marvell CaFe's interrupt timeout Andres Salomon
@ 2008-06-24  0:04 ` Andrew Morton
  2008-06-24  0:08   ` Andrew Morton
  2008-06-24  1:40   ` Andres Salomon
  2008-06-27 17:30 ` Pierre Ossman
  1 sibling, 2 replies; 7+ messages in thread
From: Andrew Morton @ 2008-06-24  0:04 UTC (permalink / raw)
  To: Andres Salomon; +Cc: drzeus-sdhci, sdhci-devel, linux-kernel

On Mon, 23 Jun 2008 10:13:52 -0400
Andres Salomon <dilinger@queued.net> wrote:

> 
> The CaFe chip has a hardware bug that ends up with us getting a timeout
> value that's too small, causing the following sorts of problems:
> 
> [   60.525138] mmcblk0: error -110 transferring data
> [   60.531477] end_request: I/O error, dev mmcblk0, sector 1484353
> [   60.533371] Buffer I/O error on device mmcblk0p2, logical block 181632
> [   60.533371] lost page write due to I/O error on mmcblk0p2
> 
> Presumably this is an off-by-one error in the hardware.  Incrementing
> the timeout count value that we stuff into the TIMEOUT_CONTROL register
> gets us a value that works.  This bug was originally discovered by
> Pierre Ossman, I believe.
> 
> [thanks to Robert Millan for proving that this was still a problem]
> 
> Signed-off-by: Andres Salomon <dilinger@debian.org>
> ---
>  drivers/mmc/host/sdhci.c |   12 +++++++++++-
>  1 files changed, 11 insertions(+), 1 deletions(-)
> 
> diff --git a/drivers/mmc/host/sdhci.c b/drivers/mmc/host/sdhci.c
> index 5b74c8c..2b3f06a 100644
> --- a/drivers/mmc/host/sdhci.c
> +++ b/drivers/mmc/host/sdhci.c
> @@ -57,6 +57,8 @@ static unsigned int debug_quirks = 0;
>  #define SDHCI_QUIRK_RESET_AFTER_REQUEST			(1<<8)
>  /* Controller needs voltage and power writes to happen separately */
>  #define SDHCI_QUIRK_NO_SIMULT_VDD_AND_POWER		(1<<9)
> +/* Controller has an off-by-one issue with timeout value */
> +#define SDHCI_QUIRK_INCR_TIMEOUT_CONTROL		(1<<10)
>  
>  static const struct pci_device_id pci_ids[] __devinitdata = {
>  	{
> @@ -134,7 +136,8 @@ static const struct pci_device_id pci_ids[] __devinitdata = {
>  		.device         = PCI_DEVICE_ID_MARVELL_CAFE_SD,
>  		.subvendor      = PCI_ANY_ID,
>  		.subdevice      = PCI_ANY_ID,
> -		.driver_data    = SDHCI_QUIRK_NO_SIMULT_VDD_AND_POWER,
> +		.driver_data    = SDHCI_QUIRK_NO_SIMULT_VDD_AND_POWER |
> +				  SDHCI_QUIRK_INCR_TIMEOUT_CONTROL,
>  	},
>  
>  	{
> @@ -479,6 +482,13 @@ static void sdhci_prepare_data(struct sdhci_host *host, struct mmc_data *data)
>  			break;
>  	}
>  
> +	/*
> +	 * Compensate for an off-by-one error in the CaFe hardware; otherwise,
> +	 * a too-small count gives us interrupt timeouts.
> +	 */
> +	if ((host->chip->quirks & SDHCI_QUIRK_INCR_TIMEOUT_CONTROL))
> +		count++;
> +
>  	if (count >= 0xF) {
>  		printk(KERN_WARNING "%s: Too large timeout requested!\n",
>  			mmc_hostname(host->mmc));

This is needed in 2.6.26, I assume?

If so, I can merge it unless Pierre has objections?

And it will cause conflicts with overlapping changes in linux-next.


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] [OLPC] sdhci: add quirk for the Marvell CaFe's interrupt timeout
  2008-06-24  0:04 ` Andrew Morton
@ 2008-06-24  0:08   ` Andrew Morton
  2008-06-24  1:38     ` Andres Salomon
  2008-06-24  1:40   ` Andres Salomon
  1 sibling, 1 reply; 7+ messages in thread
From: Andrew Morton @ 2008-06-24  0:08 UTC (permalink / raw)
  To: dilinger, drzeus-sdhci, sdhci-devel, linux-kernel

On Mon, 23 Jun 2008 17:04:49 -0700
Andrew Morton <akpm@linux-foundation.org> wrote:

> And it will cause conflicts with overlapping changes in linux-next.


oops, I lied.  The problem was that it secretly depended upon
olpc-sdhci-add-quirk-for-the-marvell-cafes-vdd-powerup-issue.patch

So if we want to fix thsi issue in 2.6.26 we need to merge both

olpc-sdhci-add-quirk-for-the-marvell-cafes-vdd-powerup-issue.patch

and

olpc-sdhci-add-quirk-for-the-marvell-cafes-interrupt-timeout.patch

yes?

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] [OLPC] sdhci: add quirk for the Marvell CaFe's interrupt timeout
  2008-06-24  0:08   ` Andrew Morton
@ 2008-06-24  1:38     ` Andres Salomon
  0 siblings, 0 replies; 7+ messages in thread
From: Andres Salomon @ 2008-06-24  1:38 UTC (permalink / raw)
  To: Andrew Morton; +Cc: drzeus-sdhci, sdhci-devel, linux-kernel

On Mon, 23 Jun 2008 17:08:50 -0700
Andrew Morton <akpm@linux-foundation.org> wrote:

> On Mon, 23 Jun 2008 17:04:49 -0700
> Andrew Morton <akpm@linux-foundation.org> wrote:
> 
> > And it will cause conflicts with overlapping changes in linux-next.
> 
> 
> oops, I lied.  The problem was that it secretly depended upon
> olpc-sdhci-add-quirk-for-the-marvell-cafes-vdd-powerup-issue.patch
> 
> So if we want to fix thsi issue in 2.6.26 we need to merge both
> 
> olpc-sdhci-add-quirk-for-the-marvell-cafes-vdd-powerup-issue.patch
> 
> and
> 
> olpc-sdhci-add-quirk-for-the-marvell-cafes-interrupt-timeout.patch
> 
> yes?


Correct.  I originally wasn't going to send the interrupt-timeout
patch (but was shown that the bug still existed), which is why the two
patches weren't sent as a series.  Sorry!

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] [OLPC] sdhci: add quirk for the Marvell CaFe's interrupt timeout
  2008-06-24  0:04 ` Andrew Morton
  2008-06-24  0:08   ` Andrew Morton
@ 2008-06-24  1:40   ` Andres Salomon
  1 sibling, 0 replies; 7+ messages in thread
From: Andres Salomon @ 2008-06-24  1:40 UTC (permalink / raw)
  To: Andrew Morton; +Cc: drzeus-sdhci, sdhci-devel, linux-kernel

On Mon, 23 Jun 2008 17:04:49 -0700
Andrew Morton <akpm@linux-foundation.org> wrote:

> On Mon, 23 Jun 2008 10:13:52 -0400
> Andres Salomon <dilinger@queued.net> wrote:
> 
> > 
> > The CaFe chip has a hardware bug that ends up with us getting a
> > timeout value that's too small, causing the following sorts of
> > problems:
> > 
> > [   60.525138] mmcblk0: error -110 transferring data
> > [   60.531477] end_request: I/O error, dev mmcblk0, sector 1484353
> > [   60.533371] Buffer I/O error on device mmcblk0p2, logical block
> > 181632 [   60.533371] lost page write due to I/O error on mmcblk0p2
> > 
[...]
> 
> This is needed in 2.6.26, I assume?
> 


Yes, please.


> If so, I can merge it unless Pierre has objections?
> 
> And it will cause conflicts with overlapping changes in linux-next.
> 

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] [OLPC] sdhci: add quirk for the Marvell CaFe's interrupt timeout
  2008-06-23 14:13 [PATCH] [OLPC] sdhci: add quirk for the Marvell CaFe's interrupt timeout Andres Salomon
  2008-06-24  0:04 ` Andrew Morton
@ 2008-06-27 17:30 ` Pierre Ossman
  2008-06-27 17:42   ` Andres Salomon
  1 sibling, 1 reply; 7+ messages in thread
From: Pierre Ossman @ 2008-06-27 17:30 UTC (permalink / raw)
  To: Andres Salomon; +Cc: akpm, drzeus-sdhci, sdhci-devel, linux-kernel

[-- Attachment #1: Type: text/plain, Size: 1840 bytes --]

On Mon, 23 Jun 2008 10:13:52 -0400
Andres Salomon <dilinger@queued.net> wrote:

> 
> The CaFe chip has a hardware bug that ends up with us getting a timeout
> value that's too small, causing the following sorts of problems:
> 
> [   60.525138] mmcblk0: error -110 transferring data
> [   60.531477] end_request: I/O error, dev mmcblk0, sector 1484353
> [   60.533371] Buffer I/O error on device mmcblk0p2, logical block 181632
> [   60.533371] lost page write due to I/O error on mmcblk0p2
> 
> Presumably this is an off-by-one error in the hardware.  Incrementing
> the timeout count value that we stuff into the TIMEOUT_CONTROL register
> gets us a value that works.  This bug was originally discovered by
> Pierre Ossman, I believe.
> 
> [thanks to Robert Millan for proving that this was still a problem]
> 
> Signed-off-by: Andres Salomon <dilinger@debian.org>

Hmm... I'm not entirely sure about the specifics of the workaround
here. It's likely that we'll have an off-by-minus-one in another
controller, and off-by-two in a third.

Perhaps we should just have "broken timeout" and set the timeout to
0xE. It doesn't cause any side-effects except that the user will have
to wait slightly longer for requests to fail if the card has decided to
crap out.

> @@ -479,6 +482,13 @@ static void sdhci_prepare_data(struct sdhci_host *host, struct mmc_data *data)
>  			break;
>  	}
>  
> +	/*
> +	 * Compensate for an off-by-one error in the CaFe hardware; otherwise,
> +	 * a too-small count gives us interrupt timeouts.
> +	 */

Same issue with "CaFE" as the previous patch.

-- 
     -- Pierre Ossman

  WARNING: This correspondence is being monitored by the
  Swedish government. Make sure your server uses encryption
  for SMTP traffic and consider using PGP for end-to-end
  encryption.

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 197 bytes --]

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] [OLPC] sdhci: add quirk for the Marvell CaFe's interrupt timeout
  2008-06-27 17:30 ` Pierre Ossman
@ 2008-06-27 17:42   ` Andres Salomon
  0 siblings, 0 replies; 7+ messages in thread
From: Andres Salomon @ 2008-06-27 17:42 UTC (permalink / raw)
  To: Pierre Ossman; +Cc: akpm, sdhci-devel, linux-kernel

On Fri, 27 Jun 2008 19:30:01 +0200
Pierre Ossman <drzeus@drzeus.cx> wrote:

> On Mon, 23 Jun 2008 10:13:52 -0400
> Andres Salomon <dilinger@queued.net> wrote:
> 
> > 
> > The CaFe chip has a hardware bug that ends up with us getting a
> > timeout value that's too small, causing the following sorts of
> > problems:
> > 
> > [   60.525138] mmcblk0: error -110 transferring data
> > [   60.531477] end_request: I/O error, dev mmcblk0, sector 1484353
> > [   60.533371] Buffer I/O error on device mmcblk0p2, logical block
> > 181632 [   60.533371] lost page write due to I/O error on mmcblk0p2
> > 
> > Presumably this is an off-by-one error in the hardware.
> > Incrementing the timeout count value that we stuff into the
> > TIMEOUT_CONTROL register gets us a value that works.  This bug was
> > originally discovered by Pierre Ossman, I believe.
> > 
> > [thanks to Robert Millan for proving that this was still a problem]
> > 
> > Signed-off-by: Andres Salomon <dilinger@debian.org>
> 
> Hmm... I'm not entirely sure about the specifics of the workaround
> here. It's likely that we'll have an off-by-minus-one in another
> controller, and off-by-two in a third.
> 
> Perhaps we should just have "broken timeout" and set the timeout to
> 0xE. It doesn't cause any side-effects except that the user will have
> to wait slightly longer for requests to fail if the card has decided
> to crap out.
> 

That would be fine.  OFW actually just hardcodes the timeout to 0xc,
with Mitch citing the same logic.  Just setting it to the upper bound
would certainly make it more applicable hardware other than the cafe.


^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2008-06-27 17:39 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-06-23 14:13 [PATCH] [OLPC] sdhci: add quirk for the Marvell CaFe's interrupt timeout Andres Salomon
2008-06-24  0:04 ` Andrew Morton
2008-06-24  0:08   ` Andrew Morton
2008-06-24  1:38     ` Andres Salomon
2008-06-24  1:40   ` Andres Salomon
2008-06-27 17:30 ` Pierre Ossman
2008-06-27 17:42   ` Andres Salomon

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox