Linux-mtd Archive on lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH] mtd: nand: omap: fix race condition in omap_wait()
@ 2012-04-17 11:11 Ivan Djelic
  2012-04-27  5:50 ` Artem Bityutskiy
  0 siblings, 1 reply; 3+ messages in thread
From: Ivan Djelic @ 2012-04-17 11:11 UTC (permalink / raw)
  To: linux-mtd; +Cc: ivan.djelic

If a context switch occurs in function omap_wait() just before the
while loop is entered, then upon return from context switch the
timeout may already have elapsed: in that case, status is never
read from NAND device, and omap_wait() returns an error.
This failure has been experimentally observed during stress tests.

This patch ensures a NAND status read is always performed before
returning, as in the generic nand_wait() function.

Signed-off-by: Ivan Djelic <ivan.djelic@parrot.com>
---
 drivers/mtd/nand/omap2.c |    4 +++-
 1 file changed, 3 insertions(+), 1 deletion(-)

diff --git a/drivers/mtd/nand/omap2.c b/drivers/mtd/nand/omap2.c
index c2b0bba..45c6205 100644
--- a/drivers/mtd/nand/omap2.c
+++ b/drivers/mtd/nand/omap2.c
@@ -879,7 +879,7 @@ static int omap_wait(struct mtd_info *mtd, struct nand_chip *chip)
 	struct omap_nand_info *info = container_of(mtd, struct omap_nand_info,
 							mtd);
 	unsigned long timeo = jiffies;
-	int status = NAND_STATUS_FAIL, state = this->state;
+	int status, state = this->state;
 
 	if (state == FL_ERASING)
 		timeo += (HZ * 400) / 1000;
@@ -894,6 +894,8 @@ static int omap_wait(struct mtd_info *mtd, struct nand_chip *chip)
 			break;
 		cond_resched();
 	}
+
+	status = gpmc_nand_read(info->gpmc_cs, GPMC_NAND_DATA);
 	return status;
 }
 
-- 
1.7.10

^ permalink raw reply related	[flat|nested] 3+ messages in thread

* Re: [PATCH] mtd: nand: omap: fix race condition in omap_wait()
  2012-04-17 11:11 [PATCH] mtd: nand: omap: fix race condition in omap_wait() Ivan Djelic
@ 2012-04-27  5:50 ` Artem Bityutskiy
  2012-04-27 11:26   ` Mark Olleson
  0 siblings, 1 reply; 3+ messages in thread
From: Artem Bityutskiy @ 2012-04-27  5:50 UTC (permalink / raw)
  To: Ivan Djelic; +Cc: linux-mtd

[-- Attachment #1: Type: text/plain, Size: 656 bytes --]

On Tue, 2012-04-17 at 13:11 +0200, Ivan Djelic wrote:
> If a context switch occurs in function omap_wait() just before the
> while loop is entered, then upon return from context switch the
> timeout may already have elapsed: in that case, status is never
> read from NAND device, and omap_wait() returns an error.
> This failure has been experimentally observed during stress tests.
> 
> This patch ensures a NAND status read is always performed before
> returning, as in the generic nand_wait() function.
> 
> Signed-off-by: Ivan Djelic <ivan.djelic@parrot.com>

Pushed this one to l2-mtd.git, thanks!

-- 
Best Regards,
Artem Bityutskiy

[-- Attachment #2: This is a digitally signed message part --]
[-- Type: application/pgp-signature, Size: 836 bytes --]

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [PATCH] mtd: nand: omap: fix race condition in omap_wait()
  2012-04-27  5:50 ` Artem Bityutskiy
@ 2012-04-27 11:26   ` Mark Olleson
  0 siblings, 0 replies; 3+ messages in thread
From: Mark Olleson @ 2012-04-27 11:26 UTC (permalink / raw)
  To: linux-mtd; +Cc: Artem Bityutskiy


On 27 Apr 2012, at 06:50, Artem Bityutskiy wrote:

> On Tue, 2012-04-17 at 13:11 +0200, Ivan Djelic wrote:
>> If a context switch occurs in function omap_wait() just before the
>> while loop is entered, then upon return from context switch the
>> timeout may already have elapsed: in that case, status is never
>> read from NAND device, and omap_wait() returns an error.
>> This failure has been experimentally observed during stress tests.
>> 
>> This patch ensures a NAND status read is always performed before
>> returning, as in the generic nand_wait() function.
>> 
>> Signed-off-by: Ivan Djelic <ivan.djelic@parrot.com>
> 
> Pushed this one to l2-mtd.git, thanks!
> 

I'm investigating a problem where omap_wait() returns (apparently after the timeout) without the device being ready.    In my case, the loop has run at least once (my system is lightly loaded so cond_resched() is unlikely to block us for long).   This patch will help in the case where cond_resched() blocks the tread beyond the timeout as well as the case where a context switch occurs before ever reading a value. 

When the timeout is reached without the device becoming ready,  omap_wait() returns with a status value with the NAND_STATUS_FAIL clear, but in many places where omap_wait() is called from only check for  NAND_STATUS_FAIL, and then go on to issue further commands to the device - which fail.   

This includes the code enabled by CONFIG_MTD_NAND_VERIFY_WRITE when then reads back garbage and fails.



Mark
---
Mark Olleson - Senior R&D Engineer
Technology Research & Development Group
Yamaha R&D Centre London

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2012-04-27 11:26 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-04-17 11:11 [PATCH] mtd: nand: omap: fix race condition in omap_wait() Ivan Djelic
2012-04-27  5:50 ` Artem Bityutskiy
2012-04-27 11:26   ` Mark Olleson

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox