Re: mtd/fs/jffs2 erase.c,1.84,1.85 roll back

public inbox for linux-mtd@lists.infradead.org
 help / color / mirror / Atom feed

* Re: mtd/fs/jffs2 erase.c,1.84,1.85 roll back
       [not found] ` <43303409.5030406@users.sourceforge.net>
@ 2005-09-20 16:48   ` Artem B. Bityutskiy
  2005-09-20 19:49     ` Anders Grafstrom
  0 siblings, 1 reply; 7+ messages in thread
From: Artem B. Bityutskiy @ 2005-09-20 16:48 UTC (permalink / raw)
  To: Anders Grafstrom; +Cc: linux-mtd

Hello,

Anders Grafstrom wrote:
> Artem Bityutskiy wrote:
>> Index: erase.c
>> ===================================================================
>> RCS file: /home/cvs/mtd/fs/jffs2/erase.c,v
>> retrieving revision 1.84
>> retrieving revision 1.85
>> diff -u -r1.84 -r1.85
>> --- erase.c    20 Sep 2005 14:27:34 -0000    1.84
>> +++ erase.c    20 Sep 2005 14:53:15 -0000    1.85
>> @@ -337,7 +337,6 @@
>>              if (*datum + 1) {
>>                  *bad_offset += i;
>>                  printk(KERN_WARNING "Newly-erased block contained 
>> word 0x%lx at offset 0x%08x\n", *datum, *bad_offset);
>> -                ret = -EIO;
>>                  goto fail;
>>              }
>>          }
> 
> 
> May I ask why you rolled back this change?
No problems :-)

> I have a problem, that I'm trying to solve, with blocks that are not 
> erased despite a "successful" return from c->mtd->erase(). The result of 
> this is a very broken filesystem when the blocks are reused. It seems to 
> me that the above rolled back change would prevent the filesystem from 
> being broken in this situation(?).

Let's analyse the situation.

JFFS2 issues the erase command, MTD returns OK, but the eraseblock does 
not contain all 0xFFs. Is this JFFS2's fault? No. this is driver's 
fault. The driver must not return success in this case. It must return 
an error.

And the check in JFFS2 that the eraseblock contains all FFs is *wrong*. 
That's not JFFS2's business at all. That's MTD's business. MTD 
guarantees that it either erases the eraseblock or returns error. My 
oppinion is that this check must go to MTD. Perhaps, you may embrace it 
by #ifdefs - this is the implementation detal.

So, JFFS2 does what it is not supposed to do at all. It slows down 
things by this. I wanted to remove the check, but dwmw2 loudly 
complained. He wants to be as reliable as it is possible.

Ok, now about the rollback. If we return -EIO, JFFS2 will mark the block 
as bad. Imagine your driver is broken, and the block is not actually 
bad. That's just driver. JFFS2 will mark nearly all eraseblocks as bad. 
What will you do next? In case of NAND, this assumes you're in trouble 
unless you have a list of real bad blocks.

So, this is why I rolled back. Probably the right approach will be to 
add such blocks to the JFFS2 in-RAM bad block list but not mark them as 
bad physically. But this is a distinct activity. I didn't explore this.

That "fix" slipped there accidentally. I didn't test it. So I removed 
it. If you are energetic enough, work this problem out please :-)

-- 
Best Regards,
Artem B. Bityuckiy,
St.-Petersburg, Russia.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: mtd/fs/jffs2 erase.c,1.84,1.85 roll back
  2005-09-20 16:48   ` mtd/fs/jffs2 erase.c,1.84,1.85 roll back Artem B. Bityutskiy
@ 2005-09-20 19:49     ` Anders Grafstrom
  2005-09-22 14:34       ` Artem B. Bityutskiy
  0 siblings, 1 reply; 7+ messages in thread
From: Anders Grafstrom @ 2005-09-20 19:49 UTC (permalink / raw)
  To: Artem B. Bityutskiy; +Cc: linux-mtd

[-- Attachment #1: Type: text/plain, Size: 2285 bytes --]

Artem B. Bityutskiy wrote:
> Let's analyse the situation.
> 
> JFFS2 issues the erase command, MTD returns OK, but the eraseblock does 
> not contain all 0xFFs. Is this JFFS2's fault? No. this is driver's 
> fault. The driver must not return success in this case. It must return 
> an error.
> 
> And the check in JFFS2 that the eraseblock contains all FFs is *wrong*. 
> That's not JFFS2's business at all. That's MTD's business. MTD 
> guarantees that it either erases the eraseblock or returns error. My 
> oppinion is that this check must go to MTD. Perhaps, you may embrace it 
> by #ifdefs - this is the implementation detal.
> 
> So, JFFS2 does what it is not supposed to do at all. It slows down 
> things by this. I wanted to remove the check, but dwmw2 loudly 
> complained. He wants to be as reliable as it is possible.
It would be nice if the MTD layer could be trusted :)

> Ok, now about the rollback. If we return -EIO, JFFS2 will mark the block 
> as bad. Imagine your driver is broken, and the block is not actually 
> bad. That's just driver. JFFS2 will mark nearly all eraseblocks as bad. 
> What will you do next? In case of NAND, this assumes you're in trouble 
> unless you have a list of real bad blocks.
> 
> So, this is why I rolled back. Probably the right approach will be to 
> add such blocks to the JFFS2 in-RAM bad block list but not mark them as 
> bad physically. But this is a distinct activity. I didn't explore this.
> 
> That "fix" slipped there accidentally. I didn't test it. So I removed 
> it. If you are energetic enough, work this problem out please :-)
> 

So if we return -EIO, then we go to jffs2_erase_failed(). For NAND, if 
the MTD driver reported erase failure (bad_offset != 0xffffffff), a 
retry is made before giving up on the block and marking it bad. Also the 
read operation may have failed, erase may have failed and MTD missed it, 
write of clean marker may have failed. As it is these cases will put the 
block on the bad_list. In the case of NOR it will also be put on the 
bad_list at once.
But how about retrying for all of these cases?

I'm thinking something like this patch...
(It does not make sure that it was a device level failure that occurred 
two times before marking NAND blocks bad physically. So it's flawed.)

[-- Attachment #2: erase_retry.patch --]
[-- Type: text/plain, Size: 3370 bytes --]

Index: mtd/fs/jffs2/erase.c
===================================================================
RCS file: /home/cvs/mtd/fs/jffs2/erase.c,v
retrieving revision 1.85
diff -u -r1.85 erase.c
--- mtd/fs/jffs2/erase.c	20 Sep 2005 14:53:15 -0000	1.85
+++ mtd/fs/jffs2/erase.c	20 Sep 2005 19:31:58 -0000
@@ -20,6 +20,9 @@
 #include <linux/pagemap.h>
 #include "nodelist.h"
 
+/* max. erase failures before we mark a block bad */
+#define MAX_ERASE_FAILURES 	2
+
 struct erase_priv_struct {
 	struct jffs2_eraseblock *jeb;
 	struct jffs2_sb_info *c;
@@ -171,24 +174,24 @@
 
 static void jffs2_erase_failed(struct jffs2_sb_info *c, struct jffs2_eraseblock *jeb, uint32_t bad_offset)
 {
-	/* For NAND, if the failure did not occur at the device level for a
-	   specific physical page, don't bother updating the bad block table. */
-	if (jffs2_cleanmarker_oob(c) && (bad_offset != 0xffffffff)) {
-		/* We had a device-level failure to erase.  Let's see if we've
-		   failed too many times. */
-		if (!jffs2_write_nand_badblock(c, jeb, bad_offset)) {
-			/* We'd like to give this block another try. */
-			spin_lock(&c->erase_completion_lock);
-			list_del(&jeb->list);
-			list_add(&jeb->list, &c->erase_pending_list);
-			c->erasing_size -= c->sector_size;
-			c->dirty_size += c->sector_size;
-			jeb->dirty_size = c->sector_size;
-			spin_unlock(&c->erase_completion_lock);
-			return;
-		}
+	if (++jeb->bad_count < MAX_ERASE_FAILURES) {
+		/* We'd like to give this block another try. */
+		printk(KERN_WARNING "Retrying erase at 0x%08x\n", jeb->offset);
+		spin_lock(&c->erase_completion_lock);
+		list_del(&jeb->list);
+		list_add(&jeb->list, &c->erase_pending_list);
+		c->erasing_size -= c->sector_size;
+		c->dirty_size += c->sector_size;
+		jeb->dirty_size = c->sector_size;
+		spin_unlock(&c->erase_completion_lock);
+		return;
 	}
 
+	/* For NAND, if the failure did occur at the device level for a
+	   specific physical page, update the bad block table. */
+	if (jffs2_cleanmarker_oob(c) && (bad_offset != 0xffffffff))
+		jffs2_write_nand_badblock(c, jeb, bad_offset);
+
 	spin_lock(&c->erase_completion_lock);
 	c->erasing_size -= c->sector_size;
 	c->bad_size += c->sector_size;
@@ -337,6 +340,7 @@
 			if (*datum + 1) {
 				*bad_offset += i;
 				printk(KERN_WARNING "Newly-erased block contained word 0x%lx at offset 0x%08x\n", *datum, *bad_offset);
+				ret = -EIO;
 				goto fail;
 			}
 		}
@@ -425,6 +429,8 @@
 		jeb->wasted_size = 0;
 	}
 
+	jeb->bad_count = 0;
+
 	spin_lock(&c->erase_completion_lock);
 	c->erasing_size -= c->sector_size;
 	c->free_size += jeb->free_size;
Index: mtd/fs/jffs2/wbuf.c
===================================================================
RCS file: /home/cvs/mtd/fs/jffs2/wbuf.c,v
retrieving revision 1.98
diff -u -r1.98 wbuf.c
--- mtd/fs/jffs2/wbuf.c	7 Sep 2005 08:34:55 -0000	1.98
+++ mtd/fs/jffs2/wbuf.c	20 Sep 2005 19:31:59 -0000
@@ -28,9 +28,6 @@
 static unsigned char *brokenbuf;
 #endif
 
-/* max. erase failures before we mark a block bad */
-#define MAX_ERASE_FAILURES 	2
-
 /* two seconds timeout for timed wbuf-flushing */
 #define WBUF_FLUSH_TIMEOUT	2 * HZ
 
@@ -1099,10 +1096,6 @@
 {
 	int 	ret;
 
-	/* if the count is < max, we try to write the counter to the 2nd page oob area */
-	if( ++jeb->bad_count < MAX_ERASE_FAILURES)
-		return 0;
-
 	if (!c->mtd->block_markbad)
 		return 1; // What else can we do?
 

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: mtd/fs/jffs2 erase.c,1.84,1.85 roll back
  2005-09-20 19:49     ` Anders Grafstrom
@ 2005-09-22 14:34       ` Artem B. Bityutskiy
  2005-09-23 15:22         ` Anders Grafstrom
  0 siblings, 1 reply; 7+ messages in thread
From: Artem B. Bityutskiy @ 2005-09-22 14:34 UTC (permalink / raw)
  To: Anders Grafstrom; +Cc: linux-mtd

On Tue, 2005-09-20 at 21:49 +0200, Anders Grafstrom wrote:
> So if we return -EIO, then we go to jffs2_erase_failed(). For NAND, if 
> the MTD driver reported erase failure (bad_offset != 0xffffffff), a 
> retry is made before giving up on the block and marking it bad. Also the 
> read operation may have failed, erase may have failed and MTD missed it, 
> write of clean marker may have failed. As it is these cases will put the 
> block on the bad_list. In the case of NOR it will also be put on the 
> bad_list at once.
> But how about retrying for all of these cases?

Sorry, no more today. What does your patch do? Adds the second try? What
for?

-- 
Best Regards,
Artem B. Bityuckiy,
St.-Petersburg, Russia.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: mtd/fs/jffs2 erase.c,1.84,1.85 roll back
  2005-09-22 14:34       ` Artem B. Bityutskiy
@ 2005-09-23 15:22         ` Anders Grafstrom
  2005-09-25 12:54           ` Jörn Engel
  0 siblings, 1 reply; 7+ messages in thread
From: Anders Grafstrom @ 2005-09-23 15:22 UTC (permalink / raw)
  To: dedekind; +Cc: linux-mtd

Artem B. Bityutskiy wrote:
> On Tue, 2005-09-20 at 21:49 +0200, Anders Grafstrom wrote:
> 
>>So if we return -EIO, then we go to jffs2_erase_failed(). For NAND, if 
>>the MTD driver reported erase failure (bad_offset != 0xffffffff), a 
>>retry is made before giving up on the block and marking it bad. Also the 
>>read operation may have failed, erase may have failed and MTD missed it, 
>>write of clean marker may have failed. As it is these cases will put the 
>>block on the bad_list. In the case of NOR it will also be put on the 
>>bad_list at once.
>>But how about retrying for all of these cases?
> 
> 
> Sorry, no more today. What does your patch do? Adds the second try? What
> for?
> 

jffs2 makes a second try if a failed erase is reported by the mtd 
driver. But only for NAND (in jffs2_write_nand_badblock()). What I'm 
after is a retry for NOR as well. But I wrote the patch so it would 
retry for other cases of failure too.

Regards,
  Anders

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: mtd/fs/jffs2 erase.c,1.84,1.85 roll back
  2005-09-23 15:22         ` Anders Grafstrom
@ 2005-09-25 12:54           ` Jörn Engel
  2005-09-26  8:03             ` Artem B. Bityutskiy
  0 siblings, 1 reply; 7+ messages in thread
From: Jörn Engel @ 2005-09-25 12:54 UTC (permalink / raw)
  To: Anders Grafstrom; +Cc: linux-mtd

On Fri, 23 September 2005 17:22:53 +0200, Anders Grafstrom wrote:
> 
> jffs2 makes a second try if a failed erase is reported by the mtd 
> driver. But only for NAND (in jffs2_write_nand_badblock()). What I'm 
> after is a retry for NOR as well. But I wrote the patch so it would 
> retry for other cases of failure too.

Is a retry really an improvement?  When a block cannot be properly
erased on the first try, I would not trust it anymore.  Instead of a
retry, I'd rather mark it as a bad block and not touch it again.

Jörn

-- 
When you close your hand, you own nothing. When you open it up, you
own the whole world.
-- Li Mu Bai in Tiger & Dragon

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: mtd/fs/jffs2 erase.c,1.84,1.85 roll back
  2005-09-25 12:54           ` Jörn Engel
@ 2005-09-26  8:03             ` Artem B. Bityutskiy
  2005-09-26  8:31               ` Jörn Engel
  0 siblings, 1 reply; 7+ messages in thread
From: Artem B. Bityutskiy @ 2005-09-26  8:03 UTC (permalink / raw)
  To: Jörn Engel; +Cc: linux-mtd, Anders Grafstrom

Jörn Engel wrote:
> On Fri, 23 September 2005 17:22:53 +0200, Anders Grafstrom wrote:
> 
>>jffs2 makes a second try if a failed erase is reported by the mtd 
>>driver. But only for NAND (in jffs2_write_nand_badblock()). What I'm 
>>after is a retry for NOR as well. But I wrote the patch so it would 
>>retry for other cases of failure too.
> 
> Is a retry really an improvement?  When a block cannot be properly
> erased on the first try, I would not trust it anymore.  Instead of a
> retry, I'd rather mark it as a bad block and not touch it again.

 From the other hand, why not to retry? Does it hurt? Is this in the 
critical code-path? Deity knows what happend to the hardware/driver, may 
be there was a short magnetical disturbance and that was the reason why 
the block was not erased. May be one of the lines/cirquits outside of 
the flash was affected but the block is really erasable. Actually, after 
I've read the paper that *you* pointed 
(http://www.cs.wisc.edu/adsl/Publications/iron-sosp05.pdf), I'm rather 
positive about the idea of retrying. The articlae is about HDDs, but IMO 
is also acceptable flash devices in this respect.

-- 
Best Regards,
Artem B. Bityuckiy,
St.-Petersburg, Russia.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: mtd/fs/jffs2 erase.c,1.84,1.85 roll back
  2005-09-26  8:03             ` Artem B. Bityutskiy
@ 2005-09-26  8:31               ` Jörn Engel
  0 siblings, 0 replies; 7+ messages in thread
From: Jörn Engel @ 2005-09-26  8:31 UTC (permalink / raw)
  To: Artem B. Bityutskiy; +Cc: linux-mtd, Anders Grafstrom

On Mon, 26 September 2005 12:03:17 +0400, Artem B. Bityutskiy wrote:
> Jörn Engel wrote:
> >On Fri, 23 September 2005 17:22:53 +0200, Anders Grafstrom wrote:
> >
> >>jffs2 makes a second try if a failed erase is reported by the mtd 
> >>driver. But only for NAND (in jffs2_write_nand_badblock()). What I'm 
> >>after is a retry for NOR as well. But I wrote the patch so it would 
> >>retry for other cases of failure too.
> >
> >Is a retry really an improvement?  When a block cannot be properly
> >erased on the first try, I would not trust it anymore.  Instead of a
> >retry, I'd rather mark it as a bad block and not touch it again.
> 
> From the other hand, why not to retry? Does it hurt? Is this in the 
> critical code-path? Deity knows what happend to the hardware/driver, may 
> be there was a short magnetical disturbance and that was the reason why 
> the block was not erased. May be one of the lines/cirquits outside of 
> the flash was affected but the block is really erasable. Actually, after 
> I've read the paper that *you* pointed 
> (http://www.cs.wisc.edu/adsl/Publications/iron-sosp05.pdf), I'm rather 
> positive about the idea of retrying. The articlae is about HDDs, but IMO 
> is also acceptable flash devices in this respect.

Depends on where the problem is.  An alpha particle messing up the bus
- sure, just retry and it'll work.  Something in the flash - by all
chances the block is messed up and shouldn't be used again.

As a default, I'd consider the block in question to be dead.  I don't
mind a retry per se, but that should be the exception.  If the first
retry didn't work, you can be pretty sure that it was not just an
alpha particle and should declare your hardware dead.  It doesn't even
matter whether it's the flash, the bus or anything else.

Jörn

-- 
Ninety percent of everything is crap.
-- Sturgeon's Law

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2005-09-26  8:33 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <E1EHjUk-0001IT-Dn@phoenix.infradead.org>
     [not found] ` <43303409.5030406@users.sourceforge.net>
2005-09-20 16:48   ` mtd/fs/jffs2 erase.c,1.84,1.85 roll back Artem B. Bityutskiy
2005-09-20 19:49     ` Anders Grafstrom
2005-09-22 14:34       ` Artem B. Bityutskiy
2005-09-23 15:22         ` Anders Grafstrom
2005-09-25 12:54           ` Jörn Engel
2005-09-26  8:03             ` Artem B. Bityutskiy
2005-09-26  8:31               ` Jörn Engel

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox