public inbox for linux-mtd@lists.infradead.org
 help / color / mirror / Atom feed
* ubifs panic with 2.6.39 stable - followup
@ 2012-01-04 11:39 Brad Parker
  2012-01-04 16:42 ` Artem Bityutskiy
  2012-01-04 16:54 ` Artem Bityutskiy
  0 siblings, 2 replies; 3+ messages in thread
From: Brad Parker @ 2012-01-04 11:39 UTC (permalink / raw)
  To: linux-mtd@lists.infradead.org

[sorry for the duplicate post; I tried to post a followup with just
  the url to the pastebin log but for some reason  it got stuck
  waiting for the moderator]

Running and older 2.6.31 kernel with UBIFS I see a panic which appears
to be during recovery of a bad block. The root fs (which is UBIFS)
won't mount and the kernel panics.

I upgraded the kernel 2.6.39-stable, hoping that would fix the problem,
as I noticed a lot of recovery fixes had gone in.  It still panics;
it appears the recovery fails.

I rebooted with "ignore_loglevel" and the output is here:

     http://pastebin.com/ETJjP4uw

The board is essentially an Olimex SAM9-L9260, with Samsung NAND.

I'm curious if this looks familiar and if it might be fixed post 2.5.39

thanks for any insight.

-brad

UBIFS: recovery needed
UBI error: ubi_io_read: error -74 (ECC error) while reading 126976 bytes 
from PEB 2970:4096, read 126976 bytes
UBIFS error (pid 1): ubifs_check_node: bad CRC: calculated 0xf510fb95, 
read 0x4f0a3196
UBIFS error (pid 1): ubifs_check_node: bad node at LEB 3589:45128
         magic          0x6101831
         crc            0x4f0a3196
         node_type      0 (inode node)
         group_type     2 (last of node group)
         sqnum          69874250
         len            160
         key            (23417, inode)
         creat_sqnum    230076
         size           608
         nlink          2
         atime          1306744746.0
         mtime          1316662493.0
         ctime          1316662493.0
         uid            0
         gid            0
         mode           16832
         flags          0x1
         xattr_cnt      2147483648
         xattr_size     0
         xattr_names    0
         compr_type     0x0
         data len       0
...
UBIFS error (pid 1): ubifs_recover_leb: bad node
UBIFS error (pid 1): ubifs_scanned_corruption: corruption at LEB 3589:45128
UBIFS error (pid 1): ubifs_scanned_corruption: first 8192 bytes from LEB 
3589:45128
...
UBIFS error (pid 1): ubifs_recover_leb: LEB 3589 scanning failed

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: ubifs panic with 2.6.39 stable - followup
  2012-01-04 11:39 ubifs panic with 2.6.39 stable - followup Brad Parker
@ 2012-01-04 16:42 ` Artem Bityutskiy
  2012-01-04 16:54 ` Artem Bityutskiy
  1 sibling, 0 replies; 3+ messages in thread
From: Artem Bityutskiy @ 2012-01-04 16:42 UTC (permalink / raw)
  To: Brad Parker; +Cc: linux-mtd@lists.infradead.org

[-- Attachment #1: Type: text/plain, Size: 1700 bytes --]

On Wed, 2012-01-04 at 06:39 -0500, Brad Parker wrote:
> [sorry for the duplicate post; I tried to post a followup with just
>   the url to the pastebin log but for some reason  it got stuck
>   waiting for the moderator]
> 
> Running and older 2.6.31 kernel with UBIFS I see a panic which appears
> to be during recovery of a bad block. The root fs (which is UBIFS)
> won't mount and the kernel panics.
> 
> I upgraded the kernel 2.6.39-stable, hoping that would fix the problem,
> as I noticed a lot of recovery fixes had gone in.  It still panics;
> it appears the recovery fails.
> 
> I rebooted with "ignore_loglevel" and the output is here:
> 
>      http://pastebin.com/ETJjP4uw
> 
> The board is essentially an Olimex SAM9-L9260, with Samsung NAND.
> 
> I'm curious if this looks familiar and if it might be fixed post 2.5.39
> 
> thanks for any insight.

Would you please also enable scanning, mount, and recovery debugging
messages and provide the log? 

http://www.linux-mtd.infradead.org/faq/ubifs.html#L_how_debug

But in newer kernels this has been turned into dynamic debug, see
Documentation/dynamic-debug-howto.txt

(Gosh, I have to update the web site!)

In older kernels you can do this via the configuration menu, in newer
kernel this has been turned into Documentation/dynamic-debug-howto.txt

To use them at boot time, use ddebug_query=...

You need to use 3 format string to show "UBIFS DBG mnt" and the like.

I think, but did not verify, you need to add kernel boot 3 parameters:

ddebug_query="format UBIFS DBG rcvry" ddebug_query="format UBIFS DBG
mnt" ddebug_query="format UBIFS DBG scan"

-- 
Best Regards,
Artem Bityutskiy

[-- Attachment #2: This is a digitally signed message part --]
[-- Type: application/pgp-signature, Size: 836 bytes --]

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: ubifs panic with 2.6.39 stable - followup
  2012-01-04 11:39 ubifs panic with 2.6.39 stable - followup Brad Parker
  2012-01-04 16:42 ` Artem Bityutskiy
@ 2012-01-04 16:54 ` Artem Bityutskiy
  1 sibling, 0 replies; 3+ messages in thread
From: Artem Bityutskiy @ 2012-01-04 16:54 UTC (permalink / raw)
  To: Brad Parker; +Cc: linux-mtd@lists.infradead.org

[-- Attachment #1: Type: text/plain, Size: 1986 bytes --]

On Wed, 2012-01-04 at 06:39 -0500, Brad Parker wrote:
> [sorry for the duplicate post; I tried to post a followup with just
>   the url to the pastebin log but for some reason  it got stuck
>   waiting for the moderator]
> 
> Running and older 2.6.31 kernel with UBIFS I see a panic which appears
> to be during recovery of a bad block. The root fs (which is UBIFS)
> won't mount and the kernel panics.
> 
> I upgraded the kernel 2.6.39-stable, hoping that would fix the problem,
> as I noticed a lot of recovery fixes had gone in.  It still panics;
> it appears the recovery fails.
> 
> I rebooted with "ignore_loglevel" and the output is here:
> 
>      http://pastebin.com/ETJjP4uw
> 
> The board is essentially an Olimex SAM9-L9260, with Samsung NAND.
> 
> I'm curious if this looks familiar and if it might be fixed post 2.5.39
> 
> thanks for any insight.
> 
> -brad
> 
> UBIFS: recovery needed
> UBI error: ubi_io_read: error -74 (ECC error) while reading 126976 bytes 
> from PEB 2970:4096, read 126976 bytes
> UBIFS error (pid 1): ubifs_check_node: bad CRC: calculated 0xf510fb95, 
> read 0x4f0a3196

So there is a corrupted inode node, and UBIFS believes it has been
corrupted not because of a power cut. I do not know why it is corrupted,
but if you use MLC then this may be related to the paired pages problem.

Anyway, there is another issue I see from the dump. Even if you somehow
make the node good again, UBIFS will still fail saying something like
"corrupt empty space". Look at line 292 in your pastebin:

ffffffff ffffffff ffffffff ffffffff ffffffff ffdfffff ffffffff ffffffff

See that little "d"? It means that the empty space has a bit-flip. The
question is why? Unstable bit I guess? Does your NAND driver / HW
provides ECC protection for empty pages?

Anyway, currently UBIFS cannot handle these situation. Someone needs to
do this - I can assist by reviewing and advising.

-- 
Best Regards,
Artem Bityutskiy

[-- Attachment #2: This is a digitally signed message part --]
[-- Type: application/pgp-signature, Size: 836 bytes --]

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2012-01-04 16:52 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-01-04 11:39 ubifs panic with 2.6.39 stable - followup Brad Parker
2012-01-04 16:42 ` Artem Bityutskiy
2012-01-04 16:54 ` Artem Bityutskiy

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox