* Unexpected reset corrupted Reiser4 filesystem @ 2005-05-25 3:38 Andrew James Wade 2005-05-25 4:49 ` David Masover 0 siblings, 1 reply; 4+ messages in thread From: Andrew James Wade @ 2005-05-25 3:38 UTC (permalink / raw) To: reiserfs-list Hello, One of my Reiser4 filesystems was corrupted by a power glitch. fsck fixed the corruption, but my understanding is that an unexpected reset should not have corrupted the filesystem. I have an image of the corrupted filesystem, is it of any use to anyone? Details: kernel: 2.6.12-rc4-mm2 fsck.reiser4: 1.0.4 I was installing oracle, when the power flickered. I was unable to delete oracle's directory due to what was reported as I/O errors. fsck revealed a corrupted filesystem (FSCK: Node (13142228), item (77), unit (0): Points to the block (12981542) which is in the tree already. The whole subtree is skipped.) I have an image of the partition at this point, and dd reported no errors while copying. The image is unfortunately a bit large to upload (70 GB), but I am happy to run diagnostic tools against it. Andrew Wade ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: Unexpected reset corrupted Reiser4 filesystem 2005-05-25 3:38 Unexpected reset corrupted Reiser4 filesystem Andrew James Wade @ 2005-05-25 4:49 ` David Masover 2005-05-25 10:46 ` John Dong 2005-05-25 20:39 ` Andrew James Wade 0 siblings, 2 replies; 4+ messages in thread From: David Masover @ 2005-05-25 4:49 UTC (permalink / raw) To: Andrew James Wade; +Cc: reiserfs-list -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Andrew James Wade wrote: > Hello, > > One of my Reiser4 filesystems was corrupted by a power glitch. No filesystem can prepare for a power glitch, AFAIK. > fsck fixed the corruption, but my understanding is that an > unexpected reset should not have corrupted the filesystem. I It's my understanding that an unexpected _should_ not have corrupted the filesystem. Generally, this means that if we have everything working the way it's supposed to right up until someone hits the Reset switch, there _should_ be zero corruption. Caveats: 1.) _should_ is not _is_. Just because the developers can't crash it doesn't mean it's invulnerable. 2.) power flicker is different than the power/reset button. It really all depends on what your hardware actually does when power is cut, but most hard drives do some sort of write caching, and some of them make it impossible to turn that off. To say more, I'd have to know about the physical mechanics involved, but even if you could make the system absolutely invulnerable to power loss, you're still going to lose data (not corrupt -- LOSE) in such an event, unless you have a battery backup. Even then, your drive is eventually going to fail -- so use RAID. And someone is eventually going to rm -rf your RAID, or spill coffee on it, or hit it with a tornado/earthquake/nuke, so have multiple sites and make backups. That's what it all comes down to -- make backups. The fact that you have journalling/transactions/fsck/batteries/RAID is all just to make it a little less catostrophic when stuff does fail. > have an image of the corrupted filesystem, is it of any use to > anyone? Probably someone, not me. I don't work here, but I'll bet money that they are going to ask for something from debugfs. -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.1 (GNU/Linux) Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org iQIVAwUBQpQD7ngHNmZLgCUhAQLvSA//bPqBcYMs/NRS7mT//zlGF0WK2YlnNUCd 2hvzXxQhKcLMIvaTo9DJ9wzkaDIHot1DZUcuFIU3f4qmb7QIlMbkVoYE1AW34Wg/ /G06KTkHUi0iU+GJ9Dq65Oj4nK1SM4AtBHMpLqhpcW0FExsj9ScmewcSehFNSzDS Frptzvk/iWtkip2rJxReF8rKr170Y2aR+1DNaCUiLLoESTgbF5WMmcJFyyuaa0/y JbP8/kTz4mbLuZuIt6qKswHqUACUA98sK32zeylUOeccKih35FkiaZtegIwti6Op QwfuCQNyhlODVxAUCghMxz6RtF59Zk4hfoWv5f1GHB7/MHgHGQmMeBy3G/Y/y8D/ P29LVyjhyStVmRyfYtDewMIBQz+nx1+StTqZ3jimfwlpMZxOxeGc0IbV9qiM4o6U uYgtf9nrdeywn2HruijYO5xiTUcIbaBUGsvCCxNYrdzMYKEDsk3iTR/Jrxl/8GJl d88NwSAdiU+uUcPkfdlcl+bHk3SB1yX7sjbItOtrUgZxFYsjtuFOusaOo5CkT84l 2lduqioG2u4zWirRk3d3QJEWETykhX4JnjnCjw/qAQ9+JOLc5xjUbgO2fB8d1toy QKGGVTH1P3i8Ux5ISrtP3noNtA/zzGRlhYShzc50bq8c8+/CAiqnx9FZGv854Ttp hpTuQ1Ht0FA= =1k9p -----END PGP SIGNATURE----- ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: Unexpected reset corrupted Reiser4 filesystem 2005-05-25 4:49 ` David Masover @ 2005-05-25 10:46 ` John Dong 2005-05-25 20:39 ` Andrew James Wade 1 sibling, 0 replies; 4+ messages in thread From: John Dong @ 2005-05-25 10:46 UTC (permalink / raw) To: reiserfs-list [-- Attachment #1: Type: text/plain, Size: 3173 bytes --] If thse were IDE drives, the IDE writeback cache is probably the bad boy -- on FreeBSD 5.x, Soft Updates is virtually broken on IDE drives because they simply haven't written all the data they promised the kernel that they had. On 5/25/05, David Masover <ninja@slaphack.com> wrote: > > -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA1 > > Andrew James Wade wrote: > > Hello, > > > > One of my Reiser4 filesystems was corrupted by a power glitch. > > No filesystem can prepare for a power glitch, AFAIK. > > > fsck fixed the corruption, but my understanding is that an > > unexpected reset should not have corrupted the filesystem. I > > It's my understanding that an unexpected _should_ not have corrupted the > filesystem. Generally, this means that if we have everything working > the way it's supposed to right up until someone hits the Reset switch, > there _should_ be zero corruption. > > Caveats: > 1.) _should_ is not _is_. Just because the developers can't crash it > doesn't mean it's invulnerable. > 2.) power flicker is different than the power/reset button. It really > all depends on what your hardware actually does when power is cut, but > most hard drives do some sort of write caching, and some of them make it > impossible to turn that off. > > To say more, I'd have to know about the physical mechanics involved, but > even if you could make the system absolutely invulnerable to power loss, > you're still going to lose data (not corrupt -- LOSE) in such an event, > unless you have a battery backup. Even then, your drive is eventually > going to fail -- so use RAID. And someone is eventually going to rm -rf > your RAID, or spill coffee on it, or hit it with a > tornado/earthquake/nuke, so have multiple sites and make backups. > > That's what it all comes down to -- make backups. The fact that you > have journalling/transactions/fsck/batteries/RAID is all just to make it > a little less catostrophic when stuff does fail. > > > have an image of the corrupted filesystem, is it of any use to > > anyone? > > Probably someone, not me. I don't work here, but I'll bet money that > they are going to ask for something from debugfs. > > -----BEGIN PGP SIGNATURE----- > Version: GnuPG v1.4.1 (GNU/Linux) > Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org > > iQIVAwUBQpQD7ngHNmZLgCUhAQLvSA//bPqBcYMs/NRS7mT//zlGF0WK2YlnNUCd > 2hvzXxQhKcLMIvaTo9DJ9wzkaDIHot1DZUcuFIU3f4qmb7QIlMbkVoYE1AW34Wg/ > /G06KTkHUi0iU+GJ9Dq65Oj4nK1SM4AtBHMpLqhpcW0FExsj9ScmewcSehFNSzDS > Frptzvk/iWtkip2rJxReF8rKr170Y2aR+1DNaCUiLLoESTgbF5WMmcJFyyuaa0/y > JbP8/kTz4mbLuZuIt6qKswHqUACUA98sK32zeylUOeccKih35FkiaZtegIwti6Op > QwfuCQNyhlODVxAUCghMxz6RtF59Zk4hfoWv5f1GHB7/MHgHGQmMeBy3G/Y/y8D/ > P29LVyjhyStVmRyfYtDewMIBQz+nx1+StTqZ3jimfwlpMZxOxeGc0IbV9qiM4o6U > uYgtf9nrdeywn2HruijYO5xiTUcIbaBUGsvCCxNYrdzMYKEDsk3iTR/Jrxl/8GJl > d88NwSAdiU+uUcPkfdlcl+bHk3SB1yX7sjbItOtrUgZxFYsjtuFOusaOo5CkT84l > 2lduqioG2u4zWirRk3d3QJEWETykhX4JnjnCjw/qAQ9+JOLc5xjUbgO2fB8d1toy > QKGGVTH1P3i8Ux5ISrtP3noNtA/zzGRlhYShzc50bq8c8+/CAiqnx9FZGv854Ttp > hpTuQ1Ht0FA= > =1k9p > -----END PGP SIGNATURE----- > [-- Attachment #2: Type: text/html, Size: 3622 bytes --] ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: Unexpected reset corrupted Reiser4 filesystem 2005-05-25 4:49 ` David Masover 2005-05-25 10:46 ` John Dong @ 2005-05-25 20:39 ` Andrew James Wade 1 sibling, 0 replies; 4+ messages in thread From: Andrew James Wade @ 2005-05-25 20:39 UTC (permalink / raw) To: reiserfs-list John Dong wrote: > If thse were IDE drives, the IDE writeback cache is probably the bad boy -- > on FreeBSD 5.x, Soft Updates is virtually broken on IDE drives because they > simply haven't written all the data they promised the kernel that they had. I do indeed have an IDE drive (Seagate Barracuda) with a writeback cache. But I thought that write barriers were now working (by flushing the writeback cache if the drive doesn't support anything fancier). However, I couldn't find any updates on the write barrier work since March of last year. (<http://lwn.net/Articles/77074/>). So the writeback cache may indeed be the bad boy. On May 25, 2005 12:49 am, David Masover wrote: > That's what it all comes down to -- make backups. The fact that you > have journalling/transactions/fsck/batteries/RAID is all just to make it > a little less catostrophic when stuff does fail. Yup, time for me to make backups. Thanks for the nudge. Especially as I'm running a bleeding-edge kernel. (I did have one eat some of my data). Andrew P.S. My internet connection's been flaky lately, so apologies for any bounces. I check the mailing list archives for missed messages. ^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2005-05-25 20:39 UTC | newest] Thread overview: 4+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2005-05-25 3:38 Unexpected reset corrupted Reiser4 filesystem Andrew James Wade 2005-05-25 4:49 ` David Masover 2005-05-25 10:46 ` John Dong 2005-05-25 20:39 ` Andrew James Wade
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.