public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* ext3 mount infinite loop over orphan list issue, please release 2.6.27
@ 2008-09-15 22:09 Amit Chaudhary
  2008-09-16 12:53 ` Theodore Tso
  0 siblings, 1 reply; 3+ messages in thread
From: Amit Chaudhary @ 2008-09-15 22:09 UTC (permalink / raw)
  To: linux-kernel

Hello,

Over the weekend, due to a crash, I ran into the ext3 mount infinite 
loop over orphan list issue. This was on Ubuntu 8.04. I tried many 
things, including using 18 month old distributions, nothing works. Only 
solution is to boot off a alpha version of next Ubunuty which has the 
2.6.27 kernel (rc1 has the fix), more details are below:

Can you please release 2.6.27 so that it can make it to stable 
distributions.

Needless to say, this is bad. End Users should not have to go through 
kernel patches, logs and try out alpha OSes to boot up from a crash and 
save their filesystem contents.

Thanks
Amit

-------------------------------------------------

After crash, Ubuntu does not come up, ext3 root filesystem fails to mount
https://bugs.launchpad.net/bugs/270371


** Affects: ubuntu
     Importance: Undecided
         Status: New


** Tags: ext3 kernel

-- 

Hello,

Ubuntu 8.0.4 crashed while opening a .mpg in totem player. it was a hang, no response to keyboard, the mouse was moving but not response to clicks. Ctrl+Alt+F1, etc did not open a console.
On a reboot, the system did not come up. Neither did rescue mode. Rescue mode shows the last few kernel messages were
    ext3: recovery required
   Orphan cleanup on readonly filesystem

which means Root filesystem with ext3 fails to mount.

I then tried booting with the 8.04 live CD, even the CD never boots up.
When running without the flags quiet & slpash, again the above
statements are the last printed.

I tried all of following Ubuntu live cds, 8.04.1, 7.10, 7.04, no luck.
The message is not printed in the last one, but it is stuck.

Does anyone know of any solution? Is there a way of not loading the
rootfs from the live CD?

I also found there is a fix for infinite loop in 2.6.27
(http://www.kernel.org/pub/linux/kernel/v2.6/testing/ChangeLog-2.6.27-rc1),
but am not sure if this is that issue

Thanks
Amit




------------------------------------

Ok, so it was the problem from the link, rebooting from a live cd for Intrepid (Ubunty 8.10) Alpha 5 which has kernel 2.6.27 fixed it.
Incase your box hangs or goes in an infinite loop after a crash, this is one of the solutions.

It printed some error messages such as bad orphan node, did you run
e2fsck, n_inode = 1 , etc and then rebooting off the installed Ubuntu
worked.

Should mention, this is bad. Users should not have to go through kernel
patches, logs and try out alpha OSes to boot up from a crash.

Amit



^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: ext3 mount infinite loop over orphan list issue, please release 2.6.27
  2008-09-15 22:09 ext3 mount infinite loop over orphan list issue, please release 2.6.27 Amit Chaudhary
@ 2008-09-16 12:53 ` Theodore Tso
  2008-09-16 16:42   ` Amit D. Chaudhary
  0 siblings, 1 reply; 3+ messages in thread
From: Theodore Tso @ 2008-09-16 12:53 UTC (permalink / raw)
  To: Amit Chaudhary; +Cc: linux-kernel

On Mon, Sep 15, 2008 at 03:09:22PM -0700, Amit Chaudhary wrote:
> Over the weekend, due to a crash, I ran into the ext3 mount infinite  
> loop over orphan list issue. This was on Ubuntu 8.04. I tried many  
> things, including using 18 month old distributions, nothing works. Only  
> solution is to boot off a alpha version of next Ubunuty which has the  
> 2.6.27 kernel (rc1 has the fix), more details are below:
>
> Can you please release 2.6.27 so that it can make it to stable  
> distributions.

As you point out, this problem has been fixed in 2.6.27-rc1.
Unfortunately, it is a problem which has been around for a very long
time --- perhaps since ext3 was first written.  No one had noticed the
problem for a very long time; in fact the first time someone reported
it as far I know was June 6, 2008, when Sami Liedes found it via a
synthetic testing program, fsfuzzer, which creates filesystems, then
corrupts them randomly and then sees whether or not they cause the
kernel to panic or hang when they are mounted.

You seem to have had the bad luck of running into the problem in a
real world situation relatively recently, but this has been a
long-standing problem.

As far as releasing 2.6.27, there still is a fairly large set of
regressions (i.e., bugs introduced in 2.6.27-rc1 or later) which need
to be fixed before we can release it; otherwise more users will have
bad experiences when older kernels that had worked fine for them will
break for them.  So while it might have helped you to have released
2.6.27, it could make things worse for many other users.  So that's
not a realistic option.  If I had to guess, given currently the
projected regression bug fix rates, there still is at least 2 or so of
bug fixing that still needs to be done before 2.6.27 is ready for
release.

We could take this bug fix and nominate it for release in the next
2.6.26-stable series, which distributions could then pick up --- maybe
we should have, but when the patch went in, it was seen was a fix for
largely theoretical problems, and not something that needed
accelerated handling.  This is still something we could do, but
realistically I'm not sure it's going to help the Ubuntu Intrepid
release, since it's pretty late in their schedule.  You might be
better off asking the Ubuntu kernel team if they are willing to cherry
pick the commit in question (ae76dd9a) and include it in their
release; they do have the ability to do that without waiting for an
upstream release, you know.

You also should have been able to work around the problem if you had
booted a rescue CD and checked your filesystem from the rescue CD.
That is the normal way these sorts of problems are fixed, and I'm a
bit surprised you didn't try this first.  It could be that Ubuntu
Rescue CD's could be made better by automating the ability of (upon
request) detecting the root filesystem on Ubuntu systems and then
running e2fsck on the filesystem before trying to mount them.

If you are willing to help out, we can always use more testers to test
the development kernels.  We can't do this alone, you know.  We've
wanted to shorten the release window, but in order to do that we need
more people helping to find and fix bugs during the development cycle.
Remember, this is open source.  If you see a problem you don't like,
you can help fix it.  And in fact, that's the most likely way that it
will get fixed.

Best regards,

						- Ted

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: ext3 mount infinite loop over orphan list issue, please release 2.6.27
  2008-09-16 12:53 ` Theodore Tso
@ 2008-09-16 16:42   ` Amit D. Chaudhary
  0 siblings, 0 replies; 3+ messages in thread
From: Amit D. Chaudhary @ 2008-09-16 16:42 UTC (permalink / raw)
  To: Theodore Tso, linux-kernel

Hello Ted,

Thanks for noting that the problem does happen in the real world, though 
rarely or users do not narrow it down. Yes, it makes sense for all 
regressions to be fixed before releasing 2.6.27.

I have put your suggestion about the Rescue CD & patch to kernel to the 
Ubuntu forums & bug. The latter would solve it only for one 
distribution, it is a start. Good point about the Rescue CD, I will get 
one from other distributions next time, if needed. There is no way of 
skipping mount\scan of filesysytems on the disk for the Ubuntu live CD 
that I found.

I am aware of the Open Source implications & off the fact that 
unrecoverable crashes happen to all OSes\Kernel sometime or other.

Regards
Amit

PS: Resent as text

Theodore Tso wrote:
> On Mon, Sep 15, 2008 at 03:09:22PM -0700, Amit Chaudhary wrote:
>   
>> Over the weekend, due to a crash, I ran into the ext3 mount infinite  
>> loop over orphan list issue. This was on Ubuntu 8.04. I tried many  
>> things, including using 18 month old distributions, nothing works. Only  
>> solution is to boot off a alpha version of next Ubunuty which has the  
>> 2.6.27 kernel (rc1 has the fix), more details are below:
>>
>> Can you please release 2.6.27 so that it can make it to stable  
>> distributions.
>>     
>
> As you point out, this problem has been fixed in 2.6.27-rc1.
> Unfortunately, it is a problem which has been around for a very long
> time --- perhaps since ext3 was first written.  No one had noticed the
> problem for a very long time; in fact the first time someone reported
> it as far I know was June 6, 2008, when Sami Liedes found it via a
> synthetic testing program, fsfuzzer, which creates filesystems, then
> corrupts them randomly and then sees whether or not they cause the
> kernel to panic or hang when they are mounted.
>
> You seem to have had the bad luck of running into the problem in a
> real world situation relatively recently, but this has been a
> long-standing problem.
>
> As far as releasing 2.6.27, there still is a fairly large set of
> regressions (i.e., bugs introduced in 2.6.27-rc1 or later) which need
> to be fixed before we can release it; otherwise more users will have
> bad experiences when older kernels that had worked fine for them will
> break for them.  So while it might have helped you to have released
> 2.6.27, it could make things worse for many other users.  So that's
> not a realistic option.  If I had to guess, given currently the
> projected regression bug fix rates, there still is at least 2 or so of
> bug fixing that still needs to be done before 2.6.27 is ready for
> release.
>
> We could take this bug fix and nominate it for release in the next
> 2.6.26-stable series, which distributions could then pick up --- maybe
> we should have, but when the patch went in, it was seen was a fix for
> largely theoretical problems, and not something that needed
> accelerated handling.  This is still something we could do, but
> realistically I'm not sure it's going to help the Ubuntu Intrepid
> release, since it's pretty late in their schedule.  You might be
> better off asking the Ubuntu kernel team if they are willing to cherry
> pick the commit in question (ae76dd9a) and include it in their
> release; they do have the ability to do that without waiting for an
> upstream release, you know.
>
> You also should have been able to work around the problem if you had
> booted a rescue CD and checked your filesystem from the rescue CD.
> That is the normal way these sorts of problems are fixed, and I'm a
> bit surprised you didn't try this first.  It could be that Ubuntu
> Rescue CD's could be made better by automating the ability of (upon
> request) detecting the root filesystem on Ubuntu systems and then
> running e2fsck on the filesystem before trying to mount them.
>
> If you are willing to help out, we can always use more testers to test
> the development kernels.  We can't do this alone, you know.  We've
> wanted to shorten the release window, but in order to do that we need
> more people helping to find and fix bugs during the development cycle.
> Remember, this is open source.  If you see a problem you don't like,
> you can help fix it.  And in fact, that's the most likely way that it
> will get fixed.
>
> Best regards,
>
> 						- Ted
>   

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2008-09-16 16:48 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-09-15 22:09 ext3 mount infinite loop over orphan list issue, please release 2.6.27 Amit Chaudhary
2008-09-16 12:53 ` Theodore Tso
2008-09-16 16:42   ` Amit D. Chaudhary

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox