public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Oleg Drokin <green@namesys.com>
To: Stephan von Krawczynski <skraw@ithnet.com>
Cc: marcelo@conectiva.com.br, akpm@osdl.org, andrea@suse.de,
	alan@lxorguk.ukuu.org.uk, linux-kernel@vger.kernel.org,
	mason@suse.com
Subject: Re: 2.4.22-pre lockups (now decoded oops for pre10)
Date: Wed, 13 Aug 2003 20:34:52 +0400	[thread overview]
Message-ID: <20030813163452.GC27515@namesys.com> (raw)
In-Reply-To: <20030813180405.3c45465d.skraw@ithnet.com>

Hello!

On Wed, Aug 13, 2003 at 06:04:05PM +0200, Stephan von Krawczynski wrote:
> > > Well, that's exactly the reason why I am awaiting some more days of
> > > up-and-running ext3. After how many days will you be convinced that a
> > > random memory corruption should have hit the ext3 system that bad, that it
> > > should have crashed?
> > Well, I'd prefer that you spend time to figure out at which exact
> > 2.4.21-pre version the crashes in reiserfs started to appear. ;)
> Well, Oleg, I'd love to, but there is an immanent problem with that. If
> I check pre-X and it crashes, everything is fine, because I have a certain
> result of the test. If it does not crash within 3 days, then I have a problem.
> How long do I wait before stating the pre is good? It could take months to test

You seem to be getting corruptions in at least 2 days for now, though.
And reiserfs seems to trigger the problem even faster (and may be
even more faster if you enable CONFIG_REISERFS_CHECK).

> 10 pre's ... That cannot be the way to find out what is going on. 
> On the other hand: 
> - no UP kernel ever crashed. So we can at least talk about an SMP-race.

There is still huge field to look at.

> - 2.4.20 does not crash
> - 2.4.21 does crash

diff is 20M in size.

> If we can add "ext3 does not crash" to the list, then I really hope we can use
> some brain and give good selection of patches between 2.4.20 and 2.4.21 that
> may cause the troubles.

There were not much changes in reiserfs. All those patches can easily be
reverted just for verification purposes. Let me know when you are ready/want
to test this variant and I will send you a diff.

> How many suspects do we have? We can at least begin to create a list of things

Well, suspects are all used drivers, VM, filesystem itself, arch code.

> that went in between .20 and .21, or not?

Lots of changes, 2.4.20->2.4.21 was a long trip.

> If possible I can then patch out all of them and retry. So there is much less
> time spent for testing. 
> I mean, have you looked at the length of this thread already?

Yes, I did.
Now if only we can get someone to reproduce your problems...

> > > I can add another week if you want me to, just tell me. The only thing I
> > > don't want is that any doubts are left after testing ...
> > It would be interesting to look at fsck results on the fs after some time of
> > testing.
> You mean I should do an fsck on sunday?

Yes, whenever you decide you have waited long enough (provided that it won't
crash) and decide to stop testing, please run fsck on that testing fs.

> > Probably it would be easier for you to make it crash (if there are crash
> > possibility at all) if you enable JBD debugging.
> I have never seen this in real life. Is it possible to turn this on when
> handling >100 GB of data or will some debug output flood the box?

It only enables some more checks, not debug output.

Bye,
    Oleg

  reply	other threads:[~2003-08-13 16:34 UTC|newest]

Thread overview: 56+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <20030808002918.723abb08.skraw@ithnet.com>
2003-08-08 14:54 ` 2.4.22-pre lockups (now decoded oops for pre10) Marcelo Tosatti
2003-08-08 15:05   ` Stephan von Krawczynski
2003-08-08 15:33     ` Marcelo Tosatti
2003-08-10 21:35       ` Stephan von Krawczynski
2003-08-10 23:23         ` Neil Brown
2003-08-11  9:33           ` Stephan von Krawczynski
2003-08-18 20:43             ` Mike Fedyk
2003-08-13 10:55       ` Stephan von Krawczynski
2003-08-13 14:53         ` Marcelo Tosatti
2003-08-13 14:59           ` Oleg Drokin
2003-08-13 15:12             ` Stephan von Krawczynski
2003-08-13 15:30               ` Oleg Drokin
2003-08-13 16:04                 ` Stephan von Krawczynski
2003-08-13 16:34                   ` Oleg Drokin [this message]
2003-08-13 22:19                     ` Stephan von Krawczynski
2003-08-14  8:45                       ` Oleg Drokin
2003-08-14 17:26                         ` Marcelo Tosatti
2003-08-14 17:42                           ` Stephan von Krawczynski
2003-08-15  2:08                             ` Chris Mason
2003-08-15  9:40                               ` Stephan von Krawczynski
2003-08-15 10:28                               ` Stephan von Krawczynski
2003-08-15 12:55                                 ` Chris Mason
2003-08-20 14:21                                   ` 2.4.22-pre lockups (yet another oops for rc2) Stephan von Krawczynski
2003-09-05  9:24                                   ` 2.4.22-pre lockups (case closed) Stephan von Krawczynski
2003-09-05 13:37                                     ` Andrea Arcangeli
2003-08-15 10:13                         ` 2.4.22-pre lockups (now decoded oops for pre10) Stephan von Krawczynski
2003-08-15 10:31                           ` Oleg Drokin
2003-08-18 15:06                   ` Andrea Arcangeli
2003-08-18 20:19                     ` Stephan von Krawczynski
2003-08-18 20:58                       ` Mike Fedyk
2003-08-18 22:31                       ` Andrea Arcangeli
2003-08-19  1:12                         ` Mike Fedyk
2003-08-19  7:12                           ` Stephan von Krawczynski
2003-08-19 13:10                             ` Alan Cox
2003-08-19 14:18                               ` Stephan von Krawczynski
2003-08-19 18:00                                 ` Mike Fedyk
2003-08-19 21:58                                   ` Stephan von Krawczynski
2003-08-19 13:27                             ` Andrea Arcangeli
2003-08-13 15:21           ` Jim Gifford
2003-08-13 17:08             ` Marcelo Tosatti
2003-08-10 14:23     ` Keith Owens
2003-08-02 12:27 2.4.22-pre lockups (decoded oops for pre8) Stephan von Krawczynski
2003-08-05 16:40 ` Marcelo Tosatti
2003-08-06  7:41   ` 2.4.22-pre lockups (now decoded oops for pre10) Stephan von Krawczynski
2003-08-06  8:58     ` Oleg Drokin
2003-08-06  9:09     ` Willy Tarreau
2003-08-06  9:36       ` Stephan von Krawczynski
2003-08-06 12:45         ` Willy Tarreau
2003-08-18 14:23       ` Andrea Arcangeli
2003-08-06 18:15     ` Marcelo Tosatti
2003-08-07  2:14       ` Stephan von Krawczynski
2003-08-07  5:35         ` Oleg Drokin
2003-08-07 12:45         ` Marcelo Tosatti
     [not found]           ` <3F325198.2010301@namesys.com>
2003-08-07 13:32             ` Stephan von Krawczynski
2003-08-18 20:29               ` Mike Fedyk
2003-08-18 20:39                 ` Stephan von Krawczynski
2003-08-18 21:09                   ` Mike Fedyk
2003-08-07 15:52           ` Stephan von Krawczynski

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20030813163452.GC27515@namesys.com \
    --to=green@namesys.com \
    --cc=akpm@osdl.org \
    --cc=alan@lxorguk.ukuu.org.uk \
    --cc=andrea@suse.de \
    --cc=linux-kernel@vger.kernel.org \
    --cc=marcelo@conectiva.com.br \
    --cc=mason@suse.com \
    --cc=skraw@ithnet.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox