public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* BUG: Assertion failure with ext3-0.95 for 2.4.7
@ 2001-08-08  8:46 Christian Borntraeger
  2001-08-08  9:26 ` Arjan van de Ven
                   ` (2 more replies)
  0 siblings, 3 replies; 13+ messages in thread
From: Christian Borntraeger @ 2001-08-08  8:46 UTC (permalink / raw)
  To: ext3-users, linux-kernel; +Cc: Carsten Otte


Hello ext3-users,


I tested ext3 on a Linux for S/390 with several stress and benchmark test
tests and faced a kernel bug message.
The console showed the following output:

Message from syslogd@boeaet34 at Fri Aug  3 11:34:16 2001 ...
boeaet34 kernel: Assertion failure in journal_forget() at
transaction.c:1184: "!
jh->b_committed_data"

I tried the Patch from http://www.zip.com.au/~akpm/ext3-2.4-0.9.5-247.gz
with the kernel 2.4.7 with a new LVM- patch(0.9.1)  and some S/390 specific
patches. I use mke2fs version 1.22.
S/390 is a 32bit big endian machine. After compiling and running the kernel
I created an ext3-file system on an 70GB LVM. When running the postmark
test I get (reproduceable) the message from above. dmesg shows:

kernel BUG at transaction.c:1184!
illegal operation: 0001
CPU:    1
Process bench (pid: 2453, stackpage=08CEF000)

Kernel PSW:    07080000 8007f458         =journal_forget
task: 08cee000 ksp: 08cefaa8 pt_regs: 08cefa10
Kernel GPRS:
00000000  8001c118  00000022  00000001
8007f456  00c27000  00194f9a  00000001
030d2c80  074ed294  00001899  092ca350
0001f94c  8007f2c8  8007f456  08cefaa8
Kernel ACRS:
00000000  00000000  00000000  00000000
00000001  00000000  00000000  00000000
00000000  00000000  00000000  00000000
00000000  00000000  00000000  00000000
Kernel BackChain  CallChain
       08cefaa8   [<0007f456>]                =journal_forget
       08cefb10   [<000744c6>]                =ext3_forget
       08cefb70   [<000767b4>]                =ext3_clear_blocks
       08cefbd8   [<000768d4>]                =ext3_free_data
       08cefc50   [<00076c38>]                =ext3_truncate
       08cefd08   [<00074732>]                =ext3_delete_inode
       08cefd68   [<0006659a>]                =iput
       08cefdc8   [<00063dfc>]                =d_delete

I resolved the functions using the system.map file.

Has anyone saw this message before? Any ideas, clues, hints?

Please CC me , because I am not on the list.


--
Mit freundlichen Grüßen / Best Regards

Christian Bornträger
IBM Deutschland Entwicklung GmbH
eServer SW  System Evaluation + Test
email: CBORNTRA@de.ibm.com
Tel +49 7031-16-3507



^ permalink raw reply	[flat|nested] 13+ messages in thread
* Re: BUG: Assertion failure with ext3-0.95 for 2.4.7
@ 2001-08-08  9:36 Christian Borntraeger
  0 siblings, 0 replies; 13+ messages in thread
From: Christian Borntraeger @ 2001-08-08  9:36 UTC (permalink / raw)
  To: arjanv, linux-kernel



>> I tried the Patch from http://www.zip.com.au/~akpm/ext3-2.4-0.9.5-247.gz
>> with the kernel 2.4.7 with a new LVM- patch(0.9.1)  and some S/390
specific
>> patches. I use mke2fs version 1.22.
>> S/390 is a 32bit big endian machine. After compiling and running the
kernel
>> I created an ext3-file system on an 70GB LVM. When running the postmark
>> test I get (reproduceable) the message from above. dmesg shows:
>
>It would be interesting to know if this still happends without a beta
>version of LVM,
>and without LVM at all.

I will try it. But if I mount the same file system as ext2 (mount ... -t
ext2) the test succeeds, so I guess it is not LVM specific. I will inform
you if I know the result.

greetings



^ permalink raw reply	[flat|nested] 13+ messages in thread
* Re: BUG: Assertion failure with ext3-0.95 for 2.4.7
@ 2001-08-08 14:38 Christian Borntraeger
  2001-08-08 15:27 ` Tom Rini
  0 siblings, 1 reply; 13+ messages in thread
From: Christian Borntraeger @ 2001-08-08 14:38 UTC (permalink / raw)
  To: Arjan van de Ven, linux-kernel



>> >It would be interesting to know if this still happends without a beta
>> >version of LVM,
>> >and without LVM at all.
>>
>> I will try it. But if I mount the same file system as ext2 (mount ... -t
>> ext2) the test succeeds, so I guess it is not LVM specific. I will
inform
>> you if I know the result.
>
>Well ext3 has more debugging checks than ext3 at the moment, and also
requires the
>underlying blocklayers (LVM/RAID etc) to not lie. So the test _IS_
relevant,
>not to caste blame, but to find the interaction.....

OK.I tested it with a single 2GB disk without LVM and there was no error.

I also tested it with a 70GB LVM and /proc/sys/fs/jbd-debug set to 5.There
was also no error. After reset to 0 the error reoccured (???)
Next, I will try,using md instead of LVM to have a disk with a similar
size.

greetings

Christan Bornträger




^ permalink raw reply	[flat|nested] 13+ messages in thread
* Re: BUG: Assertion failure with ext3-0.95 for 2.4.7
@ 2001-08-09  9:26 Christian Borntraeger
  0 siblings, 0 replies; 13+ messages in thread
From: Christian Borntraeger @ 2001-08-09  9:26 UTC (permalink / raw)
  To: Arjan van de Ven, trini, linux-kernel, ext3-users; +Cc: Carsten Otte



>Well ext3 has more debugging checks than ext3 at the moment, and also
requires the
>underlying blocklayers (LVM/RAID etc) to not lie. So the test _IS_
relevant,
>not to caste blame, but to find the interaction.....


OK, I tried it now  with md  and the bug was there again. Now during a rm
-rf * .
As the bug ouccured with LVM __and__  with MD it looks like a ext3-problem.
Tom Rini reported the same problem on a PPC-Box. Possibly it is a big
endian problem.

The message was again:
kernel: Assertion failure in journal_forget() at transaction.c:1184:
"!jh->b_committed_data"


Here a have a new backtrace, I resolved the functions manually from
system.map -hopefully without a mistake.

kernel BUG at transaction.c:1184!
illegal operation: 0001
CPU:    1
Process rm (pid: 1917, stackpage=06125000)

Kernel PSW:    070c0000 8008023c   =journal_forget
task: 06124000 ksp: 06125938 pt_regs: 061258a0
Kernel GPRS:
00000000  8001c118  00000022  00000001
8008023a  00c2a000  00197198  00000001
12fc1880  08d06494  00001899  1575c9b0
0001f94c  800800ac  8008023a  06125938
Kernel ACRS:
00000000  00000000  00000000  00000000
00000001  00000000  00000000  00000000
00000000  00000000  00000000  00000000
00000000  00000000  00000000  00000000
Kernel BackChain  CallChain
       06125938   [<0008023a>]           =journal_forget
       061259a0   [<000747e6>]           =ext3_forget
       06125a08   [<00076c00>]           =ext3_clear_blocks
       06125a70   [<00076d20>]           =ext3_free_data
       06125ae8   [<00076ee4>]           =ext3_free_branches
       06125b60   [<00076e44>]           =ext3_free_branches
       06125bd8   [<00076e44>]           =ext3_free_branches
       06125c50   [<0007723c>]           =ext3_truncate


greetings




^ permalink raw reply	[flat|nested] 13+ messages in thread
* Re: BUG: Assertion failure with ext3-0.95 for 2.4.7
@ 2001-08-13  6:53 Christian Borntraeger
  0 siblings, 0 replies; 13+ messages in thread
From: Christian Borntraeger @ 2001-08-13  6:53 UTC (permalink / raw)
  To: Andrew Morton; +Cc: Tom Rini, ext3-users, linux-kernel, Carsten Otte



> If it's possible, could you please also test journalled data mode?
It will take a while, but it is already planned.

> It'd be interesting to sanity test recovery as well, but doing
> thorough testing of recovery is hard.  That's why the ext3 patch
> places interesting debug/devel code way down inside the IDE device
> driver...

S/390 has no IDE. That might be the reason why I faced depmod-problems if I
include ext3 and jbd-debug as module.
Nevertheless debugging seems to work if I compile ext3 in the kernel.

greetings




^ permalink raw reply	[flat|nested] 13+ messages in thread
* Re: BUG: Assertion failure with ext3-0.95 for 2.4.7
@ 2001-08-14  8:34 Christian Borntraeger
  0 siblings, 0 replies; 13+ messages in thread
From: Christian Borntraeger @ 2001-08-14  8:34 UTC (permalink / raw)
  To: Andrew Morton; +Cc: ext3-users, linux-kernel, Carsten Otte, Tom Rini



> If it's possible, could you please also test journalled data mode?

I finished some tests with journalled data mode and with ordered mode. Both
modes seems to run correct, in the meaning of stability and correct log
replay, after a forced restart on S390 architecture, but further testing is
planned.

Today I started testing ext3 on an IBM zSeries  (s390x in the kernel tree).
It might be interesting for you, that s390x is a 64bit big endian machine.
I will post the results if I face any problems.

--
Mit freundlichen Grüßen / Best Regards

Christian Bornträger
IBM Deutschland Entwicklung GmbH
eServer SW  System Evaluation + Test
email: CBORNTRA@de.ibm.com
Tel +49 7031-16-3507






^ permalink raw reply	[flat|nested] 13+ messages in thread

end of thread, other threads:[~2001-08-14  8:35 UTC | newest]

Thread overview: 13+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2001-08-08  8:46 BUG: Assertion failure with ext3-0.95 for 2.4.7 Christian Borntraeger
2001-08-08  9:26 ` Arjan van de Ven
2001-08-08 15:16 ` Tom Rini
2001-08-09 18:58 ` Andrew Morton
2001-08-10 17:44   ` Tom Rini
2001-08-10 18:09     ` Andrew Morton
2001-08-10 18:10       ` Tom Rini
  -- strict thread matches above, loose matches on Subject: below --
2001-08-08  9:36 Christian Borntraeger
2001-08-08 14:38 Christian Borntraeger
2001-08-08 15:27 ` Tom Rini
2001-08-09  9:26 Christian Borntraeger
2001-08-13  6:53 Christian Borntraeger
2001-08-14  8:34 Christian Borntraeger

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox