All of lore.kernel.org
 help / color / mirror / Atom feed
* Problem with Kernel 2.4.19, reiser on sw-raid5 and rsync
@ 2003-03-05 12:23 Oliver Heering
  2003-03-05 14:20 ` Oleg Drokin
  0 siblings, 1 reply; 4+ messages in thread
From: Oliver Heering @ 2003-03-05 12:23 UTC (permalink / raw)
  To: reiserfs-list

Hi,

we've got a big problem with one of our servers using Kernel 2.4.19
(SuSE 8.1 standard kernel), rsync and reiserfs on a software-raid5
array consisting of 6x75GB (= 375GB usable capacity) harddrives.

The installation of SuSE works flawlessly, but when we try to copy
several thousand files from another server to this one (via rsync), we
get the following kernel-crash:

--------------- snip ---------------
kernel: kernel BUG at journal.c:592!
kernel: invalid operand: 0000 2.4.19-4GB #1 Fri Sep 13 13:19:15 UTC 2002
kernel: CPU:    0
kernel: EIP:    0010:[joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32654158/96]    Not tainted
kernel: EIP:    0010:[<c1ddfcb2>]    Not tainted
kernel: EFLAGS: 00010282
kernel: eax: e2931fc8   ebx: 00000900   ecx: e28cb000   edx: 00001964
kernel: esi: 00000001   edi: 012c8000   ebp: 00000000   esp: d1459c20
kernel: ds: 0018   es: 0018   ss: 0018
kernel: Process rsync (pid: 1098, stackpage=d1459000)
kernel: Stack: 00000000 e28cb000 00001000 00008000 00000259 09008ccc 00001000 00000000
kernel:        00000000 00000000 c1dc03a5 c1b66800 00000900 012c8000 00001000 00000001
kernel:        d1459c78 00000000 00001000 00008000 c2de89c0 00000259 00000000 c1b66800
kernel: Call Trace:    [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32783451/96] [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32783121/96] [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32782641/96] [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32781493/96] [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32750223/96]
kernel: Call Trace:    [<c1dc03a5>] [<c1dc04ef>] [<c1dc06cf>] [<c1dc0b4b>] [<c1dc8571>]
kernel:   [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32745617/96] [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32678239/96] [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32744544/96] [__block_prepare_write+399/800] [block_prepare_write+57/144] [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32751312/96]
kernel:   [<c1dc976f>] [<c1dd9ea1>] [<c1dc9ba0>] [<c01480df>] [<c01489b9>] [<c1dc8130>]
kernel:   [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32736026/96] [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32751312/96] [do_generic_file_write+963/1936] [generic_file_write+154/208] [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32731463/96] [sys_write+163/304]
kernel:   [<c1dcbce6>] [<c1dc8130>] [<c01366b3>] [<c0136b1a>] [<c1dcceb9>] [<c0145523>]
kernel:   [system_call+51/56]
kernel:   [<c010907f>]
kernel: Modules: [(reiserfs:<c1dc0060>:<c1def390>)]
kernel: Code: 0f 0b 50 02 64 d2 de c1 b8 01 00 00 00 eb 9f 31 c0 eb 9b 0f
kernel: klogd 1.4.1, ---------- state change ----------
kernel: Inspecting /boot/System.map-2.4.19-4GB
kernel: Loaded 14220 symbols from /boot/System.map-2.4.19-4GB.
kernel: Symbols match kernel version 2.4.19.
kernel: Loaded 483 symbols from 14 modules.
--------------- snip ---------------

What could have gone wrong? We tried several newinstalls and changing
of kernel-version to 2.4.20 and nothing helped.

We're virtually out of ideas right now but if you take a look at...
http://groups.google.com/groups?hl=en&lr=&ie=UTF-8&oe=utf-8&threadm=avuid5%242kgl%241%40FreeBSD.csie.NCTU.edu.tw&rnum=1&prev=/groups%3Fq%3Dsoftware%2Braid%2Breiserfs%2Bkernel%2Bpanic%26hl%3Den%26lr%3D%26ie%3DUTF-8%26oe%3Dutf-8%26selm%3Davuid5%25242kgl%25241%2540FreeBSD.csie.NCTU.edu.tw%26rnum%3D1
...there must be some other people who have the same problem.

The last suspect I have is, that 2 of the 6 harddrivers are different
models but with around the same (a bit more) capacity. Could this be
the cause, too?

Or are there any known bugs when using rsync in combination with
reiserfs and software-raid5 ?

Thanks for answers.. :)

Oliver Heering
Medienzentrum - Universität Dortmund
http://www.medienzentrum.uni-dortmund.de



^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Problem with Kernel 2.4.19, reiser on sw-raid5 and rsync
  2003-03-05 12:23 Problem with Kernel 2.4.19, reiser on sw-raid5 and rsync Oliver Heering
@ 2003-03-05 14:20 ` Oleg Drokin
  2003-03-05 14:43   ` Chris Mason
  0 siblings, 1 reply; 4+ messages in thread
From: Oleg Drokin @ 2003-03-05 14:20 UTC (permalink / raw)
  To: Oliver Heering; +Cc: reiserfs-list

Hello!

On Wed, Mar 05, 2003 at 01:23:56PM +0100, Oliver Heering wrote:

> we've got a big problem with one of our servers using Kernel 2.4.19
> (SuSE 8.1 standard kernel), rsync and reiserfs on a software-raid5
> array consisting of 6x75GB (= 375GB usable capacity) harddrives.

I suppose this is 2.4.19 shipped with SuSE? Is this the last version available from SuSE?
This is SuSE 8.1, I presume?
(and rpm -qa | grep k_i386 output please)

> --------------- snip ---------------
> kernel: kernel BUG at journal.c:592!
> kernel: invalid operand: 0000 2.4.19-4GB #1 Fri Sep 13 13:19:15 UTC 2002
> kernel: CPU:    0
> kernel: EIP:    0010:[joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32654158/96]    Not tainted
> kernel: EIP:    0010:[<c1ddfcb2>]    Not tainted
> kernel: EFLAGS: 00010282
> kernel: eax: e2931fc8   ebx: 00000900   ecx: e28cb000   edx: 00001964
> kernel: esi: 00000001   edi: 012c8000   ebp: 00000000   esp: d1459c20
> kernel: ds: 0018   es: 0018   ss: 0018
> kernel: Process rsync (pid: 1098, stackpage=d1459000)
> kernel: Stack: 00000000 e28cb000 00001000 00008000 00000259 09008ccc 00001000 00000000
> kernel:        00000000 00000000 c1dc03a5 c1b66800 00000900 012c8000 00001000 00000001
> kernel:        d1459c78 00000000 00001000 00008000 c2de89c0 00000259 00000000 c1b66800
> kernel: Call Trace:    [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32783451/96] [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32783121/96] [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32782641/96] [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32781493/96] [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32750223/96]
> kernel: Call Trace:    [<c1dc03a5>] [<c1dc04ef>] [<c1dc06cf>] [<c1dc0b4b>] [<c1dc8571>]
> kernel:   [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32745617/96] [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32678239/96] [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32744544/96] [__block_prepare_write+399/800] [block_prepare_write+57/144] [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32751312/96]
> kernel:   [<c1dc976f>] [<c1dd9ea1>] [<c1dc9ba0>] [<c01480df>] [<c01489b9>] [<c1dc8130>]
> kernel:   [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32736026/96] [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32751312/96] [do_generic_file_write+963/1936] [generic_file_write+154/208] [joydev:__insmod_joydev_O/lib/modules/2.4.19-4GB/kernel/drivers/inp+-32731463/96] [sys_write+163/304]
> kernel:   [<c1dcbce6>] [<c1dc8130>] [<c01366b3>] [<c0136b1a>] [<c1dcceb9>] [<c0145523>]
> kernel:   [system_call+51/56]
> kernel:   [<c010907f>]
> kernel: Modules: [(reiserfs:<c1dc0060>:<c1def390>)]

The decoded output is just a mess.
You seem to need to restart klogd after reiserfs module was loaded, so that it will
find new module and it's symtab. (or even better: run dmesg | ksymoops immediately
after oops have happened). That info would be very useful.

Also what will pop up if you'd run reiserfsck on that device? Are there any weird errors?

> We're virtually out of ideas right now but if you take a look at...
> http://groups.google.com/groups?hl=en&lr=&ie=UTF-8&oe=utf-8&threadm=avuid5%242kgl%241%40FreeBSD.csie.NCTU.edu.tw&rnum=1&prev=/groups%3Fq%3Dsoftware%2Braid%2Breiserfs%2Bkernel%2Bpanic%26hl%3Den%26lr%3D%26ie%3DUTF-8%26oe%3Dutf-8%26selm%3Davuid5%25242kgl%25241%2540FreeBSD.csie.NCTU.edu.tw%26rnum%3D1
> ...there must be some other people who have the same problem.

Hm. This man never came to us, though.

> The last suspect I have is, that 2 of the 6 harddrivers are different
> models but with around the same (a bit more) capacity. Could this be
> the cause, too?

Not likely.

> Or are there any known bugs when using rsync in combination with
> reiserfs and software-raid5 ?

Not that I have heard of. I suppose you do not have power failures from time to time?

Bye,
    Oleg

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Problem with Kernel 2.4.19, reiser on sw-raid5 and rsync
  2003-03-05 14:20 ` Oleg Drokin
@ 2003-03-05 14:43   ` Chris Mason
  2003-03-05 14:51     ` Oleg Drokin
  0 siblings, 1 reply; 4+ messages in thread
From: Chris Mason @ 2003-03-05 14:43 UTC (permalink / raw)
  To: Oleg Drokin; +Cc: Oliver Heering, reiserfs-list

On Wed, 2003-03-05 at 09:20, Oleg Drokin wrote:
> Hello!
> 
> On Wed, Mar 05, 2003 at 01:23:56PM +0100, Oliver Heering wrote:
> 
> > we've got a big problem with one of our servers using Kernel 2.4.19
> > (SuSE 8.1 standard kernel), rsync and reiserfs on a software-raid5
> > array consisting of 6x75GB (= 375GB usable capacity) harddrives.
> 
> I suppose this is 2.4.19 shipped with SuSE? Is this the last version available from SuSE?
> This is SuSE 8.1, I presume?
> (and rpm -qa | grep k_i386 output please)

> > kernel: Modules: [(reiserfs:<c1dc0060>:<c1def390>)]
> 
> The decoded output is just a mess.
> You seem to need to restart klogd after reiserfs module was loaded, so that it will
> find new module and it's symtab. (or even better: run dmesg | ksymoops immediately
> after oops have happened). That info would be very useful.
> 

See that kernel: Modules line above?  That tells you were the reiserfs
module started and ended at the time of the oops.  We can use it to
decode the thing if Oliver sends us the output from 
nm -n /lib/modules/2.4.19-4GB/kernel/fs/reiserfs.o

Oliver, you might also want to look in the update directory on
ftp.suse.com for an updated kernel, there have been fixes since
September.  

On some machines, ACPI causes bugs and crashes in strange places, so you
could also trying booting with acpi=off or acpi=oldboot

-chris



^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Problem with Kernel 2.4.19, reiser on sw-raid5 and rsync
  2003-03-05 14:43   ` Chris Mason
@ 2003-03-05 14:51     ` Oleg Drokin
  0 siblings, 0 replies; 4+ messages in thread
From: Oleg Drokin @ 2003-03-05 14:51 UTC (permalink / raw)
  To: Chris Mason; +Cc: Oliver Heering, reiserfs-list

Hello!

On Wed, Mar 05, 2003 at 09:43:20AM -0500, Chris Mason wrote:
> > I suppose this is 2.4.19 shipped with SuSE? Is this the last version available from SuSE?
> > This is SuSE 8.1, I presume?
> > (and rpm -qa | grep k_i386 output please)
> > > kernel: Modules: [(reiserfs:<c1dc0060>:<c1def390>)]
> > The decoded output is just a mess.
> > You seem to need to restart klogd after reiserfs module was loaded, so that it will
> > find new module and it's symtab. (or even better: run dmesg | ksymoops immediately
> > after oops have happened). That info would be very useful.
> See that kernel: Modules line above?  That tells you were the reiserfs
> module started and ended at the time of the oops.  We can use it to
> decode the thing if Oliver sends us the output from 
> nm -n /lib/modules/2.4.19-4GB/kernel/fs/reiserfs.o

Sure, but I prefer that stuff which can be automated was done automatically. ;)

Bye,
    Oleg

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2003-03-05 14:51 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-03-05 12:23 Problem with Kernel 2.4.19, reiser on sw-raid5 and rsync Oliver Heering
2003-03-05 14:20 ` Oleg Drokin
2003-03-05 14:43   ` Chris Mason
2003-03-05 14:51     ` Oleg Drokin

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.