netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Re: oops with 2.6.23.1, marvel, software raid, reiserfs and samba
       [not found]       ` <20071208035018.JENM21128.mta13.adelphia.net@dual-xeon.jeffunit.com>
@ 2007-12-16 11:05         ` Andrew Morton
  2007-12-16 11:56           ` Herbert Xu
  2007-12-16 14:55           ` jeffunit
  0 siblings, 2 replies; 5+ messages in thread
From: Andrew Morton @ 2007-12-16 11:05 UTC (permalink / raw)
  To: jeffunit; +Cc: linux-kernel, netdev

On Fri, 07 Dec 2007 19:49:52 -0800 jeffunit <jeff@jeffunit.com> wrote:

> I am running linux kernel 2.6.23.1, which I compiled.
> The base system was mandriva 2008.
> 
> I have a dual processor pentium III 933 system.
> It has 3gb of ram, an intel stl-2 motherboard.
> It also has a promise 100 tx2 pata controller,
> a supermicro marvell based 8 port pcix sata controller,
> and a nvidia pci based video card.
> 
> I have the os on a pata drive, and have made a software raid array
> consisting of 4 sata drives attached to the pcix sata controller.
> I created the array, and formatted with reiserfs 3.6
> I have run bonnie++ (filesystem benchmark) on the array without incident.
> When I use samba-3.0.25b-4.3 and copy files from a windows machine to 
> the fileserver,
> every so often, the fileserver crashes or hangs. It seems to happen
> more often under heavy samba traffic.
> Enclosed is the oops from syslog.
> I also have a 'kernel bug' from syslog if that would be helpful.
> 
> jeff
> 
> 
> Dec  7 17:20:52 sata_fileserver kernel: BUG: unable to handle kernel 
> NULL pointer dereference at virtual address 0000000d
> Dec  7 17:20:52 sata_fileserver kernel:  printing eip:
> Dec  7 17:20:52 sata_fileserver kernel: c02cc820
> Dec  7 17:20:52 sata_fileserver kernel: *pde = 00000000
> Dec  7 17:20:52 sata_fileserver kernel: Oops: 0000 [#1]
> Dec  7 17:20:52 sata_fileserver kernel: SMP
> Dec  7 17:20:52 sata_fileserver kernel: Modules linked in: raid456 
> async_xor async_memcpy async_tx xor iptable_raw xt_comment xt_policy 
> xt_multiport ipt_ULOG ipt_TTL ipt_ttl ipt_TOS ipt_tos ipt_SAME 
> ipt_REJECT ipt_REDIRECT ipt_recent ipt_owner ipt_NETMAP 
> ipt_MASQUERADE ipt_LOG ipt_iprange ipt_ECN ipt_ecn ipt_CLUSTERIP 
> ipt_ah ipt_addrtype nf_nat_tftp nf_nat_snmp_basic nf_nat_sip 
> nf_nat_pptp nf_nat_proto_gre nf_nat_irc nf_nat_h323 nf_nat_ftp 
> nf_nat_amanda ts_kmp nf_conntrack_amanda nf_conntrack_tftp 
> nf_conntrack_sip nf_conntrack_proto_sctp nf_conntrack_pptp 
> nf_conntrack_proto_gre nf_conntrack_netlink nf_conntrack_netbios_ns 
> nf_conntrack_irc nf_conntrack_h323 nf_conntrack_ftp xt_tcpmss 
> xt_pkttype xt_physdev xt_NFQUEUE xt_NFLOG xt_MARK xt_mark xt_mac 
> xt_limit xt_length xt_helper xt_hashlimit ip6_tables xt_dccp 
> xt_conntrack xt_CONNMARK xt_connmark xt_CLASSIFY nfsd xt_tcpudp 
> exportfs auth_rpcgss xt_state iptable_nat nf_nat nf_conntrack_ipv4 
> nf_conntrack nfs iptable_mangle lockd nfs_acl sunrpc nfnetlink 
> iptable_filter ip_table
> Dec  7 17:20:52 sata_fileserver kernel:  x_tables af_packet ipv6 
> snd_seq_dummy snd_seq_oss snd_seq_midi_event snd_seq snd_pcm_oss 
> snd_mixer_oss ipmi_si ipmi_msghandler binfmt_misc loop nls_utf8 ntfs 
> dm_mod usb_storage sg sd_mod sata_mv libata scsi_mod video output 
> thermal sbs processor fan container button dock battery ac floppy 
> snd_emu10k1 snd_rawmidi snd_ac97_codec ac97_bus snd_pcm 
> snd_seq_device snd_timer snd_page_alloc snd_util_mem snd_hwdep 
> ehci_hcd snd ohci_hcd i2c_piix4 uhci_hcd soundcore e1000 sworks_agp 
> i2c_core ide_cd usbcore agpgart emu10k1_gp gameport tsdev evdev 
> reiserfs ide_disk serverworks pdc202xx_new ide_core
> Dec  7 17:20:52 sata_fileserver kernel: CPU:    1
> Dec  7 17:20:52 sata_fileserver kernel: 
> EIP:    0060:[<c02cc820>]    Not tainted VLI
> Dec  7 17:20:52 sata_fileserver kernel: EFLAGS: 00210202   (2.6.23.1 #1)
> Dec  7 17:20:52 sata_fileserver kernel: EIP is at tcp_recvmsg+0x150/0xbf0
> Dec  7 17:20:52 sata_fileserver kernel: eax: 00000000   ebx: 
> f55c4b60   ecx: 784e2c7c   edx: f63f63d8
> Dec  7 17:20:52 sata_fileserver kernel: esi: 784e2c7a   edi: 
> f63f614c   ebp: e21fde24   esp: e21fddc4
> Dec  7 17:20:52 sata_fileserver kernel: ds: 007b   es: 007b   fs: 
> 00d8  gs: 0033  ss: 0068
> Dec  7 17:20:52 sata_fileserver kernel: Process smbd (pid: 9524, 
> ti=e21fc000 task=f5109000 task.ti=e21fc000)
> Dec  7 17:20:52 sata_fileserver kernel: Stack: 00000000 ffffffff 
> 00000000 c13e5740 f557b000 c03fa300 00000000 e21fde90
> Dec  7 17:20:52 sata_fileserver kernel:        f63f60e0 00000000 
> 00000b64 f63f63d8 000005b4 00000001 00000000 00000000
> Dec  7 17:20:52 sata_fileserver kernel:        00000000 000005b4 
> e21fde4c 7fffffff e21fde28 00000000 c03a4de0 e21fde90
> Dec  7 17:20:52 sata_fileserver kernel: Call Trace:
> Dec  7 17:20:53 sata_fileserver kernel:  [<c010542a>] 
> show_trace_log_lvl+0x1a/0x30
> Dec  7 17:20:53 sata_fileserver kernel:  [<c01054eb>] 
> show_stack_log_lvl+0xab/0xd0
> Dec  7 17:20:53 sata_fileserver kernel:  [<c01056e1>] 
> show_registers+0x1d1/0x2d0
> Dec  7 17:20:53 sata_fileserver kernel:  [<c01058f6>] die+0x116/0x250
> Dec  7 17:20:53 sata_fileserver kernel:  [<c011f52b>] do_page_fault+0x28b/0x6a0
> Dec  7 17:20:53 sata_fileserver kernel:  [<c030938a>] error_code+0x72/0x78
> Dec  7 17:20:53 sata_fileserver kernel:  [<c0295423>] 
> sock_common_recvmsg+0x43/0x60
> Dec  7 17:20:53 sata_fileserver kernel:  [<c029301c>] sock_aio_read+0x11c/0x130
> Dec  7 17:20:53 sata_fileserver kernel:  [<c017db30>] do_sync_read+0xd0/0x110
> Dec  7 17:20:53 sata_fileserver kernel:  [<c017e47d>] vfs_read+0x12d/0x140
> Dec  7 17:20:53 sata_fileserver kernel:  [<c017e8bd>] sys_read+0x3d/0x70
> Dec  7 17:20:53 sata_fileserver kernel:  [<c01042fe>] 
> sysenter_past_esp+0x6b/0xa1
> Dec  7 17:20:53 sata_fileserver kernel:  =======================
> Dec  7 17:20:53 sata_fileserver kernel: Code: 6c 39 df 74 59 8d b6 00 
> 00 00 00 85 db 74 4f 8b 55 cc 8d 43 20 8b 0a 3b 48 18 0f 88 f4 05 00 
> 00 89 ce 2b 70 18 8b 83 90 00 00 00 <0f> b6 50 0d 89 d0 83 e0 02 3c 
> 01 8b 43 50 83 d6 ff 39 c6 0f 82
> Dec  7 17:20:53 sata_fileserver kernel: EIP: [<c02cc820>] 
> tcp_recvmsg+0x150/0xbf0 SS:ESP 0068:e21fddc4
> Dec  7 17:21:11 sata_fileserver kernel: 
> Shorewall:net2all:DROP:IN=eth0 OUT= 
> MAC=00:04:23:a8:12:cf:00:11:2f:42:d4:32:08:00 SRC=192.168.47.120 
> DST=192.168.47.101 LEN=60 TOS=0x00 PREC=0x00 TTL=32 ID=9964 
> PROTO=ICMP TYPE=8 CODE=0 ID=512 SEQ=24064
> Dec  7 17:21:13 sata_fileserver kernel: 
> Shorewall:net2all:DROP:IN=eth0 OUT= 
> MAC=00:04:23:a8:12:cf:00:11:2f:42:d4:32:08:00 SRC=192.168.47.120 
> DST=192.168.47.101 LEN=60 TOS=0x00 PREC=0x00 TTL=32 ID=9975 
> PROTO=ICMP TYPE=8 CODE=0 ID=512 SEQ=24320 

(Please try to avoid the wordwrapping).

That's a networking crash.  Do the oops traces which you're getting all look
like this one?

Pentium III's are getting a bit old (resistive connections, drooping
power supplies, etc) so there's a decent chance that you're seeing
hardware failures here.



^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: oops with 2.6.23.1, marvel, software raid, reiserfs and samba
  2007-12-16 11:05         ` oops with 2.6.23.1, marvel, software raid, reiserfs and samba Andrew Morton
@ 2007-12-16 11:56           ` Herbert Xu
  2007-12-16 12:21             ` Herbert Xu
  2007-12-16 14:55           ` jeffunit
  1 sibling, 1 reply; 5+ messages in thread
From: Herbert Xu @ 2007-12-16 11:56 UTC (permalink / raw)
  To: Andrew Morton; +Cc: jeff, linux-kernel, netdev, davem

Andrew Morton <akpm@linux-foundation.org> wrote:
>
>> Dec  7 17:20:53 sata_fileserver kernel: Code: 6c 39 df 74 59 8d b6 00 
>> 00 00 00 85 db 74 4f 8b 55 cc 8d 43 20 8b 0a 3b 48 18 0f 88 f4 05 00 
>> 00 89 ce 2b 70 18 8b 83 90 00 00 00 <0f> b6 50 0d 89 d0 83 e0 02 3c 
>> 01 8b 43 50 83 d6 ff 39 c6 0f 82

This means that skb->network_header == NULL so this line crashes:

	if (tcp_hdr(skb)->syn)
		offset--;

> That's a networking crash.  Do the oops traces which you're getting all look
> like this one?

What's spooky is that I just did a google and we've had reports
since 1998 all crashing on exactly the same line in tcp_recvmsg.

Cheers,
-- 
Visit Openswan at http://www.openswan.org/
Email: Herbert Xu ~{PmV>HI~} <herbert@gondor.apana.org.au>
Home Page: http://gondor.apana.org.au/~herbert/
PGP Key: http://gondor.apana.org.au/~herbert/pubkey.txt

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: oops with 2.6.23.1, marvel, software raid, reiserfs and samba
  2007-12-16 11:56           ` Herbert Xu
@ 2007-12-16 12:21             ` Herbert Xu
  0 siblings, 0 replies; 5+ messages in thread
From: Herbert Xu @ 2007-12-16 12:21 UTC (permalink / raw)
  To: Andrew Morton; +Cc: jeff, linux-kernel, netdev, davem

On Sun, Dec 16, 2007 at 07:56:56PM +0800, Herbert Xu wrote:
>
> What's spooky is that I just did a google and we've had reports
> since 1998 all crashing on exactly the same line in tcp_recvmsg.

However, there's been no reports at all since 2000 apart from this
one so the earlier ones are probably not related.

Cheers,
-- 
Visit Openswan at http://www.openswan.org/
Email: Herbert Xu ~{PmV>HI~} <herbert@gondor.apana.org.au>
Home Page: http://gondor.apana.org.au/~herbert/
PGP Key: http://gondor.apana.org.au/~herbert/pubkey.txt

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: oops with 2.6.23.1, marvel, software raid, reiserfs and samba
  2007-12-16 11:05         ` oops with 2.6.23.1, marvel, software raid, reiserfs and samba Andrew Morton
  2007-12-16 11:56           ` Herbert Xu
@ 2007-12-16 14:55           ` jeffunit
  2007-12-16 22:09             ` Andrew Morton
  1 sibling, 1 reply; 5+ messages in thread
From: jeffunit @ 2007-12-16 14:55 UTC (permalink / raw)
  To: Andrew Morton, jeffunit; +Cc: linux-kernel, netdev

At 03:05 AM 12/16/2007, Andrew Morton wrote:
>On Fri, 07 Dec 2007 19:49:52 -0800 jeffunit <jeff@jeffunit.com> wrote:
>
> > I am running linux kernel 2.6.23.1, which I compiled.
> > The base system was mandriva 2008.
> >
> > I have a dual processor pentium III 933 system.
> > It has 3gb of ram, an intel stl-2 motherboard.
> > It also has a promise 100 tx2 pata controller,
> > a supermicro marvell based 8 port pcix sata controller,
> > and a nvidia pci based video card.
> >
> > I have the os on a pata drive, and have made a software raid array
> > consisting of 4 sata drives attached to the pcix sata controller.
> > I created the array, and formatted with reiserfs 3.6
> > I have run bonnie++ (filesystem benchmark) on the array without incident.
> > When I use samba-3.0.25b-4.3 and copy files from a windows machine to
> > the fileserver,
> > every so often, the fileserver crashes or hangs. It seems to happen
> > more often under heavy samba traffic.
> > Enclosed is the oops from syslog.
> > I also have a 'kernel bug' from syslog if that would be helpful.
> >
> > jeff
> >
> >
> > Dec  7 17:20:52 sata_fileserver kernel: BUG: unable to handle kernel
> > NULL pointer dereference at virtual address 0000000d
> > Dec  7 17:20:52 sata_fileserver kernel:  printing eip:
> > Dec  7 17:20:52 sata_fileserver kernel: c02cc820
> > Dec  7 17:20:52 sata_fileserver kernel: *pde = 00000000
> > Dec  7 17:20:52 sata_fileserver kernel: Oops: 0000 [#1]
> > Dec  7 17:20:52 sata_fileserver kernel: SMP
> > Dec  7 17:20:52 sata_fileserver kernel: Modules linked in: raid456
> > async_xor async_memcpy async_tx xor iptable_raw xt_comment xt_policy
> > xt_multiport ipt_ULOG ipt_TTL ipt_ttl ipt_TOS ipt_tos ipt_SAME
> > ipt_REJECT ipt_REDIRECT ipt_recent ipt_owner ipt_NETMAP
> > ipt_MASQUERADE ipt_LOG ipt_iprange ipt_ECN ipt_ecn ipt_CLUSTERIP
> > ipt_ah ipt_addrtype nf_nat_tftp nf_nat_snmp_basic nf_nat_sip
> > nf_nat_pptp nf_nat_proto_gre nf_nat_irc nf_nat_h323 nf_nat_ftp
> > nf_nat_amanda ts_kmp nf_conntrack_amanda nf_conntrack_tftp
> > nf_conntrack_sip nf_conntrack_proto_sctp nf_conntrack_pptp
> > nf_conntrack_proto_gre nf_conntrack_netlink nf_conntrack_netbios_ns
> > nf_conntrack_irc nf_conntrack_h323 nf_conntrack_ftp xt_tcpmss
> > xt_pkttype xt_physdev xt_NFQUEUE xt_NFLOG xt_MARK xt_mark xt_mac
> > xt_limit xt_length xt_helper xt_hashlimit ip6_tables xt_dccp
> > xt_conntrack xt_CONNMARK xt_connmark xt_CLASSIFY nfsd xt_tcpudp
> > exportfs auth_rpcgss xt_state iptable_nat nf_nat nf_conntrack_ipv4
> > nf_conntrack nfs iptable_mangle lockd nfs_acl sunrpc nfnetlink
> > iptable_filter ip_table
> > Dec  7 17:20:52 sata_fileserver kernel:  x_tables af_packet ipv6
> > snd_seq_dummy snd_seq_oss snd_seq_midi_event snd_seq snd_pcm_oss
> > snd_mixer_oss ipmi_si ipmi_msghandler binfmt_misc loop nls_utf8 ntfs
> > dm_mod usb_storage sg sd_mod sata_mv libata scsi_mod video output
> > thermal sbs processor fan container button dock battery ac floppy
> > snd_emu10k1 snd_rawmidi snd_ac97_codec ac97_bus snd_pcm
> > snd_seq_device snd_timer snd_page_alloc snd_util_mem snd_hwdep
> > ehci_hcd snd ohci_hcd i2c_piix4 uhci_hcd soundcore e1000 sworks_agp
> > i2c_core ide_cd usbcore agpgart emu10k1_gp gameport tsdev evdev
> > reiserfs ide_disk serverworks pdc202xx_new ide_core
> > Dec  7 17:20:52 sata_fileserver kernel: CPU:    1
> > Dec  7 17:20:52 sata_fileserver kernel:
> > EIP:    0060:[<c02cc820>]    Not tainted VLI
> > Dec  7 17:20:52 sata_fileserver kernel: EFLAGS: 00210202   (2.6.23.1 #1)
> > Dec  7 17:20:52 sata_fileserver kernel: EIP is at tcp_recvmsg+0x150/0xbf0
> > Dec  7 17:20:52 sata_fileserver kernel: eax: 00000000   ebx:
> > f55c4b60   ecx: 784e2c7c   edx: f63f63d8
> > Dec  7 17:20:52 sata_fileserver kernel: esi: 784e2c7a   edi:
> > f63f614c   ebp: e21fde24   esp: e21fddc4
> > Dec  7 17:20:52 sata_fileserver kernel: ds: 007b   es: 007b   fs:
> > 00d8  gs: 0033  ss: 0068
> > Dec  7 17:20:52 sata_fileserver kernel: Process smbd (pid: 9524,
> > ti=e21fc000 task=f5109000 task.ti=e21fc000)
> > Dec  7 17:20:52 sata_fileserver kernel: Stack: 00000000 ffffffff
> > 00000000 c13e5740 f557b000 c03fa300 00000000 e21fde90
> > Dec  7 17:20:52 sata_fileserver kernel:        f63f60e0 00000000
> > 00000b64 f63f63d8 000005b4 00000001 00000000 00000000
> > Dec  7 17:20:52 sata_fileserver kernel:        00000000 000005b4
> > e21fde4c 7fffffff e21fde28 00000000 c03a4de0 e21fde90
> > Dec  7 17:20:52 sata_fileserver kernel: Call Trace:
> > Dec  7 17:20:53 sata_fileserver kernel:  [<c010542a>]
> > show_trace_log_lvl+0x1a/0x30
> > Dec  7 17:20:53 sata_fileserver kernel:  [<c01054eb>]
> > show_stack_log_lvl+0xab/0xd0
> > Dec  7 17:20:53 sata_fileserver kernel:  [<c01056e1>]
> > show_registers+0x1d1/0x2d0
> > Dec  7 17:20:53 sata_fileserver kernel:  [<c01058f6>] die+0x116/0x250
> > Dec  7 17:20:53 sata_fileserver kernel:  [<c011f52b>] 
> do_page_fault+0x28b/0x6a0
> > Dec  7 17:20:53 sata_fileserver kernel:  [<c030938a>] error_code+0x72/0x78
> > Dec  7 17:20:53 sata_fileserver kernel:  [<c0295423>]
> > sock_common_recvmsg+0x43/0x60
> > Dec  7 17:20:53 sata_fileserver kernel:  [<c029301c>] 
> sock_aio_read+0x11c/0x130
> > Dec  7 17:20:53 sata_fileserver kernel:  [<c017db30>] 
> do_sync_read+0xd0/0x110
> > Dec  7 17:20:53 sata_fileserver kernel:  [<c017e47d>] vfs_read+0x12d/0x140
> > Dec  7 17:20:53 sata_fileserver kernel:  [<c017e8bd>] sys_read+0x3d/0x70
> > Dec  7 17:20:53 sata_fileserver kernel:  [<c01042fe>]
> > sysenter_past_esp+0x6b/0xa1
> > Dec  7 17:20:53 sata_fileserver kernel:  =======================
> > Dec  7 17:20:53 sata_fileserver kernel: Code: 6c 39 df 74 59 8d b6 00
> > 00 00 00 85 db 74 4f 8b 55 cc 8d 43 20 8b 0a 3b 48 18 0f 88 f4 05 00
> > 00 89 ce 2b 70 18 8b 83 90 00 00 00 <0f> b6 50 0d 89 d0 83 e0 02 3c
> > 01 8b 43 50 83 d6 ff 39 c6 0f 82
> > Dec  7 17:20:53 sata_fileserver kernel: EIP: [<c02cc820>]
> > tcp_recvmsg+0x150/0xbf0 SS:ESP 0068:e21fddc4
> > Dec  7 17:21:11 sata_fileserver kernel:
> > Shorewall:net2all:DROP:IN=eth0 OUT=
> > MAC=00:04:23:a8:12:cf:00:11:2f:42:d4:32:08:00 SRC=192.168.47.120
> > DST=192.168.47.101 LEN=60 TOS=0x00 PREC=0x00 TTL=32 ID=9964
> > PROTO=ICMP TYPE=8 CODE=0 ID=512 SEQ=24064
> > Dec  7 17:21:13 sata_fileserver kernel:
> > Shorewall:net2all:DROP:IN=eth0 OUT=
> > MAC=00:04:23:a8:12:cf:00:11:2f:42:d4:32:08:00 SRC=192.168.47.120
> > DST=192.168.47.101 LEN=60 TOS=0x00 PREC=0x00 TTL=32 ID=9975
> > PROTO=ICMP TYPE=8 CODE=0 ID=512 SEQ=24320
>
>(Please try to avoid the wordwrapping).
>
>That's a networking crash.  Do the oops traces which you're getting all look
>like this one?
>
>Pentium III's are getting a bit old (resistive connections, drooping
>power supplies, etc) so there's a decent chance that you're seeing
>hardware failures here.

The other trace is a kernel bug. lt is included below.

It is true the hardware is a bit old, but I freshly assembled the system.
The power supply is new, everything has been re-seated.
I will be updating the hardware eventually, but I picked this hardware
because it is low power (@120watts), server grade, has ecc memory,
and has pcix- slots, which my ethernet card and 8 port sata controller need.

For what it is worth, the ethernet card is an intel  pro1000-mt.

Dec  3 15:44:50 sata_fileserver kernel: ------------[ cut here ]------------
Dec  3 15:44:50 sata_fileserver kernel: Kernel BUG at c0167b30 
[verbose debug info unavailable]
Dec  3 15:44:50 sata_fileserver kernel: invalid opcode: 0000 [#1]
Dec  3 15:44:51 sata_fileserver kernel: SMP
Dec  3 15:44:51 sata_fileserver kernel: Modules linked in: 
iptable_raw xt_comment xt_policy xt_multiport ipt_ULOG ipt_TTL 
ipt_ttl ipt_TOS ipt_tos ipt_SAME ipt_REJECT ipt_REDIRECT ipt_recent 
ipt_owner ipt_NETMAP ipt_MASQUERADE ipt_LOG ipt_iprange ipt_ECN 
ipt_ecn ipt_CLUSTERIP ipt_ah ipt_addrtype nf_nat_tftp 
nf_nat_snmp_basic nf_nat_sip nf_nat_pptp nf_nat_proto_gre nf_nat_irc 
nf_nat_h323 nf_nat_ftp nf_nat_amanda ts_kmp nf_conntrack_amanda 
nf_conntrack_tftp nf_conntrack_sip nf_conntrack_proto_sctp 
nf_conntrack_pptp nf_conntrack_proto_gre nf_conntrack_netlink 
nf_conntrack_netbios_ns nf_conntrack_irc nf_conntrack_h323 
nf_conntrack_ftp xt_tcpmss xt_pkttype xt_physdev xt_NFQUEUE xt_NFLOG 
xt_MARK xt_mark xt_mac xt_limit xt_length xt_helper xt_hashlimit 
ip6_tables xt_dccp xt_conntrack xt_CONNMARK xt_connmark xt_CLASSIFY 
xt_tcpudp nfsd xt_state iptable_nat nf_nat nf_conntrack_ipv4 exportfs 
auth_rpcgss nf_conntrack iptable_mangle nfnetlink nfs lockd nfs_acl 
sunrpc iptable_filter ip_tables x_tables af_packet ipv6 snd_seq_dummy snd_
Dec  3 15:44:51 sata_fileserver kernel: eq_oss snd_seq_midi_event 
snd_seq snd_pcm_oss snd_mixer_oss ipmi_si ipmi_msghandler binfmt_misc 
loop nls_utf8 ntfs raid456 async_xor async_memcpy async_tx xor dm_mod 
usb_storage sg sd_mod sata_mv libata scsi_mod video output thermal 
sbs processor fan container button dock battery ac floppy snd_emu10k1 
snd_rawmidi snd_ac97_codec ac97_bus snd_pcm ide_cd snd_seq_device 
snd_timer snd_page_alloc i2c_piix4 snd_util_mem ohci_hcd uhci_hcd 
i2c_core ehci_hcd snd_hwdep e1000 snd sworks_agp agpgart soundcore 
usbcore emu10k1_gp gameport tsdev evdev reiserfs ide_disk serverworks 
pdc202xx_new ide_core
Dec  3 15:44:51 sata_fileserver kernel: CPU:    1
Dec  3 15:44:51 sata_fileserver kernel: 
EIP:    0060:[<c0167b30>]    Not tainted VLI
Dec  3 15:44:51 sata_fileserver kernel: EFLAGS: 00210246   (2.6.23.1 #1)
Dec  3 15:44:51 sata_fileserver kernel: EIP is at set_page_address+0x170/0x180
Dec  3 15:44:51 sata_fileserver kernel: eax: ffbff000   ebx: 
ffbff000   ecx: c0005ffc   edx: ffbff000
Dec  3 15:44:51 sata_fileserver kernel: esi: c17d6c60   edi: 
c0443ec0   ebp: ea139c88   esp: ea139c74
Dec  3 15:44:51 sata_fileserver kernel: ds: 007b   es: 007b   fs: 
00d8  gs: 0033  ss: 0068
Dec  3 15:44:52 sata_fileserver kernel: Process smbd (pid: 6132, 
ti=ea138000 task=f139c000 task.ti=ea138000)
Dec  3 15:44:52 sata_fileserver kernel: Stack: ffbff000 00200286 
ffbff000 c17d6c60 3eb63163 ea139cb4 c0167ed2 ea139ca8
Dec  3 15:44:52 sata_fileserver kernel:        ea138000 804cbe2c 
804cce2c ea139cac c0125248 c17d6c60 804cbe2c 804cce2c
Dec  3 15:44:52 sata_fileserver kernel:        ea139cc0 c01209b0 
00000000 ea139cec f8aa86b5 0000000f 00000002 00000000
Dec  3 15:44:52 sata_fileserver kernel: Call Trace:
Dec  3 15:44:52 sata_fileserver kernel:  [<c010542a>] 
show_trace_log_lvl+0x1a/0x30
Dec  3 15:44:52 sata_fileserver kernel:  [<c01054eb>] 
show_stack_log_lvl+0xab/0xd0
Dec  3 15:44:52 sata_fileserver kernel:  [<c01056e1>] 
show_registers+0x1d1/0x2d0
Dec  3 15:44:52 sata_fileserver kernel:  [<c01058f6>] die+0x116/0x250
Dec  3 15:44:53 sata_fileserver kernel:  [<c0105ac1>] do_trap+0x91/0xc0
Dec  3 15:44:53 sata_fileserver kernel:  [<c0105dd8>] do_invalid_op+0x88/0xa0
Dec  3 15:44:53 sata_fileserver kernel:  [<c030938a>] error_code+0x72/0x78
Dec  3 15:44:53 sata_fileserver kernel:  [<c0167ed2>] kmap_high+0x152/0x1b0
Dec  3 15:44:53 sata_fileserver kernel:  [<c01209b0>] kmap+0x50/0x80
Dec  3 15:44:53 sata_fileserver kernel:  [<f8aa86b5>] 
reiserfs_copy_from_user_to_file_region+0xa5/0xf0 [reiserfs]
Dec  3 15:44:53 sata_fileserver kernel:  [<f8aa9c06>] 
reiserfs_file_write+0x746/0x1dd0 [reiserfs]
Dec  3 15:44:53 sata_fileserver kernel:  [<c017e2c5>] vfs_write+0xb5/0x140
Dec  3 15:44:53 sata_fileserver kernel:  [<c017ea43>] sys_pwrite64+0x63/0x80
Dec  3 15:44:54 sata_fileserver kernel:  [<c01042fe>] 
sysenter_past_esp+0x6b/0xa1
Dec  3 15:44:54 sata_fileserver kernel:  =======================
Dec  3 15:44:54 sata_fileserver kernel: Code: 3a 44 c0 89 1a 89 53 04 
89 c2 b8 0c 3a 44 c0 e8 67 15 1a 00 e9 6f ff ff ff 8b 45 f0 89 ca e8 
58 15 1a 00 83 c4 08 5b 5e 5f 5d c3 <0f> 0b eb fe 8d b6 00 00 00 00 
8d bf 00 00 00 00 55 89 e5 83 ec
Dec  3 15:44:54 sata_fileserver kernel: EIP: [<c0167b30>] 
set_page_address+0x170/0x180 SS:ESP 0068:ea139c74
Dec  3 15:44:54 sata_fileserver kernel: WARNING: at 
/usr/src/linux-2.6.23.1/kernel/exit.c:892 do_exit()
Dec  3 15:44:54 sata_fileserver kernel:  [<c010542a>] 
show_trace_log_lvl+0x1a/0x30
Dec  3 15:44:54 sata_fileserver kernel:  [<c0106022>] show_trace+0x12/0x20
Dec  3 15:44:54 sata_fileserver kernel:  [<c0106046>] dump_stack+0x16/0x20
Dec  3 15:44:54 sata_fileserver kernel:  [<c012d064>] do_exit+0x834/0x840
Dec  3 15:44:54 sata_fileserver kernel:  [<c0105a29>] die+0x249/0x250
Dec  3 15:44:54 sata_fileserver kernel:  [<c0105ac1>] do_trap+0x91/0xc0
Dec  3 15:44:54 sata_fileserver kernel:  [<c0105dd8>] do_invalid_op+0x88/0xa0
Dec  3 15:44:54 sata_fileserver kernel:  [<c030938a>] error_code+0x72/0x78
Dec  3 15:44:54 sata_fileserver kernel:  [<c0167ed2>] kmap_high+0x152/0x1b0
Dec  3 15:44:54 sata_fileserver kernel:  [<c01209b0>] kmap+0x50/0x80
Dec  3 15:44:54 sata_fileserver kernel:  [<f8aa86b5>] 
reiserfs_copy_from_user_to_file_region+0xa5/0xf0 [reiserfs]
Dec  3 15:44:54 sata_fileserver kernel:  [<f8aa9c06>] 
reiserfs_file_write+0x746/0x1dd0 [reiserfs]
Dec  3 15:44:54 sata_fileserver kernel:  [<c017e2c5>] vfs_write+0xb5/0x140
Dec  3 15:44:54 sata_fileserver kernel:  [<c017ea43>] sys_pwrite64+0x63/0x80
Dec  3 15:44:54 sata_fileserver kernel:  [<c01042fe>] 
sysenter_past_esp+0x6b/0xa1
Dec  3 15:44:54 sata_fileserver kernel:  =======================
Dec  3 15:44:54 sata_fileserver kernel: 
Shorewall:net2all:DROP:IN=eth0 OUT= 
MAC=00:04:23:a8:12:cf:00:11:2f:42:d4:32:08:00 SRC=192.168.47.120 
DST=192.168.47.101 LEN=60 TOS=0x00 PREC=0x00 TTL=32 ID=24365 
PROTO=ICMP TYPE=8 CODE=0 ID=512 SEQ=6912
Dec  3 15:44:54 sata_fileserver kernel: 
Shorewall:net2all:DROP:IN=eth0 OUT= 
MAC=00:04:23:a8:12:cf:00:11:2f:42:d4:32:08:00 SRC=192.168.47.120 
DST=192.168.47.101 LEN=60 TOS=0x00 PREC=0x00 TTL=32 ID=24381 
PROTO=ICMP TYPE=8 CODE=0 ID=512 SEQ=7168



^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: oops with 2.6.23.1, marvel, software raid, reiserfs and samba
  2007-12-16 14:55           ` jeffunit
@ 2007-12-16 22:09             ` Andrew Morton
  0 siblings, 0 replies; 5+ messages in thread
From: Andrew Morton @ 2007-12-16 22:09 UTC (permalink / raw)
  To: jeffunit; +Cc: linux-kernel, netdev

On Sun, 16 Dec 2007 06:55:51 -0800 jeffunit <jeff@jeffunit.com> wrote:

> At 03:05 AM 12/16/2007, Andrew Morton wrote:
> >On Fri, 07 Dec 2007 19:49:52 -0800 jeffunit <jeff@jeffunit.com> wrote:
> >
> > > I am running linux kernel 2.6.23.1, which I compiled.
> > > The base system was mandriva 2008.
> > >
> > > I have a dual processor pentium III 933 system.
> > > It has 3gb of ram, an intel stl-2 motherboard.
> > > It also has a promise 100 tx2 pata controller,
> > > a supermicro marvell based 8 port pcix sata controller,
> > > and a nvidia pci based video card.
> > >
> > > I have the os on a pata drive, and have made a software raid array
> > > consisting of 4 sata drives attached to the pcix sata controller.
> > > I created the array, and formatted with reiserfs 3.6
> > > I have run bonnie++ (filesystem benchmark) on the array without incident.
> > > When I use samba-3.0.25b-4.3 and copy files from a windows machine to
> > > the fileserver,
> > > every so often, the fileserver crashes or hangs. It seems to happen
> > > more often under heavy samba traffic.
> > > Enclosed is the oops from syslog.
> > > I also have a 'kernel bug' from syslog if that would be helpful.
> > >
>
> ...
>
> >
> >(Please try to avoid the wordwrapping).

(you didn't)

> >That's a networking crash.  Do the oops traces which you're getting all look
> >like this one?
> >
> >Pentium III's are getting a bit old (resistive connections, drooping
> >power supplies, etc) so there's a decent chance that you're seeing
> >hardware failures here.
> 
> The other trace is a kernel bug. lt is included below.
> 
> It is true the hardware is a bit old, but I freshly assembled the system.
> The power supply is new, everything has been re-seated.
> I will be updating the hardware eventually, but I picked this hardware
> because it is low power (@120watts), server grade, has ecc memory,
> and has pcix- slots, which my ethernet card and 8 port sata controller need.
> 
> For what it is worth, the ethernet card is an intel  pro1000-mt.
> 
> Dec  3 15:44:50 sata_fileserver kernel: ------------[ cut here ]------------
> Dec  3 15:44:50 sata_fileserver kernel: Kernel BUG at c0167b30 
> [verbose debug info unavailable]

I'd suggest that you enable CONFIG_DEBUG_BUGVERBOSE, especially when the
system is having trouble.  It's worth it.

> Dec  3 15:44:50 sata_fileserver kernel: invalid opcode: 0000 [#1]
> Dec  3 15:44:51 sata_fileserver kernel: SMP
> Dec  3 15:44:51 sata_fileserver kernel: Modules linked in: 
> iptable_raw xt_comment xt_policy xt_multiport ipt_ULOG ipt_TTL 
> ipt_ttl ipt_TOS ipt_tos ipt_SAME ipt_REJECT ipt_REDIRECT ipt_recent 
> ipt_owner ipt_NETMAP ipt_MASQUERADE ipt_LOG ipt_iprange ipt_ECN 
> ipt_ecn ipt_CLUSTERIP ipt_ah ipt_addrtype nf_nat_tftp 
> nf_nat_snmp_basic nf_nat_sip nf_nat_pptp nf_nat_proto_gre nf_nat_irc 
> nf_nat_h323 nf_nat_ftp nf_nat_amanda ts_kmp nf_conntrack_amanda 
> nf_conntrack_tftp nf_conntrack_sip nf_conntrack_proto_sctp 
> nf_conntrack_pptp nf_conntrack_proto_gre nf_conntrack_netlink 
> nf_conntrack_netbios_ns nf_conntrack_irc nf_conntrack_h323 
> nf_conntrack_ftp xt_tcpmss xt_pkttype xt_physdev xt_NFQUEUE xt_NFLOG 
> xt_MARK xt_mark xt_mac xt_limit xt_length xt_helper xt_hashlimit 
> ip6_tables xt_dccp xt_conntrack xt_CONNMARK xt_connmark xt_CLASSIFY 
> xt_tcpudp nfsd xt_state iptable_nat nf_nat nf_conntrack_ipv4 exportfs 
> auth_rpcgss nf_conntrack iptable_mangle nfnetlink nfs lockd nfs_acl 
> sunrpc iptable_filter ip_tables x_tables af_packet ipv6 snd_seq_dummy snd_
> Dec  3 15:44:51 sata_fileserver kernel: eq_oss snd_seq_midi_event 
> snd_seq snd_pcm_oss snd_mixer_oss ipmi_si ipmi_msghandler binfmt_misc 
> loop nls_utf8 ntfs raid456 async_xor async_memcpy async_tx xor dm_mod 
> usb_storage sg sd_mod sata_mv libata scsi_mod video output thermal 
> sbs processor fan container button dock battery ac floppy snd_emu10k1 
> snd_rawmidi snd_ac97_codec ac97_bus snd_pcm ide_cd snd_seq_device 
> snd_timer snd_page_alloc i2c_piix4 snd_util_mem ohci_hcd uhci_hcd 
> i2c_core ehci_hcd snd_hwdep e1000 snd sworks_agp agpgart soundcore 
> usbcore emu10k1_gp gameport tsdev evdev reiserfs ide_disk serverworks 
> pdc202xx_new ide_core
> Dec  3 15:44:51 sata_fileserver kernel: CPU:    1
> Dec  3 15:44:51 sata_fileserver kernel: 
> EIP:    0060:[<c0167b30>]    Not tainted VLI
> Dec  3 15:44:51 sata_fileserver kernel: EFLAGS: 00210246   (2.6.23.1 #1)
> Dec  3 15:44:51 sata_fileserver kernel: EIP is at set_page_address+0x170/0x180
> Dec  3 15:44:51 sata_fileserver kernel: eax: ffbff000   ebx: 
> ffbff000   ecx: c0005ffc   edx: ffbff000
> Dec  3 15:44:51 sata_fileserver kernel: esi: c17d6c60   edi: 
> c0443ec0   ebp: ea139c88   esp: ea139c74
> Dec  3 15:44:51 sata_fileserver kernel: ds: 007b   es: 007b   fs: 
> 00d8  gs: 0033  ss: 0068
> Dec  3 15:44:52 sata_fileserver kernel: Process smbd (pid: 6132, 
> ti=ea138000 task=f139c000 task.ti=ea138000)
> Dec  3 15:44:52 sata_fileserver kernel: Stack: ffbff000 00200286 
> ffbff000 c17d6c60 3eb63163 ea139cb4 c0167ed2 ea139ca8
> Dec  3 15:44:52 sata_fileserver kernel:        ea138000 804cbe2c 
> 804cce2c ea139cac c0125248 c17d6c60 804cbe2c 804cce2c
> Dec  3 15:44:52 sata_fileserver kernel:        ea139cc0 c01209b0 
> 00000000 ea139cec f8aa86b5 0000000f 00000002 00000000
> Dec  3 15:44:52 sata_fileserver kernel: Call Trace:
> Dec  3 15:44:52 sata_fileserver kernel:  [<c010542a>] 
> show_trace_log_lvl+0x1a/0x30
> Dec  3 15:44:52 sata_fileserver kernel:  [<c01054eb>] 
> show_stack_log_lvl+0xab/0xd0
> Dec  3 15:44:52 sata_fileserver kernel:  [<c01056e1>] 
> show_registers+0x1d1/0x2d0
> Dec  3 15:44:52 sata_fileserver kernel:  [<c01058f6>] die+0x116/0x250
> Dec  3 15:44:53 sata_fileserver kernel:  [<c0105ac1>] do_trap+0x91/0xc0
> Dec  3 15:44:53 sata_fileserver kernel:  [<c0105dd8>] do_invalid_op+0x88/0xa0
> Dec  3 15:44:53 sata_fileserver kernel:  [<c030938a>] error_code+0x72/0x78
> Dec  3 15:44:53 sata_fileserver kernel:  [<c0167ed2>] kmap_high+0x152/0x1b0
> Dec  3 15:44:53 sata_fileserver kernel:  [<c01209b0>] kmap+0x50/0x80
> Dec  3 15:44:53 sata_fileserver kernel:  [<f8aa86b5>] 
> reiserfs_copy_from_user_to_file_region+0xa5/0xf0 [reiserfs]
> Dec  3 15:44:53 sata_fileserver kernel:  [<f8aa9c06>] 
> reiserfs_file_write+0x746/0x1dd0 [reiserfs]
> Dec  3 15:44:53 sata_fileserver kernel:  [<c017e2c5>] vfs_write+0xb5/0x140
> Dec  3 15:44:53 sata_fileserver kernel:  [<c017ea43>] sys_pwrite64+0x63/0x80
> Dec  3 15:44:54 sata_fileserver kernel:  [<c01042fe>] 
> sysenter_past_esp+0x6b/0xa1

This is a totally different crash and I don't think I've ever before seen a
crash in kmap()->set_page_address().  I'm suspecting hardware problems. 
Can you run memtest86 on that box for a day or so?


^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2007-12-16 22:09 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <20071205151147.9db4640b.akpm@linux-foundation.org>
     [not found] ` <47573806.8000808@am.sony.com>
     [not found]   ` <20071206142552.694B.Y-GOTO@jp.fujitsu.com>
     [not found]     ` <475A0EF4.1020502@am.sony.com>
     [not found]       ` <20071208035018.JENM21128.mta13.adelphia.net@dual-xeon.jeffunit.com>
2007-12-16 11:05         ` oops with 2.6.23.1, marvel, software raid, reiserfs and samba Andrew Morton
2007-12-16 11:56           ` Herbert Xu
2007-12-16 12:21             ` Herbert Xu
2007-12-16 14:55           ` jeffunit
2007-12-16 22:09             ` Andrew Morton

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).