From: Steffen Persvold <sp@scali.no>
To: lkml <linux-kernel@vger.kernel.org>, saw@saw.sw.com.sg
Subject: eepro100 ? network driver problem.
Date: Sat, 22 Dec 2001 11:55:13 +0100 [thread overview]
Message-ID: <3C246691.2CD6460F@scali.no> (raw)
Hi,
I have a 16 node cluster with dual Intel 82557 network controllers (eepro100) driver. All nodes have
RedHat 7.1 installed and using the latest kernel update from RH (2.4.9.-12). Here's a bit from dmesg
:
eepro100.c:v1.09j-t 9/29/99 Donald Becker http://cesdis.gsfc.nasa.gov/linux/drivers/eepro100.html
eepro100.c: $Revision: 1.36 $ 2000/11/17 Modified by Andrey V. Savochkin <saw@saw.sw.com.sg> and
others
eth0: OEM i82557/i82558 10/100 Ethernet, 00:30:48:11:3E:53, IRQ 31.
Board assembly 000000-000, Physical connectors present: RJ45
Primary interface chip i82555 PHY #1.
General self-test: passed.
Serial sub-system self-test: passed.
Internal registers self-test: passed.
ROM checksum self-test: passed (0x04f4518b).
eth1: OEM i82557/i82558 10/100 Ethernet, 00:30:48:11:3C:44, IRQ 28.
Board assembly 000000-000, Physical connectors present: RJ45
Primary interface chip i82555 PHY #1.
General self-test: passed.
Serial sub-system self-test: passed.
Internal registers self-test: passed.
ROM checksum self-test: passed (0x04f4518b).
Sometimes under heavy load one of the nodes (random node each time) crash with the following Oops :
Unable to handle kernel paging request at virtual address 0000d5ca
printing eip:
c0200002
*pde = 00000000
Oops: 0002
CPU: 1
EIP: 0010:[<c0200002>] Not tainted
EFLAGS: 00010282
eax: 00000000 ebx: 00000001 ecx: dfa36000 edx: c0311840
esi: dfa36000 edi: 0000d5c2 ebp: df7123c0 esp: c9bc5ee0
ds: 0018 es: 0018 ss: 0018
Process mpi_bomb (pid: 16020, stackpage=c9bc5000)
Stack: 00000001 df7123c0 00000001 df7123c0 c01d109c 00000108 00000002 00000020
0000003c c01ccd6c df7123c0 df7123c0 00000001 00000003 dfa36000 e0902188
dfa36000 00000001 dfa36144 0000000c 00000001 00000001 df7123c0 df7123c0
Call Trace: [<c01d109c>] netif_rx [kernel] 0x8c
[<c01ccd6c>] alloc_skb [kernel] 0xfc
[<e0902188>] __insmod_eepro100_S.text_L11712 [eepro100] 0x2128
[<c01d154b>] net_rx_action [kernel] 0x1eb
[<c011f74b>] do_softirq [kernel] 0x7b
[<c0108c4d>] do_IRQ [kernel] 0xdd
[<c022a550>] call_do_IRQ [kernel] 0x5
Code: 81 47 08 89 44 24 28 b8 b8 1a 31 c0 0f b7 5e 5c f0 83 28 01
<0>Kernel panic: Aiee, killing interrupt handler!
In interrupt handler - not syncing
Is this a known problem with the eepro100 driver in 2.4.9 and fixed in later kernels, or is it a
generic network driver problem (maybe VM related since the machine might be stressed on memory). I
also wonder why the process running at the time (mpi_bomb) shows up in the Oops.
Feedback highly appreciated.
Thanks,
--
Steffen Persvold | Scalable Linux Systems | Try out the world's best
mailto:sp@scali.no | http://www.scali.com | performing MPI implementation:
Tel: (+47) 2262 8950 | Olaf Helsets vei 6 | - ScaMPI 1.12.2 -
Fax: (+47) 2262 8951 | N0621 Oslo, NORWAY | >300MBytes/s and <4uS latency
next reply other threads:[~2001-12-22 10:56 UTC|newest]
Thread overview: 2+ messages / expand[flat|nested] mbox.gz Atom feed top
2001-12-22 10:55 Steffen Persvold [this message]
2001-12-22 14:24 ` eepro100 ? network driver problem Andrey Savochkin
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=3C246691.2CD6460F@scali.no \
--to=sp@scali.no \
--cc=linux-kernel@vger.kernel.org \
--cc=saw@saw.sw.com.sg \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.