public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* Transmit timeout with 3c395, 2.4.19, 2.4.22
@ 2003-10-27 14:13 Norbert Preining
  2003-10-27 19:18 ` Andrew Morton
  2004-03-27 16:41 ` Oops with md/ext3 on 2.4.25 on alpha architecture Norbert Preining
  0 siblings, 2 replies; 13+ messages in thread
From: Norbert Preining @ 2003-10-27 14:13 UTC (permalink / raw)
  To: linux-kernel, Andrew Morton

Hi Andrew, hi list!

Suddenly, after 160 days of running, our bridged firewall started to
spit out this:
NETDEV WATCHDOG: eth1: transmit timed out
eth1: transmit timed out, tx_status 00 status e000.
  diagnostics: net 0cd8 media 8880 dma ffffffff.
  Flags; bus-master 1, dirty 2606584(8) current 2606600(8)
  Transmit list 07f2f400 vs. c7f6f400.
  0: @c7f6f200  length 800000a2 status 000000a2
  1: @c7f6f240  length 8000003c status 0000003c
  2: @c7f6f280  length 80000095 status 00000095
  3: @c7f6f2c0  length 80000092 status 00000092
  4: @c7f6f300  length 80000062 status 00000062
  5: @c7f6f340  length 8000009b status 0000009b
  6: @c7f6f380  length 800000a5 status 800000a5
  7: @c7f6f3c0  length 8000009d status 8000009d
  8: @c7f6f400  length 80000092 status 00000092
  9: @c7f6f440  length 80000072 status 00000072
  10: @c7f6f480  length 80000092 status 00000092
  11: @c7f6f4c0  length 80000072 status 00000072
  12: @c7f6f500  length 800000a2 status 000000a2
  13: @c7f6f540  length 80000089 status 00000089
  14: @c7f6f580  length 80000092 status 00000092
  15: @c7f6f5c0  length 80000091 status 00000091
eth1: Resetting the Tx ring pointer.

cont...

This was first with 2.4.19 plus bridge-firewall patches. I installed
2.4.22 with the latest ebtables/brfw patches, but got the same errors.
In all the cases reconfiguring the interfaces didn't help to bring it
back to life, only reboot helped. Now the very same error happens
already several times in a short period.

>From dmesg:
...
PCI: Found IRQ 5 for device 00:09.0
3c59x: Donald Becker and others. www.scyld.com/network/vortex.html
See Documentation/networking/vortex.txt
00:09.0: 3Com PCI 3c905B Cyclone 100baseTx at 0xe000. Vers LK1.1.18-ac
 00:50:04:4c:6f:22, IRQ 5
  product code 5451 rev 00.12 date 06-06-99
  Internal config register is 1800000, transceivers 0xa.
  8K byte-wide RAM 5:3 Rx:Tx split, autoselect/Autonegotiate interface.
  MII transceiver found at address 24, status 786d.
  Enabling bus-master transmits and whole-frame receives.
00:09.0: scatter/gather enabled. h/w checksums enabled
PCI: Found IRQ 10 for device 00:0c.0
See Documentation/networking/vortex.txt
00:0c.0: 3Com PCI 3c905B Cyclone 100baseTx at 0xe400. Vers LK1.1.18-ac
 00:50:04:4c:6f:97, IRQ 10
  product code 5451 rev 00.12 date 06-06-99
  Internal config register is 1800000, transceivers 0xa.
  8K byte-wide RAM 5:3 Rx:Tx split, autoselect/Autonegotiate interface.
  MII transceiver found at address 24, status 7849.
  Enabling bus-master transmits and whole-frame receives.
00:0c.0: scatter/gather enabled. h/w checksums enabled

This always happened with eth1, so we think it *may* a hardware error
creeping in. I would like to know wether this can be the case, or
wether there is something else (switches on the other side, ...) which
may have produced these errors.

Best wishes

Norbert

-------------------------------------------------------------------------------
Norbert Preining <preining AT logic DOT at>         Technische Universität Wien
gpg DSA: 0x09C5B094      fp: 14DF 2E6C 0307 BE6D AD76  A9C0 D2BF 4AA3 09C5 B094
-------------------------------------------------------------------------------
OBWESTRY (abs.n.)
Bloody-minded determination on part of a storyteller to continue a
story which both the teller and the listeners know has become
desperately tedious.
			--- Douglas Adams, The Meaning of Liff

^ permalink raw reply	[flat|nested] 13+ messages in thread
* RE: md raid oops on 2.4.25/alpha
@ 2004-04-02  6:53 Lewis Shobbrook
  2004-04-02 12:03 ` Marcelo Tosatti
  0 siblings, 1 reply; 13+ messages in thread
From: Lewis Shobbrook @ 2004-04-02  6:53 UTC (permalink / raw)
  To: linux-kernel; +Cc: debian-testing

Hi All,

[Excerpt from linux-raid mailing list]
> We have some problems with the md code on alpha. We get 
> regular oops when using the md raid1. Here we got another 
> oops when fsck (at boot 
> time) the raid:
> This was after a fresh reboot. As long as only the raid is 
> *not* mounted of fsck the machine works without any oops.

I've found similar results with the unstable Debian 2.4.25.-1-686
kernel.
 
> I also can mount the hard disks *without* raid directly as 
> hda1 and hdc1, and do NOT get any errors here, so I suspect 
> that only the md code is the culprit.

Same again here.

I have /dev/hda2 listed in raidtab as a failed disc, lilo points to hda
as boot.  I get kernel panic when attemtpting to boot to it.

I thought this strange as I have another system running the very same
kernel (even used the same copied kernel deb) with a UU (no failed)raid
1 running off a hpt372 onboard controller (as soft md, not hardware raid
1).  The raid device was present prior to the kernel in this instance.
I was impressed that the std initrd wokred with both md and hpt372
without modification, where the process had been more involved in the
past requiring a custom initrd or "compiled in" kernel.

I thought that the devfs (compiled in with the std debain kernel) may
have been an issue, but it has never been in the past and the same
filesystem on the raidtab listed "failed" drive /dev/hda2 is happy.

I can boot the the raid 1 as root filesystem when passing root=/dev/md0
loading through a system rescue disc (http://www.sysresccd.org/) with a
2.4.25 kernel. 

Attempting to boot to /dev/md0  with the Debian 2.4.25-1-686 kernel
panics after attempting to mount the device /dev2/root2 as ext2, minix &
fat (possibly others that disappear before they can be read) and
complains the std way when you attempt to mount with the wrong fs. 
...
pivot_root: no such file or directory
/sbin/init: 347: cannot open dev/console : no such file
Kernel panic: Attempted to kill init !

I've attempted to pre-load the initrd modules, but didn't expect this to
be a solution and it wasn't.

I'm scratching the head and losing hair...

Never had any trouble of this sort before.

Any suggestions appreciated.

Cheers,

Lewis

^ permalink raw reply	[flat|nested] 13+ messages in thread

end of thread, other threads:[~2004-04-02 12:17 UTC | newest]

Thread overview: 13+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-10-27 14:13 Transmit timeout with 3c395, 2.4.19, 2.4.22 Norbert Preining
2003-10-27 19:18 ` Andrew Morton
2003-10-28  6:49   ` Norbert Preining
2003-10-30 17:50   ` Stian Jordet
2004-03-27 16:41 ` Oops with md/ext3 on 2.4.25 on alpha architecture Norbert Preining
2004-03-28 16:02   ` md raid oops on 2.4.25/alpha Norbert Preining
2004-03-28 17:30     ` Thomas Steudten
2004-03-28 16:45       ` Ross Dickson
2004-03-28 20:32         ` Norbert Preining
2004-03-28 18:30       ` Ivan Kokshaysky
2004-03-29 10:41         ` Thomas Steudten
  -- strict thread matches above, loose matches on Subject: below --
2004-04-02  6:53 Lewis Shobbrook
2004-04-02 12:03 ` Marcelo Tosatti

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox