* 3c59x driver: card not responding after a while
@ 2002-05-31 15:34 Ronny T. Lampert (EED)
2002-05-31 22:07 ` Scott McDermott
2002-06-03 7:59 ` Helge Hafting
0 siblings, 2 replies; 10+ messages in thread
From: Ronny T. Lampert (EED) @ 2002-05-31 15:34 UTC (permalink / raw)
To: linux-kernel
Hello,
I'm having (reproducable) problems with the 3c59x driver; after a while
(depends on card/traffic), the card doesn't send nor receive anymore.
I had 3c905B and 3c905C series, both affected. I did also change
mainboard-slots.
My current card is identifying as:
00:0c.0 Ethernet controller: 3Com Corporation 3c905C-TX [Fast Etherlink]
(rev 74)
Subsystem: 3Com Corporation 3C905C-TX Fast Etherlink for PC
Management NIC
Flags: bus master, medium devsel, latency 64, IRQ 11
I/O ports at e800 [size=128]
Memory at ed000000 (32-bit, non-prefetchable) [size=128]
Expansion ROM at ec000000 [disabled] [size=128K]
Capabilities: [dc] Power Management version 2
Kernels were
o RH 7.2 stock (2.4.7)
o 2.4.17 (custom, driver builtin) (after a while)
o 2.4.18 (custom, driver builtin) (depending on traffic, last time:
[ifconfig output after the device is frozen]
RX packets:249782 errors:0 dropped:0 overruns:94 frame:0
TX packets:32712 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:100
RX bytes:49988234 (47.6 Mb) TX bytes:8716361 (8.3 Mb)
2.2.19 worked fine.
gcc -v is gcc version 2.96 20000731 (Red Hat Linux 7.1 2.96-98) (std
from RH 7.2; no 'kgcc' available)
I saw no changes in 2.4.19*, so I went out asking you :)
Machine is a K6-2 500 via a VIA MVP3 chipset, Shuttle mainboard (if that
matters).
If you do a /etc/init.d/network restart (or ifconfig eth0 down ;
ifconfig eth0 ... up), the card works again.
dmesg is not reporting anything.
I also did force the card to using fixed 100BaseTx-FD, didn't change
anything.
If you need further information or more testing please say so.
Please also include me on CC.
Thanks.
--
Ronny T. Lampert email: Ronny.Lampert@eed.ericsson.se
System Administrator voice: +49 911 255 1214
Ericsson Eurolab Deutschland fax: +49 911 255 1960
Nuernberg, Germany
^ permalink raw reply [flat|nested] 10+ messages in thread* Re: 3c59x driver: card not responding after a while
2002-05-31 15:34 3c59x driver: card not responding after a while Ronny T. Lampert (EED)
@ 2002-05-31 22:07 ` Scott McDermott
2002-06-01 7:10 ` Zwane Mwaikambo
2002-06-03 7:59 ` Helge Hafting
1 sibling, 1 reply; 10+ messages in thread
From: Scott McDermott @ 2002-05-31 22:07 UTC (permalink / raw)
To: linux-kernel
Ronny T. Lampert (EED) on Fri 31/05 17:34 +0200:
> I'm having (reproducable) problems with the 3c59x driver; after a
> while (depends on card/traffic), the card doesn't send nor receive
> anymore.
- are you using netfilter?
- cat /proc/net/dev, what does the fifo counter say when iface hung?
- it degrades over time and only after quite a bit of data pumped
through it does it hang right? and slowly decreasing throughput right?
- I feel much better to know someone else has this bug! I thought sure I
was crazy since I did not hear of this problem from anyone else and
905B is very common card.
> o RH 7.2 stock (2.4.7)
wait this worked or didn't? for me 2.4.7 works fine, 2.4.17 does not.
> If you do a /etc/init.d/network restart (or ifconfig eth0 down ;
> ifconfig eth0 ... up), the card works again.
yep, same thing here...I was to try 2.4.7 3c59x.c with otherwise recent
kernel but have not got around to this yet...
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: 3c59x driver: card not responding after a while
2002-05-31 22:07 ` Scott McDermott
@ 2002-06-01 7:10 ` Zwane Mwaikambo
0 siblings, 0 replies; 10+ messages in thread
From: Zwane Mwaikambo @ 2002-06-01 7:10 UTC (permalink / raw)
To: Scott McDermott; +Cc: linux-kernel
hmm i can't say i'm experiencing any problems, perhaps this might help;
On Fri, 31 May 2002, Scott McDermott wrote:
> Ronny T. Lampert (EED) on Fri 31/05 17:34 +0200:
> > I'm having (reproducable) problems with the 3c59x driver; after a
> > while (depends on card/traffic), the card doesn't send nor receive
> > anymore.
>
> - are you using netfilter?
Yep
> - it degrades over time and only after quite a bit of data pumped
> through it does it hang right? and slowly decreasing throughput right?
Stays consistent, 60+ day uptime.
> - I feel much better to know someone else has this bug! I thought sure I
> was crazy since I did not hear of this problem from anyone else and
> 905B is very common card.
You might have to plead insanity ;)
> > o RH 7.2 stock (2.4.7)
>
> wait this worked or didn't? for me 2.4.7 works fine, 2.4.17 does not.
iirc stock RH7.2 and RH7.3 worked fine.
> yep, same thing here...I was to try 2.4.7 3c59x.c with otherwise recent
> kernel but have not got around to this yet...
I'm running 2.4.18-pre7-ac1, 2.4.19-pre8-ac(can't recall), all the boxes
transfer 2G+ of data a day (home network). IIRC i've been using those
cards since 2.4.2ish almost without skipping any version updates.
Cards are 3c905B on i440BX, SIS5595 and ServerWorks CNB20LE
I hope this information is of some use to you.
Regards,
Zwane
--
http://function.linuxpower.ca
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: 3c59x driver: card not responding after a while
2002-05-31 15:34 3c59x driver: card not responding after a while Ronny T. Lampert (EED)
2002-05-31 22:07 ` Scott McDermott
@ 2002-06-03 7:59 ` Helge Hafting
2002-06-03 8:51 ` Zwane Mwaikambo
2002-06-03 13:11 ` Joe Korty
1 sibling, 2 replies; 10+ messages in thread
From: Helge Hafting @ 2002-06-03 7:59 UTC (permalink / raw)
To: Ronny T. Lampert (EED), linux-kernel
"Ronny T. Lampert (EED)" wrote:
>
> Hello,
>
> I'm having (reproducable) problems with the 3c59x driver; after a while
> (depends on card/traffic), the card doesn't send nor receive anymore.
>
I see this too. I always thought it was the less-than-perfect ABIT BP6
loosing an irq or something. (odd that it _always_ is the NIC that goes
though...) I also have a k6 with the same NIC, and another
UP machine at work. They never fail this way.
Could it be a SMP problem?
> If you do a /etc/init.d/network restart (or ifconfig eth0 down ;
> ifconfig eth0 ... up), the card works again.
That never helped me - "shutdown -r now" is my way to recovery.
The card driver is compiled in, perhaps I should try
making it modular and unload/reload.
Helge Hafting
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: 3c59x driver: card not responding after a while
2002-06-03 7:59 ` Helge Hafting
@ 2002-06-03 8:51 ` Zwane Mwaikambo
2002-06-03 12:57 ` john slee
2002-06-03 13:11 ` Joe Korty
1 sibling, 1 reply; 10+ messages in thread
From: Zwane Mwaikambo @ 2002-06-03 8:51 UTC (permalink / raw)
To: Helge Hafting; +Cc: Ronny T. Lampert (EED), linux-kernel
On Mon, 3 Jun 2002, Helge Hafting wrote:
> I see this too. I always thought it was the less-than-perfect ABIT BP6
> loosing an irq or something. (odd that it _always_ is the NIC that goes
> though...) I also have a k6 with the same NIC, and another
> UP machine at work. They never fail this way.
> Could it be a SMP problem?
I wouldn't think so, i use it on SMP extensively without a hitch.
Cheers,
Zwane
--
http://function.linuxpower.ca
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: 3c59x driver: card not responding after a while
2002-06-03 8:51 ` Zwane Mwaikambo
@ 2002-06-03 12:57 ` john slee
2002-06-03 20:12 ` Andrew Morton
0 siblings, 1 reply; 10+ messages in thread
From: john slee @ 2002-06-03 12:57 UTC (permalink / raw)
To: Zwane Mwaikambo; +Cc: Helge Hafting, Ronny T. Lampert (EED), linux-kernel
On Mon, Jun 03, 2002 at 10:51:34AM +0200, Zwane Mwaikambo wrote:
> On Mon, 3 Jun 2002, Helge Hafting wrote:
>
> > I see this too. I always thought it was the less-than-perfect ABIT BP6
> > loosing an irq or something. (odd that it _always_ is the NIC that goes
> > though...) I also have a k6 with the same NIC, and another
> > UP machine at work. They never fail this way.
> > Could it be a SMP problem?
>
> I wouldn't think so, i use it on SMP extensively without a hitch.
"me too" - have been using 3c905B cards in various SMP (and UP) boxes
for a couple of years now and they've never failed me, even on bp6. in
fact i seem to have missed out on the plague of bp6 problems, even when
running dual 300a overclocked to 450. strange.
j.
--
toyota power: http://indigoid.net/
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: 3c59x driver: card not responding after a while
2002-06-03 12:57 ` john slee
@ 2002-06-03 20:12 ` Andrew Morton
2002-06-03 23:35 ` François Cami
0 siblings, 1 reply; 10+ messages in thread
From: Andrew Morton @ 2002-06-03 20:12 UTC (permalink / raw)
To: john slee
Cc: Zwane Mwaikambo, Helge Hafting, Ronny T. Lampert (EED),
linux-kernel
john slee wrote:
>
> On Mon, Jun 03, 2002 at 10:51:34AM +0200, Zwane Mwaikambo wrote:
> > On Mon, 3 Jun 2002, Helge Hafting wrote:
> >
> > > I see this too. I always thought it was the less-than-perfect ABIT BP6
> > > loosing an irq or something. (odd that it _always_ is the NIC that goes
> > > though...) I also have a k6 with the same NIC, and another
> > > UP machine at work. They never fail this way.
> > > Could it be a SMP problem?
> >
> > I wouldn't think so, i use it on SMP extensively without a hitch.
>
> "me too" - have been using 3c905B cards in various SMP (and UP) boxes
> for a couple of years now and they've never failed me, even on bp6. in
> fact i seem to have missed out on the plague of bp6 problems, even when
> running dual 300a overclocked to 450. strange.
>
That driver is solid for SMP. It's possible that the BP6
is losing its IRQ routing assignments, or the APIC is
getting stuck. We had extensive problems with that last
year. A workaround was implemented and as far as I can tell,
the problem went away.
It seems to affect network cards most because they typically
generate the most interrupts.
Try booting the machine with the `noapic' option.
-
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: 3c59x driver: card not responding after a while
2002-06-03 20:12 ` Andrew Morton
@ 2002-06-03 23:35 ` François Cami
2002-06-05 8:11 ` Ronny T. Lampert (EED)
0 siblings, 1 reply; 10+ messages in thread
From: François Cami @ 2002-06-03 23:35 UTC (permalink / raw)
To: Andrew Morton
Cc: john slee, Zwane Mwaikambo, Helge Hafting, Ronny T. Lampert (EED),
linux-kernel
Andrew Morton wrote:
> john slee wrote:
>
>>On Mon, Jun 03, 2002 at 10:51:34AM +0200, Zwane Mwaikambo wrote:
>>
>>>On Mon, 3 Jun 2002, Helge Hafting wrote:
>>>
>>>
>>>>I see this too. I always thought it was the less-than-perfect ABIT BP6
>>>>loosing an irq or something. (odd that it _always_ is the NIC that goes
>>>>though...) I also have a k6 with the same NIC, and another
>>>>UP machine at work. They never fail this way.
>>>>Could it be a SMP problem?
>>>
>>>I wouldn't think so, i use it on SMP extensively without a hitch.
>>
>>"me too" - have been using 3c905B cards in various SMP (and UP) boxes
>>for a couple of years now and they've never failed me, even on bp6. in
>>fact i seem to have missed out on the plague of bp6 problems, even when
>>running dual 300a overclocked to 450. strange.
>>
>
>
> That driver is solid for SMP. It's possible that the BP6
> is losing its IRQ routing assignments, or the APIC is
> getting stuck. We had extensive problems with that last
> year. A workaround was implemented and as far as I can tell,
> the problem went away.
>
> It seems to affect network cards most because they typically
> generate the most interrupts.
>
> Try booting the machine with the `noapic' option.
If I remember correctly, back in January we (we being Andrew Morton
and myself) talked about my 3C905C-TX not willing to share an
interrupt with my SBLive!...
I've spotted the same problem, this time sharing an interrupt
between my 3C905C-TX and an Intel i82559 10/100 ethernet
controller (kernel is 2.4.19pre7).
The situation is a bit different : using the 3C905C with the
SBLive! resulted in a total lock-up of the machine, whereas using both
network cards usually generate network timeouts and weird errors.
Putting the cards in separate (meaning different IRQ lines) PCI
slots solved the problem (same solution than between 3C905 and
SBLive!). All 3 cards are now working perfectly, at the same time.
Motherboard is MSI BXMaster (i440BX).
--
F. CAMI
----------------------------------------------------------
"To disable the Internet to save EMI and Disney is the
moral equivalent of burning down the library of Alexandria
to ensure the livelihood of monastic scribes."
- John Ippolito (Guggenheim)
^ permalink raw reply [flat|nested] 10+ messages in thread* Re: 3c59x driver: card not responding after a while
2002-06-03 23:35 ` François Cami
@ 2002-06-05 8:11 ` Ronny T. Lampert (EED)
0 siblings, 0 replies; 10+ messages in thread
From: Ronny T. Lampert (EED) @ 2002-06-05 8:11 UTC (permalink / raw)
To: François Cami
Cc: Andrew Morton, john slee, Zwane Mwaikambo, Helge Hafting,
linux-kernel
Hi,
> > That driver is solid for SMP. It's possible that the BP6
> > is losing its IRQ routing assignments, or the APIC is
> > getting stuck. We had extensive problems with that last
...
> > It seems to affect network cards most because they typically
> > generate the most interrupts.
> > Try booting the machine with the `noapic' option.
Even on an UP and no-MP-capable K6? IMHO this won't help (at least
myself).
> and myself) talked about my 3C905C-TX not willing to share an
> interrupt with my SBLive!...
> I've spotted the same problem, this time sharing an interrupt
> between my 3C905C-TX and an Intel i82559 10/100 ethernet
> controller (kernel is 2.4.19pre7).
I forgot to mention that there is only the 3c905 and a gfxcard (AGP) in
the box, no interrupts shared - I THOUGHT!
Then a quick lspci -v revealed both sharing IRQ 11. Will try to reassign
IRQs.
Thank you! :-)
Hope this is solved.
--
Ronny T. Lampert email: Ronny.Lampert@eed.ericsson.se
System Administrator voice: +49 911 255 1214
Ericsson Eurolab Deutschland fax: +49 911 255 1960
Nuernberg, Germany
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: 3c59x driver: card not responding after a while
2002-06-03 7:59 ` Helge Hafting
2002-06-03 8:51 ` Zwane Mwaikambo
@ 2002-06-03 13:11 ` Joe Korty
1 sibling, 0 replies; 10+ messages in thread
From: Joe Korty @ 2002-06-03 13:11 UTC (permalink / raw)
To: Helge Hafting, Ronny T. Lampert (EED), linux-kernel
At 09:59 AM 6/3/02 +0200, Helge Hafting wrote:
>"Ronny T. Lampert (EED)" wrote:
>
> > I'm having (reproducable) problems with the 3c59x driver; after a while
> > (depends on card/traffic), the card doesn't send nor receive anymore.
> >
>I see this too. I always thought it was the less-than-perfect ABIT BP6
>loosing an irq or something. (odd that it _always_ is the NIC that goes
>though...) I also have a k6 with the same NIC, and another
>UP machine at work. They never fail this way.
>Could it be a SMP problem?
Try increasing max_interrupt_work from 32 to 128 (located in
drivers/net/3c59x.c).
Joe
^ permalink raw reply [flat|nested] 10+ messages in thread
end of thread, other threads:[~2002-06-05 8:12 UTC | newest]
Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2002-05-31 15:34 3c59x driver: card not responding after a while Ronny T. Lampert (EED)
2002-05-31 22:07 ` Scott McDermott
2002-06-01 7:10 ` Zwane Mwaikambo
2002-06-03 7:59 ` Helge Hafting
2002-06-03 8:51 ` Zwane Mwaikambo
2002-06-03 12:57 ` john slee
2002-06-03 20:12 ` Andrew Morton
2002-06-03 23:35 ` François Cami
2002-06-05 8:11 ` Ronny T. Lampert (EED)
2002-06-03 13:11 ` Joe Korty
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox