* r8169, 2.6.2-rc2, Sager 4780 laptop
@ 2004-01-26 21:19 bhartin
2004-01-26 22:55 ` Francois Romieu
0 siblings, 1 reply; 11+ messages in thread
From: bhartin @ 2004-01-26 21:19 UTC (permalink / raw)
To: netdev
[-- Attachment #1: Type: TEXT/PLAIN, Size: 2898 bytes --]
(This was meant for author of the patches located at
http://www.fr.zoreil.com/linux/kernel/2.6.x/2.6.2-rc1/, who states to CC
the netdev list in regards to help with these patches. Unfortunately, the
author only stated to CC this list, but didn't include his own email.)
I've been looking through your patches as a source for help with my new
laptop. It's a Sager 4780, which uses an RTL-8169 on board. It is an
internal PCI version.
Under kernel 2.4.22, this NIC works perfectly.
Under 2.6.1 and 2.6.2-rc2, I have the following situation.
When I scp a file onto the laptop across the NIC, I get about
4.5Mbyte/sec, and once it reaches about 13MByte transferred, the NIC
ceases to function. No oops messages, no errors. You can't ping in or
out of the interface any longer.
When you scp a file from the laptop to elsewhere, it transfers at about
700kbyte/sec, and once it reaches a little over 6MByte, the entire system
locks up, hard.
This has been done on two completely different networks with varying
hardware, as to rule out anything related to such.
I have tried your 2.6.2-rc1 patches against 2.6.2-rc2 (they apply
cleanly). The driver behaves the same as before.
When I apply your 2.6.1 patches against 2.6.1, I get varying responses.
I tried a few combinations (following your order as listed). The most
notable effect of any of them is the r8169-init-one.patch. When I apply
that, insmod or modprobe lock up when inserting the module. They are
unkillable processes at that point. I even tried applying all the patches
except that one and the ethtool patch (it won't apply cleanly without the
init-one patch applied). When I did this, I started getting messages
about "eth0: Too much work at interrupt!" Once this occured, I rebooted
back into a stable 2.4.22 kernel. The NIC wasn't responding. I even
tried booting into WinXP, but the NIC still wouldn't respond. I was
finally forced to remove the battery from the laptop (not meant to be a
user task, as it is under a screwed-down cover). Once this was done, it
finally cleared the NIC properly and allowed me to continue.
I attempted a similar run with the 2.6.2-rc1 patches, cutting out the last
patch on the list one-by-one.
Attached is a tar.gz containing the outputs you request in your README,
plus 'lspci -v' output and my .config for the kernel. This data was
collected from a fresh 2.6.2-rc2 bootup, using your full set of 2.6.2-rc1
patches, manually doing a 'modprobe r8169' once booted up. The system is
running a P4 with HT enabled, with an SMP kernel.
I'm not at all familiar with kernel debugging, but I should be able to
carry out any tasks needed, given some minor instructions.
Please CC replies to my email address, as I'm not part of this mailing
list.
Thanks,
Bradley Hartin - bhartin@straus-frank.com
Communications and Network Administrator
Straus-Frank Company
[-- Attachment #2: Type: APPLICATION/octet-stream, Size: 14383 bytes --]
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: r8169, 2.6.2-rc2, Sager 4780 laptop
2004-01-26 21:19 r8169, 2.6.2-rc2, Sager 4780 laptop bhartin
@ 2004-01-26 22:55 ` Francois Romieu
2004-01-27 6:51 ` bhartin
2004-04-04 21:05 ` bhartin
0 siblings, 2 replies; 11+ messages in thread
From: Francois Romieu @ 2004-01-26 22:55 UTC (permalink / raw)
To: bhartin; +Cc: netdev
bhartin@straus-frank.com <bhartin@straus-frank.com> :
[...]
> Attached is a tar.gz containing the outputs you request in your README,
> plus 'lspci -v' output and my .config for the kernel. This data was
> collected from a fresh 2.6.2-rc2 bootup, using your full set of 2.6.2-rc1
> patches, manually doing a 'modprobe r8169' once booted up. The system is
> running a P4 with HT enabled, with an SMP kernel.
There has been report of non-regression on non-SMP kernel with the following
patches applied (2.6.2-rc1 serie):
r8169-tx-index-overflow.patch
r8169-dma-api-tx.patch
r8169-dma-api-rx-buffers.patch
r8169-dma-api-tx-buffers.patch
r8169-rx_copybreak.patch
r8169-mac-phy-version.patch
r8169-init_one.patch
r8169-timer.patch
r8169-hw_start.patch
r8169-intr_mask.patch
r8169-suspend.patch
r8169-endianness.patch
r8169-getstats.patch
Can you confirm that the driver behaves the same as the standard driver
with a non-SMP enabled kernel ?
Does it make a difference if you give an 'acpi=off' option at boot time ?
r8169-addr-high.patch is not doing its job on amd64 so it is not suggested
to use it at all.
There is something broken wrt SMP and the r8169 patches: do not use both at
the same time. I still have to find what happens here.
--
Ueimor
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: r8169, 2.6.2-rc2, Sager 4780 laptop
2004-01-26 22:55 ` Francois Romieu
@ 2004-01-27 6:51 ` bhartin
2004-04-04 21:05 ` bhartin
1 sibling, 0 replies; 11+ messages in thread
From: bhartin @ 2004-01-27 6:51 UTC (permalink / raw)
To: Francois Romieu; +Cc: netdev
Okay, I patched using only the patches listed below. These are actually
applied against 2.6.2-rc2, since I don't have 2.6.2-rc1 on hand at the
moment, and your patches still apply cleanly.
acpi=off
- RX - Pushed about a gig, averaging 8.2MB/sec. Appeared to work ok.
- TX - Once I hit 642MB transfered (avg 8.9MB/sec), the system locked up
hard.
SMP off, acpi on
- TX - Hard lock at 307MB.
SMP off, acpi off
- TX - Hard lock at 345MB.
I'll try to investigate further tomorrow. I have to go out of town
starting Thursday, and was hoping to have everything working by then.
Unfortunately, I have other problems to deal with as well (ati-drivers,
slmodem, etc).
Thanks
On Mon, 26 Jan 2004, Francois Romieu wrote:
> bhartin@straus-frank.com <bhartin@straus-frank.com> :
> [...]
> > Attached is a tar.gz containing the outputs you request in your README,
> > plus 'lspci -v' output and my .config for the kernel. This data was
> > collected from a fresh 2.6.2-rc2 bootup, using your full set of 2.6.2-rc1
> > patches, manually doing a 'modprobe r8169' once booted up. The system is
> > running a P4 with HT enabled, with an SMP kernel.
>
> There has been report of non-regression on non-SMP kernel with the following
> patches applied (2.6.2-rc1 serie):
> r8169-tx-index-overflow.patch
> r8169-dma-api-tx.patch
> r8169-dma-api-rx-buffers.patch
> r8169-dma-api-tx-buffers.patch
> r8169-rx_copybreak.patch
> r8169-mac-phy-version.patch
> r8169-init_one.patch
> r8169-timer.patch
> r8169-hw_start.patch
> r8169-intr_mask.patch
> r8169-suspend.patch
> r8169-endianness.patch
> r8169-getstats.patch
>
> Can you confirm that the driver behaves the same as the standard driver
> with a non-SMP enabled kernel ?
>
> Does it make a difference if you give an 'acpi=off' option at boot time ?
>
> r8169-addr-high.patch is not doing its job on amd64 so it is not suggested
> to use it at all.
>
> There is something broken wrt SMP and the r8169 patches: do not use both at
> the same time. I still have to find what happens here.
>
> --
> Ueimor
>
Bradley Hartin - bhartin@straus-frank.com
Communications and Network Administrator
Straus-Frank Company
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: r8169, 2.6.2-rc2, Sager 4780 laptop
2004-01-26 22:55 ` Francois Romieu
2004-01-27 6:51 ` bhartin
@ 2004-04-04 21:05 ` bhartin
2004-04-04 22:20 ` Francois Romieu
1 sibling, 1 reply; 11+ messages in thread
From: bhartin @ 2004-04-04 21:05 UTC (permalink / raw)
To: Francois Romieu; +Cc: netdev
Francois:
I don't remember if I ever did reply to you last time concerning this.
Since I emailed you, I've tried many, many versions of the 2.6.x series
kernels and superpatch variants. The only change was with today's release
of 2.6.5. I now get the message "eth0: Too much work at interrupt!" prior
to the interface locking up.
I still have absolutely no problems with the 2.4.x kernels with respect to
the networking (I have video/AGP issues with it though, but that's another
story).
Trying the same kernel with SMP and ACPI disabled: Rather than the
interface locking up (the NIC failing to transmit or receive), the entire
system locks up hard.
All of these tests are done on a 100Mbit network, both on an unmanaged
Linksys switch and on a Cisco 2948G, with the same results. The test is
done by doing an scp to copy a remote file to the local filesystem. The
lockups occur anywhere from 45MBytes to 95MBytes transferred.
I haven't tried your latest set of patches because I haven't been able to
find the URL for them again (I got carried away cleaning up my bookmarks
recently). Let me know if there are any newer ones you'd like for me to
try.
Bradley Hartin - bhartin@straus-frank.com
Communications and Network Administrator
Straus-Frank Company
Thanks,
On Mon, 26 Jan 2004, Francois Romieu wrote:
> bhartin@straus-frank.com <bhartin@straus-frank.com> :
> [...]
> > Attached is a tar.gz containing the outputs you request in your README,
> > plus 'lspci -v' output and my .config for the kernel. This data was
> > collected from a fresh 2.6.2-rc2 bootup, using your full set of 2.6.2-rc1
> > patches, manually doing a 'modprobe r8169' once booted up. The system is
> > running a P4 with HT enabled, with an SMP kernel.
>
> There has been report of non-regression on non-SMP kernel with the following
> patches applied (2.6.2-rc1 serie):
> r8169-tx-index-overflow.patch
> r8169-dma-api-tx.patch
> r8169-dma-api-rx-buffers.patch
> r8169-dma-api-tx-buffers.patch
> r8169-rx_copybreak.patch
> r8169-mac-phy-version.patch
> r8169-init_one.patch
> r8169-timer.patch
> r8169-hw_start.patch
> r8169-intr_mask.patch
> r8169-suspend.patch
> r8169-endianness.patch
> r8169-getstats.patch
>
> Can you confirm that the driver behaves the same as the standard driver
> with a non-SMP enabled kernel ?
>
> Does it make a difference if you give an 'acpi=off' option at boot time ?
>
> r8169-addr-high.patch is not doing its job on amd64 so it is not suggested
> to use it at all.
>
> There is something broken wrt SMP and the r8169 patches: do not use both at
> the same time. I still have to find what happens here.
>
> --
> Ueimor
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: r8169, 2.6.2-rc2, Sager 4780 laptop
2004-04-04 21:05 ` bhartin
@ 2004-04-04 22:20 ` Francois Romieu
2004-04-04 23:57 ` bhartin
[not found] ` <Pine.LNX.4.58.0404061650310.20664@edp12.straus-frank.int>
0 siblings, 2 replies; 11+ messages in thread
From: Francois Romieu @ 2004-04-04 22:20 UTC (permalink / raw)
To: bhartin; +Cc: netdev
bhartin@straus-frank.com <bhartin@straus-frank.com> :
[...]
> I haven't tried your latest set of patches because I haven't been able to
> find the URL for them again (I got carried away cleaning up my bookmarks
> recently). Let me know if there are any newer ones you'd like for me to
> try.
http://www.fr.zoreil.com/people/francois/misc/20040329-2.6.5-rc2-r8169.c-test.patch
It removes a possible loop in the Tx irq handler and should save some cpu
cycles as well. Apply against 2.6.5.
--
Ueimor
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: r8169, 2.6.2-rc2, Sager 4780 laptop
2004-04-04 22:20 ` Francois Romieu
@ 2004-04-04 23:57 ` bhartin
[not found] ` <Pine.LNX.4.58.0404061650310.20664@edp12.straus-frank.int>
1 sibling, 0 replies; 11+ messages in thread
From: bhartin @ 2004-04-04 23:57 UTC (permalink / raw)
To: Francois Romieu; +Cc: netdev
Here are the results using this new patch:
No SMP, no ACPI: Appears to work well--transferred about 8 gigs okay
With SMP, with ACPI: Interface locks up after a few MB
With SMP, no ACPI: Interface locks up after a few MB
No SMP, with ACPI: Appears to work well--transferred about 8 gigs okay
My laptop is *finally* reaching full usability! Thanks!
I can live without SMP--this is just a single P4/HT, so I'm not losing
much.
Thanks again. If there is any debugging info you need towards fixing the
SMP compatibility issue, just let me know, but I'm not in a hurry for
it.
Bradley Hartin - bhartin@straus-frank.com
Communications and Network Administrator
Straus-Frank Company
Thanks,
On Mon, 5 Apr 2004, Francois Romieu wrote:
> bhartin@straus-frank.com <bhartin@straus-frank.com> :
> [...]
> > I haven't tried your latest set of patches because I haven't been able to
> > find the URL for them again (I got carried away cleaning up my bookmarks
> > recently). Let me know if there are any newer ones you'd like for me to
> > try.
>
> http://www.fr.zoreil.com/people/francois/misc/20040329-2.6.5-rc2-r8169.c-test.patch
>
> It removes a possible loop in the Tx irq handler and should save some cpu
> cycles as well. Apply against 2.6.5.
>
> --
> Ueimor
>
Bradley Hartin - bhartin@straus-frank.com
Communications and Network Administrator
Straus-Frank Company
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: r8169, 2.6.2-rc2, Sager 4780 laptop
[not found] ` <20040407001815.C7833@electric-eye.fr.zoreil.com>
@ 2004-04-06 23:03 ` bhartin
2004-04-06 23:38 ` Francois Romieu
0 siblings, 1 reply; 11+ messages in thread
From: bhartin @ 2004-04-06 23:03 UTC (permalink / raw)
To: Francois Romieu; +Cc: netdev
On Wed, 7 Apr 2004, Francois Romieu wrote:
> Hmmm... Something like "no PHY Reset ack. Giving up." I guess. Ok, I'll
> add it to the queue, the locking between the phy_timer() and the irq handler
> looks strange.
I got my laptop back up and found the messages logged:
kernel: r8169: eth0: Reset RTL8169s PHY
I didn't see exactly at what point this started happening during the
shutdown, but the message repeated about 500 times before the system
logger was stopped.
Again, I've had no issues so far during normal use. There are no abnormal
timeouts, and transfer rates appear within normal ranges. (I've only used
it on 100bT switches so far, no 10bT or gigabit.)
> As a general note, do not hesitate to post on/Cc: netdev as it can be of
> interest to other people as well.
I meant to, but must have hit no when Pine asked about replying to all
recipients.
> Thanks for the report.
Not a problem at all. I'm just glad to finally get this thing working!
Bradley Hartin - bhartin@straus-frank.com
Communications and Network Administrator
Straus-Frank Company
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: r8169, 2.6.2-rc2, Sager 4780 laptop
2004-04-06 23:03 ` bhartin
@ 2004-04-06 23:38 ` Francois Romieu
2004-04-07 5:15 ` bhartin
0 siblings, 1 reply; 11+ messages in thread
From: Francois Romieu @ 2004-04-06 23:38 UTC (permalink / raw)
To: bhartin; +Cc: netdev
bhartin@straus-frank.com <bhartin@straus-frank.com> :
[...]
> I got my laptop back up and found the messages logged:
>
> kernel: r8169: eth0: Reset RTL8169s PHY
>
> I didn't see exactly at what point this started happening during the
> shutdown, but the message repeated about 500 times before the system
> logger was stopped.
It implies a minimum of 6000 invocations of rtl8169_phy_timer().
*doh*
Please apply patch below.
mod_timer() expects an absolute time, not a relative offset.
drivers/net/r8169.c | 2 +-
1 files changed, 1 insertion(+), 1 deletion(-)
diff -puN drivers/net/r8169.c~r8169-mod_timer drivers/net/r8169.c
--- linux-2.6.5/drivers/net/r8169.c~r8169-mod_timer 2004-04-07 01:30:14.000000000 +0200
+++ linux-2.6.5-fr/drivers/net/r8169.c 2004-04-07 01:32:53.000000000 +0200
@@ -617,7 +617,7 @@ static void rtl8169_phy_timer(unsigned l
}
}
- mod_timer(timer, RTL8169_PHY_TIMEOUT);
+ mod_timer(timer, jiffies + RTL8169_PHY_TIMEOUT);
}
static inline void rtl8169_delete_timer(struct net_device *dev)
_
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: r8169, 2.6.2-rc2, Sager 4780 laptop
2004-04-06 23:38 ` Francois Romieu
@ 2004-04-07 5:15 ` bhartin
2004-04-07 9:26 ` Francois Romieu
0 siblings, 1 reply; 11+ messages in thread
From: bhartin @ 2004-04-07 5:15 UTC (permalink / raw)
To: Francois Romieu; +Cc: netdev
The patch resolved the PHY messages, and I stress-tested the networking
again--it's still stable.
Thanks!
Bradley Hartin - bhartin@straus-frank.com
Communications and Network Administrator
Straus-Frank Company
On Wed, 7 Apr 2004, Francois Romieu wrote:
> bhartin@straus-frank.com <bhartin@straus-frank.com> :
> [...]
> > I got my laptop back up and found the messages logged:
> >
> > kernel: r8169: eth0: Reset RTL8169s PHY
> >
> > I didn't see exactly at what point this started happening during the
> > shutdown, but the message repeated about 500 times before the system
> > logger was stopped.
>
> It implies a minimum of 6000 invocations of rtl8169_phy_timer().
>
> *doh*
>
> Please apply patch below.
>
>
>
> mod_timer() expects an absolute time, not a relative offset.
>
>
> drivers/net/r8169.c | 2 +-
> 1 files changed, 1 insertion(+), 1 deletion(-)
>
> diff -puN drivers/net/r8169.c~r8169-mod_timer drivers/net/r8169.c
> --- linux-2.6.5/drivers/net/r8169.c~r8169-mod_timer 2004-04-07 01:30:14.000000000 +0200
> +++ linux-2.6.5-fr/drivers/net/r8169.c 2004-04-07 01:32:53.000000000 +0200
> @@ -617,7 +617,7 @@ static void rtl8169_phy_timer(unsigned l
> }
> }
>
> - mod_timer(timer, RTL8169_PHY_TIMEOUT);
> + mod_timer(timer, jiffies + RTL8169_PHY_TIMEOUT);
> }
>
> static inline void rtl8169_delete_timer(struct net_device *dev)
>
> _
>
Bradley Hartin - bhartin@straus-frank.com
Communications and Network Administrator
Straus-Frank Company
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: r8169, 2.6.2-rc2, Sager 4780 laptop
2004-04-07 5:15 ` bhartin
@ 2004-04-07 9:26 ` Francois Romieu
2004-04-07 13:43 ` bhartin
0 siblings, 1 reply; 11+ messages in thread
From: Francois Romieu @ 2004-04-07 9:26 UTC (permalink / raw)
To: bhartin; +Cc: netdev
bhartin@straus-frank.com <bhartin@straus-frank.com> :
[...]
> The patch resolved the PHY messages, and I stress-tested the networking
> again--it's still stable.
Does it allow the link to be recovered if the cable is removed/plugde ?
--
Ueimor
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: r8169, 2.6.2-rc2, Sager 4780 laptop
2004-04-07 9:26 ` Francois Romieu
@ 2004-04-07 13:43 ` bhartin
0 siblings, 0 replies; 11+ messages in thread
From: bhartin @ 2004-04-07 13:43 UTC (permalink / raw)
To: Francois Romieu; +Cc: netdev
I started a ping, and unplugged the network cable. I got the 'r8169:
eth0: Reset RTL8169s PHY' message about once every 11 to 15 seconds. Once
plugged back in, the connection recovered immediately. I repeated this
several times. So, it's still generating the messages, but they seem to
be harmless (at least for the few minutes I was testing this).
On Wed, 7 Apr 2004, Francois Romieu wrote:
> bhartin@straus-frank.com <bhartin@straus-frank.com> :
> [...]
> > The patch resolved the PHY messages, and I stress-tested the networking
> > again--it's still stable.
>
> Does it allow the link to be recovered if the cable is removed/plugde ?
>
> --
> Ueimor
>
Bradley Hartin - bhartin@straus-frank.com
Communications and Network Administrator
Straus-Frank Company
^ permalink raw reply [flat|nested] 11+ messages in thread
end of thread, other threads:[~2004-04-07 13:43 UTC | newest]
Thread overview: 11+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2004-01-26 21:19 r8169, 2.6.2-rc2, Sager 4780 laptop bhartin
2004-01-26 22:55 ` Francois Romieu
2004-01-27 6:51 ` bhartin
2004-04-04 21:05 ` bhartin
2004-04-04 22:20 ` Francois Romieu
2004-04-04 23:57 ` bhartin
[not found] ` <Pine.LNX.4.58.0404061650310.20664@edp12.straus-frank.int>
[not found] ` <20040407001815.C7833@electric-eye.fr.zoreil.com>
2004-04-06 23:03 ` bhartin
2004-04-06 23:38 ` Francois Romieu
2004-04-07 5:15 ` bhartin
2004-04-07 9:26 ` Francois Romieu
2004-04-07 13:43 ` bhartin
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).