* tg3 adapter losing link - PM related? @ 2013-05-03 9:28 Nikola Ciprich 2013-05-03 15:13 ` Nithin Nayak Sujir 0 siblings, 1 reply; 6+ messages in thread From: Nikola Ciprich @ 2013-05-03 9:28 UTC (permalink / raw) To: netdev [-- Attachment #1: Type: text/plain, Size: 2910 bytes --] Hello, I'd like to ask about trouble I've got with new HP server: tg3 adapter keeps losing link every few minutes: Aug 3 03:58:02 atlovav1a kernel: [616741.147598] tg3 0000:03:00.0: eth0: Link is down Aug 3 03:58:04 atlovav1a kernel: [616743.943456] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex Aug 3 03:58:04 atlovav1a kernel: [616743.943598] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX Aug 3 03:58:04 atlovav1a kernel: [616743.943736] tg3 0000:03:00.0: eth0: EEE is enabled Aug 3 04:14:29 atlovav1a kernel: [617727.980487] tg3 0000:03:00.0: eth0: Link is down Aug 3 04:14:32 atlovav1a kernel: [617730.847245] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex Aug 3 04:14:32 atlovav1a kernel: [617730.847387] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX Aug 3 04:14:32 atlovav1a kernel: [617730.847525] tg3 0000:03:00.0: eth0: EEE is enabled Aug 3 06:47:13 atlovav1a kernel: [626885.452974] tg3 0000:03:00.0: eth0: Link is down Aug 3 06:47:15 atlovav1a kernel: [626888.218702] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex Aug 3 06:47:15 atlovav1a kernel: [626888.218844] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX Aug 3 06:47:15 atlovav1a kernel: [626888.218982] tg3 0000:03:00.0: eth0: EEE is enabled Aug 3 06:51:44 atlovav1a kernel: [627156.293386] tg3 0000:03:00.0: eth0: Link is down Aug 3 06:51:46 atlovav1a kernel: [627159.123347] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex Aug 3 06:51:46 atlovav1a kernel: [627159.123491] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX Aug 3 06:51:46 atlovav1a kernel: [627159.123629] tg3 0000:03:00.0: eth0: EEE is enabled Aug 3 07:13:10 atlovav1a kernel: [628441.722197] tg3 0000:03:00.0: eth0: Link is down Aug 3 07:13:13 atlovav1a kernel: [628444.615548] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex Aug 3 07:13:13 atlovav1a kernel: [628444.615690] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX Aug 3 07:13:13 atlovav1a kernel: [628444.615828] tg3 0000:03:00.0: eth0: EEE is enabled I can't exclude the possibility it's switch problem, but I don't have access to box now and the switch is not managed, so I'd like to try other ways first.. Could this somehow be power management related? I don't see what else PM-related could I disable in the bios to disable this EEE stuff.. Didn't anyone met similar problem? the system is running Centos6 with x86_64 3.0.76 kernel thanks in advance for any reply BR nik -- ------------------------------------- Ing. Nikola CIPRICH LinuxBox.cz, s.r.o. 28.rijna 168, 709 00 Ostrava tel.: +420 591 166 214 fax: +420 596 621 273 mobil: +420 777 093 799 www.linuxbox.cz mobil servis: +420 737 238 656 email servis: servis@linuxbox.cz ------------------------------------- [-- Attachment #2: Type: application/pgp-signature, Size: 198 bytes --] ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: tg3 adapter losing link - PM related? 2013-05-03 9:28 tg3 adapter losing link - PM related? Nikola Ciprich @ 2013-05-03 15:13 ` Nithin Nayak Sujir 2013-05-03 18:39 ` Ben Hutchings ` (2 more replies) 0 siblings, 3 replies; 6+ messages in thread From: Nithin Nayak Sujir @ 2013-05-03 15:13 UTC (permalink / raw) To: Nikola Ciprich; +Cc: netdev Hi Nikola, 1. What device is present on this server? Can you give the tg3 messages in /var/log/messages? Can you also give the output of "ethtool -i <iface>"? 2. Is it possible for you to try the latest 3.9 upstream kernel? 3. Any reason why the link is at 100Mb? The switch does not support gig? What switch is it? 4. I don't think you can turn off EEE in the bios, but you can try turning autoneg off. Try "ethtool -s <iface> speed 100 duplex full autoneg off". It's not exactly the same thing since the device still has EEE enabled but not negotiated. Nithin. On 5/3/2013 2:28 AM, Nikola Ciprich wrote: > Hello, > > I'd like to ask about trouble I've got with new HP server: tg3 adapter keeps losing > link every few minutes: > > Aug 3 03:58:02 atlovav1a kernel: [616741.147598] tg3 0000:03:00.0: eth0: Link is down > Aug 3 03:58:04 atlovav1a kernel: [616743.943456] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex > Aug 3 03:58:04 atlovav1a kernel: [616743.943598] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX > Aug 3 03:58:04 atlovav1a kernel: [616743.943736] tg3 0000:03:00.0: eth0: EEE is enabled > Aug 3 04:14:29 atlovav1a kernel: [617727.980487] tg3 0000:03:00.0: eth0: Link is down > Aug 3 04:14:32 atlovav1a kernel: [617730.847245] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex > Aug 3 04:14:32 atlovav1a kernel: [617730.847387] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX > Aug 3 04:14:32 atlovav1a kernel: [617730.847525] tg3 0000:03:00.0: eth0: EEE is enabled > Aug 3 06:47:13 atlovav1a kernel: [626885.452974] tg3 0000:03:00.0: eth0: Link is down > Aug 3 06:47:15 atlovav1a kernel: [626888.218702] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex > Aug 3 06:47:15 atlovav1a kernel: [626888.218844] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX > Aug 3 06:47:15 atlovav1a kernel: [626888.218982] tg3 0000:03:00.0: eth0: EEE is enabled > Aug 3 06:51:44 atlovav1a kernel: [627156.293386] tg3 0000:03:00.0: eth0: Link is down > Aug 3 06:51:46 atlovav1a kernel: [627159.123347] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex > Aug 3 06:51:46 atlovav1a kernel: [627159.123491] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX > Aug 3 06:51:46 atlovav1a kernel: [627159.123629] tg3 0000:03:00.0: eth0: EEE is enabled > Aug 3 07:13:10 atlovav1a kernel: [628441.722197] tg3 0000:03:00.0: eth0: Link is down > Aug 3 07:13:13 atlovav1a kernel: [628444.615548] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex > Aug 3 07:13:13 atlovav1a kernel: [628444.615690] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX > Aug 3 07:13:13 atlovav1a kernel: [628444.615828] tg3 0000:03:00.0: eth0: EEE is enabled > > I can't exclude the possibility it's switch problem, but I don't have access to box now and the > switch is not managed, so I'd like to try other ways first.. > > Could this somehow be power management related? I don't see what else PM-related could I disable > in the bios to disable this EEE stuff.. Didn't anyone met similar problem? > > the system is running Centos6 with x86_64 3.0.76 kernel > > thanks in advance for any reply > > BR > > nik > > ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: tg3 adapter losing link - PM related? 2013-05-03 15:13 ` Nithin Nayak Sujir @ 2013-05-03 18:39 ` Ben Hutchings 2013-05-03 18:48 ` Nithin Nayak Sujir 2013-05-03 19:28 ` Nikola Ciprich 2013-05-04 6:45 ` Nikola Ciprich 2 siblings, 1 reply; 6+ messages in thread From: Ben Hutchings @ 2013-05-03 18:39 UTC (permalink / raw) To: Nithin Nayak Sujir; +Cc: Nikola Ciprich, netdev On Fri, 2013-05-03 at 08:13 -0700, Nithin Nayak Sujir wrote: > Hi Nikola, > 1. What device is present on this server? Can you give the tg3 messages > in /var/log/messages? Can you also give the output of "ethtool -i <iface>"? > > 2. Is it possible for you to try the latest 3.9 upstream kernel? > > 3. Any reason why the link is at 100Mb? The switch does not support gig? > What switch is it? > > 4. I don't think you can turn off EEE in the bios, but you can try > turning autoneg off. Try "ethtool -s <iface> speed 100 duplex full > autoneg off". It's not exactly the same thing since the device still has > EEE enabled but not negotiated. [...] It should be possible to disable EEE with ethtool. You really should implement the EEE configuration operations in tg3 if you're going to enable it at all. Ben. -- Ben Hutchings, Staff Engineer, Solarflare Not speaking for my employer; that's the marketing department's job. They asked us to note that Solarflare product names are trademarked. ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: tg3 adapter losing link - PM related? 2013-05-03 18:39 ` Ben Hutchings @ 2013-05-03 18:48 ` Nithin Nayak Sujir 0 siblings, 0 replies; 6+ messages in thread From: Nithin Nayak Sujir @ 2013-05-03 18:48 UTC (permalink / raw) To: Ben Hutchings; +Cc: Nikola Ciprich, netdev On 05/03/2013 11:39 AM, Ben Hutchings wrote: > On Fri, 2013-05-03 at 08:13 -0700, Nithin Nayak Sujir wrote: >> Hi Nikola, >> 1. What device is present on this server? Can you give the tg3 messages >> in /var/log/messages? Can you also give the output of "ethtool -i <iface>"? >> >> 2. Is it possible for you to try the latest 3.9 upstream kernel? >> >> 3. Any reason why the link is at 100Mb? The switch does not support gig? >> What switch is it? >> >> 4. I don't think you can turn off EEE in the bios, but you can try >> turning autoneg off. Try "ethtool -s <iface> speed 100 duplex full >> autoneg off". It's not exactly the same thing since the device still has >> EEE enabled but not negotiated. > [...] > > It should be possible to disable EEE with ethtool. You really should > implement the EEE configuration operations in tg3 if you're going to > enable it at all. > I agree. I plan to have a patch when netdev opens. > Ben. > ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: tg3 adapter losing link - PM related? 2013-05-03 15:13 ` Nithin Nayak Sujir 2013-05-03 18:39 ` Ben Hutchings @ 2013-05-03 19:28 ` Nikola Ciprich 2013-05-04 6:45 ` Nikola Ciprich 2 siblings, 0 replies; 6+ messages in thread From: Nikola Ciprich @ 2013-05-03 19:28 UTC (permalink / raw) To: Nithin Nayak Sujir; +Cc: netdev [-- Attachment #1: Type: text/plain, Size: 5109 bytes --] Hello Nithin, On Fri, May 03, 2013 at 08:13:15AM -0700, Nithin Nayak Sujir wrote: > Hi Nikola, > 1. What device is present on this server? Can you give the tg3 messages in > /var/log/messages? Can you also give the output of "ethtool -i <iface>"? sure, here it goes: [ +0.273255] tg3.c:v3.119 (May 18, 2011) [ +0.000184] tg3 0000:03:00.0: PCI INT A -> GSI 32 (level, low) -> IRQ 32 [ +0.000175] tg3 0000:03:00.0: setting latency timer to 64 [ +0.034616] tg3 0000:03:00.0: eth0: Tigon3 [partno(none) rev 5719001] (PCI Express) MAC address 2c:76:8a:52:a5:1c [ +0.000255] tg3 0000:03:00.0: eth0: attached PHY is 5719C (10/100/1000Base-T Ethernet) (WireSpeed[1], EEE[1]) [ +0.000251] tg3 0000:03:00.0: eth0: RXcsums[1] LinkChgREG[0] MIirq[0] ASF[1] TSOcap[0] [ +0.000251] tg3 0000:03:00.0: eth0: dma_rwctrl[00000001] dma_mask[64-bit] [root@atlovav1a ~]# ethtool -i eth0 driver: tg3 version: 3.119 firmware-version: 5719-v1.29 NCSI v1.0.88.0 bus-info: 0000:03:00.0 supports-statistics: yes supports-test: yes supports-eeprom-access: yes supports-register-dump: yes supports-priv-flags: no the box is HP ProLiant DL360p, device identifies itself in lspci as Broadcom Corporation NetXtreme BCM5719 > > 2. Is it possible for you to try the latest 3.9 upstream kernel? of course, It's compiling now. I'll report how it behaves. > > 3. Any reason why the link is at 100Mb? The switch does not support gig? > What switch is it? it's some lowcost cisco, not sure whether it's gigabit capable.. I'll find out on monday and report if it's important. > > 4. I don't think you can turn off EEE in the bios, but you can try turning > autoneg off. Try "ethtool -s <iface> speed 100 duplex full autoneg off". > It's not exactly the same thing since the device still has EEE enabled but > not negotiated. forcing speed helped, I no longer see link going down! of course I'll report about 3.9 kernel too.. thanks for Your reply! nik > > Nithin. > > > On 5/3/2013 2:28 AM, Nikola Ciprich wrote: > >Hello, > > > >I'd like to ask about trouble I've got with new HP server: tg3 adapter keeps losing > >link every few minutes: > > > >Aug 3 03:58:02 atlovav1a kernel: [616741.147598] tg3 0000:03:00.0: eth0: Link is down > >Aug 3 03:58:04 atlovav1a kernel: [616743.943456] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex > >Aug 3 03:58:04 atlovav1a kernel: [616743.943598] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX > >Aug 3 03:58:04 atlovav1a kernel: [616743.943736] tg3 0000:03:00.0: eth0: EEE is enabled > >Aug 3 04:14:29 atlovav1a kernel: [617727.980487] tg3 0000:03:00.0: eth0: Link is down > >Aug 3 04:14:32 atlovav1a kernel: [617730.847245] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex > >Aug 3 04:14:32 atlovav1a kernel: [617730.847387] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX > >Aug 3 04:14:32 atlovav1a kernel: [617730.847525] tg3 0000:03:00.0: eth0: EEE is enabled > >Aug 3 06:47:13 atlovav1a kernel: [626885.452974] tg3 0000:03:00.0: eth0: Link is down > >Aug 3 06:47:15 atlovav1a kernel: [626888.218702] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex > >Aug 3 06:47:15 atlovav1a kernel: [626888.218844] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX > >Aug 3 06:47:15 atlovav1a kernel: [626888.218982] tg3 0000:03:00.0: eth0: EEE is enabled > >Aug 3 06:51:44 atlovav1a kernel: [627156.293386] tg3 0000:03:00.0: eth0: Link is down > >Aug 3 06:51:46 atlovav1a kernel: [627159.123347] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex > >Aug 3 06:51:46 atlovav1a kernel: [627159.123491] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX > >Aug 3 06:51:46 atlovav1a kernel: [627159.123629] tg3 0000:03:00.0: eth0: EEE is enabled > >Aug 3 07:13:10 atlovav1a kernel: [628441.722197] tg3 0000:03:00.0: eth0: Link is down > >Aug 3 07:13:13 atlovav1a kernel: [628444.615548] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex > >Aug 3 07:13:13 atlovav1a kernel: [628444.615690] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX > >Aug 3 07:13:13 atlovav1a kernel: [628444.615828] tg3 0000:03:00.0: eth0: EEE is enabled > > > >I can't exclude the possibility it's switch problem, but I don't have access to box now and the > >switch is not managed, so I'd like to try other ways first.. > > > >Could this somehow be power management related? I don't see what else PM-related could I disable > >in the bios to disable this EEE stuff.. Didn't anyone met similar problem? > > > >the system is running Centos6 with x86_64 3.0.76 kernel > > > >thanks in advance for any reply > > > >BR > > > >nik > > > > > -- ------------------------------------- Ing. Nikola CIPRICH LinuxBox.cz, s.r.o. 28. rijna 168, 709 00 Ostrava tel.: +420 591 166 214 fax: +420 596 621 273 mobil: +420 777 093 799 www.linuxbox.cz mobil servis: +420 737 238 656 email servis: servis@linuxbox.cz ------------------------------------- [-- Attachment #2: Type: application/pgp-signature, Size: 198 bytes --] ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: tg3 adapter losing link - PM related? 2013-05-03 15:13 ` Nithin Nayak Sujir 2013-05-03 18:39 ` Ben Hutchings 2013-05-03 19:28 ` Nikola Ciprich @ 2013-05-04 6:45 ` Nikola Ciprich 2 siblings, 0 replies; 6+ messages in thread From: Nikola Ciprich @ 2013-05-04 6:45 UTC (permalink / raw) To: Nithin Nayak Sujir; +Cc: netdev [-- Attachment #1: Type: text/plain, Size: 690 bytes --] Hello again, > 2. Is it possible for you to try the latest 3.9 upstream kernel? so in 3.9 it's the same.. the box is not used for production yet, so I have few days for tests and experiments if it helps. But I shouldn't cut myself off the network :) (although there's IPMI module present, so in worst case, I should be able to reboot the box..) nik -- ------------------------------------- Ing. Nikola CIPRICH LinuxBox.cz, s.r.o. 28. rijna 168, 709 00 Ostrava tel.: +420 591 166 214 fax: +420 596 621 273 mobil: +420 777 093 799 www.linuxbox.cz mobil servis: +420 737 238 656 email servis: servis@linuxbox.cz ------------------------------------- [-- Attachment #2: Type: application/pgp-signature, Size: 198 bytes --] ^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2013-05-04 6:45 UTC | newest] Thread overview: 6+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2013-05-03 9:28 tg3 adapter losing link - PM related? Nikola Ciprich 2013-05-03 15:13 ` Nithin Nayak Sujir 2013-05-03 18:39 ` Ben Hutchings 2013-05-03 18:48 ` Nithin Nayak Sujir 2013-05-03 19:28 ` Nikola Ciprich 2013-05-04 6:45 ` Nikola Ciprich
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).