* [REGRESSION] e1000e stopped working @ 2010-06-27 17:27 Maxim Levitsky 2010-06-27 17:29 ` Maxim Levitsky 0 siblings, 1 reply; 29+ messages in thread From: Maxim Levitsky @ 2010-06-27 17:27 UTC (permalink / raw) To: netdev@vger.kernel.org Just that, It doesn't receive anything from my internet router during DHCP. 00:19.0 Ethernet controller [0200]: Intel Corporation 82566DC Gigabit Network Connection [8086:104b] (rev 02) Subsystem: Intel Corporation Device [8086:0001] Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 Interrupt: pin A routed to IRQ 47 Region 0: Memory at 50300000 (32-bit, non-prefetchable) [size=128K] Region 1: Memory at 50324000 (32-bit, non-prefetchable) [size=4K] Region 2: I/O ports at 30e0 [size=32] Capabilities: [c8] Power Management version 2 Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA PME(D0+,D1-,D2-,D3hot+,D3cold+) Status: D0 PME-Enable- DSel=0 DScale=1 PME- Capabilities: [d0] Message Signalled Interrupts: Mask- 64bit+ Queue=0/0 Enable+ Address: 00000000fee0100c Data: 41c9 Kernel driver in use: e1000e Kernel modules: e1000e I use vanilla tree, commit bf2937695fe2330bfd8933a2310e7bdd2581dc2e Best regards, Maxim Levitsky ^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [REGRESSION] e1000e stopped working 2010-06-27 17:27 [REGRESSION] e1000e stopped working Maxim Levitsky @ 2010-06-27 17:29 ` Maxim Levitsky 2010-06-27 17:43 ` Maxim Levitsky 0 siblings, 1 reply; 29+ messages in thread From: Maxim Levitsky @ 2010-06-27 17:29 UTC (permalink / raw) To: netdev@vger.kernel.org On Sun, 2010-06-27 at 20:27 +0300, Maxim Levitsky wrote: > Just that, > > It doesn't receive anything from my internet router during DHCP. > > > 00:19.0 Ethernet controller [0200]: Intel Corporation 82566DC Gigabit Network Connection [8086:104b] (rev 02) > Subsystem: Intel Corporation Device [8086:0001] > Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ > Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- > Latency: 0 > Interrupt: pin A routed to IRQ 47 > Region 0: Memory at 50300000 (32-bit, non-prefetchable) [size=128K] > Region 1: Memory at 50324000 (32-bit, non-prefetchable) [size=4K] > Region 2: I/O ports at 30e0 [size=32] > Capabilities: [c8] Power Management version 2 > Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA PME(D0+,D1-,D2-,D3hot+,D3cold+) > Status: D0 PME-Enable- DSel=0 DScale=1 PME- > Capabilities: [d0] Message Signalled Interrupts: Mask- 64bit+ Queue=0/0 Enable+ > Address: 00000000fee0100c Data: 41c9 > Kernel driver in use: e1000e > Kernel modules: e1000e > > I use vanilla tree, commit bf2937695fe2330bfd8933a2310e7bdd2581dc2e > > > Best regards, > Maxim Levitsky > It appears to work now after reboot. Will keep a look for this. Disregard for now. Best regards, Maxim Levitsky ^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [REGRESSION] e1000e stopped working 2010-06-27 17:29 ` Maxim Levitsky @ 2010-06-27 17:43 ` Maxim Levitsky 2010-06-27 17:47 ` Maxim Levitsky 0 siblings, 1 reply; 29+ messages in thread From: Maxim Levitsky @ 2010-06-27 17:43 UTC (permalink / raw) To: netdev@vger.kernel.org On Sun, 2010-06-27 at 20:29 +0300, Maxim Levitsky wrote: > On Sun, 2010-06-27 at 20:27 +0300, Maxim Levitsky wrote: > > Just that, > > > > It doesn't receive anything from my internet router during DHCP. > > > > > > 00:19.0 Ethernet controller [0200]: Intel Corporation 82566DC Gigabit Network Connection [8086:104b] (rev 02) > > Subsystem: Intel Corporation Device [8086:0001] > > Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ > > Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- > > Latency: 0 > > Interrupt: pin A routed to IRQ 47 > > Region 0: Memory at 50300000 (32-bit, non-prefetchable) [size=128K] > > Region 1: Memory at 50324000 (32-bit, non-prefetchable) [size=4K] > > Region 2: I/O ports at 30e0 [size=32] > > Capabilities: [c8] Power Management version 2 > > Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA PME(D0+,D1-,D2-,D3hot+,D3cold+) > > Status: D0 PME-Enable- DSel=0 DScale=1 PME- > > Capabilities: [d0] Message Signalled Interrupts: Mask- 64bit+ Queue=0/0 Enable+ > > Address: 00000000fee0100c Data: 41c9 > > Kernel driver in use: e1000e > > Kernel modules: e1000e > > > > I use vanilla tree, commit bf2937695fe2330bfd8933a2310e7bdd2581dc2e > > > > > > Best regards, > > Maxim Levitsky > > > > It appears to work now after reboot. > Will keep a look for this. > > Disregard for now. Just s2ram cycle, problem is back. Did full reboot (power off then on), same thing card doesn't work... >Best regards, Maxim Levitsky ^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [REGRESSION] e1000e stopped working 2010-06-27 17:43 ` Maxim Levitsky @ 2010-06-27 17:47 ` Maxim Levitsky 2010-06-28 17:04 ` Allan, Bruce W 0 siblings, 1 reply; 29+ messages in thread From: Maxim Levitsky @ 2010-06-27 17:47 UTC (permalink / raw) To: netdev@vger.kernel.org On Sun, 2010-06-27 at 20:43 +0300, Maxim Levitsky wrote: > On Sun, 2010-06-27 at 20:29 +0300, Maxim Levitsky wrote: > > On Sun, 2010-06-27 at 20:27 +0300, Maxim Levitsky wrote: > > > Just that, > > > > > > It doesn't receive anything from my internet router during DHCP. > > > > > > > > > 00:19.0 Ethernet controller [0200]: Intel Corporation 82566DC Gigabit Network Connection [8086:104b] (rev 02) > > > Subsystem: Intel Corporation Device [8086:0001] > > > Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ > > > Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- > > > Latency: 0 > > > Interrupt: pin A routed to IRQ 47 > > > Region 0: Memory at 50300000 (32-bit, non-prefetchable) [size=128K] > > > Region 1: Memory at 50324000 (32-bit, non-prefetchable) [size=4K] > > > Region 2: I/O ports at 30e0 [size=32] > > > Capabilities: [c8] Power Management version 2 > > > Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA PME(D0+,D1-,D2-,D3hot+,D3cold+) > > > Status: D0 PME-Enable- DSel=0 DScale=1 PME- > > > Capabilities: [d0] Message Signalled Interrupts: Mask- 64bit+ Queue=0/0 Enable+ > > > Address: 00000000fee0100c Data: 41c9 > > > Kernel driver in use: e1000e > > > Kernel modules: e1000e > > > > > > I use vanilla tree, commit bf2937695fe2330bfd8933a2310e7bdd2581dc2e > > > > > > > > > Best regards, > > > Maxim Levitsky > > > > > > > It appears to work now after reboot. > > Will keep a look for this. > > > > Disregard for now. > > > Just s2ram cycle, problem is back. > Did full reboot (power off then on), same thing card doesn't work... > Yep, s2ram sometimes 'fixes', sometimes breaks the card. Something got broken in device initialization path. Best regards, Maxim Levitsky ^ permalink raw reply [flat|nested] 29+ messages in thread
* RE: [REGRESSION] e1000e stopped working 2010-06-27 17:47 ` Maxim Levitsky @ 2010-06-28 17:04 ` Allan, Bruce W 2010-06-28 17:14 ` Maxim Levitsky 0 siblings, 1 reply; 29+ messages in thread From: Allan, Bruce W @ 2010-06-28 17:04 UTC (permalink / raw) To: Maxim Levitsky, netdev@vger.kernel.org On Sunday, June 27, 2010 10:47 AM, Maxim Levitsky wrote: > On Sun, 2010-06-27 at 20:43 +0300, Maxim Levitsky wrote: >> On Sun, 2010-06-27 at 20:29 +0300, Maxim Levitsky wrote: >>> On Sun, 2010-06-27 at 20:27 +0300, Maxim Levitsky wrote: >>>> Just that, >>>> >>>> It doesn't receive anything from my internet router during DHCP. >>>> >>>> >>>> 00:19.0 Ethernet controller [0200]: Intel Corporation 82566DC >>>> Gigabit Network Connection [8086:104b] (rev 02) Subsystem: Intel >>>> Corporation Device [8086:0001] Control: I/O+ Mem+ BusMaster+ >>>> SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- >>>> DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >>>> >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 >>>> Interrupt: pin A routed to IRQ 47 Region 0: Memory at 50300000 >>>> (32-bit, non-prefetchable) [size=128K] Region 1: Memory at >>>> 50324000 (32-bit, non-prefetchable) [size=4K] Region 2: I/O ports >>>> at 30e0 [size=32] Capabilities: [c8] Power Management version 2 >>>> Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA >>>> PME(D0+,D1-,D2-,D3hot+,D3cold+) Status: D0 PME-Enable- DSel=0 >>>> DScale=1 PME- Capabilities: [d0] Message Signalled Interrupts: >>>> Mask- 64bit+ Queue=0/0 Enable+ Address: 00000000fee0100c Data: >>>> 41c9 Kernel driver in use: e1000e Kernel modules: e1000e >>>> >>>> I use vanilla tree, commit bf2937695fe2330bfd8933a2310e7bdd2581dc2e >>>> >>>> >>>> Best regards, >>>> Maxim Levitsky >>>> >>> >>> It appears to work now after reboot. >>> Will keep a look for this. >>> >>> Disregard for now. >> >> >> Just s2ram cycle, problem is back. >> Did full reboot (power off then on), same thing card doesn't work... >> > > Yep, s2ram sometimes 'fixes', sometimes breaks the card. > Something got broken in device initialization path. > > Best regards, > Maxim Levitsky What distro are you using? If RedHat, since you are using DHCP will you please try putting a "LINKDELAY=10" in the /etc/sysconfig/network-scripts/ifcfg-ethX config file. Is there anything in the system log that might help narrow down the issue? Thanks, Bruce. ^ permalink raw reply [flat|nested] 29+ messages in thread
* RE: [REGRESSION] e1000e stopped working 2010-06-28 17:04 ` Allan, Bruce W @ 2010-06-28 17:14 ` Maxim Levitsky 2010-06-29 1:09 ` Allan, Bruce W 0 siblings, 1 reply; 29+ messages in thread From: Maxim Levitsky @ 2010-06-28 17:14 UTC (permalink / raw) To: Allan, Bruce W; +Cc: netdev@vger.kernel.org On Mon, 2010-06-28 at 10:04 -0700, Allan, Bruce W wrote: > On Sunday, June 27, 2010 10:47 AM, Maxim Levitsky wrote: > > On Sun, 2010-06-27 at 20:43 +0300, Maxim Levitsky wrote: > >> On Sun, 2010-06-27 at 20:29 +0300, Maxim Levitsky wrote: > >>> On Sun, 2010-06-27 at 20:27 +0300, Maxim Levitsky wrote: > >>>> Just that, > >>>> > >>>> It doesn't receive anything from my internet router during DHCP. > >>>> > >>>> > >>>> 00:19.0 Ethernet controller [0200]: Intel Corporation 82566DC > >>>> Gigabit Network Connection [8086:104b] (rev 02) Subsystem: Intel > >>>> Corporation Device [8086:0001] Control: I/O+ Mem+ BusMaster+ > >>>> SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- > >>>> DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast > >>>> >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 > >>>> Interrupt: pin A routed to IRQ 47 Region 0: Memory at 50300000 > >>>> (32-bit, non-prefetchable) [size=128K] Region 1: Memory at > >>>> 50324000 (32-bit, non-prefetchable) [size=4K] Region 2: I/O ports > >>>> at 30e0 [size=32] Capabilities: [c8] Power Management version 2 > >>>> Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA > >>>> PME(D0+,D1-,D2-,D3hot+,D3cold+) Status: D0 PME-Enable- DSel=0 > >>>> DScale=1 PME- Capabilities: [d0] Message Signalled Interrupts: > >>>> Mask- 64bit+ Queue=0/0 Enable+ Address: 00000000fee0100c Data: > >>>> 41c9 Kernel driver in use: e1000e Kernel modules: e1000e > >>>> > >>>> I use vanilla tree, commit bf2937695fe2330bfd8933a2310e7bdd2581dc2e > >>>> > >>>> > >>>> Best regards, > >>>> Maxim Levitsky > >>>> > >>> > >>> It appears to work now after reboot. > >>> Will keep a look for this. > >>> > >>> Disregard for now. > >> > >> > >> Just s2ram cycle, problem is back. > >> Did full reboot (power off then on), same thing card doesn't work... > >> > > > > Yep, s2ram sometimes 'fixes', sometimes breaks the card. > > Something got broken in device initialization path. > > > > Best regards, > > Maxim Levitsky > > What distro are you using? If RedHat, since you are using DHCP will you please try putting a "LINKDELAY=10" in the /etc/sysconfig/network-scripts/ifcfg-ethX config file. > I use ubuntu 9.10 > Is there anything in the system log that might help narrow down the issue? Nothing, really nothing. It seems to detect link, dhcp client sends requests, but doesn't recieve a thing (even tried promisc mode - doesn't help) Best regards, Maxim Levitsky ^ permalink raw reply [flat|nested] 29+ messages in thread
* RE: [REGRESSION] e1000e stopped working 2010-06-28 17:14 ` Maxim Levitsky @ 2010-06-29 1:09 ` Allan, Bruce W 2010-06-29 10:32 ` Maxim Levitsky 0 siblings, 1 reply; 29+ messages in thread From: Allan, Bruce W @ 2010-06-29 1:09 UTC (permalink / raw) To: Maxim Levitsky; +Cc: netdev@vger.kernel.org On Monday, June 28, 2010 10:14 AM, Maxim Levitsky wrote: > On Mon, 2010-06-28 at 10:04 -0700, Allan, Bruce W wrote: >> On Sunday, June 27, 2010 10:47 AM, Maxim Levitsky wrote: >>> On Sun, 2010-06-27 at 20:43 +0300, Maxim Levitsky wrote: >>>> On Sun, 2010-06-27 at 20:29 +0300, Maxim Levitsky wrote: >>>>> On Sun, 2010-06-27 at 20:27 +0300, Maxim Levitsky wrote: >>>>>> Just that, >>>>>> >>>>>> It doesn't receive anything from my internet router during DHCP. >>>>>> >>>>>> >>>>>> 00:19.0 Ethernet controller [0200]: Intel Corporation 82566DC >>>>>> Gigabit Network Connection [8086:104b] (rev 02) Subsystem: Intel >>>>>> Corporation Device [8086:0001] Control: I/O+ Mem+ BusMaster+ >>>>>> SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- >>>>>> DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >>>>>> >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 >>>>>> Interrupt: pin A routed to IRQ 47 Region 0: Memory at 50300000 >>>>>> (32-bit, non-prefetchable) [size=128K] Region 1: Memory at >>>>>> 50324000 (32-bit, non-prefetchable) [size=4K] Region 2: I/O >>>>>> ports at 30e0 [size=32] Capabilities: [c8] Power Management >>>>>> version 2 Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA >>>>>> PME(D0+,D1-,D2-,D3hot+,D3cold+) Status: D0 PME-Enable- DSel=0 >>>>>> DScale=1 PME- Capabilities: [d0] Message Signalled Interrupts: >>>>>> Mask- 64bit+ Queue=0/0 Enable+ Address: 00000000fee0100c Data: >>>>>> 41c9 Kernel driver in use: e1000e Kernel modules: e1000e >>>>>> >>>>>> I use vanilla tree, commit >>>>>> bf2937695fe2330bfd8933a2310e7bdd2581dc2e >>>>>> >>>>>> >>>>>> Best regards, >>>>>> Maxim Levitsky >>>>>> >>>>> >>>>> It appears to work now after reboot. >>>>> Will keep a look for this. >>>>> >>>>> Disregard for now. >>>> >>>> >>>> Just s2ram cycle, problem is back. >>>> Did full reboot (power off then on), same thing card doesn't >>>> work... >>>> >>> >>> Yep, s2ram sometimes 'fixes', sometimes breaks the card. >>> Something got broken in device initialization path. >>> >>> Best regards, >>> Maxim Levitsky >> >> What distro are you using? If RedHat, since you are using DHCP will >> you please try putting a "LINKDELAY=10" in the >> /etc/sysconfig/network-scripts/ifcfg-ethX config file. >> > I use ubuntu 9.10 > >> Is there anything in the system log that might help narrow down the >> issue? > > Nothing, really nothing. > It seems to detect link, dhcp client sends requests, but doesn't > recieve a thing (even tried promisc mode - doesn't help) > > > > Best regards, > Maxim Levitsky Since you say this is a regression, when did this last work for you without this problem, i.e. which distro, which kernel? I have been unable to reproduce similar behavior. Thanks, Bruce. ^ permalink raw reply [flat|nested] 29+ messages in thread
* RE: [REGRESSION] e1000e stopped working 2010-06-29 1:09 ` Allan, Bruce W @ 2010-06-29 10:32 ` Maxim Levitsky 2010-06-29 18:37 ` Tantilov, Emil S 0 siblings, 1 reply; 29+ messages in thread From: Maxim Levitsky @ 2010-06-29 10:32 UTC (permalink / raw) To: Allan, Bruce W; +Cc: netdev@vger.kernel.org On Mon, 2010-06-28 at 18:09 -0700, Allan, Bruce W wrote: > On Monday, June 28, 2010 10:14 AM, Maxim Levitsky wrote: > > On Mon, 2010-06-28 at 10:04 -0700, Allan, Bruce W wrote: > >> On Sunday, June 27, 2010 10:47 AM, Maxim Levitsky wrote: > >>> On Sun, 2010-06-27 at 20:43 +0300, Maxim Levitsky wrote: > >>>> On Sun, 2010-06-27 at 20:29 +0300, Maxim Levitsky wrote: > >>>>> On Sun, 2010-06-27 at 20:27 +0300, Maxim Levitsky wrote: > >>>>>> Just that, > >>>>>> > >>>>>> It doesn't receive anything from my internet router during DHCP. > >>>>>> > >>>>>> > >>>>>> 00:19.0 Ethernet controller [0200]: Intel Corporation 82566DC > >>>>>> Gigabit Network Connection [8086:104b] (rev 02) Subsystem: Intel > >>>>>> Corporation Device [8086:0001] Control: I/O+ Mem+ BusMaster+ > >>>>>> SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- > >>>>>> DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast > >>>>>> >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 > >>>>>> Interrupt: pin A routed to IRQ 47 Region 0: Memory at 50300000 > >>>>>> (32-bit, non-prefetchable) [size=128K] Region 1: Memory at > >>>>>> 50324000 (32-bit, non-prefetchable) [size=4K] Region 2: I/O > >>>>>> ports at 30e0 [size=32] Capabilities: [c8] Power Management > >>>>>> version 2 Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA > >>>>>> PME(D0+,D1-,D2-,D3hot+,D3cold+) Status: D0 PME-Enable- DSel=0 > >>>>>> DScale=1 PME- Capabilities: [d0] Message Signalled Interrupts: > >>>>>> Mask- 64bit+ Queue=0/0 Enable+ Address: 00000000fee0100c Data: > >>>>>> 41c9 Kernel driver in use: e1000e Kernel modules: e1000e > >>>>>> > >>>>>> I use vanilla tree, commit > >>>>>> bf2937695fe2330bfd8933a2310e7bdd2581dc2e > >>>>>> > >>>>>> > >>>>>> Best regards, > >>>>>> Maxim Levitsky > >>>>>> > >>>>> > >>>>> It appears to work now after reboot. > >>>>> Will keep a look for this. > >>>>> > >>>>> Disregard for now. > >>>> > >>>> > >>>> Just s2ram cycle, problem is back. > >>>> Did full reboot (power off then on), same thing card doesn't > >>>> work... > >>>> > >>> > >>> Yep, s2ram sometimes 'fixes', sometimes breaks the card. > >>> Something got broken in device initialization path. > >>> > >>> Best regards, > >>> Maxim Levitsky > >> > >> What distro are you using? If RedHat, since you are using DHCP will > >> you please try putting a "LINKDELAY=10" in the > >> /etc/sysconfig/network-scripts/ifcfg-ethX config file. > >> > > I use ubuntu 9.10 > > > >> Is there anything in the system log that might help narrow down the > >> issue? > > > > Nothing, really nothing. > > It seems to detect link, dhcp client sends requests, but doesn't > > recieve a thing (even tried promisc mode - doesn't help) > > > > > > > > Best regards, > > Maxim Levitsky > > Since you say this is a regression, when did this last work for you without this problem, i.e. which distro, which kernel? I always compile kernel, and last kernel I compiled here was vanilla 2.6.33-rc4. It works just fine. I mostly use my laptop, and therefore didn't update kernel on my desktop for long time. If I find some free time I try to bisect the problem. Best regards, Maxim Levitsky ^ permalink raw reply [flat|nested] 29+ messages in thread
* RE: [REGRESSION] e1000e stopped working 2010-06-29 10:32 ` Maxim Levitsky @ 2010-06-29 18:37 ` Tantilov, Emil S 2010-06-30 22:59 ` Maxim Levitsky 0 siblings, 1 reply; 29+ messages in thread From: Tantilov, Emil S @ 2010-06-29 18:37 UTC (permalink / raw) To: Maxim Levitsky; +Cc: netdev@vger.kernel.org, Allan, Bruce W, Pieper, Jeffrey E Maxim Levitsky wrote: > On Mon, 2010-06-28 at 18:09 -0700, Allan, Bruce W wrote: >> On Monday, June 28, 2010 10:14 AM, Maxim Levitsky wrote: >>> On Mon, 2010-06-28 at 10:04 -0700, Allan, Bruce W wrote: >>>> On Sunday, June 27, 2010 10:47 AM, Maxim Levitsky wrote: >>>>> On Sun, 2010-06-27 at 20:43 +0300, Maxim Levitsky wrote: >>>>>> On Sun, 2010-06-27 at 20:29 +0300, Maxim Levitsky wrote: >>>>>>> On Sun, 2010-06-27 at 20:27 +0300, Maxim Levitsky wrote: >>>>>>>> Just that, >>>>>>>> >>>>>>>> It doesn't receive anything from my internet router during >>>>>>>> DHCP. >>>>>>>> >>>>>>>> >>>>>>>> 00:19.0 Ethernet controller [0200]: Intel Corporation 82566DC >>>>>>>> Gigabit Network Connection [8086:104b] (rev 02) Subsystem: >>>>>>>> Intel Corporation Device [8086:0001] Control: I/O+ Mem+ >>>>>>>> BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- >>>>>>>> SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- >>>>>>>> ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- >>>>>>>> INTx- Latency: 0 Interrupt: pin A routed to IRQ 47 Region 0: >>>>>>>> Memory at 50300000 (32-bit, non-prefetchable) [size=128K] >>>>>>>> Region 1: Memory at 50324000 (32-bit, non-prefetchable) >>>>>>>> [size=4K] Region 2: I/O ports at 30e0 [size=32] >>>>>>>> Capabilities: [c8] Power Management version 2 Flags: PMEClk- >>>>>>>> DSI+ D1- D2- AuxCurrent=0mA PME(D0+,D1-,D2-,D3hot+,D3cold+) >>>>>>>> Status: D0 PME-Enable- DSel=0 DScale=1 PME- Capabilities: >>>>>>>> [d0] Message Signalled Interrupts: Mask- 64bit+ Queue=0/0 >>>>>>>> Enable+ Address: 00000000fee0100c Data: 41c9 Kernel driver >>>>>>>> in use: e1000e Kernel modules: e1000e >>>>>>>> >>>>>>>> I use vanilla tree, commit >>>>>>>> bf2937695fe2330bfd8933a2310e7bdd2581dc2e >>>>>>>> >>>>>>>> >>>>>>>> Best regards, >>>>>>>> Maxim Levitsky >>>>>>>> >>>>>>> >>>>>>> It appears to work now after reboot. >>>>>>> Will keep a look for this. >>>>>>> >>>>>>> Disregard for now. >>>>>> >>>>>> >>>>>> Just s2ram cycle, problem is back. >>>>>> Did full reboot (power off then on), same thing card doesn't >>>>>> work... >>>>>> >>>>> >>>>> Yep, s2ram sometimes 'fixes', sometimes breaks the card. >>>>> Something got broken in device initialization path. >>>>> >>>>> Best regards, >>>>> Maxim Levitsky >>>> >>>> What distro are you using? If RedHat, since you are using DHCP >>>> will you please try putting a "LINKDELAY=10" in the >>>> /etc/sysconfig/network-scripts/ifcfg-ethX config file. >>>> >>> I use ubuntu 9.10 >>> >>>> Is there anything in the system log that might help narrow down the >>>> issue? >>> >>> Nothing, really nothing. >>> It seems to detect link, dhcp client sends requests, but doesn't >>> recieve a thing (even tried promisc mode - doesn't help) >>> >>> >>> >>> Best regards, >>> Maxim Levitsky >> >> Since you say this is a regression, when did this last work for you >> without this problem, i.e. which distro, which kernel? > > I always compile kernel, and last kernel I compiled here was vanilla > 2.6.33-rc4. > It works just fine. > > I mostly use my laptop, and therefore didn't update kernel on my > desktop for long time. > > If I find some free time I try to bisect the problem. Could you provide some additional info about your setup: ethtool -e eth0 ethtool -d eth0 kernel config (if possible) What is the model of your system/MB? Thanks, Emil ^ permalink raw reply [flat|nested] 29+ messages in thread
* RE: [REGRESSION] e1000e stopped working 2010-06-29 18:37 ` Tantilov, Emil S @ 2010-06-30 22:59 ` Maxim Levitsky 2010-07-04 0:41 ` Maxim Levitsky 0 siblings, 1 reply; 29+ messages in thread From: Maxim Levitsky @ 2010-06-30 22:59 UTC (permalink / raw) To: Tantilov, Emil S Cc: netdev@vger.kernel.org, Allan, Bruce W, Pieper, Jeffrey E [-- Attachment #1: Type: text/plain, Size: 3835 bytes --] On Tue, 2010-06-29 at 12:37 -0600, Tantilov, Emil S wrote: > Maxim Levitsky wrote: > > On Mon, 2010-06-28 at 18:09 -0700, Allan, Bruce W wrote: > >> On Monday, June 28, 2010 10:14 AM, Maxim Levitsky wrote: > >>> On Mon, 2010-06-28 at 10:04 -0700, Allan, Bruce W wrote: > >>>> On Sunday, June 27, 2010 10:47 AM, Maxim Levitsky wrote: > >>>>> On Sun, 2010-06-27 at 20:43 +0300, Maxim Levitsky wrote: > >>>>>> On Sun, 2010-06-27 at 20:29 +0300, Maxim Levitsky wrote: > >>>>>>> On Sun, 2010-06-27 at 20:27 +0300, Maxim Levitsky wrote: > >>>>>>>> Just that, > >>>>>>>> > >>>>>>>> It doesn't receive anything from my internet router during > >>>>>>>> DHCP. > >>>>>>>> > >>>>>>>> > >>>>>>>> 00:19.0 Ethernet controller [0200]: Intel Corporation 82566DC > >>>>>>>> Gigabit Network Connection [8086:104b] (rev 02) Subsystem: > >>>>>>>> Intel Corporation Device [8086:0001] Control: I/O+ Mem+ > >>>>>>>> BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- > >>>>>>>> SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- > >>>>>>>> ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- > >>>>>>>> INTx- Latency: 0 Interrupt: pin A routed to IRQ 47 Region 0: > >>>>>>>> Memory at 50300000 (32-bit, non-prefetchable) [size=128K] > >>>>>>>> Region 1: Memory at 50324000 (32-bit, non-prefetchable) > >>>>>>>> [size=4K] Region 2: I/O ports at 30e0 [size=32] > >>>>>>>> Capabilities: [c8] Power Management version 2 Flags: PMEClk- > >>>>>>>> DSI+ D1- D2- AuxCurrent=0mA PME(D0+,D1-,D2-,D3hot+,D3cold+) > >>>>>>>> Status: D0 PME-Enable- DSel=0 DScale=1 PME- Capabilities: > >>>>>>>> [d0] Message Signalled Interrupts: Mask- 64bit+ Queue=0/0 > >>>>>>>> Enable+ Address: 00000000fee0100c Data: 41c9 Kernel driver > >>>>>>>> in use: e1000e Kernel modules: e1000e > >>>>>>>> > >>>>>>>> I use vanilla tree, commit > >>>>>>>> bf2937695fe2330bfd8933a2310e7bdd2581dc2e > >>>>>>>> > >>>>>>>> > >>>>>>>> Best regards, > >>>>>>>> Maxim Levitsky > >>>>>>>> > >>>>>>> > >>>>>>> It appears to work now after reboot. > >>>>>>> Will keep a look for this. > >>>>>>> > >>>>>>> Disregard for now. > >>>>>> > >>>>>> > >>>>>> Just s2ram cycle, problem is back. > >>>>>> Did full reboot (power off then on), same thing card doesn't > >>>>>> work... > >>>>>> > >>>>> > >>>>> Yep, s2ram sometimes 'fixes', sometimes breaks the card. > >>>>> Something got broken in device initialization path. > >>>>> > >>>>> Best regards, > >>>>> Maxim Levitsky > >>>> > >>>> What distro are you using? If RedHat, since you are using DHCP > >>>> will you please try putting a "LINKDELAY=10" in the > >>>> /etc/sysconfig/network-scripts/ifcfg-ethX config file. > >>>> > >>> I use ubuntu 9.10 > >>> > >>>> Is there anything in the system log that might help narrow down the > >>>> issue? > >>> > >>> Nothing, really nothing. > >>> It seems to detect link, dhcp client sends requests, but doesn't > >>> recieve a thing (even tried promisc mode - doesn't help) > >>> > >>> > >>> > >>> Best regards, > >>> Maxim Levitsky > >> > >> Since you say this is a regression, when did this last work for you > >> without this problem, i.e. which distro, which kernel? > > > > I always compile kernel, and last kernel I compiled here was vanilla > > 2.6.33-rc4. > > It works just fine. > > > > I mostly use my laptop, and therefore didn't update kernel on my > > desktop for long time. > > > > If I find some free time I try to bisect the problem. > > Could you provide some additional info about your setup: > ethtool -e eth0 > ethtool -d eth0 > kernel config (if possible) > > What is the model of your system/MB? Sure, My motherboard on this system is Intel DG965RY The bug in about 90% reproducible. Doing several s2ram cycles, its possible to catch a moment when the device starts working. Best regards, Maxim Levitsky [-- Attachment #2: eeprom --] [-- Type: text/plain, Size: 14622 bytes --] Offset Values ------ ------ 0x0000 00 19 d1 ed 88 2a 00 08 ff ff 10 10 ff ff ff ff 0x0010 ff ff ff ff c7 10 01 00 86 80 4b 10 86 80 00 00 0x0020 01 0d 00 00 00 00 05 96 20 50 00 33 00 00 07 8d 0x0030 84 06 41 03 00 00 00 00 00 00 00 00 00 00 00 00 0x0040 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 0x0050 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 0x0060 00 01 00 40 2a 12 07 40 ff ff ff ff ff ff ff ff 0x0070 ff ff ff ff ff ff ff ff ff ff ff ff ff ff 1f ff 0x0080 20 61 1f 00 02 0e 12 00 40 2f 1f 00 18 90 1b 00 0x0090 00 00 12 00 a0 2f 1f 00 24 8b 11 00 f0 f8 12 00 0x00a0 00 20 1f 00 b0 10 10 00 00 00 11 00 c0 20 1f 00 0x00b0 9a 24 1d 00 d3 00 1e 00 a0 28 1f 00 ce 04 14 00 0x00c0 60 2f 1f 00 e4 29 10 00 00 00 1f 00 40 01 00 00 0x00d0 20 1f 1f 00 06 16 10 00 14 b8 11 00 2a 01 15 00 0x00e0 67 00 1e 00 40 1f 1f 00 65 00 14 00 2a 00 15 00 0x00f0 2a 00 16 00 60 1f 1f 00 b0 3f 12 00 ff c0 16 00 0x0100 ec 1d 17 00 ef f9 18 00 10 02 19 00 80 18 1f 00 0x0110 03 00 15 00 80 17 1f 00 08 00 16 00 80 17 1f 00 0x0120 08 d0 18 00 80 18 1f 00 18 d9 18 00 60 18 1f 00 0x0130 00 08 1a 00 00 00 1f 00 01 00 19 00 40 13 00 00 0x0140 51 60 1f 00 01 00 11 00 00 00 1f 00 ff ff ff ff 0x0150 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0160 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0170 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0180 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0190 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x01a0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x01b0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x01c0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x01d0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x01e0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x01f0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0200 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0210 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0220 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0230 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0240 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0250 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0260 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0270 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0280 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0290 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x02a0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x02b0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x02c0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x02d0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x02e0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x02f0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0300 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0310 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0320 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0330 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0340 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0350 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0360 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0370 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0380 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0390 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x03a0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x03b0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x03c0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x03d0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x03e0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x03f0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0400 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0410 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0420 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0430 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0440 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0450 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0460 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0470 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0480 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0490 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x04a0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x04b0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x04c0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x04d0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x04e0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x04f0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0500 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0510 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0520 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0530 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0540 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0550 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0560 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0570 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0580 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0590 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x05a0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x05b0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x05c0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x05d0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x05e0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x05f0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0600 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0610 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0620 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0630 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0640 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0650 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0660 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0670 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0680 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0690 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x06a0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x06b0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x06c0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x06d0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x06e0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x06f0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0700 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0710 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0720 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0730 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0740 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0750 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0760 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0770 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0780 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0790 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x07a0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x07b0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x07c0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x07d0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x07e0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x07f0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0800 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0810 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0820 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0830 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0840 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0850 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0860 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0870 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0880 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0890 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x08a0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x08b0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x08c0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x08d0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x08e0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x08f0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0900 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0910 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0920 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0930 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0940 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0950 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0960 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0970 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0980 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0990 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x09a0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x09b0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x09c0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x09d0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x09e0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x09f0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0a00 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0a10 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0a20 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0a30 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0a40 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0a50 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0a60 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0a70 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0a80 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0a90 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0aa0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0ab0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0ac0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0ad0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0ae0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0af0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0b00 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0b10 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0b20 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0b30 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0b40 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0b50 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0b60 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0b70 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0b80 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0b90 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0ba0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0bb0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0bc0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0bd0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0be0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0bf0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0c00 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0c10 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0c20 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0c30 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0c40 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0c50 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0c60 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0c70 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0c80 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0c90 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0ca0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0cb0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0cc0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0cd0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0ce0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0cf0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0d00 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0d10 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0d20 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0d30 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0d40 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0d50 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0d60 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0d70 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0d80 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0d90 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0da0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0db0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0dc0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0dd0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0de0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0df0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0e00 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0e10 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0e20 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0e30 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0e40 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0e50 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0e60 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0e70 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0e80 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0e90 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0ea0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0eb0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0ec0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0ed0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0ee0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0ef0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0f00 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0f10 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0f20 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0f30 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0f40 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0f50 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0f60 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0f70 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0f80 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0f90 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0fa0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0fb0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0fc0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0fd0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0fe0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 0x0ff0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff [-- Attachment #3: misc --] [-- Type: text/plain, Size: 1297 bytes --] maxim@MAIN:~$ sudo ethtool -i eth1 driver: e1000e version: 1.0.2-k4 firmware-version: 1.1-0 bus-info: 0000:00:19.0 maxim@MAIN:~$ sudo ethtool -g eth1 Ring parameters for eth1: Pre-set maximums: RX: 4096 RX Mini: 0 RX Jumbo: 0 TX: 4096 Current hardware settings: RX: 256 RX Mini: 0 RX Jumbo: 0 TX: 256 maxim@MAIN:~$ ifconfig eth1 Link encap:Ethernet HWaddr 00:19:d1:ed:88:2a UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:18 errors:0 dropped:0 overruns:0 frame:0 TX packets:8 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:3411 (3.4 KB) TX bytes:2736 (2.7 KB) Interrupt:20 Memory:50300000-50320000 Number of RX packets seems to increase Wireshark doesn't see them Example: maxim@MAIN:~$ sudo dhclient eth1 Internet Systems Consortium DHCP Client V3.1.2 Copyright 2004-2008 Internet Systems Consortium. All rights reserved. For info, please visit http://www.isc.org/sw/dhcp/ Listening on LPF/eth1/00:19:d1:ed:88:2a Sending on LPF/eth1/00:19:d1:ed:88:2a Sending on Socket/fallback DHCPDISCOVER on eth1 to 255.255.255.255 port 67 interval 6 DHCPDISCOVER on eth1 to 255.255.255.255 port 67 interval 11 DHCPDISCOVER on eth1 to 255.255.255.255 port 67 interval 18 [-- Attachment #4: reg_dump --] [-- Type: audio/x-ape, Size: 2300 bytes --] [-- Attachment #5: .config.gz --] [-- Type: application/x-gzip, Size: 16213 bytes --] ^ permalink raw reply [flat|nested] 29+ messages in thread
* RE: [REGRESSION] e1000e stopped working 2010-06-30 22:59 ` Maxim Levitsky @ 2010-07-04 0:41 ` Maxim Levitsky 2010-07-04 22:48 ` [REGRESSION] e1000e stopped working [MANUALLY BISECTED] Maxim Levitsky 0 siblings, 1 reply; 29+ messages in thread From: Maxim Levitsky @ 2010-07-04 0:41 UTC (permalink / raw) To: Tantilov, Emil S Cc: netdev@vger.kernel.org, Allan, Bruce W, Pieper, Jeffrey E On Thu, 2010-07-01 at 01:59 +0300, Maxim Levitsky wrote: > On Tue, 2010-06-29 at 12:37 -0600, Tantilov, Emil S wrote: > > Maxim Levitsky wrote: > > > On Mon, 2010-06-28 at 18:09 -0700, Allan, Bruce W wrote: > > >> On Monday, June 28, 2010 10:14 AM, Maxim Levitsky wrote: > > >>> On Mon, 2010-06-28 at 10:04 -0700, Allan, Bruce W wrote: > > >>>> On Sunday, June 27, 2010 10:47 AM, Maxim Levitsky wrote: > > >>>>> On Sun, 2010-06-27 at 20:43 +0300, Maxim Levitsky wrote: > > >>>>>> On Sun, 2010-06-27 at 20:29 +0300, Maxim Levitsky wrote: > > >>>>>>> On Sun, 2010-06-27 at 20:27 +0300, Maxim Levitsky wrote: > > >>>>>>>> Just that, > > >>>>>>>> > > >>>>>>>> It doesn't receive anything from my internet router during > > >>>>>>>> DHCP. > > >>>>>>>> > > >>>>>>>> > > >>>>>>>> 00:19.0 Ethernet controller [0200]: Intel Corporation 82566DC > > >>>>>>>> Gigabit Network Connection [8086:104b] (rev 02) Subsystem: > > >>>>>>>> Intel Corporation Device [8086:0001] Control: I/O+ Mem+ > > >>>>>>>> BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- > > >>>>>>>> SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- > > >>>>>>>> ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- > > >>>>>>>> INTx- Latency: 0 Interrupt: pin A routed to IRQ 47 Region 0: > > >>>>>>>> Memory at 50300000 (32-bit, non-prefetchable) [size=128K] > > >>>>>>>> Region 1: Memory at 50324000 (32-bit, non-prefetchable) > > >>>>>>>> [size=4K] Region 2: I/O ports at 30e0 [size=32] > > >>>>>>>> Capabilities: [c8] Power Management version 2 Flags: PMEClk- > > >>>>>>>> DSI+ D1- D2- AuxCurrent=0mA PME(D0+,D1-,D2-,D3hot+,D3cold+) > > >>>>>>>> Status: D0 PME-Enable- DSel=0 DScale=1 PME- Capabilities: > > >>>>>>>> [d0] Message Signalled Interrupts: Mask- 64bit+ Queue=0/0 > > >>>>>>>> Enable+ Address: 00000000fee0100c Data: 41c9 Kernel driver > > >>>>>>>> in use: e1000e Kernel modules: e1000e > > >>>>>>>> > > >>>>>>>> I use vanilla tree, commit > > >>>>>>>> bf2937695fe2330bfd8933a2310e7bdd2581dc2e > > >>>>>>>> > > >>>>>>>> > > >>>>>>>> Best regards, > > >>>>>>>> Maxim Levitsky > > >>>>>>>> > > >>>>>>> > > >>>>>>> It appears to work now after reboot. > > >>>>>>> Will keep a look for this. > > >>>>>>> > > >>>>>>> Disregard for now. > > >>>>>> > > >>>>>> > > >>>>>> Just s2ram cycle, problem is back. > > >>>>>> Did full reboot (power off then on), same thing card doesn't > > >>>>>> work... > > >>>>>> > > >>>>> > > >>>>> Yep, s2ram sometimes 'fixes', sometimes breaks the card. > > >>>>> Something got broken in device initialization path. > > >>>>> > > >>>>> Best regards, > > >>>>> Maxim Levitsky > > >>>> > > >>>> What distro are you using? If RedHat, since you are using DHCP > > >>>> will you please try putting a "LINKDELAY=10" in the > > >>>> /etc/sysconfig/network-scripts/ifcfg-ethX config file. > > >>>> > > >>> I use ubuntu 9.10 > > >>> > > >>>> Is there anything in the system log that might help narrow down the > > >>>> issue? > > >>> > > >>> Nothing, really nothing. > > >>> It seems to detect link, dhcp client sends requests, but doesn't > > >>> recieve a thing (even tried promisc mode - doesn't help) > > >>> > > >>> > > >>> > > >>> Best regards, > > >>> Maxim Levitsky > > >> > > >> Since you say this is a regression, when did this last work for you > > >> without this problem, i.e. which distro, which kernel? > > > > > > I always compile kernel, and last kernel I compiled here was vanilla > > > 2.6.33-rc4. > > > It works just fine. > > > > > > I mostly use my laptop, and therefore didn't update kernel on my > > > desktop for long time. > > > > > > If I find some free time I try to bisect the problem. > > > > Could you provide some additional info about your setup: > > ethtool -e eth0 > > ethtool -d eth0 > > kernel config (if possible) > > > > What is the model of your system/MB? > > > Sure, > > > My motherboard on this system is Intel DG965RY > > The bug in about 90% reproducible. > Doing several s2ram cycles, its possible to catch a moment when the > device starts working. > Just tested 2.6.34, and it works, so this is 2.6.35 regression. Best regards, Maxim Levitsky ^ permalink raw reply [flat|nested] 29+ messages in thread
* RE: [REGRESSION] e1000e stopped working [MANUALLY BISECTED] 2010-07-04 0:41 ` Maxim Levitsky @ 2010-07-04 22:48 ` Maxim Levitsky 2010-07-05 8:13 ` Jeff Kirsher 0 siblings, 1 reply; 29+ messages in thread From: Maxim Levitsky @ 2010-07-04 22:48 UTC (permalink / raw) To: Tantilov, Emil S Cc: netdev@vger.kernel.org, Allan, Bruce W, Pieper, Jeffrey E Did few guesses, and now I see that reverting the below commit fixes the problem. "e1000e: Fix/cleanup PHY reset code for ICHx/PCHx" e98cac447cc1cc418dff1d610a5c79c4f2bdec7f. Best regards, Maxim Levitsky ^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [REGRESSION] e1000e stopped working [MANUALLY BISECTED] 2010-07-04 22:48 ` [REGRESSION] e1000e stopped working [MANUALLY BISECTED] Maxim Levitsky @ 2010-07-05 8:13 ` Jeff Kirsher 2010-07-05 9:58 ` Maxim Levitsky 0 siblings, 1 reply; 29+ messages in thread From: Jeff Kirsher @ 2010-07-05 8:13 UTC (permalink / raw) To: Maxim Levitsky Cc: Tantilov, Emil S, netdev@vger.kernel.org, Allan, Bruce W, Pieper, Jeffrey E On Sun, Jul 4, 2010 at 15:48, Maxim Levitsky <maximlevitsky@gmail.com> wrote: > Did few guesses, and now I see that reverting the below commit fixes the > problem. > > "e1000e: Fix/cleanup PHY reset code for ICHx/PCHx" > e98cac447cc1cc418dff1d610a5c79c4f2bdec7f. > > > Best regards, > Maxim Levitsky > > -- Can you give us till Tuesday to respond? I know that there are some additional e1000e patches in my queue, which may resolve the issue, but this weekend the power is down to do some infrastructure upgrades which prevents us from doing any investigation.debugging until Tuesday. -- Cheers, Jeff ^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [REGRESSION] e1000e stopped working [MANUALLY BISECTED] 2010-07-05 8:13 ` Jeff Kirsher @ 2010-07-05 9:58 ` Maxim Levitsky 2010-07-12 15:56 ` Maxim Levitsky 0 siblings, 1 reply; 29+ messages in thread From: Maxim Levitsky @ 2010-07-05 9:58 UTC (permalink / raw) To: Jeff Kirsher Cc: Tantilov, Emil S, netdev@vger.kernel.org, Allan, Bruce W, Pieper, Jeffrey E On Mon, 2010-07-05 at 01:13 -0700, Jeff Kirsher wrote: > On Sun, Jul 4, 2010 at 15:48, Maxim Levitsky <maximlevitsky@gmail.com> wrote: > > Did few guesses, and now I see that reverting the below commit fixes the > > problem. > > > > "e1000e: Fix/cleanup PHY reset code for ICHx/PCHx" > > e98cac447cc1cc418dff1d610a5c79c4f2bdec7f. > > > > > > Best regards, > > Maxim Levitsky > > > > -- > > Can you give us till Tuesday to respond? I know that there are some > additional e1000e patches in my queue, which may resolve the issue, > but this weekend the power is down to do some infrastructure upgrades > which prevents us from doing any investigation.debugging until > Tuesday. > Sure. Best regards, Maxim Levitsky ^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [REGRESSION] e1000e stopped working [MANUALLY BISECTED] 2010-07-05 9:58 ` Maxim Levitsky @ 2010-07-12 15:56 ` Maxim Levitsky 2010-07-12 21:23 ` Tantilov, Emil S 0 siblings, 1 reply; 29+ messages in thread From: Maxim Levitsky @ 2010-07-12 15:56 UTC (permalink / raw) To: Jeff Kirsher Cc: Tantilov, Emil S, netdev@vger.kernel.org, Allan, Bruce W, Pieper, Jeffrey E On Mon, 2010-07-05 at 12:58 +0300, Maxim Levitsky wrote: > On Mon, 2010-07-05 at 01:13 -0700, Jeff Kirsher wrote: > > On Sun, Jul 4, 2010 at 15:48, Maxim Levitsky <maximlevitsky@gmail.com> wrote: > > > Did few guesses, and now I see that reverting the below commit fixes the > > > problem. > > > > > > "e1000e: Fix/cleanup PHY reset code for ICHx/PCHx" > > > e98cac447cc1cc418dff1d610a5c79c4f2bdec7f. > > > > > > > > > Best regards, > > > Maxim Levitsky > > > > > > -- > > > > Can you give us till Tuesday to respond? I know that there are some > > additional e1000e patches in my queue, which may resolve the issue, > > but this weekend the power is down to do some infrastructure upgrades > > which prevents us from doing any investigation.debugging until > > Tuesday. > > > > Sure. > > Best regards, > Maxim Levitsky > Updates? or 2.6.35 will ship with e0000e ? :-) I really have very little time to help further with that for now. Best regards, Maxim Levitsky ^ permalink raw reply [flat|nested] 29+ messages in thread
* RE: [REGRESSION] e1000e stopped working [MANUALLY BISECTED] 2010-07-12 15:56 ` Maxim Levitsky @ 2010-07-12 21:23 ` Tantilov, Emil S 2010-07-13 0:38 ` Maxim Levitsky 0 siblings, 1 reply; 29+ messages in thread From: Tantilov, Emil S @ 2010-07-12 21:23 UTC (permalink / raw) To: Maxim Levitsky, Kirsher, Jeffrey T Cc: netdev@vger.kernel.org, Allan, Bruce W, Pieper, Jeffrey E Maxim Levitsky wrote: > On Mon, 2010-07-05 at 12:58 +0300, Maxim Levitsky wrote: >> On Mon, 2010-07-05 at 01:13 -0700, Jeff Kirsher wrote: >>> On Sun, Jul 4, 2010 at 15:48, Maxim Levitsky >>> <maximlevitsky@gmail.com> wrote: >>>> Did few guesses, and now I see that reverting the below commit >>>> fixes the problem. >>>> >>>> "e1000e: Fix/cleanup PHY reset code for ICHx/PCHx" >>>> e98cac447cc1cc418dff1d610a5c79c4f2bdec7f. >>>> >>>> >>>> Best regards, >>>> Maxim Levitsky >>>> >>>> -- >>> >>> Can you give us till Tuesday to respond? I know that there are some >>> additional e1000e patches in my queue, which may resolve the issue, >>> but this weekend the power is down to do some infrastructure >>> upgrades >>> which prevents us from doing any investigation.debugging until >>> Tuesday. >>> >> >> Sure. >> >> Best regards, >> Maxim Levitsky >> > > Updates? We are working on reproducing the issue. So far we have not seen the problem when testing with net-next. I asked in previous email about some additional info from ethtool (-d, -e, -S) and kernel config. That would help us to narrow it down. Thanks, Emil ^ permalink raw reply [flat|nested] 29+ messages in thread
* RE: [REGRESSION] e1000e stopped working [MANUALLY BISECTED] 2010-07-12 21:23 ` Tantilov, Emil S @ 2010-07-13 0:38 ` Maxim Levitsky 2010-07-14 22:56 ` Tantilov, Emil S 0 siblings, 1 reply; 29+ messages in thread From: Maxim Levitsky @ 2010-07-13 0:38 UTC (permalink / raw) To: Tantilov, Emil S Cc: Kirsher, Jeffrey T, netdev@vger.kernel.org, Allan, Bruce W, Pieper, Jeffrey E On Mon, 2010-07-12 at 15:23 -0600, Tantilov, Emil S wrote: > Maxim Levitsky wrote: > > On Mon, 2010-07-05 at 12:58 +0300, Maxim Levitsky wrote: > >> On Mon, 2010-07-05 at 01:13 -0700, Jeff Kirsher wrote: > >>> On Sun, Jul 4, 2010 at 15:48, Maxim Levitsky > >>> <maximlevitsky@gmail.com> wrote: > >>>> Did few guesses, and now I see that reverting the below commit > >>>> fixes the problem. > >>>> > >>>> "e1000e: Fix/cleanup PHY reset code for ICHx/PCHx" > >>>> e98cac447cc1cc418dff1d610a5c79c4f2bdec7f. > >>>> > >>>> > >>>> Best regards, > >>>> Maxim Levitsky > >>>> > >>>> -- > >>> > >>> Can you give us till Tuesday to respond? I know that there are some > >>> additional e1000e patches in my queue, which may resolve the issue, > >>> but this weekend the power is down to do some infrastructure > >>> upgrades > >>> which prevents us from doing any investigation.debugging until > >>> Tuesday. > >>> > >> > >> Sure. > >> > >> Best regards, > >> Maxim Levitsky > >> > > > > Updates? > > We are working on reproducing the issue. So far we have not seen the problem when testing with net-next. > > I asked in previous email about some additional info from ethtool (-d, -e, -S) and kernel config. That would help us to narrow it down. > > Thanks, > Emil I did send -e and -d output. Since you probably want -S output during failure, I need to recompile kernel for that. I will do that soon. One question, in two weeks I hope 2.6.35 won't be released? If so, I will have enough free time then to narrow down this issue. Other solution, is to revert this commit. (I have never seen this problem with it reverted). Best regards, Maxim Levitsky ^ permalink raw reply [flat|nested] 29+ messages in thread
* RE: [REGRESSION] e1000e stopped working [MANUALLY BISECTED] 2010-07-13 0:38 ` Maxim Levitsky @ 2010-07-14 22:56 ` Tantilov, Emil S 2010-07-14 23:33 ` Maxim Levitsky 0 siblings, 1 reply; 29+ messages in thread From: Tantilov, Emil S @ 2010-07-14 22:56 UTC (permalink / raw) To: Maxim Levitsky Cc: Kirsher, Jeffrey T, netdev@vger.kernel.org, Allan, Bruce W, Pieper, Jeffrey E Maxim Levitsky wrote: > On Mon, 2010-07-12 at 15:23 -0600, Tantilov, Emil S wrote: >> Maxim Levitsky wrote: >>> On Mon, 2010-07-05 at 12:58 +0300, Maxim Levitsky wrote: >>>> On Mon, 2010-07-05 at 01:13 -0700, Jeff Kirsher wrote: >>>>> On Sun, Jul 4, 2010 at 15:48, Maxim Levitsky >>>>> <maximlevitsky@gmail.com> wrote: >>>>>> Did few guesses, and now I see that reverting the below commit >>>>>> fixes the problem. >>>>>> >>>>>> "e1000e: Fix/cleanup PHY reset code for ICHx/PCHx" >>>>>> e98cac447cc1cc418dff1d610a5c79c4f2bdec7f. >>>>>> >>>>>> >>>>>> Best regards, >>>>>> Maxim Levitsky >>>>>> >>>>>> -- >>>>> >>>>> Can you give us till Tuesday to respond? I know that there are >>>>> some additional e1000e patches in my queue, which may resolve the >>>>> issue, but this weekend the power is down to do some >>>>> infrastructure upgrades which prevents us from doing any >>>>> investigation.debugging until Tuesday. >>>>> >>>> >>>> Sure. >>>> >>>> Best regards, >>>> Maxim Levitsky >>>> >>> >>> Updates? >> >> We are working on reproducing the issue. So far we have not seen the >> problem when testing with net-next. >> >> I asked in previous email about some additional info from ethtool >> (-d, -e, -S) and kernel config. That would help us to narrow it >> down. >> >> Thanks, >> Emil > I did send -e and -d output. Sorry, looks like I lost the email with the attachements. Could you provide the output of dmesg after the failure occurs? > Since you probably want -S output during failure, I need to recompile > kernel for that. I will do that soon. > > > One question, in two weeks I hope 2.6.35 won't be released? > If so, I will have enough free time then to narrow down this issue. > > Other solution, is to revert this commit. > (I have never seen this problem with it reverted). We have been running reboot tests on 2 separate systems with recent net-next kernels using your config and so far no luck in reproducing this issue. What is the make model of your system (or MB)? Thanks, Emil ^ permalink raw reply [flat|nested] 29+ messages in thread
* RE: [REGRESSION] e1000e stopped working [MANUALLY BISECTED] 2010-07-14 22:56 ` Tantilov, Emil S @ 2010-07-14 23:33 ` Maxim Levitsky 2010-07-15 18:57 ` Maxim Levitsky 0 siblings, 1 reply; 29+ messages in thread From: Maxim Levitsky @ 2010-07-14 23:33 UTC (permalink / raw) To: Tantilov, Emil S Cc: Kirsher, Jeffrey T, netdev@vger.kernel.org, Allan, Bruce W, Pieper, Jeffrey E On Wed, 2010-07-14 at 16:56 -0600, Tantilov, Emil S wrote: > Maxim Levitsky wrote: > > On Mon, 2010-07-12 at 15:23 -0600, Tantilov, Emil S wrote: > >> Maxim Levitsky wrote: > >>> On Mon, 2010-07-05 at 12:58 +0300, Maxim Levitsky wrote: > >>>> On Mon, 2010-07-05 at 01:13 -0700, Jeff Kirsher wrote: > >>>>> On Sun, Jul 4, 2010 at 15:48, Maxim Levitsky > >>>>> <maximlevitsky@gmail.com> wrote: > >>>>>> Did few guesses, and now I see that reverting the below commit > >>>>>> fixes the problem. > >>>>>> > >>>>>> "e1000e: Fix/cleanup PHY reset code for ICHx/PCHx" > >>>>>> e98cac447cc1cc418dff1d610a5c79c4f2bdec7f. > >>>>>> > >>>>>> > >>>>>> Best regards, > >>>>>> Maxim Levitsky > >>>>>> > >>>>>> -- > >>>>> > >>>>> Can you give us till Tuesday to respond? I know that there are > >>>>> some additional e1000e patches in my queue, which may resolve the > >>>>> issue, but this weekend the power is down to do some > >>>>> infrastructure upgrades which prevents us from doing any > >>>>> investigation.debugging until Tuesday. > >>>>> > >>>> > >>>> Sure. > >>>> > >>>> Best regards, > >>>> Maxim Levitsky > >>>> > >>> > >>> Updates? > >> > >> We are working on reproducing the issue. So far we have not seen the > >> problem when testing with net-next. > >> > >> I asked in previous email about some additional info from ethtool > >> (-d, -e, -S) and kernel config. That would help us to narrow it > >> down. > >> > >> Thanks, > >> Emil > > I did send -e and -d output. > > Sorry, looks like I lost the email with the attachements. > > Could you provide the output of dmesg after the failure occurs? > > > Since you probably want -S output during failure, I need to recompile > > kernel for that. I will do that soon. > > > > > > One question, in two weeks I hope 2.6.35 won't be released? > > If so, I will have enough free time then to narrow down this issue. > > > > Other solution, is to revert this commit. > > (I have never seen this problem with it reverted). > > We have been running reboot tests on 2 separate systems with recent net-next kernels > using your config and so far no luck in reproducing this issue. > > What is the make model of your system (or MB)? the motherboard is Intel DG965RY. However, I am using vanilla kernel. net-next might contain further fixes. I see if net-next works here. Best regards, Maxim Levitsky ^ permalink raw reply [flat|nested] 29+ messages in thread
* RE: [REGRESSION] e1000e stopped working [MANUALLY BISECTED] 2010-07-14 23:33 ` Maxim Levitsky @ 2010-07-15 18:57 ` Maxim Levitsky 2010-07-15 19:02 ` Tantilov, Emil S 0 siblings, 1 reply; 29+ messages in thread From: Maxim Levitsky @ 2010-07-15 18:57 UTC (permalink / raw) To: Tantilov, Emil S Cc: Kirsher, Jeffrey T, netdev@vger.kernel.org, Allan, Bruce W, Pieper, Jeffrey E On Thu, 2010-07-15 at 02:33 +0300, Maxim Levitsky wrote: > On Wed, 2010-07-14 at 16:56 -0600, Tantilov, Emil S wrote: > > Maxim Levitsky wrote: > > > On Mon, 2010-07-12 at 15:23 -0600, Tantilov, Emil S wrote: > > >> Maxim Levitsky wrote: > > >>> On Mon, 2010-07-05 at 12:58 +0300, Maxim Levitsky wrote: > > >>>> On Mon, 2010-07-05 at 01:13 -0700, Jeff Kirsher wrote: > > >>>>> On Sun, Jul 4, 2010 at 15:48, Maxim Levitsky > > >>>>> <maximlevitsky@gmail.com> wrote: > > >>>>>> Did few guesses, and now I see that reverting the below commit > > >>>>>> fixes the problem. > > >>>>>> > > >>>>>> "e1000e: Fix/cleanup PHY reset code for ICHx/PCHx" > > >>>>>> e98cac447cc1cc418dff1d610a5c79c4f2bdec7f. > > >>>>>> > > >>>>>> > > >>>>>> Best regards, > > >>>>>> Maxim Levitsky > > >>>>>> > > >>>>>> -- > > >>>>> > > >>>>> Can you give us till Tuesday to respond? I know that there are > > >>>>> some additional e1000e patches in my queue, which may resolve the > > >>>>> issue, but this weekend the power is down to do some > > >>>>> infrastructure upgrades which prevents us from doing any > > >>>>> investigation.debugging until Tuesday. > > >>>>> > > >>>> > > >>>> Sure. > > >>>> > > >>>> Best regards, > > >>>> Maxim Levitsky > > >>>> > > >>> > > >>> Updates? > > >> > > >> We are working on reproducing the issue. So far we have not seen the > > >> problem when testing with net-next. > > >> > > >> I asked in previous email about some additional info from ethtool > > >> (-d, -e, -S) and kernel config. That would help us to narrow it > > >> down. > > >> > > >> Thanks, > > >> Emil > > > I did send -e and -d output. > > > > Sorry, looks like I lost the email with the attachements. > > > > Could you provide the output of dmesg after the failure occurs? > > > > > Since you probably want -S output during failure, I need to recompile > > > kernel for that. I will do that soon. > > > > > > > > > One question, in two weeks I hope 2.6.35 won't be released? > > > If so, I will have enough free time then to narrow down this issue. > > > > > > Other solution, is to revert this commit. > > > (I have never seen this problem with it reverted). > > > > We have been running reboot tests on 2 separate systems with recent net-next kernels > > using your config and so far no luck in reproducing this issue. > > > > What is the make model of your system (or MB)? > > the motherboard is Intel DG965RY. > > However, I am using vanilla kernel. > net-next might contain further fixes. > > I see if net-next works here. Yep, net-next works here. I have the problem on vanilla kernel. Last revision of it, I tested is 2.6.35-rc4 exactly (815c4163b6c8ebf8152f42b0a5fd015cfdcedc78) Maybe vanilla git master works, I test it too soon. Best regards, Maxim Levitsky ^ permalink raw reply [flat|nested] 29+ messages in thread
* RE: [REGRESSION] e1000e stopped working [MANUALLY BISECTED] 2010-07-15 18:57 ` Maxim Levitsky @ 2010-07-15 19:02 ` Tantilov, Emil S 2010-07-15 19:09 ` Maxim Levitsky 0 siblings, 1 reply; 29+ messages in thread From: Tantilov, Emil S @ 2010-07-15 19:02 UTC (permalink / raw) To: Maxim Levitsky Cc: Kirsher, Jeffrey T, netdev@vger.kernel.org, Allan, Bruce W, Pieper, Jeffrey E Maxim Levitsky wrote: > On Thu, 2010-07-15 at 02:33 +0300, Maxim Levitsky wrote: >> On Wed, 2010-07-14 at 16:56 -0600, Tantilov, Emil S wrote: >>> Maxim Levitsky wrote: >>>> On Mon, 2010-07-12 at 15:23 -0600, Tantilov, Emil S wrote: >>>>> Maxim Levitsky wrote: >>>>>> On Mon, 2010-07-05 at 12:58 +0300, Maxim Levitsky wrote: >>>>>>> On Mon, 2010-07-05 at 01:13 -0700, Jeff Kirsher wrote: >>>>>>>> On Sun, Jul 4, 2010 at 15:48, Maxim Levitsky >>>>>>>> <maximlevitsky@gmail.com> wrote: >>>>>>>>> Did few guesses, and now I see that reverting the below >>>>>>>>> commit fixes the problem. >>>>>>>>> >>>>>>>>> "e1000e: Fix/cleanup PHY reset code for ICHx/PCHx" >>>>>>>>> e98cac447cc1cc418dff1d610a5c79c4f2bdec7f. >>>>>>>>> >>>>>>>>> >>>>>>>>> Best regards, >>>>>>>>> Maxim Levitsky >>>>>>>>> >>>>>>>>> -- >>>>>>>> >>>>>>>> Can you give us till Tuesday to respond? I know that there are >>>>>>>> some additional e1000e patches in my queue, which may resolve >>>>>>>> the issue, but this weekend the power is down to do some >>>>>>>> infrastructure upgrades which prevents us from doing any >>>>>>>> investigation.debugging until Tuesday. >>>>>>>> >>>>>>> >>>>>>> Sure. >>>>>>> >>>>>>> Best regards, >>>>>>> Maxim Levitsky >>>>>>> >>>>>> >>>>>> Updates? >>>>> >>>>> We are working on reproducing the issue. So far we have not seen >>>>> the problem when testing with net-next. >>>>> >>>>> I asked in previous email about some additional info from ethtool >>>>> (-d, -e, -S) and kernel config. That would help us to narrow it >>>>> down. >>>>> >>>>> Thanks, >>>>> Emil >>>> I did send -e and -d output. >>> >>> Sorry, looks like I lost the email with the attachements. >>> >>> Could you provide the output of dmesg after the failure occurs? >>> >>>> Since you probably want -S output during failure, I need to >>>> recompile kernel for that. I will do that soon. >>>> >>>> >>>> One question, in two weeks I hope 2.6.35 won't be released? >>>> If so, I will have enough free time then to narrow down this issue. >>>> >>>> Other solution, is to revert this commit. >>>> (I have never seen this problem with it reverted). >>> >>> We have been running reboot tests on 2 separate systems with recent >>> net-next kernels using your config and so far no luck in >>> reproducing this issue. >>> >>> What is the make model of your system (or MB)? >> >> the motherboard is Intel DG965RY. >> >> However, I am using vanilla kernel. >> net-next might contain further fixes. >> >> I see if net-next works here. > > Yep, net-next works here. > > > I have the problem on vanilla kernel. > Last revision of it, I tested is 2.6.35-rc4 exactly > (815c4163b6c8ebf8152f42b0a5fd015cfdcedc78) > > > Maybe vanilla git master works, I test it too soon. Thanks for the information! Good to know that this issue does not exist in the latest branch. Have you by any chance tested a stable branch (2.6.34.x)? > > > Best regards, > Maxim Levitsky Thanks, Emil ^ permalink raw reply [flat|nested] 29+ messages in thread
* RE: [REGRESSION] e1000e stopped working [MANUALLY BISECTED] 2010-07-15 19:02 ` Tantilov, Emil S @ 2010-07-15 19:09 ` Maxim Levitsky 2010-07-16 19:25 ` Maxim Levitsky 0 siblings, 1 reply; 29+ messages in thread From: Maxim Levitsky @ 2010-07-15 19:09 UTC (permalink / raw) To: Tantilov, Emil S Cc: Kirsher, Jeffrey T, netdev@vger.kernel.org, Allan, Bruce W, Pieper, Jeffrey E On Thu, 2010-07-15 at 13:02 -0600, Tantilov, Emil S wrote: > Maxim Levitsky wrote: > > On Thu, 2010-07-15 at 02:33 +0300, Maxim Levitsky wrote: > >> On Wed, 2010-07-14 at 16:56 -0600, Tantilov, Emil S wrote: > >>> Maxim Levitsky wrote: > >>>> On Mon, 2010-07-12 at 15:23 -0600, Tantilov, Emil S wrote: > >>>>> Maxim Levitsky wrote: > >>>>>> On Mon, 2010-07-05 at 12:58 +0300, Maxim Levitsky wrote: > >>>>>>> On Mon, 2010-07-05 at 01:13 -0700, Jeff Kirsher wrote: > >>>>>>>> On Sun, Jul 4, 2010 at 15:48, Maxim Levitsky > >>>>>>>> <maximlevitsky@gmail.com> wrote: > >>>>>>>>> Did few guesses, and now I see that reverting the below > >>>>>>>>> commit fixes the problem. > >>>>>>>>> > >>>>>>>>> "e1000e: Fix/cleanup PHY reset code for ICHx/PCHx" > >>>>>>>>> e98cac447cc1cc418dff1d610a5c79c4f2bdec7f. > >>>>>>>>> > >>>>>>>>> > >>>>>>>>> Best regards, > >>>>>>>>> Maxim Levitsky > >>>>>>>>> > >>>>>>>>> -- > >>>>>>>> > >>>>>>>> Can you give us till Tuesday to respond? I know that there are > >>>>>>>> some additional e1000e patches in my queue, which may resolve > >>>>>>>> the issue, but this weekend the power is down to do some > >>>>>>>> infrastructure upgrades which prevents us from doing any > >>>>>>>> investigation.debugging until Tuesday. > >>>>>>>> > >>>>>>> > >>>>>>> Sure. > >>>>>>> > >>>>>>> Best regards, > >>>>>>> Maxim Levitsky > >>>>>>> > >>>>>> > >>>>>> Updates? > >>>>> > >>>>> We are working on reproducing the issue. So far we have not seen > >>>>> the problem when testing with net-next. > >>>>> > >>>>> I asked in previous email about some additional info from ethtool > >>>>> (-d, -e, -S) and kernel config. That would help us to narrow it > >>>>> down. > >>>>> > >>>>> Thanks, > >>>>> Emil > >>>> I did send -e and -d output. > >>> > >>> Sorry, looks like I lost the email with the attachements. > >>> > >>> Could you provide the output of dmesg after the failure occurs? > >>> > >>>> Since you probably want -S output during failure, I need to > >>>> recompile kernel for that. I will do that soon. > >>>> > >>>> > >>>> One question, in two weeks I hope 2.6.35 won't be released? > >>>> If so, I will have enough free time then to narrow down this issue. > >>>> > >>>> Other solution, is to revert this commit. > >>>> (I have never seen this problem with it reverted). > >>> > >>> We have been running reboot tests on 2 separate systems with recent > >>> net-next kernels using your config and so far no luck in > >>> reproducing this issue. > >>> > >>> What is the make model of your system (or MB)? > >> > >> the motherboard is Intel DG965RY. > >> > >> However, I am using vanilla kernel. > >> net-next might contain further fixes. > >> > >> I see if net-next works here. > > > > Yep, net-next works here. > > > > > > I have the problem on vanilla kernel. > > Last revision of it, I tested is 2.6.35-rc4 exactly > > (815c4163b6c8ebf8152f42b0a5fd015cfdcedc78) > > > > > > Maybe vanilla git master works, I test it too soon. > > Thanks for the information! Good to know that this issue does not exist in the latest branch. > > Have you by any chance tested a stable branch (2.6.34.x)? I only did test plain 2.6.34 (v2.6.34) Also I repeat that revert of e98cac447cc1cc418dff1d610a5c79c4f2bdec7f (e1000e: Fix/cleanup PHY reset code for ICHx/PCHx) fixes the bug on vanilla kernel. Also I just pulled latest vanilla git, and I according to diffstat I see no changes in e1000e, so its likely that bug remains there. I will test that soon. Best regards, Maxim Levitsky ^ permalink raw reply [flat|nested] 29+ messages in thread
* RE: [REGRESSION] e1000e stopped working [MANUALLY BISECTED] 2010-07-15 19:09 ` Maxim Levitsky @ 2010-07-16 19:25 ` Maxim Levitsky 2010-07-16 23:23 ` Tantilov, Emil S 0 siblings, 1 reply; 29+ messages in thread From: Maxim Levitsky @ 2010-07-16 19:25 UTC (permalink / raw) To: Tantilov, Emil S Cc: Kirsher, Jeffrey T, netdev@vger.kernel.org, Allan, Bruce W, Pieper, Jeffrey E On Thu, 2010-07-15 at 22:09 +0300, Maxim Levitsky wrote: > On Thu, 2010-07-15 at 13:02 -0600, Tantilov, Emil S wrote: > > Maxim Levitsky wrote: > > > On Thu, 2010-07-15 at 02:33 +0300, Maxim Levitsky wrote: > > >> On Wed, 2010-07-14 at 16:56 -0600, Tantilov, Emil S wrote: > > >>> Maxim Levitsky wrote: > > >>>> On Mon, 2010-07-12 at 15:23 -0600, Tantilov, Emil S wrote: > > >>>>> Maxim Levitsky wrote: > > >>>>>> On Mon, 2010-07-05 at 12:58 +0300, Maxim Levitsky wrote: > > >>>>>>> On Mon, 2010-07-05 at 01:13 -0700, Jeff Kirsher wrote: > > >>>>>>>> On Sun, Jul 4, 2010 at 15:48, Maxim Levitsky > > >>>>>>>> <maximlevitsky@gmail.com> wrote: > > >>>>>>>>> Did few guesses, and now I see that reverting the below > > >>>>>>>>> commit fixes the problem. > > >>>>>>>>> > > >>>>>>>>> "e1000e: Fix/cleanup PHY reset code for ICHx/PCHx" > > >>>>>>>>> e98cac447cc1cc418dff1d610a5c79c4f2bdec7f. > > >>>>>>>>> > > >>>>>>>>> > > >>>>>>>>> Best regards, > > >>>>>>>>> Maxim Levitsky > > >>>>>>>>> > > >>>>>>>>> -- > > >>>>>>>> > > >>>>>>>> Can you give us till Tuesday to respond? I know that there are > > >>>>>>>> some additional e1000e patches in my queue, which may resolve > > >>>>>>>> the issue, but this weekend the power is down to do some > > >>>>>>>> infrastructure upgrades which prevents us from doing any > > >>>>>>>> investigation.debugging until Tuesday. > > >>>>>>>> > > >>>>>>> > > >>>>>>> Sure. > > >>>>>>> > > >>>>>>> Best regards, > > >>>>>>> Maxim Levitsky > > >>>>>>> > > >>>>>> > > >>>>>> Updates? > > >>>>> > > >>>>> We are working on reproducing the issue. So far we have not seen > > >>>>> the problem when testing with net-next. > > >>>>> > > >>>>> I asked in previous email about some additional info from ethtool > > >>>>> (-d, -e, -S) and kernel config. That would help us to narrow it > > >>>>> down. > > >>>>> > > >>>>> Thanks, > > >>>>> Emil > > >>>> I did send -e and -d output. > > >>> > > >>> Sorry, looks like I lost the email with the attachements. > > >>> > > >>> Could you provide the output of dmesg after the failure occurs? > > >>> > > >>>> Since you probably want -S output during failure, I need to > > >>>> recompile kernel for that. I will do that soon. > > >>>> > > >>>> > > >>>> One question, in two weeks I hope 2.6.35 won't be released? > > >>>> If so, I will have enough free time then to narrow down this issue. > > >>>> > > >>>> Other solution, is to revert this commit. > > >>>> (I have never seen this problem with it reverted). > > >>> > > >>> We have been running reboot tests on 2 separate systems with recent > > >>> net-next kernels using your config and so far no luck in > > >>> reproducing this issue. > > >>> > > >>> What is the make model of your system (or MB)? > > >> > > >> the motherboard is Intel DG965RY. > > >> > > >> However, I am using vanilla kernel. > > >> net-next might contain further fixes. > > >> > > >> I see if net-next works here. > > > > > > Yep, net-next works here. > > > > > > > > > I have the problem on vanilla kernel. > > > Last revision of it, I tested is 2.6.35-rc4 exactly > > > (815c4163b6c8ebf8152f42b0a5fd015cfdcedc78) > > > > > > > > > Maybe vanilla git master works, I test it too soon. > > > > Thanks for the information! Good to know that this issue does not exist in the latest branch. > > > > Have you by any chance tested a stable branch (2.6.34.x)? > > I only did test plain 2.6.34 (v2.6.34) And forgot to add, that it did work. > > Also I repeat that revert of e98cac447cc1cc418dff1d610a5c79c4f2bdec7f > (e1000e: Fix/cleanup PHY reset code for ICHx/PCHx) fixes the bug on > vanilla kernel. > > Also I just pulled latest vanilla git, and I according to diffstat I see > no changes in e1000e, so its likely that bug remains there. > I will test that soon. Tested, broken as expected. Best regards, Maxim Levitsky ^ permalink raw reply [flat|nested] 29+ messages in thread
* RE: [REGRESSION] e1000e stopped working [MANUALLY BISECTED] 2010-07-16 19:25 ` Maxim Levitsky @ 2010-07-16 23:23 ` Tantilov, Emil S 2010-07-17 13:54 ` Maxim Levitsky 0 siblings, 1 reply; 29+ messages in thread From: Tantilov, Emil S @ 2010-07-16 23:23 UTC (permalink / raw) To: Maxim Levitsky Cc: Kirsher, Jeffrey T, netdev@vger.kernel.org, Allan, Bruce W, Pieper, Jeffrey E Maxim Levitsky wrote: > On Thu, 2010-07-15 at 22:09 +0300, Maxim Levitsky wrote: >> On Thu, 2010-07-15 at 13:02 -0600, Tantilov, Emil S wrote: >>> Maxim Levitsky wrote: >>>> On Thu, 2010-07-15 at 02:33 +0300, Maxim Levitsky wrote: >>>>> On Wed, 2010-07-14 at 16:56 -0600, Tantilov, Emil S wrote: >>>>>> Maxim Levitsky wrote: >>>>>>> On Mon, 2010-07-12 at 15:23 -0600, Tantilov, Emil S wrote: >>>>>>>> Maxim Levitsky wrote: >>>>>>>>> On Mon, 2010-07-05 at 12:58 +0300, Maxim Levitsky wrote: >>>>>>>>>> On Mon, 2010-07-05 at 01:13 -0700, Jeff Kirsher wrote: >>>>>>>>>>> On Sun, Jul 4, 2010 at 15:48, Maxim Levitsky >>>>>>>>>>> <maximlevitsky@gmail.com> wrote: >>>>>>>>>>>> Did few guesses, and now I see that reverting the below >>>>>>>>>>>> commit fixes the problem. >>>>>>>>>>>> >>>>>>>>>>>> "e1000e: Fix/cleanup PHY reset code for ICHx/PCHx" >>>>>>>>>>>> e98cac447cc1cc418dff1d610a5c79c4f2bdec7f. >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> Best regards, >>>>>>>>>>>> Maxim Levitsky >>>>>>>>>>>> >>>>>>>>>>>> -- >>>>>>>>>>> >>>>>>>>>>> Can you give us till Tuesday to respond? I know that there >>>>>>>>>>> are some additional e1000e patches in my queue, which may >>>>>>>>>>> resolve the issue, but this weekend the power is down to do >>>>>>>>>>> some infrastructure upgrades which prevents us from doing >>>>>>>>>>> any investigation.debugging until Tuesday. >>>>>>>>>>> >>>>>>>>>> >>>>>>>>>> Sure. >>>>>>>>>> >>>>>>>>>> Best regards, >>>>>>>>>> Maxim Levitsky >>>>>>>>>> >>>>>>>>> >>>>>>>>> Updates? >>>>>>>> >>>>>>>> We are working on reproducing the issue. So far we have not >>>>>>>> seen the problem when testing with net-next. >>>>>>>> >>>>>>>> I asked in previous email about some additional info from >>>>>>>> ethtool (-d, -e, -S) and kernel config. That would help us to >>>>>>>> narrow it down. >>>>>>>> >>>>>>>> Thanks, >>>>>>>> Emil >>>>>>> I did send -e and -d output. >>>>>> >>>>>> Sorry, looks like I lost the email with the attachements. >>>>>> >>>>>> Could you provide the output of dmesg after the failure occurs? >>>>>> >>>>>>> Since you probably want -S output during failure, I need to >>>>>>> recompile kernel for that. I will do that soon. >>>>>>> >>>>>>> >>>>>>> One question, in two weeks I hope 2.6.35 won't be released? >>>>>>> If so, I will have enough free time then to narrow down this >>>>>>> issue. >>>>>>> >>>>>>> Other solution, is to revert this commit. >>>>>>> (I have never seen this problem with it reverted). >>>>>> >>>>>> We have been running reboot tests on 2 separate systems with >>>>>> recent net-next kernels using your config and so far no luck in >>>>>> reproducing this issue. >>>>>> >>>>>> What is the make model of your system (or MB)? >>>>> >>>>> the motherboard is Intel DG965RY. >>>>> >>>>> However, I am using vanilla kernel. >>>>> net-next might contain further fixes. >>>>> >>>>> I see if net-next works here. >>>> >>>> Yep, net-next works here. >>>> >>>> >>>> I have the problem on vanilla kernel. >>>> Last revision of it, I tested is 2.6.35-rc4 exactly >>>> (815c4163b6c8ebf8152f42b0a5fd015cfdcedc78) >>>> >>>> >>>> Maybe vanilla git master works, I test it too soon. >>> >>> Thanks for the information! Good to know that this issue does not >>> exist in the latest branch. >>> >>> Have you by any chance tested a stable branch (2.6.34.x)? >> >> I only did test plain 2.6.34 (v2.6.34) > And forgot to add, that it did work. > >> >> Also I repeat that revert of e98cac447cc1cc418dff1d610a5c79c4f2bdec7f >> (e1000e: Fix/cleanup PHY reset code for ICHx/PCHx) fixes the bug on >> vanilla kernel. >> >> Also I just pulled latest vanilla git, and I according to diffstat I >> see no changes in e1000e, so its likely that bug remains there. >> I will test that soon. > Tested, broken as expected. That makes sense. Unfortunately we are still not able to reproduce even on recent pull from Linus tree. If you want - you can look at the patches for e1000e in net-next and start applying those to your tree until the issue is resolved. I will keep trying it here, but none of the systems we have exhibit the issue you described, so the bug could be exposed by something in your system/config. Thanks, Emil ^ permalink raw reply [flat|nested] 29+ messages in thread
* RE: [REGRESSION] e1000e stopped working [MANUALLY BISECTED] 2010-07-16 23:23 ` Tantilov, Emil S @ 2010-07-17 13:54 ` Maxim Levitsky 2010-07-26 0:25 ` Maxim Levitsky 0 siblings, 1 reply; 29+ messages in thread From: Maxim Levitsky @ 2010-07-17 13:54 UTC (permalink / raw) To: Tantilov, Emil S Cc: Kirsher, Jeffrey T, netdev@vger.kernel.org, Allan, Bruce W, Pieper, Jeffrey E On Fri, 2010-07-16 at 17:23 -0600, Tantilov, Emil S wrote: > Maxim Levitsky wrote: > > On Thu, 2010-07-15 at 22:09 +0300, Maxim Levitsky wrote: > >> On Thu, 2010-07-15 at 13:02 -0600, Tantilov, Emil S wrote: > >>> Maxim Levitsky wrote: > >>>> On Thu, 2010-07-15 at 02:33 +0300, Maxim Levitsky wrote: > >>>>> On Wed, 2010-07-14 at 16:56 -0600, Tantilov, Emil S wrote: > >>>>>> Maxim Levitsky wrote: > >>>>>>> On Mon, 2010-07-12 at 15:23 -0600, Tantilov, Emil S wrote: > >>>>>>>> Maxim Levitsky wrote: > >>>>>>>>> On Mon, 2010-07-05 at 12:58 +0300, Maxim Levitsky wrote: > >>>>>>>>>> On Mon, 2010-07-05 at 01:13 -0700, Jeff Kirsher wrote: > >>>>>>>>>>> On Sun, Jul 4, 2010 at 15:48, Maxim Levitsky > >>>>>>>>>>> <maximlevitsky@gmail.com> wrote: > >>>>>>>>>>>> Did few guesses, and now I see that reverting the below > >>>>>>>>>>>> commit fixes the problem. > >>>>>>>>>>>> > >>>>>>>>>>>> "e1000e: Fix/cleanup PHY reset code for ICHx/PCHx" > >>>>>>>>>>>> e98cac447cc1cc418dff1d610a5c79c4f2bdec7f. > >>>>>>>>>>>> > >>>>>>>>>>>> > >>>>>>>>>>>> Best regards, > >>>>>>>>>>>> Maxim Levitsky > >>>>>>>>>>>> > >>>>>>>>>>>> -- > >>>>>>>>>>> > >>>>>>>>>>> Can you give us till Tuesday to respond? I know that there > >>>>>>>>>>> are some additional e1000e patches in my queue, which may > >>>>>>>>>>> resolve the issue, but this weekend the power is down to do > >>>>>>>>>>> some infrastructure upgrades which prevents us from doing > >>>>>>>>>>> any investigation.debugging until Tuesday. > >>>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> Sure. > >>>>>>>>>> > >>>>>>>>>> Best regards, > >>>>>>>>>> Maxim Levitsky > >>>>>>>>>> > >>>>>>>>> > >>>>>>>>> Updates? > >>>>>>>> > >>>>>>>> We are working on reproducing the issue. So far we have not > >>>>>>>> seen the problem when testing with net-next. > >>>>>>>> > >>>>>>>> I asked in previous email about some additional info from > >>>>>>>> ethtool (-d, -e, -S) and kernel config. That would help us to > >>>>>>>> narrow it down. > >>>>>>>> > >>>>>>>> Thanks, > >>>>>>>> Emil > >>>>>>> I did send -e and -d output. > >>>>>> > >>>>>> Sorry, looks like I lost the email with the attachements. > >>>>>> > >>>>>> Could you provide the output of dmesg after the failure occurs? > >>>>>> > >>>>>>> Since you probably want -S output during failure, I need to > >>>>>>> recompile kernel for that. I will do that soon. > >>>>>>> > >>>>>>> > >>>>>>> One question, in two weeks I hope 2.6.35 won't be released? > >>>>>>> If so, I will have enough free time then to narrow down this > >>>>>>> issue. > >>>>>>> > >>>>>>> Other solution, is to revert this commit. > >>>>>>> (I have never seen this problem with it reverted). > >>>>>> > >>>>>> We have been running reboot tests on 2 separate systems with > >>>>>> recent net-next kernels using your config and so far no luck in > >>>>>> reproducing this issue. > >>>>>> > >>>>>> What is the make model of your system (or MB)? > >>>>> > >>>>> the motherboard is Intel DG965RY. > >>>>> > >>>>> However, I am using vanilla kernel. > >>>>> net-next might contain further fixes. > >>>>> > >>>>> I see if net-next works here. > >>>> > >>>> Yep, net-next works here. > >>>> > >>>> > >>>> I have the problem on vanilla kernel. > >>>> Last revision of it, I tested is 2.6.35-rc4 exactly > >>>> (815c4163b6c8ebf8152f42b0a5fd015cfdcedc78) > >>>> > >>>> > >>>> Maybe vanilla git master works, I test it too soon. > >>> > >>> Thanks for the information! Good to know that this issue does not > >>> exist in the latest branch. > >>> > >>> Have you by any chance tested a stable branch (2.6.34.x)? > >> > >> I only did test plain 2.6.34 (v2.6.34) > > And forgot to add, that it did work. > > > >> > >> Also I repeat that revert of e98cac447cc1cc418dff1d610a5c79c4f2bdec7f > >> (e1000e: Fix/cleanup PHY reset code for ICHx/PCHx) fixes the bug on > >> vanilla kernel. > >> > >> Also I just pulled latest vanilla git, and I according to diffstat I > >> see no changes in e1000e, so its likely that bug remains there. > >> I will test that soon. > > Tested, broken as expected. > > That makes sense. Unfortunately we are still not able to reproduce even on recent pull from Linus tree. > > If you want - you can look at the patches for e1000e in net-next and start applying those to your tree until the issue is resolved. > That exactly what I will do soon. Also I can narrow down the problem by reverting the commit partially. After one week, I will have enough free time to do all the thing like above. Now I have none. > I will keep trying it here, but none of the systems we have exhibit the issue you described, so the bug could be exposed by something in your system/config. I also think so. Otherwise, we would see more bug-reports. You probably don't need to try anymore and reproduce that issue, because of that. Best regards, Maxim Levitsky ^ permalink raw reply [flat|nested] 29+ messages in thread
* RE: [REGRESSION] e1000e stopped working [MANUALLY BISECTED] 2010-07-17 13:54 ` Maxim Levitsky @ 2010-07-26 0:25 ` Maxim Levitsky 2010-07-28 7:04 ` Maxim Levitsky 0 siblings, 1 reply; 29+ messages in thread From: Maxim Levitsky @ 2010-07-26 0:25 UTC (permalink / raw) To: Tantilov, Emil S Cc: Kirsher, Jeffrey T, netdev@vger.kernel.org, Allan, Bruce W, Pieper, Jeffrey E On Sat, 2010-07-17 at 16:54 +0300, Maxim Levitsky wrote: > On Fri, 2010-07-16 at 17:23 -0600, Tantilov, Emil S wrote: > > Maxim Levitsky wrote: > > > On Thu, 2010-07-15 at 22:09 +0300, Maxim Levitsky wrote: > > >> On Thu, 2010-07-15 at 13:02 -0600, Tantilov, Emil S wrote: > > >>> Maxim Levitsky wrote: > > >>>> On Thu, 2010-07-15 at 02:33 +0300, Maxim Levitsky wrote: > > >>>>> On Wed, 2010-07-14 at 16:56 -0600, Tantilov, Emil S wrote: > > >>>>>> Maxim Levitsky wrote: > > >>>>>>> On Mon, 2010-07-12 at 15:23 -0600, Tantilov, Emil S wrote: > > >>>>>>>> Maxim Levitsky wrote: > > >>>>>>>>> On Mon, 2010-07-05 at 12:58 +0300, Maxim Levitsky wrote: > > >>>>>>>>>> On Mon, 2010-07-05 at 01:13 -0700, Jeff Kirsher wrote: > > >>>>>>>>>>> On Sun, Jul 4, 2010 at 15:48, Maxim Levitsky > > >>>>>>>>>>> <maximlevitsky@gmail.com> wrote: > > >>>>>>>>>>>> Did few guesses, and now I see that reverting the below > > >>>>>>>>>>>> commit fixes the problem. > > >>>>>>>>>>>> > > >>>>>>>>>>>> "e1000e: Fix/cleanup PHY reset code for ICHx/PCHx" > > >>>>>>>>>>>> e98cac447cc1cc418dff1d610a5c79c4f2bdec7f. > > >>>>>>>>>>>> > > >>>>>>>>>>>> > > >>>>>>>>>>>> Best regards, > > >>>>>>>>>>>> Maxim Levitsky > > >>>>>>>>>>>> > > >>>>>>>>>>>> -- > > >>>>>>>>>>> > > >>>>>>>>>>> Can you give us till Tuesday to respond? I know that there > > >>>>>>>>>>> are some additional e1000e patches in my queue, which may > > >>>>>>>>>>> resolve the issue, but this weekend the power is down to do > > >>>>>>>>>>> some infrastructure upgrades which prevents us from doing > > >>>>>>>>>>> any investigation.debugging until Tuesday. > > >>>>>>>>>>> > > >>>>>>>>>> > > >>>>>>>>>> Sure. > > >>>>>>>>>> > > >>>>>>>>>> Best regards, > > >>>>>>>>>> Maxim Levitsky > > >>>>>>>>>> > > >>>>>>>>> > > >>>>>>>>> Updates? > > >>>>>>>> > > >>>>>>>> We are working on reproducing the issue. So far we have not > > >>>>>>>> seen the problem when testing with net-next. > > >>>>>>>> > > >>>>>>>> I asked in previous email about some additional info from > > >>>>>>>> ethtool (-d, -e, -S) and kernel config. That would help us to > > >>>>>>>> narrow it down. > > >>>>>>>> > > >>>>>>>> Thanks, > > >>>>>>>> Emil > > >>>>>>> I did send -e and -d output. > > >>>>>> > > >>>>>> Sorry, looks like I lost the email with the attachements. > > >>>>>> > > >>>>>> Could you provide the output of dmesg after the failure occurs? > > >>>>>> > > >>>>>>> Since you probably want -S output during failure, I need to > > >>>>>>> recompile kernel for that. I will do that soon. > > >>>>>>> > > >>>>>>> > > >>>>>>> One question, in two weeks I hope 2.6.35 won't be released? > > >>>>>>> If so, I will have enough free time then to narrow down this > > >>>>>>> issue. > > >>>>>>> > > >>>>>>> Other solution, is to revert this commit. > > >>>>>>> (I have never seen this problem with it reverted). > > >>>>>> > > >>>>>> We have been running reboot tests on 2 separate systems with > > >>>>>> recent net-next kernels using your config and so far no luck in > > >>>>>> reproducing this issue. > > >>>>>> > > >>>>>> What is the make model of your system (or MB)? > > >>>>> > > >>>>> the motherboard is Intel DG965RY. > > >>>>> > > >>>>> However, I am using vanilla kernel. > > >>>>> net-next might contain further fixes. > > >>>>> > > >>>>> I see if net-next works here. > > >>>> > > >>>> Yep, net-next works here. > > >>>> > > >>>> > > >>>> I have the problem on vanilla kernel. > > >>>> Last revision of it, I tested is 2.6.35-rc4 exactly > > >>>> (815c4163b6c8ebf8152f42b0a5fd015cfdcedc78) > > >>>> > > >>>> > > >>>> Maybe vanilla git master works, I test it too soon. > > >>> > > >>> Thanks for the information! Good to know that this issue does not > > >>> exist in the latest branch. > > >>> > > >>> Have you by any chance tested a stable branch (2.6.34.x)? > > >> > > >> I only did test plain 2.6.34 (v2.6.34) > > > And forgot to add, that it did work. > > > > > >> > > >> Also I repeat that revert of e98cac447cc1cc418dff1d610a5c79c4f2bdec7f > > >> (e1000e: Fix/cleanup PHY reset code for ICHx/PCHx) fixes the bug on > > >> vanilla kernel. > > >> > > >> Also I just pulled latest vanilla git, and I according to diffstat I > > >> see no changes in e1000e, so its likely that bug remains there. > > >> I will test that soon. > > > Tested, broken as expected. > > > > That makes sense. Unfortunately we are still not able to reproduce even on recent pull from Linus tree. > > > > If you want - you can look at the patches for e1000e in net-next and start applying those to your tree until the issue is resolved. > > > That exactly what I will do soon. > > > Also I can narrow down the problem by reverting the commit partially. > > After one week, I will have enough free time to do all the thing like > above. Now I have none. > > > > I will keep trying it here, but none of the systems we have exhibit the issue you described, so the bug could be exposed by something in your system/config. > I also think so. Otherwise, we would see more bug-reports. > > You probably don't need to try anymore and reproduce that issue, because > of that. > This commit, present in net-next, solves the problem: commit 1286950690f0f82ffa504e1e149ee3fdb4c51478 Author: Bruce Allan <bruce.w.allan@intel.com> Date: Mon Jul 26 03:19:38 2010 +0300 e1000e: cleanup e1000_sw_lcd_config_ich8lan() Do not acquire and release the PHY unnecessarily for parts that return from this workaround without actually accessing the PHY registers. Signed-off-by: Bruce Allan <bruce.w.allan@intel.com> Tested-by: Jeff Pieper <jeffrey.e.pieper@intel.com> Signed-off-by: Jeff Kirsher <jeffrey.t.kirsher@intel.com> Signed-off-by: David S. Miller <davem@davemloft.net> Also, the above patch is part of whole series of patches with scary descriptions (that is these fix bugs). If I were you I would send them to Linus for 2.6.35 inclusion too. Best regards, Maxim Levitsky ^ permalink raw reply [flat|nested] 29+ messages in thread
* RE: [REGRESSION] e1000e stopped working [MANUALLY BISECTED] 2010-07-26 0:25 ` Maxim Levitsky @ 2010-07-28 7:04 ` Maxim Levitsky 2010-07-29 1:10 ` Jeff Kirsher 0 siblings, 1 reply; 29+ messages in thread From: Maxim Levitsky @ 2010-07-28 7:04 UTC (permalink / raw) To: Tantilov, Emil S Cc: Kirsher, Jeffrey T, netdev@vger.kernel.org, Allan, Bruce W, Pieper, Jeffrey E On Mon, 2010-07-26 at 03:25 +0300, Maxim Levitsky wrote: > On Sat, 2010-07-17 at 16:54 +0300, Maxim Levitsky wrote: > > On Fri, 2010-07-16 at 17:23 -0600, Tantilov, Emil S wrote: > > > Maxim Levitsky wrote: > > > > On Thu, 2010-07-15 at 22:09 +0300, Maxim Levitsky wrote: > > > >> On Thu, 2010-07-15 at 13:02 -0600, Tantilov, Emil S wrote: > > > >>> Maxim Levitsky wrote: > > > >>>> On Thu, 2010-07-15 at 02:33 +0300, Maxim Levitsky wrote: > > > >>>>> On Wed, 2010-07-14 at 16:56 -0600, Tantilov, Emil S wrote: > > > >>>>>> Maxim Levitsky wrote: > > > >>>>>>> On Mon, 2010-07-12 at 15:23 -0600, Tantilov, Emil S wrote: > > > >>>>>>>> Maxim Levitsky wrote: > > > >>>>>>>>> On Mon, 2010-07-05 at 12:58 +0300, Maxim Levitsky wrote: > > > >>>>>>>>>> On Mon, 2010-07-05 at 01:13 -0700, Jeff Kirsher wrote: > > > >>>>>>>>>>> On Sun, Jul 4, 2010 at 15:48, Maxim Levitsky > > > >>>>>>>>>>> <maximlevitsky@gmail.com> wrote: > > > >>>>>>>>>>>> Did few guesses, and now I see that reverting the below > > > >>>>>>>>>>>> commit fixes the problem. > > > >>>>>>>>>>>> > > > >>>>>>>>>>>> "e1000e: Fix/cleanup PHY reset code for ICHx/PCHx" > > > >>>>>>>>>>>> e98cac447cc1cc418dff1d610a5c79c4f2bdec7f. > > > >>>>>>>>>>>> > > > >>>>>>>>>>>> > > > >>>>>>>>>>>> Best regards, > > > >>>>>>>>>>>> Maxim Levitsky > > > >>>>>>>>>>>> > > > >>>>>>>>>>>> -- > > > >>>>>>>>>>> > > > >>>>>>>>>>> Can you give us till Tuesday to respond? I know that there > > > >>>>>>>>>>> are some additional e1000e patches in my queue, which may > > > >>>>>>>>>>> resolve the issue, but this weekend the power is down to do > > > >>>>>>>>>>> some infrastructure upgrades which prevents us from doing > > > >>>>>>>>>>> any investigation.debugging until Tuesday. > > > >>>>>>>>>>> > > > >>>>>>>>>> > > > >>>>>>>>>> Sure. > > > >>>>>>>>>> > > > >>>>>>>>>> Best regards, > > > >>>>>>>>>> Maxim Levitsky > > > >>>>>>>>>> > > > >>>>>>>>> > > > >>>>>>>>> Updates? > > > >>>>>>>> > > > >>>>>>>> We are working on reproducing the issue. So far we have not > > > >>>>>>>> seen the problem when testing with net-next. > > > >>>>>>>> > > > >>>>>>>> I asked in previous email about some additional info from > > > >>>>>>>> ethtool (-d, -e, -S) and kernel config. That would help us to > > > >>>>>>>> narrow it down. > > > >>>>>>>> > > > >>>>>>>> Thanks, > > > >>>>>>>> Emil > > > >>>>>>> I did send -e and -d output. > > > >>>>>> > > > >>>>>> Sorry, looks like I lost the email with the attachements. > > > >>>>>> > > > >>>>>> Could you provide the output of dmesg after the failure occurs? > > > >>>>>> > > > >>>>>>> Since you probably want -S output during failure, I need to > > > >>>>>>> recompile kernel for that. I will do that soon. > > > >>>>>>> > > > >>>>>>> > > > >>>>>>> One question, in two weeks I hope 2.6.35 won't be released? > > > >>>>>>> If so, I will have enough free time then to narrow down this > > > >>>>>>> issue. > > > >>>>>>> > > > >>>>>>> Other solution, is to revert this commit. > > > >>>>>>> (I have never seen this problem with it reverted). > > > >>>>>> > > > >>>>>> We have been running reboot tests on 2 separate systems with > > > >>>>>> recent net-next kernels using your config and so far no luck in > > > >>>>>> reproducing this issue. > > > >>>>>> > > > >>>>>> What is the make model of your system (or MB)? > > > >>>>> > > > >>>>> the motherboard is Intel DG965RY. > > > >>>>> > > > >>>>> However, I am using vanilla kernel. > > > >>>>> net-next might contain further fixes. > > > >>>>> > > > >>>>> I see if net-next works here. > > > >>>> > > > >>>> Yep, net-next works here. > > > >>>> > > > >>>> > > > >>>> I have the problem on vanilla kernel. > > > >>>> Last revision of it, I tested is 2.6.35-rc4 exactly > > > >>>> (815c4163b6c8ebf8152f42b0a5fd015cfdcedc78) > > > >>>> > > > >>>> > > > >>>> Maybe vanilla git master works, I test it too soon. > > > >>> > > > >>> Thanks for the information! Good to know that this issue does not > > > >>> exist in the latest branch. > > > >>> > > > >>> Have you by any chance tested a stable branch (2.6.34.x)? > > > >> > > > >> I only did test plain 2.6.34 (v2.6.34) > > > > And forgot to add, that it did work. > > > > > > > >> > > > >> Also I repeat that revert of e98cac447cc1cc418dff1d610a5c79c4f2bdec7f > > > >> (e1000e: Fix/cleanup PHY reset code for ICHx/PCHx) fixes the bug on > > > >> vanilla kernel. > > > >> > > > >> Also I just pulled latest vanilla git, and I according to diffstat I > > > >> see no changes in e1000e, so its likely that bug remains there. > > > >> I will test that soon. > > > > Tested, broken as expected. > > > > > > That makes sense. Unfortunately we are still not able to reproduce even on recent pull from Linus tree. > > > > > > If you want - you can look at the patches for e1000e in net-next and start applying those to your tree until the issue is resolved. > > > > > That exactly what I will do soon. > > > > > > Also I can narrow down the problem by reverting the commit partially. > > > > After one week, I will have enough free time to do all the thing like > > above. Now I have none. > > > > > > > I will keep trying it here, but none of the systems we have exhibit the issue you described, so the bug could be exposed by something in your system/config. > > I also think so. Otherwise, we would see more bug-reports. > > > > You probably don't need to try anymore and reproduce that issue, because > > of that. > > > > > This commit, present in net-next, solves the problem: > > commit 1286950690f0f82ffa504e1e149ee3fdb4c51478 > Author: Bruce Allan <bruce.w.allan@intel.com> > Date: Mon Jul 26 03:19:38 2010 +0300 > > e1000e: cleanup e1000_sw_lcd_config_ich8lan() > > Do not acquire and release the PHY unnecessarily for parts that return > from this workaround without actually accessing the PHY registers. > > Signed-off-by: Bruce Allan <bruce.w.allan@intel.com> > Tested-by: Jeff Pieper <jeffrey.e.pieper@intel.com> > Signed-off-by: Jeff Kirsher <jeffrey.t.kirsher@intel.com> > Signed-off-by: David S. Miller <davem@davemloft.net> > > > > > Also, the above patch is part of whole series of patches with scary descriptions (that is these fix bugs). > If I were you I would send them to Linus for 2.6.35 inclusion too. > > Best regards, > Maxim Levitsky > > > ping ^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [REGRESSION] e1000e stopped working [MANUALLY BISECTED] 2010-07-28 7:04 ` Maxim Levitsky @ 2010-07-29 1:10 ` Jeff Kirsher 2010-08-01 2:08 ` Jeff Kirsher 0 siblings, 1 reply; 29+ messages in thread From: Jeff Kirsher @ 2010-07-29 1:10 UTC (permalink / raw) To: Maxim Levitsky Cc: Tantilov, Emil S, netdev@vger.kernel.org, Allan, Bruce W, Pieper, Jeffrey E On Wed, Jul 28, 2010 at 00:04, Maxim Levitsky <maximlevitsky@gmail.com> wrote: > On Mon, 2010-07-26 at 03:25 +0300, Maxim Levitsky wrote: >> >> This commit, present in net-next, solves the problem: >> >> commit 1286950690f0f82ffa504e1e149ee3fdb4c51478 >> Author: Bruce Allan <bruce.w.allan@intel.com> >> Date: Mon Jul 26 03:19:38 2010 +0300 >> >> e1000e: cleanup e1000_sw_lcd_config_ich8lan() >> >> Do not acquire and release the PHY unnecessarily for parts that return >> from this workaround without actually accessing the PHY registers. >> >> Signed-off-by: Bruce Allan <bruce.w.allan@intel.com> >> Tested-by: Jeff Pieper <jeffrey.e.pieper@intel.com> >> Signed-off-by: Jeff Kirsher <jeffrey.t.kirsher@intel.com> >> Signed-off-by: David S. Miller <davem@davemloft.net> >> >> >> >> >> Also, the above patch is part of whole series of patches with scary descriptions (that is these fix bugs). >> If I were you I would send them to Linus for 2.6.35 inclusion too. >> >> Best regards, >> Maxim Levitsky >> >> >> > ping > Sorry for the delayed response. I am working on the issue. Here is the problem I am having, the patch that fixes the issue you are seeing is fairly large and is a cleanup to the ich8 function, which as it stands now, would not be accepted into net-2.6 tree this late into the -rc cycle. So, what I looking at is, what specifically fixed the issue you are seeing that resides in that patch, and come up with a smaller (acceptable) patch that I can submit to net-2.6 now to resolve your issue. I have dedicated most of this evening to finding a resolution to your issue that will be acceptable for the net-2.6 tree. As you noted, there were several patches before this particular commit that may play some part in the resolution as well, and that is what I will be looking into. I greatly appreciate the hard work you have done to help us resolve this issue, and will make sure you get credit for any solution I put together to resolve this issue. -- Cheers, Jeff ^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [REGRESSION] e1000e stopped working [MANUALLY BISECTED] 2010-07-29 1:10 ` Jeff Kirsher @ 2010-08-01 2:08 ` Jeff Kirsher 0 siblings, 0 replies; 29+ messages in thread From: Jeff Kirsher @ 2010-08-01 2:08 UTC (permalink / raw) To: Maxim Levitsky Cc: Tantilov, Emil S, netdev@vger.kernel.org, Allan, Bruce W, Pieper, Jeffrey E On Wed, Jul 28, 2010 at 18:10, Jeff Kirsher <jeffrey.t.kirsher@intel.com> wrote: > On Wed, Jul 28, 2010 at 00:04, Maxim Levitsky <maximlevitsky@gmail.com> wrote: >> On Mon, 2010-07-26 at 03:25 +0300, Maxim Levitsky wrote: >>> >>> This commit, present in net-next, solves the problem: >>> >>> commit 1286950690f0f82ffa504e1e149ee3fdb4c51478 >>> Author: Bruce Allan <bruce.w.allan@intel.com> >>> Date: Mon Jul 26 03:19:38 2010 +0300 >>> >>> e1000e: cleanup e1000_sw_lcd_config_ich8lan() >>> >>> Do not acquire and release the PHY unnecessarily for parts that return >>> from this workaround without actually accessing the PHY registers. >>> >>> Signed-off-by: Bruce Allan <bruce.w.allan@intel.com> >>> Tested-by: Jeff Pieper <jeffrey.e.pieper@intel.com> >>> Signed-off-by: Jeff Kirsher <jeffrey.t.kirsher@intel.com> >>> Signed-off-by: David S. Miller <davem@davemloft.net> >>> >>> >>> >>> >>> Also, the above patch is part of whole series of patches with scary descriptions (that is these fix bugs). >>> If I were you I would send them to Linus for 2.6.35 inclusion too. >>> >>> Best regards, >>> Maxim Levitsky >>> >>> >>> >> ping >> > > Sorry for the delayed response. I am working on the issue. Here is > the problem I am having, the patch that fixes the issue you are seeing > is fairly large and is a cleanup to the ich8 function, which as it > stands now, would not be accepted into net-2.6 tree this late into the > -rc cycle. So, what I looking at is, what specifically fixed the > issue you are seeing that resides in that patch, and come up with a > smaller (acceptable) patch that I can submit to net-2.6 now to resolve > your issue. > > I have dedicated most of this evening to finding a resolution to your > issue that will be acceptable for the net-2.6 tree. As you noted, > there were several patches before this particular commit that may play > some part in the resolution as well, and that is what I will be > looking into. I greatly appreciate the hard work you have done to > help us resolve this issue, and will make sure you get credit for any > solution I put together to resolve this issue. > > -- > Cheers, > Jeff > To keep everyone informed... We have found the root cause for this issue with the help of Maxim, and will have a patch to fix the issue in the next couple of days. -- Cheers, Jeff ^ permalink raw reply [flat|nested] 29+ messages in thread
end of thread, other threads:[~2010-08-01 2:08 UTC | newest] Thread overview: 29+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2010-06-27 17:27 [REGRESSION] e1000e stopped working Maxim Levitsky 2010-06-27 17:29 ` Maxim Levitsky 2010-06-27 17:43 ` Maxim Levitsky 2010-06-27 17:47 ` Maxim Levitsky 2010-06-28 17:04 ` Allan, Bruce W 2010-06-28 17:14 ` Maxim Levitsky 2010-06-29 1:09 ` Allan, Bruce W 2010-06-29 10:32 ` Maxim Levitsky 2010-06-29 18:37 ` Tantilov, Emil S 2010-06-30 22:59 ` Maxim Levitsky 2010-07-04 0:41 ` Maxim Levitsky 2010-07-04 22:48 ` [REGRESSION] e1000e stopped working [MANUALLY BISECTED] Maxim Levitsky 2010-07-05 8:13 ` Jeff Kirsher 2010-07-05 9:58 ` Maxim Levitsky 2010-07-12 15:56 ` Maxim Levitsky 2010-07-12 21:23 ` Tantilov, Emil S 2010-07-13 0:38 ` Maxim Levitsky 2010-07-14 22:56 ` Tantilov, Emil S 2010-07-14 23:33 ` Maxim Levitsky 2010-07-15 18:57 ` Maxim Levitsky 2010-07-15 19:02 ` Tantilov, Emil S 2010-07-15 19:09 ` Maxim Levitsky 2010-07-16 19:25 ` Maxim Levitsky 2010-07-16 23:23 ` Tantilov, Emil S 2010-07-17 13:54 ` Maxim Levitsky 2010-07-26 0:25 ` Maxim Levitsky 2010-07-28 7:04 ` Maxim Levitsky 2010-07-29 1:10 ` Jeff Kirsher 2010-08-01 2:08 ` Jeff Kirsher
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).