From mboxrd@z Thu Jan 1 00:00:00 1970 From: Arkadiusz =?utf-8?q?Mi=C5=9Bkiewicz?= Subject: Re: tg3: BMC stops responding in 3.0 Date: Fri, 30 Sep 2011 10:06:25 +0200 Message-ID: <201109301006.25590.a.miskiewicz@gmail.com> References: <201109232145.50449.a.miskiewicz@gmail.com> <201109262031.33956.a.miskiewicz@gmail.com> <20110929235701.GA9513@mcarlson.broadcom.com> Mime-Version: 1.0 Content-Type: Text/Plain; charset=utf-8 Content-Transfer-Encoding: QUOTED-PRINTABLE Cc: "Michael Chan" , "netdev@vger.kernel.org" To: "Matt Carlson" Return-path: Received: from mail-bw0-f46.google.com ([209.85.214.46]:59257 "EHLO mail-bw0-f46.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1757744Ab1I3IGc convert rfc822-to-8bit (ORCPT ); Fri, 30 Sep 2011 04:06:32 -0400 Received: by bkbzt4 with SMTP id zt4so1460194bkb.19 for ; Fri, 30 Sep 2011 01:06:30 -0700 (PDT) In-Reply-To: <20110929235701.GA9513@mcarlson.broadcom.com> Sender: netdev-owner@vger.kernel.org List-ID: On Friday 30 of September 2011, Matt Carlson wrote: > On Mon, Sep 26, 2011 at 11:31:33AM -0700, Arkadiusz Mi??kiewicz wrote= : > > On Monday 26 of September 2011, Matt Carlson wrote: > > > On Fri, Sep 23, 2011 at 12:45:50PM -0700, Arkadiusz Mi??kiewicz w= rote: > > > > Hi, > > > >=20 > > > > I was using 2.6.38.8 and recently tried to switch to 3.0.4 on T= yan > > > > S2891 platform. > > > >=20 > > > > This platform uses tg3: > > > > tg3 0000:0a:09.1: eth1: Tigon3 [partno(BCM95704) rev 2003] > > > > (PCIX:133MHz:64- bit) MAC address 00:e0:81:33:5e:af > > > > tg3 0000:0a:09.1: eth1: attached PHY is 5704 (10/100/1000Base-T > > > > Ethernet) (WireSpeed[1], EEE[0]) > > > > tg3 0000:0a:09.1: eth1: RXcsums[1] LinkChgREG[0] MIirq[0] ASF[0= ] > > > > TSOcap[1] tg3 0000:0a:09.1: eth1: dma_rwctrl[769f4000] > > > > dma_mask[64-bit] > > > >=20 > > > > With 2.6.38.8 everything was working fine. With 3.0.4 there is = a > > > > problem. As soon as tg3 module is loaded or eth0 configured (ca= n't > > > > tell which one since the machine is 400km away from me and I ha= ve no > > > > way to play with it other than ipmi or ssh) BMC stops respondin= g (so > > > > all ipmitool commands over LAN stop working). Normal tg3 activi= ty is > > > > not affected - I can ssh-in without a problem etc but ipmi over= lan > > > > doesn't work. > > > >=20 > > > > From ssh console "ipmitool lan print" works, shows data but for > > > > example after "ipmitool mc reset cold" it doesn't recover - ipm= itool > > > > returns "Invalid channel: 255". I have to reboot to 2.6.38.8 an= d > > > > then issue "ipmitool mc reset cold" to recover. > > > >=20 > > > > Any idea which tg3 change could break this? Can't bisect this d= ue > > > > remote access only. > > > >=20 > > > > I was hoping that maybe 9e975cc291d80d5e4562d6bed15ec171e896d69= b > > > > "tg3: Fix io failures after chip reset" will fix things for me = but no > > > > - this doesn't help. > > >=20 > > > What version of the tg3 driver are you working with? > >=20 > > The one in 3.0.4 kernel. I think it's 3.119 (at least modinfo says = so). >=20 > Unfortunately there were a lot of changes between 3.117 and 3.119(+). > Is there any way you can narrow down the gap? The machines are 400km away from me and it's hard to debug that way the= n=20 ipmi/network conectivity is in stake :-/ I could try some form of bisec= ting=20 but need to know if all git versions between 3.117 and 3.119 were known= to be=20 safe and working? I don't want to loose any conectivity to this machine= =2E I was going to try 2.6.39 but it looks like it also uses 3.117 driver. --=20 Arkadiusz Mi=C5=9Bkiewicz PLD/Linux Team arekm / maven.pl http://ftp.pld-linux.org/