From mboxrd@z Thu Jan 1 00:00:00 1970 Return-path: Received: from mga14.intel.com ([192.55.52.115]:52218 "EHLO mga14.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752256AbbDWJMx (ORCPT ); Thu, 23 Apr 2015 05:12:53 -0400 From: "Grumbach, Emmanuel" To: "jkosina@suse.cz" CC: "linux-wireless@vger.kernel.org" , "linux-kernel@vger.kernel.org" , "ilw@linux.intel.com" , "Berg, Johannes" Subject: Re: iwlwifi getting stuck with current Linus' tree (646da63172) Date: Thu, 23 Apr 2015 09:12:46 +0000 Message-ID: <1429780366.11859.1.camel@intel.com> (sfid-20150423_111300_928672_9C5D4A6F) References: <1429764440.4084.5.camel@intel.com> In-Reply-To: Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Sender: linux-wireless-owner@vger.kernel.org List-ID: T24gVGh1LCAyMDE1LTA0LTIzIGF0IDEwOjE1ICswMjAwLCBKaXJpIEtvc2luYSB3cm90ZToNCj4g T24gVGh1LCAyMyBBcHIgMjAxNSwgR3J1bWJhY2gsIEVtbWFudWVsIHdyb3RlOg0KPiANCj4gPiA+ IEkndmUgYmVlbiBydW5uaW5nIGN1cnJlbnQgTGludXMnIHRyZWUgYW5kIGhhdmUgYmVlbiBnZXR0 aW5nIHN5c3RlbSBsb2NrdXBzIA0KPiA+ID4gZnJlcXVlbnRseS4gQWZ0ZXIgYSBmZXcgInNpbGVu dCIgbG9ja3VwcywgSSB3YXMgYWJsZSB0byBvYnRhaW4gYSBkbWVzZyANCj4gPiA+IGJlZm9yZSB0 aGUgbWFjaGluZSB0dXJuZWQgZGVhZCBhZ2FpbiAod2lmaSBzdG9wcGVkIHdvcmtpbmcgc2hvcnRs eSBiZWZvcmUgDQo+ID4gPiB0aGF0KS4NCj4gPiA+IA0KPiA+ID4gQmVmb3JlIHN0YXJ0aW5nIHRv IGRlYnVnIC8gYmlzZWN0IChsYXN0IGtub3duIGdvb2Qgb24gdGhpcyBtYWNoaW5lIGlzIA0KPiA+ ID4gNC4wLXJjNiksIEkgYW0gYXR0YWNoaW5nIHRoZSBkbWVzZyBpbiBjYXNlIHNvbWVvbmUgYWxy ZWFkeSBrbm93cyB3aGF0IHRoZSANCj4gPiA+IGlzc3VlIGlzLg0KPiA+ID4gDQo+ID4gDQo+ID4g SSBicmllZmx5IHdlbnQgb3ZlciB0aGUgaXdsd2lmaSBjb21taXRzIGJldHdlZW4gNC4wLXJjNiBh bmQgbGludXgvbWFzdGVyDQo+ID4gYW5kIGNvdWxkbid0IGZpbmQgYW55dGhpbmcgb2J2aW91cy4N Cj4gPiBOb3RlIHRoYXQgZm9yIHRoZSBkZXZpY2UgeW91IGhhdmUsIHRoZSBjb21taXRzIHRoYXQg dG91Y2gNCj4gPiBkcml2ZXJzL25ldC93aXJlbGVzcy9pd2x3aWZpL212bSBhcmUgbm90IHJlbGV2 YW50Lg0KPiA+IA0KPiA+IFdoYXQgeW91IGFyZSBzZWVpbmcgaXMgdGhhdCB0aGUgUENJIGhvc3Qg aXMgZGlzY29ubmVjdGluZyB0aGUgV2lGaSBOSUMNCj4gPiBmb3Igc29tZSB3ZWlyZCByZWFzb24u IEl0IGlzIG5vdCB0aGUgZmlyc3QgdGltZSBJIHNlZSB0aGF0LCBidXQNCj4gPiB1bmZvcnR1bmF0 ZWx5LCBJIGhhdmUgbmV2ZXIgYmVlbiBhYmxlIHRvIGRlYnVnIHRoaXMuIEkgYW0gcGVyc29uYWxs eSBub3QNCj4gPiBhIEhXIFBDSSBleHBlcnQgYW5kIEkgY291bGRuJ3QgcmVwcm9kdWNlIGVpdGhl ci4uLg0KPiA+IA0KPiA+IEkgYW0gYWZyYWlkIEkgd29uJ3Qgc2F2ZSB5b3UgdGhlIHRpbWUgb2Yg dGhlIGJpc2VjdGlvbiwgYnV0IEkgYW0gbm90DQo+ID4gZW50aXJlbHkgc3VyZSB0aGF0IGJpc2Vj dGluZyB0aGUgaXdsd2lmaSBkcml2ZXIgaXMgZW5vdWdoIHRvIGZpbmQgdGhlDQo+ID4gY29tbWl0 IHRoYXQgYnJva2UgaXQuIFlvdSBtYXkgd2FudCB0byBiaXNlY3QgdGhlIHBjaSBidXMgZHJpdmVy IGFzIHdlbGwuDQo+IA0KPiBUaGUgcHJvYmxlbSBpcyB0aGF0IEkgY2FuJ3QgcmVhbGx5IHJlbGlh Ymx5IHJlcHJvZHVjZSBpdDsgaXQgaGFwcGVucyANCj4gcmF0aGVyIG9mdGVuLCBidXQgbm90IHNv IG9mdGVuIHRoYXQgSSBjb3VsZCBiZSBjZXJ0YWlubHkgc3VyZSB0aGF0IG15IA0KPiBkaXN0aW5j dGlvbiBvZiBnb29kIGFuZCBiYWQga2VybmVscyB3b3VsZCBiZSBhY2N1cmF0ZS4NCj4gDQo+IEkg d2lsbCB0cnkgaXQsIGJ1dCBJIGV4cGVjdCB0aGUgcmVzdWx0IHRvIGJlIGJvZ3VzIGJlY2F1c2Ug b2YgdGhpcywgDQo+IHVuZm9ydHVuYXRlbHkuDQo+IA0KDQpJIGNhbiB1bmRlcnN0YW5kLiBBIGZl dyB1c2VycyByZXBvcnRlZCB0aGF0IHRoaXMgYnVnIG9jY3VycmVkIG1vcmUNCnJlbGlhYmx5IHdo ZW4gbW92aW5nIHRoZWlyIHN5c3RlbSwgYWx0aG91Z2ggaXQgc2VlbXMgdmVyeSB3ZWlyZCB0byBt ZS4NCg0KPiA+IEZpcnN0IHF1ZXN0aW9uIGlzOiBBcmUgeW91IHN1cmUgdGhhdCA0LjAtcmM2IHdh cyBnb29kPw0KPiANCj4gUHJldHR5IG11Y2gsIHllcy4gSSd2ZSBiZWVuIHJ1bm5pbmcgaXQgZm9y IHF1aXRlIHNvbWUgdGltZSBvbiB0aGlzIA0KPiBtYWNoaW5lIHdpdGhvdXQgYW55IGlzc3Vlcy4g QnV0IGFmdGVyIHVwZGF0aW5nIHRvIGN1cnJlbnQgSEVBRCB0d28gZGF5cyANCj4gYWdvLCB0aGUg aXNzdWUgdHJpZ2dlcmVkIGxpa2UgNiBvciA3IHRpbWVzIGFscmVhZHkuDQo+IA0KDQpPayAtIEkg d2lsbCB0cnkgdG8gbG9vayBhdCB0aGUgUENJIGNvbW1pdHMgdGhlcmUgYWx0aG91Z2ggSSBhbSBu b3Qgc3VyZQ0KSSdsbCBiZSBhYmxlIHRvIG1ha2UgbXVjaCBzZW5zZSBvZiB0aGVtLi4uDQoNCj4g VGhhbmtzLA0KPiANCg0K From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932980AbbDWJM6 (ORCPT ); Thu, 23 Apr 2015 05:12:58 -0400 Received: from mga14.intel.com ([192.55.52.115]:52218 "EHLO mga14.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752256AbbDWJMx (ORCPT ); Thu, 23 Apr 2015 05:12:53 -0400 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.11,629,1422950400"; d="scan'208";a="714170325" From: "Grumbach, Emmanuel" To: "jkosina@suse.cz" CC: "linux-wireless@vger.kernel.org" , "linux-kernel@vger.kernel.org" , "ilw@linux.intel.com" , "Berg, Johannes" Subject: Re: iwlwifi getting stuck with current Linus' tree (646da63172) Thread-Topic: iwlwifi getting stuck with current Linus' tree (646da63172) Thread-Index: AQHQfTzjJftp7aVbakmHVHohb9QTqJ1Z1GUAgAA6NoCAAA/zAA== Date: Thu, 23 Apr 2015 09:12:46 +0000 Message-ID: <1429780366.11859.1.camel@intel.com> References: <1429764440.4084.5.camel@intel.com> In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [10.254.151.56] Content-Type: text/plain; charset="utf-8" Content-ID: <11F035032F924B4BA848B282422A49B6@intel.com> MIME-Version: 1.0 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Transfer-Encoding: 8bit X-MIME-Autoconverted: from base64 to 8bit by nfs id t3N9D6N7026322 On Thu, 2015-04-23 at 10:15 +0200, Jiri Kosina wrote: > On Thu, 23 Apr 2015, Grumbach, Emmanuel wrote: > > > > I've been running current Linus' tree and have been getting system lockups > > > frequently. After a few "silent" lockups, I was able to obtain a dmesg > > > before the machine turned dead again (wifi stopped working shortly before > > > that). > > > > > > Before starting to debug / bisect (last known good on this machine is > > > 4.0-rc6), I am attaching the dmesg in case someone already knows what the > > > issue is. > > > > > > > I briefly went over the iwlwifi commits between 4.0-rc6 and linux/master > > and couldn't find anything obvious. > > Note that for the device you have, the commits that touch > > drivers/net/wireless/iwlwifi/mvm are not relevant. > > > > What you are seeing is that the PCI host is disconnecting the WiFi NIC > > for some weird reason. It is not the first time I see that, but > > unfortunately, I have never been able to debug this. I am personally not > > a HW PCI expert and I couldn't reproduce either... > > > > I am afraid I won't save you the time of the bisection, but I am not > > entirely sure that bisecting the iwlwifi driver is enough to find the > > commit that broke it. You may want to bisect the pci bus driver as well. > > The problem is that I can't really reliably reproduce it; it happens > rather often, but not so often that I could be certainly sure that my > distinction of good and bad kernels would be accurate. > > I will try it, but I expect the result to be bogus because of this, > unfortunately. > I can understand. A few users reported that this bug occurred more reliably when moving their system, although it seems very weird to me. > > First question is: Are you sure that 4.0-rc6 was good? > > Pretty much, yes. I've been running it for quite some time on this > machine without any issues. But after updating to current HEAD two days > ago, the issue triggered like 6 or 7 times already. > Ok - I will try to look at the PCI commits there although I am not sure I'll be able to make much sense of them... > Thanks, > {.n++%ݶw{.n+{G{ayʇڙ,jfhz_(階ݢj"mG?&~iOzv^m ?I