* [iwl4965] Microcode SW error detected @ 2011-05-23 11:45 Paul Bolle 2011-05-24 7:09 ` Stanislaw Gruszka 2011-06-07 6:32 ` Bernhard Schmidt 0 siblings, 2 replies; 23+ messages in thread From: Paul Bolle @ 2011-05-23 11:45 UTC (permalink / raw) To: Stanislaw Gruszka; +Cc: linux-wireless 0) Since I started running (release candidates of) kernel v2.6.39 errors like these show up in my log, every now and then: iwl4965 0000:03:00.0: Microcode SW error detected. Restarting 0x82000000. iwl4965 0000:03:00.0: Loaded firmware version: 228.61.2.24 iwl4965 0000:03:00.0: Start IWL Error Log Dump: iwl4965 0000:03:00.0: Status: 0x000213E4, count: 5 iwl4965 0000:03:00.0: Desc Time data1 data2 line iwl4965 0000:03:00.0: FH_ERROR (0x000C) 1821446380 0x00000008 0x03130000 208 iwl4965 0000:03:00.0: pc blink1 blink2 ilink1 ilink2 hcmd iwl4965 0000:03:00.0: 0x0046C 0x04A46 0x004C2 0x006DA 0x04A46 0x4C700A8 iwl4965 0000:03:00.0: FH register values: iwl4965 0000:03:00.0: FH_RSCSR_CHNL0_STTS_WPTR_REG: 0X132a9b00 iwl4965 0000:03:00.0: FH_RSCSR_CHNL0_RBDCB_BASE_REG: 0X01329ed0 iwl4965 0000:03:00.0: FH_RSCSR_CHNL0_WPTR: 0X000000d8 iwl4965 0000:03:00.0: FH_MEM_RCSR_CHNL0_CONFIG_REG: 0X00819000 iwl4965 0000:03:00.0: FH_MEM_RSSR_SHARED_CTRL_REG: 0X0000003c iwl4965 0000:03:00.0: FH_MEM_RSSR_RX_STATUS_REG: 0X03130000 iwl4965 0000:03:00.0: FH_MEM_RSSR_RX_ENABLE_ERR_IRQ2DRV: 0X00000000 iwl4965 0000:03:00.0: FH_TSSR_TX_STATUS_REG: 0X07ff0002 iwl4965 0000:03:00.0: FH_TSSR_TX_ERROR_REG: 0X00000000 iwl4965 0000:03:00.0: Start IWL Event Log Dump: display last 20 entries iwl4965 0000:03:00.0: EVT_LOGT:1821445332:0x00000000:1352 iwl4965 0000:03:00.0: EVT_LOGT:1821445332:0x00000001:1353 iwl4965 0000:03:00.0: EVT_LOGT:1821445336:0x0000000c:0357 iwl4965 0000:03:00.0: EVT_LOGT:1821445533:0x00000107:0106 iwl4965 0000:03:00.0: EVT_LOGT:1821445534:0x00000000:0302 iwl4965 0000:03:00.0: EVT_LOGT:1821445574:0x000000d4:0321 iwl4965 0000:03:00.0: EVT_LOGT:1821445575:0x00000000:1350 iwl4965 0000:03:00.0: EVT_LOGT:1821445576:0x00000000:1351 iwl4965 0000:03:00.0: EVT_LOGT:1821445576:0x00000000:1352 iwl4965 0000:03:00.0: EVT_LOGT:1821445577:0x00000001:1353 iwl4965 0000:03:00.0: EVT_LOGT:1821445581:0x0000000d:0357 iwl4965 0000:03:00.0: EVT_LOGT:1821446327:0x00000107:0106 iwl4965 0000:03:00.0: EVT_LOGT:1821446328:0x00000000:0302 iwl4965 0000:03:00.0: EVT_LOGT:1821446368:0x000000d4:0321 iwl4965 0000:03:00.0: EVT_LOGT:1821446369:0x00000000:1350 iwl4965 0000:03:00.0: EVT_LOGT:1821446370:0x00000000:1351 iwl4965 0000:03:00.0: EVT_LOGT:1821446370:0x00000000:1352 iwl4965 0000:03:00.0: EVT_LOGT:1821446371:0x00000001:1353 iwl4965 0000:03:00.0: EVT_LOGT:1821446375:0x0000000e:0357 iwl4965 0000:03:00.0: EVT_LOGT:1821446383:0x00000000:0125 ieee80211 phy0: Hardware restart was requested 1) Is there anything I'm supposed to do to (help) make these errors go away? Paul Bolle ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2011-05-23 11:45 [iwl4965] Microcode SW error detected Paul Bolle @ 2011-05-24 7:09 ` Stanislaw Gruszka 2011-05-24 15:54 ` Paul Bolle 2011-05-24 16:50 ` Paul Bolle 2011-06-07 6:32 ` Bernhard Schmidt 1 sibling, 2 replies; 23+ messages in thread From: Stanislaw Gruszka @ 2011-05-24 7:09 UTC (permalink / raw) To: Paul Bolle; +Cc: linux-wireless On Mon, May 23, 2011 at 01:45:06PM +0200, Paul Bolle wrote: > 0) Since I started running (release candidates of) kernel v2.6.39 errors > like these show up in my log, every now and then: > > iwl4965 0000:03:00.0: Microcode SW error detected. Restarting 0x82000000. [snip] > 1) Is there anything I'm supposed to do to (help) make these errors go > away? Does it also happen on 2.6.39-rc1 ? What is your configuration (mode, encryption, do you use wpa_supplicant or NetworkManager, which version, etc ...)? I will try to reproduce locally. If I'll be not able to reproduce, I'll ask you for debug logs on broken, and working (i.e. 2.6.38) version for compare. Thanks Stanislaw ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2011-05-24 7:09 ` Stanislaw Gruszka @ 2011-05-24 15:54 ` Paul Bolle 2011-05-24 16:03 ` Stanislaw Gruszka 2011-05-24 16:50 ` Paul Bolle 1 sibling, 1 reply; 23+ messages in thread From: Paul Bolle @ 2011-05-24 15:54 UTC (permalink / raw) To: Stanislaw Gruszka; +Cc: linux-wireless On Tue, 2011-05-24 at 09:09 +0200, Stanislaw Gruszka wrote: > Does it also happen on 2.6.39-rc1 ? My logs do not go back anymore to the (short) period I was actually running 2.6.39-rc1. Is this important enough for you that I should reinstall 2.6.39-rc1 and try to see whether I can reproduce this error with that release candidate? Paul Bolle ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2011-05-24 15:54 ` Paul Bolle @ 2011-05-24 16:03 ` Stanislaw Gruszka 0 siblings, 0 replies; 23+ messages in thread From: Stanislaw Gruszka @ 2011-05-24 16:03 UTC (permalink / raw) To: Paul Bolle; +Cc: linux-wireless On Tue, May 24, 2011 at 05:54:05PM +0200, Paul Bolle wrote: > On Tue, 2011-05-24 at 09:09 +0200, Stanislaw Gruszka wrote: > > Does it also happen on 2.6.39-rc1 ? > > My logs do not go back anymore to the (short) period I was actually > running 2.6.39-rc1. Is this important enough for you that I should > reinstall 2.6.39-rc1 and try to see whether I can reproduce this error > with that release candidate? It's not so important at that point, it could be if we would not find remedy other way. Currently I would more like to know how to reproduce the problem. Stanislaw ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2011-05-24 7:09 ` Stanislaw Gruszka 2011-05-24 15:54 ` Paul Bolle @ 2011-05-24 16:50 ` Paul Bolle 2011-05-25 13:11 ` Stanislaw Gruszka 1 sibling, 1 reply; 23+ messages in thread From: Paul Bolle @ 2011-05-24 16:50 UTC (permalink / raw) To: Stanislaw Gruszka; +Cc: linux-wireless On Tue, 2011-05-24 at 09:09 +0200, Stanislaw Gruszka wrote: > What is your configuration (mode, > encryption, do you use wpa_supplicant or NetworkManager, which version, > etc ...)? 0) Mode as is displayed in iwconfig output? That is "Managed". 1) Encryption is wep (so, no wpa_supplicant). 2) Currently using NetworkManager-0.8.4-1.fc14.x86_64. > I will try to reproduce locally. If I'll be not able to reproduce, > I'll ask you for debug logs on broken, and working (i.e. 2.6.38) version > for compare. 3) Sure, no problem. But isn't 2.6.38 from before the iwlegacy split? Paul Bolle ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2011-05-24 16:50 ` Paul Bolle @ 2011-05-25 13:11 ` Stanislaw Gruszka 2011-05-25 13:31 ` Paul Bolle 2011-05-26 17:26 ` Dan Williams 0 siblings, 2 replies; 23+ messages in thread From: Stanislaw Gruszka @ 2011-05-25 13:11 UTC (permalink / raw) To: Paul Bolle; +Cc: linux-wireless On Tue, May 24, 2011 at 06:50:53PM +0200, Paul Bolle wrote: > On Tue, 2011-05-24 at 09:09 +0200, Stanislaw Gruszka wrote: > > What is your configuration (mode, > > encryption, do you use wpa_supplicant or NetworkManager, which version, > > etc ...)? > > 0) Mode as is displayed in iwconfig output? That is "Managed". > > 1) Encryption is wep (so, no wpa_supplicant). > > 2) Currently using NetworkManager-0.8.4-1.fc14.x86_64. I try to use WEP with one AP, no microcode errors. However with different NetworkManager version, not sure if it is important. I'll try lately on F-14. What channel do you use? Is network 11n or 11g or mixed ? WEP is 40 bits or other? > > I will try to reproduce locally. If I'll be not able to reproduce, > > I'll ask you for debug logs on broken, and working (i.e. 2.6.38) version > > for compare. > > 3) Sure, no problem. But isn't 2.6.38 from before the iwlegacy split? Yes, it is. If the problem was introduced during driver split (that happen in 2.6.39), one way to find a fix would be compare driver behaviour before the split and after. Stanislaw ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2011-05-25 13:11 ` Stanislaw Gruszka @ 2011-05-25 13:31 ` Paul Bolle 2011-05-31 13:37 ` Stanislaw Gruszka 2011-05-26 17:26 ` Dan Williams 1 sibling, 1 reply; 23+ messages in thread From: Paul Bolle @ 2011-05-25 13:31 UTC (permalink / raw) To: Stanislaw Gruszka; +Cc: linux-wireless On Wed, 2011-05-25 at 15:11 +0200, Stanislaw Gruszka wrote: > What channel do you use? 11, 2.462 GHz (according to my basestation) > Is network 11n or 11g or mixed ? Network is mixed 11b and 11g (according to my basestation) > WEP is 40 bits or other? 128 bits, 26 hex digits (again, according to my basestation) Paul Bolle ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2011-05-25 13:31 ` Paul Bolle @ 2011-05-31 13:37 ` Stanislaw Gruszka 2011-06-06 8:58 ` Paul Bolle 0 siblings, 1 reply; 23+ messages in thread From: Stanislaw Gruszka @ 2011-05-31 13:37 UTC (permalink / raw) To: Paul Bolle; +Cc: linux-wireless On Wed, May 25, 2011 at 03:31:53PM +0200, Paul Bolle wrote: > On Wed, 2011-05-25 at 15:11 +0200, Stanislaw Gruszka wrote: > > > What channel do you use? > 11, 2.462 GHz (according to my basestation) > > > Is network 11n or 11g or mixed ? > Network is mixed 11b and 11g (according to my basestation) > > > WEP is 40 bits or other? > 128 bits, 26 hex digits (again, according to my basestation) My all attempts of reproduce the problem was unsuccessful. Please send me (off list since this could be quite big) dmesg output from loading module to first microcode error, when using verbose debug eg: $ modprobe -r iwl4965 $ modprobe iwl4965 debug=0x47ffffff Note: driver have to be compiled with CONFIG_IWLWIFI_LEGACY_DEBUG. Also dmesg circular buffer can be too small to contain all requested messages. In such case, you will have to configure syslog like described here: https://bugzilla.kernel.org/show_bug.cgi?id=35932#c2 Stanislaw ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2011-05-31 13:37 ` Stanislaw Gruszka @ 2011-06-06 8:58 ` Paul Bolle 2011-06-06 11:36 ` Stanislaw Gruszka 0 siblings, 1 reply; 23+ messages in thread From: Paul Bolle @ 2011-06-06 8:58 UTC (permalink / raw) To: Stanislaw Gruszka; +Cc: linux-wireless On Tue, 2011-05-31 at 15:37 +0200, Stanislaw Gruszka wrote: > My all attempts of reproduce the problem was unsuccessful. Please > send me (off list since this could be quite big) dmesg output from > loading module to first microcode error, when using verbose debug eg: > > $ modprobe -r iwl4965 > $ modprobe iwl4965 debug=0x47ffffff > > Note: driver have to be compiled with CONFIG_IWLWIFI_LEGACY_DEBUG. > Also dmesg circular buffer can be too small to contain all requested > messages. In such case, you will have to configure syslog like described > here: > https://bugzilla.kernel.org/show_bug.cgi?id=35932#c2 0) I have not yet done that, but I do hope to do it shortly. 1) This Microcode SW error pops up only every now and then, so I'm afraid I might find myself logging debug messages for quite some time. What is the average volume of those messages? If it's no more than, say, 10 MB/min I can still run in verbose debug mode for hours on end without worrying about running out of disk space. Paul Bolle ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2011-06-06 8:58 ` Paul Bolle @ 2011-06-06 11:36 ` Stanislaw Gruszka 2011-06-06 18:29 ` Paul Bolle 0 siblings, 1 reply; 23+ messages in thread From: Stanislaw Gruszka @ 2011-06-06 11:36 UTC (permalink / raw) To: Paul Bolle; +Cc: linux-wireless On Mon, Jun 06, 2011 at 10:58:42AM +0200, Paul Bolle wrote: > On Tue, 2011-05-31 at 15:37 +0200, Stanislaw Gruszka wrote: > > My all attempts of reproduce the problem was unsuccessful. Please > > send me (off list since this could be quite big) dmesg output from > > loading module to first microcode error, when using verbose debug eg: > > > > $ modprobe -r iwl4965 > > $ modprobe iwl4965 debug=0x47ffffff > > > > Note: driver have to be compiled with CONFIG_IWLWIFI_LEGACY_DEBUG. > > Also dmesg circular buffer can be too small to contain all requested > > messages. In such case, you will have to configure syslog like described > > here: > > https://bugzilla.kernel.org/show_bug.cgi?id=35932#c2 > > 0) I have not yet done that, but I do hope to do it shortly. > > 1) This Microcode SW error pops up only every now and then, so I'm > afraid I might find myself logging debug messages for quite some time. Hmm, perhaps there is some way to reproduce a error? > What is the average volume of those messages? If it's no more than, say, > 10 MB/min I can still run in verbose debug mode for hours on end without > worrying about running out of disk space. I don't know how fast verbose logging will eat disk space, perhaps you can check that? :-) Maybe logger could be configured to do not log date, hostname and component, that would decrease size of logs vastly. If not, less verbose logging could be used, let say debug=0x00375fff, that will miss some information, but maybe would be enough to figure out where the bug is. Thanks Stanislaw ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2011-06-06 11:36 ` Stanislaw Gruszka @ 2011-06-06 18:29 ` Paul Bolle 0 siblings, 0 replies; 23+ messages in thread From: Paul Bolle @ 2011-06-06 18:29 UTC (permalink / raw) To: Stanislaw Gruszka; +Cc: linux-wireless On Mon, 2011-06-06 at 13:36 +0200, Stanislaw Gruszka wrote: > Hmm, perhaps there is some way to reproduce a error? My current logs show no obvious - to me - pattern for this error, so I'm afraid we must scan the debug output for some hints for reproducibility ... Paul Bolle ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2011-05-25 13:11 ` Stanislaw Gruszka 2011-05-25 13:31 ` Paul Bolle @ 2011-05-26 17:26 ` Dan Williams 1 sibling, 0 replies; 23+ messages in thread From: Dan Williams @ 2011-05-26 17:26 UTC (permalink / raw) To: Stanislaw Gruszka; +Cc: Paul Bolle, linux-wireless On Wed, 2011-05-25 at 15:11 +0200, Stanislaw Gruszka wrote: > On Tue, May 24, 2011 at 06:50:53PM +0200, Paul Bolle wrote: > > On Tue, 2011-05-24 at 09:09 +0200, Stanislaw Gruszka wrote: > > > What is your configuration (mode, > > > encryption, do you use wpa_supplicant or NetworkManager, which version, > > > etc ...)? > > > > 0) Mode as is displayed in iwconfig output? That is "Managed". > > > > 1) Encryption is wep (so, no wpa_supplicant). > > > > 2) Currently using NetworkManager-0.8.4-1.fc14.x86_64. > > I try to use WEP with one AP, no microcode errors. However with different > NetworkManager version, not sure if it is important. I'll try lately on F-14. > What channel do you use? Is network 11n or 11g or mixed ? WEP is 40 bits or > other? If it makes a difference, NM 0.8.x uses only the 'wext' supplicant driver and uses wpa_supplicant 0.6.x. NM still requests periodic scans about every 2 minutes as usual. (F15+, ie NM 0.9, uses "nl80211,wext" and supplicant 0.7.3+git) Dan > > > I will try to reproduce locally. If I'll be not able to reproduce, > > > I'll ask you for debug logs on broken, and working (i.e. 2.6.38) version > > > for compare. > > > > 3) Sure, no problem. But isn't 2.6.38 from before the iwlegacy split? > Yes, it is. If the problem was introduced during driver split (that > happen in 2.6.39), one way to find a fix would be compare driver > behaviour before the split and after. > > Stanislaw > -- > To unsubscribe from this list: send the line "unsubscribe linux-wireless" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2011-05-23 11:45 [iwl4965] Microcode SW error detected Paul Bolle 2011-05-24 7:09 ` Stanislaw Gruszka @ 2011-06-07 6:32 ` Bernhard Schmidt 2011-06-07 14:54 ` Stanislaw Gruszka 1 sibling, 1 reply; 23+ messages in thread From: Bernhard Schmidt @ 2011-06-07 6:32 UTC (permalink / raw) To: Paul Bolle; +Cc: Stanislaw Gruszka, linux-wireless On Monday, May 23, 2011 13:45:06 Paul Bolle wrote: > 0) Since I started running (release candidates of) kernel v2.6.39 errors > like these show up in my log, every now and then: > > iwl4965 0000:03:00.0: Microcode SW error detected. Restarting 0x82000000. > iwl4965 0000:03:00.0: Loaded firmware version: 228.61.2.24 > iwl4965 0000:03:00.0: Start IWL Error Log Dump: > iwl4965 0000:03:00.0: Status: 0x000213E4, count: 5 > iwl4965 0000:03:00.0: Desc Time data1 data2 line > iwl4965 0000:03:00.0: FH_ERROR (0x000C) 1821446380 0x00000008 0x03130000 208 On a unrelated site note, I have a case where I can trigger a FH_ERROR at this line 100% reliably. Not on Linux though.. While being associated to a 5GHz BSS doing a scan chan by chan (instead of all at once) is enough to trigger it. A workaround is too not send probe requests for 2GHz channels at 1Mbps (CCK flag) but at 6Mbps instead. Maybe this bug report [1] is related too? [1] http://bugzilla.intellinuxwireless.org/show_bug.cgi?id=1965 -- Bernhard ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2011-06-07 6:32 ` Bernhard Schmidt @ 2011-06-07 14:54 ` Stanislaw Gruszka 2011-06-07 19:23 ` Paul Bolle 2011-09-04 8:28 ` Paul Bolle 0 siblings, 2 replies; 23+ messages in thread From: Stanislaw Gruszka @ 2011-06-07 14:54 UTC (permalink / raw) To: Bernhard Schmidt; +Cc: Paul Bolle, linux-wireless On Tue, Jun 07, 2011 at 08:32:48AM +0200, Bernhard Schmidt wrote: > On Monday, May 23, 2011 13:45:06 Paul Bolle wrote: > > 0) Since I started running (release candidates of) kernel v2.6.39 errors > > like these show up in my log, every now and then: > > > > iwl4965 0000:03:00.0: Microcode SW error detected. Restarting 0x82000000. > > iwl4965 0000:03:00.0: Loaded firmware version: 228.61.2.24 > > iwl4965 0000:03:00.0: Start IWL Error Log Dump: > > iwl4965 0000:03:00.0: Status: 0x000213E4, count: 5 > > iwl4965 0000:03:00.0: Desc Time data1 data2 line > > iwl4965 0000:03:00.0: FH_ERROR (0x000C) 1821446380 0x00000008 0x03130000 208 > > On a unrelated site note, I have a case where I can trigger a > FH_ERROR at this line 100% reliably. Not on Linux though.. > While being associated to a 5GHz BSS doing a scan chan by chan > (instead of all at once) is enough to trigger it. That could be useful hint, we do not scan chan by chan, but we have thing called "plcp check health", which "restart radio" by requesting one channel scan. So perhaps disabling that could help. > A workaround > is too not send probe requests for 2GHz channels at 1Mbps (CCK > flag) but at 6Mbps instead. > > Maybe this bug report [1] is related too? > > [1] http://bugzilla.intellinuxwireless.org/show_bug.cgi?id=1965 Hard to tell. Thanks Stanislaw diff --git a/drivers/net/wireless/iwlegacy/iwl-rx.c b/drivers/net/wireless/iwlegacy/iwl-rx.c index 654cf23..6062da0 100644 --- a/drivers/net/wireless/iwlegacy/iwl-rx.c +++ b/drivers/net/wireless/iwlegacy/iwl-rx.c @@ -230,6 +230,8 @@ EXPORT_SYMBOL(iwl_legacy_rx_spectrum_measure_notif); void iwl_legacy_recover_from_statistics(struct iwl_priv *priv, struct iwl_rx_packet *pkt) { + return; + if (test_bit(STATUS_EXIT_PENDING, &priv->status)) return; if (iwl_legacy_is_any_associated(priv)) { ^ permalink raw reply related [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2011-06-07 14:54 ` Stanislaw Gruszka @ 2011-06-07 19:23 ` Paul Bolle 2011-06-08 13:47 ` Stanislaw Gruszka 2011-09-04 8:28 ` Paul Bolle 1 sibling, 1 reply; 23+ messages in thread From: Paul Bolle @ 2011-06-07 19:23 UTC (permalink / raw) To: Stanislaw Gruszka; +Cc: Bernhard Schmidt, linux-wireless On Tue, 2011-06-07 at 16:54 +0200, Stanislaw Gruszka wrote: > That could be useful hint, we do not scan chan by chan, but we > have thing called "plcp check health", which "restart radio" > by requesting one channel scan. So perhaps disabling that could > help. At this moment I'm interested in something (a script, some sequence of actions, whatever) that (somewhat) reliably triggers this error. Because right now I have no clue what triggers it. Is your patch in that category or is it a (crude) fix? If it's a fix, I'm not sure it is of much help at this stage. Paul Bolle ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2011-06-07 19:23 ` Paul Bolle @ 2011-06-08 13:47 ` Stanislaw Gruszka 2011-08-15 10:51 ` Paul Bolle 0 siblings, 1 reply; 23+ messages in thread From: Stanislaw Gruszka @ 2011-06-08 13:47 UTC (permalink / raw) To: Paul Bolle; +Cc: Bernhard Schmidt, linux-wireless On Tue, Jun 07, 2011 at 09:23:00PM +0200, Paul Bolle wrote: > On Tue, 2011-06-07 at 16:54 +0200, Stanislaw Gruszka wrote: > > That could be useful hint, we do not scan chan by chan, but we > > have thing called "plcp check health", which "restart radio" > > by requesting one channel scan. So perhaps disabling that could > > help. > > At this moment I'm interested in something (a script, some sequence of > actions, whatever) that (somewhat) reliably triggers this error. Because > right now I have no clue what triggers it. Having reliable reproducer will be definitely something that is nice to have. But bug could be some kind of race condition that happen in code flow once per 10000000000 cases ... > Is your patch in that category or is it a (crude) fix? If it's a fix, > I'm not sure it is of much help at this stage. It could be possible fix. Why you can not simply patch and see if errors are still there? If after a week or so there will be no errors, we could consider bug fixed, otherwise well ... still will need looking around for fix. I just posted patch that remove these "plcp health check" and related code on -next anyway, because I don't think this is something that we need. Stanislaw ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2011-06-08 13:47 ` Stanislaw Gruszka @ 2011-08-15 10:51 ` Paul Bolle 0 siblings, 0 replies; 23+ messages in thread From: Paul Bolle @ 2011-08-15 10:51 UTC (permalink / raw) To: Stanislaw Gruszka; +Cc: Bernhard Schmidt, linux-wireless On Wed, 2011-06-08 at 15:47 +0200, Stanislaw Gruszka wrote: > It could be possible fix. Why you can not simply patch and see if errors > are still there? If after a week or so there will be no errors, we could > consider bug fixed, otherwise well ... still will need looking around > for fix. > > I just posted patch that remove these "plcp health check" and related > code on -next anyway, because I don't think this is something that we > need. 0) This is just to note that I haven't yet tried to see if you're small patch helps. I still hope to do that as I have not given up on this issue. Feel free to prod me if I again disappear for too long and you loose your patience. 1) By the way, I still see this error (every now and then) in my logs. Most recently while running v3.0.1, so it appears not to be fixed by recent updates for iwl4965 (if any). Paul Bolle ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2011-06-07 14:54 ` Stanislaw Gruszka 2011-06-07 19:23 ` Paul Bolle @ 2011-09-04 8:28 ` Paul Bolle 2011-09-05 9:33 ` Stanislaw Gruszka 1 sibling, 1 reply; 23+ messages in thread From: Paul Bolle @ 2011-09-04 8:28 UTC (permalink / raw) To: Stanislaw Gruszka; +Cc: Bernhard Schmidt, linux-wireless On Tue, 2011-06-07 at 16:54 +0200, Stanislaw Gruszka wrote: > That could be useful hint, we do not scan chan by chan, but we > have thing called "plcp check health", which "restart radio" > by requesting one channel scan. So perhaps disabling that could > help. > > [...] > > diff --git a/drivers/net/wireless/iwlegacy/iwl-rx.c b/drivers/net/wireless/iwlegacy/iwl-rx.c > index 654cf23..6062da0 100644 > --- a/drivers/net/wireless/iwlegacy/iwl-rx.c > +++ b/drivers/net/wireless/iwlegacy/iwl-rx.c > @@ -230,6 +230,8 @@ EXPORT_SYMBOL(iwl_legacy_rx_spectrum_measure_notif); > void iwl_legacy_recover_from_statistics(struct iwl_priv *priv, > struct iwl_rx_packet *pkt) > { > + return; > + > if (test_bit(STATUS_EXIT_PENDING, &priv->status)) > return; > if (iwl_legacy_is_any_associated(priv)) { 0) I finally got around to applying this patch (to v3.0.4). 1) After a few days of normal usage (with quite a bit of suspend and resume cycles) this error was again triggered. So avoiding check_plcp_health() doesn't seem to help. 2) I never send you the debug output (ie, output after doing "modprobe iwl4965 debug=0x47ffffff"), did I? Paul Bolle ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2011-09-04 8:28 ` Paul Bolle @ 2011-09-05 9:33 ` Stanislaw Gruszka 2011-09-05 10:32 ` Paul Bolle 2012-09-14 12:17 ` Paul Bolle 0 siblings, 2 replies; 23+ messages in thread From: Stanislaw Gruszka @ 2011-09-05 9:33 UTC (permalink / raw) To: Paul Bolle; +Cc: Bernhard Schmidt, linux-wireless On Sun, Sep 04, 2011 at 10:28:35AM +0200, Paul Bolle wrote: > On Tue, 2011-06-07 at 16:54 +0200, Stanislaw Gruszka wrote: > > That could be useful hint, we do not scan chan by chan, but we > > have thing called "plcp check health", which "restart radio" > > by requesting one channel scan. So perhaps disabling that could > > help. > > > > [...] > > > > diff --git a/drivers/net/wireless/iwlegacy/iwl-rx.c b/drivers/net/wireless/iwlegacy/iwl-rx.c > > index 654cf23..6062da0 100644 > > --- a/drivers/net/wireless/iwlegacy/iwl-rx.c > > +++ b/drivers/net/wireless/iwlegacy/iwl-rx.c > > @@ -230,6 +230,8 @@ EXPORT_SYMBOL(iwl_legacy_rx_spectrum_measure_notif); > > void iwl_legacy_recover_from_statistics(struct iwl_priv *priv, > > struct iwl_rx_packet *pkt) > > { > > + return; > > + > > if (test_bit(STATUS_EXIT_PENDING, &priv->status)) > > return; > > if (iwl_legacy_is_any_associated(priv)) { > > 0) I finally got around to applying this patch (to v3.0.4). > > 1) After a few days of normal usage (with quite a bit of suspend and > resume cycles) this error was again triggered. So avoiding > check_plcp_health() doesn't seem to help. > > 2) I never send you the debug output (ie, output after doing "modprobe > iwl4965 debug=0x47ffffff"), did I? No, but if error show up after few days, gathering and analyzing few days of debug logs in impractical. Does wifi stop working after an error, or there is some other negative impact? Or only that messages are printed and driver recover itself? Stanislaw ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2011-09-05 9:33 ` Stanislaw Gruszka @ 2011-09-05 10:32 ` Paul Bolle 2012-09-14 12:17 ` Paul Bolle 1 sibling, 0 replies; 23+ messages in thread From: Paul Bolle @ 2011-09-05 10:32 UTC (permalink / raw) To: Stanislaw Gruszka; +Cc: Bernhard Schmidt, linux-wireless On Mon, 2011-09-05 at 11:33 +0200, Stanislaw Gruszka wrote: > On Sun, Sep 04, 2011 at 10:28:35AM +0200, Paul Bolle wrote: > > 1) After a few days of normal usage (with quite a bit of suspend and > > resume cycles) this error was again triggered. So avoiding > > check_plcp_health() doesn't seem to help. > > > > 2) I never send you the debug output (ie, output after doing "modprobe > > iwl4965 debug=0x47ffffff"), did I? > > No, but if error show up after few days, gathering and analyzing few > days of debug logs in impractical. I see. > Does wifi stop working after an > error, or there is some other negative impact? Or only that messages > are printed and driver recover itself? There doesn't seem to be any impact (ie, it might have some impact but I'm too insensitive to notice). The driver does recover itself and I do not have to mess with rfkill or "modprobe -r" or whatever. I actually discovered this because I tend to regularly do dmesg -r | grep "^<[123]>" to keep myself informed of any kernel errors (or worse). And then these few dozen lines can't go unnoticed. Paul Bolle ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2011-09-05 9:33 ` Stanislaw Gruszka 2011-09-05 10:32 ` Paul Bolle @ 2012-09-14 12:17 ` Paul Bolle 2012-10-15 14:51 ` Paul Bolle 1 sibling, 1 reply; 23+ messages in thread From: Paul Bolle @ 2012-09-14 12:17 UTC (permalink / raw) To: Stanislaw Gruszka; +Cc: Bernhard Schmidt, linux-wireless On Mon, 2011-09-05 at 12:32 +0200, Paul Bolle wrote: > On Mon, 2011-09-05 at 11:33 +0200, Stanislaw Gruszka wrote: > > Does wifi stop working after an > > error, or there is some other negative impact? Or only that messages > > are printed and driver recover itself? > > There doesn't seem to be any impact (ie, it might have some impact but > I'm too insensitive to notice). The driver does recover itself and I do > not have to mess with rfkill or "modprobe -r" or whatever. I actually > discovered this because I tend to regularly do > dmesg -r | grep "^<[123]>" > > to keep myself informed of any kernel errors (or worse). And then these > few dozen lines can't go unnoticed. 0) It's one year later now and this Microcode SW error again showed up in the logs. I recently upgraded and I haven't kept any logs, but my guess would be that I have run into that error once every week. (This laptop is now running a v3.5.3 based kernel as shipped for Fedora 17.) 1) Would you have any suggestions how to pinpoint the cause of this error? It is mainly annoying, and I managed to ignore it since my previous message, but I still would like to free the logs from the noise it makes. Paul Bolle ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2012-09-14 12:17 ` Paul Bolle @ 2012-10-15 14:51 ` Paul Bolle 2012-10-15 15:17 ` Stanislaw Gruszka 0 siblings, 1 reply; 23+ messages in thread From: Paul Bolle @ 2012-10-15 14:51 UTC (permalink / raw) To: Stanislaw Gruszka; +Cc: Bernhard Schmidt, linux-wireless On Fri, 2012-09-14 at 14:17 +0200, Paul Bolle wrote: > 0) It's one year later now and this Microcode SW error again showed up > in the logs. I recently upgraded and I haven't kept any logs, but my > guess would be that I have run into that error once every week. (This > laptop is now running a v3.5.3 based kernel as shipped for Fedora 17.) > > 1) Would you have any suggestions how to pinpoint the cause of this > error? It is mainly annoying, and I managed to ignore it since my > previous message, but I still would like to free the logs from the noise > it makes. 0) I ported the "iwlegacy_tracing" patch from https://bugzilla.kernel.org/show_bug.cgi?id=42766 to v3.6-rc7 and to iwl4965. I've been running iwl4965 with tracing enabled ever since (that is on: v3.6-rc7, v3.6, v3.6.1, and v3.6.2). Finally, after only three weeks I hit our Microcode SW error again. 1) So now I've got a 600+k line (or 65 MB) trace dump. What should I do with it? Paul Bolle ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [iwl4965] Microcode SW error detected 2012-10-15 14:51 ` Paul Bolle @ 2012-10-15 15:17 ` Stanislaw Gruszka 0 siblings, 0 replies; 23+ messages in thread From: Stanislaw Gruszka @ 2012-10-15 15:17 UTC (permalink / raw) To: Paul Bolle; +Cc: Bernhard Schmidt, linux-wireless On Mon, Oct 15, 2012 at 04:51:00PM +0200, Paul Bolle wrote: > On Fri, 2012-09-14 at 14:17 +0200, Paul Bolle wrote: > > 0) It's one year later now and this Microcode SW error again showed up > > in the logs. I recently upgraded and I haven't kept any logs, but my > > guess would be that I have run into that error once every week. (This > > laptop is now running a v3.5.3 based kernel as shipped for Fedora 17.) > > > > 1) Would you have any suggestions how to pinpoint the cause of this > > error? It is mainly annoying, and I managed to ignore it since my > > previous message, but I still would like to free the logs from the noise > > it makes. > > 0) I ported the "iwlegacy_tracing" patch from > https://bugzilla.kernel.org/show_bug.cgi?id=42766 to v3.6-rc7 and to > iwl4965. I've been running iwl4965 with tracing enabled ever since (that > is on: v3.6-rc7, v3.6, v3.6.1, and v3.6.2). Finally, after only three > weeks I hit our Microcode SW error again. > > 1) So now I've got a 600+k line (or 65 MB) trace dump. What should I do > with it? Just post me privately let say last 10MB of it... Stanislaw ^ permalink raw reply [flat|nested] 23+ messages in thread
end of thread, other threads:[~2012-10-15 15:18 UTC | newest] Thread overview: 23+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2011-05-23 11:45 [iwl4965] Microcode SW error detected Paul Bolle 2011-05-24 7:09 ` Stanislaw Gruszka 2011-05-24 15:54 ` Paul Bolle 2011-05-24 16:03 ` Stanislaw Gruszka 2011-05-24 16:50 ` Paul Bolle 2011-05-25 13:11 ` Stanislaw Gruszka 2011-05-25 13:31 ` Paul Bolle 2011-05-31 13:37 ` Stanislaw Gruszka 2011-06-06 8:58 ` Paul Bolle 2011-06-06 11:36 ` Stanislaw Gruszka 2011-06-06 18:29 ` Paul Bolle 2011-05-26 17:26 ` Dan Williams 2011-06-07 6:32 ` Bernhard Schmidt 2011-06-07 14:54 ` Stanislaw Gruszka 2011-06-07 19:23 ` Paul Bolle 2011-06-08 13:47 ` Stanislaw Gruszka 2011-08-15 10:51 ` Paul Bolle 2011-09-04 8:28 ` Paul Bolle 2011-09-05 9:33 ` Stanislaw Gruszka 2011-09-05 10:32 ` Paul Bolle 2012-09-14 12:17 ` Paul Bolle 2012-10-15 14:51 ` Paul Bolle 2012-10-15 15:17 ` Stanislaw Gruszka
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).