linux-wireless.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Kernel Panics from ath9k
@ 2012-04-01 18:20 Larry Finger
  2012-04-02  2:34 ` Sujith Manoharan
  0 siblings, 1 reply; 3+ messages in thread
From: Larry Finger @ 2012-04-01 18:20 UTC (permalink / raw)
  To: Luis R. Rodriguez, Jouni Malinen, Vasanthakumar Thiagarajan,
	Senthil Balasubramanian
  Cc: wireless

A few weeks ago, I was trying to help an inexperienced user on the openSUSE 
Wireless Forum who was having "crashes" caused by his AR9285 device. The crash 
turned out to be a kernel panic. As expected, I was not able to help him, and I 
ordered one of these cards through E-bay. It just arrived, and I am able to 
duplicate the panics. For me, they occur when changing from a WPA2-encrypted AP 
to one with WPA encryption. When switching from WPA2 to WEP, the NULL 
dereference does not occur; however, the connection fails.

I captured enough info from the debugging console to know that the panic results 
from "BUG: unable to handle kernel NULL pointer dereference at (null)" that 
originates at ath_tx_start+0x2c0/0x4b0. The kernel is x86_64 and I am currently 
testing kernel v3.4-rc1-214-g1ac7a92 from Linus's tree. I have not yet tested 
the wireless-testing tree, but I do not see any commits there that appear to 
address this issue.

The address of the traceback translates to line 1878 of 
drivers/net/wireless/ath/ath9k/xmit.c, which is (ironically) a WARN_ON that says

                 WARN_ON(tid->ac->txq != txctl->txq);

By testing each of the pointers in the above statement, I determined that 
tid->ac is NULL. Note: This statement is actually in ath_tx_start_dma(), which 
appears to be in-lined by the compiler. My tests process the pointers from left 
to right, and txctl or txctl->txq may also be NULL. The setting of tid is done with

                 tid = ATH_AN_2_TID(txctl->an, tidno);

I have not traced this any further yet to see why tid is not correct.

The full lspci output for my device is

06:00.0 Network controller [0280]: Atheros Communications Inc. AR9285 Wireless 
Network Adapter (PCI-Express) [168c:002b] (rev 01)
         Subsystem: Accton Technology Corporation Device [1113:d811]
         Flags: bus master, fast devsel, latency 0, IRQ 20
         Memory at f8000000 (64-bit, non-prefetchable) [size=64K]
         Capabilities: [40] Power Management version 3
         Capabilities: [50] MSI: Enable- Count=1/1 Maskable- 64bit-
         Capabilities: [60] Express Legacy Endpoint, MSI 00
         Capabilities: [100] Advanced Error Reporting
         Capabilities: [140] Virtual Channel
         Capabilities: [160] Device Serial Number 00-15-17-ff-ff-24-14-12
         Capabilities: [170] Power Budgeting <?>
         Kernel driver in use: ath9k

I will be happy to provide any further information that may be required.

Larry

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Kernel Panics from ath9k
  2012-04-01 18:20 Kernel Panics from ath9k Larry Finger
@ 2012-04-02  2:34 ` Sujith Manoharan
  2012-04-02  3:54   ` Larry Finger
  0 siblings, 1 reply; 3+ messages in thread
From: Sujith Manoharan @ 2012-04-02  2:34 UTC (permalink / raw)
  To: Larry Finger
  Cc: Luis R. Rodriguez, Jouni Malinen, Vasanthakumar Thiagarajan,
	Senthil Balasubramanian, wireless

Larry Finger wrote:
> A few weeks ago, I was trying to help an inexperienced user on the openSUSE 
> Wireless Forum who was having "crashes" caused by his AR9285 device. The crash 
> turned out to be a kernel panic. As expected, I was not able to help him, and I 
> ordered one of these cards through E-bay. It just arrived, and I am able to 
> duplicate the panics. For me, they occur when changing from a WPA2-encrypted AP 
> to one with WPA encryption. When switching from WPA2 to WEP, the NULL 
> dereference does not occur; however, the connection fails.
> 
> I captured enough info from the debugging console to know that the panic results 
> from "BUG: unable to handle kernel NULL pointer dereference at (null)" that 
> originates at ath_tx_start+0x2c0/0x4b0. The kernel is x86_64 and I am currently 
> testing kernel v3.4-rc1-214-g1ac7a92 from Linus's tree. I have not yet tested 
> the wireless-testing tree, but I do not see any commits there that appear to 
> address this issue.

Should be fixed by this patch: http://marc.info/?l=linux-wireless&m=133282325727326&w=2

Sujith

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: Kernel Panics from ath9k
  2012-04-02  2:34 ` Sujith Manoharan
@ 2012-04-02  3:54   ` Larry Finger
  0 siblings, 0 replies; 3+ messages in thread
From: Larry Finger @ 2012-04-02  3:54 UTC (permalink / raw)
  To: Sujith Manoharan, John Linville
  Cc: Luis R. Rodriguez, Jouni Malinen, Vasanthakumar Thiagarajan,
	Senthil Balasubramanian, wireless

On 04/01/2012 09:34 PM, Sujith Manoharan wrote:

> Should be fixed by this patch: http://marc.info/?l=linux-wireless&m=133282325727326&w=2

Yes, that patch does fix the problem.

BTW, that patch should be pushed to 3.4, and it should have a Cc: Stable, at 
least for 3.3. Make certain that stable gets an E-mail with the commit ID when 
the patch gets into mainline.

I wonder what the problem was with kernels as far back as 3.1. It must have a 
different cause that I still need to explore.

Thanks,

Larry



^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2012-04-02  3:54 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-04-01 18:20 Kernel Panics from ath9k Larry Finger
2012-04-02  2:34 ` Sujith Manoharan
2012-04-02  3:54   ` Larry Finger

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).