All of lore.kernel.org
 help / color / mirror / Atom feed
* [ath9k-devel] ath9k-issue with stop RX/TX DMA - correlation with WMM traffic classes?
@ 2013-04-12  8:27 Rougu
  2013-04-12 15:41 ` Felix Fietkau
  0 siblings, 1 reply; 4+ messages in thread
From: Rougu @ 2013-04-12  8:27 UTC (permalink / raw)
  To: ath9k-devel

Hi everyone,

while I might be totally wrong with my observation, I still would like 
to share it, because this issue is lingering for years now and many 
users are waiting to a solid grip on it to fix it.

I'm running several freifunk-nodes in Cologne, Germany, where the 
firmware contains a ath9k-watchdog that restarts the node, once the 
failure occurs. One of my nodes is affected in varying intervals of 200 
.. 10000 seconds uptime, depending on WLAN activity.

(openwrt, attitude adjustment, kernel 3.3.8)

Looking at "/sys/kernel/debug/ieee80211/phy0/ath9k/xmit", it seems that
MPDUs in the highest priority queue "VO" have a lower rate of 
"completion" (90,5%) than those in "BE" (>99,9%).

 From my laymen's perspective I would expect "VO" to have an even better 
completion rate than "BE".

Any chance to see a correlation between WMM priority queue handling and 
DMA access problems?

Regards,

Rougu




root at 64700284ede8:~# uptime
  09:52:13 up 14:33,  load average: 0.00, 0.01, 0.04

root at 64700284ede8:~# cat /sys/kernel/debug/ieee80211/phy0/ath9k/xmit
Num-Tx-Queues: 10  tx-queues-setup: 0x10f poll-work-seen: 52392
                             BE         BK        VI        VO

MPDUs Queued:          1349013         22         0      8497
MPDUs Completed:       1348756         22         0      7694
MPDUs XRetried:            257          0         0       803
Aggregates:                116         13         0         0
AMPDUs Queued HW:         2448        316         0         0
AMPDUs Queued SW:          364         72         0         0
AMPDUs Completed:         2767        388         0         0
AMPDUs Retried:            281         24         0         0
AMPDUs XRetried:            44          0         0         0
FIFO Underrun:               0          0         0         0
TXOP Exceeded:               0          0         0         0
TXTIMER Expiry:              0          0         0         0
DESC CFG Error:              0          0         0         0
DATA Underrun:               0          0         0         0
DELIM Underrun:              0          0         0         0
TX-Pkts-All:           1351824        410         0      8497
TX-Bytes-All:        184784983     454112         0   1280336
hw-put-tx-buf:               1          1         0         2
hw-tx-start:           1351887        376         0      8497
hw-tx-proc-desc:       1351887        376         0      8497
TX-Failed:                   0          0         0         0
txq-memory-address:   81b7a1b4   81b7a230  81b7a138  81b7a0bc
axq-qnum:                    2          3         1         0
axq-depth:                   0          0         0         0
axq-ampdu_depth:             0          0         0         0
axq-stopped                  0          0         0         0
tx-in-progress               0          0         0         0
pending-frames               0          0         0         0
txq_headidx:                 0          0         0         0
txq_tailidx:                 0          0         0         0
axq_q empty:                   0          0         1         0
axq_acq empty:                 1          1         1         1
txq_fifo[0] empty:             1          1         1         1
txq_fifo[1] empty:             1          1         1         1
txq_fifo[2] empty:             1          1         1         1
txq_fifo[3] empty:             1          1         1         1
txq_fifo[4] empty:             1          1         1         1
txq_fifo[5] empty:             1          1         1         1
txq_fifo[6] empty:             1          1         1         1
txq_fifo[7] empty:             1          1         1         1

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [ath9k-devel] ath9k-issue with stop RX/TX DMA - correlation with WMM traffic classes?
  2013-04-12  8:27 [ath9k-devel] ath9k-issue with stop RX/TX DMA - correlation with WMM traffic classes? Rougu
@ 2013-04-12 15:41 ` Felix Fietkau
  2013-04-12 16:57   ` Jan Lühr
  0 siblings, 1 reply; 4+ messages in thread
From: Felix Fietkau @ 2013-04-12 15:41 UTC (permalink / raw)
  To: ath9k-devel

On 2013-04-12 10:27 AM, Rougu wrote:
> Hi everyone,
> 
> while I might be totally wrong with my observation, I still would like 
> to share it, because this issue is lingering for years now and many 
> users are waiting to a solid grip on it to fix it.
> 
> I'm running several freifunk-nodes in Cologne, Germany, where the 
> firmware contains a ath9k-watchdog that restarts the node, once the 
> failure occurs. One of my nodes is affected in varying intervals of 200 
> .. 10000 seconds uptime, depending on WLAN activity.
> 
> (openwrt, attitude adjustment, kernel 3.3.8)
Please also run a test with OpenWrt trunk and compare stability.

> Looking at "/sys/kernel/debug/ieee80211/phy0/ath9k/xmit", it seems that
> MPDUs in the highest priority queue "VO" have a lower rate of 
> "completion" (90,5%) than those in "BE" (>99,9%).
> 
>  From my laymen's perspective I would expect "VO" to have an even better 
> completion rate than "BE".
What you're missing in that perspective is the fact that not all traffic
is created equal :)
The VO queue carries management traffic. Probe responses frequently end
up in XRetry because the number of retries is intentionally limited to 1
(some clients are often either too far away to ACK the probe response,
or do not stick around long enough to receive it).

> Any chance to see a correlation between WMM priority queue handling and 
> DMA access problems?
No, the data is too limited for that.

- Felix

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [ath9k-devel] ath9k-issue with stop RX/TX DMA - correlation with WMM traffic classes?
  2013-04-12 15:41 ` Felix Fietkau
@ 2013-04-12 16:57   ` Jan Lühr
  2013-04-12 20:49     ` Felix Fietkau
  0 siblings, 1 reply; 4+ messages in thread
From: Jan Lühr @ 2013-04-12 16:57 UTC (permalink / raw)
  To: ath9k-devel

Hallo Felix,

Am 12.04.2013 um 17:41 schrieb Felix Fietkau:
> On 2013-04-12 10:27 AM, Rougu wrote:
>> while I might be totally wrong with my observation, I still would like 
>> to share it, because this issue is lingering for years now and many 
>> users are waiting to a solid grip on it to fix it.
>> 
>> I'm running several freifunk-nodes in Cologne, Germany, where the 
>> firmware contains a ath9k-watchdog that restarts the node, once the 
>> failure occurs. One of my nodes is affected in varying intervals of 200 
>> .. 10000 seconds uptime, depending on WLAN activity.
>> 
>> (openwrt, attitude adjustment, kernel 3.3.8)
> Please also run a test with OpenWrt trunk and compare stability.

thanks for providing feedback on this issue. Please keep in mind that using OpenWRT-trunk is not that easy for node operators of freifunk networks. I hope Rouge is able to do so - at least we will provide help in our Freifunk-Community (kbu).

I'd really like to discuss this issue at the upcoming Wireless Community Weekend in Berlin - will I meet you there?
At the moment we really do suffer from excessive driver-woos - causing the need to reboot nodes:
~ 1450 incidents have been reported within one month (http://register.kbu.freifunk.net/watchdog_bites). Some nodes are almost DoS'ed by certain - yet undiscovered - clients.

Thanks,
yanosz

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [ath9k-devel] ath9k-issue with stop RX/TX DMA - correlation with WMM traffic classes?
  2013-04-12 16:57   ` Jan Lühr
@ 2013-04-12 20:49     ` Felix Fietkau
  0 siblings, 0 replies; 4+ messages in thread
From: Felix Fietkau @ 2013-04-12 20:49 UTC (permalink / raw)
  To: ath9k-devel

On 2013-04-12 6:57 PM, Jan L?hr wrote:
> Hallo Felix,
> 
> Am 12.04.2013 um 17:41 schrieb Felix Fietkau:
>> On 2013-04-12 10:27 AM, Rougu wrote:
>>> while I might be totally wrong with my observation, I still would like 
>>> to share it, because this issue is lingering for years now and many 
>>> users are waiting to a solid grip on it to fix it.
>>> 
>>> I'm running several freifunk-nodes in Cologne, Germany, where the 
>>> firmware contains a ath9k-watchdog that restarts the node, once the 
>>> failure occurs. One of my nodes is affected in varying intervals of 200 
>>> .. 10000 seconds uptime, depending on WLAN activity.
>>> 
>>> (openwrt, attitude adjustment, kernel 3.3.8)
>> Please also run a test with OpenWrt trunk and compare stability.
> 
> thanks for providing feedback on this issue. Please keep in mind
> that using OpenWRT-trunk is not that easy for node operators of freifunk
> networks. I hope Rouge is able to do so - at least we will provide help
> in our Freifunk-Community (kbu).
I'm also considering backporting mac80211 from trunk to AA after the
release (meant for people that build the branch from source, not for a
follow-up release).

> I'd really like to discuss this issue at the upcoming Wireless
> Community Weekend in Berlin - will I meet you there?
Yes, definitely.

> At the moment we really do suffer from excessive driver-woos -
> causing the need to reboot nodes:
> ~ 1450 incidents have been reported within one month
> (http://register.kbu.freifunk.net/watchdog_bites). Some nodes are almost
> DoS'ed by certain - yet undiscovered - clients.
I'm really interested how such a setup behaves with either trunk or a
mac80211 backport to AA. Many fixes were added that could not easily be
backported to AA because they were too intrusive for the release.

- Felix

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2013-04-12 20:49 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2013-04-12  8:27 [ath9k-devel] ath9k-issue with stop RX/TX DMA - correlation with WMM traffic classes? Rougu
2013-04-12 15:41 ` Felix Fietkau
2013-04-12 16:57   ` Jan Lühr
2013-04-12 20:49     ` Felix Fietkau

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.