public inbox for b.a.t.m.a.n@lists.open-mesh.org
 help / color / mirror / Atom feed
From: cmsv <cmsv@wirelesspt.net>
To: Russell Senior <russell@personaltelco.net>
Cc: Felix Fietkau <nbd@openwrt.org>,
	The list for a Better Approach To Mobile Ad-hoc Networking
	<b.a.t.m.a.n@lists.open-mesh.org>
Subject: Re: [B.A.T.M.A.N.] batman-adv: memory leak?
Date: Fri, 07 Feb 2014 22:08:00 -0500	[thread overview]
Message-ID: <52F59F90.7000204@wirelesspt.net> (raw)
In-Reply-To: <52E97715.3060306@wirelesspt.net>

[-- Attachment #1: Type: text/plain, Size: 7445 bytes --]

I have an update in regards to this matter and i have CC' ed Felix
Fietkau from openwrt (athk) here too since i am using
nbd.name/aa-mac80211.git

I decided to compile new images with the latest batman-adv stable
patches and in the process of testing the new image as well as the old
one i thought to be stable i got the routers to reboot.
This time i tested this with more routers in the mesh and was able to
replicate it.

It happens that the routers reboot when the gateway disappears either by
doing batctl gw client/off or rebooting the gw router.
This then causes the others to reboot  with Kernel panic - not syncing:
Fatal exception in interrupt.

Rebooting the gw router while maintaining gw off did not seem to reboot
the other routers. With me the problem is easy to replicate when the
router gateway which is providing gateway to the clients disappears. It'
s disappearance causes the clients to reboot.

Here is the reboot log:

[  239.410000] CPU 0 Unable to handle kernel paging request at virtual
address 0000000c, epc == 80ea7914, ra == 80ea7910
[  239.420000] Oops[#1]:
[  239.420000] Cpu 0
[  239.420000] $ 0   : 00000000 00000001 00000000 00000000
[  239.420000] $ 4   : 81b12380 80f7fb00 00000000 00000000
[  239.420000] $ 8   : 00000037 00000000 00000000 00000000
[  239.420000] $12   : 00000000 0000015f 80e82540 00000000
[  239.420000] $16   : 81adbc00 00000000 81b12380 80f3e802
[  239.420000] $20   : 80f7fb00 00000000 00000189 00000000
[  239.420000] $24   : 00000002 80e365f0
[  239.420000] $28   : 80fe6000 80fe7ae8 00000043 80ea7910
[  239.420000] Hi    : 000001d5
[  239.420000] Lo    : 0011e189
[  239.420000] epc   : 80ea7914 0x80ea7914
[  239.420000]     Tainted: G           O
[  239.420000] ra    : 80ea7910 0x80ea7910
[  239.420000] Status: 1000f403    KERNEL EXL IE
[  239.420000] Cause : 00800008
[  239.420000] BadVA : 0000000c
[  239.420000] PrId  : 00019374 (MIPS 24Kc)
[  239.420000] Modules linked in: ath79_wdt batman_adv(O) nf_nat_irc
nf_conntrack_irc nf_nat_ftp nf_conntrack_ftp ipt_MASQUERADE iptable_nat
nf_nat xt_conntrack xt_CT xt_NOTRACK iptable


_raw xt_state nf_conntrack_ipv4 nf_defrag_ipv4 nf_conntrack ipt_REJECT
xt_TCPMSS ipt_LOG xt_comment xt_multiport xt_mac xt_limit iptable_mangle
iptable_filter ip_tables xt_tcpudp x_tabl


es ath9k(O) ath9k_common(O) ath9k_hw(O) ath(O) mac80211(O) libcrc32c
crc16 cfg80211(O) compat(O) arc4 aes_generic crc32c crypto_hash
crypto_algapi gpio_button_hotplug(O)
[  239.420000] Process udhcpc (pid: 1267, threadinfo=80fe6000,
task=81af8850, tls=77929440)
[  239.420000] Stack : 00000000 00000000 00000000 00000000 0000002a
81adbc00 00000000 81adbc00
[  239.420000]         81b12000 80f3e802 81b12380 00000000 00000189
80eb1fbc 81b12000 00000000
[  239.420000]         80e8bd00 80eb86c0 00000000 00000000 00000000
801e98ac 81adbc00 00000000
[  239.420000]         81b12000 00000000 80e8bd00 80eb86c0 00000000
801ec874 00000000 80dae000
[  239.420000]         00000000 00000014 80fb7ca8 0200bc00 00000001
00000001 802e0000 81adbc00
[  239.420000]         ...
[  239.420000] Call Trace:[<80eb1fbc>] 0x80eb1fbc
[  239.420000] [<801e98ac>] 0x801e98ac
[  239.420000] [<801ec874>] 0x801ec874
[  239.420000] [<801ecd5c>] 0x801ecd5c
[  239.420000] [<8026a388>] 0x8026a388
[  239.420000] [<80218750>] 0x80218750
[  239.420000] [<802689a4>] 0x802689a4
[  239.420000] [<801dbf88>] 0x801dbf88
[  239.420000] [<80218750>] 0x80218750
[  239.420000] [<801ec874>] 0x801ec874
[  239.420000] [<80216c50>] 0x80216c50
[  239.420000] [<80218750>] 0x80218750
[  239.420000] [<801ecd5c>] 0x801ecd5c
[  239.420000] [<80216c50>] 0x80216c50
[  239.420000] [<802689b4>] 0x802689b4
[  239.420000] [<80219eb0>] 0x80219eb0
[  239.420000] [<80237bb8>] 0x80237bb8
[  239.420000] [<80239734>] 0x80239734
[  239.420000] [<8024f668>] 0x8024f668
[  239.420000] [<801101d4>] 0x801101d4
[  239.420000] [<8020e3dc>] 0x8020e3dc
[  239.420000] [<801fd38c>] 0x801fd38c
[  239.420000] [<802179f8>] 0x802179f8
[  239.420000] [<8020ff04>] 0x8020ff04
[  239.420000] [<801d8154>] 0x801d8154
[  239.420000] [<80211184>] 0x80211184
[  239.420000] [<800d8890>] 0x800d8890
[  239.420000] [<800ec6f0>] 0x800ec6f0
[  239.420000] [<801d9f58>] 0x801d9f58
[  239.420000] [<801d93dc>] 0x801d93dc
[  239.420000] [<800d9114>] 0x800d9114
[  239.420000] [<800d93dc>] 0x800d93dc
[  239.420000] [<801d9a70>] 0x801d9a70
[  239.420000] [<8006a284>] 0x8006a284
[  239.420000]
[  239.420000]
[  239.420000] Code: 0c3a9ac3  00402821  0040a821 <8c42000c> 54400052
00008021  8e050054  10a00005  8fb10010
[  239.730000] ---[ end trace 7d873dc004108502 ]---
[  239.740000] Kernel panic - not syncing: Fatal exception in interrupt
[  239.740000] Rebooting in 3 seconds..


Routers used:
dir 601a & 615c1
tplink wr703n

aa: DISTRIB_REVISION="r39154"
hostapd and mac80211 from git://nbd.name/aa-mac80211.git

hostapd: sync with trunk (as of r39155)
mac80211: sync with openwrt trunk (as of r39150)

I am able to confirm that this problem does not happen with
[batman-adv: 2013.4.0] but it does happen with 2014.0.0 and it is easy
to replicate.
currently my batman-adv 2014.0.0 package as the following patches:

$ ls feeds/routing/batman-adv/patches/
0001-batman-adv-fix-batman-adv-header-overhead-calculatio.patch
0003-batman-adv-fix-soft-interface-MTU-computation.patch
0005-batman-adv-release-vlan-object-after-checking-the-CR.patch
0002-batman-adv-fix-potential-kernel-paging-error-for-uni.patch
0004-batman-adv-fix-TT-TVLV-parsing-on-OGM-reception.patch
0007-batman-adv-use-vlan_-eth_hdr-instead-of-skb-data-in-.patch



On 01/29/2014 04:48 PM, cmsv wrote:
> inline reply:
> 
> On 01/29/2014 03:10 AM, Russell Senior wrote:
>>>>>>> "cmsv" == cmsv  <cmsv@wirelesspt.net> writes:
>>
>>
>>>> Can you paste your feeds.conf file?
>>
>> cmsv> Of course:
>>
>>
>> cmsv> for AA and batman-adv 2014.0.0 in feeds.default.conf
>>
>> cmsv> src-svn packages svn://svn.openwrt.org/openwrt/branches/packages_12.09 
>> cmsv> src-git routing git://github.com/openwrt-routing/packages.git
>>
>>
>> cmsv> For the hostapd and mentioned mac80211 you will need to clone
>> cmsv> git clone  git://nbd.name/aa-mac80211.git
>>
>> cmsv> Then obtain the specific revisions and replace the original
>> cmsv> hostapd and mac80211 from AA.
>>
>> I am not following exactly.  Do you know which change in particular
>> makes the memory leak come and go?  
> I do not know exactly what causes the leak because i don't have the leak
> in my builds and have not found a better way than the ones mentioned
> before to try to find what may cause it.
> 
> 
>> AA implies an older kernel, 3.3.8
>> or something.
> Yes 3.3.8
> 
>>
>> Also, obtain specific revisions from trunk? and then copy
>> them into the AA tree?  
> 
> Not from trunk.  I posted the wrong git before.
> git clone git://nbd.name/aa-mac80211.git
> 
>> package/kernel/mac80211 r39150 = commit 886b3c876b71122ed9523834488f373908224663
>> package/network/services/hostapd r39155 = commit 64820db4b264472e03acb9ea6b5536fa7633a8ca
>>
>> Is that right?  Do those mac80211/hostapd revisions come from
>> bisection (i.e. the last "good" rev) or happenstance?
> You have to ask the maintainer. To me they are in between AA and trunk
> in terms of stability.
> 
> 
> 
>> Thanks for clarification!
>>
>>
> 


[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 901 bytes --]

  reply	other threads:[~2014-02-08  3:08 UTC|newest]

Thread overview: 35+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-01-21 10:22 [B.A.T.M.A.N.] [PATCH maint] batman-adv: fix soft-interface MTU computation Antonio Quartulli
2014-01-21 10:31 ` Antonio Quartulli
2014-01-27  7:57   ` Marek Lindner
2014-01-21 18:43 ` Russell Senior
2014-01-21 19:00   ` Antonio Quartulli
2014-01-22  6:04   ` Russell Senior
2014-01-22  7:03     ` Antonio Quartulli
2014-01-22  7:04     ` [B.A.T.M.A.N.] memleak (Was: [PATCH maint] batman-adv: fix soft-interface MTU computation) Antonio Quartulli
2014-01-22  7:37     ` [B.A.T.M.A.N.] [PATCH maint] batman-adv: fix soft-interface MTU computation Daniel
2014-01-22 17:45       ` [B.A.T.M.A.N.] batman-adv: memory leak? Russell Senior
2014-01-22 17:46         ` Antonio Quartulli
2014-01-22 19:18           ` Russell Senior
2014-01-22 20:49           ` cmsv
2014-01-22 23:57             ` Russell Senior
2014-01-23  0:10               ` cmsv
2014-01-23  3:35                 ` Daniel
2014-01-26 12:57                   ` Daniel
2014-01-26 14:21                     ` Antonio Quartulli
2014-01-26 14:24                       ` [B.A.T.M.A.N.] [PATCH maint] batman-adv: release vlan object after checking the CRC Antonio Quartulli
2014-01-27 18:43                         ` Russell Senior
2014-01-26 16:05                       ` [B.A.T.M.A.N.] batman-adv: memory leak? cmsv
2014-01-26 16:07                         ` Antonio Quartulli
2014-01-26 16:13                           ` Antonio Quartulli
2014-01-27 17:55                             ` cmsv
2014-01-28  1:21                               ` Russell Senior
2014-01-28  1:30                                 ` cmsv
2014-01-29  8:10                                   ` Russell Senior
2014-01-29 21:48                                     ` cmsv
2014-02-08  3:08                                       ` cmsv [this message]
2014-02-08 10:53                                         ` Felix Fietkau
2014-02-12  7:23                                         ` Antonio Quartulli
2014-02-12 10:40                                           ` cmsv
2014-02-12 11:41                                             ` Antonio Quartulli
2014-02-13  0:55                                               ` cmsv
2014-02-13  7:23                                                 ` Antonio Quartulli

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=52F59F90.7000204@wirelesspt.net \
    --to=cmsv@wirelesspt.net \
    --cc=b.a.t.m.a.n@lists.open-mesh.org \
    --cc=nbd@openwrt.org \
    --cc=russell@personaltelco.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox