From: Simon Wunderlich <sw@simonwunderlich.de>
To: b.a.t.m.a.n@lists.open-mesh.org
Subject: Re: [B.A.T.M.A.N.] broadcast storms
Date: Mon, 22 Oct 2018 16:26:49 +0200 [thread overview]
Message-ID: <3126005.3y7xrHjZEe@prime> (raw)
In-Reply-To: <B99A1AB9A294834C8DF94185A5E9D18101784CE751@NRTV96002.america.zf-world.com>
[-- Attachment #1: Type: text/plain, Size: 2440 bytes --]
Hi Jake,
could you make some pcap dumps on the wlan device where batman runs, and
provide that to us? Just the the full tcpdump (tcpdump -s 2000 -w /tmp/my.pcap
wlan0, assuming that wlan0 is your interface), not batctl dump? Then we can
check sequence numbers etc in wireshark.
Do you have some of your mesh nodes connected and bridged to Ethernet? If yes,
you should check the bridge loop avoidance which could also be causing this
effect, if you don't have it enabled and use such a topology:
https://www.open-mesh.org/projects/batman-adv/wiki/Bridge-loop-avoidance-II
Cheers,
Simon
On Monday, October 22, 2018 1:07:29 PM CEST Jake.Harris@zf.com wrote:
> I'm sure a similar question to this has been answered, but I am new to this
> mailing list format and don't know an efficient way to search
> https://lists.open-mesh.org/pipermail/b.a.t.m.a.n/
>
> I'm having problems with broadcast messages effectively echoing around the
> network of 50ish nodes. I attached a few seconds of the batctl tcpdump
> output. I can't seem to find a pattern to what causes this, it tends to
> happen once every two or three weeks, the storm causes problems with the
> batman program where during the storm nodes drop all their neighbors
> (batctl n shows an empty list) indefinitely, which I have worked around
> that issue via a batch script that reloads batman if the neighbor list is
> empty. Reloading successfully reconnects to the network but the storm still
> persists.
>
> The only way I've found to fix this is to reboot all the nodes at the same
> time such that the whole network is down to kill the echos.
>
> I believe I had this problem much more frequently (every 4 days or so) a
> while ago on the same network when using discrete tcp destinations for the
> nodes to communicate, the storm frequency was reduced to what it is now by
> using broadcast packets and reducing the communication rate from 12 seconds
> to once every 40 seconds.
>
> Rebooting the nodes that are responsible for the echoing messages has no
> effect, I rebooted 192.168.1.230 before running tcpdump that is attached
> and as it shows packets from 230 continued to bounce around while the node
> was powered off and after it rejoined the network. It doesn't appear
> broadcast uses a time-to-live parameter to limit the hops the packets will
> make.
>
> I'm at a loss for a way to remedy this, there seems to only be multicast
> optimizations.
[-- Attachment #2: This is a digitally signed message part. --]
[-- Type: application/pgp-signature, Size: 833 bytes --]
next prev parent reply other threads:[~2018-10-22 14:26 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-10-22 13:07 [B.A.T.M.A.N.] broadcast storms Jake.Harris
2018-10-22 14:26 ` Simon Wunderlich [this message]
2018-10-22 17:27 ` Jake.Harris
2018-10-22 18:17 ` Simon Wunderlich
2018-11-12 14:29 ` Jake.Harris
2018-11-12 17:13 ` Simon Wunderlich
2018-11-13 14:55 ` Jake.Harris
2018-11-13 15:26 ` Simon Wunderlich
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=3126005.3y7xrHjZEe@prime \
--to=sw@simonwunderlich.de \
--cc=b.a.t.m.a.n@lists.open-mesh.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox