public inbox for b.a.t.m.a.n@lists.open-mesh.org
 help / color / mirror / Atom feed
* [B.A.T.M.A.N.] Rev 972: batmand -c -d n hangs sometimes
@ 2008-02-03 11:50 Freifunk Dresden
  2008-02-03 18:27 ` elektra
  2008-02-11 12:04 ` Axel Neumann
  0 siblings, 2 replies; 12+ messages in thread
From: Freifunk Dresden @ 2008-02-03 11:50 UTC (permalink / raw)
  To: Open-Mesh, Mailinglist

Hi,

I'm currently try using batmand-experimental Rev.972.

I have encountered a strange behaviour. I'm running two
WRT54 (GL and GS).
Routing is working and there is enough memory. I have setup
a cron job to call "batmand -c -d [2,7,8,9]" every minutes to
update the status files on ramdisk. The webinterface then
access the content of the status file which reduces the cpu load.

The WRT54GS is working almost, but the WRT54GL hangs after a while
as described below:

The call to "batmand -c -d [2,7,8,9]" blocks batmand completely.
batman does not do any routing and OGM processing. As result the
router leaves the network.
I can still call "batmand -c -r 3" and verify with "batmand -c" that the
options were set. But OGMs are not processed.
Any call to access the debug-information is blocked.

After killall batmand and restart the call to "batmand -c -d x" is possible
serveral times until batman hangs. The process list shows this
"batmand -c -d x" - process.

I have compiled batmand for whiterussian_rc6 and with the following options:
(The email server has a problem with the assignment character so I have
removed it in this email)
CFLAGS         -Wall -Os
LDFLAGS        -lpthread
CFLAGS_MIPS    -Wall -Os -DREVISION_VERSION $(REVISION_VERSION)
LDFLAGS_MIPS   -lpthread

I had to remove the -pg option because it was not possible to compile.
Also in whiterussian_rc the CFLAGS_MIPS/LDFLAGS_MIPS are not used (I think).

Any Idea
/Stephan


^ permalink raw reply	[flat|nested] 12+ messages in thread
* Re: [B.A.T.M.A.N.] Rev 972: batmand -c -d n hangs sometimes
@ 2008-02-14 20:15 Freifunk Dresden
  2008-02-15 18:27 ` Axel Neumann
  0 siblings, 1 reply; 12+ messages in thread
From: Freifunk Dresden @ 2008-02-14 20:15 UTC (permalink / raw)
  To: b.a.t.m.a.n

Hi Axel,

I have done some checks. First I have added the tincd tap interface to
a bridge and used the bridge as parameter for batmand. This is currently
running since 10h.

The second what I have tried is to call "kill -STOP $(pidof tincd)".
On the WRT54GL batmand stopped quite fast, but it stopped also fast without
the "kill". On the WRT54SL where I never have seen batmand stopping, the
"kill" has stopped batmand after 5-10 minutes. I could reproduce this serveral
times with the GS.

When calling batmand -c -a 120.56.56.56/32 I get the following output on
batmand -cd3:

[    275531] Unix socket: got connection
[    275539] got request: 10
[    275540] Unix socket: Requesting adding of HNA 120.56.56.34/32 -  
put this on todo list...
[    275544] got request: 10
[    275545] Unix client closed connection ...

Three batmand processes are running and batmand -c returns:
WARNING: You are using BatMan-eXp 0.3-alpha (compatibility version 10) !
/sbin/batmand [not-all-options-displayed] -r 2 -a 10.12.10.16/28 eth1 tbb

After calling "killall batmand" only one of three batmand processes is killed.
I have to killall -9 to get rid of it.

Regards
/Stephan







^ permalink raw reply	[flat|nested] 12+ messages in thread
* Re: [B.A.T.M.A.N.] Rev 972: batmand -c -d n hangs sometimes
@ 2008-02-17 20:48 Freifunk Dresden
  2008-02-25 11:00 ` Axel Neumann
  0 siblings, 1 reply; 12+ messages in thread
From: Freifunk Dresden @ 2008-02-17 20:48 UTC (permalink / raw)
  To: b.a.t.m.a.n

Hi Axel,

when I use rv980 and call "kill -STOP $(pidof tincd)" then batman works
for a while (1-2 minutes) and exists with the following error:

[    411413] Error - can't send udp packet: Resource temporarily unavailable.
Connection terminated by remote host

When I restart batmand when tincd is still stopped, the OGM are processed and
I see the neighbour nodes until I get the same error after about 1-2 minutes.
good news, it does not block.

Because of the error message I have inserted at posix/posix.c: 705 "return 0"
(send_udp_packet()).

I have tested two wrt54gl that had the problems before with the rev980 and
the "return 0" statement. Both routers has be running for 18 hours when
tincd was running without creating a connection.
After that I called "kill -STOP $(pidof tincd)" on both routers. Both routers
running now since 13 hours without problems.


Please check and insert the "return 0" at posix/posix:705 if it is the right
solution.

Have a nice week

/Stephan


Zitat von Axel Neumann <axel@open-mesh.net>:

> Hi Stephan,
>
> can you check if rv980 solves the problem ?
>
> ciao,
> axel
>
> On Donnerstag 14 Februar 2008, Freifunk Dresden wrote:
>> Hi Axel,
>>
>> I have done some checks. First I have added the tincd tap interface to
>> a bridge and used the bridge as parameter for batmand. This is currently
>> running since 10h.
>>
>> The second what I have tried is to call "kill -STOP $(pidof tincd)".
>> On the WRT54GL batmand stopped quite fast, but it stopped also fast without
>> the "kill". On the WRT54SL where I never have seen batmand stopping, the
>> "kill" has stopped batmand after 5-10 minutes. I could reproduce this
>> serveral times with the GS.
>>
>> When calling batmand -c -a 120.56.56.56/32 I get the following output on
>> batmand -cd3:
>>
>> [    275531] Unix socket: got connection
>> [    275539] got request: 10
>> [    275540] Unix socket: Requesting adding of HNA 120.56.56.34/32 -
>> put this on todo list...
>> [    275544] got request: 10
>> [    275545] Unix client closed connection ...
>>
>> Three batmand processes are running and batmand -c returns:
>> WARNING: You are using BatMan-eXp 0.3-alpha (compatibility version 10) !
>> /sbin/batmand [not-all-options-displayed] -r 2 -a 10.12.10.16/28 eth1 tbb
>>
>> After calling "killall batmand" only one of three batmand processes is
>> killed. I have to killall -9 to get rid of it.
>>
>> Regards
>> /Stephan
>>
>>
>>
>>
>>
>>
>> _______________________________________________
>> B.A.T.M.A.N mailing list
>> B.A.T.M.A.N@open-mesh.net
>> https://list.open-mesh.net/mm/listinfo/b.a.t.m.a.n
>
>
> _______________________________________________
> B.A.T.M.A.N mailing list
> B.A.T.M.A.N@open-mesh.net
> https://list.open-mesh.net/mm/listinfo/b.a.t.m.a.n
>






^ permalink raw reply	[flat|nested] 12+ messages in thread

end of thread, other threads:[~2008-02-25 11:00 UTC | newest]

Thread overview: 12+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-02-03 11:50 [B.A.T.M.A.N.] Rev 972: batmand -c -d n hangs sometimes Freifunk Dresden
2008-02-03 18:27 ` elektra
2008-02-12  8:27   ` Freifunk Dresden
2008-02-11 12:04 ` Axel Neumann
2008-02-12  8:47   ` Freifunk Dresden
2008-02-12 19:34     ` Axel Neumann
2008-02-13 16:48       ` Freifunk Dresden
2008-02-14  9:19         ` Axel Neumann
  -- strict thread matches above, loose matches on Subject: below --
2008-02-14 20:15 Freifunk Dresden
2008-02-15 18:27 ` Axel Neumann
2008-02-17 20:48 Freifunk Dresden
2008-02-25 11:00 ` Axel Neumann

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox