andy motten wrote: >> >> I will run continuously several tests on the pc (including "latency -f") >> for the rest of this week. Since I am not in the office during this period >> (so not in the neighborhood of this problematic pc). >> And I hope (I hope not in vain) that the latency tracer will give us a >> hint for the reason of the hard lock ups (if a hard lock up happens during >> this period). >> >> andy >> > > > Hello, > > Since we are having a hard time finding the hard lock-ups. We have taken a > closer look at the failed tests of orocos (maybe the source of the problem > is the same). These failures occur during the make check execution. > > The following tests FAILED: > 2 - task-test (OTHER-FAULT) > 3 - event-test (OTHER-FAULT) > 4 - taskcontext-test (OTHER-FAULT) > > When we perform a single test, e.g task-test, we get the the following > messages: Killed > The OROCOS messages are then: > > 0.000 [ Info ][Logger] Successfully extracted environment variable > ORO_LOGLEVEL > 0.001 [ Info ][Logger] OROCOS version '1.2.1' compiled with GCC > 4.1.2.Orocos Logging Activated at level : [ Debug ] ( 6 ) > 0.001 [ Info ][Logger] Reference System Time is : 880886725351 ticks ( > 315.369 seconds ). > 0.002 [ Info ][Logger] Logging is relative to this time. > 0.002 [ Info ][Logger] Xenomai Periodic Timer runs in preemptive > 'one-shot' mode. > 0.003 [ Debug ][Logger] Xenomai Timer and Main Task Created > 0.003 [ Debug ][Logger] MainThread started. > 0.003 [ Debug ][Logger] Starting StartStopManager. > 0.004 [ Info ][Toolkit] Loading Tool RealTime. > 0.005 [ Debug ][Toolkit] Registered Type 'int' to the Orocos Type System. > 0.005 [ Debug ][Toolkit] Registered Type 'uint' to the Orocos Type System. > 0.006 [ Debug ][Toolkit] Registered Type 'double' to the Orocos Type > System. > 0.006 [ Debug ][Toolkit] Registered Type 'bool' to the Orocos Type System. > 0.006 [ Debug ][Toolkit] Registered Type 'PropertyBag' to the Orocos Type > System. > 0.007 [ Debug ][Toolkit] Registered Type 'float' to the Orocos Type System. > 0.007 [ Debug ][Toolkit] Registered Type 'char' to the Orocos Type System. > 0.008 [ Debug ][Toolkit] Registered Type 'array' to the Orocos Type System. > 0.008 [ Debug ][Toolkit] Registered Type 'string' to the Orocos Type > System. > 0.010 [ Debug ][./task-test::main()] ORO_main starting... > 0.010 [ Info ][./task-test::main()] LogLevel unaltered by test-runner. > 0.011 [ Info ][./task-test::main()] Creating PeriodicThread for scheduler: > 0 > 0.012 [ Info ][TimerThreadInstance] PeriodicThread created with scheduler > type '0', priority 15 and period 0.01. > 0.013 [ Debug ][Logger] Periodic Thread TimerThreadInstance started. > 0.014 [ Info ][PThread] PeriodicThread created with scheduler type '0', > priority 99 and period 0.1. > 0.014 [ Debug ][Logger] Periodic Thread PThread started. > 0.115 [ Debug ][Logger] Periodic Thread PThread stopping... done. > 0.115 [ Debug ][Logger] Periodic Thread PThread started. > 1.216 [ Debug ][Logger] Periodic Thread PThread stopping... done. > 1.216 [ Debug ][~PeriodicThread] Terminating PThread > > On the serial console we get the following listing (complete listing in > appendix): > > Xenomai: starting native API services. > I-pipe: Detected illicit call from domain 'Xenomai' > into a service reserved for domain 'Linux' and below. > f635be74 00000000 00000000 52544149 f635be98 c0104789 c02cfa4f > c02f5b80 > f6c4e2f0 f635beb0 c0137d69 c02c256c c02c1186 c02c01b8 f8c0b280 > f635bebc > c0132981 f60a1730 f635bed8 f8bd8570 c010ef8c 00000000 f60a0120 > f8beefe0 > Call Trace: > [] show_trace_log_lvl+0x1f/0x35 > [] show_stack_log_lvl+0xaa/0xcf > [] show_stack+0x2f/0x36 > [] ipipe_check_context+0x7a/0x81 > [] module_put+0x19/0x7d > [] xnshadow_unmap+0xbc/0xff [xeno_nucleus] > [] __shadow_delete_hook+0x25/0x27 [xeno_native] > [] xnpod_delete_thread+0x1b9/0x2aa [xeno_nucleus] > [] rt_task_delete+0x140/0x145 [xeno_native] > [] __rt_task_delete+0x58/0x69 [xeno_native] > [] hisyscall_event+0x185/0x291 [xeno_nucleus] > [] __ipipe_dispatch_event+0xc0/0x1da > [] __ipipe_syscall_root+0x43/0x10a > [] system_call+0x29/0x41 > ======================= That specific Xenomai bug should be fixed in 2.4, please check your testcase against -rc1 e.g. Unfortunately we have no backport of the fix in 2.3 yet. Can't tell right now if this is tricky, but this test demonstrates that $SOMETHING should be done... Jan