* Putting for discussion "__qdisc_run" bug - strange application-dependant error
@ 2009-05-09 19:55 Panadero Pizza
2009-05-10 7:33 ` Jarek Poplawski
0 siblings, 1 reply; 5+ messages in thread
From: Panadero Pizza @ 2009-05-09 19:55 UTC (permalink / raw)
To: netdev
Hello netdev group,
I would like to put for discussion, if a bug in the __qdsic_run function could explain the problem I have described in my post "Kernel networking problem (>=2.6.24.7)" two days ago.
"__qdisc_run" is the last common function in the kernel traces I receive for the kernels:
2.6.24.7
2.6.27.12
with a multi-threaded software, using fputc, putc, puts - calls, mixed with fflush()-calls on a remote file, opened on a cifs-handle. (Errors occur extremely seldom, although the software is running for hours - day after day)
I don't receive the error with other standard software (e.g. "dd"), writing to a file on the same cifs-handle/session and more interesting:
The error does not occur with the afore-mentioned multi-threaded program (or any other software), using a
2.6.16.62
kernel.
I have found that from 2.6.24.x on, the __qdisc_run function has been made preemptible and this feature has been enabled for the realization of the afore-said 2.6.24.7 and 2.6.27.12 kernels.
Though 2.6.16.62 solves my problem, I am offering participation in finding a solution for this (which should be fixed, if it is a kernel bug).
Best regards,
David
^ permalink raw reply [flat|nested] 5+ messages in thread* Re: Putting for discussion "__qdisc_run" bug - strange application-dependant error
2009-05-09 19:55 Putting for discussion "__qdisc_run" bug - strange application-dependant error Panadero Pizza
@ 2009-05-10 7:33 ` Jarek Poplawski
2009-05-10 11:54 ` Jarek Poplawski
2009-05-11 11:33 ` Panadero Pizza
0 siblings, 2 replies; 5+ messages in thread
From: Jarek Poplawski @ 2009-05-10 7:33 UTC (permalink / raw)
To: Panadero Pizza; +Cc: netdev
Panadero Pizza wrote, On 05/09/2009 09:55 PM:
> Hello netdev group,
>
> I would like to put for discussion, if a bug in the __qdsic_run function could explain the problem I have described in my post "Kernel networking problem (>=2.6.24.7)" two days ago.
>
> "__qdisc_run" is the last common function in the kernel traces I receive for the kernels:
>
> 2.6.24.7
> 2.6.27.12
>
> with a multi-threaded software, using fputc, putc, puts - calls, mixed with fflush()-calls on a remote file, opened on a cifs-handle. (Errors occur extremely seldom, although the software is running for hours - day after day)
>
> I don't receive the error with other standard software (e.g. "dd"), writing to a file on the same cifs-handle/session and more interesting:
> The error does not occur with the afore-mentioned multi-threaded program (or any other software), using a
>
> 2.6.16.62
>
> kernel.
>
> I have found that from 2.6.24.x on, the __qdisc_run function has been made preemptible and this feature has been enabled for the realization of the afore-said 2.6.24.7 and 2.6.27.12 kernels.
>
> Though 2.6.16.62 solves my problem, I am offering participation in finding a solution for this (which should be fixed, if it is a kernel bug).
>
Since you're suggesting the specific patch has broken this for you, could
you simply write if 2.6.24.4 (or 2.6.24.7 with "NET: Add preemption point
in qdisc_run" patch reverted) also solves your problem?
Btw., you've admitted the stack could be a problem; did you try to enable
any kernel debugging for this?
Cheers,
Jarek P.
PS: continuing the discussion within one thread and limiting line width
would make it easier to track this report.
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: Putting for discussion "__qdisc_run" bug - strange application-dependant error
2009-05-10 7:33 ` Jarek Poplawski
@ 2009-05-10 11:54 ` Jarek Poplawski
2009-05-11 11:33 ` Panadero Pizza
1 sibling, 0 replies; 5+ messages in thread
From: Jarek Poplawski @ 2009-05-10 11:54 UTC (permalink / raw)
Cc: Panadero Pizza, netdev
Jarek Poplawski wrote, On 05/10/2009 09:33 AM:
> Panadero Pizza wrote, On 05/09/2009 09:55 PM
...
>> I have found that from 2.6.24.x on, the __qdisc_run function has been made preemptible and this feature has been enabled for the realization of the afore-said 2.6.24.7 and 2.6.27.12 kernels
...
> Since you're suggesting the specific patch has broken this for you, could
> you simply write if 2.6.24.4 (or 2.6.24.7 with "NET: Add preemption point
> in qdisc_run" patch reverted) also solves your problem?
Btw., let's make it clear: after this patch the __qdisc_run function hasn't
been made preemptible; it could simply exit before all work is finished and
schedule the rest for later.
Jarek P.
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: Putting for discussion "__qdisc_run" bug - strange application-dependant error
2009-05-10 7:33 ` Jarek Poplawski
2009-05-10 11:54 ` Jarek Poplawski
@ 2009-05-11 11:33 ` Panadero Pizza
2009-05-11 16:34 ` Jarek Poplawski
1 sibling, 1 reply; 5+ messages in thread
From: Panadero Pizza @ 2009-05-11 11:33 UTC (permalink / raw)
To: netdev; +Cc: Jarek Poplawski
De: Jarek Poplawski
> Since you're suggesting the specific patch has broken this for you, could
> you simply write if 2.6.24.4 (or 2.6.24.7 with "NET: Add preemption point
> in qdisc_run" patch reverted) also solves your problem?
I'm now testing the 2.6.24.7 kernel with CONFIG_PREEMPT_NONE.
(If I knew how to easily revert the patch for the __qdisc_run function,
I would like to test it, too) ...
> Btw., you've admitted the stack could be a problem; did you try to enable
> any kernel debugging for this?
Thanks for the hint! I have found the DEBUG_STACKOVERFLOW option,
which seems reasonable. Using it now, too.
Testing will now take some days, because errors are very seldom - I will
post my results here again.
Cheers,
David
^ permalink raw reply [flat|nested] 5+ messages in thread* Re: Putting for discussion "__qdisc_run" bug - strange application-dependant error
2009-05-11 11:33 ` Panadero Pizza
@ 2009-05-11 16:34 ` Jarek Poplawski
0 siblings, 0 replies; 5+ messages in thread
From: Jarek Poplawski @ 2009-05-11 16:34 UTC (permalink / raw)
To: Panadero Pizza; +Cc: netdev
Panadero Pizza wrote, On 05/11/2009 01:33 PM:
>
> De: Jarek Poplawski
>
>> Since you're suggesting the specific patch has broken this for you, could
>
>> you simply write if 2.6.24.4 (or 2.6.24.7 with "NET: Add preemption point
>> in qdisc_run" patch reverted) also solves your problem?
>
> I'm now testing the 2.6.24.7 kernel with CONFIG_PREEMPT_NONE.
> (If I knew how to easily revert the patch for the __qdisc_run function,
> I would like to test it, too) ...
You can try 2.6.24.4 which doesn't have this patch. But I doubt this
patch really matters here.
Jarek P.
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2009-05-11 16:35 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-05-09 19:55 Putting for discussion "__qdisc_run" bug - strange application-dependant error Panadero Pizza
2009-05-10 7:33 ` Jarek Poplawski
2009-05-10 11:54 ` Jarek Poplawski
2009-05-11 11:33 ` Panadero Pizza
2009-05-11 16:34 ` Jarek Poplawski
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).