* Why "No buffer space available"? [not found] <cc73a9220912281846h6a337b76ud7d9d1f61371f21e@mail.gmail.com> @ 2009-12-29 2:49 ` Medialy 2009-12-30 12:10 ` Pablo Neira Ayuso 0 siblings, 1 reply; 3+ messages in thread From: Medialy @ 2009-12-29 2:49 UTC (permalink / raw) To: netfilter Hi, I have written a program to log the nat behavior. the program works well when traffic is low. But when the traffic reaches 1Gb, program always error. According to the previous discussions about this problem, I even set the recv buffer size to 50MB and the error still exists. For every callback, format the data and then put it into the queue directly. The formating of data causes less then 1 second for 0.65 million records. Errors always occurs when there are less than 10 log records. Is anyone who can help? Thanks. Setting: Redhat Enterprise Linux 5 libnetfilter_conntrack-0.0.100 libnfnetlink-1.0.0 recv buffer size: 50MB nfct_open(CONNTRACK, NF_NETLINK_CONNTRACK_NEW|NF_NETLINK_CONNTRACK_DESTROY) 1Gb nat traffic, 0.65 million records per minute circular queue size: 1 million Error: nfct_catch error: No buffer space available Program Structure: Callback: lock; if log number > MAX_LOG_NUM: discard else: put log in circular queue log number += 1 unlock Thread 2: lock; if log number >0 : get lock number unlock process log data in circular queue lock log number = log number - log number processed unlock ^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Why "No buffer space available"? 2009-12-29 2:49 ` Why "No buffer space available"? Medialy @ 2009-12-30 12:10 ` Pablo Neira Ayuso [not found] ` <c29e3bea0912301743k7f7bda48o17ce78d4521a7585@mail.gmail.com> 0 siblings, 1 reply; 3+ messages in thread From: Pablo Neira Ayuso @ 2009-12-30 12:10 UTC (permalink / raw) To: Medialy; +Cc: netfilter Medialy wrote: > Hi, > I have written a program to log the nat behavior. the program works > well when traffic is low. But when the traffic reaches 1Gb, program > always error. > According to the previous discussions about this problem, I even set > the recv buffer size to 50MB and the error still exists. Increasing the buffer size would not solve the problem, that will only delay the ENOBUFS error. There are several reasons why you may hit ENOBUFS: a) your program is too slow to handle the Netlink messages that you receive from the kernel at a given rate. This is easier to trigger if the handling that you perform on every message takes too long. b) the queue size is too small, but this does not seem to be your case. ENOBUFS basically means that the kernel has to drop Netlink messages because your user-space program cannot back-off. > For every callback, format the data and then put it into the queue > directly. The formating of data causes less then 1 second for 0.65 > million records. > Errors always occurs when there are less than 10 log records. I don't understand what you mean here above. BTW, if you use a recent Linux kernel (>=2.6.30) you can set these two socket options not to get ENOBUFS error and to try to improve ctnetlink reliability. int on = 1; setsockopt(nfct_fd(h), SOL_NETLINK, NETLINK_BROADCAST_SEND_ERROR, &on, sizeof(int)); setsockopt(nfct_fd(h), SOL_NETLINK, NETLINK_NO_ENOBUFS, &on, sizeof(int)); ^ permalink raw reply [flat|nested] 3+ messages in thread
[parent not found: <c29e3bea0912301743k7f7bda48o17ce78d4521a7585@mail.gmail.com>]
* Re: Why "No buffer space available"? [not found] ` <c29e3bea0912301743k7f7bda48o17ce78d4521a7585@mail.gmail.com> @ 2009-12-31 11:31 ` Pablo Neira Ayuso 0 siblings, 0 replies; 3+ messages in thread From: Pablo Neira Ayuso @ 2009-12-31 11:31 UTC (permalink / raw) To: Medialy; +Cc: netfilter Medialy wrote: > Problem solved. Thanks. > BTW, sometimes the program stops at function nfct_close() and never return! I don't have an explanation for that, but it should not happen. > On Wed, Dec 30, 2009 at 8:10 PM, Pablo Neira Ayuso <pablo@netfilter.org > <mailto:pablo@netfilter.org>> wrote: > > Medialy wrote: > > Hi, > I have written a program to log the nat behavior. the program works > well when traffic is low. But when the traffic reaches 1Gb, program > always error. > According to the previous discussions about this problem, I even set > the recv buffer size to 50MB and the error still exists. > > > Increasing the buffer size would not solve the problem, that will > only delay the ENOBUFS error. There are several reasons why you may > hit ENOBUFS: > > a) your program is too slow to handle the Netlink messages that you > receive from the kernel at a given rate. This is easier to trigger > if the handling that you perform on every message takes too long. > b) the queue size is too small, but this does not seem to be your case. > > ENOBUFS basically means that the kernel has to drop Netlink messages > because your user-space program cannot back-off. > > > Medialy: > Reason: system was overloaded due to the storage capability. The > program (2 threads) was set to use last CPU. When the traffic was > heavy, most of the computing power of last CPU was occupied by the > thread which wrote Netlink messages to the stroage. Good analysis. It is a good idea to put the thread that digest the Netlink message in a spare CPU. That reduces the chances to hit ENOBUFS. I forgot to say but reducing the nice() value also help to avoid ENOBUFS. ^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2009-12-31 11:31 UTC | newest] Thread overview: 3+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- [not found] <cc73a9220912281846h6a337b76ud7d9d1f61371f21e@mail.gmail.com> 2009-12-29 2:49 ` Why "No buffer space available"? Medialy 2009-12-30 12:10 ` Pablo Neira Ayuso [not found] ` <c29e3bea0912301743k7f7bda48o17ce78d4521a7585@mail.gmail.com> 2009-12-31 11:31 ` Pablo Neira Ayuso
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).