* 2.6.4 e100 NAPI - dst cache overflow and network unavailability
@ 2004-04-09 0:45 Christopher Chan
2004-04-13 13:06 ` Robert Olsson
0 siblings, 1 reply; 2+ messages in thread
From: Christopher Chan @ 2004-04-09 0:45 UTC (permalink / raw)
To: netdev
I previously sent email to the list about seeing dst cache overflows and
the kernel spewing gibberish (at which point the box usually needs to be
rebooted as you'd wait a long time for the kernel to recover) to serial
console.
For all those cases with different values in route/max-size,
route/gc_thresh and tcp_max_syn_backlog, I'd get the problems even under
sufficient network load.
However, I had NAPI enabled in the e100 driver then.
Turning NAPI off for the e100 driver has meant that the box has now been
up several days without any problems under heavy network load.
I have not tried out 2.6.5 with NAPI enabled but 2.6.5 without NAPI
enabled is stable.
^ permalink raw reply [flat|nested] 2+ messages in thread
* 2.6.4 e100 NAPI - dst cache overflow and network unavailability
2004-04-09 0:45 2.6.4 e100 NAPI - dst cache overflow and network unavailability Christopher Chan
@ 2004-04-13 13:06 ` Robert Olsson
0 siblings, 0 replies; 2+ messages in thread
From: Robert Olsson @ 2004-04-13 13:06 UTC (permalink / raw)
To: Christopher Chan; +Cc: netdev, Robert.Olsson
Christopher Chan writes:
> However, I had NAPI enabled in the e100 driver then.
>
> Turning NAPI off for the e100 driver has meant that the box has now been
> up several days without any problems under heavy network load.
>
> I have not tried out 2.6.5 with NAPI enabled but 2.6.5 without NAPI
> enabled is stable.
dst cache overflows when garbage collection cannot keep up dst entries
freed so we exceed max_size. GC is run after gc_min_interval and eventually
a RCU delay which we have discussed here and are looking into now.
So if you increase your network performance/load for any reason so more
dst entries are freed you can reach the overflow threshold. This is probably
what happens for you with NAPI driver.
You can try to decrease gc_min_interval a bit but if you are unlucky you
have run into RCU problem as well. There is one experimental patch that
seems help.
Tuning just to avoid dst cache overflows can mean you sacrifice a lot of
network performance. Anyway monitor your route cache to start with. There
is interesting stats in /proc/net/rt_cache_stat. The rtstat utility can
be handy parsing it.
Cheers.
--ro
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2004-04-13 13:06 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2004-04-09 0:45 2.6.4 e100 NAPI - dst cache overflow and network unavailability Christopher Chan
2004-04-13 13:06 ` Robert Olsson
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).