All of lore.kernel.org
 help / color / mirror / Atom feed
* [RFC] ct_sync 0.15
@ 2004-08-13 13:35 KOVACS Krisztian
  2004-08-13 14:01 ` [nf-failover] " Jonathan G - Mailing Lists
  0 siblings, 1 reply; 4+ messages in thread
From: KOVACS Krisztian @ 2004-08-13 13:35 UTC (permalink / raw)
  To: Netfilter-failover list; +Cc: netfilter-devel


  Hi,

  I think finally I managed to get ct_sync in a state where some more
public review/testing would certainly do good for the project. This week
I've been successfully running overnight stress tests on our two-node
cluster, and both of the nodes survived all of the tests without locking
up, panicking, etc. OK, I know this is not too much, but somehow I felt
this would be a bare minimum we should reach before asking anyone to try
out ct_sync. Of course, there is still a lot to do. Just to mention two
things which certainly need a lot of work: protocol implementation is
really the bare minimum which is capable of doing anything, and
expectation support is missing completely. The current implementation
supports replicating conntrack entries, and supports NAT. So basically
all the simple things (not using expectations) should work reasonably
well. This is why I've bumped the version number to 0.15.

  The aim of this e-mail is twofold:

      * First, I just thought that ct_sync is now ready for some
        testing. Unfortunately our development and testing environment
        is very limited, so anyone being able to help us out and do some
        testing would be of a big help.
      * After doing some work to stabilize the current code, there is
        now need to discuss a few things before going on with
        implementation. So, I'd be happy if there was some discussion on
        these things before starting to implement anything.

  I'll try to summarize the most important open problems I've come
through. This list does not contain things I consider trivial, for
example exporting important internal constants as tunable parameters
through sysctl.

     1. There should some facility by which one can select which
        connections have to be replicated. This way it would be possible
        to limit replication traffic to the bare minimum. For example,
        there is no point in replicating conntrack entries for
        connections whose endpoint is one of the nodes (administrative
        SSH traffic, for example). A per-conntrack flag would be needed,
        just like CONNMARK, which could be set for conntracks needing
        replication with a simple iptables rule. Actually, CONNMARK is
        enough, if we choose a given bit of the mark as the SYNC bit.
        Besides this, we should decide if we needed a SYNC or a NOSYNC
        bit, that is, if the default mode of operation should be "sync
        or not to sync".
     2. The error recovery functions in the protocol layer should be
        revamped. The protocol is a plain sequence numbered, NACK-based
        one using multicast UDP. Lost packets are detected by the
        receiver when receiving the next packet with the inappropriate
        sequence number (not 'seq of the last packet + 1') is received.
        When this occurs, the node sends a recovery request containing
        the last successfully received sequence number. When the 

-- 
 Regards,
   Krisztian KOVACS

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2004-08-13 18:07 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2004-08-13 13:35 [RFC] ct_sync 0.15 KOVACS Krisztian
2004-08-13 14:01 ` [nf-failover] " Jonathan G - Mailing Lists
2004-08-13 17:35   ` KOVACS Krisztian
2004-08-13 18:07     ` Jonathan G - Mailing Lists

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.