All of lore.kernel.org
 help / color / mirror / Atom feed
* IPVS Health Checking Best Practices
@ 2014-09-18 21:26 Alex Gartrell
  2014-09-19  4:31 ` Alexey Andriyanov
  0 siblings, 1 reply; 2+ messages in thread
From: Alex Gartrell @ 2014-09-18 21:26 UTC (permalink / raw)
  To: lvs-devel; +Cc: dsp, kernel-team, ps

Hello All,

Today, we run IPVS on a number of hosts.  Each of these hosts has a 
python process responsible for ensuring the health of pool members and 
then updating their weights as necessary.

We do these health checks via IPVS for two reasons:
1) Different VIPs have different listeners on our real servers, so we 
can't just use the regular host address
2) We want to ensure that decapsulation is happening appropriately.

The way we do this today is a giant hack.  We have a scheduler that 
we've not (yet) open sourced that does consistent hashing, and someone 
just wired in a couple additional sysctls that will allow you to do the 
following:

If a request is from $MAGIC_IP and the source port is >= $MAGIC_PORT, 
then send it to pool->members[($SRC_PORT - $MAGIC_PORT) % $N].

I'd like to solve this problem more generally.

The other solution I've heard of is using fwmarks, but that kind of 
sucks from a configuration perspective (because you have to add in all 
of the persistent vips and everything).

Here are some other ideas:

1) Map the socket itself to a particular pool with a netlink invocation 
or something

2) Provide a way to bind specific src addr, port tuples to specific 
destination (though this is a bummer because you have to reserve port space)

But I'm completely open to ideas and I think we're willing to do the 
work to make this happen.


Thanks,

-- 
Alex Gartrell <agartrell@fb.com>

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2014-09-19  4:31 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-09-18 21:26 IPVS Health Checking Best Practices Alex Gartrell
2014-09-19  4:31 ` Alexey Andriyanov

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.