Multiple load balancers problem

To:	lvs-devel@xxxxxxxxxxxxxxx
Subject:	Multiple load balancers problem
From:	Dmitry Akindinov <dimak@xxxxxxxxxxx>
Date:	Sat, 25 Aug 2012 11:37:08 +0400

Hello,

We are currently stuck with the following ipvs problem:

1. The configuration includes a (potentially large) set of serversproviding various services - besides HTTP (POP, IMAP, LDAP, SMTP, XMPP,etc.) The test setup includes just 2 servers, though.

2. Each server runs a stock version of CentOS 6.0

3. The application software (CommuniGate Pro) controls the ipvs kernelmodule using the ipvsadm commands.

4. On each server, iptables are configured to:
  a) disable connection tracking for VIP address(es)

b) mark all packets coming to the VIP address(es) with the mark valueof 100.5. On the currently active load balancer, the ipvsadm is used toconfigure ipvs to load-balance packets with the marker 100:

-A -f 100 -s rr -p 1
-a -f 100 -r <server1> -g
-a -f 100 -r <server2> -g
....
where the active balancer itself is one of the <serverN>

6. All other servers (just 1 "other" server in our test config) arerunning ipvs, but with an empty rule set.7. The active load balancer runs the sync daemon started with ipvsadm--start-daemon master7. All other servers run the sync daemon started with ipvsadm--start-daemon backup.

As a result, all servers have the duplicated ipvs connection tables. Ifthe active balancer fails, some other server assumes its role byarp-broadcasting VIP and loading the ipvs rule set listed above.

When a connection is being established to the VIP address, and theactive load balancer directs it to itself, everything works fine.When a connection is being established to the VIP address, and theactive load balancer directs it to some other server, the connection isestablished fine, and if the protocol is POP, IMAP, SMTP, the serverprompt is sent to the client via VIP, and it is seen by client just fine.But when the client tries to send anything to the server, the packet(according to tcpdump) reaches the load balancer server, and from thereit reaches the "other" server. Where the packet is dropped. The clientresends that packet, it goes to the active balancer, then to the "other"server, and it is dropped again.



Observations:

*) if ipvs is switched off on that "other" server, everything works justfine (service ipvsadm stop)

*) if ipvs is left running on that "other" server, but syncing daemon isswitched off, everything works just fine.We are 95% sure that the problem appears only if the "other server" ipvsconnection table gets a copy of thisconnection from the active balancer. If the copy is not there (the syncdaemon was stopped when the connectionwas established, and restarted immediately after), everything works justfine.

*) the problem exists for protocols like POP, IMAP, SMTP - where theserver immediately sends some data (prompt) to the client, as soon asthe connection is established.When the HTTP protocol is used, the problem does not exist, but only ifthe entire request is sent as one packet. If the HTTP connection is a"keep-alive" one, subsequent requests in the same connection do notreach the application either.I.e. it looks like the "idling" ipvs allows only one incoming datapacket in, and only if there has been no outgoing packet on thatconnection yet.

*) Sometimes (we still cannot reproduce this reliably) the ksoftirqdthreads on the "other" server jump to 100% CPUutilization, and when it happens, it happens in reaction to oneconnection being established.


Received suggestions:

*) it was suggested that we use iptables to filter the packets to VIPthat come from other servers in the farm (using their MAC addresses) anddirect them directly to the local application, bypassing ipvsprocessing. We cannot do that, as servers in the farm can be added atany moment, and updating the list of MACs on all servers is not trivial.It may be easier to filter the packets that come from the router(s),which are less numerous and do not change that often.But it does not look like a good solution. If the ipvs table on"inactive" balancer drops packets, why would it stop dropping them whenit becomes an "active" balancer? Just because there will be ipvs rulespresent?

*) The suggestion to separate load balancer(s) and real servers won'twork for us at all.

*) We tried not to empty the ipvs table on the "other" server(s).Instead, we left it balancing - but with only one "real server" - thisserver itself. Now, the "active" load balancer dsitributes packets toitself and other servers, and when the packets hit the "other"server(s), they get to the ipvs again, where they are balanced again,but to the local server only.

It looks like it does solve the problem. But now the ipvs connectiontable on the "other" server(s) is filled by both that server ipvs itselfand by the sync-daemon. While the locally-generated connection tableentries should be the same as corresponding entries received with thesync daemon, it does not look good when the same table is modified fromtwo sources.


Any comment, please? Should we use the last suggestion?


--
Best regards,
Dmitry Akindinov
--
To unsubscribe from this list: send the line "unsubscribe lvs-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

<Prev in Thread]	Current Thread	[Next in Thread>
Multiple load balancers problem, Dmitry Akindinov <= Re: Multiple load balancers problem, Dmitry Akindinov Re: Multiple load balancers problem, Julian Anastasov Re: Multiple load balancers problem, Dmitry Akindinov Re: Multiple load balancers problem, Julian Anastasov Re: Multiple load balancers problem, Dmitry Akindinov Re: Multiple load balancers problem, Dmitry Akindinov Re: Multiple load balancers problem, Julian Anastasov Re: Multiple load balancers problem, Dmitry Akindinov Re: Multiple load balancers problem, Julian Anastasov Re[2]: Multiple load balancers problem, Hans Schillstrom

Previous by Date:	Re: [PATCH 2/3] ipvs: Fix faulty IPv6 extension header handling in IPVS, Julian Anastasov
Next by Date:	Re: Multiple load balancers problem, Dmitry Akindinov
Previous by Thread:	[PATCH] ipvs: Fix GSO support for IPVS DR IPv6 mode, Jesper Dangaard Brouer
Next by Thread:	Re: Multiple load balancers problem, Dmitry Akindinov
Indexes:	[Date] [Thread] [Top] [All Lists]