Re: ActiveConn problem

To:	micah@xxxxxxxxxxxxxxx,<lvs-users@xxxxxxxxxxxxxxxxxxxxxx>
Subject:	Re: ActiveConn problem
From:	awysocki@xxxxxxxxxxxxxx
Date:	Wed, 3 Dec 2003 18:26:36 -0800

Found this in the documentation at 
http://www.austintek.com/LVS/LVS-HOWTO/HOWTO/LVS-HOWTO.ipvsadm.html#using_ipvsadm


Done some testing (netmon) on this and here's my observations : 
1. A connection becomes active when LVS sees the ACK flag in the TCP 
header incoming in the cluster : i.e when the socket gets established on 
the real server. 
2. A connection becomes inactive when LVS sees the ACK-FIN flag in the TCP 
header incoming in the cluster. This does NOT corespond to the socket 
closing on the real server. 
Example with my Apache Web server. 

Client                   <---> Server

A client request an object on the web server on port 80 :

SYN REQUEST     ---->
SYN ACK                  <----
ACK             ----> *** ActiveConn=1 and 1 ESTABLISHED socket on real 
server.
HTTP get        ----> *** The client request the object
HTTP response   <---- *** The server sends the object
APACHE closes the socket : *** ActiveConn=1 and 0 ESTABLISHED socket on 
real server
The CLIENT receives the object. (took 15 seconds in my test)
ACK-FIN         ----> *** ActiveConn=0 and 0 ESTABLISHED socket on real 
server


Conclusion : ActiveConn is the active number of CLIENT connections..... 
not on the server in the case of short transmissions (like objects on a 
web page). Its hard to calculate a server's capacity based on this because 
slower clients makes ActiveConn greater than what the server is really 
processing. You wont be able to reproduce that effect on a LAN, because 
the client receives the segment too fast. 
In the LVS mailing list, many people explained that the correct way to 
balance the connections is to use monitoring software. The weights must be 
evaluated using values from the real server. In LVS-DR and LVS-Tun, the 
Director can be easily fooled with invalid packets for some period and 
this can be enough to inbalance the cluster when using "*lc" schedulers. 
I reproduce the effect connecting at 9600 bps and getting a 100k gif from 
Apache, while monitoring established sockets on port 80 on the real server 
and ipvsadm on the cluster. 
Julian 
You are probably using LVS-DR or LVS-Tun in your test. Right? Using these 
methods, the LVS is changing the TCP state based on the incoming packets, 
i.e. from the clients. This is the reason that the Director can't see the 
FIN packet from the real server. This is the reason that LVS can be easily 
SYN flooded, even flooded with ACK following the SYN packet. The LVS can't 
change the TCP state according to the state in the real server. This is 
possible only for VS/NAT mode. So, in some situations you can have invalid 
entries in ESTABLISHED state which do not correspond to the connections in 
the real server, which effectively ignores these SYN packets using 
cookies. The VS/NAT looks the better solution against the SYN flood 
attacks. Of course, the ESTABLISHED timeout can be changed to 5 minutes 
for example. Currently, the max timeout interval (excluding the 
ESTABLISHED state) is 2 minutes. If you think that you can serve the 
clients using a smaller timeout for the ESTABLISHED state, when under "ACK 
after SYN" attack, you can change it with ipchains. You don't need to 
change it under 2 minutes in LVS 0.9.7. In the last LVS version SYN+FIN 
switches the state to TIME_WAIT, which can't be controlled using ipchains. 
In other cases, you can change the timeout for the ESTABLISHED and 
FIN-WAIT states. But you can change it only down to 1 minute. If this 
doesn't help, buy 2GB RAM or more for the Director. 
One thing that can be done, but this is may be paranoia: 
change the INPUT_ONLY table: 

from:
           FIN
        SR ---> TW
to:
           FIN
        SR ---> FW


OK, this is incorrect interpretation of the TCP states but this is a hack 
which allows the min state timeout to be 1 minute. Now using ipchains we 
can set the timeout to all TCP states to 1 minute. 
If this is changed you can now set ESTABLISHED and FIN-WAIT timeouts down 
to 1 minute. In current LVS version the min effective timeout for 
ESTABLISHED and FINWAIT state is 2 minutes.

<Prev in Thread]	Current Thread	[Next in Thread>
ActiveConn problem, Micah Abrams Re: ActiveConn problem, awysocki <= Re: ActiveConn problem, Julian Anastasov Re: ActiveConn problem, Horms

Previous by Date:	ActiveConn problem, Micah Abrams
Next by Date:	Re: is there a 2.4.23 patch?, Travis Cole
Previous by Thread:	ActiveConn problem, Micah Abrams
Next by Thread:	Re: ActiveConn problem, Julian Anastasov
Indexes:	[Date] [Thread] [Top] [All Lists]