Antonio Forster wrote:
We had load ip_conntrack_ftp in both situations, with modular and
static kernel. thanks for the comment anyway!
Are you also load-balancing *inbound* FTP sessions in this LVS?
Humour me for a moment. On the face of it, from here, it seems highly
likely that the N:1 SNAT rule for outbound initiated connections is
incorrect - not that I'm accusing you of anything here, I am trying to
simplify the conditions.
Can you do a sequence of tests? Below, the word "active" indicates that
*either*:
A: The "active" server has all services up, the others are down, the LVS
remains configured on the director for all four; or
B: The "active" server is the *only* server configured for LVS service
on the director.
1. Attempt an FTP connection from server1 (each time) with server1,
server2, server3, server4 active in the LVS on their own (four tests).
2. Do the same sequence but with the FTP connection coming from server2,
server3, server4 in turn (with the other servers active in turn as in 1).
3. Test from server(1,2,3,4) with pairs of servers active.
4. Test from server(1,2,3,4) with triplets of servers active.
5. Finally, test from server(1,2,3,4) with all servers active.
This way, although a bit long-winded, should at least throw some light
on the problem - bear in mind that we can only see what you're telling
us, so any additional info will help!
Graeme
|