Questions, connection resets and other items...

To:	<lvs-users@xxxxxxxxxxxxxxxxxxxxxx>
Subject:	Questions, connection resets and other items...
From:	"Dan Brown" <danb@xxxxxx>
Date:	Wed, 22 Feb 2006 13:31:52 -0600

So I've got my two server LVS setup going with UltraMonkey 3 on a Fedora FC
4 box.  My setup has 115 IPs running under IPVS direct routing using the
UltraMonkey Streamline High Availability and Load Balanced setup.  After a
time consuming and frustrating configuration period to get both servers not
to both actively running arps for all of the Ips at the same time, I've got
a few other questions concerning configuration for which documents I've
found for are either not very specific or don't provide an answer. 

The main problem is that while my configuration is running and switching
back and forth between servers when one dies, according to the examples in
the Ultramonkey 3 docs, it shouldn't be.

1. While both servers appear to be running (I can get proper pages from both
servers either by reloading the sites from different clients or pulling the
network cables on the servers) the results of ipvsadm -L -n do not seem to
coincide properly with the examples listed.  
I've commented out all of the other IPs except for the internal IPs and
restarted heartbeat to simplify the matter.

Cluster/Virtual Service IP: 192.168.0.3
Nitehawk's IP: 192.168.0.4
Seahawk's IP: 192.168.0.5

On the server which appears to be master (going by who is accepting) it
looks like:

[root@nitehawk ~]# /etc/ha.d/resource.d/LVSSyncDaemonSwap master status
master running
(ipvs_syncmaster pid: 7194)
[root@nitehawk logs]# ldirectord ldirectord.cf status
ldirectord is stopped for /etc/ha.d/ldirectord.cf
[root@nitehawk logs]# ipvsadm -L -n
IP Virtual Server version 1.2.1 (size=4096)
Prot LocalAddress:Port Scheduler Flags
  -> RemoteAddress:Port           Forward Weight ActiveConn InActConn
TCP  192.168.0.3:80 rr
  -> 192.168.0.5:80               Route   0      0          0         
  -> 192.168.0.4:80               Local   0      0          0         
  -> 127.0.0.1:80                 Local   1      0          0         

On the server not running the ldirector:
[root@seahawk ha.d]# /etc/ha.d/resource.d/LVSSyncDaemonSwap master status
master running
(ipvs_syncmaster pid: 30630)
[root@seahawk ~]# ldirectord ldirectord.cf status
ldirectord is stopped for /etc/ha.d/ldirectord.cf
[root@seahawk ~]# ipvsadm -L -n
IP Virtual Server version 1.2.1 (size=4096)
Prot LocalAddress:Port Scheduler Flags
  -> RemoteAddress:Port           Forward Weight ActiveConn InActConn
TCP  192.168.0.3:80 rr
  -> 127.0.0.1:80                 Local   1      0          0       

The first problem I see is that both hosts are using the fallback rather
than either actual IP.  Other than removing the fallback hosts, I'm not too
sure what is going on.


The other problem I've noticed (which may be a result of the above) is that
occassionally my browser alert a message like: "The connection to the server
was reset while the page was loading".  There are no connection reset
messages in /var/log/ha-log, /var/log/ldirectord.log, nor in the Apache
logs.

Configs:
-----------------------------------------------------------------
ha.cf
-----------------------------------------------------------------
#       File to write debug messages to
debugfile /var/log/ha-debug
#
logfile /var/log/ha-log
#
keepalive 1
#
deadtime 10 
#
warntime 2
#
initdead 60 
#
udpport 694
bcast   eth1            # Linux
#
ucast eth1 192.168.0.4
#
auto_failback on
#       node    nodename ...    -- must match uname -n
node    192.168.0.4 nitehawk
node    192.168.0.5 seahawk.zu.com
#
ping 192.168.0.4
#
respawn hacluster /usr/lib64/heartbeat/ipfail
#       deadping - dead time for ping nodes
deadping 10
#               defaults to zero
debug 1
#
apiauth ipfail uid=hacluster gid=haclient




-----------------------------------------------------------------
haresources
-----------------------------------------------------------------
# actual configuration for cluster
seahawk.zu.com \
        ldirectord \
        LVSSyncDaemonSwap::master \
        IPaddr2::192.168.0.3/24/eth1/192.168.0.255 


-----------------------------------------------------------------
ldirectord.cf
-----------------------------------------------------------------
# Global Directives
checktimeout=3
checkinterval=1
#fallback=127.0.0.1:80
autoreload=no
logfile="/var/log/ldirectord.log"
#logfile="local0"
quiescent=no

# internal virtual host for all sites
virtual=192.168.0.3:80
        real=192.168.0.5:80 gate
        real=192.168.0.4:80 gate
        fallback=127.0.0.1:80 gate
        service=http
        request="index.html"
        receive="Test Message"
        scheduler=rr
        #persistent=600
        #netmask=255.255.255.255
        protocol=tcp
        checktype=negotiate


__________________________________________________________
Dan Brown
zu.com communications inc.       http://www.zu.com
Web Design - Databases - Creative
~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~
324 Duchess Street      Saskatoon  SK     S7K 0R1
tel.1.306.653.4747      fax.1.306.653.4774

<Prev in Thread]	Current Thread	[Next in Thread>
Questions, connection resets and other items..., Dan Brown <=

Previous by Date:	Re: Unable to forward packets, Bill Omer
Next by Date:	Re: Unable to forward packets, Joseph Mack NA3T
Previous by Thread:	French LVS-mini-HOWTO on HOWTO website, Joseph Mack NA3T
Next by Thread:	persistent service & HA, Matthias Krauss
Indexes:	[Date] [Thread] [Top] [All Lists]