Sorry this reply is so long.
Ha.cf:
#Working config file for Director # 1
debugfile /var/log/ha-debug
logfile /var/log/ha-log
logfacility local0
keepalive 2
deadtime 10
#serial /dev/ttyS0 # Linux
initdead 30
bcast eth0
node Director1
node Director2
Haresources
#Working haresources for Director # 1
Director1 IPaddr::xx.xxx.191.4/24/eth0:1
ldirectord::/usr/local/etc/ha.d/ldirectord.cf EmailAdmin
Director1 IPaddr::xx.xxx.191.5/24/eth0:2
ldirectord::/usr/local/etc/ha.d/ldirectord.cf EmailAdmin
Director1 IPaddr::xx.xxx.191.6/24/eth0:3
ldirectord::/usr/local/etc/ha.d/ldirectord.cf EmailAdmin
Director1 IPaddr::xx.xxx.191.7/24/eth0:4
ldirectord::/usr/local/etc/ha.d/ldirectord.cf EmailAdmin
Ps -ax |grep hearbeat
14160 ? SL 0:00 heartbeat: control process
14163 ? SL 0:00 heartbeat: write: bcast eth0
14164 ? SL 0:00 heartbeat: read: bcast eth0
14165 ? SL 0:01 heartbeat: master status process
Ha-log:
heartbeat: 2003/06/04_10:51:21 /usr/local/lib/heartbeat/send_arp eth0
xx.xxx.191.7 00034725F49F xx.xxx.191.7 ffffffffffff
heartbeat: 2003/06/04_10:51:21 /usr/local/lib/heartbeat/send_arp eth0
xx.xxx.191.6 00034725F49F xx.xxx.191.6 ffffffffffff
heartbeat: 2003/06/04_10:51:21 /usr/local/lib/heartbeat/send_arp eth0
xx.xxx.191.5 00034725F49F xx.xxx.191.5 ffffffffffff
heartbeat: 2003/06/04_10:51:22 /usr/local/lib/heartbeat/send_arp eth0
xx.xxx.191.4 00034725F49F xx.xxx.191.4 ffffffffffff
heartbeat: 2003/06/04_10:51:23 /usr/local/lib/heartbeat/send_arp eth0
xx.xxx.191.7 00034725F49F xx.xxx.191.7 ffffffffffff
heartbeat: 2003/06/04_10:51:23 /usr/local/lib/heartbeat/send_arp eth0
xx.xxx.191.6 00034725F49F xx.xxx.191.6 ffffffffffff
heartbeat: 2003/06/04_10:51:23 /usr/local/lib/heartbeat/send_arp eth0
xx.xxx.191.5 00034725F49F xx.xxx.191.5 ffffffffffff
heartbeat: 2003/06/04_10:51:25 /usr/local/lib/heartbeat/send_arp eth0
xx.xxx.191.7 00034725F49F xx.xxx.191.7 ffffffffffff
heartbeat: 2003/06/04_10:51:25 /usr/local/lib/heartbeat/send_arp eth0
xx.xxx.191.6 00034725F49F xx.xxx.191.6 ffffffffffff
heartbeat: 2003/06/04_10:51:27 /usr/local/lib/heartbeat/send_arp eth0
xx.xxx.191.7 00034725F49F xx.xxx.191.7 ffffffffffff
heartbeat: 2003/06/04_10:51:41 info: Link Director2:eth0 up.
heartbeat: 2003/06/04_10:51:41 info: Status update for node Director2:
status up
heartbeat: 2003/06/04_10:51:41 info: Running
/usr/local/etc/ha.d/rc.d/status status
heartbeat: 2003/06/04_10:51:41 info: Status update for node Director2:
status active
heartbeat: 2003/06/04_10:51:41 info: Running
/usr/local/etc/ha.d/rc.d/status status
heartbeat: 2003/06/04_10:51:42 info: Running
/usr/local/etc/ha.d/rc.d/ip-request ip-request
heartbeat: 2003/06/04_10:51:42 info: Releasing resource group: Director1
IPaddr::xx.xxx.191.4/24/eth0:1
ldirectord::/usr/local/etc/ha.d/ldirectord.cf EmailAdmin
heartbeat: 2003/06/04_10:51:42 info: Running
/usr/local/etc/ha.d/resource.d/EmailAdmin stop
heartbeat: 2003/06/04_10:51:43 info: Running
/usr/local/etc/ha.d/resource.d/ldirectord
/usr/local/etc/ha.d/ldirectord.cf stop
heartbeat: 2003/06/04_10:51:43 info: Running
/usr/local/etc/ha.d/resource.d/IPaddr xx.xxx.191.4/24/eth0:1 stop
heartbeat: 2003/06/04_10:51:43 info: /sbin/route del -host xx.xxx.191.4
heartbeat: 2003/06/04_10:51:43 info: /sbin/ifconfig eth0:1:0 down
heartbeat: 2003/06/04_10:51:43 info: IP Address xx.xxx.191.4 released
heartbeat: 2003/06/04_10:51:43 info: Running
/usr/local/etc/ha.d/rc.d/ip-request ip-request
heartbeat: 2003/06/04_10:51:44 info: Releasing resource group: Director1
IPaddr::xx.xxx.191.5/24/eth0:2
ldirectord::/usr/local/etc/ha.d/ldirectord.cf EmailAdmin
heartbeat: 2003/06/04_10:51:44 info: Running
/usr/local/etc/ha.d/resource.d/EmailAdmin stop
heartbeat: 2003/06/04_10:51:45 info: Running
/usr/local/etc/ha.d/resource.d/ldirectord
/usr/local/etc/ha.d/ldirectord.cf stop
heartbeat: 2003/06/04_10:51:45 warning: Return code 1 from
/usr/local/etc/ha.d/resource.d/ldirectord
heartbeat: 2003/06/04_10:51:45 info: Running
/usr/local/etc/ha.d/resource.d/IPaddr xx.xxx.191.5/24/eth0:2 stop
heartbeat: 2003/06/04_10:51:45 info: /sbin/route del -host xx.xxx.191.5
heartbeat: 2003/06/04_10:51:45 info: /sbin/ifconfig eth0:2:0 down
heartbeat: 2003/06/04_10:51:45 info: IP Address xx.xxx.191.5 released
heartbeat: 2003/06/04_10:51:45 info: Running
/usr/local/etc/ha.d/rc.d/ip-request ip-request
heartbeat: 2003/06/04_10:51:45 info: Releasing resource group: Director1
IPaddr::xx.xxx.191.6/24/eth0:3
ldirectord::/usr/local/etc/ha.d/ldirectord.cf EmailAdmin
heartbeat: 2003/06/04_10:51:45 info: Running
/usr/local/etc/ha.d/resource.d/EmailAdmin stop
heartbeat: 2003/06/04_10:51:47 info: Running
/usr/local/etc/ha.d/resource.d/ldirectord
/usr/local/etc/ha.d/ldirectord.cf stop
heartbeat: 2003/06/04_10:51:47 warning: Return code 1 from
/usr/local/etc/ha.d/resource.d/ldirectord
heartbeat: 2003/06/04_10:51:47 info: Running
/usr/local/etc/ha.d/resource.d/IPaddr xx.xxx.191.6/24/eth0:3 stop
heartbeat: 2003/06/04_10:51:47 info: /sbin/route del -host xx.xxx.191.6
heartbeat: 2003/06/04_10:51:47 info: /sbin/ifconfig eth0:3:0 down
heartbeat: 2003/06/04_10:51:47 info: IP Address xx.xxx.191.6 released
heartbeat: 2003/06/04_10:51:47 info: Running
/usr/local/etc/ha.d/rc.d/ip-request ip-request
heartbeat: 2003/06/04_10:51:47 info: Releasing resource group: Director1
IPaddr::xx.xxx.191.7/24/eth0:4
ldirectord::/usr/local/etc/ha.d/ldirectord.cf EmailAdmin
heartbeat: 2003/06/04_10:51:47 info: Running
/usr/local/etc/ha.d/resource.d/EmailAdmin stop
heartbeat: 2003/06/04_10:51:48 info: Running
/usr/local/etc/ha.d/resource.d/ldirectord
/usr/local/etc/ha.d/ldirectord.cf stop
heartbeat: 2003/06/04_10:51:48 warning: Return code 1 from
/usr/local/etc/ha.d/resource.d/ldirectord
heartbeat: 2003/06/04_10:51:49 info: Running
/usr/local/etc/ha.d/resource.d/IPaddr xx.xxx.191.7/24/eth0:4 stop
heartbeat: 2003/06/04_10:51:49 info: /sbin/route del -host xx.xxx.191.7
heartbeat: 2003/06/04_10:51:49 info: /sbin/ifconfig eth0:4:0 down
heartbeat: 2003/06/04_10:51:49 info: IP Address xx.xxx.191.7 released
-----Original Message-----
From: lvs-users-bounces@xxxxxxxxxxxxxxxxxxxxxx
[mailto:lvs-users-bounces@xxxxxxxxxxxxxxxxxxxxxx] On Behalf Of Markus
Markert
Sent: Wednesday, June 04, 2003 11:48 AM
To: LinuxVirtualServer.org users mailing list.
Subject: Re: Fail Over
Am Mittwoch, 4. Juni 2003 18:42 schrieb AJ Lemke:
> Hello List,
>
> I am running a 2 Node Cluster with fail over using Heartbeat. We
> recently have come to notice that when the Primary Node(Director1) is
> taken down or fails the Secondary Node(Director2) takes upto 6 minutes
> to assume the Virtual IP's. Sometimes the Director2 doesn't take over
> at all. Heartbeat checks the servers every 2 seconds and the Deadtime
> is 10 seconds. If I restart the heartbeat service on both Nodes they
> seem to work within 15 seconds the first couple of tries but then
> they seem to get confused as Director2 will not give up its resources
> when Director1 comes back on line. This is tested by shutting off the
> port on the switch or by starting and stopping the Heartbeat service.
> Any ideas as to what could be causing this problem?
>
> AJ
please send some more informations. (ha.cf, haresources, logfile
abridgement,
ps -ax | grep heartbeat)
greetings
markus
>
> _______________________________________________
> LinuxVirtualServer.org mailing list - lvs-users@xxxxxxxxxxxxxxxxxxxxxx
> Send requests to lvs-users-request@xxxxxxxxxxxxxxxxxxxxxx
> or go to http://www.in-addr.de/mailman/listinfo/lvs-users
--
-----------------------------------------------------------
Suchtreffer AG
Bleicherstrasse 20
D-78467 Konstanz
Germany
fon: +49-(0)7531-89207-17
fax: +49-(0)7531-89207-13
e-mail: mma@xxxxxxxxxxxxxx
internet: http://www.suchtreffer.de
-----------------------------------------------------------
In a world without walls and fences,
who need gates?
_______________________________________________
LinuxVirtualServer.org mailing list - lvs-users@xxxxxxxxxxxxxxxxxxxxxx
Send requests to lvs-users-request@xxxxxxxxxxxxxxxxxxxxxx
or go to http://www.in-addr.de/mailman/listinfo/lvs-users
|