LVS
lvs-users
Google
 
Web LinuxVirtualServer.org

heartbeat servers rebooting automatically

To: "LinuxVirtualServer.org users mailing list." <lvs-users@xxxxxxxxxxxxxxxxxxxxxx>
Subject: heartbeat servers rebooting automatically
From: ankush grover <ankushgrover1711@xxxxxxxxx>
Date: Mon, 17 Oct 2005 19:04:51 +0530
hey friends,

I am trying to configure StreamLine High Availability and Load
Balancing.Ihave 2 pcs both running FC3.
The pc linux.sunupdelhi.net <http://linux.sunupdelhi.net> is the primary one
and cluster1.sunupdelhi.net <http://cluster1.sunupdelhi.net> is the standy
linux-director.

For ex Server 1 goes down ,the 2nd server keeps on working but reboots
after the 1st Server comes up and
same happens with server1 when 2nd server comes up after rebooting.


I have added this line in haresources on both the servers

linux.sunupdelhi.net <http://linux.sunupdelhi.net/>
ldirectord::ldirectord.cf <http://ldirectord.cf/>  
LVSSyncDaemonSwap::master IPaddr2::192.168.1.59/24/eth0/192.168.1.255
<http://192.168.1.59/24/eth0/192.168.1.255>


I am using arp_ignore and apr_announce in /etc/sysctl.conf as FC3 supports
these parameters.

I am using lo:0 for VIP on both the Servers.

DEVICE=lo:0
IPADDR=192.168.1.59 <http://192.168.1.59>
NETMASK=255.255.255.255 <http://255.255.255.255>
NETWORK=192.168.1.0 <http://192.168.1.0>
BROADCAST=192.168.1.255 <http://192.168.1.255>
ONBOOT=yes
NAME=loopback

eth0 on First LVS server

DEVICE=eth0
BOOTPROTO=static
BROADCAST=192.168.1.255 <http://192.168.1.255>
IPADDR=192.168.1.42 <http://192.168.1.42>
NETMASK=255.255.255.0 <http://255.255.255.0>
NETWORK=192.168.1.0 <http://192.168.1.0>
ONBOOT=yes
TYPE=Ethernet


eth0 on Second LVS server

DEVICE=eth0
BOOTPROTO=static
BROADCAST=192.168.1.255 <http://192.168.1.255>
IPADDR=192.168.1.68 <http://192.168.1.68>
NETMASK=255.255.255.0 <http://255.255.255.0>
NETWORK=192.168.1.0 <http://192.168.1.0>
ONBOOT=yes
TYPE=Ethernet


a) 1st LVS server : RIP 192.168.1.42 <http://192.168.1.42> on eth0
VIP 192.168.1.59 <http://192.168.1.59> on lo:0
2) 2nd LVS Server: RIP 192.168.1.68 <http://192.168.1.68> on eth0
VIP 192.168.1.59 <http://192.168.1.59> on lo:0

the ldirectord .cf
# Global Directives
checktimeout=10
checkinterval=2
#fallback=127.0.0.1:80 <http://127.0.0.1:80>
autoreload=yes
logfile="/var/log/ldirectord.log"
#logfile="local0"
quiescent=yes
#autofallback
# Virtual Server for HTTP
virtual=192.168.1.59:80 <http://192.168.1.59:80>
fallback=127.0.0.1:80 <http://127.0.0.1:80>
real=192.168.1.42:80 <http://192.168.1.42:80> gate
real=192.168.1.68:80 <http://192.168.1.68:80> gate
service=http
request="index.html"
receive="Test Page"
scheduler=wlc
# persistent=600
protocol=tcp
checktype=negotiate


in /etc/ha.d/haresources

I haven't added any other line other than below line

linux.sunupdelhi.net <http://linux.sunupdelhi.net>
ldirectord::ldirectord.cf<http://ldirectord.cf>LVSSyncDaemonSwap::master
IPaddr2::
192.168.1.59/24/eth0/192.168.1.255<http://192.168.1.59/24/eth0/192.168.1.255>


in /etc/ha.d/ha.cf file

I have disabled
serial /dev/ttyS0 # Linux

enabled

bcast eth0 # Linux
node linux.sunupdelhi.net <http://linux.sunupdelhi.net>
node cluster1.sunupdelhi.net <http://cluster1.sunupdelhi.net>
ping 192.168.1.1 <http://192.168.1.1> (Gateway)
udpport 694
warntime 10
deadtime 30
keepalive 2

This is testing scenario in real time I will have 2 cards in each machine
and these 2 servers will be connected directly to each other through 2nd
cards.

in /etc/ha.d/authkeys I have changed this

2 sha1 sunupdelhi

logs file from one of the server.

/var/log/ha-log
heartbeat: 2005/10/17_18:49:37 info: /sbin/ip link set lo up
heartbeat: 2005/10/17_18:49:37 info: IP Address
192.168.1.59<http://192.168.1.59>released
heartbeat: 2005/10/17_18:49:37 info: Running
/etc/ha.d/resource.d/LVSSyncDaemonSwap master stop
heartbeat: 2005/10/17_18:49:37 ERROR: Return code 255 from
/etc/ha.d/resource.d/LVSSyncDaemonSwap
heartbeat: 2005/10/17_18:49:38 info: Retrying failed stop operation
[LVSSyncDaemonSwap::master]
heartbeat: 2005/10/17_18:49:38 info: Running
/etc/ha.d/resource.d/LVSSyncDaemonSwap master stop
heartbeat: 2005/10/17_18:49:38 ERROR: Return code 255 from
/etc/ha.d/resource.d/LVSSyncDaemonSwap
heartbeat: 2005/10/17_18:49:39 info: Retrying failed stop operation
[LVSSyncDaemonSwap::master]
heartbeat: 2005/10/17_18:49:39 info: Running
/etc/ha.d/resource.d/LVSSyncDaemonSwap master stop
heartbeat: 2005/10/17_18:49:39 ERROR: Return code 255 from
/etc/ha.d/resource.d/LVSSyncDaemonSwap
heartbeat: 2005/10/17_18:49:40 info: Retrying failed stop operation
[LVSSyncDaemonSwap::master]
heartbeat: 2005/10/17_18:49:40 info: Running
/etc/ha.d/resource.d/LVSSyncDaemonSwap master stop
heartbeat: 2005/10/17_18:49:40 ERROR: Return code 255 from
/etc/ha.d/resource.d/LVSSyncDaemonSwap
heartbeat: 2005/10/17_18:49:41 info: Retrying failed stop operation
[LVSSyncDaemonSwap::master]
heartbeat: 2005/10/17_18:49:41 info: Running
/etc/ha.d/resource.d/LVSSyncDaemonSwap master stop
heartbeat: 2005/10/17_18:49:41 ERROR: Return code 255 from
/etc/ha.d/resource.d/LVSSyncDaemonSwap
heartbeat: 2005/10/17_18:49:42 info: Retrying failed stop operation
[LVSSyncDaemonSwap::master]
heartbeat: 2005/10/17_18:49:42 info: Running
/etc/ha.d/resource.d/LVSSyncDaemonSwap master stop
heartbeat: 2005/10/17_18:49:42 ERROR: Return code 255 from
/etc/ha.d/resource.d/LVSSyncDaemonSwap
heartbeat: 2005/10/17_18:49:43 info: Retrying failed stop operation
[LVSSyncDaemonSwap::master]
heartbeat: 2005/10/17_18:49:43 info: Running
/etc/ha.d/resource.d/LVSSyncDaemonSwap master stop
heartbeat: 2005/10/17_18:49:43 ERROR: Return code 255 from
/etc/ha.d/resource.d/LVSSyncDaemonSwap
heartbeat: 2005/10/17_18:49:44 info: Retrying failed stop operation
[LVSSyncDaemonSwap::master]
heartbeat: 2005/10/17_18:49:44 info: Running
/etc/ha.d/resource.d/LVSSyncDaemonSwap master stop
heartbeat: 2005/10/17_18:49:45 ERROR: Return code 255 from
/etc/ha.d/resource.d/LVSSyncDaemonSwap
heartbeat: 2005/10/17_18:49:46 info: Retrying failed stop operation
[LVSSyncDaemonSwap::master]
heartbeat: 2005/10/17_18:49:46 info: Running
/etc/ha.d/resource.d/LVSSyncDaemonSwap master stop
heartbeat: 2005/10/17_18:49:46 ERROR: Return code 255 from
/etc/ha.d/resource.d/LVSSyncDaemonSwap
heartbeat: 2005/10/17_18:49:47 info: Retrying failed stop operation
[LVSSyncDaemonSwap::master]
heartbeat: 2005/10/17_18:49:47 info: Running
/etc/ha.d/resource.d/LVSSyncDaemonSwap master stop
heartbeat: 2005/10/17_18:49:47 ERROR: Return code 255 from
/etc/ha.d/resource.d/LVSSyncDaemonSwap
heartbeat: 2005/10/17_18:49:48 info: Retrying failed stop operation
[LVSSyncDaemonSwap::master]
heartbeat: 2005/10/17_18:49:48 info: Running
/etc/ha.d/resource.d/LVSSyncDaemonSwap master stop
heartbeat: 2005/10/17_18:49:48 ERROR: Return code 255 from
/etc/ha.d/resource.d/LVSSyncDaemonSwap
heartbeat: 2005/10/17_18:49:48 CRIT: Resource STOP failure. Reboot required!
heartbeat: 2005/10/17_18:49:48 CRIT: Killing heartbeat ungracefully!
heartbeat: 2005/10/17_18:51:12 info: **************************

Any pointer why they are rebooting.


Thanks & Regards

Ankush Grover

<Prev in Thread] Current Thread [Next in Thread>