LVS
lvs-users
Google
 
Web LinuxVirtualServer.org

ldirectord error when heartbeat stops

To: "LinuxVirtualServer.org users mailing list." <lvs-users@xxxxxxxxxxxxxxxxxxxxxx>
Subject: ldirectord error when heartbeat stops
From: "Leon Keijser" <errtu@xxxxxxx>
Date: Mon, 24 Oct 2005 13:02:42 +0200 (MEST)
Hi,

Every time i stop heartbeat on the primary LVS, i see this in syslog:

Oct 24 12:47:10 rpzlvs01 heartbeat: debug: /etc/ha.d/resource.d/ldirectord 
stop done. RC=0
Oct 24 12:47:10 rpzlvs01 ldirectord[11928]: Removed real server:
192.168.50.121:1494 ( x 192.168.51.201:1494
Oct 24 12:47:10 rpzlvs01 ldirectord[11928]: Removed real server:
192.168.50.122:1494 ( x 192.168.51.201:1494
Oct 24 12:47:10 rpzlvs01 ldirectord[11928]: Removed virtual server:
192.168.51.201:1494
Oct 24 12:47:10 rpzlvs01 ldirectord[11928]: Linux Director Daemon terminated
on signal: TERM
Oct 24 12:47:11 rpzlvs01 heartbeat: info: Running
/etc/ha.d/resource.d/IPaddr 192.168.51.200 stop
Oct 24 12:47:11 rpzlvs01 heartbeat: debug: Starting
/etc/ha.d/resource.d/IPaddr 192.168.51.200 stop
Oct 24 12:47:11 rpzlvs01 heartbeat: info: /sbin/route -n del -host
192.168.51.200
Oct 24 12:47:11 rpzlvs01 heartbeat: info: /sbin/ifconfig eth0:0 down
Oct 24 12:47:11 rpzlvs01 heartbeat: info: IP Address 192.168.51.200 released
Oct 24 12:47:11 rpzlvs01 heartbeat: debug: /etc/ha.d/resource.d/IPaddr
192.168.51.200 stop done. RC=0
Oct 24 12:47:11 rpzlvs01 heartbeat: info: Releasing resource group: rpzlvs01
192.168.51.201 ldirectord
Oct 24 12:47:11 rpzlvs01 heartbeat: info: Running
/etc/ha.d/resource.d/ldirectord  stop
Oct 24 12:47:11 rpzlvs01 heartbeat: debug: Starting
/etc/ha.d/resource.d/ldirectord  stop
Oct 24 12:47:12 rpzlvs01 heartbeat: debug: /etc/ha.d/resource.d/ldirectord 
stop done. RC=1
Oct 24 12:47:12 rpzlvs01 heartbeat: ERROR: Return code 1 from
/etc/ha.d/resource.d/ldirectord
Oct 24 12:47:13 rpzlvs01 heartbeat: info: Retrying failed stop operation
[ldirectord]
Oct 24 12:47:13 rpzlvs01 heartbeat: info: Running
/etc/ha.d/resource.d/ldirectord  stop
Oct 24 12:47:13 rpzlvs01 heartbeat: debug: Starting
/etc/ha.d/resource.d/ldirectord  stop
Oct 24 12:47:14 rpzlvs01 heartbeat: debug: /etc/ha.d/resource.d/ldirectord 
stop done. RC=1

repeat last 4 messages .. alot of times
then:

Oct 24 12:47:30 rpzlvs01 heartbeat: ERROR: Return code 1 from
/etc/ha.d/resource.d/ldirectord
Oct 24 12:47:31 rpzlvs01 heartbeat: ERROR: Resource script for ldirectord
probably not LSB-compliant.
Oct 24 12:47:31 rpzlvs01 heartbeat: WARN: it (ldirectord) MUST succeed on a
stop when already stopped
Oct 24 12:47:31 rpzlvs01 heartbeat: WARN: Machine reboot narrowly avoided!
Oct 24 12:47:31 rpzlvs01 heartbeat: info: Running
/etc/ha.d/resource.d/IPaddr 192.168.51.201 stop
Oct 24 12:47:31 rpzlvs01 heartbeat: debug: Starting
/etc/ha.d/resource.d/IPaddr 192.168.51.201 stop
Oct 24 12:47:31 rpzlvs01 heartbeat: info: /sbin/route -n del -host
192.168.51.201
Oct 24 12:47:31 rpzlvs01 heartbeat: info: /sbin/ifconfig eth0:1 down
Oct 24 12:47:31 rpzlvs01 heartbeat: info: IP Address 192.168.51.201 released
Oct 24 12:47:31 rpzlvs01 heartbeat: debug: /etc/ha.d/resource.d/IPaddr
192.168.51.201 stop done. RC=0
Oct 24 12:47:31 rpzlvs01 heartbeat[12898]: info: All HA resources
relinquished.
Oct 24 12:47:32 rpzlvs01 heartbeat[11714]: WARN: 1 lost packet(s) for
[rpzlvs02] [1590:1592]
Oct 24 12:47:32 rpzlvs01 heartbeat[11714]: info: No pkts missing from
rpzlvs02!
Oct 24 12:47:32 rpzlvs01 heartbeat[11714]: info: killing HBFIFO process
11729 with signal 15
Oct 24 12:47:32 rpzlvs01 heartbeat[11714]: info: killing HBWRITE process
11730 with signal 15
Oct 24 12:47:32 rpzlvs01 heartbeat[11714]: info: killing HBREAD process
11731 with signal 15
Oct 24 12:47:32 rpzlvs01 heartbeat[11714]: info: Core process 11729 exited.
3 remaining
Oct 24 12:47:32 rpzlvs01 heartbeat[11714]: info: Core process 11730 exited.
2 remaining
Oct 24 12:47:32 rpzlvs01 heartbeat[11714]: info: Core process 11731 exited.
1 remaining
Oct 24 12:47:32 rpzlvs01 heartbeat[11714]: info: Heartbeat shutdown
complete.


Is this something to worry about? Heartbeat takes a long time to stop and
some clients were disconnected during the failover (even though on both
LVS's the sync daemon is running (master and backup on both boxes))


Leon

-- 
10 GB Mailbox, 100 FreeSMS/Monat http://www.gmx.net/de/go/topmail
+++ GMX - die erste Adresse für Mail, Message, More +++

<Prev in Thread] Current Thread [Next in Thread>