LVS
lvs-users
Google
 
Web LinuxVirtualServer.org

[lvs-users] ldirecctord problem on slave node

To: lvs-users@xxxxxxxxxxxxxxxxxxxxxx
Subject: [lvs-users] ldirecctord problem on slave node
From: "Tears !" <unix.co@xxxxxxxxx>
Date: Wed, 17 Sep 2008 21:27:17 +0500
Dear Members!

lidrectord is not working on secondary node whenever primary node is
unavailable.

Here is the heartbeat log on secondary node.

heartbeat[21555]: 2008/09/17_21:04:42 info: Received shutdown notice from
'node1'.
heartbeat[21555]: 2008/09/17_21:04:42 info: Resources being acquired from
node1.
heartbeat[22828]: 2008/09/17_21:04:42 info: acquire local HA resources
(standby).
heartbeat[22828]: 2008/09/17_21:04:42 info: local HA resource acquisition
completed (standby).
heartbeat[22829]: 2008/09/17_21:04:42 info: No local resources
[/usr/share/heartbeat/ResourceManager listkeys tears] to acquire.
heartbeat[21555]: 2008/09/17_21:04:42 info: Standby resource acquisition
done [foreign].
harc[22854]:    2008/09/17_21:04:42 info: Running /etc/ha.d/rc.d/status
status
mach_down[22869]:       2008/09/17_21:04:42 info: Taking over resource group
192.168.2.25/24/eth0
ResourceManager[22894]: 2008/09/17_21:04:42 info: Acquiring resource group:
node1 192.168.2.25/24/eth0 ldirectord
IPaddr[22920]:  2008/09/17_21:04:43 INFO:  Resource is
stopped
ResourceManager[22894]: 2008/09/17_21:04:43 info: Running
/etc/ha.d/resource.d/IPaddr 192.168.2.25/24/eth0 start
IPaddr[23017]:  2008/09/17_21:04:43 INFO: Using calculated netmask for
192.168.2.25: 255.255.255.0
IPaddr[23017]:  2008/09/17_21:04:43 INFO: eval ifconfig eth0:0
192.168.2.25netmask
255.255.255.0 broadcast 192.168.2.255
IPaddr[22988]:  2008/09/17_21:04:43 INFO:
Success

ResourceManager[22894]: 2008/09/17_21:04:43 info: Running
/etc/ha.d/resource.d/ldirectord  start
ResourceManager[22894]: 2008/09/17_21:04:43 ERROR: Return code 2 from
/etc/ha.d/resource.d/ldirectord
ResourceManager[22894]: 2008/09/17_21:04:43 CRIT: Giving up resources due to
failure of ldirectord
ResourceManager[22894]: 2008/09/17_21:04:43 info: Releasing resource group:
node1 192.168.2.25/24/eth0 ldirectord
ResourceManager[22894]: 2008/09/17_21:04:43 info: Running
/etc/ha.d/resource.d/ldirectord  stop
ResourceManager[22894]: 2008/09/17_21:04:43 ERROR: Return code 2 from
/etc/ha.d/resource.d/ldirectord
ResourceManager[22894]: 2008/09/17_21:04:44 info: Retrying failed stop
operation [ldirectord]
ResourceManager[22894]: 2008/09/17_21:04:44 info: Running
/etc/ha.d/resource.d/ldirectord  stop
ResourceManager[22894]: 2008/09/17_21:04:44 ERROR: Return code 2 from
/etc/ha.d/resource.d/ldirectord
ResourceManager[22894]: 2008/09/17_21:04:45 info: Retrying failed stop
operation [ldirectord]
ResourceManager[22894]: 2008/09/17_21:04:45 info: Running
/etc/ha.d/resource.d/ldirectord  stop
ResourceManager[22894]: 2008/09/17_21:04:45 ERROR: Return code 2 from
/etc/ha.d/resource.d/ldirectord
ResourceManager[22894]: 2008/09/17_21:04:46 info: Retrying failed stop
operation [ldirectord]
ResourceManager[22894]: 2008/09/17_21:04:46 info: Running
/etc/ha.d/resource.d/ldirectord  stop
ResourceManager[22894]: 2008/09/17_21:04:46 ERROR: Return code 2 from
/etc/ha.d/resource.d/ldirectord
ResourceManager[22894]: 2008/09/17_21:04:47 info: Retrying failed stop
operation [ldirectord]
ResourceManager[22894]: 2008/09/17_21:04:47 info: Running
/etc/ha.d/resource.d/ldirectord  stop
ResourceManager[22894]: 2008/09/17_21:04:47 ERROR: Return code 2 from
/etc/ha.d/resource.d/ldirectord
ResourceManager[22894]: 2008/09/17_21:04:48 info: Retrying failed stop
operation [ldirectord]
ResourceManager[22894]: 2008/09/17_21:04:48 info: Running
/etc/ha.d/resource.d/ldirectord  stop
ResourceManager[22894]: 2008/09/17_21:04:48 ERROR: Return code 2 from
/etc/ha.d/resource.d/ldirectord
ResourceManager[22894]: 2008/09/17_21:04:50 info: Retrying failed stop
operation [ldirectord]
ResourceManager[22894]: 2008/09/17_21:04:50 info: Running
/etc/ha.d/resource.d/ldirectord  stop
ResourceManager[22894]: 2008/09/17_21:04:50 ERROR: Return code 2 from
/etc/ha.d/resource.d/ldirectord
ResourceManager[22894]: 2008/09/17_21:04:51 info: Retrying failed stop
operation [ldirectord]
ResourceManager[22894]: 2008/09/17_21:04:51 info: Running
/etc/ha.d/resource.d/ldirectord  stop
ResourceManager[22894]: 2008/09/17_21:04:51 ERROR: Return code 2 from
/etc/ha.d/resource.d/ldirectord
ResourceManager[22894]: 2008/09/17_21:04:52 info: Retrying failed stop
operation [ldirectord]
ResourceManager[22894]: 2008/09/17_21:04:52 info: Running
/etc/ha.d/resource.d/ldirectord  stop
ResourceManager[22894]: 2008/09/17_21:04:52 ERROR: Return code 2 from
/etc/ha.d/resource.d/ldirectord
ResourceManager[22894]: 2008/09/17_21:04:53 info: Retrying failed stop
operation [ldirectord]
ResourceManager[22894]: 2008/09/17_21:04:53 info: Running
/etc/ha.d/resource.d/ldirectord  stop
ResourceManager[22894]: 2008/09/17_21:04:53 ERROR: Return code 2 from
/etc/ha.d/resource.d/ldirectord
heartbeat[21555]: 2008/09/17_21:04:54 WARN: node node1: is dead
heartbeat[21555]: 2008/09/17_21:04:54 info: Dead node node1 gave up
resources.
heartbeat[21555]: 2008/09/17_21:04:54 info: Link node1:eth0 dead.
ResourceManager[22894]: 2008/09/17_21:04:54 info: Retrying failed stop
operation [ldirectord]
ResourceManager[22894]: 2008/09/17_21:04:54 info: Running
/etc/ha.d/resource.d/ldirectord  stop
ResourceManager[22894]: 2008/09/17_21:04:54 ERROR: Return code 2 from
/etc/ha.d/resource.d/ldirectord
ResourceManager[22894]: 2008/09/17_21:04:54 ERROR: Resource script for
ldirectord probably not LSB-compliant.
ResourceManager[22894]: 2008/09/17_21:04:54 WARN: it (ldirectord) MUST
succeed on a stop when already stopped
ResourceManager[22894]: 2008/09/17_21:04:54 WARN: Machine reboot narrowly
avoided!
ResourceManager[22894]: 2008/09/17_21:04:54 info: Running
/etc/ha.d/resource.d/IPaddr 192.168.2.25/24/eth0 stop
IPaddr[23503]:  2008/09/17_21:04:54 INFO: ifconfig eth0:0 down
IPaddr[23474]:  2008/09/17_21:04:54 INFO:  Success
mach_down[22869]:       2008/09/17_21:04:54 info:
/usr/share/heartbeat/mach_down: nice_failback: foreign resources acquired
mach_down[22869]:       2008/09/17_21:04:54 info: mach_down takeover
complete for node node1.
heartbeat[21555]: 2008/09/17_21:04:54 info: mach_down takeover complete.

Regards,

Umar

<Prev in Thread] Current Thread [Next in Thread>