LVS
lvs-users
Google
 
Web LinuxVirtualServer.org

RE: Help debugging director problems after a period of time

To: "'LinuxVirtualServer.org users mailing list.'" <lvs-users@xxxxxxxxxxxxxxxxxxxxxx>
Subject: RE: Help debugging director problems after a period of time
From: "Randy Paries" <randy.paries@xxxxxxxxxxx>
Date: Tue, 6 Apr 2004 08:47:48 -0500
Hello,

It is me again. Still fighting this battle. ( I am pretty sure it is
something I have done setting up my network and not LVS, but I hope someone
can see what I have done incorrectly)
I did not included all the past responses since it would have been huge
BTW, I am a programmer but since I own my own lil company I am forced to be
a sysadmin/network guy. So be kind if I say something stupid.

Thanks for any help

Here is the readers digest version on the problem

After an amount of time(usually over night), the director can not longer get
to the outside world.
(Config stuff at the bottom)

When the director boots , everything works just fine. After a while I can
not longer get to the outside world(not entirely true more on that in a
minute) I notice the problem when I do a 
ipvsadm --list ( it takes about 40 seconds to come back )
If I do a ipvsadm --list -n it comes back immediately.

Here is where it gets a little strange(at least to me)
I can ping some boxes on our network(with their public ips)

Before that here is some config info
============================================================================
===========
eth0      Link encap:Ethernet  HWaddr 00:08:54:D0:9E:06  
          inet addr:192.168.0.205  Bcast:192.168.0.255  Mask:255.255.255.0
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:52798 errors:0 dropped:0 overruns:0 frame:0
          TX packets:52953 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:100 
          RX bytes:6627836 (6.3 Mb)  TX bytes:0 (0.0 b)
          Interrupt:10 Base address:0x800 

eth1      Link encap:Ethernet  HWaddr 00:01:02:E8:65:FF  
          inet addr:66.208.187.25  Bcast:66.208.187.127
Mask:255.255.255.128
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:22518 errors:0 dropped:0 overruns:0 frame:0
          TX packets:2082 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:100 
          RX bytes:1811056 (1.7 Mb)  TX bytes:222715 (217.4 Kb)
          Interrupt:9 Base address:0x1400 

eth1:0    Link encap:Ethernet  HWaddr 00:01:02:E8:65:FF  
          inet addr:66.208.187.27  Bcast:66.208.187.27  Mask:255.255.255.255
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:22518 errors:0 dropped:0 overruns:0 frame:0
          TX packets:2082 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:100 
          RX bytes:1811056 (1.7 Mb)  TX bytes:222715 (217.4 Kb)
          Interrupt:9 Base address:0x1400 

lo        Link encap:Local Loopback  
          inet addr:127.0.0.1  Mask:255.0.0.0
          UP LOOPBACK RUNNING  MTU:16436  Metric:1
          RX packets:8 errors:0 dropped:0 overruns:0 frame:0
          TX packets:8 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0 
          RX bytes:588 (588.0 b)  TX bytes:588 (588.0 b)

============================================================================
===========
Ie. Ping 66.208.187.21 (on of my webservers on a separate box - works fine)

Ping 66.208.187.126 (which is the gateway) works find

Ping 66.208.187.129 (one of the name servers ) Fails

Ping 66.208.187.133 (a web server on another box ) Fails

Ping anywhere outside world fails

============================================================================
===========
One addition I did yesterday was from
http://www.linuxvirtualserver.org/VS-DRouting.html
Put in the arp workaround
echo 1 > /proc/sys/net/ipv4/conf/eth1/hidden
echo 1 > /proc/sys/net/ipv4/all/hidden

Here is my ldirectord.cf
# Global Directives
checktimeout=3
checkinterval=10
fallback=127.0.0.1:80
autoreload=yes
logfile="/var/log/ldirectord.log"
#logfile="local0"
quiescent=yes

# 
virtual=66.208.187.27:80
        real=192.168.0.203:80 gate
        real=192.168.0.204:80 gate
        fallback=127.0.0.1:80 gate
        service=http
        request="ldirectordping.html"
        receive="Test Page"
        scheduler=wlc
        persistent=1800
        netmask=255.255.255.0
        protocol=tcp
#SSL part
virtual=66.208.187.27:443
        real=192.168.0.203:443 gate
        fallback=127.0.0.1:80 gate
        service=https
        request="ldirectordping.html"
        receive="Test Page"
        scheduler=wlc
        persistent=1800
        netmask=255.255.255.0
        protocol=tcp

<Prev in Thread] Current Thread [Next in Thread>