On Thursday, March 14, 2002, at 05:44 AM, Radomski, Mike wrote:
I have an update to the problem. It has been over 10 hours since the
last
[snip]
After shutting off the serial heartbeat, the over all load dropped
about .02. I have not seen the sustained spike since.
Does anyone have any suggestions on the serial connection problem?
I had major reliability issues when I tried using the serial connection
with heartbeat. I attributed it to poor chipset design from Intel. My
load balancers are 1U's Celeron systems that use that crappy i810
chipset. Pretty much whenever there was any load on the server (such as
during rsync replication between the master and slave loadbalancers),
the serial connection would completely timeout. Which would cause
complete havoc on my lvs. The slave would then think the master was
down, and start to bring itself up as the director. Let me tell you from
experience, it really sucks when you have two directors fighting over
arp for the ip addresses of the lvs. So I just decided to bite the
bullet and switch from serial heartbeats to udp. I haven't had a problem
since.
-Paul Baker
|