Both the secondary and the master are doing the same thing intermittently.
The strange thing is that servers have been running fine for about 2 months.
The the master LB has a completely different hardware configuration then
does the backup. The only common piece of hardware is the hub to which all
the servers connect.
When the cluster does fail, only the front (incoming) side of the cluster is
failing. What I mean is that the real servers continue to pass the
keepalived tcp checks and all the real servers remain in the cluster
(ipvs -L lists all the realservers). Unfortunately, no incoming traffic is
routed to the real servers. If I simply ssh in and restart keepalived, the
cluster is brought back online right away and normal traffic resumes.
Micah
> -----Original Message-----
> From: Peter Mueller [mailto:pmueller@xxxxxxxxxxxx]
> Sent: Monday, January 12, 2004 3:44 PM
> To: 'LinuxVirtualServer.org users mailing list.'; micah@xxxxxxxxxxxxxxx
> Subject: RE: LVS freezing up
>
>
> > We've never had lockups with LVS. However in the (far
> > distance) past, we've heard about lockups from ethernet
> > drivers overflowing their counters etc. Since this is
> > occuring at short intervals and suddenly, it sounds like
> > hardware. Are you getting voltage spikes anywhere
> > (lightning?, is everything on UPSs?), any bad ethernet
> > cables? Does the node die when you push on the mobo in
> > various places, shake all the cables?
>
> It sure sounds like hardware, probably PS -> motherboard related. If you
> have an oscillascope (probably not) you can check directly!
>
> When you are failing over, are you failing back to the primary? Keep the
> secondary as the master until you figure out what's going on with the
> primary.
>
> P
>
>
|