Hello,
We are problems with keepalived 1.1.7 on RHEL 5.3, running on a DL380 with 6
ethernet ports, two onboard and four on a quad card. The quad uses the
e1000 drivers. There has been a lot of discussion on Red Hat and other
sites about MSI, much of which is too detailed for me to understand, but it
does point to some issues with RHEL 5.3, e1000 and bonding which are similar
to what we are seeing here.
Can anyone help, or give me a pointer to another mail list ?
For the sake of making the configuration as simple as possible, we have
stripped down the configuration file.
! configuration file for keepalived - MASTER conf
global_defs {
router_id VRRP_01
}
vrrp_instance VI_1 {
! we are the Master
state MASTER
interface eth0
virtual_router_id 51
priority 100
advert_int 1
virtual_ipaddress {
10.150.25.52
}
}
With version 1.1.7 we get the following errors
[root@Server1 ~]# tail -f /var/log/messages
Nov 2 13:51:10 Server1 Keepalived: Starting Keepalived v1.1.17 (07/27,2009)
Nov 2 13:51:10 Server1 Keepalived: Remove a zombie pid file
/var/run/keepalived.pid
Nov 2 13:51:10 Server1 Keepalived_healthcheckers: Using MII-BMSR NIC
polling thread...
Nov 2 13:51:10 Server1 Keepalived_healthcheckers: SIOCGMIIREG on eth2
failed: Input/output error
Nov 2 13:51:10 Server1 last message repeated 8 times
Nov 2 13:51:10 Server1 Keepalived: Starting Healthcheck child process,
pid=13094
Nov 2 13:51:10 Server1 Keepalived: Using MII-BMSR NIC polling thread...
Nov 2 13:51:10 Server1 Keepalived: SIOCGMIIREG on eth2 failed: Input/output
error
Nov 2 13:51:10 Server1 last message repeated 10 times
Nov 2 13:51:10 Server1 Keepalived_healthcheckers: SIOCGMIIREG on eth2
failed: Input/output error
Nov 2 13:51:10 Server1 Keepalived: SIOCGMIIREG on eth2 failed: Input/output
error
Nov 2 13:51:10 Server1 last message repeated 10 times
Nov 2 13:51:10 Server1 Keepalived: SIOCGMIIREG on eth3 failed: Input/output
error
Nov 2 13:51:10 Server1 last message repeated 21 times
Nov 2 13:51:10 Server1 Keepalived: SIOCGMIIREG on eth4 failed: Input/output
error
Nov 2 13:51:10 Server1 last message repeated 8 times
On version 1.1.16 we also get errors
[root@Server1 ~]# tail -f /var/log/messages
Nov 2 14:06:32 Server1 kernel: device eth0 entered promiscuous mode
Nov 2 14:06:34 Server1 Keepalived: Starting Keepalived v1.1.16 (07/27,2009)
Nov 2 14:06:34 Server1 Keepalived_healthcheckers: Using MII-BMSR NIC
polling thread...
Nov 2 14:06:34 Server1 Keepalived_healthcheckers: SIOCGMIIREG on eth2
failed: Input/output error
Nov 2 14:06:34 Server1 last message repeated 8 times
Nov 2 14:06:34 Server1 Keepalived: Starting Healthcheck child process,
pid=19543
Nov 2 14:06:34 Server1 Keepalived: Starting VRRP child process, pid=19544
Nov 2 14:06:34 Server1 Keepalived_healthcheckers: SIOCGMIIREG on eth2
failed: Input/output error
Nov 2 14:06:34 Server1 last message repeated 10 times
Nov 2 14:06:34 Server1 Keepalived_vrrp: Using MII-BMSR NIC polling
thread...
Nov 2 14:06:34 Server1 Keepalived_healthcheckers: SIOCGMIIREG on eth2
failed: Input/output error
Nov 2 14:06:34 Server1 Keepalived_healthcheckers: SIOCGMIIREG on eth2
failed: Input/output error
Nov 2 14:06:34 Server1 Keepalived_healthcheckers: SIOCGMIIREG on eth3
failed: Input/output error
Nov 2 14:06:34 Server1 last message repeated 12 times
Nov 2 14:06:34 Server1 Keepalived_vrrp: SIOCGMIIREG on eth2 failed:
Input/output error
Nov 2 14:06:34 Server1 Keepalived_healthcheckers: SIOCGMIIREG on eth3
failed: Input/output error
Nov 2 14:06:34 Server1 Keepalived_vrrp: SIOCGMIIREG on eth2 failed:
Input/output error
Nov 2 14:06:34 Server1 Keepalived_healthcheckers: SIOCGMIIREG on eth3
failed: Input/output error
On version 1.1.15 we do not get errors
[root@Server1 home]# tail -f /var/log/messages
Nov 2 14:12:35 Server1 kernel: device eth0 entered promiscuous mode
Nov 2 14:12:37 Server1 Keepalived: Starting Keepalived v1.1.15 (07/27,2009)
Nov 2 14:12:37 Server1 Keepalived: Starting Healthcheck child process,
pid=22110
Nov 2 14:12:37 Server1 Keepalived_healthcheckers: Using MII-BMSR NIC
polling thread...
Nov 2 14:12:37 Server1 Keepalived_vrrp: Using MII-BMSR NIC polling
thread...
Nov 2 14:12:37 Server1 Keepalived: Starting VRRP child process, pid=22111
Nov 2 14:12:37 Server1 Keepalived_healthcheckers: Registering Kernel
netlink reflector
Nov 2 14:12:37 Server1 Keepalived_healthcheckers: Registering Kernel
netlink command channel
Nov 2 14:12:37 Server1 Keepalived_vrrp: Registering Kernel netlink
reflector
Nov 2 14:12:37 Server1 Keepalived_vrrp: Registering Kernel netlink command
channel
Nov 2 14:12:37 Server1 Keepalived_vrrp: Registering gratutious ARP shared
channel
Nov 2 14:12:37 Server1 Keepalived_vrrp: Opening file
'/etc/keepalived/keepalived.conf'.
Nov 2 14:12:37 Server1 Keepalived_vrrp: Configuration is using : 33972
Bytes
Nov 2 14:12:37 Server1 Keepalived_healthcheckers: Opening file
'/etc/keepalived/keepalived.conf'.
Nov 2 14:12:37 Server1 Keepalived_healthcheckers: Configuration is using :
5105 Bytes
Nov 2 14:12:38 Server1 Keepalived_vrrp: VRRP_Instance(VI_1) Transition to
MASTER STATE
Nov 2 14:12:39 Server1 Keepalived_vrrp: VRRP_Instance(VI_1) Entering MASTER
STATE
Nov 2 14:12:39 Server1 avahi-daemon[3597]: Registering new address record
for 10.150.25.52 on eth0.
Nov 2 14:13:12 Server1 kernel: device eth0 left promiscuous mode
Here is the system and network configuration
KERNEL VERSION
[root@Server1 ~]# cat /proc/version
Linux version 2.6.18-128.el5 (mockbuild@xxxxxxxxxxxxxxxxxxxxxxxxxxx) (gcc
version 4.1.2 20080704 (Red Hat 4.1.2-44)) #1 SMP Wed Dec 17 11:42:39 EST
2008
[root@Server1 ~]# uname -a
Linux Server1 2.6.18-128.el5 #1 SMP Wed Dec 17 11:42:39 EST 2008 i686 i686
i386 GNU/Linux
[root@Server1 ~]# cat /etc/issue
Red Hat Enterprise Linux Server release 5.3 (Tikanga)
Kernel \r on an \m
ETHRENET CONFIGURATION
[root@Server1 ~]# ifconfig
eth0 Link encap:Ethernet HWaddr 00:23:7D:5A:3E:3A
inet addr:10.150.25.50 Bcast:10.150.25.55 Mask:255.255.255.248
inet6 addr: fe80::223:7dff:fe5a:3e3a/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:36803 errors:0 dropped:0 overruns:0 frame:0
TX packets:1326 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:2681663 (2.5 MiB) TX bytes:95379 (93.1 KiB)
Interrupt:169 Memory:f8000000-f8012100
eth1 Link encap:Ethernet HWaddr 00:23:7D:5A:3E:38
inet addr:212.165.108.114 Bcast:212.165.108.119
Mask:255.255.255.248
inet6 addr: fe80::223:7dff:fe5a:3e38/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:34715 errors:0 dropped:0 overruns:0 frame:0
TX packets:1262 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:2495100 (2.3 MiB) TX bytes:86802 (84.7 KiB)
Interrupt:177 Memory:fa000000-fa012100
eth5 Link encap:Ethernet HWaddr 00:24:81:7C:8B:C2
inet addr:10.230.238.74 Bcast:10.230.239.255 Mask:255.255.252.0
inet6 addr: fe80::224:81ff:fe7c:8bc2/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:38444 errors:2 dropped:0 overruns:0 frame:1
TX packets:4772 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:100
RX bytes:2794272 (2.6 MiB) TX bytes:1768728 (1.6 MiB)
Memory:fdee0000-fdf00000
lo Link encap:Local Loopback
inet addr:127.0.0.1 Mask:255.0.0.0
inet6 addr: ::1/128 Scope:Host
UP LOOPBACK RUNNING MTU:16436 Metric:1
RX packets:1215 errors:0 dropped:0 overruns:0 frame:0
TX packets:1215 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:0
RX bytes:98692 (96.3 KiB) TX bytes:98692 (96.3 KiB)
[root@Server1 network-scripts]# more ifcfg*
::::::::::::::
ifcfg-eth0
::::::::::::::
# Broadcom Corporation NetXtreme II BCM5708 Gigabit Ethernet
DEVICE=eth0
HWADDR=00:23:7D:5A:3E:3A
ONBOOT=yes
IPADDR=10.150.25.50
GATEWAY=10.150.25.49
NETMASK=255.255.255.248
::::::::::::::
ifcfg-eth1
::::::::::::::
# Broadcom Corporation NetXtreme II BCM5708 Gigabit Ethernet
DEVICE=eth1
HWADDR=00:23:7D:5A:3E:38
ONBOOT=yes
IPADDR=212.165.108.114
GATEWAY=212.165.108.113
NETMASK=255.255.255.248
::::::::::::::
ifcfg-eth2
::::::::::::::
# Intel Corporation 82571EB Gigabit Ethernet Controller (Copper)
DEVICE=eth2
HWADDR=00:24:81:7C:8B:C1
ONBOOT=no
::::::::::::::
ifcfg-eth3
::::::::::::::
# Intel Corporation 82571EB Gigabit Ethernet Controller (Copper)
DEVICE=eth3
HWADDR=00:24:81:7C:8B:C0
ONBOOT=no
::::::::::::::
ifcfg-eth4
::::::::::::::
# Intel Corporation 82571EB Gigabit Ethernet Controller (Copper)
DEVICE=eth4
HWADDR=00:24:81:7C:8B:C3
ONBOOT=no
::::::::::::::
ifcfg-eth5
::::::::::::::
# Intel Corporation 82571EB Gigabit Ethernet Controller (Copper)
DEVICE=eth5
HWADDR=00:24:81:7C:8B:C2
ONBOOT=yes
IPADDR=10.230.238.74
NETMASK=255.255.252.0
[root@Server1 network-scripts]# ethtool eth0
Settings for eth0:
Supported ports: [ TP ]
Supported link modes: 10baseT/Half 10baseT/Full
100baseT/Half 100baseT/Full
1000baseT/Full
Supports auto-negotiation: Yes
Advertised link modes: 10baseT/Half 10baseT/Full
100baseT/Half 100baseT/Full
1000baseT/Full
Advertised auto-negotiation: Yes
Speed: 100Mb/s
Duplex: Full
Port: Twisted Pair
PHYAD: 1
Transceiver: internal
Auto-negotiation: on
Supports Wake-on: g
Wake-on: g
Link detected: yes
[root@Server1 network-scripts]# ethtool eth1
Settings for eth1:
Supported ports: [ TP ]
Supported link modes: 10baseT/Half 10baseT/Full
100baseT/Half 100baseT/Full
1000baseT/Full
Supports auto-negotiation: Yes
Advertised link modes: 10baseT/Half 10baseT/Full
100baseT/Half 100baseT/Full
1000baseT/Full
Advertised auto-negotiation: Yes
Speed: 100Mb/s
Duplex: Full
Port: Twisted Pair
PHYAD: 1
Transceiver: internal
Auto-negotiation: on
Supports Wake-on: g
Wake-on: g
Link detected: yes
[root@Server1 network-scripts]# ethtool eth2
Settings for eth2:
Supported ports: [ TP ]
Supported link modes: 10baseT/Half 10baseT/Full
100baseT/Half 100baseT/Full
1000baseT/Full
Supports auto-negotiation: Yes
Advertised link modes: 10baseT/Half 10baseT/Full
100baseT/Half 100baseT/Full
1000baseT/Full
Advertised auto-negotiation: Yes
Speed: 100Mb/s
Duplex: Full
Port: Twisted Pair
PHYAD: 1
Transceiver: internal
Auto-negotiation: on
Supports Wake-on: pumbag
Wake-on: d
Current message level: 0x00000001 (1)
Link detected: yes
[root@Server1 network-scripts]# ethtool eth3
Settings for eth3:
Supported ports: [ TP ]
Supported link modes: 10baseT/Half 10baseT/Full
100baseT/Half 100baseT/Full
1000baseT/Full
Supports auto-negotiation: Yes
Advertised link modes: 10baseT/Half 10baseT/Full
100baseT/Half 100baseT/Full
1000baseT/Full
Advertised auto-negotiation: Yes
Speed: 100Mb/s
Duplex: Full
Port: Twisted Pair
PHYAD: 1
Transceiver: internal
Auto-negotiation: on
Supports Wake-on: d
Wake-on: d
Current message level: 0x00000001 (1)
Link detected: yes
[root@Server1 network-scripts]# ethtool eth4
Settings for eth4:
Supported ports: [ TP ]
Supported link modes: 10baseT/Half 10baseT/Full
100baseT/Half 100baseT/Full
1000baseT/Full
Supports auto-negotiation: Yes
Advertised link modes: 10baseT/Half 10baseT/Full
100baseT/Half 100baseT/Full
1000baseT/Full
Advertised auto-negotiation: Yes
Speed: 100Mb/s
Duplex: Full
Port: Twisted Pair
PHYAD: 1
Transceiver: internal
Auto-negotiation: on
Supports Wake-on: d
Wake-on: d
Current message level: 0x00000001 (1)
Link detected: yes
[root@Server1 network-scripts]# ethtool eth5
Settings for eth5:
Supported ports: [ TP ]
Supported link modes: 10baseT/Half 10baseT/Full
100baseT/Half 100baseT/Full
1000baseT/Full
Supports auto-negotiation: Yes
Advertised link modes: 10baseT/Half 10baseT/Full
100baseT/Half 100baseT/Full
1000baseT/Full
Advertised auto-negotiation: Yes
Speed: 100Mb/s
Duplex: Full
Port: Twisted Pair
PHYAD: 1
Transceiver: internal
Auto-negotiation: on
Supports Wake-on: d
Wake-on: d
Current message level: 0x00000001 (1)
Link detected: yes
[root@Server1 network-scripts]# mii-tool
eth0: negotiated 100baseTx-FD, link ok
eth1: negotiated 100baseTx-FD, link ok
SIOCGMIIREG on eth2 failed: Input/output error
eth2: 10 Mbit, half duplex, no link
SIOCGMIIREG on eth3 failed: Input/output error
eth3: 10 Mbit, half duplex, no link
SIOCGMIIREG on eth4 failed: Input/output error
eth4: 10 Mbit, half duplex, no link
SIOCGMIIREG on eth5 failed: Input/output error
eth5: negotiated 100baseTx-FD, link ok
[root@Server1 network-scripts]# cat /etc/modprobe.conf
alias eth0 bnx2
alias eth1 bnx2
alias eth2 e1000e
alias eth3 e1000e
alias eth4 e1000e
alias eth5 e1000e
alias scsi_hostadapter cciss
alias scsi_hostadapter1 ata_piix
[root@Server1 network-scripts]# modinfo e1000e | head -n 2
filename:
/lib/modules/2.6.18-128.el5/kernel/drivers/net/e1000e/e1000e.ko
version: 0.3.3.3-k4
[root@Server1 network-scripts]# modinfo bnx2 | head -n 2
filename: /lib/modules/2.6.18-128.el5/kernel/drivers/net/bnx2.ko
version: 1.7.9-1
[root@Server1 network-scripts]# lspci | grep -i eth
03:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5708
Gigabit Ethernet (rev 12)
05:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5708
Gigabit Ethernet (rev 12)
10:00.0 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet
Controller (Copper) (rev 06)
10:00.1 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet
Controller (Copper) (rev 06)
11:00.0 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet
Controller (Copper) (rev 06)
11:00.1 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet
Controller (Copper) (rev 06)
[root@Server1 network-scripts]# cat /proc/interrupts
CPU0 CPU1 CPU2 CPU3
0: 10833296 10815243 10853784 10817369 IO-APIC-edge timer
1: 1 0 0 2 IO-APIC-edge i8042
8: 1 1 1 0 IO-APIC-edge rtc
9: 0 0 0 0 IO-APIC-level acpi
12: 1 0 1 2 IO-APIC-edge i8042
14: 34 35 193797 194324 IO-APIC-edge ide0
169: 18 100610 22 18 IO-APIC-level
ehci_hcd:usb1, uhci_hcd:usb2, eth0
177: 23 26 18 105406 IO-APIC-level
uhci_hcd:usb3, eth1, eth5
185: 151128 101949 1090 1160 IO-APIC-level
uhci_hcd:usb4, cciss0
193: 0 0 0 0 IO-APIC-level
uhci_hcd:usb5
201: 23 25 24 24 IO-APIC-level
uhci_hcd:usb6
NMI: 0 0 0 0
LOC: 43319487 43319511 43319532 43319563
ERR: 0
MIS: 0
[root@Server1 network-scripts]# cat /etc/grub.conf
# grub.conf generated by anaconda
#
# Note that you do not have to rerun grub after making changes to this file
# NOTICE: You have a /boot partition. This means that
# all kernel and initrd paths are relative to /boot/, eg.
# root (hd0,0)
# kernel /vmlinuz-version ro root=/dev/VolGroup00/LogVol00
# initrd /initrd-version.img
#boot=/dev/cciss/c0d0
default=0
timeout=5
splashimage=(hd0,0)/grub/splash.xpm.gz
hiddenmenu
title Red Hat Enterprise Linux Server (2.6.18-128.el5)
root (hd0,0)
kernel /vmlinuz-2.6.18-128.el5 ro root=/dev/VolGroup00/LogVol00
pci=nomsi
initrd /initrd-2.6.18-128.el5.img
_______________________________________________
Please read the documentation before posting - it's available at:
http://www.linuxvirtualserver.org/
LinuxVirtualServer.org mailing list - lvs-users@xxxxxxxxxxxxxxxxxxxxxx
Send requests to lvs-users-request@xxxxxxxxxxxxxxxxxxxxxx
or go to http://lists.graemef.net/mailman/listinfo/lvs-users
|