I am attempting to use NLB on VMware server - my VM's are Windows 2003 Std SP1. VMware was so kind to include a section on this in their manual. That was helpful -- I have used NLB in the past, but I would not have know I needed to use multicast mode instead of unicast.
Anyway - I have VMware server running on a server called isweb2. The VM on isweb2 is isweb4. I wanted to test VMware server before putting other servers up so I NLB'ed isweb4 (the VM) with a physical server isweb1. I used to have isweb1 and isweb2 NLB'ed, but I removed isweb2 and added isweb4.
I have two NIC's in isweb1 and two in isweb2 - one used for NLB and the other for external traffic. On the VM setup on isweb2 I setup two separate bridges - one for each NIC. I use the NICs in the same manner on isweb4 -- the isweb2 external is the isweb4 external; the one that used to be for NLB on isweb2 is used for NLB on isweb4.
Today a program I use to monitor the servers told me it could not ping isweb4. I found this in the system event:
(from isweb1)
Event Type: Information
Event Source: WLBS
Event Category: None
Event ID: 69
Date: 8/25/2006
Time: 5:39:22 AM
User: N/A
Computer: ISWEB1
Description:
NLB Cluster 10.3.1.97 : Initiating convergence on host 1. Reason: Host 2 is leaving the cluster.
\-- from Isweb4 \---
Event Type: Information
Event Source: WLBS
Event Category: None
Event ID: 65
Date: 8/25/2006
Time: 5:39:18 AM
User: N/A
Computer: ISWEB4
Description:
NLB Cluster 10.3.1.97 : Initiating convergence on host 2. Reason: Host 1 is converging for an unknown reason.
Not sure what to look for -- are there different NIC drivers that can be used? Any other settings I should check?