I've got an interesting issue that has now bit me twice.
My organization has three Microsoft DNS servers (Windows Server 2003 SP1). Normally, they work just as advertised and we have no issues from clients.
Unfortunately, we've experienced a scenario that is impacting our UNIX clients (~200). One DNS server will be in a limbo state. By this I mean that the service is up, it is accepting connections, but not responding with any information. Our Microsoft servers and clients accessing this particular server don't miss a beat. They automtically skip this one and move on to the next DNS server. If they manually lookup a DNS entry from that specific server, they simply timeout after 2 seconds.
The UNIX servers (AIX, Solaris, RedHat, probably others) all hang for about 10 seconds before moving on to the next DNS server.
As this problem is only affecting UNIX clients, our Microsoft DNS admins don't really see it as an issue. Further, they believe that it is a "UNIX" problem and I should deal with it.
The particular DNS server that has the issues requires a reboot to bring the DNS service back online.
You can see my problem. I've got a variety of UNIX flavors that are all experiencing the same symptom. Microsoft clients (all versions?) seem to be working fine.
Has anyone out there seen this before? I'd love to identify a Microsoft patch for the DNS server, but the Microsoft support guys tell me that there isn't one available.
Alas, google hasn't been able to show me the way either.
I could set some timeouts for all of our UNIX clients, but this would be time consuming and I don't believe it addresses the root cause of the problem.
My organization has three Microsoft DNS servers (Windows Server 2003 SP1). Normally, they work just as advertised and we have no issues from clients.
Unfortunately, we've experienced a scenario that is impacting our UNIX clients (~200). One DNS server will be in a limbo state. By this I mean that the service is up, it is accepting connections, but not responding with any information. Our Microsoft servers and clients accessing this particular server don't miss a beat. They automtically skip this one and move on to the next DNS server. If they manually lookup a DNS entry from that specific server, they simply timeout after 2 seconds.
The UNIX servers (AIX, Solaris, RedHat, probably others) all hang for about 10 seconds before moving on to the next DNS server.
As this problem is only affecting UNIX clients, our Microsoft DNS admins don't really see it as an issue. Further, they believe that it is a "UNIX" problem and I should deal with it.
The particular DNS server that has the issues requires a reboot to bring the DNS service back online.
You can see my problem. I've got a variety of UNIX flavors that are all experiencing the same symptom. Microsoft clients (all versions?) seem to be working fine.
Has anyone out there seen this before? I'd love to identify a Microsoft patch for the DNS server, but the Microsoft support guys tell me that there isn't one available.
Alas, google hasn't been able to show me the way either.
I could set some timeouts for all of our UNIX clients, but this would be time consuming and I don't believe it addresses the root cause of the problem.