Puzzling DHCP Issue - Assistance Requested

Posted by Jet_mech91@reddit | sysadmin | View on Reddit | 27 comments

I work as a sysadmin for a moderately sized environment (\~1000 systems). We have several DHCP scopes in our domain, with one being a build VLAN for imaging new systems and the rest being various user scopes. Our Domain Controllers double as our DHCP and DNS servers for the entire domain.

Normally we image workstations on the build VLAN, from which they join our domain and get drivers/software/updates through the task sequence and MECM, before we move them over to our primary user VLAN (802.1x enabled) to receive a DHCP lease. This has historically worked fine for years, but as of last week weve suddenly found that newly imaged systems are no longer receiving DHCP leases on the primary user VLAN.

We've confirmed that when connected, we can track the device MAC across the network devices up to the switch bordering our DHCP server, so the requests seem to be getting out there. Our two load balanced DHCP servers are showing hits for the workstation MAC addresses for lease requests on the build VLAN, but zero hits at all for the primary user VLAN after switching.

DHCP for the primary user VLAN works for all existing systems in the environment, even after I released the lease on a test system, ensured it was removed from DHPC and DNS, and left it powered down until it fell off the switch MAC Address Tables. Expanding on this, newly imaged devices that are given a static IP on the primary user VLAN are subsequently able to pull new DHCP leases when the static IP is deconfigured.

The only error message of note I have found is a DHCP event viewer log that shows error 0x79, however based on my reading that suggests either our scopes are full (theyre not), there is an IP conflict (not sure how this would be relevant for a new device on DHCP), or our network settings are "misconfigured" (dhcp scope settings look correct and do not appear to have changed relative to before/after the issue started. The only recent change to our knowledge is a GPO update that enabled Windows Defender Firewall on our servers with domain policy traffic set to Allow All Inbound/Outbound (Public and Private are set to block inbound default). All other administrative entities (network, forest level) deny making any changes on their end.

Due to separation of duties and red tape from security policy, I am not currently approved to utilize packet sniffing software to try and trace the DHCP traffic.

Any ideas or thoughts as to why only one out of 5 DHCP scopes have decided to stop leasing brand new devices are greatly appreciated.