Networking & Protocols NET · 04 · 02

The resolver walk: referrals, record types, and glue

How a recursive resolver follows referrals from root to authoritative, what the DNS message sections mean, and why missing glue breaks everything.

NET Middle ◷ 14 min

Level

FoundationsJuniorMiddleSenior

A resolver asking for cdn.example.co.uk does not get an answer from the root server. It gets a referral — “ask the .uk TLD.” Then another referral — “ask example.co.uk’s authoritative server.” Only on the third query does it get an actual IP address. Understanding what travels in those referral responses is what separates a developer who can debug DNS from one who just runs dig and hopes.

Iterative vs recursive queries

Why does this distinction matter in production? Because a server that blurs the line — accepting recursive queries from external clients — becomes a free amplifier for DDoS attackers. The resolver’s queries to root, TLD, and authoritative servers are iterative: the RD (Recursion Desired) bit is cleared. Those servers return only what they know, plus a referral — they never chase the tree for the caller. Only the client’s original query to the resolver has RD=1, asking the resolver to do the full walk.

This distinction matters for security. An authoritative server that accepts RD=1 from external clients — an open resolver — can be abused as a DNS-reflection amplifier: an attacker sends spoofed-source queries and the server sends large responses to the victim. Production authoritatives (Route 53, Cloudflare DNS) never recurse for external clients.

DNS message sections

A DNS message has four sections:

Section	Purpose
Question	The query name, type (A, MX, …), class (IN)
Answer	Records that directly answer the question
Authority	NS records pointing to the next-hop nameserver
Additional	Pre-fetched A/AAAA records for names in Authority

A referral has nothing in Answer, and has Authority + Additional. The AA (Authoritative Answer) bit is set only when the responding server owns the zone — a referral from a TLD server has AA=0.

Same DNS message format, two roles: a referral fills Authority + Additional with AA=0 to point onward; only the zone owner fills Answer with AA=1.

The referral chain for `cdn.example.co.uk`

Resolver → root: asks for cdn.example.co.uk. Root has no answer. It returns Authority: ns.nic.uk (NS record for .uk), plus Additional: ns.nic.uk A 193.0.0.1 (glue). No Answer section.
Resolver → .uk TLD: asks same QNAME. TLD returns Authority: ns1.example.co.uk (NS for example.co.uk), plus Additional: ns1.example.co.uk A 198.51.100.4 (glue). Still no Answer.
Resolver → authoritative: asks same QNAME. Authoritative returns Answer: cdn.example.co.uk A 203.0.113.10 TTL=300. AA bit set.

Together these three hops mean the resolver never guesses the next nameserver’s address — it is always handed it explicitly in the glue. Without step 1’s glue, step 2 is a dead end.

Trace it

1/6

Trace a cold DNS lookup of cdn.example.co.uk from a clean cache.

Step 1 of 6

Resolver has nothing cached. First step?

Locked

Root responds. What does the resolver receive?

Locked

Resolver queries .uk TLD. What does it ask?

Locked

TLD responds. What now?

Locked

Resolver queries example.co.uk authoritative. What does it receive?

Locked

Next identical query within TTL?

Glue records and the circular dependency

When a zone delegates to a nameserver inside its own zone (the NS for example.com is ns1.example.com), the parent zone must include the A record for ns1.example.com as glue in the referral. Without glue, the resolver cannot find the nameserver without first resolving example.com — a chicken-and-egg loop.

Missing glue causes intermittent SERVFAIL or referral loops. Out-of-bailiwick NS records (e.g., ns1.somednsprovider.org) need no glue because the resolver can ask a different zone for their IP.

Common DNS record types

A: IPv4 address
AAAA: IPv6 address
CNAME: alias → another name (never an IP)
MX: mail server hostname + priority
TXT: free text — SPF, DKIM, verification
NS: delegation — nameserver for the zone
SOA: zone authority: serial, refresh, TTL mins
CAA: which CAs may issue certs for this domain
HTTPS/SVCB: advertises ALPN + ECH keys (RFC 9460)

CNAME and the apex rule

A CNAME record points a name to another name: www CNAME example.com. The resolver then looks up the target. CNAME is forbidden at the zone apex (the bare domain like example.com) per RFC 1034, because SOA and NS records must also exist at the apex, and CNAME would override every other record type at that name. CDN-specific ALIAS/ANAME records and the new HTTPS record (RFC 9460) work around this.

Quiz

When a recursive resolver gets a referral from a TLD server, where does next-hop information come from?

Quiz

Why is CNAME forbidden at the zone apex (example.com) per RFC 1034?

UDP, TCP, and EDNS0

DNS defaults to UDP port 53. If a response exceeds the UDP buffer, the server sets the TC (truncation) flag and the resolver retries over TCP port 53. TCP is also required for zone transfers (AXFR/IXFR). A firewall that blocks TCP/53 while allowing UDP/53 silently breaks DNSSEC responses and zone transfers.

EDNS0 (RFC 6891) adds an OPT pseudo-record to negotiate larger UDP buffers (typically 1232 or 4096 bytes) and carry extension options including the DNSSEC OK (DO) bit, ECS (client subnet), and EDNS Cookies. Every modern resolver advertises EDNS0; without it DNSSEC responses are truncated.

Order the steps

Order the steps when a client opens https://shop.example.com from a clean cache:

1 Browser asks OS resolver for shop.example.com
2 OS resolver forwards to configured upstream (ISP or 1.1.1.1)
3 Resolver walks root → .com → example.com authoritative
4 Resolver returns A record (IP) to browser
5 Browser performs TCP handshake to that IP
6 Browser performs TLS handshake on top of TCP
7 Browser sends HTTPS request and receives the page

▸Why this works

Stub resolver vs full recursive resolver. Your operating system runs a stub resolver — a thin client that forwards queries to a configured upstream and caches briefly. Linux uses systemd-resolved or nscd; macOS uses mDNSResponder; Windows uses the DNS Client service. The stub does almost no work itself. The full recursive resolver — the one that walks root→TLD→authoritative — lives upstream (your router, ISP, or a public resolver like 1.1.1.1). Browsers often bypass the OS stub entirely: Chrome’s Async DNS Resolver and Firefox’s TRR query DoH directly. Flushing the OS stub cache (sudo systemd-resolve --flush-caches) does not flush the upstream resolver’s cache.

Each referral carries NS records in the Authority section and glue A/AAAA in Additional — pointing the resolver one hop deeper without a circular dependency. Only the authoritative reply has a non-empty Answer and the AA bit set.

Recall before you leave

01
What is the operational difference between an authoritative server and a recursive resolver?
02
Why must glue records exist for in-bailiwick nameservers?
03
Why does DNS fall back from UDP to TCP for some queries?

Recap

A recursive resolver walks the DNS tree iteratively, collecting referrals at each level. Each referral carries NS records in the Authority section plus glue A/AAAA records in Additional — together they break any circular dependency for in-bailiwick nameservers. The AA (Authoritative Answer) bit is only set when the responding server owns the zone. DNS record types go far beyond A records: CNAME aliases names, MX routes mail, TXT carries SPF/DKIM, NS delegates zones, SOA defines zone authority. CNAME is forbidden at the zone apex because SOA and NS must co-exist there. DNS defaults to UDP; large responses (DNSSEC, zone transfers) fall back to TCP. EDNS0 extends UDP buffers and carries DNSSEC flags. Now when you see a SERVFAIL or a referral loop in dig output, check the glue first — a missing in-bailiwick A record in the Additional section is the most common silent culprit.

Practice

Start at the top. Tasks go easiest → hardest: recall a fact, apply it to a case, then a senior-level stretch. Open one, attempt it, then reveal.

recallapplystretch0 of 5 done

Connected lessons

builds on

DNS: what it does and why it existsjunior

unlocks

TTL, caching, and DNS propagationmiddle

deepens into

TTL, caching, and DNS propagationmiddle

appears again in178

Something unclear?

Ask a question about this lesson. Questions are anonymous and go straight to the author to make the lesson better.

The resolver walk: referrals, record types, and glue

Iterative vs recursive queries

DNS message sections

The referral chain for cdn.example.co.uk

Trace a cold DNS lookup of cdn.example.co.uk from a clean cache.

Glue records and the circular dependency

CNAME and the apex rule

When a recursive resolver gets a referral from a TLD server, where does next-hop information come from?

Why is CNAME forbidden at the zone apex (example.com) per RFC 1034?

UDP, TCP, and EDNS0

Order the steps when a client opens https://shop.example.com from a clean cache:

Practice

Something unclear?

The referral chain for `cdn.example.co.uk`