2.6.2. Recursion
You may have noticed a big difference in
the amount of work done by the name servers in the previous example.
Four of the name servers simply returned the best answer they already
had -- mostly referrals to other name servers -- to the queries
they received. They didn't have to send their own queries to
find the data requested. But one name server -- the one queried by
the resolver -- had to follow successive referrals until it
received an answer.
Why couldn't the local name server simply have referred the
resolver to another name server? Because a stub resolver
wouldn't have had the intelligence to follow a referral. And
how did the name server know not to answer with a referral? Because
the resolver issued a recursive query.
Queries come in two flavors,
recursive and
iterative
(or
nonrecursive).
Recursive queries place most of the burden of resolution on a single
name server. Recursion, or recursive
resolution, is just a name for the resolution process used
by a name server when it receives recursive queries. As with
recursive algorithms in programming, the name server repeats the same
basic process (querying a remote name server and following any
referrals) until it receives an answer.
Iteration,
or iterative
resolution, described in the next section,
refers to the resolution process used by a name server when it
receives iterative queries.
In recursion, a resolver sends a recursive query to a name server for
information about a particular domain name. The queried name server
is then obliged to respond with the requested data or with an error
stating that data of the requested type doesn't exist or that
the domain name specified doesn't exist.[14] The name server can't just refer
the querier to a different name server because the query was
recursive.
If the queried name server isn't authoritative for the data
requested, it will have to query other name servers to find the
answer. It could send recursive queries to those name servers,
thereby obliging them to find the answer and return it (and passing
the buck). Or it could send iterative queries and possibly be
referred to other name servers "closer" to the domain
name it's looking for. Current implementations are polite and
do the latter, following the referrals until an answer is
found.
[15]
A name server that receives a recursive query that it can't
answer itself will query the "closest known" name
servers. The
closest known name servers are the
servers authoritative for the zone closest to the domain name being
looked up. For example, if the name server receives a recursive query
for the address of the domain name
girigiri.gbrmpa.gov.au, it will first check
whether it knows which name servers are authoritative for
girigiri.gbrmpa.gov.au. If it does, it will send
the query to one of them. If not, it will check whether it knows the
name servers for gbrmpa.gov.au, and after that
gov.au, and then au. The
default, where the check is guaranteed to stop, is the root zone,
since every name server knows the domain names and addresses of the
root name servers.
Using the closest known name servers ensures that the resolution
process is as short as possible. A berkeley.edu
name server receiving a recursive query for the address of
waxwing.ce.berkeley.edu shouldn't have to
consult the root name servers; it can simply follow delegation
information directly to the ce.berkeley.edu name
servers. Likewise, a name server that has just looked up a domain
name in ce.berkeley.edu shouldn't have to
start resolution at the roots to look up another
ce.berkeley.edu (or
berkeley.edu) domain name; we'll show how
this works in Section 2.7, "Caching".
The name server that receives the recursive query always sends the
same query that the resolver sends it, for example, for the address
of waxwing.ce.berkeley.edu. It never sends
explicit queries for the name servers for
ce.berkeley.edu or
berkeley.edu, though this information is also
stored in the namespace. Sending explicit queries could cause
problems: there may be no ce.berkeley.edu name
servers (that is, ce.berkeley.edu may be part of
the berkeley.edu zone). Also, it's always
possible that an edu or
berkeley.edu name server already knows
waxwing.ce.berkeley.edu's address. An
explicit query for the berkeley.edu or
ce.berkeley.edu name servers would miss this
information.