Cache Communication Protocols (Building Internet Firewalls, 2nd Edition)

15.5. Cache Communication Protocols

When we discussed proxying and HTTP, we also discussed caching, which is one of the primary uses of web proxies. Caching is very important as a way of speeding up transfers and reducing the amount of data transferred across crowded links. Once cache servers are set up, the next logical step is to use multiple cache servers and have them coordinate operations. A lot of active development is going on, and it's not at all clear what protocol is going to win out in the long run.

15.5.1. Internet Cache Protocol (ICP)

ICP is the oldest of the cache management protocols in current use and is supported by the largest number of caches, including Netscape Proxy, Harvest, and Squid. The principle behind ICP is that cache servers operate independently, but when a cache server gets a request for a document that it does not have cached, it asks other cache servers for the document, and retrieves the document from its source only if no other cache server has the document. ICP has a number of drawbacks; it requires a considerable amount of communication between caches, it slows down document retrieval, it provides no security or authentication, and it searches the cache based only on URL, not on document header information, which may cause it to return incorrect document versions. On the other hand, it has the noticeable advantage of being both standardized (it is documented in IETF RFCs 2186 and 2187) and in widespread use.

15.5.1.1. Packet filtering characteristics of ICP

ICP normally uses UDP; the port number is configurable but defaults to 3130. ICP can also be run over TCP, once again at any port. Caches exchange documents via HTTP. Once again, the port used for HTTP is configurable, but it defaults to 3128.

Direction

Source Addr.

Dest. Addr.

Protocol

Source Port

Dest. Port

ACK Set

Notes

Ext

Int

UDP

>1023

3130[44]

[45]

ICP request or response, external cache to internal cache

Out

Int

Ext

UDP

3130[44]

>1023

[45]

ICP request or response, internal cache to external cache

Ext

Int

TCP

>1023

3128[46]

[47]

HTTP request, external cache to internal cache

Out

Int

Ext

TCP

3128[46]

>1023

Yes

HTTP response, internal cache to external cache

Out

Int

Ext

TCP

>1023

3128[46]

[47]

HTTP request, internal cache to external cache

Ext

Int

TCP

3128[46]

>1023

Yes

HTTP response, external cache to internal cache

[44]3130 is the standard port number for ICP, but some servers run on different port numbers.

[45]UDP has no ACK equivalent.

[46]3128 is the standard port number for intercache HTTP servers, but some servers run on different port numbers.

[47]ACK is not set on the first packet of this type (establishing connection) but will be set on the rest.

15.5.1.2. Proxying characteristics of ICP

ICP, like SMTP and NNTP, is a self-proxying protocol, one that allows for queries to be passed from server to server. In general, if you are configuring ICP in a firewall environment, you will use this facility and set all internal cache servers to peer with a cache server that's part of the firewall and serves as a proxy.

Since ICP is a straightforward TCP-based protocol, it would also be possible to proxy it through a proxy system like SOCKS; the only difficulty is that you would end up with a one-way relationship, since the external cache would not be able to send queries to the internal cache. This would slow down performance without providing any more security than doing self-proxying, and no current implementations support it.