19.1.3. Discussion
CGI is just a protocol, a formal agreement between a web server and a
separate program. The server encodes the client's form input data,
and the CGI program decodes the form and generates output. The
protocol says nothing regarding which language the program must be
written in; programs and scripts that obey the CGI protocol have been
written in C, shell, Rexx, C++, VMS DCL, Smalltalk, Tcl, Python, and
of course Perl.
The full CGI specification lays out which environment variables hold
which data (such as form input parameters) and how it's all encoded.
In theory, it should be easy to follow the protocol to decode the
input, but in practice, it is surprisingly tricky to get right.
That's why we strongly recommend using the CGI
module. The hard work of handling the CGI requirements correctly and
conveniently has already been done, freeing you to write the core of
your program without getting bogged down in network protocols.
With few exceptions, mainly related to file permissions and highly
interactive work, CGI scripts can do nearly anything other programs
can do. They can send results back in many formats: plain text, HTML
documents, XML files, sound files, pictures, or anything else
specified in the HTTP header. Besides producing plain text or HTML
text, they can also redirect the client browser to another location,
set server cookies, request authentication, and give errors.
That's all there is to accessing the user's input. Do with it
whatever you please, then generate properly formatted output. This is
nearly as easy. Remember that unlike regular programs, a CGI script's
output must be formatted in a particular way: it must first emit a
set of headers followed by a blank line before any normal output.
As shown in the Solution, the CGI module helps with output as well as
input. The module provides functions for generating HTTP headers and
HTML code. The header function builds the header
for you. By default, it produces headers for a
text/html document, but you can change the
Content-Type and supply other optional header parameters as well:
print header( -TYPE => 'text/plain',
-EXPIRES => '+3d' );
CGI.pm can also be used to generate HTML. It may seem trivial, but
this is where the CGI module shines: the creation of dynamic forms,
especially stateful ones such as shopping carts. The CGI module even
has functions for generating forms and tables.
When printing form widgets, the characters &,
<, >, and " in HTML output are automatically
replaced with their entity equivalents. This is not the case with
arbitrary user output. That's why the Solution imports and makes use
of the escapeHTML function—if the user types
any of those special characters, they won't cause formatting errors
in the HTML.
For a full list of functions and their calling conventions, see
CGI.pm's documentation.