Variable-length (Text) Databases (Learning Perl, 3rd Edition)

He had bought a large map representing the sea,\n Without the l east vestige of land:\nAnd the crew were much pleased when they found it to be\n A map they could all understand.\n\n"What's th e good of Mercator's North Poles and Equators,\n Tropics, Zones , and Meridian Lines?"\nSo the Bellman would cry: and the crew w ould reply\n "They are merely conventional signs!\n\n"Other map s are such shapes, with their islands and capes!\n But we've go t our brave Captain to thank:"\n(So the crew would protest) "tha t he's bought us the best-\n A perfect and absolute blank!"\n\n

#!/usr/bin/perl -w use strict; chomp(my $date = `date`); @ARGV = glob "fred*.dat" or die "no files found"; $^I = ".bak"; while (<>) { s/^Author:.*/Author: Randal L. Schwartz/; s/^Phone:.*\n//; s/^Date:.*/Date: $date/; print; }

16.4.1. In-place Editing from the Command Line

A program like the example from the previous section is fairly easy to write. But Larry decided it wasn't easy enough.

Imagine that you need to update hundreds of files that have the misspelling Randal instead of the one-l name Randal. You could write a program like the one in the previous section. Or you could do it all with a one-line program, right on the command line:

$ perl -p -i.bak -w -e 's/Randal/Randal/g' fred*.dat

Perl has a whole slew of command-line options that can be used to build a complete program in a few keystrokes.[359] Let's see what these few do.

[359]See the perlrunmanpage for the complete list.

Starting the command with perl does something like putting #!/usr/bin/perl at the top of a file does: it says to use the program perl to process what follows.

The -p option tells Perl to write a program for you. It's not much of a program, though; it looks something like this:[360]

[360]Actually, the print occurs in a continue block. See the perlsynand perlrunmanpages for more information.

while (<>) { print; }.

If you want even less, you could use -n instead; that leaves out the print statement. (Fans of awk will recognize -p and -n.) Again, it's not much of a program, but it's pretty good for the price of a few keystrokes.

The next option is -i.bak, which you might have guessed sets $^I to ".bak" before the program starts. If you don't want a backup file, you can use -i alone, with no extension.

We've seen -w before -- it turns on warnings.

The -e option says "executable code follows." That means that the s/Randal/Randal/g string is treated as Perl code. Since we've already got a while loop (from the -p option), this code is put inside the loop, before the print. For technical reasons, the last semicolon in the -e code is optional. But if you have more than one -e, and thus more than one chunk of code, only the semicolon at the end of the last one may safely be omitted.

The last command-line parameter is fred*.dat, which says that @ARGV should hold the list of filenames that match that glob. Put the pieces all together, and it's as if we had written a program like this:

#!/usr/bin/perl -w

@ARGV = glob "fred*.dat";
$^I = ".bak";

while (<>) {
  s/Randal/Randal/g;
  print;
}

Compare this program to the one we used in the previous section. It's pretty similar. These command-line options are pretty handy, aren't they?

16.4. Variable-length (Text) Databases

16.4.1. In-place Editing from the Command Line