home | O'Reilly's CD bookshelfs | FreeBSD | Linux | Cisco | Cisco Exam    

Book HomeDesigning Web AudioSearch this book

8.4. Making MP3 files

There are three steps to creating MP3 files for Internet distribution:

  1. Your first step will often be to save your audio stream into AIFF or WAV format. This typically starts with a compact disc, in which case the raw bits on the CD need to be turned into a computer-recognizable audio format. The process of extracting audio CD data is called ripping. However, you can also encode live audio streams to a raw format, which is typically not ripping but rather recording.

  2. The next step is to encode your raw audio stream into the MP3 format. Fortunately, most modern encoding tools handle ripping and encoding all in one step, even if they work in two steps behind the scenes.

  3. Optionally, and assuming the user holds the legal copyright to the music in question, you may want to upload your MP3 files to the Internet.

To work with and create MP3 audio, you need:

  • An MP3 ripping/encoding tool

  • An MP3 player so you can audition your MP3s for quality control before adding them to your permanent archive

  • A digital audio file, compact disc, or live audio stream to encode

You'll find an abundance of free MP3 players and encoders online (see Section 8.8, "MP3 resources" later in this chapter for more information).

8.4.1. Encoding MP3 files

Since most encoding is done directly from compact discs, we'll look at that process most closely, but keep in mind that most of these steps apply equally to audio streams coming from pre-existing WAV or AIFF audio files and to live streams. The single most important thing you'll need to keep in mind when encoding is the quality level, or bitrate. Here's a general guideline; a more complete quality comparison chart is shown in Table 8-1 later in this chapter:

  • 64 Kbps and below: voice, lectures, streaming over modems

  • 96 Kbps: low-fi music or hi-fi voice (such as newscasts)

  • 128 Kbps: music files destined for Internet distribution

  • 160 Kpbs or 192 Kbps: music files destined for your personal collection

  • 256 Kbps or 320 Kbps: files for the most demanding audiophiles -- virtually perfect quality, but much larger files

These bitrates all use the CD-quality 44.1 kHz sample rate, except the 64 Kbps settings and below, which use lesser rates of 22 kHz and 11 kHz, respectively, to achieve adequate compression for the lower bitrates.

In addition, you'll need to decide whether you want to create CBR or VBR files. CBR is constant bitrate and generates files in which every frame has the same bitrate, regardless of the complexity of the passage. Use the list above to determine a CBR setting. VBR means variable bitrate and generates files in which the bitrate is different for each frame. In VBR mode, the bitrate is adjusted dynamically, on the fly, to accommodate the simplicity or complexity of the passage in question. In VBR mode, quality thresholds are set by adjusting a slider from low quality to high quality (much as you would when saving a JPEG image, adjusting the quality level at output time). In VBR mode, a setting of 60% to 80% creates files that sound roughly equivalent to 128 Kbps to 160 Kbps CBR files.

CBR encoding is recommended for general purposes; in our opinion, VBR doesn't actually win you that much, and CBR files are more broadly compatible with a wide range of players and are less prone to glitches and errors. Some encoders create VBR files with track length written to incorrect portions of the file, which really messes up some non-mainstream players and players on other operating systems.

VBR files

Some older MP3 players may not be capable of playing VBR files or may yield inaccurate time/progress indicators. However, most modern players handle VBR files just fine. Working with the CDDB

If you've ever played a CD through your computer, you know that you are presented with only Track 1, Track 2, Track 3, and so on, in the information window and not with the names of the actual songs. Many people don't realize that artist, album, and track names are not on the disc itself. This is where the CDDB (compact disc database) servers come in to play. On the Internet is a collection of servers that provide access to an extremely large database of CD titles. If you run a CDDB-savvy CD or MP3 player or encoder, you have access to the CD database. When you insert a CD, it is identified by a special "hash" number, and a few seconds later, the CDDB returns artist, album, and track names, giving you much more useful information to work with. The main database is at http://www.cddb.com, though there are many mirrors and competing CD database servers out there.

Why is it called "ripping?"


The practice of extracting music from an audio CD to a digital audio file on your hard drive is called "ripping." Why? Because audio CDs don't use a filesystem corresponding to that of any particular operating system. Your OS's file manager can't see the tracks as it sees normal files on a data CD, so you either need to trick your operating system into seeing the tracks' file "handles" as actual WAV or AIFF files, or use a special utility to read the raw data from the audio CD and turn it into a WAV or AIFF file. Because special techniques or tools are required and you're not doing a straight file copy, another name for the process was required. No one seems to know where the term "ripping" originated, but it sounds intuitive and has been floating around in the computer vernacular for many years. Don't worry -- ripping is a very simple process, and many tools completely hide the process for you, so you may not need to think about it at all.

You'll find CDDB lookups so powerful that you probably won't want to use an encoder without this feature. All the encoders featured in this chapter contain CDDB. By using a CDDB-savvy encoder, you can have meaningful ID3 tags (covered later in this chapter) inserted into your MP3 files automatically, rather than having to enter them by hand. Getting started

For the purposes of this chapter, we will encode an MP3 file using two encoders: Xing AudioCatalyst and MusicMatch Jukebox. Xing has long been a pioneer in MPEG encoding, and its encoding algorithms are reliable and popular. In fact, MusicMatch also used Xing's codec in Jukebox until late 1999, when they switched to a slightly faster and better quality codec from Fraunhofer and Thomson. Xing's AudioCatalyst also features faster-than-real-time encoding on modern machines, which is essential if you need to encode live streams. The downside is that you have to pay for the quality and extended features, but the price is modest. However, you can download a trial version of AudioCatalyst at http://www.xingtech.com, so you can try before you buy.

The second encoder we will use is MusicMatch Jukebox, a shareware encoder available at http://www.musicmatch.com. While early versions of MusicMatch were limited in encoding quality until you paid for the upgrade, MusicMatch changed their policy in early 2000; you can now download the free version and get full CD-quality encoding. MusicMatch Jukebox is an all-in-one ripping /encoding / playback/database application, and offers very good encoding capabilities. New users may find MusicMatch a little easier to use than AudioCatalyst, while more practiced users often gravitate toward AudioCatalyst.

Copyright issues

You should be aware that copyright issues are involved when using other people's music. You may legally encode CD tracks of music you have already purchased for your own use. However, you may not post or trade them over the Internet unless you hold the copyright. The Recording Industry Association of America (RIAA) is deeply involved in protecting artists and their copyrighted work from unauthorized distribution on the Internet. You should always be aware of who holds the copyright to a piece of music you intend to distribute. Furthermore, you must receive permission from the copyright holder to post any copyrighted work for distribution (and it's highly unlikely that any label will grant you this permission). For more information about the legalities of MP3, see Section 8.7, "Legalities of MP3" later in this chapter.

8.4.2. Using Xing AudioCatalyst

Encoding tracks from CD with the Xing AudioCatalyst is a straightforward and relatively easy process. The interface is easy to follow, with the features you will use most often arranged in the main window, as shown in Figure 8-8.

Figure 8-8

Figure 8-8. The most important features in the AudioCatalyst interface, Add from File, Add from CD, Player, and Encode, are all located in the main window. 128 Kbps Internet standard

The MP3 quality standard that has emerged for general Internet distribution is 128 Kbps, which maintains near-CD quality sound at a 44.1 kHz sampling rate and provides a fairly good quality-to-file-size ratio for Internet delivery. You can use the next higher encoding levels (160 Kbps or 192 Kbps) for your personal collection or the next lower (112 Kbps) if you need lower file sizes. Anything higher than 192 Kbps is overkill for most purposes, and anything lower than 112 Kbps loses the benefit of high fidelity sound. For perspective, consider that one minute of stereo 16-bit, 44.1 kHz CD-quality digital audio is roughly 10 MB, whereas the 128 Kbps encoded MP3 version will occupy less than 1 MB. At this rate, the average 4 1/2 minute song uses less than 4 MB, instead of the over 45 MB needed for the standard, uncompressed 44.1 kHz WAV or AIFF file. At the reduced file size of 128 Kbps, 44.1 kHz MP3 encoding, distributing near CD-quality music over the Internet is attainable and sensible, even when delivered at analog modem speeds. Table 8-1 details the relationship between bitrate, file size, and quality.

Table 8-1. Results of encoding at different bitrates

Bitrate (Kilobits per Second)

Size of Song



41.3 MB



5.6 MB

Very close to CD quality, indistinguishable for all but the most demanding audiophiles


4.7 MB

High quality


3.8 MB

Approaches CD quality


3.3 MB

Below CD quality


2.8 MB

Well below CD quality MP3; "swishiness" almost always present


1.9 MB

Acceptable for voice, but not music; very noticeable quality artifacts


0.9 MB

Tinny, AM radio quality, lots of artifacts


0.5 MB

Shortwave radio quality -- pretty much unusable Working with ID3 tags

Storing additional information such as title, artist, album, year, comments, and so on, is performed by creating or editing ID3 tags. ID3 tags are bits of information stored within the MP3 file itself and are accessible and editable using most of MP3 players. A good MP3 encoding tool will also offer to store ID3 data within the files it creates as it's running. In other words, ID3 data can be created along with the file, or added or changed later through an MP3 player or with a third-party utility. To have AudioCatalyst store ID3 data, select a file or CD track within the main window and choose Track Information in the edit menu to display the window shown in Figure 8-10. Be sure to check the box next to ID3 in the Preferences window.

Figure 8-10

Figure 8-10. Use the Track Information window to include output filename, title, artist, and album.

Notes on numbering


If you're encoding your own albums, it's a good idea to include track numbers from the CD in the filenames themselves. Without them, the files will appear in the encoding output directory in alphabetical order, rather than in album order. But an album is a complete entity, not just a pile of singles. Artists put a great deal of effort into determining the ideal order in which tracks should be heard. If you choose to respect this prerogative of the artist, have your encoder prepend track numbers to the filenames so they appear in Explorer/Finder/Tracker in the proper order. When you drag the folder onto your MP3 player, you'll hear the album as it was meant to be heard, not willy-nilly. And remember: if there are more than nine tracks on the album, you'll need to make sure the encoder uses "01" rather than simply "1." Otherwise, your computer will order track 10 before track 1, and so on. To control this behavior in AudioCatalyst, look for the Track Number checkbox in the Naming section of AudioCatalyst's Settings panel.

Trial download

Trial versions of AudioCatalyst are available for download from Xing's web site at http://www.xingtech.com.

8.4.3. Using MusicMatch Jukebox

Using MusicMatch Jukebox is similar to using AudioCatalyst, though the interface differs somewhat. MusicMatch is an all-in-one tool that combines all the features you need to create and database your MP3 files. MusicMatch Jukebox also includes a built-in player, whereas AudioCatalyst requires you to install a separate player.

Recorder or encoder?

Though MusicMatch uses the term "recorder," this isn't technically correct. The signal you are working with, whether it comes from CD, a file, or a live stream, has already been recorded. The application really should call this feature "encoder," because that's what it is and does. MusicMatch probably calls it "recorder" just to make things easy for new users, but we feel MusicMatch is doing a disservice to the community by confusing -- not clarifying -- the difference.

To encode to MP3 from MusicMatch, place an audio CD into your computer's CD drive and launch MusicMatch Jukebox. Click the button that says Recorder and the encoding recorder window will appear as shown in Figure 8-12. The tracks on the CD will be listed with track numbers.

Figure 8-12

Figure 8-12. Click Recorder in the MusicMatch Jukebox interface to enter the encoding section of Jukebox. Selecting tracks

The next step is to select the tracks you wish to encode by clicking the checkbox next to each desired track. You may also select other audio files from your hard drive. When you have finished selecting your tracks, click the Start button found in the upper-right side of the window.

Blame it on the Rio -- and other MP3 hardware


Diamond's Rio player is a compact, portable music player capable of playing MP3 music files downloaded from the Internet or created from a CD or other digital audio source. This type of device signals a new era in personal music listening.


The player is about the size of a deck of cards and roughly the same weight. Even better, it has no moving parts so it won't skip, which is an important feature for people who use portable devices while driving or exercising. Because the device is energy-efficient, a single AA battery will power the Rio for up to 12 hours.


The low-end version of the Rio comes with 32 MB of rewriteable, built-in flash memory that holds about 30 to 35 minutes of 128 Kbps MP3 audio. External flash memory cards can increase playing time and are available where you purchase the Rio.


User accounts of the Rio in action are largely positive. Like the Sony Walkman of old, the Rio seems to be defining a lifestyle. With music moving from disc to chip, the Rio was the first of many players ushering in the new era of file-based music consumption. Samsung, Sony, Fisher, and Sanyo have introduced MP3 players similar to the Rio, as well as a new line-up of other personal home and car audio players.


Another product of interest is called the Personal Jukebox PJB-100 from HanGo, a Korean manufacturer. This product holds over 1,000 hours of music on a 4.5 GB internal hard drive. The beauty of this product is that it not only can it store so much music (roughly 3.2 days of continuous, nonrepeating music, assuming 128 Kbps files), but it also includes other handy features. It features a built-in organizer, which is ID3-ready, allowing easy access and categorization of your music titles from http://www.cddb.com databases, including song title, artist, album, and other information. HanGo's Personal Jukebox, like similar products, includes its own ripping/encoding utility.


Yet another product, the Yamakawa AVPhile 715 player, has taken the concept one step further and included the ability to play back multiple formats of audio and video. Besides playback of MP3 files, this unit will play back DVD, CD, VCD (MPEG-1 video), and SVCD (MPEG-1 supervideo) files. This type of player paves the way for a centralized player that is connected to the Internet and will play all types of media in your home (or anywhere else for that matter).


Beyond these new products, larger companies such as Sony, Sanyo, and Fisher have released next-generation portable and home players. These devices play both MP3 and other, more secure file formats and are destined to be compliant with SDMI (Secure Digital Music Initiative), which will give music retailers a modicum of security against piracy. One such device is the sleek and slender Sony Music Clip. It can hold 64 MB of MP3s (actually, the unit converts your MP3s into the less common but more efficient ATRAC format) and has a futuristic look, like a small microphone or fountain pen. However these product lines continue, the direction has been established. Future MP3 hardware devices will be smaller, play longer, and hold more music than previous generations of this new technology. CDDB database support

Also included in MusicMatch is CDDB support, as described earlier in this chapter. This option allows you connect to the online CD database at http://www.cddb.com, as shown in Figure 8-13, to automatically include artist, album, and track information with your encoded files. This feature is a great time-saver because it relieves you from having to locate and enter the appropriate track information for each track on a given CD.

Figure 8-13

Figure 8-13. The CDDB web site features track information for a variety of CDs from around the world.

Library Navigation Links

Copyright © 2002 O'Reilly & Associates. All rights reserved.