The MP3 Format

If you have read How CDs Work, then you know something about how CDs store music. A CD stores a song as digital information. The data on a CD uses an uncompressed, high-resolution format. Here's what happens when a CD is created:
  • Music is sampled 44,100 times per second. The samples are 2 bytes (16 bits) long.
  • Separate samples are taken for the left and right speakers in a stereo system.

So a CD stores a huge number of bits for each second of music:

 

44,100 samples/second * 16 bits/sample * 2 channels = 1,411,200 bits per second

Let's break that down: 1.4 million bits per second equals 176,000 bytes per second. If an average song is three minutes long, then the average song on a CD consumes about 32 million bytes of space. That's a lot of space for one song, and it's especially large when you consider that over a 56K modem, it would take close to two hours to download that one song.

The MP3 format is a compression system for music. The MP3 format helps reduce the number of bytes in a song without hurting the quality of the song's sound. The goal of the MP3 format is to compress a CD-quality song by a factor of 10 to 14 without noticeably affecting the CD-quality sound. With MP3, a 32-megabyte (MB) song on a CD compresses down to about 3 MB. This lets you download a song in minutes rather than hours, and store hundreds of songs on your computer's hard disk without taking up that much space.

Is it possible to compress a song without hurting its quality? We use compression algorithms for images all the time. For example, a GIF file is a compressed image. So is a JPG file. We create Zip files to compress text. So we are familiar with compression algorithms for images and words and we know they work. To make a good compression algorithm for sound, a technique called perceptual noise shaping is used. It is "perceptual" partly because the MP3 format uses characteristics of the human ear to design the compression algorithm. For example:

  • There are certain sounds that the human ear cannot hear.
  • There are certain sounds that the human ear hears much better than others.
  • If there are two sounds playing simultaneously, we hear the louder one but cannot hear the softer one.
Using facts like these, certain parts of a song can be eliminated without significantly hurting the quality of the song for the listener. Compressing the rest of the song with well-known compression techniques shrinks the song considerably -- by a factor of 10 at least. (If you would like to learn more about the specific compression algorithms, see the links at the end this article.) When you are done creating an MP3 file, what you have is a "near CD quality" song. The MP3 version of the song does not sound exactly the same as the original CD song because some of it has been removed, but it's very close.

From this description, you can see that MP3 is nothing magical. It is simply a file format that compresses a song into a smaller size so it is easier to move around on the Internet and store.

The Name

MPEG is the acronym for Moving Picture Experts Group. This group has developed compression systems used for video data. For example, DVD movies, HDTV broadcasts and DSS satellite systems use MPEG compression to fit video and movie data into smaller spaces. The MPEG compression system includes a subsystem to compress sound, called MPEG audio Layer-3. We know it by its abbreviation, MP3.