Why are MP3 bitrates usually multiples of 32? (powers of 2)

audiobitratemp3

I understand why multiples of 2 show up so often in with computers because of their binary nature, but I haven't been able to figure out the most common mp3 bitrates (64kbps, 128kbps, 160kbps, 192kbps, 256kbps, 320kbps, etc.) tend to follow this rule as well.

  • Since MP3s are just sequential encodings of sound waves, why does it matter that each second is represented with a number of kilobits that is divisible by 2?
  • Do music players like iTunes just keep reading the file and reproducing the encoded sound regardless of where the second boundaries are, or do they read the file second by second?
  • In the later case, reading a 256kbps file would require reading slightly fewer pages in memory than a 257kbps file, but the player could always just read in 256-kilobit chunks regardless of them bitrate and just process it gradually, right?
  • Are MP3s at 128kbps popular just because this is a generally accepted bitrate or do they actually have some advantage over files at 126kbps and 131kbps other than a very slight difference in quality/filesize?

Best Answer

For constant bit rate encoding (CBR), the MPEG-1 Audio Layer III standard specifies standard bit rates of 32, 40, 48, 56, 64, 80, 96, 112, 128, 160, 192, 224, 256 and 320 kbit/s. There are others defined in the MPEG-2 standard, but they are all multiples of 2 as well (actually, all multiples of 8 in the range 8 to 160 - see the table labelled "Bitrate index" in the link above).

Technically, there is nothing limiting an MP3's bitrate to a multiple of two, as variable bit-rate encoding may be used, or a custom bitrate may be implemented using some of the unused flags from the MPEG specification (although this would have to be implemented manually). For an MP3 to be compliant with the MPEG specification, and thus compatible with most MP3 decoders, it must have a bit-rate as defined per the specification - thus, all CBR-encoded MP3 files will have a bit-rate as a multiple of two.

According to the resource, VBR can be encoded by either switching the bitrate between the fixed rates above per-frame, or it can be encoded by sharing the available bits in adjacent frames (effectively yielding a non-standard bitrate for the two combined frames). The length of a given frame is dependent on the sampling rate, with 1152 samples per frame. There is nothing restricting the size of a frame itself, nor is there any restriction for a frame to be a base-2 size (i.e. a 128 kbit/s MP3 with a sampling rate of 44.1 kHz will have a frame size of 417 bytes).


Lastly, a file encoded at 126 kbps will sound worse than one at 128 kbps, and likewise, one at 131 kbps will sound better. However, MP3's are encoded according to a particular encoder's psychoacoustic model for compression. The amount a file will sound "better" or "worse" at a given bitrate is highly dependent on the algorithms used to implement the model - however, in general, higher bitrates allow for more data, presumably reconstructing a more accurate stream of the original audio signal.

Related Question