Dolby AC-3

 

What is AC-3 ?

AC-3 (Audio Code number 3) refers to a multichannel music compression technology that has been developed by Dolby Laboratories. The goal of this digital compression technology is to produce a digital representation of an audio signal which, when decoded and reproduced, sounds the same as the original signal, while using a minimum of digital information (bit-rate) for the compressed (or encoded) representation, providing true surround sound.
    Over the past years Dolby Laboratories has used the term Dolby Digital to refer to this digital system in the film and theatre industries, and has used the term Dolby Surround AC-3 to refer to the system in the home theatre market. In practice both the cinema and home theater applications are variations of the same basic technology (slightly different data rates). So to minimize the possibility of confusion by consumers the Company has decided that Dolby's consumer multichannel digital format will be known by the same name used in the film and theatre industries ie Dolby Digital.



    The AC-3 can carry from 1 to 5.1 channels. It provides five full range channels (3 Hz to 20,000 Hz) in what is sometimes referred to as a "3/2" configuration: three front channels (left, center, and right), plus two surround channels. A sixth bass-only effects channel (3 Hz to 120 Hz), also called sometimes "low frequencies enhancement channel" (LFE), is also provided, giving rise to the term "5.1" channels. As AC-3 is mainly designed for providing true surround sound, it also includes information about the room size and differences in dB between the channels levels.


How does AC-3 work ?

Like MP3 or AAC, AC-3 uses masking properties of sounds to achieve its compression. Input uncompressed PCM samples must be 32, 44.1 or 48 kHz on up to 20 bits.



  1.     The first step in the encoding process is to transform the representation of audio from a sequence of PCM time samples into a sequence of frequencies coefficients blocks. This is done in the analysis filter bank. Overlapping blocks of 512 time samples are multiplied by a time window and transformed into the frequency domain. Due to the overlapping blocks, each PCM input sample is represented in two sequential transformed blocks. The frequency domain representation may then be decimated by a factor of two so that each block contains 256 frequency coefficients. The individual frequency coefficients are represented in binary exponential notation as a binary exponent and a mantissa.

  2.     The set of exponents is encoded into a coarse representation of the signal spectrum which is referred to as the spectral envelope.

  3.     This spectral envelope is used by the core bit allocation routine which determines how many bits to use to encode each individual mantissa.

  4.     The mantissa is quantized according to the bit allocation informations.

  5.     The spectral envelope and the coarsely quantized mantissas for 6 audio blocks (1536 audio samples) are formatted into an AC-3 frame.

  6.     The AC-3 bit stream (from 32 to 640 kbps) is a sequence of AC-3 frames.


AC-3 utilizations

Dolby AC-3 is used intensively in the cinema at 640 kbps data rate. The THX quality label is also in 95% of the cases based on some AC-3 installations. It is used on laserdiscs at 384 kbps bitrate, and now in DVD at similar bitrates. The 5 channels of the old Dolby Pro-Logic are extracted from the 2 stereo channels, so they only reproduce parts of the audio spectrum range. As AC-3 provides only full range channels, its sound is really much better in terms of quality and spatialisation. You can notice that an AC-3 bitstream can carry a Pro-Logic signal in its two front channels for compatibility with old systems.
    One of the main advantages of AC-3 is its utilization: since several years, it has intensively been used, so it became a real world "standard".

© 1998 Gabriel Bouvigne for MP3Bench
© 1999-2001 Gabriel Bouvigne for MP3'Tech - www.mp3-tech.org