A-4
Video/Audio Compression and Decompression Concepts
The DCT coefcient in the upper left location (0, 0) of the block
represents the zero horizontal and zero vertical frequencies and is known
as the DC coefcient
. The DC coefcient is proportional to the average
pixel value of the 8 x 8 block, and additional compression is provided
through predictive coding because the difference in the average value of
neighboring 8 x 8 blocks tends to be relatively small.
The other coefcients represent one or more nonzero horizontal or
nonzero vertical spatial frequencies, and are called AC coefcients
. The
quantization level of the coefcients corresponding to the higher spatial
frequencies favors the creation of an AC coefcient of zero by choosing
a quantization step size such that the human visual system is unlikely to
perceive the loss of the particular spatial frequency, unless the coefcient
value lies above the particular quantization level. The statistical encoding
of the expected runs of consecutive zero-valued coefcients of higher-
order coefcients accounts for some coding gain.
To cluster nonzero coefcients early in the series and to encode as many
zero coefcients as possible following the last nonzero coefcient in the
ordering, the coefcient sequence is specied to be a
zigzag ordering.
Zigzag ordering concentrates the highest spatial frequencies at the end
of the series. The MPEG-2 standard includes additional block scanning
orders.
After block scanning has been performed, the encoder performs
run-length coding on the AC coefcients. This process reduces each
8 x 8 block of DCT coefcients to a number of events represented by a
nonzero coefcient and the number of preceding zero coefcients.
Because many coefcients are likely to be zero after quantization,
run-length coding increases the overall compression ratio.
The encoder then performs variable-length coding (VLC) on the resulting
data. VLC is a reversible procedure for coding that assigns shorter
codewords to frequent events and longer codewords to less frequent
events, thereby reducing the number of bits necessary to represent a
data set without losing any information. Huffman encoding is a
particularly well known form of VLC.
The nal compressed video data is now ready for transmission to either
a local storage device from which a video decoder may later retrieve and
decompress the data, or to a remote video decoder using cable, or direct
satellite broadcast, as examples.