a codec, short for coder-decoder, does two basic operations −
first, it converts an analog voice signal to its equivalent digital form so that it can be easily transmitted.
thereafter, it converts the compressed digital signal back to its original analog form so that it can be replayed.
there are many codecs available in the market – some are free while others require licensing. codecs vary in the sound quality and vary in bandwidth accordingly.
hardware devices such as phones and gateways support several different codecs. while talking to each other, they negotiate which codec they will use.
here, in this chapter, we will discuss a few popular sip audio codecs that are widely used.
g.711
g.711 is a codec that was introduced by itu in 1972 for use in digital telephony. the codec has two variants: a-law is being used in europe and in international telephone links, ulaw is used in the u.s.a. and japan.
g.711 uses a logarithmic compression. it squeezes each 16-bit sample to 8 bits, thus it achieves a compression ratio of 1:2.
the bitrate is 64 kbit/s for one direction, so a call consumes 128 kbit/s.
g.711 is the same codec used by the pstn network, hence it provides the best voice quality. however it consumes more bandwidth than other codecs.
it works best in local area networks where we have a lot of bandwidth available.
g.729
g.729 is a codec with low bandwidth requirements; it provides good audio quality.
the codec encodes audio in frames of 10 ms long. given a sampling frequency of 8 khz, a 10 ms frame contains 80 audio samples.
the codec algorithm encodes each frame into 10 bytes, so the resulting bitrate is 8 kbit/s in one direction.
g.729 is a licensed codec. end-users who want to use this codec should buy a hardware that implements it (be it a voip phone or gateway).
a frequently used variant of g.729 is g.729a. it is wire-compatible with the original codec but has lower cpu requirements.
g.723.1
g.723.1 is the result of a competition that itu announced with the aim to design a codec that would allow calls over 28.8 and 33 kbit/s modem links.
we have two variants of g.723.1. they both operate on audio frames of 30 ms (i.e. 240 samples), but the algorithms differ.
the bitrate of the first variant is 6.4 kbit/s, while for the second variant, it is 5.3 kbit/s.
the encoded frames for the two variants are 24 and 20 bytes long, respectively.
gsm 06.10
gsm 06.10 is a codec designed for gsm mobile networks. it is also known as gsm full rate.
this variant of the gsm codec can be freely used, so you will often find it in open source voip applications.
the codec operates on audio frames 20 ms long (i.e. 160 samples) and it compresses each frame to 33 bytes, so the resulting bitrate is 13 kbit/.