The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.02 - April-June (2013 vol.20)
pp: 72-78
Schuyler Quackenbush , Audio Research Labs
ABSTRACT
The MPEG Audio Subgroup has a rich history of accomplishments in creating music coding technology. At higher bit rates, MPEG technology can represent arbitrary sounds, including the human voice, with excellent quality. MPEG-1 and MPEG-2 Audio coders use perceptually shaped quantization noise as the primary tool for achieving compression. The MPEG-4 High-Efficiency Advanced Audio Coding (AAC) standard is a single technology capable of compressing speech, speech mixed with music, or music signals with quality that is always at least as good as the best of two state-of-the-art reference codecs, one optimized for speech and mixed content (AMR-WB+) and the other optimized for music and general audio (HE-AACv2). This article provides an overview of the USAC architecture and summarizes the performance relative to the best state-of-the-art speech and audio codecs.
INDEX TERMS
Music, Coding, MPEG standards, HE-AACv2, multimedia, multimedia applications, MPEG, MPEG-4 High-Efficiency Advanced Audio Coding (AAC) standard, audio codecs, music coding technology, compressed speech, AMR-WB+
CITATION
Schuyler Quackenbush, "MPEG Unified Speech and Audio Coding", IEEE MultiMedia, vol.20, no. 2, pp. 72-78, April-June 2013, doi:10.1109/MMUL.2013.24
REFERENCES
1. ISO/IEC 11172-3:1993, Information Technology − Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to about 1,5 Mbit/s − Part 3: Audio, ed. 1, Int'l Organization for Standardization.
2. ISO/IEC 13818-7:2006, Information Technology – Generic Coding of Moving Pictures and Associated Audio Information – Part 7: Advanced Audio Coding (AAC), ed. 4, Int'l Organization for Standardization.
3. ISO/IEC 14496-3:2009, Information Technology − Coding of Audio-Visual Objects − Part 3: Audio, ed. 4, Int'l Organization for Standardization.
4. ISO/IEC 23003-1:2007, Information Technology − MPEG Audio Technologies − Part 1: MPEG Surround, ed. 1, Int'l Organization for Standardization.
5. J. Herre et al., "MPEG Surround-The ISO/MPEG Standard for Efficient and Compatible Multichannel Audio Coding," J. Audio Eng. Soc., vol. 56, no. 11, 2008, pp. 932–955.
6. ISO/IEC 23003-2:2010, Information Technology − MPEG Audio Technologies − Part 2: Spatial Audio Object Coding (SAOC), ed. 1, Int'l Organization for Standardization.
7. ISO/IEC 23003-3:2011, Information Technology − MPEG Audio Technologies − Part 3: Unified Speech and Audio Coding, ed. 1, Int'l Organization for Standardization.
8. M. Neuendorf et al., "MPEG Unified Speech and Audio Coding − The ISO/MPEG Standard for High Efficiency Audio Coding of all Content Types," Proc. AES 131st Convention, Audio Eng. Soc., 2012.
9. "Call for Proposals on Unified Speech and Audio Coding," ISO/IEC WG11 N9519, Int'l Organization for Standardization.
10. ETSI TS 126 290 V9.0.0 (2010-01), Digital Cellular Telecommunications System; Universal Mobile Telecommunications System (UMTS); LTE; Audio Codec Processing Functions; Extended Adaptive Multi-Rate − Wideband (AMR-WB+) codec, 3GPP TS 26.290, version 9.0.0, release 9.
11. J. Makinen et al., "AMR-WB+: A New Audio Coding Standard for 3rd Generation Mobile Audio Services," Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP 05), vol. 2, IEEE, 2005, pp. 1109–1112.
12. M. Bosi et al., "ISO/IEC MPEG-2 Advanced Audio Coding," J. Audio Eng. Soc., vol. 45, no. 10, 1997, pp. 789–814.
13. M. Dietz et al., "Spectral Band Replication, A Novel Approach in Audio Coding," Proc. 112th AES Convention, Audio Eng. Soc., 2001, preprint 5553.
14. M. Wolters et al., "A Closer Look into MPEG-4 High Efficiency AAC," Proc. 115th AES Convention, Audio Eng. Soc., 2003, preprint 5871.
15. R. Lefebvre et al., "High Quality Coding of Wideband Audio Signals Using Transform Coded Excitation (TCX)," Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP 94), vol. 1, IEEE, 1994, pp. 193–196.
16. C. Laflamme et al., "On Reducing Computational Complexity of Codebook Search in CELP Coder Through the Use of Algebraic Codes," Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP 90), vol. 1, IEEE, 1990, pp. 177–180.
17. R.V. Cox and P. Kroon, "Low Bit-Rate Speech Coders for Multimedia Communication," IEEE Comm. Magazine, vol. 34, no. 12, 1996, pp. 34–41.
18. Method for the Subjective Assessment of Intermediate Sound Quality (MUSHRA), ITU-R Recommendation BS. 1543-1, Int'l Telecommunication Union, 2001.
78 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool