Unified Speech and Audio Coding

Unified Speech and Audio Coding (USAC) is an audio compression format and codec for both music and speech or any mix of speech and audio using very low bit rates between 12 and 64 kbit/s.^[1] It was developed by Moving Picture Experts Group (MPEG) and was published as an international standard ISO/IEC 23003-3 (a.k.a. MPEG-D Part 3)^[2] and also as an MPEG-4 Audio Object Type in ISO/IEC 14496-3:2009/Amd 3 in 2012.^[3]

It uses time-domain linear prediction and residual coding tools (ACELP-like techniques) for speech signal segments and transform coding tools (MDCT-based techniques) for music signal segments and it is able to switch between the tool sets dynamically in a signal-responsive manner. It is being developed with the aim of a single, unified coder with performance that equals or surpasses that of dedicated speech coders and dedicated music coders over a broad range of bitrates. Enhanced variations of the MPEG-4 Spectral Band Replication (SBR) and MPEG-D MPEG Surround parametric coding tools are integrated into the USAC codec.^[4]^[5]

xHE-AAC

The MPEG-D USAC standard (ISO/IEC 23003-3) defines the xHE-AAC profile (Extended High Efficiency AAC), which contains all of the tools of the HE-AAC v2 profile plus the mono/stereo capabilities of the Baseline USAC profile. As a result, a decoder built according to the xHE-AAC profile is able to also decode the bit streams created for the previous members of the AAC family profile(s). The xHE-AAC profile was designed for applications relying on a consistent performance at low data rates while being able to decode all existing AAC-LC, HE-AAC and HE-AACv2 content.^[6]

xHE-AAC is a mandatory audio codec in the Digital Radio Mondiale standard.^[7]^[8]^[9]

In April 2016, Via Licensing announced the launch of a xHE-AAC patent pool licensing program for 2016.^[10]

References

↑ MPEG. "Unified Speech and Audio Coding". The Moving Pictures Experts Group. Retrieved 2016-11-11.
↑ "ISO/IEC DIS 23003-3 - Information technology -- MPEG audio technologies -- Part 3: Unified speech and audio coding". 2011-02-15. Retrieved 2011-07-18.
↑ "ISO/IEC 14496-3:2009/PDAM 3 - Transport of unified speech and audio coding (USAC)". 2011-06-30. Retrieved 2011-07-18.
↑ Neuendorf; et al. (2013-12-20), The ISO/MPEG Unified Speech and Audio Coding Standard—Consistent High Quality for All Content Types and at All Bit Rates (PDF), retrieved 2015-06-13
↑ Neuendorf; et al. (2012-04-26), MPEG Unified Speech and Audio Coding-The ISO/MPEG standard for high-efficiency audio coding of all content types (PDF), retrieved 2015-06-13
↑ Neuendorf, Max; Multrus, Markus; Rettelbach, Nikolaus; Fuchs, Guillaume; Robilliard, Julien; Lecomte, Jérémie; Wilde, Stephan; Bayer, Stefan; Disch, Sascha; Helmrich, Christian; Lefebvre, Roch; Gournay, Philippe; Bessette, Bruno; Lapierre, Jimmy; Kjörling, Kristofer; Purnhagen, Heiko; Villemoes, Lars; Oomen, Werner; Schuijers, Erik; Kikuiri, Kei; Chinen, Toru; Norimatsu, Takeshi; Chong, Kok Seng; Oh, Eunmi; Kim, Miyoung; Quackenbush, Schuyler; Grill, Bernhard (2013-12-01). "The ISO/MPEG Unified Speech and Audio Coding Standard - Consistent High Quality for all Content Types and at all Bit Rates". Journal of the AES. 61 (12): 956–977. ISSN 0004-7554.
↑ "Technical Info | Digital Radio Mondiale". www.drm.org. Retrieved 2016-08-02.
↑ "xHE-AAC". Fraunhofer Institute for Integrated Circuits IIS. Retrieved 2016-08-02.
↑ xHE-AAC in Digital Radio Mondiale (DRM) (PDF). Fraunhofer IIS. 2015.
↑ "Via Licensing Announces Extended High Efficiency AAC Patent Pool - Via Corp". www.via-corp.com. Retrieved 2016-08-02.

Multimedia compression and container formats

Video
compression

ISO/IEC	MJPEG Motion JPEG 2000 MPEG-1 MPEG-2 Part 2 MPEG-4 Part 2/ASP Part 10/AVC MPEG-H Part 2/HEVC

ITU-T	H.120 H.261 H.262 H.263 H.264 H.265

SMPTE	VC-1 VC-2 VC-3 VC-5

Others	Apple Video AV1 AVS Bink Cinepak Daala Dirac DV DVI FFV1 Huffyuv Indeo Lagarith Microsoft Video 1 MSU Lossless OMS Video Pixlet ProRes 422 ProRes 4444 QuickTime Animation Graphics RealVideo RTVideo SheerVideo Smacker Sorenson Video, Spark Theora Thor VP3 VP6 VP7 VP8 VP9 WMV XEB YULS

Audio
compression

ISO/IEC	MPEG-1 Layer III (MP3) MPEG-1 Layer II Multichannel MPEG-1 Layer I AAC HE-AAC AAC-LD MPEG Surround MPEG-4 ALS MPEG-4 SLS MPEG-4 DST MPEG-4 HVXC MPEG-4 CELP MPEG-D USAC MPEG-H 3D Audio

ITU-T	G.711 (A-law, µ-law) G.718 G.719 G.722 G.722.1 G.722.2 G.723 G.723.1 G.726 G.728 G.729 G.729.1

IETF	Opus iLBC

3GPP	AMR AMR-WB AMR-WB+ EVRC EVRC-B GSM-HR GSM-FR GSM-EFR

Others	ACELP AC-3 ALAC Asao ATRAC CELT Codec2 DRA DTS FLAC iSAC Monkey's Audio TTA True Audio MT9 Musepack OptimFROG OSQ QCELP RCELP RealAudio RTAudio SD2 SHN SILK Siren SMV Speex SVOPC TwinVQ VMR-WB Vorbis VSELP WavPack WMA MQA aptX

Image
compression

IEC, ISO, ITU-T, W3C, IETF	CCITT Group 4 GIF HEVC JBIG JBIG2 JPEG JPEG 2000 JPEG XR Lossless JPEG PNG TIFF TIFF/EP TIFF/IT

Others	APNG BPG DjVu EXR FLIF ICER MNG PGF QTVR WBMP WebP

Containers

ISO/IEC	MPEG-ES MPEG-PES MPEG-PS MPEG-TS ISO base media file format MPEG-4 Part 14 (MP4) Motion JPEG 2000 MPEG-21 Part 9 MPEG media transport

ITU-T	H.222.0 T.802

IETF	RTP

Others	3GP and 3G2 AMV ASF AIFF AVI AU BPG Bink Smacker BMP DivX Media Format EVO Flash Video GXF IFF M2TS Matroska WebM MXF Ogg QuickTime File Format RatDVD RealMedia RIFF WAV MOD and TOD VOB, IFO and BUP

Collaborations

See Compression methods for methods and Compression software for codecs

MPEG (Moving Picture Experts Group)

MPEG-1 2 3 4 7 21 A B C D E V M U H

MPEG-1 Parts	Part 1: Systems Program stream Part 2: Video based on H.261 Part 3: Audio Layer I Layer II Layer III

MPEG-2 Parts	Part 1: Systems (H.222.0) Transport stream Program stream Part 2: Video (H.262) Part 3: Audio Layer I Layer II Layer III MPEG Multichannel Part 6: DSM CC Part 7: Advanced Audio Coding

MPEG-4 Parts	Part 2: Video based on H.263 Part 3: Audio Part 6: DMIF Part 10: Advanced Video Coding (H.264) Part 11: Scene description Part 12: ISO base media file format Part 14: MP4 file format Part 17: Streaming text format Part 20: LASeR Part 22: Open Font Format

MPEG-7 Parts	Part 2: Description definition language

MPEG-21 Parts	Parts 2, 3 and 9: Digital Item Part 5: Rights Expression Language

MPEG-D Parts	Part 1: MPEG Surround Part 3: Unified Speech and Audio Coding

MPEG-H Parts	Part 1: MPEG media transport Part 2: High Efficiency Video Coding Part 3: MPEG-H 3D Audio

Other	MPEG-DASH

This article is issued from Wikipedia - version of the 11/11/2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.

Unified Speech and Audio Coding

xHE-AAC

See also

References