MPEG-4

MPEG-4 is a method of defining compression of audio and visual (AV) digital data. It was introduced in late 1998 and designated a standard for a group of audio and video coding formats and related technology agreed upon by the ISO/IEC Moving Picture Experts Group (MPEG) (ISO/IEC JTC1/SC29/WG11) under the formal standard ISO/IEC 14496 Coding of audio-visual objects. Uses of MPEG-4 include compression of AV data for web (streaming media) and CD distribution, voice (telephone, videophone) and broadcast television applications.

Background

MPEG-4 absorbs many of the features of MPEG-1 and MPEG-2 and other related standards, adding new features such as (extended) VRML support for 3D rendering, object-oriented composite files (including audio, video and VRML objects), support for externally specified Digital Rights Management and various types of interactivity. AAC (Advanced Audio Coding) was standardized as an adjunct to MPEG-2 (as Part 7) before MPEG-4 was issued.

MPEG-4 is still an evolving standard and is divided into a number of parts. Companies promoting MPEG-4 compatibility do not always clearly state which "part" level compatibility they are referring to. The key parts to be aware of are MPEG-4 Part 2 (including Advanced Simple Profile, used by codecs such as DivX, Xvid, Nero Digital and 3ivx and by QuickTime 6) and MPEG-4 part 10 (MPEG-4 AVC/H.264 or Advanced Video Coding, used by the x264 encoder, Nero Digital AVC, QuickTime 7, and high-definition video media like Blu-ray Disc).

Most of the features included in MPEG-4 are left to individual developers to decide whether or not to implement. This means that there are probably no complete implementations of the entire MPEG-4 set of standards. To deal with this, the standard includes the concept of "profiles" and "levels", allowing a specific set of capabilities to be defined in a manner appropriate for a subset of applications.

Initially, MPEG-4 was aimed primarily at low bit-rate video communications; however, its scope as a multimedia coding standard was later expanded. MPEG-4 is efficient across a variety of bit-rates ranging from a few kilobits per second to tens of megabits per second. MPEG-4 provides the following functions:

Overview

MPEG-4 provides a series of technologies for developers, for various service-providers and for end users:

The MPEG-4 format can perform various functions, among which might be the following:

Profiles and Levels

MPEG-4 provides a large and rich set of tools for encoding. Subsets of the MPEG-4 tool sets have been provided for use in specific applications. These subsets, called 'Profiles', limit the size of the tool set a decoder is required to implement.[1] In order to restrict computational complexity, one or more 'Levels' are set for each Profile.[1] A Profile and Level combination allows:[1]

MPEG-4 Parts

MPEG-4 consists of several standards—termed "parts"—including the following (each part covers a certain aspect of the whole specification):

MPEG-4 parts[2][3]
Part Number First public release date (first edition) Latest public release date (last edition) Latest amendment Title Description
Part 1 ISO/IEC 14496-1 1999 2010[4] 2010[5] Systems Describes synchronization and multiplexing of video and audio. For example, the MPEG-4 file format version 1 (obsoleted by version 2 defined in MPEG-4 Part 14). The functionality of a transport protocol stack for transmitting and/or storing content complying with ISO/IEC 14496 is not within the scope of 14496-1 and only the interface to this layer is considered (DMIF). Information about transport of MPEG-4 content is defined e.g. in MPEG-2 Transport Stream, RTP Audio Video Profiles and others.[6][7][8][9][10]
Part 2 ISO/IEC 14496-2 1999 2004[11] 2009 Visual A compression format for visual data (video, still textures, synthetic images, etc.). One of the many "profiles" in Part 2 is the Advanced Simple Profile (ASP).
Part 3 ISO/IEC 14496-3 1999 2009[12] 2010[13][14] Audio A set of compression formats for perceptual coding of audio signals, including some variations of Advanced Audio Coding (AAC) as well as other audio/speech coding formats and tools (such as Audio Lossless Coding (ALS), Scalable Lossless Coding (SLS), Structured Audio, Text-To-Speech Interface (TTSI), HVXC, CELP and others)
Part 4 ISO/IEC 14496-4 2000 2004[15] 2010 (2011) Conformance testing Describes procedures for testing conformance to other parts of the standard.
Part 5 ISO/IEC 14496-5 2000 2001[16] 2010 (2011) Reference software Provides reference software for demonstrating and clarifying the other parts of the standard.
Part 6 ISO/IEC 14496-6 1999 2000[17] Delivery Multimedia Integration Framework (DMIF)
Part 7 ISO/IEC TR 14496-7 2002 2004[18] Optimized reference software for coding of audio-visual objects Provides examples of how to make improved implementations (e.g., in relation to Part 5).
Part 8 ISO/IEC 14496-8 2004 2004[19] Carriage of ISO/IEC 14496 contents over IP networks Specifies a method to carry MPEG-4 content on IP networks. It also includes guidelines to design RTP payload formats, usage rules of SDP to transport ISO/IEC 14496-1-related information, MIME type definitions, analysis on RTP security and multicasting.
Part 9 ISO/IEC TR 14496-9 2004 2009[20] Reference hardware description Provides hardware designs for demonstrating how to implement the other parts of the standard.
Part 10 ISO/IEC 14496-10 2003 2012[21] (2010[22]) Advanced Video Coding (AVC) A compression format for video signals which is technically identical to the ITU-T H.264 standard.
Part 11 ISO/IEC 14496-11 2005 2005[23] 2009 Scene description and application engine Can be used for rich, interactive content with multiple profiles, including 2D and 3D versions. MPEG-4 Part 11 revised MPEG-4 Part 1 – ISO/IEC 14496-1:2001 and two amendments to MPEG-4 Part 1. It describes a system level description of an application engine (delivery, lifecycle, format and behaviour of downloadable Java byte code applications) and the Binary Format for Scene (BIFS) and the Extensible MPEG-4 Textual (XMT) format – a textual representation of the MPEG-4 multimedia content using XML, etc.[23] (It is also known as BIFS, XMT, MPEG-J.[24] MPEG-J was defined in MPEG-4 Part 21)
Part 12 ISO/IEC 14496-12 2004 2012[25] 2009[26] (2010[2]) ISO base media file format A file format for storing time-based media content. It is a general format forming the basis for a number of other more specific file formats (e.g. 3GP, Motion JPEG 2000, MPEG-4 Part 14). It is technically identical to ISO/IEC 15444-12 (JPEG 2000 image coding system – Part 12).
Part 13 ISO/IEC 14496-13 2004 2004[27] Intellectual Property Management and Protection (IPMP) Extensions MPEG-4 Part 13 revised an amendment to MPEG-4 Part 1 – ISO/IEC 14496-1:2001/Amd 3:2004. It specifies common Intellectual Property Management and Protection (IPMP) processing, syntax and semantics for the carriage of IPMP tools in the bit stream, IPMP information carriage, mutual authentication for IPMP tools, a list of registration authorities required for the support of the amended specifications (e.g. CISAC), etc. It was defined due to the lack of interoperability of different protection mechanisms (different DRM systems) for protecting and distributing copyrighted digital content such as music or video.[28][29][30][31][32][33][34][35][36]
Part 14 ISO/IEC 14496-14 2003 2003[37] (2010[38]) MP4 file format It is also known as "MPEG-4 file format version 2". The designated container file format for MPEG-4 content, which is based on Part 12. It revises and completely replaces Clause 13 of ISO/IEC 14496-1 (MPEG-4 Part 1: Systems), in which the MPEG-4 file format was previously specified.
Part 15 ISO/IEC 14496-15 2004 2010[39] 2008 (2010[40]) Advanced Video Coding (AVC) file format For storage of Part 10 video. File format is based on Part 12, but also allows storage in other file formats.
Part 16 ISO/IEC 14496-16 2004 2011[41] (2010[42]) Animation Framework eXtension (AFX) It specifies MPEG-4 Animation Framework eXtension (AFX) model for representing 3D Graphics content. MPEG-4 is extended with higher-level synthetic objects for specifying geometry, texture, animation and dedicated compression algorithms.
Part 17 ISO/IEC 14496-17 2006 2006[43] Streaming text format Timed Text subtitle format
Part 18 ISO/IEC 14496-18 2004 2004[44] Font compression and streaming For Open Font Format defined in Part 22.
Part 19 ISO/IEC 14496-19 2004 2004[45] Synthesized texture stream Synthesized texture streams are used for creation of very low bitrate synthetic video clips.
Part 20 ISO/IEC 14496-20 2006 2008[46] 2009 Lightweight Application Scene Representation (LASeR) and Simple Aggregation Format (SAF) LASeR requirements (compression efficiency, code and memory footprint) are fulfilled by building upon the existing the Scalable Vector Graphics (SVG) format defined by the World Wide Web Consortium.[47]
Part 21 ISO/IEC 14496-21 2006 2006[48] MPEG-J Graphics Framework eXtensions (GFX) Describes a lightweight programmatic environment for advanced interactive multimedia applications – a framework that marries a subset of the MPEG standard Java application environment (MPEG-J) with a Java API.[24][48][49][50] (at "FCD" stage in July 2005, FDIS January 2006, published as ISO standard on 2006-11-22).
Part 22 ISO/IEC 14496-22 2007 2009[51] Open Font Format OFFS is based on the OpenType version 1.4 font format specification, and is technically equivalent to that specification.[52][53] Reached "CD" stage in July 2005, published as ISO standard in 2007
Part 23 ISO/IEC 14496-23 2008 2008[54] Symbolic Music Representation (SMR) Reached "FCD" stage in October 2006, published as ISO standard in 2008-01-28
Part 24 ISO/IEC TR 14496-24 2008 2008[55] Audio and systems interaction Describes the desired joint behavior of MPEG-4 File Format and MPEG-4 Audio.
Part 25 ISO/IEC 14496-25 2009 2009[56] 3D Graphics Compression Model Defines a model for connecting 3D Graphics Compression tools defined in MPEG-4 standards to graphics primitives defined in any other standard or specification.
Part 26 ISO/IEC 14496-26 2010 2010[57] Audio Conformance
Part 27 ISO/IEC 14496-27 2009[58] (2010[59]) 3D Graphics conformance 3D Graphics Conformance summarizes the requirements, cross references them to characteristics, and defines how conformance with them can be tested. Guidelines are given on constructing tests to verify decoder conformance.
Part 28 ISO/IEC 14496-28 2012[60] Composite font representation
Part 29 ISO/IEC 14496-29 2014 Web video coding Text of Part 29 is derived from Part 10 - ISO/IEC 14496-10. Web video coding is a technology that is compatible with the Constrained Baseline Profile of ISO/IEC 14496-10 (the subset that is specified in Annex A for Constrained Baseline is a normative specification, while all remaining parts are informative).
Part 30 ISO/IEC 14496-30 2014 Timed text and other visual overlays in ISO base media file format It describes the carriage of some forms of timed text and subtitle streams in files based on ISO/IEC 14496-12 - W3C Timed Text Markup Language 1.0, W3C WebVTT (Web Video Text Tracks). The documentation of these forms does not preclude other definition of carriage of timed text or subtitles; see, for example, 3GPP Timed Text (3GPP TS 26.245).
Part 31 ISO/IEC DIS 14496-31 Under development Video Coding for Browsers Video Coding for Browsers (VCB) codec - a video compression technology that is intended for use within World Wide Web browser

Profiles are also defined within the individual "parts", so an implementation of a part is ordinarily not an implementation of an entire part.

MPEG-1, MPEG-2, MPEG-7 and MPEG-21 are other suites of MPEG standards.

MPEG-4 Levels

The low profile levels are part of the MPEG-4 video encoding/decoding constraints and are compatible with the older ITU H.261 standard, also compatible with former analog TV standards for broadcast and records (such as NTSC or PAL video). The ASP profile in its highest level is suitable for most usual DVD medias and players or for many online video sites, but not for Blu-ray records or online HD video contents.

Profile Level Max.
buffer
Max.
bitrate
Max.
delay
Max.
VP
size
Max.
VOP
size
Max.
decoder
rate
Max. framesize
@ max.
bitrate
@ 30 Hz @ 25 Hz @ 24 Hz @ 15 Hz @ 12.5 Hz
SP L0 160642.50 2,048991,485 QCIF (176×144)
L0b 320128
L1 16064 128×96144×96160×96
L2 6401285.00 4,0963965,940 256×192304×192, 288×208304×208CIF (352×288)
L3 3841.66 8,19211,880 CIF (352×288)
L4a 1,2804,0000.32 16,384 1,20036,000 VGA (640×480)
L5 1,792 8,0000.22 1,62040,500 D1 NTSC (720×480)D1 PAL (720×576)
L6 3,968 12,0000.33 3,600108,000 720p (1280x720)
ASP L0 1601281.25 2,048992,970 QCIF (176×144)
L1
L2 6403841.66 4,0963965,940 256×192304×192, 288×208304×208CIF (352×288)
L3 7680.86 11,880 CIF (352×288)
L3b 1,0401,5000.69
L4 1,2803,0000.43 8,19279223,760 352×576, 704×288
L5 1,7928,0000.22 16,3841,62048,600 720×576
Units kbits kbits/s seconds bits macroblocks macroblocks/s pixels

More advanced profiles for HD media have been defined later in the AVC profile, which is functionally identical to the ITU H.264 standard but are now also integrated in MPEG-4 Part 10 (see H.264/MPEG-4 AVC for the list of defined levels in this AVC profile).

Licensing

MPEG-4 contains patented technologies, the use of which requires licensing in countries that acknowledge software algorithm patents. Over two dozen companies claim to have patents covering MPEG-4. MPEG LA[61] licenses patents required for MPEG-4 Part 2 Visual from a wide range of companies (audio is licensed separately) and lists all of its licensors and licensees on the site. New licenses for MPEG-4 System patents are under development[62] and no new licenses are being offered while holders of its old MPEG-4 Systems license are still covered under the terms of that license for the patents listed (MPEG LA – Patent List).

AT&T is trying to sue companies such as Apple Inc. over alleged MPEG-4 patent infringement.[63] The terms of Apple's QuickTime 7 license for users[64] describes in paragraph 14 the terms under Apple's existing MPEG-4 System Patent Portfolio license from MPEG LA.

See also

References

  1. 1 2 3 4 5 RFC 3640, IETF, p.  31.
  2. 1 2 MPEG. "MPEG standards – Full list of standards developed or under development". Chiariglione. Archived from the original on 2010-04-20. Retrieved 2010-02-09.
  3. ISO/IEC JTC 1/SC 29 (2009-11-09). "Programme of Work – MPEG-4 (Coding of audio-visual objects)". Retrieved 2009-11-10.
  4. "ISO/IEC 14496-1:2010 – Information technology — Coding of audio-visual objects — Part 1: Systems". Retrieved 2011-01-28.
  5. ISO. "ISO/IEC 14496-1:2010/Amd 1:2010 – Usage of LASeR in MPEG-4 systems and Registration Authority for MPEG-4 descriptors". Retrieved 2011-01-28.
  6. ISO/IEC (2004-11-15), ISO/IEC 14496-1:2004 – Third edition 2004-11-15 – Information technology — Coding of audio-visual objects — Part 1: Systems (PDF), retrieved 2010-04-11
  7. WG11 (MPEG) (March 2002). "Overview of the MPEG-4 Standard". Retrieved 2010-04-11.
  8. WG11 (1997-11-21), Text for CD 14496-1 Systems (MS Word .doc), retrieved 2010-04-11
  9. "MPEG-4 Systems Elementary Stream Management (ESM)". July 2001. Retrieved 2010-04-11.
  10. "MPEG Systems (1-2-4-7) FAQ, Version 17.0". July 2001. Retrieved 2010-04-11.
  11. "ISO/IEC 14496-2:2004 – Information technology — Coding of audio-visual objects — Part 2: Visual". ISO. Retrieved 2009-10-30.
  12. "ISO/IEC 14496-3:2009 – Information technology — Coding of audio-visual objects — Part 3: Audio". ISO. Retrieved 2009-10-30.
  13. ISO (2009). "ISO/IEC 14496-3:2009/FPDAmd 2, ALS simple profile and transport of SAOC". ISO. Retrieved 2009-10-13.
  14. "ISO/IEC 14496-3:2009/Amd 1:2009 – HD-AAC profile and MPEG Surround signaling". ISO. 2009-09-11. Retrieved 2009-10-15.
  15. "ISO/IEC 14496-4:2004 – Information technology — Coding of audio-visual objects — Part 4: Conformance testing". ISO. Retrieved 2009-10-30.
  16. "ISO/IEC 14496-5:2001 – Information technology — Coding of audio-visual objects — Part 5: Reference software". ISO. Retrieved 2009-10-30.
  17. "ISO/IEC 14496-6:2000 – Information technology — Coding of audio-visual objects — Part 6: Delivery Multimedia Integration Framework (DMIF)". ISO. Retrieved 2009-10-30.
  18. "ISO/IEC TR 14496-7:2004 – Information technology — Coding of audio-visual objects — Part 7: Optimized reference software for coding of audio-visual objects". ISO. Retrieved 2009-10-30.
  19. "ISO/IEC 14496-8:2004 – Information technology — Coding of audio-visual objects — Part 8: Carriage of ISO/IEC 14496 contents over IP networks". ISO. Retrieved 2009-10-30.
  20. "ISO/IEC TR 14496-9:2009 – Information technology — Coding of audio-visual objects — Part 9: Reference hardware description". ISO. Retrieved 2009-10-30.
  21. "ISO/IEC 14496-10:2012 – Information technology — Coding of audio-visual objects — Part 10: Advanced Video Coding". ISO. Retrieved 2012-06-11.
  22. "ISO/IEC 14496-10:2009/FPDAmd 1 – Constrained baseline profile, stereo high profile and frame packing arrangement SEI message". ISO. Retrieved 2009-12-29.
  23. 1 2 "ISO/IEC 14496-11:2005 – Information technology — Coding of audio-visual objects — Part 11: Scene description and application engine". ISO. Retrieved 2009-10-30.
  24. 1 2 "MPEG-J White Paper". July 2005. Retrieved 2010-04-11.
  25. "ISO/IEC 14496-12:2012 – Information technology — Coding of audio-visual objects — Part 12: ISO base media file format". ISO. Retrieved 2014-01-19.
  26. ISO. "ISO/IEC 14496-12:2008/Amd 1:2009 – General improvements including hint tracks, metadata support and sample groups". Retrieved 2009-12-30.
  27. "ISO/IEC 14496-13:2004 – Information technology — Coding of audio-visual objects — Part 13: Intellectual Property Management and Protection (IPMP) extensions". ISO. Retrieved 2009-10-30.
  28. MPEG (March 2002), FPDAM ISO/IEC 14496-1:2001 / AMD3 (Final Proposed Draft Amendment) (MS Word .doc), retrieved 2010-08-01
  29. "MPEG-4 IPMPX white paper". MPEG. July 2005. Retrieved 2010-08-01.
  30. "MPEG Intellectual Property Management and Protection". MPEG. April 2009. Retrieved 2010-08-01.
  31. MPEG-4 IPMP Extension – For Interoperable Protection of Multimedia Content (PDF), 2004, retrieved 2010-08-01
  32. "MPEG Registration Authority – IPMP". MPEG RA International Agency (CISAC). Retrieved 2010-08-01.
  33. "MPEG RA – FAQ IPMP". MPEG RA International Agency (CISAC). Retrieved 2010-08-01.
  34. "Intellectual Property Management and Protection Registration Authority". CISAC. 2004-12-05. Archived from the original on 2004-12-05. Retrieved 2010-08-01.
  35. Chiariglione, Leonardo (2003), Digital media: Can content, business and users coexist?, Torino, IT: Telecom Italia Lab, retrieved 2010-08-01
  36. IPMP in MPEG – W3C DRM workshop 22/23 January 2001 (PPT), retrieved 2010-08-01
  37. ISO. "ISO/IEC 14496-14:2003 – Information technology — Coding of audio-visual objects — Part 14: MP4 file format". Retrieved 2009-10-30.
  38. "ISO/IEC 14496-14:2003/FPDAmd 1 – Handling of MPEG-4 audio enhancement layers". ISO. Retrieved 2009-12-29.
  39. "ISO/IEC 14496-15:2004 – Information technology — Coding of audio-visual objects — Part 15: Advanced Video Coding (AVC) file format". ISO. Retrieved 2009-10-30.
  40. "ISO/IEC 14496-15:2004/FDAmd 3 – File format support for Multiview Video Coding". ISO. Retrieved 2009-12-29.
  41. "ISO/IEC 14496-16:2009 – Information technology — Coding of audio-visual objects — Part 16: Animation Framework eXtension (AFX)". ISO. Retrieved 2009-12-29.
  42. "ISO/IEC 14496-16:2009/FPDAmd 1 – Scalable complexity 3D mesh coding". Retrieved 2009-12-29.
  43. "ISO/IEC 14496-17:2006 – Information technology — Coding of audio-visual objects — Part 17: Streaming text format". ISO. Retrieved 2009-10-30.
  44. "ISO/IEC 14496-18:2004 – Information technology — Coding of audio-visual objects — Part 18: Font compression and streaming". ISO. Retrieved 2009-10-30.
  45. "ISO/IEC 14496-19:2004 – Information technology – Coding of audio-visual objects — Part 19: Synthesized texture stream". ISO. Retrieved 2009-10-30.
  46. "ISO/IEC 14496-20:2008 – Information technology — Coding of audio-visual objects — Part 20: Lightweight Application Scene Representation (LASeR) and Simple Aggregation Format (SAF)". ISO. Retrieved 2009-10-30.
  47. "MPEG-4 LASeR white paper". July 2005. Retrieved 2010-04-11.
  48. 1 2 "ISO/IEC 14496-21:2006 – Information technology — Coding of audio-visual objects — Part 21: MPEG-J Graphics Framework eXtensions (GFX)". ISO. Retrieved 2009-10-30.
  49. "MPEG-4 Systems MPEG-J". July 2001. Retrieved 2010-04-11.
  50. "MPEG-J GFX white paper". July 2005. Retrieved 2010-04-11.
  51. "ISO/IEC 14496-22:2009 – Information technology — Coding of audio-visual objects — Part 22: Open Font Format". ISO. Retrieved 2009-10-30.
  52. ISO/IEC JTC 1/SC 29/WG 11 (July 2008). "ISO/IEC 14496-22 "Open Font Format"". Chiariglione. Retrieved 2010-02-09.
  53. "ISO/IEC 14496-22 Information technology — Coding of audio-visual objects — Part 22: Open Font Format" (Zip) (first ed.). 2007-03-15. Retrieved 2010-01-28.
  54. "ISO/IEC 14496-23:2008 – Information technology — Coding of audio-visual objects — Part 23: Symbolic Music Representation". ISO. Retrieved 2009-10-30.
  55. "ISO/IEC TR 14496-24:2008 – Information technology — Coding of audio-visual objects — Part 24: Audio and systems interaction". ISO. Retrieved 2009-10-30.
  56. "ISO/IEC 14496-25:2009 – Information technology — Coding of audio-visual objects — Part 25: 3D Graphics Compression Model". ISO. Retrieved 2009-10-30.
  57. "ISO/IEC 14496-26:2010 – Information technology — Coding of audio-visual objects — Part 26: Audio conformance". ISO. Retrieved 2010-05-16.
  58. "ISO/IEC 14496-27:2009 – Information technology — Coding of audio-visual objects — Part 27: 3D Graphics conformance". ISO. Retrieved 2009-12-29.
  59. ISO. "ISO/IEC 14496-27:2009/FPDAmd 2 – Handling of MPEG-4 audio enhancement layers". Retrieved 2009-12-29.
  60. "ISO/IEC CD 14496-28 – Information technology — Coding of audio-visual objects — Part 28: Composite font representation". ISO. Retrieved 2011-04-06.
  61. MPEG Licensing Authority – MPEG-4 Visual: Introduction
  62. MPEG Licensing Authority – MPEG-4 Systems: Introduction
  63. "AT&T Warns Apple, Others, Of Patent Infringement". PC Magazine. February 9, 2006. Retrieved 2007-08-10.
  64. Apple QuickTime 7 Software License (PDF)

External links

This article is issued from Wikipedia - version of the 11/29/2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.