Who we are
- Achievements
- Terms of Reference
- Work plan
- Meetings
Hot news
MPEG Life
- Events
- An MPEG meeting
- Ad-hoc groups
- Patents
- Guide to hosts
Documents
- Standards
- Technologies
- Performance tests
- Tutorials
- Working documents
How to join
For the Media
FAQ
MPEG books
Liaisons
Links
Pictures
Contact point
Back to home |
|
|
|
1
Media coding
1.1 MPEG-2 Main Profile Level for 1080@50/60p
Std |
Pt |
Amd |
Req |
Short description |
2 |
2 |
3 |
|
High resolution and high frame rate video formats, e.g. 1080@50p/60p, surpassing current HDTV have been developed for various applications and are emerging in the industry. However the max video frame size and frame rate are limited by the level definitions in the current MPEG standards. The purpose of this amendment of ISO/IEC 13818-2 is to define new levels to support high resolution video formats. |
Std |
Pt |
Amd |
Req |
Short description |
4 |
2 |
5 |
|
High resolution and high frame rate video formats, e.g. 4Kx2K, surpassing current HDTV have been developed for various applications and are emerging in the industry. The purpose of this amendment of ISO/IEC 14496 is to define new levels to support high resolution video formats. |
Std |
Pt |
Amd |
Req |
Short description |
4 |
3 |
9 |
|
AAC-ELD uses recent advances in filterbank design to combine the AAC-LD codec with SBR technology, but with revised filterbank designs. The resulting codec, AAC-ELD, has low throughput delay with enhanced compression efficiency. |
1.4 Multiview Video Coding (MVC)
Std |
Pt |
Amd |
Req |
Short description |
4 |
10 |
4 |
|
Standard for coding of multiview video, i.e. multiple synchronized video streams that show the same scene from different viewpoints. This is necessary to enable systems and applications for free viewpoint video and 3D video (e.g. m-view displays), where two or more views need to be transmitted simultaneously. The goal is to allow decoding of multiple views and to achieve better compression than for the case of simulcasting these views. |
1.5 Frame-based Animated Mesh Compression
Std |
Pt |
Amd |
Req |
Short description |
4 |
16 |
2 |
|
FAMC is a tool to compress an animated mesh by encoding on a time basis the geometry (position) and attributes (normals, colors …) of vertices composing the mesh. FAMC is independent on the manner how animation is obtained (deformation or rigid motion), encoding directly the results of the animation. |
Std |
Pt |
Amd |
Req |
Short description |
4 |
16 |
3 |
|
This standard defines three profiles: one for Graphics dimension (called Basic AFX Graphics), one for Scene Graph dimension (called Basic AFX Scene Graph) and one for compression dimension (called 3D MultiResolution Compression). The combination of the three profiles allows progressive and adaptive transmission over networks of large 3D environments and/or complex 3D shapes. |
1.7
Video Tool Library
Std |
Pt |
Amd |
Req |
Short description |
C |
4 |
|
|
- A repository of video coding tools used in MPEG Video standards
- The series of MPEG video coding standards with indication of which coding tools defined in 1. they use
- Combination of tools for existing standards (if applicable)
- Possible case: intraframe only profile(s)
|
1.8 Spatial Audio Object Coding
Std |
Pt |
Amd |
Req |
Short description |
D |
1 |
1 |
|
Spatial Audio Object Coding represents several audio objects by first combining the object signals into a mono or stereo signal, whilst extracting parameters from the individual object signals based on knowledge of human perception of the sound stage. These parameters are coded as a low bitrate side-channel that the decoder uses to render an audio scene from the stereo or mono down-mix such that the aspects of the output composition can be decided at the time of decoding. |
1.9 3D Video Coding
Std |
Pt |
Amd |
Req |
Short description |
| |
|
|
|
3D video (3DV) supports new types of audio-visual systems that allow users to view videos of the real 3D space from different user viewpoints. In an advanced application of 3DV, denoted as Free-viewpoinT Video (FTV), a user can set the viewpoint to an almost arbitrary location and direction, which can be static, change abruptly, or vary continuously, within the limits that are given by the available camera setup. Similarly, the audio listening point is changed accordingly. The first phase of 3DV development is expected to support advanced 3D displays, where M dense views must be generated from a sparse set of K transmitted views (typically K£3) with associated depth data. The allowable range of view synthesis will be relatively narrow (20 degrees view angle from leftmost to rightmost view). |
1.10 Unified Speech and Audio Coding
Std |
Pt |
Amd |
Req |
Short description |
| |
|
|
|
This exploration seeks technology that would permit a single, unified coder with performance that approaches that of speech coders for speech signals and that of music coders for music signals. In addition, this technology should have a bitstream representation that is scalable. |
1.11 Ontology
Std |
Pt |
Amd |
Req |
Short description |
| |
|
|
|
xxx |
2.1 Lightweight Scene Representation
Std |
Pt |
Amd |
Req |
Short description |
4 |
20 |
1 |
6499 |
This amendment adresses the following topics:
- Full compatibility with SVGT1.2,
- new events for streaming and broadcasting sessions,
- scrolling capabilities,
- support of Micro-DOM,
- code points allowing for the signaling of alternative encoding schemes such as XML and GZIP,
- improved stream management for better stream switching,
- improved management of encoded ID,
- extended SAF packets size to support media types such as 3GPP2 CMF (Compact Multimedia Format),
- multiple scene streams and sharing of streams between SAF sessions as well as animations connected to media time lines/external parameters.
|
3 Description coding
3.1 Image Signature Tools
Std |
Pt |
Amd |
Req |
Short description |
7 |
3 |
4 |
|
Video signature tools support ultra-fast search for and identification of videos and their modified/edited versions or fragments. They are designed to be robust to a range of deformations, such as coding artifacts, blurring, colour-to-monochrome conversion, trans-coding and and frame-rate change. Applications include:
- Video-based content searching and linking,
- Database de-duplication and
- Content Rights Management and Usage Monitoring.
|
3.3 Improvements to Geographic Descriptor
Std |
Pt |
Amd |
Req |
Short description |
7 |
5 |
3 |
4.4.2 - 14 |
At present ISO/IEC 15938-5 defines that one ‘Point’ of GeographicPointType can be associated with a GeographicPosition and does not define the GeographicalPosition Point details. This proposed amendment will allow more than one ‘Point’ of GeographicPointType to be associated with a GeographicPosition. It shall also define the ‘type’ of GeographicPosition Point details. |
3.4 MPEG-7 Query Format
Std |
Pt |
Amd |
Req |
Short description |
7 |
12 |
|
|
While the current MPEG-7 standard provides tools to describe multimedia content, the interface to support queries in a MPEG-7 database is not yet defined. Because standardized interfaces are not defined, each MPEG-7 database offers its own query interface, which prevents clients experiencing aggregated services from various MPEG-7 databases Providing a standardized interface to MPEG-7 databases will allow the users to describe their search criteria with a set of precise input parameters and give a set of preferred output parameters to describe the return result sets. In addition, with query management tools, users can refine their queries for specific content. The goal of this work on a query format is to provide the industry with a unified/standardized way to accept and respond to user requests for multimedia content searches. |
5 IPMP
5.1 REL OAC (Open Release Content) Profile
Std |
Pt |
Amd |
Req |
Short description |
21 |
5 |
3 |
|
A profile of the MPEG-21 REL, suitable for applications that distribute and use content intended for the public domain and open release. It supports many usage models for controlling commercial use, adaptation and derivative works. The profile is compatible with the rights, terms and conditions expressed in the licenses defined by the Creative Commons. |
6 Digital Item
6.1 Schema files for MPEG-21 standards
Std |
Pt |
Amd |
Req |
Short description |
21 |
|
|
|
Contains all schemas of the relevant MPEG-21 parts |
6.2 Security in Event Reporting
Std |
Pt |
Amd |
Req |
Short description |
21 |
14 |
1 |
9180 |
xxx |
7.1 Carriage of SVC in MPEG-2 Systems
Std |
Pt |
Amd |
Req |
Short description |
2 |
1 |
3 |
9273 |
Specifies the transport format for ISO/IEC 14496-10:200x/AMD1 Scalable Video Coding (SVC) video streams in MPEG-2 Systems. |
7.2 AVC File Format extensions for SVC
Std |
Pt |
Amd |
Req |
Short description |
4 |
15 |
2 |
|
Specifies the storage format for ISO/IEC 14496-10:200x/AMD1 Scalable Video Coding (SVC) video streams as an amendment to the AVC File Format, the specification for storage of Advanced Video Coding (AVC) streams. The storage of SVC content uses the existing capabilities of the ISO Base Media File Format and the AVC File Format but also defines new extensions to support the following features of the SVC codec:
- Scalable Grouping: A structuring and grouping mechanism for the dependencies that exist in a group of pictures and within each sample to obtain a flexible stream structure that provides scalability such as spatial, temporal and quality scalability.
- AVC Compatibility: A provision for storing in an AVC compatible manner, such that the AVC compatible base layer can be used by any existing AVC FF compliant reader.
|
7.3
AVC File Format extensions for MVC
Std |
Pt |
Amd |
Req |
Short description |
4 |
15 |
|
|
xxx |
Std |
Pt |
Amd |
Req |
Short description |
21 |
9 |
1 |
|
xxx |
8 Multimedia architecture
8.1 Codec Configuration Representation
Std |
Pt |
Amd |
Req |
Short description |
B |
4 |
|
|
CCR provides a description of media decoder configuration, which contains syntax parsing and decoding process. Now, the design of CCR is mainly done in video coding, but may be applicable to audio and other media coding. Using CCR and a toolbox containing functional units (FUs), a decoding solution may be implemented. CCR may support the toolbox with MPEG tools as well as the toolbox with nonMPEG tools. |
8.2 3D Graphics Compression Model
Std |
Pt |
Amd |
Req |
Short description |
4 |
25 |
|
|
The goal of this standard is to specify an architectural model able to accommodate (1) third-party XML based description of scene graph and graphics primitives with (2) (potential) binarisation tools and with (3) MPEG-4 3D Graphics Compression tools specified in ISO/IEC 14496-2, ISO/IEC 14496-11 and ISO/IEC 14496-16.
The advantages of such approach are on one side the use of powerful compression tools for graphics and one other side the generality of graphics primitives representation. Hence, compression tools developed in ISO/IEC 14496-2, ISO/IEC 14496-11 and ISO/IEC 14496-16 would not be applied only to the scene graph defined by ISO/IEC 14496-11 but to any scene graph definition. The bitstreams obtained when using the model are MP4 formatted and contains XML (or binarized XML) for the scene graph and binary elementary streams for graphics compression (geometry, texture and animation). |
8.3 Interfaces between virtual and real worlds
Std |
Pt |
Amd |
Req |
Short description |
V |
1 |
|
|
This project will provide a standardized framework that enables the interoperability between virtual worlds (as for example IMVU, Google Earth and many others) and the real world (sensors, actuators, social and welfare systems, banking, insurance, travel, real estate and many others). Virtual Worlds are 3D space where people can work, interact, play, travel, learn and augment real life. The broad acceptance of virtual words (social networks, serious games,…) and the relation to the real world (devices and real world networks) brings emerging (business) opportunities and challenges; the resulting interoperability will enable a new and sustainable industry. It is foreseen that this project will enable the start of the next revolution of the internet and related technologies, which will become a major source of information, services, education and entertainment and associated business in the ‘digital society’. |
Std |
Pt |
Amd |
Req |
Short description |
| |
|
|
|
xxx |
Std |
Pt |
Amd |
Req |
Short description |
| |
|
|
|
xxxOlivier |
9.1
Protected Musical Slide Show Application Format
Std |
Pt |
Amd |
Req |
Short description |
A |
4 |
2nd Ed. |
|
xxxOlivier |
9.2 Professional Archival MAF
Std |
Pt |
Amd |
Req |
Short description |
A |
6 |
|
|
The Professional Archival Multimedia Application Format is a cross-platform inter change format for audio/multimedia projects or sets of any kind of files.The Professional Archival MAF offers a standardised solution for packaging multimedia information and related data based on the hierarchical structure of files. It facilitates simple and fully interoperable exchange across different devices and platforms. |
Std |
Pt |
Amd |
Req |
Short description |
A |
7 |
|
|
The Open Release MAF is a packaging format designed for the release and exchange of contents. It packages different contents into a single container file and provides a mechanism to attach meta-data information, by using MPEG-7 and MPEG-21 technologies. The MPEG-21 REL is used to model the intentions of the license. MPEG-21 Event Reporting provides a feedback mechanism, which can notify the author, when a user wants to derive a content or extract an item out of the container file. |
9.4 Portable Video Player Application Format
Std |
Pt |
Amd |
Req |
Short description |
A |
8 |
|
|
Portable video player MAF is a standard that specifies the file structure and technical components of a multimedia application format designed for a mid-resolution “DVD-style” video application.
The format provides the overall structure for storing video contents, subtitles, metadata, and description for the user interface for accessing multiple video tracks, in a single file. The format allows the users to create and playback personalized video contents (e.g. UCC videos), and consume video contents authored and distributed on media disks or over the internet by professionals and service providers. |
9.5 Video Surveillance MAF
Std |
Pt |
Amd |
Req |
Short description |
A |
10 |
|
|
The Video Surveillance MAF is a packaging format designed for the storage of video content originating from surveillance cameras. It packages the video content together with associated metadata by using the AVC file format. The MPEG-4 AVC video codec is being used to handle the video content whereas for the metadata MPEG-7 descriptions are being deployed. |
Std |
Pt |
Amd |
Req |
Short description |
A |
|
|
|
Stereoscopic content provides natural three-dimensional scenes which are displayed using solved acquisition/generator techniques. The market for applying stereoscopic contents on respective devices is fully formed and is ready for deployment. Consequently, there is a need for standard data for acquisition/creation, storage, transmission and playing of stereoscopic contents. The Stereoscopic MAF addresses both stereoscopic images and videos (with associated audio), and should also include some protection capabilities. |
10.1 Symbolic Music Representation Reference Software
Std |
Pt |
Amd |
Req |
Short description |
4 |
5 |
16 |
|
Reference software that implements the following modules:
- 1. A music editor able to create conformant SMR XML descriptions as well as *.mp4 files with SMR bitstream (uses BiM reference software for inary encoding of XML descriptions).
- 2. A music viewer able to load *.mp4 files containing SMR bitstreams, permitting a user to visualize the music score, play it as MIDI, transpose it, etc.
- 3. An enhanced MPEG-4 IM1 player that implements the BIFS nodes that support SMR data as defined in 14496-11/AMD 5.
- 4. An SMR Decoder for MPEG-4 IM1 that is able to decode an SMR bitstream and to produce a visual representation (e.g. score).
|
10.2 MPEG-1 and -2 on MPEG-4 Reference Software
Std |
Pt |
Amd |
Req |
Short description |
4 |
5 |
20 |
|
This software builds upon existing MPEG-1/2 reference software by adding software modules to re-format MPEG-1/2 bitstreams (created by MPEG-1/2 reference software) into a series of MPEG-4 Access Units as required by an MPEG-4 File Format. Additional modules provide the inverse functionality, that being to produce an MPEG-1/2 bitstream that can be decoded by an MPEG-1/2 reference software decoder. |
|