| Riding the Media Bits | chiariglione.org | ||
|
Inside MPEG-21 |
|
||
|
Last update: 2008/06/11 |
|||
|
|
|||
| The technologies inside the MPEG-21 framework. | |||
|
|
|||
|
Part 1 of the MPEG-21 standard has the title "Vision, Technologies and Strategy". It is not a standard but a Technical Report because it contains a description of part of the content of MPEG-21. Digital Item Declaration (DID) is part 2. This normative part defines the technology supporting DIs. The purpose of the DID standard is to describe a set of abstract terms and concepts to form a useful model for defining DIs. The Digital Item Declaration Language (DIDL) is an XML language for defining a DI that provides for the standard representation in XML of a DI. For each transaction we need a means to identify the object of the transaction. That is why we have part 3 of MPEG-21, called "Digital Item Identification" (DII), a standard to uniquely identify DIs. This plays very much the same role as International Standard Book Numbering (ISBN) does for books and International Standard Serial Number (ISSN) for periodicals. Its scope includes:
The title of part 4 of MPEG-21 is Intellectual Property Management and Protection. The standard defines a standard IPMP framework. A related piece of work, called MPEG-4 IPMP-eXtensions (IPMP-X), was started in 1999 as part of MPEG-4 and was completed in 2002. IPMP-X defines standard ways of retrieving IPMP tools from remote locations, authenticating IPMP tools and exchanging messages between the tools used to protect a piece of content and a terminal that needs to process (e.g. decrypt, decode, present) the content. The same technology has been applied to MPEG-2 and has become part 11 of MPEG-2. Already in the physical world we seldom have absolute rights to an object. In the virtual world, where the disembodiment of content from carriage augments the flexibility with which business can be conducted, this trend is likely to continue. That is why part 5 of MPEG-21 Rights Expression Language (REL) has been developed so that rights about a DI can be expressed in a way that can be interpreted by a computer. The MPEG REL data model for a rights expression consists of four basic entities and the relationship among those entities. This basic relationship is defined by the MPEG REL assertion “grant”. Structurally, an MPEG REL grant consists of the following:
The MPEG-21 REL model A right exists to perform actions on something. Today we use such verbs as: "display", "print", "copy" or "store" and, in a given context, we humans know what we mean. But computers do not and must be taught the meaning. That is why we have a Rights Data Dictionary (RDD) as part 6 of MPEG-21 that gives the precise semantics of all the verbs that are used in the REL in addition to a lot more verbs. ICT lets people do more than just find new ways of doing old business. Content and service providers used to know their customers very well. They used to know - even control - the means through which their content is delivered. Consumers used to know the meaning of well-classified services such as television, movies and music. Today we are having fewer and fewer such certainties: end users are less and less predictable, the same piece of content can reach them through a variety of delivery systems and can be enjoyed by a plethora of widely differing consuming devices. How can we cope with this unpredictability of end user features, delivery systems and consumption devices? This is where Digital Item Adaptation (DIA), part 7 of MPEG-21, comes to help, providing the means to describe how a DI should be adapted (i.e. transformed) so that it best matches the specific features of the User, the Network and the Device.
As shown in the figure, a DI is subject to a "resource adaptation engine", and a "descriptor adaptation engine". Together the two (non-normative) engines produce the adapted Digital Item based on the Digital Item Adaptation Description, as specified by part 7 of MPEG-21. Part 8 has been assigned to Reference Software. Part 9 is the MPEG-21 File Format. A DI may be a complex collection of information that contains still and dynamic media, information related to the DI such as metadata, layout information, etc., textual and binary. Therefore the MPEG-21 file format inherits several concepts from MP4, in order to make 'multi-purpose' files possible. A dual-purpose MP4 and MP21 file would, for example, play just the MPEG-4 data on an MP4 player, and would play the MPEG-21 data on an MP21 player. A DID is a static declaration defined using the DIDL. Digital Item Methods (DIM) are defined in part 10 "Digital Item Processing" (DIP) and are meant to allow Users (authors, publishers, distributors, etc.) of the DI to add functionality to a DID, such as specifying a selection of preferred procedures by which the DI should be handled at the level of the DI itself. On receipt of a DID, a list of DIMs that can be applied to the DI is presented to the User. The User chooses one DIM that is then executed by the DIP Engine. As an example, for a music album DI an "AddTrack" DIM might be provided such that a user can add a new track in the preferred format. Back to part 3, getting an identifier for a DI is important, but how are we going to put a "virtual sticker" on it to carry the identification? This is where Persistent Association Technologies come in. SDMI struggled with the selection of very advanced "Phase I" and "Phase II" screening technologies and its task was made harder by the fact that no established methods existed to assess the performance of these technologies. That is why MPEG-21 contains part 11 called "Evaluation Methods for Persistent Association Technologies". This is not meant to be a normative standard but a TR, i.e. more like "best practice" for those who need to assess the performance of watermarking and related technologies. Part 12 is called Test Bed for MPEG-21 Resource Delivery. It is a comprehensive environment that can be used to test the effect of different conditions for delivery of media resources. During the long study period that eventually led to the acquisition of the technologies required to develop a scalable video coding standard, it was thought that novel technologies would be required for such a form of video coding that would not fit in the MPEG-4 standard as, e.g., is done by Advanced Video Coding (AVC). Part 13 Scalable Video Coding (SVC) was originally intended to host such a standard. However, when it became clear that SVC would be an extension of AVC, the standard was moved to MPEG-4 part 10 as an amendment to AVC and MPEG-21 Part 13 became void. Conformance of an implementation is of course needed for MPEG-21 technologies as well. Therefore the purpose of Part 14 Conformance is to provide the necessary test methodologies and suites to be used to assess the conformity of a bitstream (typically an XML document) and a decoder (typically a parser) to the relevant MPEG-21 standard. Certain application domains require a technology that can generate an event every time a Digital Item is processed. The technology achieving this is specified in Part 15 Event Reporting (ER).
The MPEG-21 ER model An Event Report Request (ERR) is placed in a DI by a Creator. When the DI is received, the ERR is passed to an ERR Receiver and parsed. An Event Receiver senses all internal and external events and passes them to an ER Builder that creates a message and dispatches it to the address indicated in the ERR. Since a few years (starting from MPEG-7) MPEG has standardised a technology that allows the lossless conversion of a typically very bulky XML document to a binary format, while preserving the ability to efficiently parse the binarised XML format. The technology was originally part of MPEG-7 Systems but has now been moved to MPEG-B Part 1 Binary XML format (BiM). BiM is now only referenced in MPEG-7 Part 1 Systems and MPEG-21 Part 16 Binary format. There are cases where it is necessary to identify a specific fragment of a resource as opposed to the entire set of data. Part 17 Fragment Identification (FID) specifies a normative syntax for URI Fragment Identifiers to be used for addressing parts of a resource from a number of Internet Media Types. Part 9 provides a solution to transport a Digital Item in a file. However, Digital Items may also be transported over a streaming mechanism (e.g. in broadcasting or over IP networks). Part 18 Digital Item Streaming (DIS) provides the technology to achieve this when the streaming mechanism employed is MPEG-2 Transport Stream and RTP/UDP/IP. |
|||
|
|
|||
|
|
|||
|
Copyright © 2003 chiariglione.org |
|||