INTERNATIONAL ORGANISATION FOR STANDARDISATION
ORGANISATION INTERNATIONALE DE NORMALISATION
ISO/IEC JTC1/SC29/WG11
CODING OF MOVING PICTURES AND AUDIO

 

ISO/IEC JTC1/SC29/WG11N
March 2000

 

MPEG-7

Frequently Asked Questions

 

1. What is MPEG-7?

MPEG-7 will be a standardized description of various types of multimedia information. This description will be associated with the content itself, to allow fast and efficient searching for material that is of interest to the user. MPEG-7 is formally called ‘Multimedia Content Description Interface’.

The standard does not comprise the (automatic) extraction of descriptions/features. Nor does it specify the search engine (or any other program) that can make use of the description.

 

 

 

Figure 1: Scope of MPEG-7

 

In other words: MPEG-7 will specify a standard set of descriptors that can be used to describe various types of multimedia information. MPEG-7 will also standardise ways to define other descriptors as well as structures (Description Schemes) for the descriptors and their relationships. This description (i.e. the combination of descriptors and description schemes) shall be associated with the content itself, to allow fast and efficient searching for material of a user’s interest. MPEG-7 will also standardise a language to specify description schemes, i.e. a Description Definition Language (DDL). AV material that has MPEG-7 data associated with it, can be indexed and searched for. This ‘material’ may include: still pictures, graphics, 3D models, audio, speech, video, and information about how these elements are combined in a multimedia presentation (‘scenarios’, composition information). Special cases of these general data types may include facial expressions and personal characteristics.

2. From whom or where did the demand for MPEG-7 come?

The demand logically follows the increasing availability of digital audiovisual content. MPEG members recognized this demand, and initiated a new work item. There are many applications and application domains which will benefit from the MPEG-7 standard. A few application examples are:

The potential applications are spread over the following application domains:

3. Why is MPEG-7 needed?

Nowadays, more and more audiovisual information is available, from many sources around the world. Also, there are people who want to use this audiovisual information for various purposes. However, before the information can be used, it must be located. At the same time, the increasing availability of potentially interesting material makes this search more difficult. This challenging situation led to the need of a solution to the problem of quickly and efficiently searching for various types of multimedia material interesting to the user. MPEG-7 wants to answer to this need, providing this solution.

The way MPEG-7 data will be used to answer user queries is outside the scope of the standard. In principle, any type of AV material may be retrieved by means of any type of query material. This means, for example, that video material may be queried using video, music, speech, etc. It is to the search engine to match the query data and the MPEG-7 AV description. A few query examples are:

  1. Music
    Play a few notes on a keyboard and get in return a list of musical pieces containing (or close to) the required tune or images somehow matching the notes, e.g. in terms of emotions.
  2. Graphics
    Draw a few lines on a screen and get in return a set of images containing similar graphics, logos, ideograms,...
  3. Image
    Define objects, including colour patches or textures and get in return examples among which you select the interesting objects to compose your image.
  4. Movement
    On a given set of objects, describe movements and relations between objects and get in return a list of animations fulfilling the described temporal and spatial relations.
  5. Scenario
    On a given content, describe actions and get a list of scenarios where similar actions happen.
  6. Voice
    Using an excerpt of Pavarotti’s voice, and getting a list of Pavarotti’s records, video clips where Pavarotti is singing or video clips where Pavarotti is present.

4. Who is currently participating in the development of the MPEG-7 standard?

The people taking part in defining MPEG-7 represent broadcasters, equipment manufacturers, digital content creators and managers, transmission providers, publishers and intellectual property rights managers, as well as university researchers.

5. Where are you in the process of specifying the MPEG-7 standard?

We are in the phase of defining the scope of the standard and its requirements, and the ideas are likely to evolve considerably. Much is still open to input from interested parties. The work plan is as follows:

Working Draft December 1999
Committee Draft October 2000
Draft International Standard July 2001
International Standard September 2001

6. Will MPEG-7 include audio or video content recognition?

The standardization of audiovisual content recognition tools is beyond the scope of MPEG-7. Following its principle ‘specifying the minimum for maximum usability, MPEG-7 will concentrate on standardizing a representation that can be used for description. Development of audiovisual content recognition tools will be a task for industries which will build and sell MPEG-7 enabled products.

In developing the standard, however, MPEG might build some coding tools, just as it did with the predecessors of MPEG-7, namely MPEG-1, -2 and -4. Also for these standards, coding tools were built for research purposes, but they did not become part of the standard itself.

7. Will MPEG-7 support audio or video content retrieval?

In the same way that MPEG will not standardize the tools to generate the description, MPEG-7 will also not standardize the tools that use the description. It might however be necessary to address the interface between the description and the search engine.

8. What form will the "descriptions" of multimedia content in MPEG-7 take?

The words ‘descriptions’ or ‘features’ represent a rich concept, that can be related to several levels of abstraction. Descriptions vary according to the types of data. Furthermore, different types of descriptions are necessary for different purposes of the categorization.

9. Will the standard allow automatic extraction of descriptions as well as manual entry?

The descriptions that conform to the MPEG-7 standard could be entered by hand, but they could also be automatically extracted. Some features can be best extracted automatically (color, texture), but for some other features (‘this scene contains three shoes and that music was recorded in 1995’) this is very hard or even impossible.

10. What is the relationship between MPEG-7 and other MPEG activities?

MPEG-7 can be used independently of the other MPEG standards - the description might even be attached to an analog movie. The representation that is defined within MPEG-4, i.e. the representation of audiovisual data in terms of objects, is however very well suited to what will be built on the MPEG-7 standard. This representation is basic to the process of categorization. In addition, MPEG-7 descriptions could be used to improve the functionalities of previous MPEG standards.

11. If I want to get involved in MPEG-7, what do I need to know about the other MPEG standards?

In principle, knowledge about the other three MPEG standards is not required for taking part in the MPEG-7 work. However, since some of MPEG-7's tools may be close to those of MPEG-4, some knowledge about them could be useful.

12. If I want to know more about the other MPEG standards, where do I look?

You can start by taking a look at MPEG's home page (http://www.cselt.it/mpeg/) which contains many useful references, including more lists with "Frequently Asked Questions" about MPEG activities.

13. So what happened to MPEG-5 and -6? (And how about 3?)

MPEG-3 existed once upon a time, but its goal, enabling HDTV, could be accomplished using the tools of MPEG-2, and hence the work item was abandoned. So after 1,2 and 4, there was much speculation about the next number. Should it be 5 (the next) or 8 (creating an obvious binary pattern)? MPEG, however, decided not to follow either logical expansion of the sequence, but chose the number of 7 instead. So MPEG-5 and MPEG-6 are, just like MPEG-3, not defined.

14. When will MPEG-7 replace the existing MPEG-1 and MPEG-2 standards?

MPEG-7 will not replace MPEG-1 MPEG-2 or in fact MPEG-4 it is intended to provide complementary functionality to these other MPEG standards: representing information about the content, not the content itself ("the bits about the bits") This functionality is the standardisation of multimedia content descriptions.

15. If I want to know more about, be involved in, or give an input to the MPEG-7 development process, whom should I contact?

You can contact any of the people listed below with their email addresses and telephone numbers. To visit MPEG meetings you need to be on your national delegation, but the people listed below can explain how this works.

Rob Koenen (KPN Research - the Netherlands / chairman MPEG Requirements)
r.h.koenen@research.kpn.com +31 70 332 5310
Itaru Kaneko (ASCII Corporation - Japan)
itaru-k@ascii.co.jp +81 3 5351 8670
Fernando Pereira (Instituto Superior Técnico - Portugal)
fp@lx.it.pt +351 1 8418460
Adam Lindsay (Riverland, Belgium/ audio contact)
adam@riv.be + 32 2 721 5454
Thomas Sikora (Heinrch Herz Institute - Germany / chairman MPEG Video)
sikora@hhi.de +49-30-31002 622