k.
      aceMedia         P2People         TheSubtitler

M-OntoMat-Annotizer: Linking Ontologies and Multimedia Low-Level 
Features for Multimedia Analysis, Reasoning and Retrieval

In the aceMedia project, novel knowledge structures and appropriate tools have been developed, since ontology structures have been extended and enriched to include low-level visual features and descriptors (following the MPEG-7 standard). This was done in order to support knowledge-aware multimedia analysis, media-aware semantic inferencing and high-level semantic reasoning, which will provide the means for automatic content annotation and generation of the ACE scalable metadata layer.

The developed knowledge infrastructure consists of several parts, namely the core ontology as a basis for all components of the knowledge infrastructure, multimedia-specific conceptualizations in form of the visual descriptor ontology (VDO) and the multimedia structure ontology and specific domain ontologies for selected personal and commercial domains. Additionally, a tool is needed for linking the prototypical instances of domain concepts from the domain ontology with corresponding instances of the VDO descriptors (i.e. linking with the multimedia/MPEG-7 descriptors).

Currently, there are many existing tools for ontology-based annotation. Most of them are document-centric, therefore they only support textual annotation of web pages and ontologies and there was no solution available so far, for the annotation of multimedia resources with low-level visual features. Thus, there was a need for the development of a tool that bridges and integrates the domain concepts and the low-level multimedia content description features.


In the aceMedia project, it has been decided to extend the CREAM (CREAting Metadata for the SemanticWeb) framework. OntoMat-Annotizer is the reference implementation of CREAM. It is Java-based and developed in the research field of ontology-based knowledge representation. There now exist two applications for the OntoMat-Annotizer framework: (i) it is used as an annotation tool for web pages and (ii) it acts as the basis of an ontology engineering environment. Moreover, OntoMat-Annotizer, provides a flexible plugin interface for further application extensions called OntoPlugin and offers the possibility to implement new components and extend the core functionality of OntoMat-Annotizer.

For this purpose the Visual Description Extraction (VDE) tool was developed as plug-in to OntoMat-Annotizer, enabling the selection of a region of interest within an image or a key frame and the extraction of the associated visual descriptors. This integrated framework is called M-OntoMat-Annotizer (M stands for Multimedia) and allows the semantic annotation of images and videos for multimedia analysis and retrieval, supporting the initialization and linking of RDF(S) domain ontologies with low-level MPEG-7 visual descriptors.

M-OntoMat-Annotizer Visual Editor and Media Viewer presents a graphical interface for loading and processing of visual content (images and videos), extraction of visual features and association with domain ontology concepts. Usually, the user needs to extract the features (multimedia descriptors) of a specific object inside the image/frame. For this reason, M-OntoMat-Annotizer lets the user draw a region of interest in the image and apply the multimedia descriptors extraction procedure only to the specific selected region. Alternatively, M-OntoMat-Annotizer supports automatic segmentation of the image; whenever an image is loaded it is automatically segmented into regions. User can then select a desired region or even merge two or more regions and proceed with the extraction.

M-OntoMat-Annotizer supports the extraction of the core MPEG-7 Descriptors included in the VDO. For this purpose it uses the aceToolbox, a content pre-processing toolbox developed inside aceMedia project, which is responsible for the low-level analysis and MPEG-7 feature extraction. M-OntoMat-Annotizer supports the transformation of the extracted multimedia resources (MPEG-7 Descriptors in XML format) into instances of the visual descriptors defined in the VDO by means of an XSL transformation specification. The selection of the appropriate domain concept is performed by the user himself who guides the tool in producing the appropriate domain concept prototype instances and linking these with the newly created instances of visual descriptors.

This extracted knowledge plays a central role in automatic semantic analysis, through tools currently being developed in aceMedia that automatically analyse content, generate metadata and annotation, and support intelligent content search and retrieval services.

M-OntoMat-Annotizer is publicly available as free software. For more information visit the tool's homepage.

[Home] [Related Work] [Projects] [Publications] [CV]