Fig 1 - uploaded by Silvia Pfeiffer
Content may be subject to copyright.
Source publication
The World Wide Web, with its paradigms of surfing and searching for information, has become the predominant system for computer-based information retrieval. Media resources, however information-rich, only play a minor role in providing information to Web users. While bandwidth (or the lack thereof) may be an excuse for this situation, the lack of s...
Context in source publication
Context 1
... Continuous Media Markup Language [23] has been designed to cater for two different yet related uses: • Authoring of clip structures, anchor points, annotations, and hyperlinks for time-continuous data in preparation for integration with the data in an Annodex resource. • Indexing and crawling of time-continuous Web resources for search engines to perform retrieval on a textual representation of the binary Annodex resources. CMML is an XML-based language to describe the content of a time-continuous Web resource. It is an authoring language for annotating, indexing, and hyperlinking time- continuous data in Annodex format. A CMML file contains structured XML markup where we have chosen the XML tags to be very similar to XHTML to enable a simple trans- fer of knowledge for HTML authors. CMML documents consist of three main types of tags: at most one stream tag, exactly one head tag, and an arbitrary number of clip tags. The stream tag is optional and describes the input bitstream(s) necessary for the creation of an Annodex resource. The head tag contains information related to the complete time-continuous resource. A clip tag, in contrast, contains information on a temporal fragment of the data. Figure 1 shows an example of a CMML document. It describes two clips for an MPEG video about the “Research Hunter.” The media file is referred to in the stream tag as “index.mpg,” and the title in the head tag provides the subject, while details about the clips are found in the clip tags. This is an actual example from our test collection at . The XML markup of a head tag in the CMML document contains information on the complete time-continuous Web resource. Its essential information contains: Structured descriptions in meta tags and • Unstructured textual annotations in the title tag. The XML markup of a clip tag contains information on a fragment of time-continuous data: • Anchor points (i.e., the id attribute) to provide locations inside a time-continuous resource that a URI can refer to. Anchor points identify the start of a clip. This enables URIs to refer to clips in CMML or Annodex resources via fragment specifications as described in Sect. 4.1. • Structured textual annotations in the meta tags in the same way as the head tag. • Unstructured textual annotations in the desc tags. Unstructured annotations are free text and mainly relevant for search applications. • An optional keyframe in the img tag providing a representative image for the clip and enabling display of a table of contents for Annodex resources. • Outgoing URI links in the a tag of the clip can point to any other place a URI can point to, such as clips in other Annodex resources or anchors in HTML ...
Similar publications
This paper presents a metadata model for the description of language resources proposed in the framework of the META-SHARE infrastructure, aiming to cover both datasets and tools/technologies used for their processing. It places the model in the overall framework of metadata models, describes the basic principles and features of the model, elaborat...
There are currently several metadata patterns which describes
information of different themes and areas. Many of these patterns are present in
web systems which allow different forms of precisely retrieving than the popular
search engines. The current work proposes a metadata model to describe news story
in several information vehicles. The resourc...
Citations
In recent years, blogging has become an exploding passion among Internet communities. By combining the grassroots blogging with the richness of expression available in video, videoblogs (vlogs for short) will be a powerful new media adjunct to our existing televised news sources. Vlogs have gained much attention worldwide, especially with Google's acquisition of YouTube. This article presents a comprehensive survey of videoblogging (vlogging for short) as a new technological trend. We first summarize the technological challenges for vlogging as four key issues that need to be answered. Along with their respective possibilities, we give a review of the currently available techniques and tools supporting vlogging, and envision emerging technological directions for future vlogging. Several multimedia technologies are introduced to empower vlogging technology with better scalability, interactivity, searchability, and accessability, and to potentially reduce the legal, economic, and moral risks of vlogging applications. We also make an in-depth investigation of various vlog mining topics from a research perspective and present several incentive applications such as user-targeted video advertising and collective intelligence gaming. We believe that vlogging and its applications will bring new opportunities and drives to the research in related fields.
Since the year 2000 a project under the name of "Continuous Media Web", CMWeb, has explored how to make video (and incidentally audio) a first class citizen on the Web. The project has led to a set of open specifications and open source implementations, which have been included into the Xiph set of open media technologies. In the spirit of the Web, specifications for a Video Web should be based on unencumbered formats, which is why Xiph was chosen.
In this paper we illustrate the model-driven development approach applied to the user interface of an audiovisual search application, within the European project PHAROS. We show how conceptual modelling can capture the most complex features of an audio-visual Web search portal, which allows users to pose advanced queries over multi-media materials, access results of queries using multi-modal and multi-channel interfaces, and customize the search experience by saving queries of interest in a personal profile, so that they can be exploited for asynchronous notification of new relevant audiovisual information. We show how model-driven development can help the generation of the code for sophisticated Rich Internet Application front-ends, typical of the multimedia portals of the future.
In recent years, blogging has become an exploding passion among Internet communities. By combining the grassroots blogging with the richness of expression available in video, videoblogs (vlogs for short) will be a powerful new media adjunct to our existing televised news sources. Vlogs have gained much attention worldwide, especially with Google's acquisition of YouTube. This article presents a comprehensive survey of videoblogging (vlogging for short) as a new technological trend. We first summarize the technological challenges for vlogging as four key issues that need to be answered. Along with their respective possibilities, we give a review of the currently available techniques and tools supporting vlogging, and envision emerging technological directions for future vlogging. Several multimedia technologies are introduced to empower vlogging technology with better scalability, interactivity, searchability, and accessability, and to potentially reduce the legal, economic, and moral risks of vlogging applications. We also make an in-depth investigation of various vlog mining topics from a research perspective and present several incentive applications such as user-targeted video advertising and collective intelligence gaming. We believe that vlogging and its applications will bring new opportunities and drives to the research in related fields.
In this paper, we provide a brief survey of the mul timedia information retrieval domain as well as introduce s ome ideas investigated in the special issue. We hope that the contributions of this issue provide motivation for readers to dea l with the current challenges and problems. Such contributions are the basis of tomorrow's multimedia information systems. Our aims are to clarify some notions raised by this new technology by reviewing the current capabilities and the potential usefulne ss to users in various areas. The research and development issues cover a wide range of fields, many of which are shared with medi a processing, signal processing, data base technologies and data mining.
Semantic interpretation of the data distributed over the Internet is subject to major current research activity. The Continuous Media Web (CMWeb) extends the World Wide Web to time-continuously sampled data such as audio and video in regard to the searching, linking, and browsing functionality. The CMWeb technology is based the file format Annodex which streams the media content interspersed with markup in the Continuous Media Markup Language (CMML) format that contains information relevant to the whole media file, e.g., title, author, language as well as time-sensitive information, e.g., topics, speakers, time-sensitive hyperlinks. The CMML markup may be generated manually or automatically. This paper investigates the automatic extraction of meta data and markup information from complex linguistic annotations, which are annotated recordings collected for use in linguistic research. We are particularly interested in annotated recordings of meetings and teleconferences and see automatically generated CMML files and their corresponding Annodex streams as one way of viewing such recordings. The paper presents some experiments with generating Annodex files from hand-annotated meeting recordings.