Conference Paper

A simplified approach to rushes summarization.

DOI: 10.1145/1463563.1463573 Conference: Proceedings of the 2nd ACM Workshop on Video Summarization, TVS 2008, Vancouver, British Columbia, Canada, October 31, 2008
Source: DBLP


In this paper we describe methods for video summarization in the context of the TRECVID 2008 BBC Rushes Summa- rization task. Color, motion, and audio features are used to segment, lter, and cluster the video. We experiment with varying the segment similarity measure to improve the joint clustering of segments with and without camera mo- tion. Compared to our previous eort for TRECVID 2007 we have reduced the complexity of the summarization pro- cess as well as the visual complexity of the summaries them- selves. We nd our objective (inclusion) performance to be competitive with systems exhibiting similar subjective per- formance.

6 Reads
  • Source
    • "The generated summary ended with a keyframe storyboard to provide a re-cap of what was included. FX Palo Alto Laboratory Inc. in California, USA [5] used the metadata donated by the NHK Science and Technical Laboratory for junk frame removal and segmented the video using a combination of motion, audio and color features, and then clustered based on these. Two runs were submitted which vary in the methods for clip similarity and selection of clips for inclusion. "
    [Show abstract] [Hide abstract]
    ABSTRACT: This paper describes an evaluation of automatic video summarization systems run on rushes from several BBC dramatic series. It was carried out under the auspices of the TREC Video Retrieval Evaluation (TRECVid) as a followup to the 2007 video summarization workshop held at ACM Multimedia 2007. 31 research teams submitted video summaries of 40 individual rushes video files, aiming to compress out redundant and insignificant material. Each summary had a duration of at most 2% of the original. The output of a baseline system, which simply presented each full video at 50 times normal speed was contributed by Carnegie Mellon University (CMU) as a control. The 2007 procedures for developing ground truth lists of important segments from each video were applied at the National Institute of Standards and Technology (NIST) to the BBC videos. At Dublin City University (DCU) each summary was judged by 3 humans with respect to how much of the ground truth was included and how well-formed the summary was. Additional objective measures included: how long it took the system to create the summary, how long it took the assessor to judge it against the ground truth, and what the summary's duration was. Assessor agreement on finding desired segments averaged 81%. Results indicated that while it was still difficult to exceed the performance of the baseline on including ground truth, the baseline was outperformed by most other systems with respect to avoiding redundancy/junk and presenting the summary with a pleasant tempo/rhythm. Footnote{Certain commercial entities, equipment, or materials may be identified in this document in order to describe an experimental procedure or concept adequately. Such identification is not intended to imply recommendation or endorsement by the National Institute of Standards, nor is it intended to imply that the entities, materials, or equipment are necessarily the best available for the purpose.}
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: In 2007 FXPAL submitted results for two tasks: rushes summarization and interactive search. The rushes
    TRECVID 2007 workshop participants notebook papers, Gaithersburg, MD, USA, November 2007; 01/2007
  • Source
    TRECVID 2008 workshop participants notebook papers, Gaithersburg, MD, USA, November 2008; 01/2008
Show more


6 Reads
Available from