Article

Eye movement prediction and variability on natural video data sets.

Institute for Neuro- and Bioinformatics, University of Lübeck, Ratzeburger Allee 160, D-23538 Lübeck, Germany, , .
Visual Cognition (impact factor: 2.05). 01/2012; 20(4-5):495-514. DOI:10.1080/13506285.2012.667456 pp.495-514
Source: PubMed

ABSTRACT We here study the predictability of eye movements when viewing high-resolution natural videos. We use three recently published gaze data sets that contain a wide range of footage, from scenes of almost still-life character to professionally made, fast-paced advertisements and movie trailers. Inter-subject gaze variability differs significantly between data sets, with variability being lowest for the professional movies. We then evaluate three state-of-the-art saliency models on these data sets. A model that is based on the invariants of the structure tensor and that combines very generic, sparse video representations with machine learning techniques outperforms the two reference models; performance is further improved for two data sets when the model is extended to a perceptually inspired colour space. Finally, a combined analysis of gaze variability and predictability shows that eye movements on the professionally made movies are the most coherent (due to implicit gaze-guidance strategies of the movie directors), yet the least predictable (presumably due to the frequent cuts). Our results highlight the need for standardized benchmarks to comparatively evaluate eye movement prediction algorithms.

0 0
 · 
0 Bookmarks
 · 
29 Views
  • Source
    Article: SUN: A Bayesian framework for saliency using natural statistics.
    [show abstract] [hide abstract]
    ABSTRACT: We propose a definition of saliency by considering what the visual system is trying to optimize when directing attention. The resulting model is a Bayesian framework from which bottom-up saliency emerges naturally as the self-information of visual features, and overall saliency (incorporating top-down information with bottom-up saliency) emerges as the pointwise mutual information between the features and the target when searching for a target. An implementation of our framework demonstrates that our model's bottom-up saliency maps perform as well as or better than existing algorithms in predicting people's fixations in free viewing. Unlike existing saliency measures, which depend on the statistics of the particular image being viewed, our measure of saliency is derived from natural image statistics, obtained in advance from a collection of natural images. For this reason, we call our model SUN (Saliency Using Natural statistics). A measure of saliency based on natural image statistics, rather than based on a single test image, provides a straightforward explanation for many search asymmetries observed in humans; the statistics of a single test image lead to predictions that are not consistent with these asymmetries. In our model, saliency is computed locally, which is consistent with the neuroanatomy of the early visual system and results in an efficient algorithm with few free parameters.
    Journal of Vision 02/2008; 8(7):32.1-20. · 3.38 Impact Factor

Keywords

colour space
 
combined analysis
 
data sets
 
eye movements
 
fast-paced advertisements
 
frequent cuts
 
generic
 
high-resolution natural videos
 
implicit gaze-guidance strategies
 
movie directors
 
professional movies
 
professionally made movies
 
sparse video representations
 
standardized benchmarks
 
state-of-the-art saliency models
 
still-life character
 
techniques outperforms
 
two reference models
 
variability
 
wide range
 

Michael Dorr