Conference Proceeding

Unsupervised learning of invariant features using video

Proceedings / CVPR, IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE Computer Society Conference on Computer Vision and Pattern Recognition 07/2010; DOI:10.1109/CVPR.2010.5539773 pp.1649 - 1656 In proceeding of: Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
Source: IEEE Xplore

ABSTRACT We present an algorithm that learns invariant features from real data in an entirely unsupervised fashion. The principal benefit of our method is that it can be applied without human intervention to a particular application or data set, learning the specific invariances necessary for excellent feature performance on that data. Our algorithm relies on the ability to track image patches over time using optical flow. With the wide availability of high frame rate video (eg: on the web, from a robot), good tracking is straightforward to achieve. The algorithm then optimizes feature parameters such that patches corresponding to the same physical location have feature descriptors that are as similar as possible while simultaneously maximizing the distinctness of descriptors for different locations. Thus, our method captures data or application specific invariances yet does not require any manual supervision. We apply our algorithm to learn domain-optimized versions of SIFT and HOG. SIFT and HOG features are excellent and widely used. However, they are general and by definition not tailored to a specific domain. Our domain-optimized versions offer a substantial performance increase for classification and correspondence tasks we consider. Furthermore, we show that the features our method learns are near the optimal that would be achieved by directly optimizing the test set performance of a classifier. Finally, we demonstrate that the learning often allows fewer features to be used for some tasks, which has the potential to dramatically improve computational concerns for very large data sets.

0 0
 · 
0 Bookmarks
 · 
24 Views

Keywords

application specific invariances
 
correspondence tasks
 
domain-optimized versions
 
domain-optimized versions offer
 
excellent feature performance
 
features
 
frame rate video
 
HOG features
 
human intervention
 
large data sets
 
learns invariant features
 
method captures data
 
method learns
 
optical flow
 
particular application
 
real data
 
specific domain
 
specific invariances necessary
 
substantial performance increase
 
track image patches