Article

Human Object Inpainting Using Manifold Learning-Based Posture Sequence Estimation

Dept. of Comput. Sci., Nat. Chiao Tung Univ., Hsinchu, Taiwan
IEEE Transactions on Image Processing (impact factor: 3.04). 12/2011; DOI:10.1109/TIP.2011.2158228 pp.3124 - 3135
Source: IEEE Xplore

ABSTRACT We propose a human object inpainting scheme that divides the process into three steps: 1) human posture synthesis; 2) graphical model construction; and 3) posture sequence estimation. Human posture synthesis is used to enrich the number of postures in the database, after which all the postures are used to build a graphical model that can estimate the motion tendency of an object. We also introduce two constraints to confine the motion continuity property. The first constraint limits the maximum search distance if a trajectory in the graphical model is discontinuous, and the second confines the search direction in order to maintain the tendency of an object's motion. We perform both forward and backward predictions to derive local optimal solutions. Then, to compute an overall best solution, we apply the Markov random field model and take the potential trajectory with the maximum total probability as the final result. The proposed posture sequence estimation model can help identify a set of suitable postures from the posture database to restore damaged/missing postures. It can also make a reconstructed motion sequence look continuous.

0 0
 · 
0 Bookmarks
 · 
49 Views
  • Source
    Article: Video inpainting under constrained camera motion.
    [show abstract] [hide abstract]
    ABSTRACT: A framework for inpainting missing parts of a video sequence recorded with a moving or stationary camera is presented in this work. The region to be inpainted is general: it may be still or moving, in the background or in the foreground, it may occlude one object and be occluded by some other object. The algorithm consists of a simple preprocessing stage and two steps of video inpainting. In the preprocessing stage, we roughly segment each frame into foreground and background. We use this segmentation to build three image mosaics that help to produce time consistent results and also improve the performance of the algorithm by reducing the search space. In the first video inpainting step, we reconstruct moving objects in the foreground that are "occluded" by the region to be inpainted. To this end, we fill the gap as much as possible by copying information from the moving foreground in other frames, using a priority-based scheme. In the second step, we inpaint the remaining hole with the background. To accomplish this, we first align the frames and directly copy when possible. The remaining pixels are filled in by extending spatial texture synthesis techniques to the spatiotemporal domain. The proposed framework has several advantages over state-of-the-art algorithms that deal with similar types of data and constraints. It permits some camera motion, is simple to implement, fast, does not require statistical models of background nor foreground, works well in the presence of rich and cluttered backgrounds, and the results show that there is no visible blurring or motion artifacts. A number of real examples taken with a consumer hand-held camera are shown supporting these findings.
    IEEE Transactions on Image Processing 03/2007; 16(2):545-53. · 3.04 Impact Factor
  • Source
    Conference Proceeding: Video Completion for Perspective Camera Under Constrained Motion.
    18th International Conference on Pattern Recognition (ICPR 2006), 20-24 August 2006, Hong Kong, China; 01/2006
  • Article: Space-Time Completion of Video
    [show abstract] [hide abstract]
    ABSTRACT: This paper presents a new framework for the completion of missing information based on local structures. It poses the task of completion as a global optimization problem with a well-defined objective function and derives a new algorithm to optimize it. Missing values are constrained to form coherent structures with respect to reference examples. We apply this method to space-time completion of large space-time "holes" in video sequences of complex dynamic scenes. The missing portions are filled in by sampling spatio-temporal patches from the available parts of the video, while enforcing global spatio-temporal consistency between all patches in and around the hole. The consistent completion of static scene parts simultaneously with dynamic behaviors leads to realistic looking video sequences and images. Space-time video completion is useful for a variety of tasks, including, but not limited to: 1) sophisticated video removal (of undesired static or dynamic objects) by completing the appropriate static or dynamic background information. 2) Correction of missing/corrupted video frames in old movies. 3) Modifying a visual story by replacing unwanted elements. 4) Creation of video textures by extending smaller ones. 5) Creation of complete field-of-view stabilized video. 6) As images are one-frame videos, we apply the method to this special case as well
    IEEE Transactions on Pattern Analysis and Machine Intelligence 04/2007; 29(3):463-476. · 4.91 Impact Factor

Full-text (2 Sources)

View
7 Downloads
Available from
8 Jan 2013

Keywords

backward predictions
 
damaged/missing postures
 
derive local optimal solutions
 
final result
 
first constraint limits
 
Human posture synthesis
 
inpainting scheme
 
Markov random field model
 
maximum search distance
 
maximum total probability
 
motion continuity property
 
motion tendency
 
object's motion
 
posture database
 
postures
 
proposed posture sequence estimation model
 
reconstructed motion sequence
 
steps
 
suitable postures