Tai-Pang Wu

Hong Kong Applied Science and Technology Research Institute, Hong Kong, Hong Kong

Are you Tai-Pang Wu?

Claim your profile

Publications (21)25.1 Total impact

  • Source
    Article: A closed-form solution to tensor voting: theory and applications.
    [show abstract] [hide abstract]
    ABSTRACT: We prove a closed-form solution to tensor voting (CFTV): Given a point set in any dimensions, our closed-form solution provides an exact, continuous, and efficient algorithm for computing a structure-aware tensor that simultaneously achieves salient structure detection and outlier attenuation. Using CFTV, we prove the convergence of tensor voting on a Markov random field (MRF), thus termed as MRFTV, where the structure-aware tensor at each input site reaches a stationary state upon convergence in structure propagation. We then embed structure-aware tensor into expectation maximization (EM) for optimizing a single linear structure to achieve efficient and robust parameter estimation. Specifically, our EMTV algorithm optimizes both the tensor and fitting parameters and does not require random sampling consensus typically used in existing robust statistical techniques. We performed quantitative evaluation on its accuracy and robustness, showing that EMTV performs better than the original TV and other state-of-the-art techniques in fundamental matrix estimation for multiview stereo matching. The extensions of CFTV and EMTV for extracting multiple and nonlinear structures are underway.
    IEEE Transactions on Software Engineering 12/2011; 34(8):1482-95. · 1.98 Impact Factor
  • Conference Proceeding: Adequate reconstruction of transparent objects on a shoestring budget
    [show abstract] [hide abstract]
    ABSTRACT: Reconstructing transparent objects is a challenging problem. While producing reasonable results for quite complex objects, existing approaches require custom calibration or somewhat expensive labor to achieve high precision. On the other hand, when an overall shape preserving salient and fine details is sufficient, we show in this paper a significant step toward solving the problem on a shoestring budget, by using only a video camera, a moving spotlight, and a small chrome sphere. Specifically, the problem we address is to estimate the normal map of the exterior surface of a given solid transparent object, from which the surface depth can be integrated. Our technical contribution lies in relating this normal reconstruction problem to one of graph-cut segmentation. Unlike conventional formulations, however, our graph is dual-layered, since we can see a transparent object's foreground as well as the background behind it. Quantitative and qualitative evaluation are performed to verify the efficacy of this practical solution.
    Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on; 07/2011
  • Conference Proceeding: Adequate reconstruction of transparent objects on a shoestring budget.
    The 24th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2011, Colorado Springs, CO, USA, 20-25 June 2011; 01/2011
  • Source
    Article: Surface-from-Gradients without Discrete Integrability Enforcement: A Gaussian Kernel Approach
    [show abstract] [hide abstract]
    ABSTRACT: Representative surface reconstruction algorithms taking a gradient field as input enforce the integrability constraint in a discrete manner. While enforcing integrability allows the subsequent integration to produce surface heights, existing algorithms have one or more of the following disadvantages: They can only handle dense per-pixel gradient fields, smooth out sharp features in a partially integrable field, or produce severe surface distortion in the results. In this paper, we present a method which does not enforce discrete integrability and reconstructs a 3D continuous surface from a gradient or a height field, or a combination of both, which can be dense or sparse. The key to our approach is the use of kernel basis functions, which transfer the continuous surface reconstruction problem into high-dimensional space, where a closed-form solution exists. By using the Gaussian kernel, we can derive a straightforward implementation which is able to produce results better than traditional techniques. In general, an important advantage of our kernel-based method is that the method does not suffer discretization and finite approximation, both of which lead to surface distortion, which is typical of Fourier or wavelet bases widely adopted by previous representative approaches. We perform comparisons with classical and recent methods on benchmark as well as challenging data sets to demonstrate that our method produces accurate surface reconstruction that preserves salient and sharp features. The source code and executable of the system are available for downloading.
    IEEE Transactions on Pattern Analysis and Machine Intelligence 12/2010; · 4.91 Impact Factor
  • Source
    Article: Photometric Stereo via Expectation Maximization
    Tai-Pang Wu, Chi-Keung Tang
    [show abstract] [hide abstract]
    ABSTRACT: This paper presents a robust and automatic approach to photometric stereo, where the two main components, namely surface normals and visible surfaces, are respectively optimized by expectation maximization (EM). A dense set of input images is conveniently captured using a digital video camera while a handheld spotlight is being moved around the target object and a small mirror sphere. In our approach, the inherently complex optimization problem is simplified into a two-step optimization, where EM is employed in each step: 1) Using the dense input, the weight or importance of each observation is alternately optimized with the normal and albedo at each pixel and 2) using the optimized normals and employing the Markov random fields (MRFs), surface integrabilities and discontinuities are alternately optimized in visible surface reconstruction. Our mathematical derivation gives simple updating rules for the EM algorithms, leading to a stable, practical, and parameter-free implementation that is very robust even in the presence of complex geometry, shadows, highlight, and transparency. We present high-quality results on normal and visible surface reconstruction, where fine geometric details are automatically recovered by our method.
    IEEE Transactions on Pattern Analysis and Machine Intelligence 04/2010; · 4.91 Impact Factor
  • Conference Proceeding: Quasi-dense 3D reconstruction using tensor-based multiview stereo.
    The Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2010, San Francisco, CA, USA, 13-18 June 2010; 01/2010
  • Article: Modeling and rendering of impossible figures.
    ACM Trans. Graph. 01/2010; 29.
  • Article: Interactive normal reconstruction from a single image.
    ACM Trans. Graph. 01/2008; 27:119.
  • Source
    Conference Proceeding: Extracting smooth and transparent layers from a single image.
    2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 24-26 June 2008, Anchorage, Alaska, USA; 01/2008
  • Article: Natural shadow matting.
    ACM Trans. Graph. 01/2007; 26.
  • Conference Proceeding: Surface-from-Gradients with Incomplete Data for Single View Modeling.
    IEEE 11th International Conference on Computer Vision, ICCV 2007, Rio de Janeiro, Brazil, October 14-20, 2007; 01/2007
  • Source
    Article: ShapePalettes: interactive normal transfer via sketching.
    ACM Trans. Graph. 01/2007; 26:44.
  • Source
    Article: Dense photometric stereo: a Markov random field approach.
    [show abstract] [hide abstract]
    ABSTRACT: We address the problem of robust normal reconstruction by dense photometric stereo, in the presence of complex geometry, shadows, highlight, transparencies, variable attenuation in light intensities, and inaccurate estimation in light directions. The input is a dense set of noisy photometric images, conveniently captured by using a very simple set-up consisting of a digital video camera, a reflective mirror sphere, and a handheld spotlight. We formulate the dense photometric stereo problem as a Markov network and investigate two important inference algorithms for Markov Random Fields (MRFs)--graph cuts and belief propagation--to optimize for the most likely setting for each node in the network. In the graph cut algorithm, the MRF formulation is translated into one of energy minimization. A discontinuity-preserving metric is introduced as the compatibility function, which allows alpha-expansion to efficiently perform the maximum a posteriori (MAP) estimation. Using the identical dense input and the same MRF formulation, our tensor belief propagation algorithm recovers faithful normal directions, preserves underlying discontinuities, improves the normal estimation from one of discrete to continuous, and drastically reduces the storage requirement and running time. Both algorithms produce comparable and very faithful normals for complex scenes. Although the discontinuity-preserving metric in graph cuts permits efficient inference of optimal discrete labels with a theoretical guarantee, our estimation algorithm using tensor belief propagation converges to comparable results, but runs faster because very compact messages are passed and combined. We present very encouraging results on normal reconstruction. A simple algorithm is proposed to reconstruct a surface from a normal map recovered by our method. With the reconstructed surface, an inverse process, known as relighting in computer graphics, is proposed to synthesize novel images of the given scene under user-specified light source and direction. The synthesis is made to run in real time by exploiting the state-of-the-art graphics processing unit (GPU). Our method offers many unique advantages over previous relighting methods and can handle a wide range of novel light sources and directions.
    IEEE Transactions on Pattern Analysis and Machine Intelligence 12/2006; 28(11):1830-46. · 4.91 Impact Factor
  • Source
    Article: Video repairing under variable illumination using cyclic motions.
    [show abstract] [hide abstract]
    ABSTRACT: This paper presents a complete system capable of synthesizing a large number of pixels that are missing due to occlusion or damage in an uncalibrated input video. These missing pixels may correspond to the static background or cyclic motions of the captured scene. Our system employs user-assisted video layer segmentation, while the main processing in video repair is fully automatic. The input video is first decomposed into the color and illumination videos. The necessary temporal consistency is maintained by tensor voting in the spatio-temporal domain. Missing colors and illumination of the background are synthesized by applying image repairing. Finally, the occluded motions are inferred by spatio-temporal alignment of collected samples at multiple scales. We experimented on our system with some difficult examples with variable illumination, where the capturing camera can be stationary or in motion.
    IEEE Transactions on Pattern Analysis and Machine Intelligence 06/2006; 28(5):832-9. · 4.91 Impact Factor
  • Source
    Conference Proceeding: Visible Surface Reconstruction from Normals with Discontinuity Consideration
    Tai-Pang Wu, Chi-Keung Tang
    [show abstract] [hide abstract]
    ABSTRACT: Given a dense set of imperfect normals obtained by photometric stereo or shape from shading, this paper presents an optimization algorithm which alternately optimizes until convergence the surface integrabilities and discontinuities inherent in the normal field, in order to derive a segmented surface description of the visible scene without noticeable distortion. In our Expectation-Maximization (EM) framework, we enforce discontinuity-preserving integrability so that fine details are preserved within each output segment while the occlusion boundaries are localized as sharp surface discontinuities. Using the resulting weighted discontinuity map, the estimation of a discontinuity-preserving height field can be formulated into a convex optimization problem. We compare our method and present convincing results on synthetic and real data.
    Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on; 02/2006
  • Source
    Conference Proceeding: Separating Subsurface Scattering from Photometric Image.
    Tai-Pang Wu, Chi-Keung Tang
    18th International Conference on Pattern Recognition (ICPR 2006), 20-24 August 2006, Hong Kong, China; 01/2006
  • Conference Proceeding: Dense Photometric Stereo by Expectation Maximization.
    Tai-Pang Wu, Chi-Keung Tang
    Computer Vision - ECCV 2006, 9th European Conference on Computer Vision, Graz, Austria, May 7-13, 2006, Proceedings, Part IV; 01/2006
  • Source
    Conference Proceeding: A Bayesian approach for shadow extraction from a single image
    Tai-Pang Wu, Chi-Keung Tang
    [show abstract] [hide abstract]
    ABSTRACT: This paper addresses the problem of shadow extraction from a single image of a complex natural scene. No simplifying assumption on the camera and the light source other than the Lambertian assumption is used. Our method is unique because it is capable of translating very rough user-supplied hints into the effective likelihood and prior functions for our Bayesian optimization. The likelihood function requires a decent estimation of the shadowless image, which is obtained by solving the associated Poisson equation. Our Bayesian framework allows for the optimal extraction of smooth shadows while preserving texture appearance under the extracted shadow. Thus our technique can be applied to shadow removal, producing some best results to date compared with the current state-of-the-art techniques using a single input image. We propose related applications in shadow compositing and image repair using our Bayesian technique.
    Computer Vision, 2005. ICCV 2005. Tenth IEEE International Conference on; 11/2005
  • Source
    Conference Proceeding: Dense photometric stereo using a mirror sphere and graph cut
    Tai-Pang Wu, Chi-Keung Tang
    [show abstract] [hide abstract]
    ABSTRACT: We present a surprisingly simple system that performs robust normal reconstruction by dense photometric stereo, in the presence of large shadows, highlight, transparencies, complex geometry, variable attenuation in light intensity and inaccurate light directions. Our system consists of a mirror sphere, a spotlight and a DV camera only. Using this, we infer a dense set of unbiased but noisy photometric data uniformly distributed on the light direction sphere. We use this dense set to derive a very robust matching cost for our MRF photometric stereo model, where the maximum a posteriori (MAP) solution is estimated. To aggregate support for candidate normals in the normal refinement process, we introduce a compatibility function that is translated into a discontinuity-preserving metric, thus speeding up the MAP estimation by energy minimization using graph cut. No reference object of similar material is used. We perform detailed comparison on our approach with conventional convex minimization. We show very good normals estimated from very noisy data on a wide range of difficult objects to show the robustness and usefulness of our method.
    Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on; 07/2005
  • Conference Proceeding: Separating Specular, Diffuse, and Subsurface Scattering Reflectances from Photometric Images.
    Tai-Pang Wu, Chi-Keung Tang
    Computer Vision - ECCV 2004, 8th European Conference on Computer Vision, Prague, Czech Republic, May 11-14, 2004. Proceedings, Part II; 01/2004

Institutions

  • 2011
    • Hong Kong Applied Science and Technology Research Institute
      Hong Kong, Hong Kong
  • 2005–2010
    • The Hong Kong University of Science and Technology
      • Department of Computer Science and Engineering
      Kowloon, Hong Kong
  • 2006
    • The Chinese University of Hong Kong
      • Department of Computer Science and Engineering
      Hong Kong, Hong Kong