About
74
Publications
40,951
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
9,527
Citations
Introduction
Current institution
Additional affiliations
January 2010 - present
Publications
Publications (74)
In real-world face recognition applications, there is a tremendous amount of data with two images for each person. One is an ID photo for face enrollment, and the other is a probe photo captured on spot. Most existing methods are designed for training data with limited breadth (a relatively small number of classes) and sufficient depth (many sample...
In many face recognition applications, there is large amount of face data with two images for each person. One is an ID photo for face enrollment, and the other is a probe photo captured on spot. Most existing methods are designed for training data with limited breadth (relatively small class number) and sufficient depth (many samples for each clas...
In many face recognition applications, the modalities of face images between the gallery and probe sets are different, which is known as heterogeneous face recognition. How to reduce the feature gap between images from different modalities is a critical issue to develop highly accurate face recognition algorithm. Recently, Joint Bayesian (JB) has d...
Video synopsis or condensation is a smart solution for fast video browsing and storage. However, most of the existing methods work offline, where two main phases are required. The first phase is to prepare tubes and background images. The second phase is to rearrange tubes and stitch them into backgrounds. However, with a long video sequence, the f...
In the last five years, biologically inspired features (BIF) always held the state-of-the-art results for human age estimation from face images. Recently, researchers mainly put their focuses on the regression step after feature extraction, such as support vector regression (SVR), partial least squares (PLS), canonical correlation analysis (CCA) an...
Deep learning, in particular Convolutional Neural Network (CNN), has achieved
promising results in face recognition recently. However, it remains an open
question: why CNNs work well and how to design a 'good' architecture. The
existing works tend to focus on reporting CNN architectures that work well for
face recognition rather than investigate th...
Face antispoofing is important to practical face recognition systems. In previous works, a generic antispoofing classifier is trained to detect spoofing attacks on all subjects. However, due to the individual differences among subjects, the generic classifier cannot generalize well to all subjects. In this paper, we propose a person-specific face a...
Learning-based face descriptors have constantly improved the face recognition performance. Compared with the hand-crafted features, learning-based features are considered to be able to exploit information with better discriminative ability for specific tasks. Motivated by the recent success of deep learning, in this paper, we extend the original sh...
Pushing by big data and deep convolutional neural network (CNN), the
performance of face recognition is becoming comparable to human. Using private
large scale training datasets, several groups achieve very high performance on
LFW, i.e., 97% to 99%. While there are many open source implementations of CNN,
none of large scale face dataset is publicl...
Until now, most existing researches on person re-identification aim at improving the recognition rate on single dataset setting. The training data and testing data of these methods are form the same source. Although they have obtained high recognition rate in experiments, they usually perform poorly in practical applications. In this paper, we focu...
Face recognition (FR) systems in real-world applications need to deal with a wide range of interferences, such as occlusions and disguises in face images. Compared with other forms of interferences such as nonuniform illumination and pose changes, face with occlusions has not attracted enough attention yet. A novel approach, coined dynamic image-to...
Color naming, which relates colors with color names, can help people with a semantic analysis of images in many computer vision applications. In this paper, we propose a novel salient color names based color descriptor (SCNCD) to describe colors. SCNCD utilizes salient color names to guarantee that a higher probability will be assigned to the color...
Many efforts have been made in recent years to tackle the unconstrained face recognition challenge. For the benchmark of this challenge, the Labeled Faces in theWild (LFW) database has been widely used. However, the standard LFW protocol is very limited, with only 3,000 genuine and 3,000 impostor matches for classification. Today a 97% accuracy can...
3D Morph able Model (3DMM) has been widely used in face analysis for many years. The most challenging part of 3DMM is to find the correspondences between 3D points and 2D pixels. Existing methods only use key points, edges, specular highlights and image pixels to complete the task, which are not accurate or robust. This paper proposes a new algorit...
Various hand-crafted features and metric learning methods prevail in the
field of person re-identification. Compared to these methods, this paper
proposes a more general way that can learn a similarity metric from image
pixels directly. By using a "siamese" deep neural network, the proposed method
can jointly learn the color feature, texture featur...
With the wide applications of face recognition, spoofing attack is becoming a big threat to their security. Conventional face recognition systems usually adopt behavioral challenge-response or texture analysis methods to resist spoofing attacks, however, these methods require high user cooperation and are sensitive to the imaging quality and enviro...
Multi-target tracking is an interesting but challenging task in computer vision field. Most previous data association based methods merely consider the relationships (e.g. appearance and motion pattern similarities) between detections in local limited temporal domain, leading to their difficulties in handling long-term occlusion and distinguishing...
After intensive research, heterogenous face recognition is still a
challenging problem. The main difficulties are owing to the complex
relationship between heterogenous face image spaces. The heterogeneity is
always tightly coupled with other variations, which makes the relationship of
heterogenous face images highly nonlinear. Many excellent metho...
In this paper, we develop a new face recognition system that was designed to use the second generation identity card (2G-ID card). In order to ensure the security in enrollment, the new subject is registered in a verification way. In enrollment phase, the captured image is firstly compared with the face image stored in 2G-ID card. If the similarity...
Visual tracking is an important but challenging problem in the computer vision field. In the real world, the appearances of the target and its surroundings change continuously over space and time, which provides effective information to track the target robustly. However, enough attention has not been paid to the spatio-temporal appearance informat...
In this paper, we present the details of our method in attending the 300 Faces in-the-wild (300W) challenge. We build our method on cascade regression framework, where a series of regressors are utilized to progressively refine the shape initialized by face detector. In cascade regression, we use the HOG feature in a multi-scale manner, where the l...
Attributes are helpful to infer high-level semantic knowledge of pedestrians, thus improving the performance of pedestrian tracking, retrieval, re-identification, etc. However, current pedestrian databases are mainly for the pedestrian detection or tracking application, and semantic attribute annotations related to pedestrians are rarely provided....
This paper demonstrates a novel retrieval synopsis system based on moving objects for surveillance video. With the popularization of digital video surveillance, massive data has been stored and the volume is still rising. How to utilize surveillance video effectively and efficiently is strategically important for practical applications. So as to im...
As a crucial security problem, anti-spoofing in biomet-rics, and particularly for the face modality, has achieved great progress in the recent years. Still, new threats arrive in form of better, more realistic and more sophisticated spoofing attacks. The objective of the 2nd Competition on Counter Measures to 2D Face Spoofing Attacks is to challeng...
Most existing pose robust methods are too computational complex to meet practical applications and their performance under unconstrained environments are rarely evaluated. In this paper, we propose a novel method for pose robust face recognition towards practical applications, which is fast, pose robust and can work well under unconstrained environ...
In recent years, heterogeneous face biometrics has attracted more attentions in the face recognition community. After published in 2009, the HFB database has been applied by tens of research groups and widely used for Near infrared vs. Visible light (NIR-VIS) face recognition. Despite its success the HFB database has two disadvantages: a limited nu...
Recently, methods with learning procedure have been widely used to solve person re-identification (re-id) problem. However, most existing databases for re-id are smallscale, therefore, over-fitting is likely to occur. To further improve the performance, we propose a novel method by fusing multiple local features and exploring their structural infor...
Person-specific face tracking is a challenging task for the trackers which only focus on the appearance of the target face, because distraction always happens and the identity is difficult to maintain. In this paper, we design a framework combining an off-line detector, an on-line tracker and an online recognizer to complete the tracking of person-...
Despite the success in the last two decades, the state-of-the-art face detectors still have problems in dealing with images in the wild for the large appearance variations. Instead of taking appearance variations as black boxes and leaving them to statistical learning algorithms, we propose a structural face model to explicitly represent them. Our...
In this paper, we propose a method to apply the popular cascade classifier
into face recognition to improve the computational efficiency while keeping
high recognition rate. In large scale face recognition systems, because the
probability of feature templates coming from different subjects is very high,
most of the matching pairs will be rejected b...
Liveness detection is an indispensable guarantee for reliable face recognition, which has recently received enormous attention. In this paper we propose three scenic clues, which are non-rigid motion, face-background consistency and imaging banding effect, to conduct accurate and efficient face liveness detection. Non-rigid motion clue indicates th...
Visual tracking is a challenging problem, because the target frequently change its appearance, randomly move its location and get occluded by other objects in unconstrained environments. The state changes of the target are temporally and spatially continuous, in this paper therefore, a robust Spatio-Temporal structural context based Tracker (STT) i...
Although numerous online learning strategies have been proposed to handle the appearance variation in visual tracking, the existing methods just perform well in certain cases since they lack effective appearance learning mechanism. In this paper, a joint model tracker (JMT) is presented, which consists of a generative model based on Multiple Subspa...
People counting is one of the key components in video surveillance applications, however, due to occlusion, illumination, color and texture variation, the problem is far from being solved. Different from traditional visible camera based systems, we construct a novel system that uses vertical Kinect sensor for people counting, where the depth inform...
Local binary pattern (LBP) and its variants are effective descriptors for face recognition. The traditional LBP like features are extracted based on the original pixel or patch values of images. In this paper, we propose to learn the discriminative image filter to improve the discriminant power of the LBP like feature. The basic idea is after the i...
Recent state-of-the-art algorithms have achieved good performance on normal pedestrian detection tasks. However, pedestrian detection in crowded scenes is still challenging due to the significant appearance variation caused by heavy occlusions and complex spatial interactions. In this paper we propose a unified probabilistic framework to globally d...
Explosive growth of surveillance video data presents formidable challenges to its browsing, retrieval and storage. Video synopsis, an innovation proposed by Peleg and his colleagues, is aimed for fast browsing by shortening the video into a synopsis while keeping activities in video captured by a camera. However, the current techniques are offline...
This letter addresses the problem of face detection in multispectral illuminations. Face detection in visible images has been well addressed based on the large scale training samples. For the recently emerging multispectral face biometrics, however, the face data is scarce and expensive to collect, and it is usually short of face samples to train a...
Coupled spectral regression (CSR) is an effective framework for heterogeneous face recognition (e.g., visual light (VIS) vs. near infrared (NIR)). CSR aims to learn different projections for different face modalities respectively to find a common subspace where the samples of different modalities from the same class are as close as possible. In ori...
Heterogeneous Face Recognition (HFR) refers recognition of face images captured in different modalities, e.g. Visual (VIS), near infrared (NIR) and thermal infrared (TIR). Although heterogeneous face images of a given person differ by pixel values, the identity of the face should be classified as the same. This paper focuses on NIR-VIS HFR. Light S...
Face antispoofing has now attracted intensive attention, aiming to assure the reliability of face biometrics. We notice that currently most of face antispoofing databases focus on data with little variations, which may limit the generalization performance of trained models since potential attacks in real world are probably more complex. In this pap...
Spoofing identities using photographs is one of the most common techniques to attack 2-D face recognition systems. There seems to exist no comparative studies of different techniques using the same protocols and data. The motivation behind this competition is to compare the performance of different state-of-the-art algorithms on the same database u...
In this paper, we present a method for detecting individuals in crowd by clustering a group of feature points belonging to the same person. In our approach, a feature point is considered to contain three attributes: the motion trajectory in video sequence, the sparse local appearance around point in current frame, and the structure relationship wit...
Eye localization is an important part in face recogni-tion system, because its precision closely affects the per-formance of face recognition. Although various methods have already achieved high precision on the face images with high quality, their precision will drop on low quality images. In this paper, we propose a robust eye localization method...
Linear discriminant analysis with nearest neighborhood classifier (LDA + NN) has been commonly used in face recognition, but it often confronts with two problems in real applications: (1) it cannot incrementally deal with the information of training instances; (2) it cannot achieve fast search against large scale gallery set. In this paper, we use...
Low resolution (LR) is an important issue when handling real world face recognition problems. The performance of traditional recognition algorithms will drop drastically due to the loss of facial texture information in original high resolution (HR) images. To address this problem, in this paper we propose an effective approach named Simultaneous Di...
Existing face liveness detection algorithms adopt behavioural challenge-response methods that require user cooperation. To be verified live, users are expected to obey some user unfriendly requirement. In this paper, we present a multispectral face liveness detection method, which is user cooperation free. Moreover, the system is adaptive to variou...
Occlusion of eyeglasses, and strong specular reflections on eyeglasses (especially in near infrared (NIR) images), can deteriorate face recognition performance. In this paper, we present a novel method to overcome these problems. The proposed method applies the sparse representation (SR) technique in a local feature space so as to be more tolerant...
Near infrared (NIR) face recognition has been a successful technology for overcoming illumination changes in face recognition.
With years of development, NIR face recognition been in practical use with success and products have appeared in the market.
In this chapter, we introduce the NIR face recognition approach, describe the design of active NIR...
Face alignment and recognition in less controlled environment are one of the most essential bottlenecks for practical face recognition system. Recently several researches have focused on partial face recognition problem, but few works have addressed the problem of face alignment under partial occlusion. In this paper, we present a robust face align...
The latest multi-biometric grand challenge (MBGC 2008) sets up a new experiment in which near infrared (NIR) face videos containing partial faces are used as a probe set and the visual (VIS) images of full faces are used as the target set. This is challenging for two reasons: (1) it has to deal with partially occluded faces in the NIR videos, and (...
Face images captured in different spectral bands, e.g., in visual (VIS) and near infrared (NIR), are said to be heterogeneous. Although a person’s face looks different in heterogeneous
images, it should be classified as being from the same individual. In this paper, we present a new method, called face analogy, in the analysis-by-synthesis framewor...
Heterogeneous face images come from different lighting conditions or different imaging devices, such as visible light (VIS)
and near infrared (NIR) based. Because heterogeneous face images can have different skin spectra-optical properties, direct
appearance based matching is no longer appropriate for solving the problem. Hence we need to find faci...
Linear discriminant analysis (LDA) is a popular method in pattern recognition and is equivalent to Bayesian method when the
sample distributions of different classes are obey to the Gaussian with the same covariance matrix. However, in real world,
the distribution of data is usually far more complex and the assumption of Gaussian density with the s...
This paper deals with a new problem in face recognition research, in which the enrollment and query face samples are captured under different lighting conditions. In our case, the enrollment samples are visual light (VIS) images, whereas the query samples are taken under near infrared (NIR) condition. It is very difficult to directly match the face...
Face recognition at a distance (FRAD) is one of the most challenging forms of face recognition applications. In this chapter,
we analyze issues in FRAD system design, which are not addressed in near-distance face recognition, and present effective
solutions for making FRAD systems for practical deployments. Evaluation of FRAD systems is discussed.
A face device is a system to acquire a set of digital data samples representing a human face. As the human face is a complex 3D object, the data can be in several forms: a 2D image where the gray levels of the pixels represent the projected reflectance of the face surface under visible illumination; a 2D image where the gray levels of the pixels re...
In recent years, 3D face recognition has obtained much attention. Using 2D face image as probe and 3D face data as gallery is an alternative method to deal with computation complexity, expensive equipment and fussy pretreatment in 3D face recognition systems. In this paper we propose a learning based 2D-3D face matching method using the CCA to lear...
In this paper, we present a robust and accurate system for outdoor (as well as indoor) face recognition, based on a recently developed enhanced near-infrared (ENIR) imaging device. Using a narrow band NIR laser generator instead of LED lights for active frontal illumination, the ENIR device can provide face images of good quality even under sunligh...
In many applications, such as E-Passport and driver’s license, the enrollment of face templates is done using visible light (VIS) face images. Such images are normally acquired in controlled environment where the lighting is approximately frontal. However, Authentication is done in variable lighting conditions. Matching of faces in VIS images taken...