Figure 2 - uploaded by Guoqing Wang
Content may be subject to copyright.
The overview of our approach for cross-domain PAD. Our approach consists of a disentangled representation learning module (DR-Net) and a multi-domain feature learning module (MD-Net). With the face images from different domains as inputs, DR-Net can learn a pair of encoders for disentangled features for PAD and subject classification respectively. The disentangled features are fed to MD-Net to learn domain-independent representations for robust cross-domain PAD.
Source publication
Face presentation attack detection (PAD) has been an urgent problem to be solved in the face recognition systems. Conventional approaches usually assume the testing and training are within the same domain; as a result, they may not generalize well into unseen scenarios because the representations learned for PAD may overfit to the subjects in the t...
Contexts in source publication
Context 1
... aim to build a robust cross-domain face PAD model that can mitigate the impact of the subjects, shooting environment and camera settings from the source domain face images. To achieve this goal, we propose an efficient disentangled representation learning for cross-domain face PAD. As shown in Fig. 2, our approach consists of a disentangled representation learning module (DR-Net) and a multidomain feature learning module (MD-Net). DR-Net leverages generative models (i.e., PAD-GAN and ID-GAN in Fig. 2) to learn a pair of encoders for learning disentangled features for subject classification and PAD respectively in each source ...
Context 2
... domain face images. To achieve this goal, we propose an efficient disentangled representation learning for cross-domain face PAD. As shown in Fig. 2, our approach consists of a disentangled representation learning module (DR-Net) and a multidomain feature learning module (MD-Net). DR-Net leverages generative models (i.e., PAD-GAN and ID-GAN in Fig. 2) to learn a pair of encoders for learning disentangled features for subject classification and PAD respectively in each source domain. Then, the disentangled features from multiple domains are fed to MD-Net to learn domainindependent features for the final cross-domain face ...
Context 3
... aim to build a robust cross-domain face PAD model that can mitigate the impact of the subjects, shooting environment and camera settings from the source domain face images. To achieve this goal, we propose an efficient disentangled representation learning for cross-domain face PAD. As shown in Fig. 2, our approach consists of a disentangled representation learning module (DR-Net) and a multidomain feature learning module (MD-Net). DR-Net leverages generative models (i.e., PAD-GAN and ID-GAN in Fig. 2) to learn a pair of encoders for learning disentangled features for subject classification and PAD respectively in each source ...
Context 4
... domain face images. To achieve this goal, we propose an efficient disentangled representation learning for cross-domain face PAD. As shown in Fig. 2, our approach consists of a disentangled representation learning module (DR-Net) and a multidomain feature learning module (MD-Net). DR-Net leverages generative models (i.e., PAD-GAN and ID-GAN in Fig. 2) to learn a pair of encoders for learning disentangled features for subject classification and PAD respectively in each source domain. Then, the disentangled features from multiple domains are fed to MD-Net to learn domainindependent features for the final cross-domain face ...
Similar publications
Neural latent variable models enable the discovery of interesting structure in speech audio data. This paper presents a comparison of two different approaches which are broadly based on predicting future time-steps or auto-encoding the input signal. Our study compares the representations learned by vq-vae and vq-wav2vec in terms of sub-word unit di...
With the development of face image synthesis and generation technology based on generative adversarial networks (GANs), it has become a research hotspot to determine whether a given face image is natural or generated. However, the generalization capability of the existing algorithms is still to be improved. Therefore, this paper proposes a general...
Specifying surface reflectances in a simple and perceptually informative way would be beneficial for many areas of research and application. We assessed whether a ${3} \times {3}$ 3 × 3 matrix may be used to approximate how a surface reflectance modulates the sensory color signal across illuminants. We tested whether observers could discriminate be...
Deep convolutional neural networks (CNNs) are widely used in face recognition, because they can extract features with higher discrimination, which is the basis for correctly identifying the identity of a face image. In order to improve the face recognition performance, in addition to improving the structures of convolutional neural networks, many n...
[This corrects the article DOI: 10.1371/journal.pone.0232551.].