Spatio-Temporal Normalization Architecture

Spatio-Temporal Normalization Architecture

Source publication
Preprint
Full-text available
Audio to Video generation is an interesting problem that has numerous applications across industry verticals including film making, multi-media, marketing, education and others. High-quality video generation with expressive facial movements is a challenging problem that involves complex learning steps for generative adversarial networks. Further, e...

Context in source publication

Context 1
... p i is described in Figure 3. We have taken the L1 loss of eye aspect ratio(EAR) between real image m r and synthesized frame m g . ...

Similar publications

Preprint
Full-text available
The past few years have witnessed the significant advances of speech synthesis and voice conversion technologies. However, such technologies can undermine the robustness of broadly implemented biometric identification models and can be harnessed by in-the-wild attackers for illegal uses. The ASVspoof challenge mainly focuses on synthesized audios b...