ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Coherence and Identity Learning for Arbitrary-Length Face Video Generation

Shuquan Ye, Chu Han, Jiaying Lin, Guoqiang Han, Shengfeng He

Auto-TLDR; Face Video Synthesis Using Identity-Aware GAN and Face Coherence Network

Abstract Slides Poster

Face synthesis is an interesting yet challenging task in computer vision. It is even much harder to generate a portrait video than a single image. In this paper, we propose a novel video generation framework for synthesizing arbitrary-length face videos without any face exemplar or landmark. To overcome the synthesis ambiguity of face video, we propose a divide-and-conquer strategy to separately address the video face synthesis problem from two aspects, face identity synthesis and rearrangement. To this end, we design a cascaded network which contains three components, Identity-aware GAN (IA-GAN), Face Coherence Network, and Interpolation Network. IA-GAN is proposed to synthesize photorealistic faces with the same identity from a set of noises. Face Coherence Network is designed to re-arrange the faces generated by IA-GAN while keeping the inter-frame coherence. Interpolation Network is introduced to eliminate the discontinuity between two adjacent frames and improve the smoothness of the face video. Experimental results demonstrate that our proposed network is able to generate face video with high visual quality while preserving the identity. Statistics show that our method outperforms state-of-the-art unconditional face video generative models in multiple challenging datasets.

Similar papers

Unsupervised Face Manipulation Via Hallucination

Keerthy Kusumam, Enrique Sanchez, Georgios Tzimiropoulos

Auto-TLDR; Unpaired Face Image Manipulation using Autoencoders

Coherence and Identity Learning for Arbitrary-Length Face Video Generation

Similar papers

Unsupervised Face Manipulation Via Hallucination

Learning Disentangled Representations for Identity Preserving Surveillance Face Camouflage

High Resolution Face Age Editing

Dual-MTGAN: Stochastic and Deterministic Motion Transfer for Image-To-Video Synthesis

The Role of Cycle Consistency for Generating Better Human Action Videos from a Single Frame

Let's Play Music: Audio-Driven Performance Video Generation

Attributes Aware Face Generation with Generative Adversarial Networks

Talking Face Generation Via Learning Semantic and Temporal Synchronous Landmarks

Controllable Face Aging

Continuous Learning of Face Attribute Synthesis

Contrastive Data Learning for Facial Pose and Illumination Normalization

Multi-Laplacian GAN with Edge Enhancement for Face Super Resolution

Learning to Take Directions One Step at a Time

Local-Global Interactive Network for Face Age Transformation

Identity-Preserved Face Beauty Transformation with Conditional Generative Adversarial Networks

Age Gap Reducer-GAN for Recognizing Age-Separated Faces

SATGAN: Augmenting Age Biased Dataset for Cross-Age Face Recognition

Unsupervised Disentangling of Viewpoint and Residues Variations by Substituting Representations for Robust Face Recognition

Disentangled Representation Learning for Controllable Image Synthesis: An Information-Theoretic Perspective

Deep Multi-Task Learning for Facial Expression Recognition and Synthesis Based on Selective Feature Sharing

Local Facial Attribute Transfer through Inpainting

Pixel-based Facial Expression Synthesis

Learning Low-Shot Generative Networks for Cross-Domain Data

AVAE: Adversarial Variational Auto Encoder

Facial Expression Recognition by Using a Disentangled Identity-Invariant Expression Representation

A Quantitative Evaluation Framework of Video De-Identification Methods

Fidelity-Controllable Extreme Image Compression with Generative Adversarial Networks

Free-Form Image Inpainting Via Contrastive Attention Network

TinyVIRAT: Low-Resolution Video Action Recognition

Exemplar Guided Cross-Spectral Face Hallucination Via Mutual Information Disentanglement

Video Face Manipulation Detection through Ensemble of CNNs

Generative Latent Implicit Conditional Optimization When Learning from Small Sample

Unsupervised Contrastive Photo-To-Caricature Translation Based on Auto-Distortion

Multi-Domain Image-To-Image Translation with Adaptive Inference Graph

Learning Semantic Representations Via Joint 3D Face Reconstruction and Facial Attribute Estimation

Disentangle, Assemble, and Synthesize: Unsupervised Learning to Disentangle Appearance and Location

Ω-GAN: Object Manifold Embedding GAN for Image Generation by Disentangling Parameters into Pose and Shape Manifolds

Semantic-Guided Inpainting Network for Complex Urban Scenes Manipulation

GarmentGAN: Photo-Realistic Adversarial Fashion Transfer

Image Inpainting with Contrastive Relation Network

Novel View Synthesis from a 6-DoF Pose by Two-Stage Networks

Motion-Supervised Co-Part Segmentation

Self-Supervised Learning of Dynamic Representations for Static Images

Pose Variation Adaptation for Person Re-Identification

Two-Stream Temporal Convolutional Network for Dynamic Facial Attractiveness Prediction

Unsupervised Learning of Landmarks Based on Inter-Intra Subject Consistencies

Cascade Attention Guided Residue Learning GAN for Cross-Modal Translation

Detecting Manipulated Facial Videos: A Time Series Solution