ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Disentangled Representation Learning for Controllable Image Synthesis: An Information-Theoretic Perspective

Shichang Tang, Xu Zhou, Xuming He, Yi Ma

Auto-TLDR; Controllable Image Synthesis in Deep Generative Models using Variational Auto-Encoder

Abstract Slides Poster

In this paper, we look into the problem of disentangled representation learning and controllable image synthesis in a deep generative model. We develop an encoder-decoder architecture for a variant of the Variational Auto-Encoder (VAE) with two latent codes $z_1$ and $z_2$. Our framework uses $z_2$ to capture specified factors of variation while $z_1$ captures the complementary factors of variation. To this end, we analyze the learning problem from the perspective of multivariate mutual information, derive optimizable lower bounds of the conditional mutual information in the image synthesis processes and incorporate them into the training objective. We validate our method empirically on the Color MNIST dataset and the CelebA dataset by showing controllable image syntheses. Our proposed paradigm is simple yet effective and is applicable to many situations, including those where there is not an explicit factorization of features available, or where the features are non-categorical.

Similar papers

AVAE: Adversarial Variational Auto Encoder

Antoine Plumerault, Hervé Le Borgne, Celine Hudelot

Auto-TLDR; Combining VAE and GAN for Realistic Image Generation

Disentangled Representation Learning for Controllable Image Synthesis: An Information-Theoretic Perspective

Similar papers

AVAE: Adversarial Variational Auto Encoder

High Resolution Face Age Editing

Local Facial Attribute Transfer through Inpainting

Variational Deep Embedding Clustering by Augmented Mutual Information Maximization

Semantics-Guided Representation Learning with Applications to Visual Synthesis

Mutual Information Based Method for Unsupervised Disentanglement of Video Representation

Reducing the Variance of Variational Estimates of Mutual Information by Limiting the Critic's Hypothesis Space to RKHS

GAN-Based Gaussian Mixture Model Responsibility Learning

Disentangle, Assemble, and Synthesize: Unsupervised Learning to Disentangle Appearance and Location

Unsupervised Face Manipulation Via Hallucination

Learning Interpretable Representation for 3D Point Clouds

Learning Low-Shot Generative Networks for Cross-Domain Data

Galaxy Image Translation with Semi-Supervised Noise-Reconstructed Generative Adversarial Networks

Multi-Domain Image-To-Image Translation with Adaptive Inference Graph

Continuous Learning of Face Attribute Synthesis

Learning Disentangled Representations for Identity Preserving Surveillance Face Camouflage

A Joint Representation Learning and Feature Modeling Approach for One-Class Recognition

Mask-Based Style-Controlled Image Synthesis Using a Mask Style Encoder

Generative Deep-Neural-Network Mixture Modeling with Semi-Supervised MinMax+EM Learning

Controllable Face Aging

SATGAN: Augmenting Age Biased Dataset for Cross-Age Face Recognition

Dual-MTGAN: Stochastic and Deterministic Motion Transfer for Image-To-Video Synthesis

Attributes Aware Face Generation with Generative Adversarial Networks

Auto Encoding Explanatory Examples with Stochastic Paths

Exemplar Guided Cross-Spectral Face Hallucination Via Mutual Information Disentanglement

Coherence and Identity Learning for Arbitrary-Length Face Video Generation

Variational Inference with Latent Space Quantization for Adversarial Resilience

Phase Retrieval Using Conditional Generative Adversarial Networks

Ω-GAN: Object Manifold Embedding GAN for Image Generation by Disentangling Parameters into Pose and Shape Manifolds

Semantic-Guided Inpainting Network for Complex Urban Scenes Manipulation

IDA-GAN: A Novel Imbalanced Data Augmentation GAN

Interpreting the Latent Space of GANs Via Correlation Analysis for Controllable Concept Manipulation

Group-Wise Feature Orthogonalization and Suppression for GAN Based Facial Attribute Translation

Interpolation in Auto Encoders with Bridge Processes

Unsupervised Contrastive Photo-To-Caricature Translation Based on Auto-Distortion

Generative Latent Implicit Conditional Optimization When Learning from Small Sample

Variational Capsule Encoder

Separation of Aleatoric and Epistemic Uncertainty in Deterministic Deep Neural Networks

Feature-Aware Unsupervised Learning with Joint Variational Attention and Automatic Clustering

GAP: Quantifying the Generative Adversarial Set and Class Feature Applicability of Deep Neural Networks

Generating Private Data Surrogates for Vision Related Tasks

Image Representation Learning by Transformation Regression

Learning to Take Directions One Step at a Time

On the Evaluation of Generative Adversarial Networks by Discriminative Models

Local-Global Interactive Network for Face Age Transformation

Identity-Preserved Face Beauty Transformation with Conditional Generative Adversarial Networks

The Role of Cycle Consistency for Generating Better Human Action Videos from a Single Frame

UCCTGAN: Unsupervised Clothing Color Transformation Generative Adversarial Network