ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Free-Form Image Inpainting Via Contrastive Attention Network

Xin Ma, Xiaoqiang Zhou, Huaibo Huang, Zhenhua Chai, Xiaolin Wei, Ran He

Auto-TLDR; Self-supervised Siamese inference for image inpainting

Abstract Slides

Most deep learning based image inpainting approaches adopt autoencoder or its variants to fill missing regions in images. Encoders are usually utilized to learn powerful representational spaces, which are important for dealing with sophisticated learning tasks. Specifically, in the image inpainting task, masks with any shapes can appear anywhere in images (i.e., free-form masks) forming complex patterns. It is difficult for encoders to capture such powerful representations under this complex situation. To tackle this problem, we propose a self-supervised Siamese inference network to improve the robustness and generalization. Moreover, the restored image usually can not be harmoniously integrated into the exiting content, especially in the boundary area. To address this problem, we propose a novel Dual Attention Fusion module (DAF), which can combine both the restored and known regions in a smoother way and be inserted into decoder layers in a plug-and-play way. DAF is developed to not only adaptively rescale channel-wise features by taking interdependencies between channels into account but also force deep convolutional neural networks (CNNs) focusing more on unknown regions. In this way, the unknown region will be naturally filled from the outside to the inside. Qualitative and quantitative experiments on multiple datasets, including facial and natural datasets (i.e., Celeb-HQ, Pairs Street View, Places2 and ImageNet), demonstrate that our proposed method outperforms against state-of-the-arts in generating high-quality inpainting results.

Similar papers

Image Inpainting with Contrastive Relation Network

Xiaoqiang Zhou, Junjie Li, Zilei Wang, Ran He, Tieniu Tan

Auto-TLDR; Two-Stage Inpainting with Graph-based Relation Network

Abstract Slides Similar

Image inpainting faces the challenging issue of the requirements on structure reasonableness and texture coherence. In this paper, we propose a two-stage inpainting framework to address this issue. The basic idea is to address the two requirements in two separate stages. Completed segmentation of the corrupted image is firstly predicted through segmentation reconstruction network, while fine-grained image details are restored in the second stage through an image generator. The two stages are connected in series as the image details are generated under the guidance of completed segmentation map that predicted in the first stage. Specifically, in the second stage, we propose a novel graph-based relation network to model the relationship existed in corrupted image. In relation network, both intra-relationship for pixels in the same semantic region and inter-relationship between different semantic parts are considered, improving the consistency and compatibility of image textures. Besides, contrastive loss is designed to facilitate the relation network training. Such a framework not only simplifies the inpainting problem directly, but also exploits the relationship in corrupted image explicitly. Extensive experiments on various public datasets quantitatively and qualitatively demonstrate the superiority of our approach compared with the state-of-the-art.

SECI-GAN: Semantic and Edge Completion for Dynamic Objects Removal

Francesco Pinto, Andrea Romanoni, Matteo Matteucci, Phil Torr

Auto-TLDR; SECI-GAN: Semantic and Edge Conditioned Inpainting Generative Adversarial Network

Free-Form Image Inpainting Via Contrastive Attention Network

Similar papers

Image Inpainting with Contrastive Relation Network

SECI-GAN: Semantic and Edge Completion for Dynamic Objects Removal

Local Facial Attribute Transfer through Inpainting

Semantic-Guided Inpainting Network for Complex Urban Scenes Manipulation

A GAN-Based Blind Inpainting Method for Masonry Wall Images

Mask-Based Style-Controlled Image Synthesis Using a Mask Style Encoder

Boosting High-Level Vision with Joint Compression Artifacts Reduction and Super-Resolution

Single Image Deblurring Using Bi-Attention Network

Learning Image Inpainting from Incomplete Images using Self-Supervision

Learning Disentangled Representations for Identity Preserving Surveillance Face Camouflage

Dynamic Guided Network for Monocular Depth Estimation

Unsupervised Contrastive Photo-To-Caricature Translation Based on Auto-Distortion

SIDGAN: Single Image Dehazing without Paired Supervision

GarmentGAN: Photo-Realistic Adversarial Fashion Transfer

Let's Play Music: Audio-Driven Performance Video Generation

Global-Local Attention Network for Semantic Segmentation in Aerial Images

Residual Fractal Network for Single Image Super Resolution by Widening and Deepening

UCCTGAN: Unsupervised Clothing Color Transformation Generative Adversarial Network

Face Super-Resolution Network with Incremental Enhancement of Facial Parsing Information

Attributes Aware Face Generation with Generative Adversarial Networks

Arbitrary Style Transfer with Parallel Self-Attention

Few-Shot Font Generation with Deep Metric Learning

Multi-Laplacian GAN with Edge Enhancement for Face Super Resolution

Future Urban Scenes Generation through Vehicles Synthesis

Selective Kernel and Motion-Emphasized Loss Based Attention-Guided Network for HDR Imaging of Dynamic Scenes

Dual-MTGAN: Stochastic and Deterministic Motion Transfer for Image-To-Video Synthesis

Adaptive Image Compression Using GAN Based Semantic-Perceptual Residual Compensation

Novel View Synthesis from a 6-DoF Pose by Two-Stage Networks

High Resolution Face Age Editing

Interactive Style Space of Deep Features and Style Innovation

Multi-Scale Residual Pyramid Attention Network for Monocular Depth Estimation

Progressive Splitting and Upscaling Structure for Super-Resolution

Dual-Attention Guided Dropblock Module for Weakly Supervised Object Localization

Unsupervised Face Manipulation Via Hallucination

Cascade Attention Guided Residue Learning GAN for Cross-Modal Translation

Cycle-Consistent Adversarial Networks and Fast Adaptive Bi-Dimensional Empirical Mode Decomposition for Style Transfer

Cross-Layer Information Refining Network for Single Image Super-Resolution

Progressive Scene Segmentation Based on Self-Attention Mechanism

Deep Universal Blind Image Denoising

VITON-GT: An Image-Based Virtual Try-On Model with Geometric Transformations

PSDNet: A Balanced Architecture of Accuracy and Parameters for Semantic Segmentation

Continuous Learning of Face Attribute Synthesis

RSAN: Residual Subtraction and Attention Network for Single Image Super-Resolution

Detail-Revealing Deep Low-Dose CT Reconstruction

An Unsupervised Approach towards Varying Human Skin Tone Using Generative Adversarial Networks

Detail Fusion GAN: High-Quality Translation for Unpaired Images with GAN-Based Data Augmentation

Local-Global Interactive Network for Face Age Transformation

Exemplar Guided Cross-Spectral Face Hallucination Via Mutual Information Disentanglement