ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Selective Kernel and Motion-Emphasized Loss Based Attention-Guided Network for HDR Imaging of Dynamic Scenes

Yipeng Deng, Qin Liu, Takeshi Ikenaga

Auto-TLDR; SK-AHDRNet: A Deep Network with attention module and motion-emphasized loss function to produce ghost-free HDR images

Abstract Slides Poster

Ghost-like artifacts caused by ill-exposed and motion areas is one of the most challenging problems in high dynamic range (HDR) image reconstruction.When the motion range is small, previous methods based on optical flow or patch-match can suppress ghost-like artifacts by first aligning input images before merging them.However, they are not robust enough and still produce artifacts for challenging scenes where large foreground motions exist.To this end, we propose a deep network with attention module and motion-emphasized loss function to produce ghost-free HDR images. In attention module, we use channel and spatial attention to guide network to emphasize important components such as motion and saturated areas automatically. With the purpose of being robust to images with different resolutions and objects with distinct scale, we adopt the selective kernel network as the basic framework for channel attention. In addition to the attention module, the motion-emphasized loss function based on the motion and ill-exposed areas mask is designed to help network reconstruct motion areas. Experiments on the public dataset indicate that the proposed SK-AHDRNet produces ghost-free results where detail in ill-exposed areas is well recovered. The proposed method scores 43.17 with PSNR metric and 61.02 with HDR-VDP-2 metric on test which outperforms all conventional works. According to quantitative and qualitative evaluations, the proposed method can achieve state-of-the-art performance.

Similar papers

Single Image Deblurring Using Bi-Attention Network

Yaowei Li, Ye Luo, Jianwei Lu

Auto-TLDR; Bi-Attention Neural Network for Single Image Deblurring

Abstract Poster Similar

Recently, deep convolutional neural networks have been extensively applied into image deblurring and have achieved remarkable performance. However, most CNN-based image deblurring methods focus on simply increasing network depth, neglecting the contextual information of the blurred image and the reconstructed image. Meanwhile, most encoder-decoder based methods rarely exploit encoder's multi-layer features. To address these issues, we propose a bi-attention neural network for single image deblurring, which mainly consists of a bi-attention network and a feature fusion network. Specifically, two criss-cross attention modules are plugged before and after the encoder-decoder to capture long-range spatial contextual information in the blurred image and the reconstructed image simultaneously, and the feature fusion network combines multi-layer features from encoder to enable the decoder reconstruct the image with multi-scale features. The whole network is end-to-end trainable. Quantitative and qualitative experiment results validate that the proposed network outperforms state-of-the-art methods in terms of PSNR and SSIM on benchmark datasets.

Video Reconstruction by Spatio-Temporal Fusion of Blurred-Coded Image Pair

Anupama S, Prasan Shedligeri, Abhishek Pal, Kaushik Mitr

Auto-TLDR; Recovering Video from Motion-Blurred and Coded Exposure Images Using Deep Learning

Selective Kernel and Motion-Emphasized Loss Based Attention-Guided Network for HDR Imaging of Dynamic Scenes

Similar papers

Single Image Deblurring Using Bi-Attention Network

Video Reconstruction by Spatio-Temporal Fusion of Blurred-Coded Image Pair

Free-Form Image Inpainting Via Contrastive Attention Network

Multi-Scale Residual Pyramid Attention Network for Monocular Depth Estimation

An Improved Bilinear Pooling Method for Image-Based Action Recognition

Wavelet Attention Embedding Networks for Video Super-Resolution

Global-Local Attention Network for Semantic Segmentation in Aerial Images

RSAN: Residual Subtraction and Attention Network for Single Image Super-Resolution

Automatical Enhancement and Denoising of Extremely Low-Light Images

Attention Stereo Matching Network

Makeup Style Transfer on Low-Quality Images with Weighted Multi-Scale Attention

Deep Residual Attention Network for Hyperspectral Image Reconstruction

Explorable Tone Mapping Operators

Deeply-Fused Attentive Network for Stereo Matching

Dual-Attention Guided Dropblock Module for Weakly Supervised Object Localization

CSpA-DN: Channel and Spatial Attention Dense Network for Fusing PET and MRI Images

Joint Face Alignment and 3D Face Reconstruction with Efficient Convolution Neural Networks

Progressive Scene Segmentation Based on Self-Attention Mechanism

Arbitrary Style Transfer with Parallel Self-Attention

Boosting High-Level Vision with Joint Compression Artifacts Reduction and Super-Resolution

PSDNet: A Balanced Architecture of Accuracy and Parameters for Semantic Segmentation

Spatial-Related and Scale-Aware Network for Crowd Counting

Single Image Super-Resolution with Dynamic Residual Connection

Cross-Layer Information Refining Network for Single Image Super-Resolution

TSMSAN: A Three-Stream Multi-Scale Attentive Network for Video Saliency Detection

Video Lightening with Dedicated CNN Architecture

Attention As Activation

SECI-GAN: Semantic and Edge Completion for Dynamic Objects Removal

Efficient-Receptive Field Block with Group Spatial Attention Mechanism for Object Detection

Face Super-Resolution Network with Incremental Enhancement of Facial Parsing Information

Multi-Scale and Attention Based ResNet for Heartbeat Classification

PHNet: Parasite-Host Network for Video Crowd Counting

Dynamic Guided Network for Monocular Depth Estimation

DID: A Nested Dense in Dense Structure with Variable Local Dense Blocks for Super-Resolution Image Reconstruction

Learning from Web Data: Improving Crowd Counting Via Semi-Supervised Learning

Residual Fractal Network for Single Image Super Resolution by Widening and Deepening

Attention Pyramid Module for Scene Recognition

DARN: Deep Attentive Refinement Network for Liver Tumor Segmentation from 3D CT Volume

Context-Aware Residual Module for Image Classification

VGG-Embedded Adaptive Layer-Normalized Crowd Counting Net with Scale-Shuffling Modules

Attentive Hybrid Feature Based a Two-Step Fusion for Facial Expression Recognition

Enhanced Feature Pyramid Network for Semantic Segmentation

Two-Level Attention-Based Fusion Learning for RGB-D Face Recognition

Deep Realistic Novel View Generation for City-Scale Aerial Images

Multi-Branch Attention Networks for Classifying Galaxy Clusters

ACRM: Attention Cascade R-CNN with Mix-NMS for Metallic Surface Defect Detection

3D Attention Mechanism for Fine-Grained Classification of Table Tennis Strokes Using a Twin Spatio-Temporal Convolutional Neural Networks

Do Not Treat Boundaries and Regions Differently: An Example on Heart Left Atrial Segmentation