ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Context-Aware Residual Module for Image Classification

Jing Bai, Ran Chen

Auto-TLDR; Context-Aware Residual Module for Image Classification

Abstract Slides Poster

Attention module has achieved great success in numerous vision tasks. However, existing visual attention modules generally consider the features of a single-scale, and cannot make full use of their multi-scale contextual information. Meanwhile, the multi-scale spatial feature representation has demonstrated its outstanding performance in a wide range of applications. However, the multi-scale features are always represented in a layer-wise manner, i.e. it is impossible to know their contextual information at a granular level. Focusing on the above issue, a context-aware residual module for image classification is proposed in this paper. It consists of a novel multi-scale channel attention module MSCAM to learn refined channel weights by considering the visual features of its own scale and its surrounding fields, and a multi-scale spatial aware module MSSAM to further capture more spatial information. Either or both of the two modules can be plugged into any CNN-based backbone image classification architecture with a short residual connection to obtain the context-aware enhanced features. The experiments on public image recognition datasets including CIFAR10, CIFAR100，Tiny-ImageNet and ImageNet consistently demonstrate that our proposed modules significantly outperforms a wide-used state-of-the-art methods, e.g., ResNet and the lightweight networks of MobileNet and SqueezeeNet.

Similar papers

Attention Pyramid Module for Scene Recognition

Zhinan Qiao, Xiaohui Yuan, Chengyuan Zhuang, Abolfazl Meyarian

Auto-TLDR; Attention Pyramid Module for Multi-Scale Scene Recognition

Context-Aware Residual Module for Image Classification

Similar papers

Attention Pyramid Module for Scene Recognition

PSDNet: A Balanced Architecture of Accuracy and Parameters for Semantic Segmentation

Efficient-Receptive Field Block with Group Spatial Attention Mechanism for Object Detection

Attention As Activation

Dual-Attention Guided Dropblock Module for Weakly Supervised Object Localization

Enhanced Feature Pyramid Network for Semantic Segmentation

Global-Local Attention Network for Semantic Segmentation in Aerial Images

Transitional Asymmetric Non-Local Neural Networks for Real-World Dirt Road Segmentation

Real-Time Semantic Segmentation Via Region and Pixel Context Network

Hierarchically Aggregated Residual Transformation for Single Image Super Resolution

Single Image Super-Resolution with Dynamic Residual Connection

Fast and Accurate Real-Time Semantic Segmentation with Dilated Asymmetric Convolutions

Dynamic Multi-Path Neural Network

Multi-Scale Residual Pyramid Attention Network for Monocular Depth Estimation

Attention Stereo Matching Network

Improved Residual Networks for Image and Video Recognition

GSTO: Gated Scale-Transfer Operation for Multi-Scale Feature Learning in Semantic Segmentation

Efficient High-Resolution High-Level-Semantic Representation Learning for Human Pose Estimation

Residual Fractal Network for Single Image Super Resolution by Widening and Deepening

Boundary-Aware Graph Convolution for Semantic Segmentation

Wavelet Attention Embedding Networks for Video Super-Resolution

Ordinal Depth Classification Using Region-Based Self-Attention

Spatial-Related and Scale-Aware Network for Crowd Counting

RSAN: Residual Subtraction and Attention Network for Single Image Super-Resolution

An Improved Bilinear Pooling Method for Image-Based Action Recognition

Second-Order Attention Guided Convolutional Activations for Visual Recognition

ResFPN: Residual Skip Connections in Multi-Resolution Feature Pyramid Networks for Accurate Dense Pixel Matching

Single Image Deblurring Using Bi-Attention Network

Multi-Direction Convolution for Semantic Segmentation

LiNet: A Lightweight Network for Image Super Resolution

Cross-Layer Information Refining Network for Single Image Super-Resolution

Dynamic Guided Network for Monocular Depth Estimation

Deeply-Fused Attentive Network for Stereo Matching

Collaborative Human Machine Attention Module for Character Recognition

Feature-Dependent Cross-Connections in Multi-Path Neural Networks

Boosting High-Level Vision with Joint Compression Artifacts Reduction and Super-Resolution

SCA Net: Sparse Channel Attention Module for Action Recognition

Bidirectional Matrix Feature Pyramid Network for Object Detection

Learnable Higher-Order Representation for Action Recognition

Delivering Meaningful Representation for Monocular Depth Estimation

Encoder-Decoder Based Convolutional Neural Networks with Multi-Scale-Aware Modules for Crowd Counting

Efficient Super Resolution by Recursive Aggregation

Accurate Cell Segmentation in Digital Pathology Images Via Attention Enforced Networks

Contextual Classification Using Self-Supervised Auxiliary Models for Deep Neural Networks

TSMSAN: A Three-Stream Multi-Scale Attentive Network for Video Saliency Detection

P2 Net: Augmented Parallel-Pyramid Net for Attention Guided Pose Estimation

Progressive Scene Segmentation Based on Self-Attention Mechanism

Adaptive Image Compression Using GAN Based Semantic-Perceptual Residual Compensation