ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Efficient-Receptive Field Block with Group Spatial Attention Mechanism for Object Detection

Jiacheng Zhang, Zhicheng Zhao, Fei Su

Auto-TLDR; E-RFB: Efficient-Receptive Field Block for Deep Neural Network for Object Detection

Abstract Slides Poster

Object detection has been paid rising attention in computer vision field. Convolutional Neural Networks (CNNs) extract high-level semantic features of images, which directly determine the performance of object detection. As a common solution, embedding integration modules into CNNs can enrich extracted features and thereby improve the performance. However, the instability and inconsistency of internal multiple branches exist in these modules. To address this problem, we propose a novel multibranch module called Efficient-Receptive Field Block (E-RFB), in which multiple levels of features are combined for network optimization. Specifically, by downsampling and increasing depth, the E-RFB provides sufficient RF. Second, in order to eliminate the inconsistency across different branches, a novel spatial attention mechanism, namely, Group Spatial Attention Module (GSAM) is proposed. The GSAM gradually narrows a feature map by channel grouping; thus it encodes the information between spatial and channel dimensions into the final attention heat map. Third, the proposed module can be easily joined in various CNNs to enhance feature representation as a plug-and-play component. With SSD-style detectors, our method halves the parameters of the original detection head and achieves high accuracy on the PASCAL VOC and MS COCO datasets. Moreover, the proposed method achieves superior performance compared with state-of-the-art methods based on similar framework.

Similar papers

Bidirectional Matrix Feature Pyramid Network for Object Detection

Wei Xu, Yi Gan, Jianbo Su

Auto-TLDR; BMFPN: Bidirectional Matrix Feature Pyramid Network for Object Detection

Efficient-Receptive Field Block with Group Spatial Attention Mechanism for Object Detection

Similar papers

Bidirectional Matrix Feature Pyramid Network for Object Detection

SFPN: Semantic Feature Pyramid Network for Object Detection

Enhanced Feature Pyramid Network for Semantic Segmentation

Small Object Detection by Generative and Discriminative Learning

ScarfNet: Multi-Scale Features with Deeply Fused and Redistributed Semantics for Enhanced Object Detection

P2 Net: Augmented Parallel-Pyramid Net for Attention Guided Pose Estimation

Attention Pyramid Module for Scene Recognition

CenterRepp: Predict Central Representative Point Set's Distribution for Detection

Object Detection Model Based on Scene-Level Region Proposal Self-Attention

Context-Aware Residual Module for Image Classification

A Novel Region of Interest Extraction Layer for Instance Segmentation

Forground-Guided Vehicle Perception Framework

ACRM: Attention Cascade R-CNN with Mix-NMS for Metallic Surface Defect Detection

Object Detection Using Dual Graph Network

Learning a Dynamic High-Resolution Network for Multi-Scale Pedestrian Detection

PSDNet: A Balanced Architecture of Accuracy and Parameters for Semantic Segmentation

PRF-Ped: Multi-Scale Pedestrian Detector with Prior-Based Receptive Field

GSTO: Gated Scale-Transfer Operation for Multi-Scale Feature Learning in Semantic Segmentation

Attention As Activation

Boundary-Aware Graph Convolution for Semantic Segmentation

EDD-Net: An Efficient Defect Detection Network

Global-Local Attention Network for Semantic Segmentation in Aerial Images

Triplet-Path Dilated Network for Detection and Segmentation of General Pathological Images

Dual-Attention Guided Dropblock Module for Weakly Supervised Object Localization

Efficient High-Resolution High-Level-Semantic Representation Learning for Human Pose Estimation

Spatial-Related and Scale-Aware Network for Crowd Counting

A Modified Single-Shot Multibox Detector for Beyond Real-Time Object Detection

MagnifierNet: Learning Efficient Small-Scale Pedestrian Detector towards Multiple Dense Regions

Construction Worker Hardhat-Wearing Detection Based on an Improved BiFPN

Cascade Saliency Attention Network for Object Detection in Remote Sensing Images

Transitional Asymmetric Non-Local Neural Networks for Real-World Dirt Road Segmentation

Hierarchical Head Design for Object Detectors

Improved Residual Networks for Image and Video Recognition

Object Detection in the DCT Domain: Is Luminance the Solution?

Video Object Detection Using Object's Motion Context and Spatio-Temporal Feature Aggregation

Dynamic Multi-Path Neural Network

SyNet: An Ensemble Network for Object Detection in UAV Images

Real-Time Semantic Segmentation Via Region and Pixel Context Network

Multi-Direction Convolution for Semantic Segmentation

Tiny Object Detection in Aerial Images

Encoder-Decoder Based Convolutional Neural Networks with Multi-Scale-Aware Modules for Crowd Counting

NAS-EOD: An End-To-End Neural Architecture Search Method for Efficient Object Detection

An Improved Bilinear Pooling Method for Image-Based Action Recognition

StrongPose: Bottom-up and Strong Keypoint Heat Map Based Pose Estimation

Image-Based Table Cell Detection: A New Dataset and an Improved Detection Method

Slimming ResNet by Slimming Shortcut

Second-Order Attention Guided Convolutional Activations for Visual Recognition

Attention Stereo Matching Network