ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

SFPN: Semantic Feature Pyramid Network for Object Detection

Yi Gan, Wei Xu, Jianbo Su

Auto-TLDR; SFPN: Semantic Feature Pyramid Network to Address Information Dilution Issue in FPN

Abstract Slides Poster

Feature Pyramid Network(FPN) employs a top-down path to enhance low level feature by utilizing high level feature.However, further improvement of detector is greatly hindered by the inner defect of FPN. The dilution issue in FPN is analyzed in this paper, and a new architecture named Semantic Feature Pyramid Network(SFPN) is introduced to address the information imbalance problem caused by information dilution. The proposed method consists of two simple and effective components: Semantic Pyramid Module(SPM) and Semantic Feature Fusion Module(SFFM). To compensate for the weaknesses of FPN, the semantic segmentation result is utilized as an extra information source in our architecture.By constructing a semantic pyramid based on the segmentation result and fusing it with FPN, feature maps at each level can obtain the necessary information without suffering from the dilution issue. The proposed architecture could be applied on many detectors, and non-negligible improvement could be achieved. Although this method is designed for object detection, other tasks such as instance segmentation can also largely benefit from it. The proposed method brings Faster R-CNN and Mask R-CNN with ResNet-50 as backbone both 1.8 AP improvements respectively. Furthermore, SFPN improves Cascade R-CNN with backbone ResNet-101 from 42.4 AP to 43.5 AP.

Similar papers

Bidirectional Matrix Feature Pyramid Network for Object Detection

Wei Xu, Yi Gan, Jianbo Su

Auto-TLDR; BMFPN: Bidirectional Matrix Feature Pyramid Network for Object Detection

SFPN: Semantic Feature Pyramid Network for Object Detection

Similar papers

Bidirectional Matrix Feature Pyramid Network for Object Detection

Small Object Detection by Generative and Discriminative Learning

A Novel Region of Interest Extraction Layer for Instance Segmentation

ScarfNet: Multi-Scale Features with Deeply Fused and Redistributed Semantics for Enhanced Object Detection

Learning a Dynamic High-Resolution Network for Multi-Scale Pedestrian Detection

Forground-Guided Vehicle Perception Framework

Object Detection Model Based on Scene-Level Region Proposal Self-Attention

MagnifierNet: Learning Efficient Small-Scale Pedestrian Detector towards Multiple Dense Regions

Triplet-Path Dilated Network for Detection and Segmentation of General Pathological Images

Construction Worker Hardhat-Wearing Detection Based on an Improved BiFPN

Efficient-Receptive Field Block with Group Spatial Attention Mechanism for Object Detection

EDD-Net: An Efficient Defect Detection Network

Cascade Saliency Attention Network for Object Detection in Remote Sensing Images

SyNet: An Ensemble Network for Object Detection in UAV Images

PRF-Ped: Multi-Scale Pedestrian Detector with Prior-Based Receptive Field

Tiny Object Detection in Aerial Images

P2 Net: Augmented Parallel-Pyramid Net for Attention Guided Pose Estimation

CenterRepp: Predict Central Representative Point Set's Distribution for Detection

Enhanced Feature Pyramid Network for Semantic Segmentation

Detecting Objects with High Object Region Percentage

Hybrid Cascade Point Search Network for High Precision Bar Chart Component Detection

CASNet: Common Attribute Support Network for Image Instance and Panoptic Segmentation

Deep Real-Time Hand Detection Using CFPN on Embedded Systems

Nighttime Pedestrian Detection Based on Feature Attention and Transformation

Scene Text Detection with Selected Anchors

Boundary-Aware Graph Convolution for Semantic Segmentation

Multiple-Step Sampling for Dense Object Detection and Counting

Dynamic Low-Light Image Enhancement for Object Detection Via End-To-End Training

StrongPose: Bottom-up and Strong Keypoint Heat Map Based Pose Estimation

Object Detection Using Dual Graph Network

DualBox: Generating BBox Pair with Strong Correspondence Via Occlusion Pattern Clustering and Proposal Refinement

FourierNet: Compact Mask Representation for Instance Segmentation Using Differentiable Shape Decoders

Video Object Detection Using Object's Motion Context and Spatio-Temporal Feature Aggregation

Siamese Dynamic Mask Estimation Network for Fast Video Object Segmentation

End-To-End Deep Learning Methods for Automated Damage Detection in Extreme Events at Various Scales

Feature Embedding Based Text Instance Grouping for Largely Spaced and Occluded Text Detection

ACRM: Attention Cascade R-CNN with Mix-NMS for Metallic Surface Defect Detection

Hierarchical Head Design for Object Detectors

Yolo+FPN: 2D and 3D Fused Object Detection with an RGB-D Camera

Adaptive Word Embedding Module for Semantic Reasoning in Large-Scale Detection

PSDNet: A Balanced Architecture of Accuracy and Parameters for Semantic Segmentation

Image-Based Table Cell Detection: A New Dataset and an Improved Detection Method

Foreground-Focused Domain Adaption for Object Detection

Real-Time Semantic Segmentation Via Region and Pixel Context Network

GSTO: Gated Scale-Transfer Operation for Multi-Scale Feature Learning in Semantic Segmentation

Mobile Phone Surface Defect Detection Based on Improved Faster R-CNN

Small Object Detection Leveraging on Simultaneous Super-Resolution

CAggNet: Crossing Aggregation Network for Medical Image Segmentation