ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

PRF-Ped: Multi-Scale Pedestrian Detector with Prior-Based Receptive Field

Yuzhi Tan, Hongxun Yao, Haoran Li, Xiusheng Lu, Haozhe Xie

Auto-TLDR; Bidirectional Feature Enhancement Module for Multi-Scale Pedestrian Detection

Abstract Slides Poster

Multi-scale feature representation is a common strategy to handle the scale variation in pedestrian detection. Existing methods simply utilize the convolutional pyramidal features for multi-scale representation. However, they rarely pay attention to the differences among different feature scales and extract multi-scale features from a single feature map, which may make the detectors sensitive to scale-variance in multi-scale pedestrian detection. In this paper, we introduce a bidirectional feature enhancement module (BFEM) to augment the semantic information of low-level features and the localization information of high-level features. In addition, we propose a prior-based receptive field block (PRFB) for multi-scale pedestrian feature extraction, where the receptive field is closer to the aspect ratio of the pedestrian target. Consequently, it is less affected by the surrounding background when extracting features. Experimental results indicate that the proposed method outperform the state-of-the-art methods on the CityPersons and Caltech datasets.

Similar papers

Learning a Dynamic High-Resolution Network for Multi-Scale Pedestrian Detection

Mengyuan Ding, Shanshan Zhang, Jian Yang

Auto-TLDR; Learningable Dynamic HRNet for Pedestrian Detection

PRF-Ped: Multi-Scale Pedestrian Detector with Prior-Based Receptive Field

Similar papers

Learning a Dynamic High-Resolution Network for Multi-Scale Pedestrian Detection

Nighttime Pedestrian Detection Based on Feature Attention and Transformation

Bidirectional Matrix Feature Pyramid Network for Object Detection

MagnifierNet: Learning Efficient Small-Scale Pedestrian Detector towards Multiple Dense Regions

Construction Worker Hardhat-Wearing Detection Based on an Improved BiFPN

CenterRepp: Predict Central Representative Point Set's Distribution for Detection

Mutual-Supervised Feature Modulation Network for Occluded Pedestrian Detection

SFPN: Semantic Feature Pyramid Network for Object Detection

Efficient-Receptive Field Block with Group Spatial Attention Mechanism for Object Detection

DualBox: Generating BBox Pair with Strong Correspondence Via Occlusion Pattern Clustering and Proposal Refinement

Convolutional STN for Weakly Supervised Object Localization

Triplet-Path Dilated Network for Detection and Segmentation of General Pathological Images

Cascade Saliency Attention Network for Object Detection in Remote Sensing Images

Scene Text Detection with Selected Anchors

EDD-Net: An Efficient Defect Detection Network

ScarfNet: Multi-Scale Features with Deeply Fused and Redistributed Semantics for Enhanced Object Detection

HPERL: 3D Human Pose Estimastion from RGB and LiDAR

Small Object Detection by Generative and Discriminative Learning

Forground-Guided Vehicle Perception Framework

Efficient High-Resolution High-Level-Semantic Representation Learning for Human Pose Estimation

Detecting Objects with High Object Region Percentage

Encoder-Decoder Based Convolutional Neural Networks with Multi-Scale-Aware Modules for Crowd Counting

Temporal Feature Enhancement Network with External Memory for Object Detection in Surveillance Video

Spatial-Related and Scale-Aware Network for Crowd Counting

Tiny Object Detection in Aerial Images

SyNet: An Ensemble Network for Object Detection in UAV Images

Object Detection Model Based on Scene-Level Region Proposal Self-Attention

P2 Net: Augmented Parallel-Pyramid Net for Attention Guided Pose Estimation

Detective: An Attentive Recurrent Model for Sparse Object Detection

VTT: Long-Term Visual Tracking with Transformers

Attention Pyramid Module for Scene Recognition

ACRM: Attention Cascade R-CNN with Mix-NMS for Metallic Surface Defect Detection

VGG-Embedded Adaptive Layer-Normalized Crowd Counting Net with Scale-Shuffling Modules

Enhanced Feature Pyramid Network for Semantic Segmentation

Dynamic Low-Light Image Enhancement for Object Detection Via End-To-End Training

A Novel Region of Interest Extraction Layer for Instance Segmentation

Yolo+FPN: 2D and 3D Fused Object Detection with an RGB-D Camera

Siamese Dynamic Mask Estimation Network for Fast Video Object Segmentation

You Ought to Look Around: Precise, Large Span Action Detection

GSTO: Gated Scale-Transfer Operation for Multi-Scale Feature Learning in Semantic Segmentation

HANet: Hybrid Attention-Aware Network for Crowd Counting

Hierarchical Head Design for Object Detectors

TCATD: Text Contour Attention for Scene Text Detection

Deep Real-Time Hand Detection Using CFPN on Embedded Systems

CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images

Feature Pyramid Hierarchies for Multi-Scale Temporal Action Detection

Enhanced Vote Network for 3D Object Detection in Point Clouds

Real-Time Semantic Segmentation Via Region and Pixel Context Network