ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

P2 Net: Augmented Parallel-Pyramid Net for Attention Guided Pose Estimation

Luanxuan Hou, Jie Cao, Yuan Zhao, Haifeng Shen, Jian Tang, Ran He

Auto-TLDR; Parallel-Pyramid Net with Partial Attention for Human Pose Estimation

Abstract Slides Poster

The target of human pose estimation is to determine the body parts and joint locations of persons in the image. Angular changes, motion blur and occlusion etc. in the natural scenes make this task challenging, while some joints are more difficult to be detected than others. In this paper, we propose an augmented Parallel-Pyramid Net (P^2Net) with an partial attention module. During data preprocessing, we proposed a differentiable auto data augmentation (DA^2) method in which sequences of data augmentations are formulated as a trainable and operational Convolution Neural Network (CNN) component. DA^2 improves the training efficiency and effectiveness. A parallel pyramid structure is followed to compensate the information loss introduced by the network. For the information loss problem in the backbone network, we optimize the backbone network by adopting a new parallel structure without increasing the overall computational complexity. To further refine the predictions after completion of global predictions, an Partial Attention Module (PAM) is defined to extract weighted features from different scale feature maps generated by the parallel pyramid structure. Compared with the traditional up-sampling refining, PAM can better capture the relationship between channels. Experiments corroborate the effectiveness of our proposed method. Notably, our method achieves the best performance on the challenging MSCOCO and MPII datasets.

Similar papers

StrongPose: Bottom-up and Strong Keypoint Heat Map Based Pose Estimation

Niaz Ahmad, Jongwon Yoon

Auto-TLDR; StrongPose: A bottom-up box-free approach for human pose estimation and action recognition

P2 Net: Augmented Parallel-Pyramid Net for Attention Guided Pose Estimation

Similar papers

StrongPose: Bottom-up and Strong Keypoint Heat Map Based Pose Estimation

Simple Multi-Resolution Representation Learning for Human Pose Estimation

Efficient High-Resolution High-Level-Semantic Representation Learning for Human Pose Estimation

Learning a Dynamic High-Resolution Network for Multi-Scale Pedestrian Detection

Bidirectional Matrix Feature Pyramid Network for Object Detection

Efficient-Receptive Field Block with Group Spatial Attention Mechanism for Object Detection

SFPN: Semantic Feature Pyramid Network for Object Detection

Light3DPose: Real-Time Multi-Person 3D Pose Estimation from Multiple Views

Tilting at Windmills: Data Augmentation for Deeppose Estimation Does Not Help with Occlusions

Object Detection Model Based on Scene-Level Region Proposal Self-Attention

NAS-EOD: An End-To-End Neural Architecture Search Method for Efficient Object Detection

Efficient Grouping for Keypoint Detection

Exploring Severe Occlusion: Multi-Person 3D Pose Estimation with Gated Convolution

HPERL: 3D Human Pose Estimastion from RGB and LiDAR

Small Object Detection by Generative and Discriminative Learning

Forground-Guided Vehicle Perception Framework

A Novel Region of Interest Extraction Layer for Instance Segmentation

Hybrid Cascade Point Search Network for High Precision Bar Chart Component Detection

MagnifierNet: Learning Efficient Small-Scale Pedestrian Detector towards Multiple Dense Regions

ScarfNet: Multi-Scale Features with Deeply Fused and Redistributed Semantics for Enhanced Object Detection

Construction Worker Hardhat-Wearing Detection Based on an Improved BiFPN

CenterRepp: Predict Central Representative Point Set's Distribution for Detection

Real-Time Semantic Segmentation Via Region and Pixel Context Network

Human-Centric Parsing Network for Human-Object Interaction Detection

CASNet: Common Attribute Support Network for Image Instance and Panoptic Segmentation

Triplet-Path Dilated Network for Detection and Segmentation of General Pathological Images

GSTO: Gated Scale-Transfer Operation for Multi-Scale Feature Learning in Semantic Segmentation

SyNet: An Ensemble Network for Object Detection in UAV Images

PRF-Ped: Multi-Scale Pedestrian Detector with Prior-Based Receptive Field

Cascade Saliency Attention Network for Object Detection in Remote Sensing Images

Nighttime Pedestrian Detection Based on Feature Attention and Transformation

PEAN: 3D Hand Pose Estimation Adversarial Network

Enhanced Feature Pyramid Network for Semantic Segmentation

Detecting Objects with High Object Region Percentage

A Grid-Based Representation for Human Action Recognition

DualBox: Generating BBox Pair with Strong Correspondence Via Occlusion Pattern Clustering and Proposal Refinement

EDD-Net: An Efficient Defect Detection Network

Feature Embedding Based Text Instance Grouping for Largely Spaced and Occluded Text Detection

Attentive Part-Aware Networks for Partial Person Re-Identification

End-To-End Deep Learning Methods for Automated Damage Detection in Extreme Events at Various Scales

Scene Text Detection with Selected Anchors

RefiNet: 3D Human Pose Refinement with Depth Maps

Neural Architecture Search for Image Super-Resolution Using Densely Connected Search Space: DeCoNAS

Attention Pyramid Module for Scene Recognition

Object Detection Using Dual Graph Network

Orthographic Projection Linear Regression for Single Image 3D Human Pose Estimation

Attention-Oriented Action Recognition for Real-Time Human-Robot Interaction

Mobile Phone Surface Defect Detection Based on Improved Faster R-CNN