ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

SyNet: An Ensemble Network for Object Detection in UAV Images

Berat Mert Albaba, Sedat Ozer

Auto-TLDR; SyNet: Combining Multi-Stage and Single-Stage Object Detection for Aerial Images

Abstract Poster

Recent advances in camera equipped drone applications and their widespread use increased the demand on vision based object detection algorithms for aerial images. Object detection process is inherently a challenging task as a generic computer vision problem, however, since the use of object detection algorithms on UAVs (or on drones) is relatively a new area, it remains as a more challenging problem to detect objects in aerial images. There are several reasons for that including: (i) the lack of large drone datasets including large object variance, (ii) the large orientation and scale variance in drone images when compared to the ground images, and (iii) the difference in texture and shape features between the ground and the aerial images. Deep learning based object detection algorithms can be classified under two main categories: (a) single-stage detectors and (b) multi-stage detectors. Both single-stage and multi-stage solutions have their advantages and disadvantages over each other. However, a technique to combine the good sides of each of those solutions could yield even a stronger solution than each of those solutions individually. In this paper, we propose an ensemble network, SyNet, that combines a multi-stage method with a single-stage one with the motivation of decreasing the high false negative rate of multi-stage detectors and increasing the quality of the single-stage detector proposals. As building blocks, CenterNet and Cascade R-CNN with pretrained feature extractors are utilized along with an ensembling strategy. We report the state of the art results obtained by our proposed solution on two different datasets: namely MS-COCO and visDrone with \%52.1 $mAP_{IoU = 0.75}$ is obtained on MS-COCO $val2017$ dataset and \%26.2 $mAP_{IoU = 0.75}$ is obtained on VisDrone $test-set$. Our code is available at: https://github.com/mertalbaba/SyNet}{https://github.com/mer talbaba/SyNet

Similar papers

Tiny Object Detection in Aerial Images

Jinwang Wang, Wen Yang, Haowen Guo, Ruixiang Zhang, Gui-Song Xia

Auto-TLDR; Tiny Object Detection in Aerial Images Using Multiple Center Points Based Learning Network

Abstract Slides Similar

Object detection in Earth Vision has achieved great progress in recent years. However, tiny object detection in aerial images remains a very challenging problem since the tiny objects contain a small number of pixels and are easily confused with the background. To advance tiny object detection research in aerial images, we present a new dataset for Tiny Object Detection in Aerial Images (AI-TOD). Specifically, AI-TOD comes with 700,621 object instances for eight categories across 28,036 aerial images. Compared to existing object detection datasets in aerial images, the mean size of objects in AI-TOD is about 12.8 pixels, which is much smaller than others. To build a benchmark for tiny object detection in aerial images, we evaluate the state-of-the-art object detectors on our AI-TOD dataset. Experimental results show that direct application of these approaches on AI-TOD produces suboptimal object detection results, thus new specialized detectors for tiny object detection need to be designed. Therefore, we propose a multiple center points based learning network (M-CenterNet) to improve the localization performance of tiny object detection, and experimental results show the significant performance gain over the competitors.

Bidirectional Matrix Feature Pyramid Network for Object Detection

Wei Xu, Yi Gan, Jianbo Su

Auto-TLDR; BMFPN: Bidirectional Matrix Feature Pyramid Network for Object Detection

SyNet: An Ensemble Network for Object Detection in UAV Images

Similar papers

Tiny Object Detection in Aerial Images

Bidirectional Matrix Feature Pyramid Network for Object Detection

Detecting Objects with High Object Region Percentage

SFPN: Semantic Feature Pyramid Network for Object Detection

Small Object Detection by Generative and Discriminative Learning

CenterRepp: Predict Central Representative Point Set's Distribution for Detection

Detective: An Attentive Recurrent Model for Sparse Object Detection

Forground-Guided Vehicle Perception Framework

MagnifierNet: Learning Efficient Small-Scale Pedestrian Detector towards Multiple Dense Regions

Hierarchical Head Design for Object Detectors

Multiple-Step Sampling for Dense Object Detection and Counting

ScarfNet: Multi-Scale Features with Deeply Fused and Redistributed Semantics for Enhanced Object Detection

A Novel Region of Interest Extraction Layer for Instance Segmentation

Construction Worker Hardhat-Wearing Detection Based on an Improved BiFPN

StrongPose: Bottom-up and Strong Keypoint Heat Map Based Pose Estimation

FeatureNMS: Non-Maximum Suppression by Learning Feature Embeddings

Object Detection in the DCT Domain: Is Luminance the Solution?

EAGLE: Large-Scale Vehicle Detection Dataset in Real-World Scenarios Using Aerial Imagery

FourierNet: Compact Mask Representation for Instance Segmentation Using Differentiable Shape Decoders

Cascade Saliency Attention Network for Object Detection in Remote Sensing Images

CASNet: Common Attribute Support Network for Image Instance and Panoptic Segmentation

HPERL: 3D Human Pose Estimastion from RGB and LiDAR

A Modified Single-Shot Multibox Detector for Beyond Real-Time Object Detection

Scene Text Detection with Selected Anchors

Object Detection Using Dual Graph Network

CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images

Hybrid Cascade Point Search Network for High Precision Bar Chart Component Detection

DualBox: Generating BBox Pair with Strong Correspondence Via Occlusion Pattern Clustering and Proposal Refinement

Yolo+FPN: 2D and 3D Fused Object Detection with an RGB-D Camera

Iterative Bounding Box Annotation for Object Detection

ACRM: Attention Cascade R-CNN with Mix-NMS for Metallic Surface Defect Detection

Multi-View Object Detection Using Epipolar Constraints within Cluttered X-Ray Security Imagery

Efficient-Receptive Field Block with Group Spatial Attention Mechanism for Object Detection

Simple Multi-Resolution Representation Learning for Human Pose Estimation

Vision-Based Layout Detection from Scientific Literature Using Recurrent Convolutional Neural Networks

Automatically Gather Address Specific Dwelling Images Using Google Street View

One-Stage Multi-Task Detector for 3D Cardiac MR Imaging

Small Object Detection Leveraging on Simultaneous Super-Resolution

Deep Real-Time Hand Detection Using CFPN on Embedded Systems

Efficient Grouping for Keypoint Detection

Object Detection Model Based on Scene-Level Region Proposal Self-Attention

Learning a Dynamic High-Resolution Network for Multi-Scale Pedestrian Detection

Correlation-Based ConvNet for Small Object Detection in Videos

End-To-End Deep Learning Methods for Automated Damage Detection in Extreme Events at Various Scales

Convolutional STN for Weakly Supervised Object Localization

Object Detection on Monocular Images with Two-Dimensional Canonical Correlation Analysis

P2 Net: Augmented Parallel-Pyramid Net for Attention Guided Pose Estimation

PRF-Ped: Multi-Scale Pedestrian Detector with Prior-Based Receptive Field