ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Utilising Visual Attention Cues for Vehicle Detection and Tracking

Feiyan Hu, Venkatesh Gurram Munirathnam, Noel E O'Connor, Alan Smeaton, Suzanne Little

Auto-TLDR; Visual Attention for Object Detection and Tracking in Driver-Assistance Systems

Abstract Slides Poster

Advanced Driver-Assistance Systems (ADAS) have been attracting attention from many researchers. Vision based sensors are the closest way to emulate human driver visual behavior while driving. In this paper, we explore possible ways to use visual attention (saliency) for object detection and tracking. We investigate: 1) How a visual attention map such as a subjectness attention or saliency map and an objectness attention map can facilitate region proposal generation in a 2-stage object detector; 2) How a visual attention map can be used for tracking multiple objects. We propose a neural network that can simultaneously detect objects as and generate objectness and subjectness maps to save computational power. We further exploit the visual attention map during tracking using a sequential Monte Carlo probability hypothesis density (PHD) filter. The experiments are conducted on KITTI and DETRAC datasets. The use of visual attention and hierarchical features has shown a considerable improvement of≈8% in object detection which effectively increased tracking performance by≈4% on KITTI dataset.

Similar papers

FastSal: A Computationally Efficient Network for Visual Saliency Prediction

Feiyan Hu, Kevin Mcguinness

Auto-TLDR; MobileNetV2: A Convolutional Neural Network for Saliency Prediction

Utilising Visual Attention Cues for Vehicle Detection and Tracking

Similar papers

FastSal: A Computationally Efficient Network for Visual Saliency Prediction

SynDHN: Multi-Object Fish Tracker Trained on Synthetic Underwater Videos

Forground-Guided Vehicle Perception Framework

HPERL: 3D Human Pose Estimastion from RGB and LiDAR

Detecting Objects with High Object Region Percentage

Detective: An Attentive Recurrent Model for Sparse Object Detection

Foreground-Focused Domain Adaption for Object Detection

Cascade Saliency Attention Network for Object Detection in Remote Sensing Images

Nighttime Pedestrian Detection Based on Feature Attention and Transformation

MagnifierNet: Learning Efficient Small-Scale Pedestrian Detector towards Multiple Dense Regions

Scene Text Detection with Selected Anchors

Object Detection Model Based on Scene-Level Region Proposal Self-Attention

Learning a Dynamic High-Resolution Network for Multi-Scale Pedestrian Detection

A Modified Single-Shot Multibox Detector for Beyond Real-Time Object Detection

SyNet: An Ensemble Network for Object Detection in UAV Images

Yolo+FPN: 2D and 3D Fused Object Detection with an RGB-D Camera

Siamese Fully Convolutional Tracker with Motion Correction

Video Object Detection Using Object's Motion Context and Spatio-Temporal Feature Aggregation

MAGNet: Multi-Region Attention-Assisted Grounding of Natural Language Queries at Phrase Level

Coarse to Fine: Progressive and Multi-Task Learning for Salient Object Detection

Iterative Bounding Box Annotation for Object Detection

Object Detection on Monocular Images with Two-Dimensional Canonical Correlation Analysis

IPT: A Dataset for Identity Preserved Tracking in Closed Domains

Construction Worker Hardhat-Wearing Detection Based on an Improved BiFPN

SFPN: Semantic Feature Pyramid Network for Object Detection

AerialMPTNet: Multi-Pedestrian Tracking in Aerial Imagery Using Temporal and Graphical Features

A Novel Region of Interest Extraction Layer for Instance Segmentation

FeatureNMS: Non-Maximum Suppression by Learning Feature Embeddings

Vehicle Lane Merge Visual Benchmark

Neural Compression and Filtering for Edge-assisted Real-time Object Detection in Challenged Networks

TSDM: Tracking by SiamRPN++ with a Depth-Refiner and a Mask-Generator

A Fast and Accurate Object Detector for Handwritten Digit String Recognition

Exploiting Distilled Learning for Deep Siamese Tracking

An Adaptive Fusion Model Based on Kalman Filtering and LSTM for Fast Tracking of Road Signs

CASNet: Common Attribute Support Network for Image Instance and Panoptic Segmentation

DAL: A Deep Depth-Aware Long-Term Tracker

Early Wildfire Smoke Detection in Videos

One-Stage Multi-Task Detector for 3D Cardiac MR Imaging

ACRM: Attention Cascade R-CNN with Mix-NMS for Metallic Surface Defect Detection

Small Object Detection by Generative and Discriminative Learning

Tracking Fast Moving Objects by Segmentation Network

Compact and Discriminative Multi-Object Tracking with Siamese CNNs

DualBox: Generating BBox Pair with Strong Correspondence Via Occlusion Pattern Clustering and Proposal Refinement

Dynamic Low-Light Image Enhancement for Object Detection Via End-To-End Training

SiamMT: Real-Time Arbitrary Multi-Object Tracking

Thermal Image Enhancement Using Generative Adversarial Network for Pedestrian Detection

Collaborative Human Machine Attention Module for Character Recognition

Correlation-Based ConvNet for Small Object Detection in Videos