ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

MFST: Multi-Features Siamese Tracker

Zhenxi Li, Guillaume-Alexandre Bilodeau, Wassim Bouachir

Auto-TLDR; Multi-Features Siamese Tracker for Robust Deep Similarity Tracking

Abstract Slides

Siamese trackers have recently achieved interesting results due to their balanced accuracy-speed. This success is mainly due to the fact that deep similarity networks were specifically designed to address the image similarity problem. Therefore, they are inherently more appropriate than classical CNNs for the tracking task. However, Siamese trackers rely on the last convolutional layers for similarity analysis and target search, which restricts their performance. In this paper, we argue that using a single convolutional layer as feature representation is not the optimal choice within the deep similarity framework, as multiple convolutional layers provide several abstraction levels in characterizing an object. Starting from this motivation, we present the Multi-Features Siamese Tracker (MFST), a novel tracking algorithm exploiting several hierarchical feature maps for robust deep similarity tracking. MFST proceeds by fusing hierarchical features to ensure a richer and more efficient representation. Moreover, we handle appearance variation by calibrating deep features extracted from two different CNN models. Based on this advanced feature representation, our algorithm achieves high tracking accuracy, while outperforming several state-of-the-art trackers, including standard Siamese trackers.

Similar papers

Exploiting Distilled Learning for Deep Siamese Tracking

Chengxin Liu, Zhiguo Cao, Wei Li, Yang Xiao, Shuaiyuan Du, Angfan Zhu

Auto-TLDR; Distilled Learning Framework for Siamese Tracking

MFST: Multi-Features Siamese Tracker

Similar papers

Exploiting Distilled Learning for Deep Siamese Tracking

Siamese Fully Convolutional Tracker with Motion Correction

Tackling Occlusion in Siamese Tracking with Structured Dropouts

RSINet: Rotation-Scale Invariant Network for Online Visual Tracking

DAL: A Deep Depth-Aware Long-Term Tracker

VTT: Long-Term Visual Tracking with Transformers

SiamMT: Real-Time Arbitrary Multi-Object Tracking

Robust Visual Object Tracking with Two-Stream Residual Convolutional Networks

Model Decay in Long-Term Tracking

TSDM: Tracking by SiamRPN++ with a Depth-Refiner and a Mask-Generator

Siamese Dynamic Mask Estimation Network for Fast Video Object Segmentation

Adaptive Context-Aware Discriminative Correlation Filters for Robust Visual Object Tracking

Efficient Correlation Filter Tracking with Adaptive Training Sample Update Scheme

Visual Object Tracking in Drone Images with Deep Reinforcement Learning

Reducing False Positives in Object Tracking with Siamese Network

Compact and Discriminative Multi-Object Tracking with Siamese CNNs

Attention Pyramid Module for Scene Recognition

PSDNet: A Balanced Architecture of Accuracy and Parameters for Semantic Segmentation

AerialMPTNet: Multi-Pedestrian Tracking in Aerial Imagery Using Temporal and Graphical Features

Efficient-Receptive Field Block with Group Spatial Attention Mechanism for Object Detection

Adaptive Feature Fusion Network for Gaze Tracking in Mobile Tablets

Aggregating Object Features Based on Attention Weights for Fine-Grained Image Retrieval

SynDHN: Multi-Object Fish Tracker Trained on Synthetic Underwater Videos

Dual-Attention Guided Dropblock Module for Weakly Supervised Object Localization

Semantic Bilinear Pooling for Fine-Grained Recognition

Fast and Accurate Real-Time Semantic Segmentation with Dilated Asymmetric Convolutions

An Improved Bilinear Pooling Method for Image-Based Action Recognition

Building Computationally Efficient and Well-Generalizing Person Re-Identification Models with Metric Learning

Tracking Fast Moving Objects by Segmentation Network

Multi-Scale Residual Pyramid Attention Network for Monocular Depth Estimation

Learning a Dynamic High-Resolution Network for Multi-Scale Pedestrian Detection

Self and Channel Attention Network for Person Re-Identification

ACRM: Attention Cascade R-CNN with Mix-NMS for Metallic Surface Defect Detection

SCA Net: Sparse Channel Attention Module for Action Recognition

Context-Aware Residual Module for Image Classification

A Novel Region of Interest Extraction Layer for Instance Segmentation

Object Detection Model Based on Scene-Level Region Proposal Self-Attention

Attention Stereo Matching Network

Detecting Objects with High Object Region Percentage

Video Face Manipulation Detection through Ensemble of CNNs

Real-Time Semantic Segmentation Via Region and Pixel Context Network

Domain Siamese CNNs for Sparse Multispectral Disparity Estimation

Visual Saliency Oriented Vehicle Scale Estimation

Utilising Visual Attention Cues for Vehicle Detection and Tracking

Multi-Order Feature Statistical Model for Fine-Grained Visual Categorization

Global-Local Attention Network for Semantic Segmentation in Aerial Images

Mobile Augmented Reality: Fast, Precise, and Smooth Planar Object Tracking

A Grid-Based Representation for Human Action Recognition