ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

HMFlow: Hybrid Matching Optical Flow Network for Small and Fast-Moving Objects

Suihanjin Yu, Youmin Zhang, Chen Wang, Xiao Bai, Liang Zhang, Edwin Hancock

Auto-TLDR; Hybrid Matching Optical Flow Network with Global Matching Component

Abstract Slides Poster

In optical flow estimation task, coarse-to-fine warping strategy is widely used to deal with the large displacement problem and provides efficiency and speed. However, limited by the small search range between the first images and warped second images, current coarse-to-fine optical flow networks fail to capture small and fast-moving objects which has disappeared at coarse resolution levels. To address this problem, we introduce a lightweight but effective Global Matching Component (GMC) to grab global matching features. We propose a new Hybrid Matching Optical Flow Network (HMFlow) by integrating GMC into existing coarse-to-fine networks seamlessly. Besides keeping in high accuracy and small model size, our proposed HMFlow can apply global matching features to guide the network to discover the small and fast-moving objects mismatched by local matching features. We also build a new dataset, named SFChairs, for evaluation. The experimental results show that our proposed network achieves considerable performance, especially at regions with small and fast-moving objects.

Similar papers

STaRFlow: A SpatioTemporal Recurrent Cell for Lightweight Multi-Frame Optical Flow Estimation

Pierre Godet, Alexandre Boulch, Aurélien Plyer, Guy Le Besnerais

Auto-TLDR; STaRFlow: A lightweight CNN-based algorithm for optical flow estimation

HMFlow: Hybrid Matching Optical Flow Network for Small and Fast-Moving Objects

Similar papers

STaRFlow: A SpatioTemporal Recurrent Cell for Lightweight Multi-Frame Optical Flow Estimation

PA-FlowNet: Pose-Auxiliary Optical Flow Network for Spacecraft Relative Pose Estimation

ResFPN: Residual Skip Connections in Multi-Resolution Feature Pyramid Networks for Accurate Dense Pixel Matching

Two-Stage Adaptive Object Scene Flow Using Hybrid CNN-CRF Model

Deeply-Fused Attentive Network for Stereo Matching

A Lightweight Network to Learn Optical Flow from Event Data

OmniFlowNet: A Perspective Neural Network Adaptation for Optical Flow Estimation in Omnidirectional Images

Learning Stereo Matchability in Disparity Regression Networks

Revisiting Optical Flow Estimation in 360 Videos

Movement-Induced Priors for Deep Stereo

Attention Stereo Matching Network

Leveraging a Weakly Adversarial Paradigm for Joint Learning of Disparity and Confidence Estimation

Suppressing Features That Contain Disparity Edge for Stereo Matching

FC-DCNN: A Densely Connected Neural Network for Stereo Estimation

Object Segmentation Tracking from Generic Video Cues

Siamese Fully Convolutional Tracker with Motion Correction

Residual Learning of Video Frame Interpolation Using Convolutional LSTM

Extending Single Beam Lidar to Full Resolution by Fusing with Single Image Depth Estimation

Video Semantic Segmentation Using Deep Multi-View Representation Learning

Multi-Scale Residual Pyramid Attention Network for Monocular Depth Estimation

Siamese Dynamic Mask Estimation Network for Fast Video Object Segmentation

Multi-Direction Convolution for Semantic Segmentation

Real-Time Monocular Depth Estimation with Extremely Light-Weight Neural Network

Visual Saliency Oriented Vehicle Scale Estimation

Fast and Accurate Real-Time Semantic Segmentation with Dilated Asymmetric Convolutions

Learning Object Deformation and Motion Adaption for Semi-Supervised Video Object Segmentation

Real-Time Semantic Segmentation Via Region and Pixel Context Network

Coarse to Fine: Progressive and Multi-Task Learning for Salient Object Detection

TSMSAN: A Three-Stream Multi-Scale Attentive Network for Video Saliency Detection

5D Light Field Synthesis from a Monocular Video

Delivering Meaningful Representation for Monocular Depth Estimation

Enhancing Depth Quality of Stereo Vision Using Deep Learning-Based Prior Information of the Driving Environment

Motion Complementary Network for Efficient Action Recognition

ACCLVOS: Atrous Convolution with Spatial-Temporal ConvLSTM for Video Object Segmentation

Dynamic Guided Network for Monocular Depth Estimation

Coarse-To-Fine Foreground Segmentation Based on Co-Occurrence Pixel-Block and Spatio-Temporal Attention Model

FastSal: A Computationally Efficient Network for Visual Saliency Prediction

Ordinal Depth Classification Using Region-Based Self-Attention

3D Semantic Labeling of Photogrammetry Meshes Based on Active Learning

Revisiting Sequence-To-Sequence Video Object Segmentation with Multi-Task Loss and Skip-Memory

Selective Kernel and Motion-Emphasized Loss Based Attention-Guided Network for HDR Imaging of Dynamic Scenes

Motion-Supervised Co-Part Segmentation

DA-RefineNet: Dual-Inputs Attention RefineNet for Whole Slide Image Segmentation

P2D: A Self-Supervised Method for Depth Estimation from Polarimetry

Human Segmentation with Dynamic LiDAR Data

TinyVIRAT: Low-Resolution Video Action Recognition

Towards Practical Compressed Video Action Recognition: A Temporal Enhanced Multi-Stream Network

Global-Local Attention Network for Semantic Segmentation in Aerial Images