ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Object Segmentation Tracking from Generic Video Cues

Amirhossein Kardoost, Sabine Müller, Joachim Weickert, Margret Keuper

Auto-TLDR; A Light-Weight Variational Framework for Video Object Segmentation in Videos

Abstract Slides Poster

We propose a light-weight variational framework for online tracking of object segmentations in videos based on optical flow and image boundaries. While high-end computer vision methods on this task rely on sequence specific training of dedicated CNN architectures, we show the potential of a variational model, based on generic video information from motion and color. Such cues are usually required for tasks such as robot navigation or grasp estimation. We leverage them directly for video object segmentation and thus provide accurate segmentations at potentially very low extra cost. Our simple method can provide competitive results compared to the costly CNN-based methods with parameter tuning. Furthermore, we show that our approach can be combined with state-of-the-art CNN-based segmentations in order to improve over their respective results. We evaluate our method on the datasets DAVIS 16,17 and SegTrack v2.

Similar papers

Video Semantic Segmentation Using Deep Multi-View Representation Learning

Akrem Sellami, Salvatore Tabbone

Auto-TLDR; Deep Multi-view Representation Learning for Video Object Segmentation

Object Segmentation Tracking from Generic Video Cues

Similar papers

Video Semantic Segmentation Using Deep Multi-View Representation Learning

Siamese Dynamic Mask Estimation Network for Fast Video Object Segmentation

Two-Stage Adaptive Object Scene Flow Using Hybrid CNN-CRF Model

Learning Object Deformation and Motion Adaption for Semi-Supervised Video Object Segmentation

ACCLVOS: Atrous Convolution with Spatial-Temporal ConvLSTM for Video Object Segmentation

Revisiting Sequence-To-Sequence Video Object Segmentation with Multi-Task Loss and Skip-Memory

STaRFlow: A SpatioTemporal Recurrent Cell for Lightweight Multi-Frame Optical Flow Estimation

Human Segmentation with Dynamic LiDAR Data

Early Wildfire Smoke Detection in Videos

Learning to Segment Dynamic Objects Using SLAM Outliers

Siamese Fully Convolutional Tracker with Motion Correction

SynDHN: Multi-Object Fish Tracker Trained on Synthetic Underwater Videos

HMFlow: Hybrid Matching Optical Flow Network for Small and Fast-Moving Objects

Movement-Induced Priors for Deep Stereo

Residual Learning of Video Frame Interpolation Using Convolutional LSTM

5D Light Field Synthesis from a Monocular Video

Motion U-Net: Multi-Cue Encoder-Decoder Network for Motion Segmentation

Self-Supervised Joint Encoding of Motion and Appearance for First Person Action Recognition

Revisiting Optical Flow Estimation in 360 Videos

Real-Time Monocular Depth Estimation with Extremely Light-Weight Neural Network

What and How? Jointly Forecasting Human Action and Pose

A Fine-Grained Dataset and Its Efficient Semantic Segmentation for Unstructured Driving Scenarios

OmniFlowNet: A Perspective Neural Network Adaptation for Optical Flow Estimation in Omnidirectional Images

P2D: A Self-Supervised Method for Depth Estimation from Polarimetry

Online Object Recognition Using CNN-Based Algorithm on High-Speed Camera Imaging

DAL: A Deep Depth-Aware Long-Term Tracker

A Grid-Based Representation for Human Action Recognition

A Lightweight Network to Learn Optical Flow from Event Data

Towards Practical Compressed Video Action Recognition: A Temporal Enhanced Multi-Stream Network

Motion-Supervised Co-Part Segmentation

Edge-Aware Monocular Dense Depth Estimation with Morphology

SAILenv: Learning in Virtual Visual Environments Made Simple

Extending Single Beam Lidar to Full Resolution by Fusing with Single Image Depth Estimation

Weakly Supervised Geodesic Segmentation of Egyptian Mummy CT Scans

Leveraging a Weakly Adversarial Paradigm for Joint Learning of Disparity and Confidence Estimation

Video Object Detection Using Object's Motion Context and Spatio-Temporal Feature Aggregation

FC-DCNN: A Densely Connected Neural Network for Stereo Estimation

Automated Whiteboard Lecture Video Summarization by Content Region Detection and Representation

ResFPN: Residual Skip Connections in Multi-Resolution Feature Pyramid Networks for Accurate Dense Pixel Matching

A Boundary-Aware Distillation Network for Compressed Video Semantic Segmentation

Robust Visual Object Tracking with Two-Stream Residual Convolutional Networks

GraphBGS: Background Subtraction Via Recovery of Graph Signals

PA-FlowNet: Pose-Auxiliary Optical Flow Network for Spacecraft Relative Pose Estimation

Modeling Long-Term Interactions to Enhance Action Recognition

Learning to Take Directions One Step at a Time

FOANet: A Focus of Attention Network with Application to Myocardium Segmentation

RONELD: Robust Neural Network Output Enhancement for Active Lane Detection

Tracking Fast Moving Objects by Segmentation Network