ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Motion U-Net: Multi-Cue Encoder-Decoder Network for Motion Segmentation

Gani Rahmon, Filiz Bunyak, Kannappan Palaniappan

Auto-TLDR; Motion U-Net: A Deep Learning Framework for Robust Moving Object Detection under Challenging Conditions

Abstract Slides Poster

Detection of moving objects is a critical first step in many computer vision applications. Several algorithms for motion and change detection were proposed. However, many of these approaches lack the ability to handle challenging real-world scenarios. Recently, deep learning approaches started to produce impressive solutions to computer vision tasks, particularly for detection and segmentation. Many existing deep learning networks proposed for moving object detection rely only on spatial appearance cues. In this paper, we propose a novel multi-cue and multi-stream network, Motion U-Net (MU-Net), which integrates motion, change, and appearance cues using a deep learning framework for robust moving object detection under challenging conditions. The proposed network consists of a two-stream encoder module followed by feature concatenation and a decoder module. Motion and change cues are computed through our tensor-based motion estimation and a multi-modal background subtraction modules. The proposed system was tested and evaluated on the change detection challenge datasets (CDnet-2014) and compared to state-of-the-art methods. On CDnet-2014 dataset, our approach reaches an average overall F-measure of 0.9852 and outperforms all current state-of-the-art methods. The network was also tested on the unseen SBI-2015 dataset and produced promising results.

Similar papers

GraphBGS: Background Subtraction Via Recovery of Graph Signals

Jhony Heriberto Giraldo Zuluaga, Thierry Bouwmans

Auto-TLDR; Graph BackGround Subtraction using Graph Signals

Motion U-Net: Multi-Cue Encoder-Decoder Network for Motion Segmentation

Similar papers

GraphBGS: Background Subtraction Via Recovery of Graph Signals

Coarse-To-Fine Foreground Segmentation Based on Co-Occurrence Pixel-Block and Spatio-Temporal Attention Model

Early Wildfire Smoke Detection in Videos

Tracking Fast Moving Objects by Segmentation Network

Video Semantic Segmentation Using Deep Multi-View Representation Learning

Gabriella: An Online System for Real-Time Activity Detection in Untrimmed Security Videos

Unsupervised Moving Object Detection through Background Models for PTZ Camera

Learning Object Deformation and Motion Adaption for Semi-Supervised Video Object Segmentation

SynDHN: Multi-Object Fish Tracker Trained on Synthetic Underwater Videos

Revisiting Sequence-To-Sequence Video Object Segmentation with Multi-Task Loss and Skip-Memory

Human Segmentation with Dynamic LiDAR Data

Accurate Background Subtraction Using Dynamic Object Presence Probability in Sports Scenes

Siamese Dynamic Mask Estimation Network for Fast Video Object Segmentation

Semantic Segmentation for Pedestrian Detection from Motion in Temporal Domain

Object Segmentation Tracking from Generic Video Cues

Weight Estimation from an RGB-D Camera in Top-View Configuration

SiamMT: Real-Time Arbitrary Multi-Object Tracking

Human Embryo Cell Centroid Localization and Counting in Time-Lapse Sequences

ACCLVOS: Atrous Convolution with Spatial-Temporal ConvLSTM for Video Object Segmentation

Attention Based Coupled Framework for Road and Pothole Segmentation

Video Reconstruction by Spatio-Temporal Fusion of Blurred-Coded Image Pair

3D Attention Mechanism for Fine-Grained Classification of Table Tennis Strokes Using a Twin Spatio-Temporal Convolutional Neural Networks

A GAN-Based Blind Inpainting Method for Masonry Wall Images

RescueNet: Joint Building Segmentation and Damage Assessment from Satellite Imagery

Real-time Pedestrian Lane Detection for Assistive Navigation using Neural Architecture Search

CT-UNet: An Improved Neural Network Based on U-Net for Building Segmentation in Remote Sensing Images

Triplet-Path Dilated Network for Detection and Segmentation of General Pathological Images

Motion and Region Aware Adversarial Learning for Fall Detection with Thermal Imaging

Robust Visual Object Tracking with Two-Stream Residual Convolutional Networks

Temporal Feature Enhancement Network with External Memory for Object Detection in Surveillance Video

TinyVIRAT: Low-Resolution Video Action Recognition

NephCNN: A Deep-Learning Framework for Vessel Segmentation in Nephrectomy Laparoscopic Videos

A Grid-Based Representation for Human Action Recognition

Self-Supervised Joint Encoding of Motion and Appearance for First Person Action Recognition

Robust Pedestrian Detection in Thermal Imagery Using Synthesized Images

A Lumen Segmentation Method in Ureteroscopy Images Based on a Deep Residual U-Net Architecture

Estimation of Abundance and Distribution of SaltMarsh Plants from Images Using Deep Learning

Point In: Counting Trees with Weakly Supervised Segmentation Network

A Fine-Grained Dataset and Its Efficient Semantic Segmentation for Unstructured Driving Scenarios

DE-Net: Dilated Encoder Network for Automated Tongue Segmentation

A Benchmark Dataset for Segmenting Liver, Vasculature and Lesions from Large-Scale Computed Tomography Data

Relevance Detection in Cataract Surgery Videos by Spatio-Temporal Action Localization

Video Lightening with Dedicated CNN Architecture

Transfer Learning through Weighted Loss Function and Group Normalization for Vessel Segmentation from Retinal Images

Automatic Semantic Segmentation of Structural Elements related to the Spinal Cord in the Lumbar Region by Using Convolutional Neural Networks

What and How? Jointly Forecasting Human Action and Pose

Residual Learning of Video Frame Interpolation Using Convolutional LSTM

CAggNet: Crossing Aggregation Network for Medical Image Segmentation