ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Coarse-To-Fine Foreground Segmentation Based on Co-Occurrence Pixel-Block and Spatio-Temporal Attention Model

Xinyu Liu, Dong Liang

Auto-TLDR; Foreground Segmentation from coarse to Fine Using Co-occurrence Pixel-Block Model for Dynamic Scene

Abstract Slides Poster

Foreground segmentation in dynamic scene is an important task in video surveillance. The unsupervised background subtraction method based on background statistics modeling has difficulties in updating. On the other hand, the supervised foreground segmentation method based on deep learning relies on the large-scale of accurately annotated training data, which limits its cross-scene performance. In this paper, we propose a foreground segmentation method from coarse to fine. First, a across-scenes trained Spatio-Temporal Attention Model (STAM) is used to achieve coarse segmentation, which does not require training on specific scene. Then the coarse segmentation is used as a reference to help Co-occurrence Pixel-Block Model (CPB) complete the fine segmentation, and at the same time help CPB to update its background model. This method is more flexible than those deep-learning-based methods which depends on the specific-scene training, and realizes the accurate online dynamic update of the background model. Experimental results on WallFlower and LIMU validate our method outperforms STAM, CPB and other methods of participating in comparison.

Similar papers

Motion U-Net: Multi-Cue Encoder-Decoder Network for Motion Segmentation

Gani Rahmon, Filiz Bunyak, Kannappan Palaniappan

Auto-TLDR; Motion U-Net: A Deep Learning Framework for Robust Moving Object Detection under Challenging Conditions

Coarse-To-Fine Foreground Segmentation Based on Co-Occurrence Pixel-Block and Spatio-Temporal Attention Model

Similar papers

Motion U-Net: Multi-Cue Encoder-Decoder Network for Motion Segmentation

Accurate Background Subtraction Using Dynamic Object Presence Probability in Sports Scenes

Unsupervised Moving Object Detection through Background Models for PTZ Camera

Early Wildfire Smoke Detection in Videos

GraphBGS: Background Subtraction Via Recovery of Graph Signals

TSMSAN: A Three-Stream Multi-Scale Attentive Network for Video Saliency Detection

Deeply-Fused Attentive Network for Stereo Matching

ACCLVOS: Atrous Convolution with Spatial-Temporal ConvLSTM for Video Object Segmentation

Siamese Dynamic Mask Estimation Network for Fast Video Object Segmentation

Forground-Guided Vehicle Perception Framework

Directed Variational Cross-encoder Network for Few-Shot Multi-image Co-segmentation

Learning Object Deformation and Motion Adaption for Semi-Supervised Video Object Segmentation

Video Semantic Segmentation Using Deep Multi-View Representation Learning

Revisiting Sequence-To-Sequence Video Object Segmentation with Multi-Task Loss and Skip-Memory

CT-UNet: An Improved Neural Network Based on U-Net for Building Segmentation in Remote Sensing Images

RWF-2000: An Open Large Scale Video Database for Violence Detection

Accurate Cell Segmentation in Digital Pathology Images Via Attention Enforced Networks

Multi-Scale Residual Pyramid Attention Network for Monocular Depth Estimation

Object Detection Model Based on Scene-Level Region Proposal Self-Attention

Progressive Scene Segmentation Based on Self-Attention Mechanism

Attention Based Coupled Framework for Road and Pothole Segmentation

Weight Estimation from an RGB-D Camera in Top-View Configuration

PHNet: Parasite-Host Network for Video Crowd Counting

Global-Local Attention Network for Semantic Segmentation in Aerial Images

Human Segmentation with Dynamic LiDAR Data

Gabriella: An Online System for Real-Time Activity Detection in Untrimmed Security Videos

PSDNet: A Balanced Architecture of Accuracy and Parameters for Semantic Segmentation

Video Lightening with Dedicated CNN Architecture

Construction Worker Hardhat-Wearing Detection Based on an Improved BiFPN

MFI: Multi-Range Feature Interchange for Video Action Recognition

DA-RefineNet: Dual-Inputs Attention RefineNet for Whole Slide Image Segmentation

DE-Net: Dilated Encoder Network for Automated Tongue Segmentation

Dynamic Guided Network for Monocular Depth Estimation

Real-Time Semantic Segmentation Via Region and Pixel Context Network

TinyVIRAT: Low-Resolution Video Action Recognition

Coarse to Fine: Progressive and Multi-Task Learning for Salient Object Detection

HMFlow: Hybrid Matching Optical Flow Network for Small and Fast-Moving Objects

Temporal Feature Enhancement Network with External Memory for Object Detection in Surveillance Video

Learning Defects in Old Movies from Manually Assisted Restoration

Object Segmentation Tracking from Generic Video Cues

Tracking Fast Moving Objects by Segmentation Network

Detecting Anomalies from Video-Sequences: A Novel Descriptor

3D Attention Mechanism for Fine-Grained Classification of Table Tennis Strokes Using a Twin Spatio-Temporal Convolutional Neural Networks

Triplet-Path Dilated Network for Detection and Segmentation of General Pathological Images

Attention Stereo Matching Network

Semantic Segmentation Refinement Using Entropy and Boundary-guided Monte Carlo Sampling and Directed Regional Search

Efficient-Receptive Field Block with Group Spatial Attention Mechanism for Object Detection

3D Semantic Labeling of Photogrammetry Meshes Based on Active Learning