ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Video Anomaly Detection by Estimating Likelihood of Representations

Yuqi Ouyang, Victor Sanchez

Auto-TLDR; Video Anomaly Detection in the latent feature space using a deep probabilistic model

Abstract Slides Poster

Video anomaly detection is a challenging task not only because it involves solving many sub-tasks such as motion representation, object localization and action recognition, but also because it is commonly considered as an unsupervised learning problem that involves detecting outliers. Traditionally, solutions to this task have focused on the mapping between video frames and their low-dimensional features, while ignoring the spatial connections of those features. Recent solutions focus on analyzing these spatial connections by using hard clustering techniques, such as K-Means, or applying neural networks to map latent features to a general understanding, such as action attributes. In order to solve video anomaly in the latent feature space, we propose a deep probabilistic model to transfer this task into a density estimation problem where latent manifolds are generated by a deep denoising autoencoder and clustered by expectation maximization. Evaluations on several benchmarks datasets show the strengths of our model, achieving outstanding performance on challenging datasets.

Similar papers

Combining GANs and AutoEncoders for Efficient Anomaly Detection

Fabio Carrara, Giuseppe Amato, Luca Brombin, Fabrizio Falchi, Claudio Gennaro

Auto-TLDR; CBIGAN: Anomaly Detection in Images with Consistency Constrained BiGAN

Video Anomaly Detection by Estimating Likelihood of Representations

Similar papers

Combining GANs and AutoEncoders for Efficient Anomaly Detection

Motion and Region Aware Adversarial Learning for Fall Detection with Thermal Imaging

Dual-Mode Iterative Denoiser: Tackling the Weak Label for Anomaly Detection

PoseCVAE: Anomalous Human Activity Detection

Improved anomaly detection by training an autoencoder with skip connections on images corrupted with Stain-shaped noise

Modeling the Distribution of Normal Data in Pre-Trained Deep Features for Anomaly Detection

Anomaly Detection, Localization and Classification for Railway Inspection

A Joint Representation Learning and Feature Modeling Approach for One-Class Recognition

Detecting Anomalies from Video-Sequences: A Novel Descriptor

Evaluation of Anomaly Detection Algorithms for the Real-World Applications

Ground-truthing Large Human Behavior Monitoring Datasets

Discriminative Multi-Level Reconstruction under Compact Latent Space for One-Class Novelty Detection

Mutual Information Based Method for Unsupervised Disentanglement of Video Representation

Reducing the Variance of Variational Estimates of Mutual Information by Limiting the Critic's Hypothesis Space to RKHS

Estimation of Clinical Tremor Using Spatio-Temporal Adversarial AutoEncoder

A Grid-Based Representation for Human Action Recognition

Variational Capsule Encoder

N2D: (Not Too) Deep Clustering Via Clustering the Local Manifold of an Autoencoded Embedding

Image Representation Learning by Transformation Regression

Movement-Induced Priors for Deep Stereo

Deep Convolutional Embedding for Digitized Painting Clustering

Multi-Modal Deep Clustering: Unsupervised Partitioning of Images

Automatic Detection of Stationary Waves in the Venus’ Atmosphere Using Deep Generative Models

Self-Supervised Joint Encoding of Motion and Appearance for First Person Action Recognition

Attack-Agnostic Adversarial Detection on Medical Data Using Explainable Machine Learning

JUMPS: Joints Upsampling Method for Pose Sequences

Single-Modal Incremental Terrain Clustering from Self-Supervised Audio-Visual Feature Learning

Learning to Take Directions One Step at a Time

Generative Deep-Neural-Network Mixture Modeling with Semi-Supervised MinMax+EM Learning

RWF-2000: An Open Large Scale Video Database for Violence Detection

Motion U-Net: Multi-Cue Encoder-Decoder Network for Motion Segmentation

Video Semantic Segmentation Using Deep Multi-View Representation Learning

Variational Deep Embedding Clustering by Augmented Mutual Information Maximization

Correlation-Based ConvNet for Small Object Detection in Videos

GAN-Based Gaussian Mixture Model Responsibility Learning

Flow-Guided Spatial Attention Tracking for Egocentric Activity Recognition

PIF: Anomaly detection via preference embedding

Gabriella: An Online System for Real-Time Activity Detection in Untrimmed Security Videos

Class-Incremental Learning with Topological Schemas of Memory Spaces

Wavelet Attention Embedding Networks for Video Super-Resolution

The Role of Cycle Consistency for Generating Better Human Action Videos from a Single Frame

Real Time Fencing Move Classification and Detection at Touch Time During a Fencing Match

Crowdsourced Verification for Operating Calving Surveillance Systems at an Early Stage

Global Feature Aggregation for Accident Anticipation

Video Reconstruction by Spatio-Temporal Fusion of Blurred-Coded Image Pair

Modeling Long-Term Interactions to Enhance Action Recognition

Towards Practical Compressed Video Action Recognition: A Temporal Enhanced Multi-Stream Network

Feature-Aware Unsupervised Learning with Joint Variational Attention and Automatic Clustering