ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Anomaly Detection, Localization and Classification for Railway Inspection

Riccardo Gasparini, Andrea D'Eusanio, Guido Borghi, Stefano Pini, Giuseppe Scaglione, Simone Calderara, Eugenio Fedeli, Rita Cucchiara

Auto-TLDR; Anomaly Detection and Localization using thermal images in the lowlight environment

Abstract Slides

The ability to detect, localize and classify objects that are anomalies is a challenging task in the computer vision community. In this paper, we tackle these tasks developing a framework to automatically inspect the railway during the night. Specifically, it is able to predict the presence, the image coordinates and the class of obstacles. To deal with the lowlight environment, the framework is based on thermal images and consists of three different modules that address the problem of detecting anomalies, predicting their image coordinates and classifying them. Moreover, due to the absolute lack of publicly released datasets collected in the railway context for anomaly detection, we introduce a new multi-modal dataset, acquired from a rail drone, used to evaluate the proposed framework. Experimental results confirm the accuracy of the framework and its suitability, in terms of computational load, performance, and inference time, to be implemented on a self-powered inspection system.

Similar papers

RefiNet: 3D Human Pose Refinement with Depth Maps

Andrea D'Eusanio, Stefano Pini, Guido Borghi, Roberto Vezzani, Rita Cucchiara

Auto-TLDR; RefiNet: A Multi-stage Framework for 3D Human Pose Estimation

Abstract Slides Similar

Human Pose Estimation is a fundamental task for many applications in the Computer Vision community and it has been widely investigated in the 2D domain, i.e. intensity images. Therefore, most of the available methods for this task are mainly based on 2D Convolutional Neural Networks and huge manually-annotated RGB datasets, achieving stunning results. In this paper, we propose RefiNet, a multi-stage framework that regresses an extremely-precise 3D human pose estimation from a given 2D pose and a depth map. The framework consists of three different modules, each one specialized in a particular refinement and data representation, i.e. depth patches, 3D skeleton and point clouds. Moreover, we collect a new dataset, namely Baracca, acquired with RGB, depth and thermal cameras and specifically created for the automotive context. Experimental results confirm the quality of the refinement procedure that largely improves the human pose estimations of off-the-shelf 2D methods.

Real-Time Drone Detection and Tracking with Visible, Thermal and Acoustic Sensors

Fredrik Svanström, Cristofer Englund, Fernando Alonso-Fernandez

Auto-TLDR; Automatic multi-sensor drone detection using sensor fusion

Anomaly Detection, Localization and Classification for Railway Inspection

Similar papers

RefiNet: 3D Human Pose Refinement with Depth Maps

Real-Time Drone Detection and Tracking with Visible, Thermal and Acoustic Sensors

Detecting Anomalies from Video-Sequences: A Novel Descriptor

Motion and Region Aware Adversarial Learning for Fall Detection with Thermal Imaging

Video Anomaly Detection by Estimating Likelihood of Representations

Combining GANs and AutoEncoders for Efficient Anomaly Detection

Dual-Mode Iterative Denoiser: Tackling the Weak Label for Anomaly Detection

IPT: A Dataset for Identity Preserved Tracking in Closed Domains

PoseCVAE: Anomalous Human Activity Detection

Weight Estimation from an RGB-D Camera in Top-View Configuration

Improved anomaly detection by training an autoencoder with skip connections on images corrupted with Stain-shaped noise

Future Urban Scenes Generation through Vehicles Synthesis

Vehicle Lane Merge Visual Benchmark

Attention Based Coupled Framework for Road and Pothole Segmentation

Holistic Grid Fusion Based Stop Line Estimation

PIF: Anomaly detection via preference embedding

Evaluation of Anomaly Detection Algorithms for the Real-World Applications

Real-Time Driver Drowsiness Detection Using Facial Action Units

Early Wildfire Smoke Detection in Videos

Ground-truthing Large Human Behavior Monitoring Datasets

Thermal Image Enhancement Using Generative Adversarial Network for Pedestrian Detection

Inner Eye Canthus Localization for Human Body Temperature Screening

Complex-Object Visual Inspection: Empirical Studies on a Multiple Lighting Solution

AerialMPTNet: Multi-Pedestrian Tracking in Aerial Imagery Using Temporal and Graphical Features

RWF-2000: An Open Large Scale Video Database for Violence Detection

RISEdb: A Novel Indoor Localization Dataset

Relevance Detection in Cataract Surgery Videos by Spatio-Temporal Action Localization

Temporal Pulses Driven Spiking Neural Network for Time and Power Efficient Object Recognition in Autonomous Driving

Temporal Binary Representation for Event-Based Action Recognition

Light3DPose: Real-Time Multi-Person 3D Pose Estimation from Multiple Views

Modeling the Distribution of Normal Data in Pre-Trained Deep Features for Anomaly Detection

Discriminative Multi-Level Reconstruction under Compact Latent Space for One-Class Novelty Detection

Gabriella: An Online System for Real-Time Activity Detection in Untrimmed Security Videos

Extending Single Beam Lidar to Full Resolution by Fusing with Single Image Depth Estimation

PolyLaneNet: Lane Estimation Via Deep Polynomial Regression

Boundary Guided Image Translation for Pose Estimation from Ultra-Low Resolution Thermal Sensor

Multiple Future Prediction Leveraging Synthetic Trajectories

DeepBEV: A Conditional Adversarial Network for Bird’s Eye View Generation

Derivation of Geometrically and Semantically Annotated UAV Datasets at Large Scales from 3D City Models

Motion U-Net: Multi-Cue Encoder-Decoder Network for Motion Segmentation

IPN Hand: A Video Dataset and Benchmark for Real-Time Continuous Hand Gesture Recognition

CARRADA Dataset: Camera and Automotive Radar with Range-Angle-Doppler Annotations

Polarimetric Image Augmentation

Automatic Detection of Stationary Waves in the Venus’ Atmosphere Using Deep Generative Models

A Grid-Based Representation for Human Action Recognition

Multimodal End-To-End Learning for Autonomous Steering in Adverse Road and Weather Conditions

Tracking Fast Moving Objects by Segmentation Network

A Versatile Crack Inspection Portable System Based on Classifier Ensemble and Controlled Illumination