ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Ground-truthing Large Human Behavior Monitoring Datasets

Tehreem Qasim, Robert Fisher, Naeem Bhatti

Auto-TLDR; Semi-automated Groundtruthing for Large Video Datasets

Abstract Slides Poster

We present a groundtruthing approach which is applicable to large video datasets collected for studying people’s behavior, and which are recorded at a low frame per second (fps) rate. Groundtruthing a large dataset manually is a time consuming task and is prone to errors. The proposed approach is semi-automated (using a combination of deepnet and traditional image analysis) to minimize human labeler’s interaction with the video frames. The framework employs mask-rcnn as a people counter followed by human assisted semi-automated tests to correct the wrong labels. Subsequently, a bounding box extraction algorithm is used which is fully automated for frames with a single person and semi-automated for frames with two or more people. We also propose a methodology for anomaly detection i.e., collapse on table or floor. Behavior recognition is performed by using a fine-tuned alexnet convolutional neural network. The people detection and behavior analysis components of the framework are primarily designed to help reduce human labor in ground-truthing so that minimal human involvement is required. They are not meant to be employed as fully automated state-of-the-art systems. The proposed approach is validated on a new dataset presented in this paper, containing human activity in an indoor office environment and recorded at 1 fps as well as an indoor video sequence recorded at 15 fps. Experimental results show a significant reduction in human labor involved in the process of ground-truthing i.e., the number of potential clicks for office dataset was reduced by 99.2% and for the additional test video by 99.7%.

Similar papers

Detecting Anomalies from Video-Sequences: A Novel Descriptor

Giulia Orrù, Davide Ghiani, Maura Pintor, Gian Luca Marcialis, Fabio Roli

Auto-TLDR; Trit-based Measurement of Group Dynamics for Crowd Behavior Analysis and Anomaly Detection

Ground-truthing Large Human Behavior Monitoring Datasets

Similar papers

Detecting Anomalies from Video-Sequences: A Novel Descriptor

Motion and Region Aware Adversarial Learning for Fall Detection with Thermal Imaging

Early Wildfire Smoke Detection in Videos

IPT: A Dataset for Identity Preserved Tracking in Closed Domains

Video Analytics Gait Trend Measurement for Fall Prevention and Health Monitoring

Iterative Bounding Box Annotation for Object Detection

Story Comparison for Estimating Field of View Overlap in a Video Collection

Video Anomaly Detection by Estimating Likelihood of Representations

Semantic Segmentation for Pedestrian Detection from Motion in Temporal Domain

Anomaly Detection, Localization and Classification for Railway Inspection

TGCRBNW: A Dataset for Runner Bib Number Detection (and Recognition) in the Wild

A Grid-Based Representation for Human Action Recognition

Learning Defects in Old Movies from Manually Assisted Restoration

SynDHN: Multi-Object Fish Tracker Trained on Synthetic Underwater Videos

Toward Building a Data-Driven System ForDetecting Mounting Actions of Black Beef Cattle

Modeling Long-Term Interactions to Enhance Action Recognition

SIMCO: SIMilarity-Based Object COunting

Tracking Fast Moving Objects by Segmentation Network

Location Prediction in Real Homes of Older Adults based on K-Means in Low-Resolution Depth Videos

AerialMPTNet: Multi-Pedestrian Tracking in Aerial Imagery Using Temporal and Graphical Features

Gabriella: An Online System for Real-Time Activity Detection in Untrimmed Security Videos

RWF-2000: An Open Large Scale Video Database for Violence Detection

End-To-End Deep Learning Methods for Automated Damage Detection in Extreme Events at Various Scales

Learning Visual Voice Activity Detection with an Automatically Annotated Dataset

Crowdsourced Verification for Operating Calving Surveillance Systems at an Early Stage

A Detection-Based Approach to Multiview Action Classification in Infants

Unsupervised Moving Object Detection through Background Models for PTZ Camera

Real Time Fencing Move Classification and Detection at Touch Time During a Fencing Match

Uncertainty Guided Recognition of Tiny Craters on the Moon

An Adaptive Fusion Model Based on Kalman Filtering and LSTM for Fast Tracking of Road Signs

StrongPose: Bottom-up and Strong Keypoint Heat Map Based Pose Estimation

EAGLE: Large-Scale Vehicle Detection Dataset in Real-World Scenarios Using Aerial Imagery

Motion U-Net: Multi-Cue Encoder-Decoder Network for Motion Segmentation

Learning Object Deformation and Motion Adaption for Semi-Supervised Video Object Segmentation

Self-Training for Domain Adaptive Scene Text Detection

What and How? Jointly Forecasting Human Action and Pose

Point In: Counting Trees with Weakly Supervised Segmentation Network

Robust Visual Object Tracking with Two-Stream Residual Convolutional Networks

Image Sequence Based Cyclist Action Recognition Using Multi-Stream 3D Convolution

Learning to Take Directions One Step at a Time

SAILenv: Learning in Virtual Visual Environments Made Simple

Construction Worker Hardhat-Wearing Detection Based on an Improved BiFPN

LFIR2Pose: Pose Estimation from an Extremely Low-Resolution FIR Image Sequence

Inner Eye Canthus Localization for Human Body Temperature Screening

Distortion-Adaptive Grape Bunch Counting for Omnidirectional Images

Automated Whiteboard Lecture Video Summarization by Content Region Detection and Representation

Automatically Gather Address Specific Dwelling Images Using Google Street View

Real-Time Drone Detection and Tracking with Visible, Thermal and Acoustic Sensors