ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Feature Representation Learning for Calving Detection of Cows Using Video Frames

Ryosuke Hyodo, Nakano Teppei, Tetsuji Ogawa

Auto-TLDR; Data-driven Feature Extraction for Calving Sign Detection Using Surveillance Video

Abstract Slides Poster

Data-driven feature extraction is examined to realize accurate and robust calving detection. Automatic calving sign detection systems can support farmers' decision making. In this paper, neural networks are designed to extract information relevant to calving signs, which can be observed from video, such as the frequency in pre-calving postures, statistics in movement, and statistics in rotation. Experimental comparisons using surveillance video demonstrate that the proposed feature extraction methods contribute to reducing false positives and explaining the basis of the prediction compared to the end-to-end calving detection system.

Similar papers

Crowdsourced Verification for Operating Calving Surveillance Systems at an Early Stage

Yusuke Okimoto, Soshi Kawata, Susumu Saito, Nakano Teppei, Tetsuji Ogawa

Auto-TLDR; Crowdsourcing for Data-Driven Video Surveillance

Abstract Poster Similar

This study attempts to use crowdsourcing to facilitate the operation of pattern-recognition-based video surveillance systems at an early stage. Target events (i.e. events to be detected during surveillance) are not frequently observed in recorded video, so achieving reliable surveillance on the basis of machine learning requires a sufficient amount of target data. Acquiring sufficient data is time-consuming. However, operating unreliable surveillance systems can induce many false alarms. Crowdsourcing is introduced to address this problem by verifying the unreliable results in data-driven surveillance. Experimental simulation conducted using monitoring video of Japanese black beef cattle demonstrates that crowdsourced verification successfully reduced false alarms in calving detection systems.

Toward Building a Data-Driven System ForDetecting Mounting Actions of Black Beef Cattle

Yuriko Kawano, Susumu Saito, Nakano Teppei, Ikumi Kondo, Ryota Yamazaki, Hiromi Kusaka, Minoru Sakaguchi, Tetsuji Ogawa

Auto-TLDR; Cattle Mounting Action Detection Using Crowdsourcing and Pattern Recognition

Abstract Poster Similar

This paper tackles on building a pattern recognition system that detects whether a pair of Japanese black beefs captured in a given image region is in a “mounting” action, which is known to be a sign critically important to be detected for cattle farmers before artificial insemination. The “mounting” action refers to a cattle’s action where a cow bends over another cow usually when either cow is in estrus. Although a pattern recognition-based approach for detecting such an action would be appreciated as being low-cost and robust, it had not been discussed much due to the complexity of the system architecture, unavailability of datasets, etc. This study presents i) our image dataset construction technique that exploits both object detection algorithm and crowdsourcing for collecting cattle pair images with labels of either “mounting” or not; and ii) a system for detecting the mounting action from any given image of a cattle pair, developed based on the dataset. Starting with an algorithm for extracting regions of cattle pairs from a video frame based on intersection of single cattle regions, we then designed our crowdsourcing microtask in which crowd workers were given simple guidelines to annotate mounting-action-relevant labels to the extracted regions, to finally obtain a dataset. We also introduce our tandem-layered pattern recognition system trained with the dataset. The system is comprised of two serially-connected machine learning components, and is capable of more robustly detecting mounting actions even with a small amount of training data than a normal end-to-end neural network. Experimental comparisons demonstrated that our detection system was capable of detecting estrus with a precision rate of 80% and a recall rate of 76%.

Early Wildfire Smoke Detection in Videos

Taanya Gupta, Hengyue Liu, Bir Bhanu

Auto-TLDR; Semi-supervised Spatio-Temporal Video Object Segmentation for Automatic Detection of Smoke in Videos during Forest Fire

Abstract Poster Similar

Recent advances in unmanned aerial vehicles and camera technology have proven useful for the detection of smoke that emerges above the trees during a forest fire. Automatic detection of smoke in videos is of great interest to Fire department. To date, in most parts of the world, the fire is not detected in its early stage and generally it turns catastrophic. This paper introduces a novel technique that integrates spatial and temporal features in a deep learning framework using semi-supervised spatio-temporal video object segmentation and dense optical flow. However, detecting this smoke in the presence of haze and without the labeled data is difficult. Considering the visibility of haze in the sky, a dark channel pre-processing method is used that reduces the amount of haze in video frames and consequently improves the detection results. Online training is performed on a video at the time of testing that reduces the need for ground-truth data. Tests using the publicly available video datasets show that the proposed algorithms outperform previous work and they are robust across different wildfire-threatened locations.

Detecting Anomalies from Video-Sequences: A Novel Descriptor

Giulia Orrù, Davide Ghiani, Maura Pintor, Gian Luca Marcialis, Fabio Roli

Auto-TLDR; Trit-based Measurement of Group Dynamics for Crowd Behavior Analysis and Anomaly Detection

Feature Representation Learning for Calving Detection of Cows Using Video Frames

Similar papers

Crowdsourced Verification for Operating Calving Surveillance Systems at an Early Stage

Toward Building a Data-Driven System ForDetecting Mounting Actions of Black Beef Cattle

Early Wildfire Smoke Detection in Videos

Detecting Anomalies from Video-Sequences: A Novel Descriptor

Real-Time Driver Drowsiness Detection Using Facial Action Units

Hybrid Network for End-To-End Text-Independent Speaker Identification

Ground-truthing Large Human Behavior Monitoring Datasets

Recognizing American Sign Language Nonmanual Signal Grammar Errors in Continuous Videos

Weight Estimation from an RGB-D Camera in Top-View Configuration

An Adaptive Fusion Model Based on Kalman Filtering and LSTM for Fast Tracking of Road Signs

Extracting and Interpreting Unknown Factors with Classifier for Foot Strike Types in Running

Real Time Fencing Move Classification and Detection at Touch Time During a Fencing Match

Gender Classification Using Video Sequences of Body Sway Recorded by Overhead Camera

Automated Whiteboard Lecture Video Summarization by Content Region Detection and Representation

Detecting Marine Species in Echograms Via Traditional, Hybrid, and Deep Learning Frameworks

Spatial Bias in Vision-Based Voice Activity Detection

Video Analytics Gait Trend Measurement for Fall Prevention and Health Monitoring

Global Feature Aggregation for Accident Anticipation

Video Anomaly Detection by Estimating Likelihood of Representations

What and How? Jointly Forecasting Human Action and Pose

Relevance Detection in Cataract Surgery Videos by Spatio-Temporal Action Localization

A Detection-Based Approach to Multiview Action Classification in Infants

Image Sequence Based Cyclist Action Recognition Using Multi-Stream 3D Convolution

Electroencephalography Signal Processing Based on Textural Features for Monitoring the Driver’s State by a Brain-Computer Interface

Audio-Video Detection of the Active Speaker in Meetings

Documents Counterfeit Detection through a Deep Learning Approach

Estimation of Clinical Tremor Using Spatio-Temporal Adversarial AutoEncoder

Motion and Region Aware Adversarial Learning for Fall Detection with Thermal Imaging

Motion U-Net: Multi-Cue Encoder-Decoder Network for Motion Segmentation

Data Normalization for Bilinear Structures in High-Frequency Financial Time-Series

IPT: A Dataset for Identity Preserved Tracking in Closed Domains

Which are the factors affecting the performance of audio surveillance systems?

Deep Learning Based Sepsis Intervention: The Modelling and Prediction of Severe Sepsis Onset

Tracking Fast Moving Objects by Segmentation Network

Precise Temporal Action Localization with Quantified Temporal Structure of Actions

AdaFilter: Adaptive Filter Design with Local Image Basis Decomposition for Optimizing Image Recognition Preprocessing

Video Face Manipulation Detection through Ensemble of CNNs

Fall Detection by Human Pose Estimation and Kinematic Theory

Confidence Calibration for Deep Renal Biopsy Immunofluorescence Image Classification

Uncertainty Guided Recognition of Tiny Craters on the Moon

Context Matters: Self-Attention for Sign Language Recognition

Gabriella: An Online System for Real-Time Activity Detection in Untrimmed Security Videos

Location Prediction in Real Homes of Older Adults based on K-Means in Low-Resolution Depth Videos

Modeling Long-Term Interactions to Enhance Action Recognition

Pose-Based Body Language Recognition for Emotion and Psychiatric Symptom Interpretation

RWF-2000: An Open Large Scale Video Database for Violence Detection

Audio-Based Near-Duplicate Video Retrieval with Audio Similarity Learning

The Color Out of Space: Learning Self-Supervised Representations for Earth Observation Imagery