ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Uncertainty-Sensitive Activity Recognition: A Reliability Benchmark and the CARING Models

Alina Roitberg, Monica Haurilet, Manuel Martinez, Rainer Stiefelhagen

Auto-TLDR; CARING: Calibrated Action Recognition with Input Guidance

Abstract Slides

Beyond assigning the correct class, an activity recognition model should also to be able to determine, how certain it is in its predictions. We present the first study of how well the confidence values of modern action recognition architectures indeed reflect the probability of the correct outcome and propose a learning-based approach for improving it. First, we extend two popular action recognition datasets with a reliability benchmark in form of the expected calibration error and reliability diagrams. Since our evaluation highlights that confidence values of standard action recognition architectures do not represent the uncertainty well, we introduce a new approach which learns to transform the model output into realistic confidence estimates through an additional calibration network. The main idea of our Calibrated Action Recognition with Input Guidance (CARING) model is to learn an optimal scaling parameter depending on the video representation. We compare our model with the native action recognition networks and the temperature scaling approach - a wide spread calibration method utilized in image classification. While temperature scaling alone drastically improves the reliability of the confidence values, our CARING method consistently leads to the best uncertainty estimates in all benchmark settings.

Similar papers

Confidence Calibration for Deep Renal Biopsy Immunofluorescence Image Classification

Federico Pollastri, Juan Maroñas, Federico Bolelli, Giulia Ligabue, Roberto Paredes, Riccardo Magistroni, Costantino Grana

Auto-TLDR; A Probabilistic Convolutional Neural Network for Immunofluorescence Classification in Renal Biopsy

Uncertainty-Sensitive Activity Recognition: A Reliability Benchmark and the CARING Models

Similar papers

Confidence Calibration for Deep Renal Biopsy Immunofluorescence Image Classification

On-Manifold Adversarial Data Augmentation Improves Uncertainty Calibration

Quasibinary Classifier for Images with Zero and Multiple Labels

Learnable Higher-Order Representation for Action Recognition

TinyVIRAT: Low-Resolution Video Action Recognition

Self-Supervised Joint Encoding of Motion and Appearance for First Person Action Recognition

Improving Model Accuracy for Imbalanced Image Classification Tasks by Adding a Final Batch Normalization Layer: An Empirical Study

ActionSpotter: Deep Reinforcement Learning Framework for Temporal Action Spotting in Videos

Separation of Aleatoric and Epistemic Uncertainty in Deterministic Deep Neural Networks

Probability Guided Maxout

A Grid-Based Representation for Human Action Recognition

Gabriella: An Online System for Real-Time Activity Detection in Untrimmed Security Videos

Developing Motion Code Embedding for Action Recognition in Videos

Late Fusion of Bayesian and Convolutional Models for Action Recognition

MFI: Multi-Range Feature Interchange for Video Action Recognition

A Detection-Based Approach to Multiview Action Classification in Infants

Knowledge Distillation for Action Anticipation Via Label Smoothing

Single View Learning in Action Recognition

Modeling Long-Term Interactions to Enhance Action Recognition

RWF-2000: An Open Large Scale Video Database for Violence Detection

Image Sequence Based Cyclist Action Recognition Using Multi-Stream 3D Convolution

Revisiting Sequence-To-Sequence Video Object Segmentation with Multi-Task Loss and Skip-Memory

Feature Pyramid Hierarchies for Multi-Scale Temporal Action Detection

What and How? Jointly Forecasting Human Action and Pose

Leveraging a Weakly Adversarial Paradigm for Joint Learning of Disparity and Confidence Estimation

Learning Group Activities from Skeletons without Individual Action Labels

Exploiting the Logits: Joint Sign Language Recognition and Spell-Correction

RMS-Net: Regression and Masking for Soccer Event Spotting

Extracting Action Hierarchies from Action Labels and their Use in Deep Action Recognition

Verifying the Causes of Adversarial Examples

Generalization Comparison of Deep Neural Networks Via Output Sensitivity

3D Attention Mechanism for Fine-Grained Classification of Table Tennis Strokes Using a Twin Spatio-Temporal Convolutional Neural Networks

MaxDropout: Deep Neural Network Regularization Based on Maximum Output Values

You Ought to Look Around: Precise, Large Span Action Detection

On the Minimal Recognizable Image Patch

PolyLaneNet: Lane Estimation Via Deep Polynomial Regression

Activity Recognition Using First-Person-View Cameras Based on Sparse Optical Flows

Space-Time Domain Tensor Neural Networks: An Application on Human Pose Classification

Motion Complementary Network for Efficient Action Recognition

MixTConv: Mixed Temporal Convolutional Kernels for Efficient Action Recognition

Iterative Label Improvement: Robust Training by Confidence Based Filtering and Dataset Partitioning

Explainable Online Validation of Machine Learning Models for Practical Applications

InsideBias: Measuring Bias in Deep Networks and Application to Face Gender Biometrics

Personalized Models in Human Activity Recognition Using Deep Learning

AttendAffectNet: Self-Attention Based Networks for Predicting Affective Responses from Movies

Temporally Coherent Embeddings for Self-Supervised Video Representation Learning

Towards Practical Compressed Video Action Recognition: A Temporal Enhanced Multi-Stream Network

A Close Look at Deep Learning with Small Data