ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Classification of Spatially Enriched Pixel Time Series with Convolutional Neural Networks

Mohamed Chelali, Camille Kurtz, Anne Puissant, Nicole Vincent

Auto-TLDR; Spatio-Temporal Features Extraction from Satellite Image Time Series Using Random Walk

Abstract Slides Poster

Satellite Image Time Series (SITS), MRI sequences, and more generally image time series, constitute 2D+t data providing spatial and temporal information about an observed scene. Given a pattern recognition task such as image classification, considering jointly such rich information is crucial during the decision process. Nevertheless, due to the complex representation of the data-cube, spatio-temporal features extraction from 2D+t data remains difficult to handle. We present in this article an approach to learn such features from this data, and then to proceed to their classification. Our strategy consists in enriching pixel time series with spatial information. It is based on Random Walk to build a novel segment-based representation of the data, passing from a 2D+t dimension to a 2D one, without loosing too much spatial information. Such new representation is then involved in an end-to-end learning process with a classical 2D Convolutional Neural Network (CNN) in order to learn spatio-temporal features for the classification of image time series. Our approach is evaluated on a remote sensing application for the mapping of agricultural crops. Thanks to a visual attention mechanism, the proposed $2D$ spatio-temporal representation makes also easier the interpretation of a SITS to understand spatio-temporal phenomenons related to soil management practices.

Similar papers

Force Banner for the Recognition of Spatial Relations

Robin Deléarde, Camille Kurtz, Laurent Wendling, Philippe Dejean

Auto-TLDR; Spatial Relation Recognition using Force Banners

Abstract Slides Similar

Studying the spatial organization of objects in images is fundamental to increase both the understanding of the sensed scene and the accuracy of the perceived similarity between images. This often leads to the problem of spatial relation recognition: given two objects depicted in an image, what is their spatial relation? In this article, we consider this as a classification problem. Instead of considering directly the original image space (or imaging features) to predict the spatial relation, we propose a novel intermediate representation (called Force Banner) modeling rich spatial information between pairs of objects composing a scene. Such a representation captures the relative position between objects using a panel of forces (attraction and repulsion), that take into account the structural shapes of the objects and their distance in a directional fashion. Force Banners are used to feed a classical 2D Convolutional Neural Network (CNN) for the recognition of spatial relations, benefiting from pre-trained models and fine-tuning. Experimental results obtained on a dataset of images with various shapes highlight the interest of this approach, and in particular its benefit to describe spatial information.

The Color Out of Space: Learning Self-Supervised Representations for Earth Observation Imagery

Stefano Vincenzi, Angelo Porrello, Pietro Buzzega, Marco Cipriano, Pietro Fronte, Roberto Cuccu, Carla Ippoliti, Annamaria Conte, Simone Calderara

Auto-TLDR; Satellite Image Representation Learning for Remote Sensing

Classification of Spatially Enriched Pixel Time Series with Convolutional Neural Networks

Similar papers

Force Banner for the Recognition of Spatial Relations

The Color Out of Space: Learning Self-Supervised Representations for Earth Observation Imagery

On Morphological Hierarchies for Image Sequences

3D Attention Mechanism for Fine-Grained Classification of Table Tennis Strokes Using a Twin Spatio-Temporal Convolutional Neural Networks

One Step Clustering Based on A-Contrario Framework for Detection of Alterations in Historical Violins

Multi-Scanning Based Recurrent Neural Network for Hyperspectral Image Classification

Self-Supervised Joint Encoding of Motion and Appearance for First Person Action Recognition

Exploring Spatial-Temporal Representations for fNIRS-based Intimacy Detection via an Attention-enhanced Cascade Convolutional Recurrent Neural Network

SDMA: Saliency Driven Mutual Cross Attention for Multi-Variate Time Series

Machine-Learned Regularization and Polygonization of Building Segmentation Masks

Detecting Marine Species in Echograms Via Traditional, Hybrid, and Deep Learning Frameworks

SAT-Net: Self-Attention and Temporal Fusion for Facial Action Unit Detection

Enhancing Deep Semantic Segmentation of RGB-D Data with Entangled Forests

Late Fusion of Bayesian and Convolutional Models for Action Recognition

Trainable Spectrally Initializable Matrix Transformations in Convolutional Neural Networks

Documents Counterfeit Detection through a Deep Learning Approach

Space-Time Domain Tensor Neural Networks: An Application on Human Pose Classification

Estimation of Abundance and Distribution of SaltMarsh Plants from Images Using Deep Learning

Towards Tackling Multi-Label Imbalances in Remote Sensing Imagery

Creating Classifier Ensembles through Meta-Heuristic Algorithms for Aerial Scene Classification

Relevance Detection in Cataract Surgery Videos by Spatio-Temporal Action Localization

Motion U-Net: Multi-Cue Encoder-Decoder Network for Motion Segmentation

Attention Pyramid Module for Scene Recognition

Modeling Long-Term Interactions to Enhance Action Recognition

A Detection-Based Approach to Multiview Action Classification in Infants

Early Wildfire Smoke Detection in Videos

Bridging the Gap between Natural and Medical Images through Deep Colorization

Video Face Manipulation Detection through Ensemble of CNNs

Global-Local Attention Network for Semantic Segmentation in Aerial Images

ESResNet: Environmental Sound Classification Based on Visual Domain Models

Automatic Semantic Segmentation of Structural Elements related to the Spinal Cord in the Lumbar Region by Using Convolutional Neural Networks

Deep Transfer Learning for Alzheimer’s Disease Detection

Epileptic Seizure Prediction: A Semi-Dilated Convolutional Neural Network Architecture

Rotation Invariant Aerial Image Retrieval with Group Convolutional Metric Learning

Transformer Networks for Trajectory Forecasting

Video Semantic Segmentation Using Deep Multi-View Representation Learning

Multi-Branch Attention Networks for Classifying Galaxy Clusters

A Comparison of Neural Network Approaches for Melanoma Classification

A Grid-Based Representation for Human Action Recognition

Probability Guided Maxout

A Systematic Investigation on Deep Architectures for Automatic Skin Lesions Classification

Recognizing Bengali Word Images - A Zero-Shot Learning Perspective

Attention Based Coupled Framework for Road and Pothole Segmentation

Attention-Driven Body Pose Encoding for Human Activity Recognition

Which are the factors affecting the performance of audio surveillance systems?

Single View Learning in Action Recognition

Joint Supervised and Self-Supervised Learning for 3D Real World Challenges

Human or Machine? It Is Not What You Write, but How You Write It