ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Classifying Eye-Tracking Data Using Saliency Maps

Shafin Rahman, Sejuti Rahman, Omar Shahid, Md. Tahmeed Abdullah, Jubair Ahmed Sourov

Auto-TLDR; Saliency-based Feature Extraction for Automatic Classification of Eye-tracking Data

Abstract Slides Poster

A plethora of research in the literature shows how human eye fixation pattern varies depending on different factors, including genetics, age, social functioning, cognitive functioning, and so on. Analysis of these variations in visual attention has already elicited two potential research avenues: 1) determining the physiological or psychological state of the subject and 2) predicting the tasks associated with the act of viewing from the recorded eye-fixation data. To this end, this paper proposes a visual saliency based novel feature extraction method for automatic and quantitative classification of eye-tracking data, which is applicable to both of the research directions. Instead of directly extracting features from the fixation data, this method employs several well-known computational models of visual attention to predict eye fixation locations as saliency maps. Comparing the saliency amplitudes, similarity and dissimilarity of saliency maps with the corresponding eye fixations maps gives an extra dimension of information which is effectively utilized to generate discriminative features to classify the eye-tracking data. Extensive experimentation using Saliency4ASD [1], Age Prediction [2], and Visual Perceptual Task [3] dataset show that our saliency-based feature can achieve superior performance, outperforming the previous state-of-the-art methods [2],[4], [5] by a considerable margin. Moreover, unlike the existing application-specific solutions, our method demonstrates performance improvement across three distinct problems from the real-life domain: Autism Spectrum Disorder screening, toddler age prediction, and human visual perceptual task classification, providing a general paradigm that utilizes the extra-information inherent in saliency maps for a more accurate classification.

Similar papers

A General End-To-End Method for Characterizing Neuropsychiatric Disorders Using Free-Viewing Visual Scanning Tasks

Hong Yue Sean Liu, Jonathan Chung, Moshe Eizenman

Auto-TLDR; A general, data-driven, end-to-end framework that extracts relevant features of attentional bias from visual scanning behaviour and uses these features

Classifying Eye-Tracking Data Using Saliency Maps

Similar papers

A General End-To-End Method for Characterizing Neuropsychiatric Disorders Using Free-Viewing Visual Scanning Tasks

GazeMAE: General Representations of Eye Movements Using a Micro-Macro Autoencoder

Collaborative Human Machine Attention Module for Character Recognition

Fully Convolutional Neural Networks for Raw Eye Tracking Data Segmentation, Generation, and Reconstruction

Saliency Prediction on Omnidirectional Images with Brain-Like Shallow Neural Network

User-Independent Gaze Estimation by Extracting Pupil Parameter and Its Mapping to the Gaze Angle

Translating Adult's Focus of Attention to Elderly's

From Early Biological Models to CNNs: Do They Look Where Humans Look?

FastSal: A Computationally Efficient Network for Visual Saliency Prediction

Responsive Social Smile: A Machine-Learning Based Multimodal Behavior Assessment Framework towards Early Stage Autism Screening

Adaptive Feature Fusion Network for Gaze Tracking in Mobile Tablets

TSMSAN: A Three-Stream Multi-Scale Attentive Network for Video Saliency Detection

PrivAttNet: Predicting Privacy Risks in Images Using Visual Attention

Explainable Online Validation of Machine Learning Models for Practical Applications

Utilising Visual Attention Cues for Vehicle Detection and Tracking

Detection and Correspondence Matching of Corneal Reflections for Eye Tracking Using Deep Learning

Audio-Video Detection of the Active Speaker in Meetings

Electroencephalography Signal Processing Based on Textural Features for Monitoring the Driver’s State by a Brain-Computer Interface

Exploring Spatial-Temporal Representations for fNIRS-based Intimacy Detection via an Attention-enhanced Cascade Convolutional Recurrent Neural Network

Attribute-Based Quality Assessment for Demographic Estimation in Face Videos

Understanding Integrated Gradients with SmoothTaylor for Deep Neural Network Attribution

Deep Gait Relative Attribute Using a Signed Quadratic Contrastive Loss

Deep Convolutional Embedding for Digitized Painting Clustering

Real-Time Driver Drowsiness Detection Using Facial Action Units

Quantified Facial Temporal-Expressiveness Dynamics for Affect Analysis

Assessing the Severity of Health States Based on Social Media Posts

Spatial Bias in Vision-Based Voice Activity Detection

Weight Estimation from an RGB-D Camera in Top-View Configuration

MRP-Net: A Light Multiple Region Perception Neural Network for Multi-Label AU Detection

Video Face Manipulation Detection through Ensemble of CNNs

Pose-Aware Multi-Feature Fusion Network for Driver Distraction Recognition

Depth Videos for the Classification of Micro-Expressions

Exposing Deepfake Videos by Tracking Eye Movements

Coarse to Fine: Progressive and Multi-Task Learning for Salient Object Detection

An Experimental Evaluation of Recent Face Recognition Losses for Deepfake Detection

InsideBias: Measuring Bias in Deep Networks and Application to Face Gender Biometrics

EEG-Based Cognitive State Assessment Using Deep Ensemble Model and Filter Bank Common Spatial Pattern

Attentive Hybrid Feature Based a Two-Step Fusion for Facial Expression Recognition

A Systematic Investigation on Deep Architectures for Automatic Skin Lesions Classification

Estimating Gaze Points from Facial Landmarks by a Remote Spherical Camera

Facial Expression Recognition Using Residual Masking Network

Magnifying Spontaneous Facial Micro Expressions for Improved Recognition

3D Facial Matching by Spiral Convolutional Metric Learning and a Biometric Fusion-Net of Demographic Properties

Two-Level Attention-Based Fusion Learning for RGB-D Face Recognition

Question-Agnostic Attention for Visual Question Answering

Automatic Tuberculosis Detection Using Chest X-Ray Analysis with Position Enhanced Structural Information

A Flatter Loss for Bias Mitigation in Cross-Dataset Facial Age Estimation

A Grid-Based Representation for Human Action Recognition