ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Exposing Deepfake Videos by Tracking Eye Movements

Meng Li, Beibei Liu, Yujiang Hu, Yufei Wang

Auto-TLDR; A Novel Approach to Detecting Deepfake Videos

Abstract Slides Poster

It has recently become a major threat to the public media that fake videos are rapidly spreading over the Internet. The advent of Deepfake, a deep-learning based toolkit, has facilitated a massive abuse of improper synthesized videos, which may influence the media credibility and human rights. A worldwide alert has been set off that finding ways to detect such fake videos is not only crucial but also urgent. This paper reports a novel approach to expose deepfake videos. We found that most fake videos are markedly different from the real ones in the way the eyes move. We are thus motivated to define four features that could well capture such differences. The features are then fed to SVM for classification. It is shown to be a promising approach that without high dimensional features and complicated neural networks, we are able to achieve competitive results on several public datasets. Moreover, the proposed features could well participate with other existing methods in the confrontation with deepfakes.

Similar papers

Video Face Manipulation Detection through Ensemble of CNNs

Nicolo Bonettini, Edoardo Daniele Cannas, Sara Mandelli, Luca Bondi, Paolo Bestagini, Stefano Tubaro

Auto-TLDR; Face Manipulation Detection in Video Sequences Using Convolutional Neural Networks

Abstract Slides Similar

In the last few years, several techniques for facial manipulation in videos have been successfully developed and made available to the masses (i.e., FaceSwap, deepfake, etc.). These methods enable anyone to easily edit faces in video sequences with incredibly realistic results and a very little effort. Despite the usefulness of these tools in many fields, if used maliciously, they can have a significantly bad impact on society (e.g., fake news spreading, cyber bullying through fake revenge porn). The ability of objectively detecting whether a face has been manipulated in a video sequence is then a task of utmost importance. In this paper, we tackle the problem of face manipulation detection in video sequences targeting modern facial manipulation techniques. In particular, we study the ensembling of different trained Convolutional Neural Network (CNN) models. In the proposed solution, different models are obtained starting from a base network (i.e., EfficientNetB4) making use of two different concepts: (i) attention layers; (ii) siamese training. We show that combining these networks leads to promising face manipulation detection results on two publicly available datasets with more than 119000 videos.

User-Independent Gaze Estimation by Extracting Pupil Parameter and Its Mapping to the Gaze Angle

Sang Yoon Han, Nam Ik Cho

Auto-TLDR; Gaze Point Estimation using Pupil Shape for Generalization

Exposing Deepfake Videos by Tracking Eye Movements

Similar papers

Video Face Manipulation Detection through Ensemble of CNNs

User-Independent Gaze Estimation by Extracting Pupil Parameter and Its Mapping to the Gaze Angle

Detecting Manipulated Facial Videos: A Time Series Solution

Adaptive Feature Fusion Network for Gaze Tracking in Mobile Tablets

Estimating Gaze Points from Facial Landmarks by a Remote Spherical Camera

Audio-Video Detection of the Active Speaker in Meetings

Face Anti-Spoofing Based on Dynamic Color Texture Analysis Using Local Directional Number Pattern

Sequential Non-Rigid Factorisation for Head Pose Estimation

Face Anti-Spoofing Using Spatial Pyramid Pooling

Real-Time Driver Drowsiness Detection Using Facial Action Units

Learning Visual Voice Activity Detection with an Automatically Annotated Dataset

Talking Face Generation Via Learning Semantic and Temporal Synchronous Landmarks

A Neural Lip-Sync Framework for Synthesizing Photorealistic Virtual News Anchors

Two-Stream Temporal Convolutional Network for Dynamic Facial Attractiveness Prediction

Classifying Eye-Tracking Data Using Saliency Maps

A Quantitative Evaluation Framework of Video De-Identification Methods

An Experimental Evaluation of Recent Face Recognition Losses for Deepfake Detection

Detection and Correspondence Matching of Corneal Reflections for Eye Tracking Using Deep Learning

Attribute-Based Quality Assessment for Demographic Estimation in Face Videos

Pose-Based Body Language Recognition for Emotion and Psychiatric Symptom Interpretation

Disentangled Representation Based Face Anti-Spoofing

Learning Disentangled Representations for Identity Preserving Surveillance Face Camouflage

Let's Play Music: Audio-Driven Performance Video Generation

Identity-Aware Facial Expression Recognition in Compressed Video

Detection of Makeup Presentation Attacks Based on Deep Face Representations

Quality-Based Representation for Unconstrained Face Recognition

Quantified Facial Temporal-Expressiveness Dynamics for Affect Analysis

RWF-2000: An Open Large Scale Video Database for Violence Detection

Depth Videos for the Classification of Micro-Expressions

Automatic Annotation of Corpora for Emotion Recognition through Facial Expressions Analysis

Recognizing American Sign Language Nonmanual Signal Grammar Errors in Continuous Videos

Translating Adult's Focus of Attention to Elderly's

Responsive Social Smile: A Machine-Learning Based Multimodal Behavior Assessment Framework towards Early Stage Autism Screening

Spatial Bias in Vision-Based Voice Activity Detection

A Grid-Based Representation for Human Action Recognition

Three-Dimensional Lip Motion Network for Text-Independent Speaker Recognition

Magnifying Spontaneous Facial Micro Expressions for Improved Recognition

Fully Convolutional Neural Networks for Raw Eye Tracking Data Segmentation, Generation, and Reconstruction

Inner Eye Canthus Localization for Human Body Temperature Screening

Interpretable Emotion Classification Using Temporal Convolutional Models

Self-Supervised Learning of Dynamic Representations for Static Images

Facial Expression Recognition Using Residual Masking Network

Dual-Mode Iterative Denoiser: Tackling the Weak Label for Anomaly Detection

Self-Supervised Joint Encoding of Motion and Appearance for First Person Action Recognition

Activity Recognition Using First-Person-View Cameras Based on Sparse Optical Flows

Continuous Learning of Face Attribute Synthesis

TinyVIRAT: Low-Resolution Video Action Recognition

Modeling Long-Term Interactions to Enhance Action Recognition