ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Detecting Manipulated Facial Videos: A Time Series Solution

Zhang Zhewei, Ma Can, Gao Meilin, Ding Bowen

Auto-TLDR; Face-Alignment Based Bi-LSTM for Fake Video Detection

Abstract Slides Poster

We propose a new method to expose fake videos based on a time series solution. The method is based on bidirectional long short-term memory (Bi-LSTM) backbone architecture with two different types of features: {Face-Alignment} and {Dense-Face-Alignment}, in which both of them are physiological signals that can be distinguished between fake and original videos. We choose 68 landmark points as the feature of {Face-Alignment} and Pose Adaptive Feature (PAF) for {Dense-Face-Alignment}. Based on these two facial features, we designed two deep networks. In addition, we optimize our network by adding an attention mechanism that improves detection precision. Our method is tested over benchmarks of Face Forensics/Face Forensics++ dataset and show a promising performance on inference speed while maintaining accuracy with state-of art solutions that deal against DeepFake.

Similar papers

Video Face Manipulation Detection through Ensemble of CNNs

Nicolo Bonettini, Edoardo Daniele Cannas, Sara Mandelli, Luca Bondi, Paolo Bestagini, Stefano Tubaro

Auto-TLDR; Face Manipulation Detection in Video Sequences Using Convolutional Neural Networks

Abstract Slides Similar

In the last few years, several techniques for facial manipulation in videos have been successfully developed and made available to the masses (i.e., FaceSwap, deepfake, etc.). These methods enable anyone to easily edit faces in video sequences with incredibly realistic results and a very little effort. Despite the usefulness of these tools in many fields, if used maliciously, they can have a significantly bad impact on society (e.g., fake news spreading, cyber bullying through fake revenge porn). The ability of objectively detecting whether a face has been manipulated in a video sequence is then a task of utmost importance. In this paper, we tackle the problem of face manipulation detection in video sequences targeting modern facial manipulation techniques. In particular, we study the ensembling of different trained Convolutional Neural Network (CNN) models. In the proposed solution, different models are obtained starting from a base network (i.e., EfficientNetB4) making use of two different concepts: (i) attention layers; (ii) siamese training. We show that combining these networks leads to promising face manipulation detection results on two publicly available datasets with more than 119000 videos.

Joint Face Alignment and 3D Face Reconstruction with Efficient Convolution Neural Networks

Keqiang Li, Huaiyu Wu, Xiuqin Shang, Zhen Shen, Gang Xiong, Xisong Dong, Bin Hu, Fei-Yue Wang

Auto-TLDR; Mobile-FRNet: Efficient 3D Morphable Model Alignment and 3D Face Reconstruction from a Single 2D Facial Image

Detecting Manipulated Facial Videos: A Time Series Solution

Similar papers

Video Face Manipulation Detection through Ensemble of CNNs

Joint Face Alignment and 3D Face Reconstruction with Efficient Convolution Neural Networks

Exposing Deepfake Videos by Tracking Eye Movements

3D Facial Matching by Spiral Convolutional Metric Learning and a Biometric Fusion-Net of Demographic Properties

A Neural Lip-Sync Framework for Synthesizing Photorealistic Virtual News Anchors

Hybrid Approach for 3D Head Reconstruction: Using Neural Networks and Visual Geometry

Multi-Attribute Regression Network for Face Reconstruction

Face Anti-Spoofing Based on Dynamic Color Texture Analysis Using Local Directional Number Pattern

MEG: Multi-Evidence GNN for Multimodal Semantic Forensics

An Experimental Evaluation of Recent Face Recognition Losses for Deepfake Detection

A Cross Domain Multi-Modal Dataset for Robust Face Anti-Spooﬁng

SAT-Net: Self-Attention and Temporal Fusion for Facial Action Unit Detection

Global Feature Aggregation for Accident Anticipation

Face Anti-Spoofing Using Spatial Pyramid Pooling

Interpretable Emotion Classification Using Temporal Convolutional Models

Inner Eye Canthus Localization for Human Body Temperature Screening

Two-Stream Temporal Convolutional Network for Dynamic Facial Attractiveness Prediction

Depth Videos for the Classification of Micro-Expressions

Adaptive Feature Fusion Network for Gaze Tracking in Mobile Tablets

Coherence and Identity Learning for Arbitrary-Length Face Video Generation

ClusterFace: Joint Clustering and Classification for Set-Based Face Recognition

Real-Time Driver Drowsiness Detection Using Facial Action Units

On the Use of Benford's Law to Detect GAN-Generated Images

Learning Semantic Representations Via Joint 3D Face Reconstruction and Facial Attribute Estimation

Evaluation of Anomaly Detection Algorithms for the Real-World Applications

RLST: A Reinforcement Learning Approach to Scene Text Detection Refinement

SSDL: Self-Supervised Domain Learning for Improved Face Recognition

Deep Multi-Task Learning for Facial Expression Recognition and Synthesis Based on Selective Feature Sharing

Two-Level Attention-Based Fusion Learning for RGB-D Face Recognition

MRP-Net: A Light Multiple Region Perception Neural Network for Multi-Label AU Detection

A Weak Coupling of Semi-Supervised Learning with Generative Adversarial Networks for Malware Classification

User-Independent Gaze Estimation by Extracting Pupil Parameter and Its Mapping to the Gaze Angle

Learning Disentangled Representations for Identity Preserving Surveillance Face Camouflage

Explorable Tone Mapping Operators

Trajectory-User Link with Attention Recurrent Networks

MedZip: 3D Medical Images Lossless Compressor Using Recurrent Neural Network (LSTM)

Adaptive Image Compression Using GAN Based Semantic-Perceptual Residual Compensation

Learning Visual Voice Activity Detection with an Automatically Annotated Dataset

Three-Dimensional Lip Motion Network for Text-Independent Speaker Recognition

Disentangled Representation Based Face Anti-Spoofing

Video-Based Facial Expression Recognition Using Graph Convolutional Networks

Computational Data Analysis for First Quantization Estimation on JPEG Double Compressed Images

Identity-Aware Facial Expression Recognition in Compressed Video

Attentive Hybrid Feature Based a Two-Step Fusion for Facial Expression Recognition

Talking Face Generation Via Learning Semantic and Temporal Synchronous Landmarks

Anomaly Detection, Localization and Classification for Railway Inspection

Audio-Video Detection of the Active Speaker in Meetings

Self-Supervised Joint Encoding of Motion and Appearance for First Person Action Recognition