ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Cost-Effective Adversarial Attacks against Scene Text Recognition

Mingkun Yang, Haitian Zheng, Xiang Bai, Jiebo Luo

Auto-TLDR; Adversarial Attacks on Scene Text Recognition

Abstract Slides Poster

Scene text recognition is a challenging task due to the diversity in text appearance and complexity of natural scenes. Thanks to the development of deep learning and the large volume of training data, scene text recognition has made impressive progress in recent years. However, recent research on adversarial examples has shown that deep learning models are vulnerable to adversarial input with imperceptible changes. As one of the most practical tasks in computer vision, scene text recognition is also facing huge security risks. To our best knowledge, there has been no work on adversarial attacks against scene text recognition. To investigate its effects on scene text recognition, we make the first attempt to attack the state-of-the-art scene text recognizer, i.e., attention-based recognizer. To that end, we first adjust the objective function designed for non-sequential tasks, such as image classification, semantic segmentation and image retrieval, to the sequential form. We then propose a novel and effective objective function to further reduce the amount of perturbation while achieving a higher attack success rate. Comprehensive experiments on several standard benchmarks clearly demonstrate effective adversarial effects on scene text recognition by the proposed attacks.

Similar papers

Transferable Adversarial Attacks for Deep Scene Text Detection

Shudeng Wu, Tao Dai, Guanghao Meng, Bin Chen, Jian Lu, Shutao Xia

Auto-TLDR; Robustness of DNN-based STD methods against Adversarial Attacks

Abstract Slides Similar

Scene text detection (STD) aims to locate text in images and plays an important role in many computer vision tasks including automatic driving and text recognition systems. Recently, deep neural networks (DNNs) have been widely and successfully used in scene text detection, leading to plenty of DNN-based STD methods including regression-based and segmentation-based STD methods. However, recent studies have also shown that DNN is vulnerable to adversarial attacks, which can significantly degrade the performance of DNN models. In this paper, we investigate the robustness of DNN-based STD methods against adversarial attacks. To this end, we propose a generic and efficient attack method to generate adversarial examples, which are produced by adding small but imperceptible adversarial perturbation to the input images. Experiments on attacking four various models and a real-world STD engine of Google optical character recognition (OCR) show that the state-of-the-art DNN-based STD methods including regression-based and segmentation-based methods are vulnerable to adversarial attacks.

IBN-STR: A Robust Text Recognizer for Irregular Text in Natural Scenes

Xiaoqian Li, Jie Liu, Shuwu Zhang

Auto-TLDR; IBN-STR: A Robust Text Recognition System Based on Data and Feature Representation

Abstract Poster Similar

Although text recognition methods based on deep neural networks have promising performance, there are still challenges due to the variety of text styles, perspective distortion, text with large curvature, and so on. To obtain a robust text recognizer, we have improved the performance from two aspects: data aspect and feature representation aspect. In terms of data, we transform the input images into S-shape distorted images in order to increase the diversity of training data. Besides, we explore the effects of different training data. In terms of feature representation, the combination of instance normalization and batch normalization improves the model's capacity and generalization ability. This paper proposes a robust text recognizer IBN-STR, which is an attention-based model. Through extensive experiments, the model analysis and comparison have been carried out from the aspects of data and feature representation, and the effectiveness of IBN-STR on both regular and irregular text instances has been verified. Furthermore, IBN-STR is an end-to-end recognition system that can achieve state-of-the-art performance.

Gaussian Constrained Attention Network for Scene Text Recognition

Zhi Qiao, Xugong Qin, Yu Zhou, Fei Yang, Weiping Wang

Auto-TLDR; Gaussian Constrained Attention Network for Scene Text Recognition

Cost-Effective Adversarial Attacks against Scene Text Recognition

Similar papers

Transferable Adversarial Attacks for Deep Scene Text Detection

IBN-STR: A Robust Text Recognizer for Irregular Text in Natural Scenes

Gaussian Constrained Attention Network for Scene Text Recognition

ReADS: A Rectified Attentional Double Supervised Network for Scene Text Recognition

Weakly Supervised Attention Rectification for Scene Text Recognition

Recognizing Multiple Text Sequences from an Image by Pure End-To-End Learning

A Multi-Head Self-Relation Network for Scene Text Recognition

Text Recognition in Real Scenarios with a Few Labeled Samples

Text Recognition - Real World Data and Where to Find Them

2D License Plate Recognition based on Automatic Perspective Rectification

MEAN: A Multi-Element Attention Based Network for Scene Text Recognition

Sample-Aware Data Augmentor for Scene Text Recognition

Stratified Multi-Task Learning for Robust Spotting of Scene Texts

Robust Lexicon-Free Confidence Prediction for Text Recognition

Task-based Focal Loss for Adversarially Robust Meta-Learning

Cross-Lingual Text Image Recognition Via Multi-Task Sequence to Sequence Learning

Defense Mechanism against Adversarial Attacks Using Density-Based Representation of Images

Adversarially Training for Audio Classifiers

Adaptive Noise Injection for Training Stochastic Student Networks from Deterministic Teachers

Variational Inference with Latent Space Quantization for Adversarial Resilience

Polynomial Universal Adversarial Perturbations for Person Re-Identification

F-Mixup: Attack CNNs from Fourier Perspective

Attack Agnostic Adversarial Defense via Visual Imperceptible Bound

AdvHat: Real-World Adversarial Attack on ArcFace Face ID System

Optimal Transport As a Defense against Adversarial Attacks

Dual Path Multi-Modal High-Order Features for Textual Content Based Visual Question Answering

CCA: Exploring the Possibility of Contextual Camouflage Attack on Object Detection

On the Robustness of 3D Human Pose Estimation

Explain2Attack: Text Adversarial Attacks via Cross-Domain Interpretability

Accuracy-Perturbation Curves for Evaluation of Adversarial Attack and Defence Methods

Local Gradient Difference Based Mass Features for Classification of 2D-3D Natural Scene Text Images

Feature Embedding Based Text Instance Grouping for Largely Spaced and Occluded Text Detection

Scene Text Detection with Selected Anchors

Beyond Cross-Entropy: Learning Highly Separable Feature Distributions for Robust and Accurate Classification

A Delayed Elastic-Net Approach for Performing Adversarial Attacks

DUET: Detection Utilizing Enhancement for Text in Scanned or Captured Documents

An Accurate Threshold Insensitive Kernel Detector for Arbitrary Shaped Text

A Transformer-Based Radical Analysis Network for Chinese Character Recognition

TCATD: Text Contour Attention for Scene Text Detection

Self-Training for Domain Adaptive Scene Text Detection

Global Context-Based Network with Transformer for Image2latex

RLST: A Reinforcement Learning Approach to Scene Text Detection Refinement

LODENet: A Holistic Approach to Offline Handwritten Chinese and Japanese Text Line Recognition

Mutually Guided Dual-Task Network for Scene Text Detection

Killing Four Birds with One Gaussian Process: The Relation between Different Test-Time Attacks

Boosting High-Level Vision with Joint Compression Artifacts Reduction and Super-Resolution

Improving Word Recognition Using Multiple Hypotheses and Deep Embeddings

Adversarial Training for Aspect-Based Sentiment Analysis with BERT