ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Transferable Adversarial Attacks for Deep Scene Text Detection

Shudeng Wu, Tao Dai, Guanghao Meng, Bin Chen, Jian Lu, Shutao Xia

Auto-TLDR; Robustness of DNN-based STD methods against Adversarial Attacks

Abstract Slides

Scene text detection (STD) aims to locate text in images and plays an important role in many computer vision tasks including automatic driving and text recognition systems. Recently, deep neural networks (DNNs) have been widely and successfully used in scene text detection, leading to plenty of DNN-based STD methods including regression-based and segmentation-based STD methods. However, recent studies have also shown that DNN is vulnerable to adversarial attacks, which can significantly degrade the performance of DNN models. In this paper, we investigate the robustness of DNN-based STD methods against adversarial attacks. To this end, we propose a generic and efficient attack method to generate adversarial examples, which are produced by adding small but imperceptible adversarial perturbation to the input images. Experiments on attacking four various models and a real-world STD engine of Google optical character recognition (OCR) show that the state-of-the-art DNN-based STD methods including regression-based and segmentation-based methods are vulnerable to adversarial attacks.

Similar papers

Cost-Effective Adversarial Attacks against Scene Text Recognition

Mingkun Yang, Haitian Zheng, Xiang Bai, Jiebo Luo

Auto-TLDR; Adversarial Attacks on Scene Text Recognition

Transferable Adversarial Attacks for Deep Scene Text Detection

Similar papers

Cost-Effective Adversarial Attacks against Scene Text Recognition

Feature Embedding Based Text Instance Grouping for Largely Spaced and Occluded Text Detection

An Accurate Threshold Insensitive Kernel Detector for Arbitrary Shaped Text

Scene Text Detection with Selected Anchors

TCATD: Text Contour Attention for Scene Text Detection

Sample-Aware Data Augmentor for Scene Text Recognition

RLST: A Reinforcement Learning Approach to Scene Text Detection Refinement

Mutually Guided Dual-Task Network for Scene Text Detection

AdvHat: Real-World Adversarial Attack on ArcFace Face ID System

CCA: Exploring the Possibility of Contextual Camouflage Attack on Object Detection

A Multi-Head Self-Relation Network for Scene Text Recognition

Defense Mechanism against Adversarial Attacks Using Density-Based Representation of Images

Polynomial Universal Adversarial Perturbations for Person Re-Identification

Stratified Multi-Task Learning for Robust Spotting of Scene Texts

DUET: Detection Utilizing Enhancement for Text in Scanned or Captured Documents

Gaussian Constrained Attention Network for Scene Text Recognition

Optimal Transport As a Defense against Adversarial Attacks

F-Mixup: Attack CNNs from Fourier Perspective

Recognizing Multiple Text Sequences from an Image by Pure End-To-End Learning

Local Gradient Difference Based Mass Features for Classification of 2D-3D Natural Scene Text Images

ReADS: A Rectified Attentional Double Supervised Network for Scene Text Recognition

Self-Training for Domain Adaptive Scene Text Detection

Variational Inference with Latent Space Quantization for Adversarial Resilience

Attack Agnostic Adversarial Defense via Visual Imperceptible Bound

IBN-STR: A Robust Text Recognizer for Irregular Text in Natural Scenes

Robust Lexicon-Free Confidence Prediction for Text Recognition

On-Device Text Image Super Resolution

Accuracy-Perturbation Curves for Evaluation of Adversarial Attack and Defence Methods

Adaptive Noise Injection for Training Stochastic Student Networks from Deterministic Teachers

Task-based Focal Loss for Adversarially Robust Meta-Learning

On the Robustness of 3D Human Pose Estimation

MagnifierNet: Learning Efficient Small-Scale Pedestrian Detector towards Multiple Dense Regions

Text Recognition - Real World Data and Where to Find Them

MEAN: A Multi-Element Attention Based Network for Scene Text Recognition

Boosting High-Level Vision with Joint Compression Artifacts Reduction and Super-Resolution

Beyond Cross-Entropy: Learning Highly Separable Feature Distributions for Robust and Accurate Classification

A Delayed Elastic-Net Approach for Performing Adversarial Attacks

2D License Plate Recognition based on Automatic Perspective Rectification

Tiny Object Detection in Aerial Images

Explain2Attack: Text Adversarial Attacks via Cross-Domain Interpretability

Adversarially Training for Audio Classifiers

Cascade Saliency Attention Network for Object Detection in Remote Sensing Images

Small Object Detection by Generative and Discriminative Learning

Weakly Supervised Attention Rectification for Scene Text Recognition

Fast Approximate Modelling of the Next Combination Result for Stopping the Text Recognition in a Video

Dual Path Multi-Modal High-Order Features for Textual Content Based Visual Question Answering

SFPN: Semantic Feature Pyramid Network for Object Detection

Transformer-Encoder Detector Module: Using Context to Improve Robustness to Adversarial Attacks on Object Detection