ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

ACRM: Attention Cascade R-CNN with Mix-NMS for Metallic Surface Defect Detection

Junting Fang, Xiaoyang Tan, Yuhui Wang

Auto-TLDR; Attention Cascade R-CNN with Mix Non-Maximum Suppression for Robust Metal Defect Detection

Abstract Slides Poster

Metallic surface defect detection is of great significance in quality control for production. However, this task is very challenging due to the noise disturbance, large appearance variation, and the ambiguous definition of the defect individual. Traditional image processing methods are unable to detect the damaged region effectively and efficiently. In this paper, we propose a new defect detection method, Attention Cascade R-CNN with Mix-NMS (ACRM), to classify and locate defects robustly. Three submodules are developed to achieve this goal: 1) a lightweight attention block is introduced, which can improve the ability in capture global and local feature both in the spatial and channel dimension; 2) we firstly apply the cascade R-CNN to our task, which exploits multiple detectors to sequentially refine the detection result robustly; 3) we introduce a new method named Mix Non-Maximum Suppression (Mix-NMS), which can significantly improve its ability in filtering the redundant detection result in our task. Extensive experiments on a real industrial dataset show that ACRM achieves state-of-the-art results compared to the existing methods, demonstrating the effectiveness and robustness of our detection method.

Similar papers

EDD-Net: An Efficient Defect Detection Network

Tianyu Guo, Linlin Zhang, Runwei Ding, Ge Yang

Auto-TLDR; EfficientNet: Efficient Network for Mobile Phone Surface defect Detection

ACRM: Attention Cascade R-CNN with Mix-NMS for Metallic Surface Defect Detection

Similar papers

EDD-Net: An Efficient Defect Detection Network

Mobile Phone Surface Defect Detection Based on Improved Faster R-CNN

Construction Worker Hardhat-Wearing Detection Based on an Improved BiFPN

Efficient-Receptive Field Block with Group Spatial Attention Mechanism for Object Detection

Forground-Guided Vehicle Perception Framework

Object Detection Model Based on Scene-Level Region Proposal Self-Attention

A Novel Region of Interest Extraction Layer for Instance Segmentation

Detective: An Attentive Recurrent Model for Sparse Object Detection

Attention Pyramid Module for Scene Recognition

DualBox: Generating BBox Pair with Strong Correspondence Via Occlusion Pattern Clustering and Proposal Refinement

SFPN: Semantic Feature Pyramid Network for Object Detection

End-To-End Deep Learning Methods for Automated Damage Detection in Extreme Events at Various Scales

Global-Local Attention Network for Semantic Segmentation in Aerial Images

Detecting Objects with High Object Region Percentage

Nighttime Pedestrian Detection Based on Feature Attention and Transformation

SyNet: An Ensemble Network for Object Detection in UAV Images

Tiny Object Detection in Aerial Images

Scene Text Detection with Selected Anchors

Attention Stereo Matching Network

Bidirectional Matrix Feature Pyramid Network for Object Detection

Complex-Object Visual Inspection: Empirical Studies on a Multiple Lighting Solution

Dual-Attention Guided Dropblock Module for Weakly Supervised Object Localization

Arbitrary Style Transfer with Parallel Self-Attention

CenterRepp: Predict Central Representative Point Set's Distribution for Detection

Temporal Feature Enhancement Network with External Memory for Object Detection in Surveillance Video

Small Object Detection by Generative and Discriminative Learning

FeatureNMS: Non-Maximum Suppression by Learning Feature Embeddings

PSDNet: A Balanced Architecture of Accuracy and Parameters for Semantic Segmentation

Progressive Scene Segmentation Based on Self-Attention Mechanism

MagnifierNet: Learning Efficient Small-Scale Pedestrian Detector towards Multiple Dense Regions

One-Stage Multi-Task Detector for 3D Cardiac MR Imaging

PRF-Ped: Multi-Scale Pedestrian Detector with Prior-Based Receptive Field

Hybrid Cascade Point Search Network for High Precision Bar Chart Component Detection

Learning a Dynamic High-Resolution Network for Multi-Scale Pedestrian Detection

An Improved Bilinear Pooling Method for Image-Based Action Recognition

Region-Based Non-Local Operation for Video Classification

ScarfNet: Multi-Scale Features with Deeply Fused and Redistributed Semantics for Enhanced Object Detection

Skin Lesion Classification Using Weakly-Supervised Fine-Grained Method

Mutual-Supervised Feature Modulation Network for Occluded Pedestrian Detection

VTT: Long-Term Visual Tracking with Transformers

Cascade Saliency Attention Network for Object Detection in Remote Sensing Images

Adaptive Remote Sensing Image Attribute Learning for Active Object Detection

Self and Channel Attention Network for Person Re-Identification

Multi-Scale and Attention Based ResNet for Heartbeat Classification

CT-UNet: An Improved Neural Network Based on U-Net for Building Segmentation in Remote Sensing Images

Object Detection Using Dual Graph Network

StrongPose: Bottom-up and Strong Keypoint Heat Map Based Pose Estimation

Detecting Marine Species in Echograms Via Traditional, Hybrid, and Deep Learning Frameworks