ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Deep Multiple Instance Learning with Spatial Attention for ROP Case Classification, Instance Selection and Abnormality Localization

Xirong Li, Wencui Wan, Yang Zhou, Jianchun Zhao, Qijie Wei, Junbo Rong, Pengyi Zhou, Limin Xu, Lijuan Lang, Yuying Liu, Chengzhi Niu, Dayong Ding, Xuemin Jin

Auto-TLDR; MIL-SA: Deep Multiple Instance Learning for Automated Screening of Retinopathy of Prematurity

Abstract

This paper tackles automated screening of Retinopathy of Prematurity (ROP), one of the most common causes of visual loss in childhood. Clinically, ROP screening per case requires multiple color fundus images capturing different zones of the premature retina. A desirable model shall not only make a decision at the case level, but also pinpoint which instances and what part of the instances are responsible for the decision. This paper makes the first attempt to accomplish three tasks, i.e, ROP case classification, instance selection and abnormality localization in a unified framework. To that end, we propose a new model that effectively combines instance-attention based deep multiple instance learning (MIL) and spatial attention (SA). The propose model, which we term MIL-SA, identifies positive instances in light of their contributions to case-level decision. Meanwhile, abnormal regions in the identified instances are automatically localized by the SA mechanism. Moreover, MIL-SA is learned from case-level binary labels exclusively, and in an end-to-end manner. Experiments on a large clinical dataset of 2,186 cases with 11,053 fundus images show the viability of the proposed model for all the three tasks.

Similar papers

Learn to Segment Retinal Lesions and Beyond

Qijie Wei, Xirong Li, Weihong Yu, Xiao Zhang, Yongpeng Zhang, Bojie Hu, Bin Mo, Di Gong, Ning Chen, Dayong Ding, Youxin Chen

Auto-TLDR; Multi-task Lesion Segmentation and Disease Classification for Diabetic Retinopathy Grading

Abstract Poster Similar

Towards automated retinal screening, this paper makes an endeavor to simultaneously achieve pixel-level retinal lesion segmentation and image-level disease classification. Such a multi-task approach is crucial for accurate and clinically interpretable disease diagnosis. Prior art is insufficient due to three challenges, i.e., lesions lacking objective boundaries, clinical importance of lesions irrelevant to their size, and the lack of one-to-one correspondence between lesion and disease classes. This paper attacks the three challenges in the context of diabetic retinopathy (DR) grading. We propose Lesion-Net, a new variant of fully convolutional networks, with its expansive path re- designed to tackle the first challenge. A dual Dice loss that leverages both semantic segmentation and image classification losses is introduced to resolve the second challenge. Lastly, we build a multi-task network that employs Lesion-Net as a side- attention branch for both DR grading and result interpretation. A set of 12K fundus images is manually segmented by 45 ophthalmologists for 8 DR-related lesions, resulting in 290K manual segments in total. Extensive experiments on this large- scale dataset show that our proposed approach surpasses the prior art for multiple tasks including lesion segmentation, lesion classification and DR grading.

Robust Localization of Retinal Lesions Via Weakly-Supervised Learning

Ruohan Zhao, Qin Li, Jane You

Auto-TLDR; Weakly Learning of Lesions in Fundus Images Using Multi-level Feature Maps and Classification Score

Deep Multiple Instance Learning with Spatial Attention for ROP Case Classification, Instance Selection and Abnormality Localization

Similar papers

Learn to Segment Retinal Lesions and Beyond

Robust Localization of Retinal Lesions Via Weakly-Supervised Learning

Zoom-CAM: Generating Fine-Grained Pixel Annotations from Image Labels

SA-UNet: Spatial Attention U-Net for Retinal Vessel Segmentation

Attention Based Multi-Instance Thyroid Cytopathological Diagnosis with Multi-Scale Feature Fusion

Skin Lesion Classification Using Weakly-Supervised Fine-Grained Method

Dual-Attention Guided Dropblock Module for Weakly Supervised Object Localization

Fine-Tuning Convolutional Neural Networks: A Comprehensive Guide and Benchmark Analysis for Glaucoma Screening

Supporting Skin Lesion Diagnosis with Content-Based Image Retrieval

Aggregating Object Features Based on Attention Weights for Fine-Grained Image Retrieval

Sparse Network Inversion for Key Instance Detection in Multiple Instance Learning

Attention-Based Selection Strategy for Weakly Supervised Object Localization

ACRM: Attention Cascade R-CNN with Mix-NMS for Metallic Surface Defect Detection

Semi-Supervised Generative Adversarial Networks with a Pair of Complementary Generators for Retinopathy Screening

Transfer Learning through Weighted Loss Function and Group Normalization for Vessel Segmentation from Retinal Images

A Comparison of Neural Network Approaches for Melanoma Classification

Activity and Relationship Modeling Driven Weakly Supervised Object Detection

Multi-Order Feature Statistical Model for Fine-Grained Visual Categorization

Inception Based Deep Learning Architecture for Tuberculosis Screening of Chest X-Rays

Classify Breast Histopathology Images with Ductal Instance-Oriented Pipeline

SAT-Net: Self-Attention and Temporal Fusion for Facial Action Unit Detection

PCANet: Pyramid Context-Aware Network for Retinal Vessel Segmentation

Global-Local Attention Network for Semantic Segmentation in Aerial Images

A Generalizable Saliency Map-Based Interpretation of Model Outcome

Efficient-Receptive Field Block with Group Spatial Attention Mechanism for Object Detection

Point In: Counting Trees with Weakly Supervised Segmentation Network

TAAN: Task-Aware Attention Network for Few-Shot Classification

Convolutional STN for Weakly Supervised Object Localization

BCAU-Net: A Novel Architecture with Binary Channel Attention Module for MRI Brain Segmentation

A Systematic Investigation on Deep Architectures for Automatic Skin Lesions Classification

A Benchmark Dataset for Segmenting Liver, Vasculature and Lesions from Large-Scale Computed Tomography Data

DARN: Deep Attentive Refinement Network for Liver Tumor Segmentation from 3D CT Volume

MFPP: Morphological Fragmental Perturbation Pyramid for Black-Box Model Explanations

Accurate Cell Segmentation in Digital Pathology Images Via Attention Enforced Networks

Multi-Scale Residual Pyramid Attention Network for Monocular Depth Estimation

CAggNet: Crossing Aggregation Network for Medical Image Segmentation

An Improved Bilinear Pooling Method for Image-Based Action Recognition

Cross-View Relation Networks for Mammogram Mass Detection

Improving Explainability of Integrated Gradients with Guided Non-Linearity

Attentive Hybrid Feature Based a Two-Step Fusion for Facial Expression Recognition

Multi-Scale and Attention Based ResNet for Heartbeat Classification

Multi-Modal Identification of State-Sponsored Propaganda on Social Media

Attention Pyramid Module for Scene Recognition

Collaborative Human Machine Attention Module for Character Recognition

Dual Stream Network with Selective Optimization for Skin Disease Recognition in Consumer Grade Images

Unsupervised Sound Source Localization From Audio-Image Pairs Using Input Gradient Map

Self and Channel Attention Network for Person Re-Identification

A New Convex Loss Function for Multiple Instance Support Vector Machines