ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Improving Explainability of Integrated Gradients with Guided Non-Linearity

Hyuk Jin Kwon, Hyung Il Koo, Nam Ik Cho

Auto-TLDR; Guided Non-linearity for Attribution in Convolutional Neural Networks

Abstract Slides Poster

Along with the performance improvements of neural network models, developing methods that enable the explanation of their behavior is a significant research topic. For convolutional neural networks, the explainability is usually achieved with attribution (heatmap) that visualizes pixel-level importance or contribution of input to its corresponding result. This attribution should reflect the relation (dependency) between inputs and outputs, which has been studied with a variety of methods, e.g., derivative of an output with respect to an input pixel value, a weighted sum of gradients, amount of output changes to input perturbations, and so on. In this paper, we present a new method that improves the measure of attribution, and incorporates it into the integrated gradients method. To be precise, rather than using the conventional chain-rule, we propose a method called guided non-linearity that propagates gradients more effectively through non-linear units (e.g., ReLU and max-pool) so that only positive gradients backpropagate through non-linear units. Our method is inspired by the mechanism of action potential generation in postsynaptic neurons, where the firing of action potentials depends on the sum of excitatory (EPSP) and inhibitory postsynaptic potentials (IPSP). We believe that paths consisting of EPSP-giving-neurons faithfully reflect the contribution of inputs to the output, and we make gradients flow only along those paths (i.e., paths of positive chain reactions). Experiments with 5 deep neural networks have shown that the proposed method outperforms others in terms of the deletion metrics, and yields fine-grained and more human-interpretable attribution.

Similar papers

Understanding Integrated Gradients with SmoothTaylor for Deep Neural Network Attribution

Gary Shing Wee Goh, Sebastian Lapuschkin, Leander Weber, Wojciech Samek, Alexander Binder

Auto-TLDR; SmoothGrad: bridging Integrated Gradients and SmoothGrad from the Taylor's theorem perspective

Abstract Slides Similar

Integrated Gradients as an attribution method for deep neural network models offers simple implementability. However, it suffers from noisiness of explanations which affects the ease of interpretability. The SmoothGrad technique is proposed to solve the noisiness issue and smoothen the attribution maps of any gradient-based attribution method. In this paper, we present SmoothTaylor as a novel theoretical concept bridging Integrated Gradients and SmoothGrad, from the Taylor's theorem perspective. We apply the methods to the image classification problem, using the ILSVRC2012 ImageNet object recognition dataset, and a couple of pretrained image models to generate attribution maps. These attribution maps are empirically evaluated using quantitative measures for sensitivity and noise level. We further propose adaptive noising to optimize for the noise scale hyperparameter value. From our experiments, we find that the SmoothTaylor approach together with adaptive noising is able to generate better quality saliency maps with lesser noise and higher sensitivity to the relevant points in the input space as compared to Integrated Gradients.

Zoom-CAM: Generating Fine-Grained Pixel Annotations from Image Labels

Xiangwei Shi, Seyran Khademi, Yunqiang Li, Jan Van Gemert

Auto-TLDR; Zoom-CAM for Weakly Supervised Object Localization and Segmentation

Improving Explainability of Integrated Gradients with Guided Non-Linearity

Similar papers

Understanding Integrated Gradients with SmoothTaylor for Deep Neural Network Attribution

Zoom-CAM: Generating Fine-Grained Pixel Annotations from Image Labels

MFPP: Morphological Fragmental Perturbation Pyramid for Black-Box Model Explanations

A Generalizable Saliency Map-Based Interpretation of Model Outcome

Combining Similarity and Adversarial Learning to Generate Visual Explanation: Application to Medical Image Classification

Color, Edge, and Pixel-Wise Explanation of Predictions Based onInterpretable Neural Network Model

Explainable Feature Embedding Using Convolutional Neural Networks for Pathological Image Analysis

From Early Biological Models to CNNs: Do They Look Where Humans Look?

Auto Encoding Explanatory Examples with Stochastic Paths

How Does DCNN Make Decisions?

Dual-Attention Guided Dropblock Module for Weakly Supervised Object Localization

CQNN: Convolutional Quadratic Neural Networks

Contextual Classification Using Self-Supervised Auxiliary Models for Deep Neural Networks

Neuron-Based Network Pruning Based on Majority Voting

InsideBias: Measuring Bias in Deep Networks and Application to Face Gender Biometrics

Robust Localization of Retinal Lesions Via Weakly-Supervised Learning

Explanation-Guided Training for Cross-Domain Few-Shot Classification

Context-Aware Residual Module for Image Classification

Kernel-Based LIME with Feature Dependency Sampling

CCA: Exploring the Possibility of Contextual Camouflage Attack on Object Detection

Attention Pyramid Module for Scene Recognition

Improving Batch Normalization with Skewness Reduction for Deep Neural Networks

Adaptive Image Compression Using GAN Based Semantic-Perceptual Residual Compensation

Probability Guided Maxout

Local Attention and Global Representation Collaborating for Fine-Grained Classification

Deep Multiple Instance Learning with Spatial Attention for ROP Case Classification, Instance Selection and Abnormality Localization

Unsupervised Sound Source Localization From Audio-Image Pairs Using Input Gradient Map

ResNet-Like Architecture with Low Hardware Requirements

Attack-Agnostic Adversarial Detection on Medical Data Using Explainable Machine Learning

On the Information of Feature Maps and Pruning of Deep Neural Networks

Adaptive Noise Injection for Training Stochastic Student Networks from Deterministic Teachers

Improved Residual Networks for Image and Video Recognition

Enhancing Semantic Segmentation of Aerial Images with Inhibitory Neurons

Multi-Order Feature Statistical Model for Fine-Grained Visual Categorization

Attention As Activation

Feature-Dependent Cross-Connections in Multi-Path Neural Networks

PSDNet: A Balanced Architecture of Accuracy and Parameters for Semantic Segmentation

HFP: Hardware-Aware Filter Pruning for Deep Convolutional Neural Networks Acceleration

Verifying the Causes of Adversarial Examples

Collaborative Human Machine Attention Module for Character Recognition

Coarse to Fine: Progressive and Multi-Task Learning for Salient Object Detection

Adversarially Training for Audio Classifiers

EM-Net: Deep Learning for Electron Microscopy Image Segmentation

Dynamic Multi-Path Neural Network

Single Image Super-Resolution with Dynamic Residual Connection

Polynomial Universal Adversarial Perturbations for Person Re-Identification

Documents Counterfeit Detection through a Deep Learning Approach

Generalization Comparison of Deep Neural Networks Via Output Sensitivity