ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Understanding Integrated Gradients with SmoothTaylor for Deep Neural Network Attribution

Gary Shing Wee Goh, Sebastian Lapuschkin, Leander Weber, Wojciech Samek, Alexander Binder

Auto-TLDR; SmoothGrad: bridging Integrated Gradients and SmoothGrad from the Taylor's theorem perspective

Abstract Slides

Integrated Gradients as an attribution method for deep neural network models offers simple implementability. However, it suffers from noisiness of explanations which affects the ease of interpretability. The SmoothGrad technique is proposed to solve the noisiness issue and smoothen the attribution maps of any gradient-based attribution method. In this paper, we present SmoothTaylor as a novel theoretical concept bridging Integrated Gradients and SmoothGrad, from the Taylor's theorem perspective. We apply the methods to the image classification problem, using the ILSVRC2012 ImageNet object recognition dataset, and a couple of pretrained image models to generate attribution maps. These attribution maps are empirically evaluated using quantitative measures for sensitivity and noise level. We further propose adaptive noising to optimize for the noise scale hyperparameter value. From our experiments, we find that the SmoothTaylor approach together with adaptive noising is able to generate better quality saliency maps with lesser noise and higher sensitivity to the relevant points in the input space as compared to Integrated Gradients.

Similar papers

Improving Explainability of Integrated Gradients with Guided Non-Linearity

Hyuk Jin Kwon, Hyung Il Koo, Nam Ik Cho

Auto-TLDR; Guided Non-linearity for Attribution in Convolutional Neural Networks

Understanding Integrated Gradients with SmoothTaylor for Deep Neural Network Attribution

Similar papers

Improving Explainability of Integrated Gradients with Guided Non-Linearity

MFPP: Morphological Fragmental Perturbation Pyramid for Black-Box Model Explanations

A Generalizable Saliency Map-Based Interpretation of Model Outcome

Combining Similarity and Adversarial Learning to Generate Visual Explanation: Application to Medical Image Classification

How Does DCNN Make Decisions?

Color, Edge, and Pixel-Wise Explanation of Predictions Based onInterpretable Neural Network Model

Auto Encoding Explanatory Examples with Stochastic Paths

Adaptive Noise Injection for Training Stochastic Student Networks from Deterministic Teachers

From Early Biological Models to CNNs: Do They Look Where Humans Look?

Zoom-CAM: Generating Fine-Grained Pixel Annotations from Image Labels

A Multilinear Sampling Algorithm to Estimate Shapley Values

Generalization Comparison of Deep Neural Networks Via Output Sensitivity

Explainable Feature Embedding Using Convolutional Neural Networks for Pathological Image Analysis

Improving Batch Normalization with Skewness Reduction for Deep Neural Networks

Towards Explaining Adversarial Examples Phenomenon in Artificial Neural Networks

Verifying the Causes of Adversarial Examples

On-Manifold Adversarial Data Augmentation Improves Uncertainty Calibration

Adversarially Training for Audio Classifiers

Contextual Classification Using Self-Supervised Auxiliary Models for Deep Neural Networks

Accuracy-Perturbation Curves for Evaluation of Adversarial Attack and Defence Methods

InsideBias: Measuring Bias in Deep Networks and Application to Face Gender Biometrics

Classifying Eye-Tracking Data Using Saliency Maps

Deep Learning on Active Sonar Data Using Bayesian Optimization for Hyperparameter Tuning

FastSal: A Computationally Efficient Network for Visual Saliency Prediction

Optimal Transport As a Defense against Adversarial Attacks

Confidence Calibration for Deep Renal Biopsy Immunofluorescence Image Classification

Explanation-Guided Training for Cross-Domain Few-Shot Classification

Boundary Optimised Samples Training for Detecting Out-Of-Distribution Images

HFP: Hardware-Aware Filter Pruning for Deep Convolutional Neural Networks Acceleration

Norm Loss: An Efficient yet Effective Regularization Method for Deep Neural Networks

CCA: Exploring the Possibility of Contextual Camouflage Attack on Object Detection

Attack-Agnostic Adversarial Detection on Medical Data Using Explainable Machine Learning

Kernel-Based LIME with Feature Dependency Sampling

Beyond Cross-Entropy: Learning Highly Separable Feature Distributions for Robust and Accurate Classification

RNN Training along Locally Optimal Trajectories via Frank-Wolfe Algorithm

Neuron-Based Network Pruning Based on Majority Voting

Probability Guided Maxout

The Color Out of Space: Learning Self-Supervised Representations for Earth Observation Imagery

Towards Tackling Multi-Label Imbalances in Remote Sensing Imagery

An Invariance-Guided Stability Criterion for Time Series Clustering Validation

Trainable Spectrally Initializable Matrix Transformations in Convolutional Neural Networks

F-Mixup: Attack CNNs from Fourier Perspective

A Delayed Elastic-Net Approach for Performing Adversarial Attacks

Defense Mechanism against Adversarial Attacks Using Density-Based Representation of Images

Variational Inference with Latent Space Quantization for Adversarial Resilience

Revisiting Graph Neural Networks: Graph Filtering Perspective

Bayesian Active Learning for Maximal Information Gain on Model Parameters

Transformer-Encoder Detector Module: Using Context to Improve Robustness to Adversarial Attacks on Object Detection