ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Activation Density Driven Efficient Pruning in Training

Timothy Foldy-Porto, Yeshwanth Venkatesha, Priyadarshini Panda

Auto-TLDR; Real-Time Neural Network Pruning with Compressed Networks

Abstract Slides Poster

Neural network pruning with suitable retraining can yield networks with considerably fewer parameters than the original with comparable degrees of accuracy. Typical pruning methods require large, fully trained networks as a starting point from which they perform a time-intensive iterative pruning and retraining procedure to regain the original accuracy. We propose a novel pruning method that prunes a network real-time during training, reducing the overall training time to achieve an efficient compressed network. We introduce an activation density based analysis to identify the optimal relative sizing or compression for each layer of the network. Our method is architecture agnostic, allowing it to be employed on a wide variety of systems. For VGG-19 and ResNet18 on CIFAR-10, CIFAR-100, and TinyImageNet, we obtain exceedingly sparse networks (up to $200 \times$ reduction in parameters and over $60 \times$ reduction in inference compute operations in the best case) with accuracy comparable to the baseline network. By reducing the network size periodically during training, we achieve total training times that are shorter than those of previously proposed pruning methods. Furthermore, training compressed networks at different epochs with our proposed method yields considerable reduction in training compute complexity ($1.6\times$ to $3.2\times$ lower) at near iso-accuracy as compared to a baseline network trained entirely from scratch.

Similar papers

Speeding-Up Pruning for Artificial Neural Networks: Introducing Accelerated Iterative Magnitude Pruning

Marco Zullich, Eric Medvet, Felice Andrea Pellegrino, Alessio Ansuini

Auto-TLDR; Iterative Pruning of Artificial Neural Networks with Overparametrization

Activation Density Driven Efficient Pruning in Training

Similar papers

Speeding-Up Pruning for Artificial Neural Networks: Introducing Accelerated Iterative Magnitude Pruning

Learning to Prune in Training via Dynamic Channel Propagation

Neuron-Based Network Pruning Based on Majority Voting

Progressive Gradient Pruning for Classification, Detection and Domain Adaptation

Slimming ResNet by Slimming Shortcut

Filter Pruning Using Hierarchical Group Sparse Regularization for Deep Convolutional Neural Networks

Exploiting Non-Linear Redundancy for Neural Model Compression

HFP: Hardware-Aware Filter Pruning for Deep Convolutional Neural Networks Acceleration

On the Information of Feature Maps and Pruning of Deep Neural Networks

A Discriminant Information Approach to Deep Neural Network Pruning

Softer Pruning, Incremental Regularization

MINT: Deep Network Compression Via Mutual Information-Based Neuron Trimming

Attention Based Pruning for Shift Networks

Compression Strategies and Space-Conscious Representations for Deep Neural Networks

Exploiting Elasticity in Tensor Ranks for Compressing Neural Networks

Learning Sparse Deep Neural Networks Using Efficient Structured Projections on Convex Constraints for Green AI

Stage-Wise Neural Architecture Search

Channel Planting for Deep Neural Networks Using Knowledge Distillation

VPU Specific CNNs through Neural Architecture Search

How Does DCNN Make Decisions?

Compact CNN Structure Learning by Knowledge Distillation

Compression of YOLOv3 Via Block-Wise and Channel-Wise Pruning for Real-Time and Complicated Autonomous Driving Environment Sensing Applications

Operation and Topology Aware Fast Differentiable Architecture Search

Dynamic Multi-Path Neural Network

Rethinking of Deep Models Parameters with Respect to Data Distribution

Adaptive Noise Injection for Training Stochastic Student Networks from Deterministic Teachers

Resource-efficient DNNs for Keyword Spotting using Neural Architecture Search and Quantization

ResNet-Like Architecture with Low Hardware Requirements

MaxDropout: Deep Neural Network Regularization Based on Maximum Output Values

Enhancing Semantic Segmentation of Aerial Images with Inhibitory Neurons

Trainable Spectrally Initializable Matrix Transformations in Convolutional Neural Networks

Towards Low-Bit Quantization of Deep Neural Networks with Limited Data

Improving Batch Normalization with Skewness Reduction for Deep Neural Networks

CQNN: Convolutional Quadratic Neural Networks

Generalization Comparison of Deep Neural Networks Via Output Sensitivity

WeightAlign: Normalizing Activations by Weight Alignment

Is the Meta-Learning Idea Able to Improve the Generalization of Deep Neural Networks on the Standard Supervised Learning?

Improved Residual Networks for Image and Video Recognition

Norm Loss: An Efficient yet Effective Regularization Method for Deep Neural Networks

Contextual Classification Using Self-Supervised Auxiliary Models for Deep Neural Networks

Attention As Activation

E-DNAS: Differentiable Neural Architecture Search for Embedded Systems

Fast and Accurate Real-Time Semantic Segmentation with Dilated Asymmetric Convolutions

Probability Guided Maxout

Revisiting the Training of Very Deep Neural Networks without Skip Connections

Delving in the Loss Landscape to Embed Robust Watermarks into Neural Networks

Feature-Dependent Cross-Connections in Multi-Path Neural Networks

Iterative Label Improvement: Robust Training by Confidence Based Filtering and Dataset Partitioning