ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Filter Pruning Using Hierarchical Group Sparse Regularization for Deep Convolutional Neural Networks

Kakeru Mitsuno, Takio Kurita

Auto-TLDR; Hierarchical Group Sparse Regularization for Sparse Convolutional Neural Networks

Abstract Slides Poster

Since the convolutional neural networks are often trained with redundant parameters, it is possible to reduce redundant kernels or filters to obtain a compact network without dropping the classification accuracy. In this paper, we propose a filter pruning method using the hierarchical group sparse regularization. It is shown in our previous work that the hierarchical group sparse regularization is effective in obtaining sparse networks in which filters connected to unnecessary channels are automatically close to zero. After training the convolutional neural network with the hierarchical group sparse regularization, the unnecessary filters are selected based on the increase of the classification loss of the randomly selected training samples to obtain a compact network. It is shown that the proposed method can reduce more than 50% parameters of ResNet for CIFAR-10 with only 0.3% decrease in the accuracy of test samples. Also, 34% parameters of ResNet are reduced for TinyImageNet-200 with higher accuracy than the baseline network.

Similar papers

Channel Planting for Deep Neural Networks Using Knowledge Distillation

Kakeru Mitsuno, Yuichiro Nomura, Takio Kurita

Auto-TLDR; Incremental Training for Deep Neural Networks with Knowledge Distillation

Filter Pruning Using Hierarchical Group Sparse Regularization for Deep Convolutional Neural Networks

Similar papers

Channel Planting for Deep Neural Networks Using Knowledge Distillation

HFP: Hardware-Aware Filter Pruning for Deep Convolutional Neural Networks Acceleration

Slimming ResNet by Slimming Shortcut

Learning to Prune in Training via Dynamic Channel Propagation

A Discriminant Information Approach to Deep Neural Network Pruning

Softer Pruning, Incremental Regularization

MINT: Deep Network Compression Via Mutual Information-Based Neuron Trimming

Activation Density Driven Efficient Pruning in Training

On the Information of Feature Maps and Pruning of Deep Neural Networks

Progressive Gradient Pruning for Classification, Detection and Domain Adaptation

Learning Sparse Deep Neural Networks Using Efficient Structured Projections on Convex Constraints for Green AI

Neuron-Based Network Pruning Based on Majority Voting

Compression of YOLOv3 Via Block-Wise and Channel-Wise Pruning for Real-Time and Complicated Autonomous Driving Environment Sensing Applications

Speeding-Up Pruning for Artificial Neural Networks: Introducing Accelerated Iterative Magnitude Pruning

Exploiting Non-Linear Redundancy for Neural Model Compression

Exploiting Elasticity in Tensor Ranks for Compressing Neural Networks

Attention Based Pruning for Shift Networks

Norm Loss: An Efficient yet Effective Regularization Method for Deep Neural Networks

WeightAlign: Normalizing Activations by Weight Alignment

CQNN: Convolutional Quadratic Neural Networks

Towards Low-Bit Quantization of Deep Neural Networks with Limited Data

Dynamic Multi-Path Neural Network

How Does DCNN Make Decisions?

Compact CNN Structure Learning by Knowledge Distillation

Is the Meta-Learning Idea Able to Improve the Generalization of Deep Neural Networks on the Standard Supervised Learning?

Generalization Comparison of Deep Neural Networks Via Output Sensitivity

VPU Specific CNNs through Neural Architecture Search

Selecting Useful Knowledge from Previous Tasks for Future Learning in a Single Network

Beyond Cross-Entropy: Learning Highly Separable Feature Distributions for Robust and Accurate Classification

Efficient Online Subclass Knowledge Distillation for Image Classification

Fine-Tuning DARTS for Image Classification

Contextual Classification Using Self-Supervised Auxiliary Models for Deep Neural Networks

MaxDropout: Deep Neural Network Regularization Based on Maximum Output Values

Meta Soft Label Generation for Noisy Labels

ResNet-Like Architecture with Low Hardware Requirements

Operation and Topology Aware Fast Differentiable Architecture Search

Improving Batch Normalization with Skewness Reduction for Deep Neural Networks

Feature-Dependent Cross-Connections in Multi-Path Neural Networks

E-DNAS: Differentiable Neural Architecture Search for Embedded Systems

Improved Residual Networks for Image and Video Recognition

A Close Look at Deep Learning with Small Data

Can Data Placement Be Effective for Neural Networks Classification Tasks? Introducing the Orthogonal Loss

Resource-efficient DNNs for Keyword Spotting using Neural Architecture Search and Quantization

Fast and Efficient Neural Network for Light Field Disparity Estimation

Filtered Batch Normalization

Not All Domains Are Equally Complex: Adaptive Multi-Domain Learning

Hcore-Init: Neural Network Initialization Based on Graph Degeneracy

Transitional Asymmetric Non-Local Neural Networks for Real-World Dirt Road Segmentation