ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Is the Meta-Learning Idea Able to Improve the Generalization of Deep Neural Networks on the Standard Supervised Learning?

Xiang Deng, Zhongfei Zhang

Auto-TLDR; Meta-learning Based Training of Deep Neural Networks for Few-Shot Learning

Abstract Slides Poster

Substantial efforts have been made on improving the generalization abilities of deep neural networks (DNNs) in order to obtain better performances without introducing more parameters. On the other hand, meta-learning approaches exhibit powerful generalization on new tasks in few-shot learning. Intuitively, few-shot learning is more challenging than the standard supervised learning as each target class only has a very few or no training samples. The natural question that arises is whether the meta-learning idea can be used for improving the generalization of DNNs on the standard supervised learning. In this paper, we propose a novel meta-learning based training procedure (MLTP) for DNNs and demonstrate that the meta-learning idea can indeed improve the generalization abilities of DNNs. MLTP simulates the meta-training process by considering a batch of training samples as a task. The key idea is that the gradient descent step for improving the current task performance should also improve a new task performance, which is ignored by the current standard procedure for training neural networks. MLTP also benefits from all the existing training techniques such as dropout, weight decay, and batch normalization. We evaluate MLTP by training a variety of small and large neural networks on three benchmark datasets, i.e., CIFAR-10, CIFAR-100, and Tiny ImageNet. The experimental results show a consistently improved generalization performance on all the DNNs with different sizes, which verifies the promise of MLTP and demonstrates that the meta-learning idea is indeed able to improve the generalization of DNNs on the standard supervised learning.

Similar papers

MetaMix: Improved Meta-Learning with Interpolation-based Consistency Regularization

Yangbin Chen, Yun Ma, Tom Ko, Jianping Wang, Qing Li

Auto-TLDR; MetaMix: A Meta-Agnostic Meta-Learning Algorithm for Few-Shot Classification

Is the Meta-Learning Idea Able to Improve the Generalization of Deep Neural Networks on the Standard Supervised Learning?

Similar papers

MetaMix: Improved Meta-Learning with Interpolation-based Consistency Regularization

Generalization Comparison of Deep Neural Networks Via Output Sensitivity

Norm Loss: An Efficient yet Effective Regularization Method for Deep Neural Networks

Meta Soft Label Generation for Noisy Labels

Complementing Representation Deficiency in Few-Shot Image Classification: A Meta-Learning Approach

Meta Learning Via Learned Loss

Channel Planting for Deep Neural Networks Using Knowledge Distillation

MaxDropout: Deep Neural Network Regularization Based on Maximum Output Values

A Close Look at Deep Learning with Small Data

WeightAlign: Normalizing Activations by Weight Alignment

Can Data Placement Be Effective for Neural Networks Classification Tasks? Introducing the Orthogonal Loss

Meta Generalized Network for Few-Shot Classification

Improving Batch Normalization with Skewness Reduction for Deep Neural Networks

ARCADe: A Rapid Continual Anomaly Detector

Knowledge Distillation Beyond Model Compression

Task-based Focal Loss for Adversarially Robust Meta-Learning

Learning to Prune in Training via Dynamic Channel Propagation

Pose-Robust Face Recognition by Deep Meta Capsule Network-Based Equivariant Embedding

Adaptive Noise Injection for Training Stochastic Student Networks from Deterministic Teachers

Revisiting the Training of Very Deep Neural Networks without Skip Connections

Iterative Label Improvement: Robust Training by Confidence Based Filtering and Dataset Partitioning

On the Information of Feature Maps and Pruning of Deep Neural Networks

TAAN: Task-Aware Attention Network for Few-Shot Classification

Rethinking Experience Replay: A Bag of Tricks for Continual Learning

Fine-Tuning DARTS for Image Classification

P-DIFF: Learning Classifier with Noisy Labels Based on Probability Difference Distributions

Softer Pruning, Incremental Regularization

E-DNAS: Differentiable Neural Architecture Search for Embedded Systems

Dynamic Multi-Path Neural Network

Graph-Based Interpolation of Feature Vectors for Accurate Few-Shot Classification

Filter Pruning Using Hierarchical Group Sparse Regularization for Deep Convolutional Neural Networks

Rethinking of Deep Models Parameters with Respect to Data Distribution

Efficient Online Subclass Knowledge Distillation for Image Classification

Hcore-Init: Neural Network Initialization Based on Graph Degeneracy

Slimming ResNet by Slimming Shortcut

Improving Model Accuracy for Imbalanced Image Classification Tasks by Adding a Final Batch Normalization Layer: An Empirical Study

HFP: Hardware-Aware Filter Pruning for Deep Convolutional Neural Networks Acceleration

How Does DCNN Make Decisions?

Knowledge Distillation with a Precise Teacher and Prediction with Abstention

Beyond Cross-Entropy: Learning Highly Separable Feature Distributions for Robust and Accurate Classification

Local Clustering with Mean Teacher for Semi-Supervised Learning

Image Representation Learning by Transformation Regression

Filtered Batch Normalization

Learning Sparse Deep Neural Networks Using Efficient Structured Projections on Convex Constraints for Green AI

Augmented Bi-Path Network for Few-Shot Learning

Exploiting Elasticity in Tensor Ranks for Compressing Neural Networks

CQNN: Convolutional Quadratic Neural Networks

Neuron-Based Network Pruning Based on Majority Voting