ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Learning Stable Deep Predictive Coding Networks with Weight Norm Supervision

Guo Ruohao

Auto-TLDR; Stability of Predictive Coding Network with Weight Norm Supervision

Abstract Slides Poster

Predictive Coding Network (PCN) is an important neural network inspired by visual processing models in neuroscience. It combines the feedforward and feedback processing and has the architecture of recurrent neural networks (RNNs). This type of network is usually trained with backpropagation through time (BPTT). With infinite recurrent steps, PCN is a dynamic system. However, as one of the most important properties, stability is rarely studied in this type of network. Inspired by reservoir computing, we investigate the stability of hierarchical RNNs from the perspective of dynamic systems, and propose a sufficient condition for their echo state property (ESP). Our study shows the global stability is determined by stability of the local layers and the feedback between neighboring layers. Based on it, we further propose Weight Norm Supervision, a new algorithm that controls the stability of PCN dynamics by imposing different weight norm constraints on different parts of the network. We compare our approach with other training methods in terms of stability and prediction capability. The experiments show that our algorithm learns stable PCNs with a reliable prediction precision in the most effective and controllable way.

Similar papers

RNN Training along Locally Optimal Trajectories via Frank-Wolfe Algorithm

Yun Yue, Ming Li, Venkatesh Saligrama, Ziming Zhang

Auto-TLDR; Frank-Wolfe Algorithm for Efficient Training of RNNs

Learning Stable Deep Predictive Coding Networks with Weight Norm Supervision

Similar papers

RNN Training along Locally Optimal Trajectories via Frank-Wolfe Algorithm

Norm Loss: An Efficient yet Effective Regularization Method for Deep Neural Networks

Revisiting the Training of Very Deep Neural Networks without Skip Connections

Learning Sparse Deep Neural Networks Using Efficient Structured Projections on Convex Constraints for Green AI

On the Global Self-attention Mechanism for Graph Convolutional Networks

Regularized Flexible Activation Function Combinations for Deep Neural Networks

Exploiting Non-Linear Redundancy for Neural Model Compression

Stochastic Runge-Kutta Methods and Adaptive SGD-G2 Stochastic Gradient Descent

Switching Dynamical Systems with Deep Neural Networks

Classification and Feature Selection Using a Primal-Dual Method and Projections on Structured Constraints

VOWEL: A Local Online Learning Rule for Recurrent Networks of Probabilistic Spiking Winner-Take-All Circuits

Energy Minimum Regularization in Continual Learning

Neuron-Based Network Pruning Based on Majority Voting

Image Representation Learning by Transformation Regression

An Efficient Empirical Solver for Localized Multiple Kernel Learning Via DNNs

Feature Engineering and Stacked Echo State Networks for Musical Onset Detection

Dimensionality Reduction for Data Visualization and Linear Classification, and the Trade-Off between Robustness and Classification Accuracy

Class-Incremental Learning with Pre-Allocated Fixed Classifiers

ResNet-Like Architecture with Low Hardware Requirements

Compression Strategies and Space-Conscious Representations for Deep Neural Networks

Learning to Prune in Training via Dynamic Channel Propagation

Low-Cost Lipschitz-Independent Adaptive Importance Sampling of Stochastic Gradients

Semi-Supervised Class Incremental Learning

WeightAlign: Normalizing Activations by Weight Alignment

Understanding When Spatial Transformer Networks Do Not Support Invariance, and What to Do about It

Local Clustering with Mean Teacher for Semi-Supervised Learning

Auto Encoding Explanatory Examples with Stochastic Paths

Recursive Convolutional Neural Networks for Epigenomics

Quaternion Capsule Networks

Learning to Take Directions One Step at a Time

Trainable Spectrally Initializable Matrix Transformations in Convolutional Neural Networks

Meta Learning Via Learned Loss

Unveiling Groups of Related Tasks in Multi-Task Learning

Can Data Placement Be Effective for Neural Networks Classification Tasks? Introducing the Orthogonal Loss

Learning Connectivity with Graph Convolutional Networks

Hcore-Init: Neural Network Initialization Based on Graph Degeneracy

Constructing Geographic and Long-term Temporal Graph for Traffic Forecasting

A Close Look at Deep Learning with Small Data

DAG-Net: Double Attentive Graph Neural Network for Trajectory Forecasting

MA-LSTM: A Multi-Attention Based LSTM for Complex Pattern Extraction

AG-GAN: An Attentive Group-Aware GAN for Pedestrian Trajectory Prediction

Generalization Comparison of Deep Neural Networks Via Output Sensitivity

Improving Batch Normalization with Skewness Reduction for Deep Neural Networks

Mutual Information Based Method for Unsupervised Disentanglement of Video Representation

Reducing the Variance of Variational Estimates of Mutual Information by Limiting the Critic's Hypothesis Space to RKHS

Locality-Promoting Representation Learning

A Joint Representation Learning and Feature Modeling Approach for One-Class Recognition

A Multilinear Sampling Algorithm to Estimate Shapley Values