ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Hcore-Init: Neural Network Initialization Based on Graph Degeneracy

Stratis Limnios, George Dasoulas, Dimitrios Thilikos, Michalis Vazirgiannis

Auto-TLDR; K-hypercore: Graph Mining for Deep Neural Networks

Abstract Slides Poster

Neural networks are the pinnacle of Artificial Intelligence, as in recent years we witnessed many novel architectures, learning and optimization techniques for deep learning. Capitalizing on the fact that neural networks inherently constitute multipartite graphs among neuron layers, we aim to analyze directly their structure to extract meaningful information that can improve the learning process. To our knowledge graph mining techniques for enhancing learning in neural networks have not been thoroughly investigated. In this paper we propose an adapted version of the k-core structure for the complete weighted multipartite graph extracted from a deep learning architecture. As a multipartite graph is a combination of bipartite graphs, that are in turn the incidence graphs of hypergraphs, we design k-hypercore decomposition, the hypergraph analogue of k-core degeneracy. We applied k-hypercore to several neural network architectures, more specifically to convolutional neural networks and multilayer perceptrons for image recognition tasks after a very short pretraining. Then we used the information provided by the hypercore numbers of the neurons to re-initialize the weights of the neural network, thus biasing the gradient optimization scheme. Extensive experiments proved that k-hypercore outperforms the state-of-the-art initialization methods.

Similar papers

Generalization Comparison of Deep Neural Networks Via Output Sensitivity

Mahsa Forouzesh, Farnood Salehi, Patrick Thiran

Auto-TLDR; Generalization of Deep Neural Networks using Sensitivity

Abstract Slides Similar

Although recent works have brought some insights into the performance improvement of techniques used in state-of-the-art deep-learning models, more work is needed to understand their generalization properties. We shed light on this matter by linking the loss function to the output's sensitivity to its input. We find a rather strong empirical relation between the output sensitivity and the variance in the bias-variance decomposition of the loss function, which hints on using sensitivity as a metric for comparing the generalization performance of networks, without requiring labeled data. We find that sensitivity is decreased by applying popular methods which improve the generalization performance of the model, such as (1) using a deep network rather than a wide one, (2) adding convolutional layers to baseline classifiers instead of adding fully-connected layers, (3) using batch normalization, dropout and max-pooling, and (4) applying parameter initialization techniques.

CQNN: Convolutional Quadratic Neural Networks

Pranav Mantini, Shishir Shah

Auto-TLDR; Quadratic Neural Network for Image Classification

Hcore-Init: Neural Network Initialization Based on Graph Degeneracy

Similar papers

Generalization Comparison of Deep Neural Networks Via Output Sensitivity

CQNN: Convolutional Quadratic Neural Networks

Can Data Placement Be Effective for Neural Networks Classification Tasks? Introducing the Orthogonal Loss

MaxDropout: Deep Neural Network Regularization Based on Maximum Output Values

Neuron-Based Network Pruning Based on Majority Voting

Classification of Intestinal Gland Cell-Graphs Using Graph Neural Networks

On the Global Self-attention Mechanism for Graph Convolutional Networks

Learning Sparse Deep Neural Networks Using Efficient Structured Projections on Convex Constraints for Green AI

Exploiting Non-Linear Redundancy for Neural Model Compression

Is the Meta-Learning Idea Able to Improve the Generalization of Deep Neural Networks on the Standard Supervised Learning?

Improving Batch Normalization with Skewness Reduction for Deep Neural Networks

Region and Relations Based Multi Attention Network for Graph Classification

ResNet-Like Architecture with Low Hardware Requirements

Filtered Batch Normalization

Color, Edge, and Pixel-Wise Explanation of Predictions Based onInterpretable Neural Network Model

On Resource-Efficient Bayesian Network Classifiers and Deep Neural Networks

Norm Loss: An Efficient yet Effective Regularization Method for Deep Neural Networks

How Does DCNN Make Decisions?

Regularized Flexible Activation Function Combinations for Deep Neural Networks

WeightAlign: Normalizing Activations by Weight Alignment

Trainable Spectrally Initializable Matrix Transformations in Convolutional Neural Networks

Supervised Domain Adaptation Using Graph Embedding

Multi-Layered Discriminative Restricted Boltzmann Machine with Untrained Probabilistic Layer

Image Representation Learning by Transformation Regression

Generation of Hypergraphs from the N-Best Parsing of 2D-Probabilistic Context-Free Grammars for Mathematical Expression Recognition

A Close Look at Deep Learning with Small Data

What Nodes Vote To? Graph Classification without Readout Phase

Revisiting the Training of Very Deep Neural Networks without Skip Connections

Edge-Aware Graph Attention Network for Ratio of Edge-User Estimation in Mobile Networks

On the Information of Feature Maps and Pruning of Deep Neural Networks

Named Entity Recognition and Relation Extraction with Graph Neural Networks in Semi Structured Documents

TreeRNN: Topology-Preserving Deep Graph Embedding and Learning

Fine-Tuning DARTS for Image Classification

Leveraging Quadratic Spherical Mutual Information Hashing for Fast Image Retrieval

Initialization Using Perlin Noise for Training Networks with a Limited Amount of Data

Graph Convolutional Neural Networks for Power Line Outage Identification

Learning to Prune in Training via Dynamic Channel Propagation

Cluster-Size Constrained Network Partitioning

Geographic-Semantic-Temporal Hypergraph Convolutional Network for Traffic Flow Prediction

Filter Pruning Using Hierarchical Group Sparse Regularization for Deep Convolutional Neural Networks

Compression Strategies and Space-Conscious Representations for Deep Neural Networks

Rethinking of Deep Models Parameters with Respect to Data Distribution

Quaternion Capsule Networks

Fixed Simplex Coordinates for Angular Margin Loss in CapsNet

Fractional Adaptation of Activation Functions in Neural Networks

Efficient Online Subclass Knowledge Distillation for Image Classification

Dimensionality Reduction for Data Visualization and Linear Classification, and the Trade-Off between Robustness and Classification Accuracy

Revisiting Graph Neural Networks: Graph Filtering Perspective