ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Sketch-Based Community Detection Via Representative Node Sampling

Mahlagha Sedghi, Andre Beckus, George Atia

Auto-TLDR; Sketch-based Clustering of Community Detection Using a Small Sketch

Abstract Slides Poster

This paper proposes a sketch-based approach to the community detection problem which clusters the full graph through the use of an informative and concise sketch. The reduced sketch is built through an effective sampling approach which selects few nodes that best represent the complete graph and operates on a pairwise node similarity measure based on the average commute time. After sampling, the proposed algorithm clusters the nodes in the sketch, and then infers the cluster membership of the remaining nodes in the full graph based on their aggregate similarity to nodes in the partitioned sketch. By sampling nodes with strong representation power, our approach can improve the success rates over full graph clustering. In challenging cases with large node degree variation, our approach not only maintains competitive accuracy with full graph clustering despite using a small sketch, but also outperforms existing sampling methods. The use of a small sketch allows considerable storage savings, and computational and timing improvements for further analysis such as clustering and visualization. We provide numerical results on synthetic data based on the homogeneous, heterogeneous and degree corrected versions of the stochastic block model, as well as experimental results on real-world data.

Similar papers

Scalable Direction-Search-Based Approach to Subspace Clustering

Yicong He, George Atia

Auto-TLDR; Fast Direction-Search-Based Subspace Clustering

Abstract Slides Similar

Subspace clustering finds a multi-subspace representation that best fits a high-dimensional dataset. The computational and storage complexities of existing algorithms limit their usefulness for large scale data. In this paper, we develop a novel scalable approach to subspace clustering termed Fast Direction-Search-Based Subspace Clustering (Fast DiSC). In sharp contrast to existing scalable solutions which are mostly based on the self-expressiveness property of the data, Fast DiSC rests upon a new representation obtained from projections on computed data-dependent directions. These directions are derived from a convex formulation for optimal direction search to gauge hidden similarity relations. The computational complexity is significantly reduced by performing direction search in partitions of sampled data, followed by a retrieval step to cluster out-of-sample data using projections on the computed directions. A theoretical analysis underscores the ability of the proposed formulation to construct local similarity relations for the different data points. Experiments on both synthetic and real data demonstrate that the proposed algorithm can often outperform the state-of-the-art clustering methods.

Cluster-Size Constrained Network Partitioning

Maksim Mironov, Konstantin Avrachenkov

Auto-TLDR; Unsupervised Graph Clustering with Stochastic Block Model

Sketch-Based Community Detection Via Representative Node Sampling

Similar papers

Scalable Direction-Search-Based Approach to Subspace Clustering

Cluster-Size Constrained Network Partitioning

Wasserstein k-Means with Sparse Simplex Projection

Assortative-Constrained Stochastic Block Models

Temporal Pattern Detection in Time-Varying Graphical Models

Low Rank Representation on Product Grassmann Manifolds for Multi-viewSubspace Clustering

Aggregating Dependent Gaussian Experts in Local Approximation

Region and Relations Based Multi Attention Network for Graph Classification

A General Model for Learning Node and Graph Representations Jointly

A Novel Adaptive Minority Oversampling Technique for Improved Classification in Data Imbalanced Scenarios

Active Sampling for Pairwise Comparisons via Approximate Message Passing and Information Gain Maximization

Encoding Brain Networks through Geodesic Clustering of Functional Connectivity for Multiple Sclerosis Classification

Classification and Feature Selection Using a Primal-Dual Method and Projections on Structured Constraints

Fast Subspace Clustering Based on the Kronecker Product

A Multi-Task Multi-View Based Multi-Objective Clustering Algorithm

Edge-Aware Graph Attention Network for Ratio of Edge-User Estimation in Mobile Networks

Subspace Clustering Via Joint Unsupervised Feature Selection

PIF: Anomaly detection via preference embedding

Equation Attention Relationship Network (EARN) : A Geometric Deep Metric Framework for Learning Similar Math Expression Embedding

Learning Connectivity with Graph Convolutional Networks

On Learning Random Forests for Random Forest Clustering

Unconstrained Vision Guided UAV Based Safe Helicopter Landing

3CS Algorithm for Efficient Gaussian Process Model Retrieval

Motion Segmentation with Pairwise Matches and Unknown Number of Motions

Graph Convolutional Neural Networks for Power Line Outage Identification

Label Self-Adaption Hashing for Image Retrieval

Sparse-Dense Subspace Clustering

Double Manifolds Regularized Non-Negative Matrix Factorization for Data Representation

Learning Embeddings for Image Clustering: An Empirical Study of Triplet Loss Approaches

Uniform and Non-Uniform Sampling Methods for Sub-Linear Time K-Means Clustering

On the Global Self-attention Mechanism for Graph Convolutional Networks

Graph Spectral Feature Learning for Mixed Data of Categorical and Numerical Type

Deep Convolutional Embedding for Digitized Painting Clustering

A Spectral Clustering on Grassmann Manifold Via Double Low Rank Constraint

GraphBGS: Background Subtraction Via Recovery of Graph Signals

Tensor Factorization of Brain Structural Graph for Unsupervised Classification in Multiple Sclerosis

AOAM: Automatic Optimization of Adjacency Matrix for Graph Convolutional Network

Kernel-based Graph Convolutional Networks

Siamese Graph Convolution Network for Face Sketch Recognition

Unveiling Groups of Related Tasks in Multi-Task Learning

A Heuristic-Based Decision Tree for Connected Components Labeling of 3D Volumes

Learning Sign-Constrained Support Vector Machines

Improved Time-Series Clustering with UMAP Dimension Reduction Method

Proximity Isolation Forests

Soft Label and Discriminant Embedding Estimation for Semi-Supervised Classification

Anime Sketch Colorization by Component-Based Matching Using Deep Appearance Features and Graph Representation

Constrained Spectral Clustering Network with Self-Training

Mean Decision Rules Method with Smart Sampling for Fast Large-Scale Binary SVM Classification