ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Deep Convolutional Embedding for Digitized Painting Clustering

Giovanna Castellano, Gennaro Vessio

Auto-TLDR; A Deep Convolutional Embedding Model for Clustering Artworks

Abstract Slides Poster

Clustering artworks is difficult because of several reasons. On one hand, recognizing meaningful patterns in accordance with domain knowledge and visual perception is extremely hard. On the other hand, the application of traditional clustering and feature reduction techniques to the highly dimensional pixel space can be ineffective. To address these issues, we propose to use a deep convolutional embedding model for digitized painting clustering, in which the task of mapping the input raw data to an abstract, latent space is jointly optimized with the task of finding a set of cluster centroids in this latent feature space. Quantitative and qualitative experimental results show the effectiveness of the proposed method. The model is also able to outperform other state-of-the-art deep clustering approaches to the same problem. The proposed method may be beneficial to several art-related tasks, particularly visual link retrieval and historical knowledge discovery in painting datasets.

Similar papers

N2D: (Not Too) Deep Clustering Via Clustering the Local Manifold of an Autoencoded Embedding

Ryan Mcconville, Raul Santos-Rodriguez, Robert Piechocki, Ian Craddock

Auto-TLDR; Local Manifold Learning for Deep Clustering on Autoencoded Embeddings

Abstract Slides Similar

Deep clustering has increasingly been demonstrating superiority over conventional shallow clustering algorithms. Deep clustering algorithms usually combine representation learning with deep neural networks to achieve this performance, typically optimizing a clustering and non-clustering loss. In such cases, an autoencoder is typically connected with a clustering network, and the final clustering is jointly learned by both the autoencoder and clustering network. Instead, we propose to learn an autoencoded embedding and then search this further for the underlying manifold. For simplicity, we then cluster this with a shallow clustering algorithm, rather than a deeper network. We study a number of local and global manifold learning methods on both the raw data and autoencoded embedding, concluding that UMAP in our framework is able to find the best clusterable manifold of the embedding. This suggests that local manifold learning on an autoencoded embedding is effective for discovering higher quality clusters. We quantitatively show across a range of image and time-series datasets that our method has competitive performance against the latest deep clustering algorithms, including out-performing current state-of-the-art on several. We postulate that these results show a promising research direction for deep clustering. The code can be found at https://github.com/rymc/n2d.

JECL: Joint Embedding and Cluster Learning for Image-Text Pairs

Sean Yang, Kuan-Hao Huang, Bill Howe

Auto-TLDR; JECL: Clustering Image-Caption Pairs with Parallel Encoders and Regularized Clusters

Abstract Poster Similar

We propose JECL, a method for clustering image-caption pairs by training parallel encoders with regularized clustering and alignment objectives, simultaneously learning both representations and cluster assignments. These image-caption pairs arise frequently in high-value applications where structured training data is expensive to produce, but free-text descriptions are common. JECL trains by minimizing the Kullback-Leibler divergence between the distribution of the images and text to that of a combined joint target distribution and optimizing the Jensen-Shannon divergence between the soft cluster assignments of the images and text. Regularizers are also applied to JECL to prevent trivial solutions. Experiments show that JECL outperforms both single-view and multi-view methods on large benchmark image-caption datasets, and is remarkably robust to missing captions and varying data sizes.

Constrained Spectral Clustering Network with Self-Training

Xinyue Liu, Shichong Yang, Linlin Zong

Auto-TLDR; Constrained Spectral Clustering Network: A Constrained Deep spectral clustering network

Deep Convolutional Embedding for Digitized Painting Clustering

Similar papers

N2D: (Not Too) Deep Clustering Via Clustering the Local Manifold of an Autoencoded Embedding

JECL: Joint Embedding and Cluster Learning for Image-Text Pairs

Constrained Spectral Clustering Network with Self-Training

Learning Embeddings for Image Clustering: An Empirical Study of Triplet Loss Approaches

Multi-Modal Deep Clustering: Unsupervised Partitioning of Images

Variational Deep Embedding Clustering by Augmented Mutual Information Maximization

Feature-Aware Unsupervised Learning with Joint Variational Attention and Automatic Clustering

One Step Clustering Based on A-Contrario Framework for Detection of Alterations in Historical Violins

Interactive Style Space of Deep Features and Style Innovation

Ancient Document Layout Analysis: Autoencoders Meet Sparse Coding

Trainable Spectrally Initializable Matrix Transformations in Convolutional Neural Networks

Contextual Classification Using Self-Supervised Auxiliary Models for Deep Neural Networks

Supporting Skin Lesion Diagnosis with Content-Based Image Retrieval

GuCNet: A Guided Clustering-Based Network for Improved Classification

The Color Out of Space: Learning Self-Supervised Representations for Earth Observation Imagery

Supervised Domain Adaptation Using Graph Embedding

Single-Modal Incremental Terrain Clustering from Self-Supervised Audio-Visual Feature Learning

Enhancing Deep Semantic Segmentation of RGB-D Data with Entangled Forests

Comparison of Deep Learning and Hand Crafted Features for Mining Simulation Data

Image Representation Learning by Transformation Regression

Confidence Calibration for Deep Renal Biopsy Immunofluorescence Image Classification

Weight Estimation from an RGB-D Camera in Top-View Configuration

A CNN-RNN Framework for Image Annotation from Visual Cues and Social Network Metadata

Semi-Supervised Class Incremental Learning

Video Anomaly Detection by Estimating Likelihood of Representations

Learning Natural Thresholds for Image Ranking

Generative Deep-Neural-Network Mixture Modeling with Semi-Supervised MinMax+EM Learning

Local Clustering with Mean Teacher for Semi-Supervised Learning

Aggregating Dependent Gaussian Experts in Local Approximation

GazeMAE: General Representations of Eye Movements Using a Micro-Macro Autoencoder

A Close Look at Deep Learning with Small Data

Deep Superpixel Cut for Unsupervised Image Segmentation

A Comparison of Neural Network Approaches for Melanoma Classification

Video Face Manipulation Detection through Ensemble of CNNs

Extracting Action Hierarchies from Action Labels and their Use in Deep Action Recognition

Variational Capsule Encoder

Multimodal Side-Tuning for Document Classification

A Systematic Investigation on Deep Architectures for Automatic Skin Lesions Classification

An Invariance-Guided Stability Criterion for Time Series Clustering Validation

A Quantitative Evaluation Framework of Video De-Identification Methods

Deep Gait Relative Attribute Using a Signed Quadratic Contrastive Loss

Neuron-Based Network Pruning Based on Majority Voting

Hybrid Decomposition Convolution Neural Network and Vocabulary Forest for Image Retrieval

SIMCO: SIMilarity-Based Object COunting

Wasserstein k-Means with Sparse Simplex Projection

Self-Paced Bottom-Up Clustering Network with Side Information for Person Re-Identification

Machine-Learned Regularization and Polygonization of Building Segmentation Masks

Self-Supervised Learning for Astronomical Image Classification