ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Learning Natural Thresholds for Image Ranking

Somayeh Keshavarz, Quang Nhat Tran, Richard Souvenir

Auto-TLDR; Image Representation Learning and Label Discretization for Natural Image Ranking

Abstract Slides Poster

For image ranking tasks with naturally continuous output, such as age and scenicness estimation, it is common to discretize the label range and apply methods from (ordered) classification analysis. In this paper, we propose a data-driven approach for simultaneous representation learning and label discretization. Compared to arbitrarily selecting thresholds, we seek to learn thresholds and image representations by minimizing a novel loss function in an end-to-end model. We demonstrate our combined approach on a variety of image ranking tasks and demonstrate that it outperforms task-specific methods. Additionally, our learned partitioning scheme can be transferred to improve methods that rely on discretization.

Similar papers

Deep Ordinal Regression with Label Diversity

Axel Berg, Magnus Oskarsson, Mark Oconnor

Auto-TLDR; Discrete Regression via Classification for Neural Network Learning

Abstract Slides Similar

Regression via classification (RvC) is a common method used for regression problems in deep learning, where the target variable belongs to a set of continuous values. By discretizing the target into a set of non-overlapping classes, it has been shown that training a classifier can improve neural network accuracy compared to using a standard regression approach. However, it is not clear how the set of discrete classes should be chosen and how it affects the overall solution. In this work, we propose that using several discrete data representations simultaneously can improve neural network learning compared to a single representation. Our approach is end-to-end differentiable and can be added as a simple extension to conventional learning methods, such as deep neural networks. We test our method on three challenging tasks and show that our method reduces the prediction error compared to a baseline RvC approach while maintaining a similar model complexity.

Rank-Based Ordinal Classification

Joan Serrat, Idoia Ruiz

Auto-TLDR; Ordinal Classification with Order

Learning Natural Thresholds for Image Ranking

Similar papers

Deep Ordinal Regression with Label Diversity

Rank-Based Ordinal Classification

A Flatter Loss for Bias Mitigation in Cross-Dataset Facial Age Estimation

Deep Gait Relative Attribute Using a Signed Quadratic Contrastive Loss

Learning Embeddings for Image Clustering: An Empirical Study of Triplet Loss Approaches

Hierarchical Routing Mixture of Experts

Ordinal Depth Classification Using Region-Based Self-Attention

PROPEL: Probabilistic Parametric Regression Loss for Convolutional Neural Networks

Real-Time Monocular Depth Estimation with Extremely Light-Weight Neural Network

Deep Convolutional Embedding for Digitized Painting Clustering

Multi-Attribute Learning with Highly Imbalanced Data

Privacy Attributes-Aware Message Passing Neural Network for Visual Privacy Attributes Classification

The Aleatoric Uncertainty Estimation Using a Separate Formulation with Virtual Residuals

3D Facial Matching by Spiral Convolutional Metric Learning and a Biometric Fusion-Net of Demographic Properties

A CNN-RNN Framework for Image Annotation from Visual Cues and Social Network Metadata

Multi-Modal Deep Clustering: Unsupervised Partitioning of Images

Attribute-Based Quality Assessment for Demographic Estimation in Face Videos

A Prototype-Based Generalized Zero-Shot Learning Framework for Hand Gesture Recognition

Making Every Label Count: Handling Semantic Imprecision by Integrating Domain Knowledge

PrivAttNet: Predicting Privacy Risks in Images Using Visual Attention

Adversarial Encoder-Multi-Task-Decoder for Multi-Stage Processes

Contextual Classification Using Self-Supervised Auxiliary Models for Deep Neural Networks

Confidence Calibration for Deep Renal Biopsy Immunofluorescence Image Classification

Temporally Coherent Embeddings for Self-Supervised Video Representation Learning

Graph-Based Interpolation of Feature Vectors for Accurate Few-Shot Classification

HP2IFS: Head Pose Estimation Exploiting Partitioned Iterated Function Systems

Local Clustering with Mean Teacher for Semi-Supervised Learning

Self-Supervised Learning for Astronomical Image Classification

Iterative Label Improvement: Robust Training by Confidence Based Filtering and Dataset Partitioning

The eXPose Approach to Crosslier Detection

Learning to Rank for Active Learning: A Listwise Approach

Age Gap Reducer-GAN for Recognizing Age-Separated Faces

Self-Supervised Learning of Dynamic Representations for Static Images

Not 3D Re-ID: Simple Single Stream 2D Convolution for Robust Video Re-Identification

Meta Soft Label Generation for Noisy Labels

Region and Relations Based Multi Attention Network for Graph Classification

Image Representation Learning by Transformation Regression

VSB^2-Net: Visual-Semantic Bi-Branch Network for Zero-Shot Hashing

Probability Guided Maxout

Learning Semantic Representations Via Joint 3D Face Reconstruction and Facial Attribute Estimation

P-DIFF: Learning Classifier with Noisy Labels Based on Probability Difference Distributions

Video Face Manipulation Detection through Ensemble of CNNs

Leveraging a Weakly Adversarial Paradigm for Joint Learning of Disparity and Confidence Estimation

A Bayesian Approach to Reinforcement Learning of Vision-Based Vehicular Control

A Delayed Elastic-Net Approach for Performing Adversarial Attacks

Uncertainty Guided Recognition of Tiny Craters on the Moon

Multi-Label Contrastive Focal Loss for Pedestrian Attribute Recognition

Light3DPose: Real-Time Multi-Person 3D Pose Estimation from Multiple Views