ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Active Sampling for Pairwise Comparisons via Approximate Message Passing and Information Gain Maximization

Aliaksei Mikhailiuk, Clifford Wilmot, Maria Perez-Ortiz, Dingcheng Yue, Rafal Mantiuk

Auto-TLDR; ASAP: An Active Sampling Algorithm for Pairwise Comparison Data

Abstract Slides

Pairwise comparison data arise in many domains with subjective assessment experiments, for example in image and video quality assessment. In these experiments observers are asked to express a preference between two conditions. However, many pairwise comparison protocols require a large number of comparisons to infer accurate scores, which may be unfeasible when each comparison is time-consuming (e.g. videos) or expensive (e.g. medical imaging). This motivates the use of an active sampling algorithm that chooses only the most informative pairs for comparison. In this paper we propose ASAP, an active sampling algorithm based on approximate message passing and expected information gain maximization. Unlike most existing methods, which rely on partial updates of the posterior distribution, we are able to perform full updates and therefore much improve the accuracy of the inferred scores. The algorithm relies on three techniques for reducing computational cost: inference based on approximate message passing, selective evaluations of the information gain, and selecting pairs in a batch that forms a minimum spanning tree of the inverse of information gain. We demonstrate, with real and synthetic data, that ASAP offers the highest accuracy of inferred scores compared to the existing methods. We also provide an open-source GPU implementation of ASAP for large-scale experiments.

Similar papers

Factor Screening Using Bayesian Active Learning and Gaussian Process Meta-Modelling

Cheng Li, Santu Rana, Andrew William Gill, Dang Nguyen, Sunil Kumar Gupta, Svetha Venkatesh

Auto-TLDR; Data-Efficient Bayesian Active Learning for Factor Screening in Combat Simulations

Abstract Similar

In this paper we propose a data-efficient Bayesian active learning framework for factor screening, which is important when dealing with systems which are expensive to evaluate, such as combat simulations. We use Gaussian Process meta-modelling with the Automatic Relevance Determination covariance kernel, which measures the importance of each factor by the inverse of their associated length-scales in the kernel. This importance measures the degree of non-linearity in the simulation response with respect to the corresponding factor. We initially place a prior over the length-scale values, then use the estimated posterior to select the next datum to simulate which maximises the mutual entropy between the length-scales and the unknown simulation response. Our goal-driven Bayesian active learning strategy ensures that we are data-efficient in discovering the correct values of the length-scales compared to either a random-sampling or uncertainty-sampling based approach. We apply our method to an expensive combat simulation and demonstrate the superiority of our approach.

Sketch-Based Community Detection Via Representative Node Sampling

Mahlagha Sedghi, Andre Beckus, George Atia

Auto-TLDR; Sketch-based Clustering of Community Detection Using a Small Sketch

Active Sampling for Pairwise Comparisons via Approximate Message Passing and Information Gain Maximization

Similar papers

Factor Screening Using Bayesian Active Learning and Gaussian Process Meta-Modelling

Sketch-Based Community Detection Via Representative Node Sampling

Probabilistic Latent Factor Model for Collaborative Filtering with Bayesian Inference

Multi-annotator Probabilistic Active Learning

Aggregating Dependent Gaussian Experts in Local Approximation

Bayesian Active Learning for Maximal Information Gain on Model Parameters

Budgeted Batch Mode Active Learning with Generalized Cost and Utility Functions

3CS Algorithm for Efficient Gaussian Process Model Retrieval

Adaptive Sampling of Pareto Frontiers with Binary Constraints Using Regression and Classification

DR2S: Deep Regression with Region Selection for Camera Quality Evaluation

Learning Parameter Distributions to Detect Concept Drift in Data Streams

Rank-Based Ordinal Classification

Temporal Pattern Detection in Time-Varying Graphical Models

The eXPose Approach to Crosslier Detection

Automatically Mining Relevant Variable Interactions Via Sparse Bayesian Learning

An Intransitivity Model for Matchup and Pairwise Comparison

Assortative-Constrained Stochastic Block Models

Quantifying Model Uncertainty in Inverse Problems Via Bayesian Deep Gradient Descent

Learning to Rank for Active Learning: A Listwise Approach

GPSRL: Learning Semi-Parametric Bayesian Survival Rule Lists from Heterogeneous Patient Data

Graph Discovery for Visual Test Generation

A Multilinear Sampling Algorithm to Estimate Shapley Values

Generic Merging of Structure from Motion Maps with a Low Memory Footprint

Watermelon: A Novel Feature Selection Method Based on Bayes Error Rate Estimation and a New Interpretation of Feature Relevance and Redundancy

Equation Attention Relationship Network (EARN) : A Geometric Deep Metric Framework for Learning Similar Math Expression Embedding

A Heuristic-Based Decision Tree for Connected Components Labeling of 3D Volumes

MINT: Deep Network Compression Via Mutual Information-Based Neuron Trimming

Leveraging Sequential Pattern Information for Active Learning from Sequential Data

Hierarchical Routing Mixture of Experts

On Learning Random Forests for Random Forest Clustering

Naturally Constrained Online Expectation Maximization

A Novel Adaptive Minority Oversampling Technique for Improved Classification in Data Imbalanced Scenarios

Algorithm Recommendation for Data Streams

Graph-Based Image Decoding for Multiplexed in Situ RNA Detection

Quantifying the Use of Domain Randomization

Minority Class Oriented Active Learning for Imbalanced Datasets

Can Reinforcement Learning Lead to Healthy Life?: Simulation Study Based on User Activity Logs

Low-Cost Lipschitz-Independent Adaptive Importance Sampling of Stochastic Gradients

Unveiling Groups of Related Tasks in Multi-Task Learning

Explainable Online Validation of Machine Learning Models for Practical Applications

Uniform and Non-Uniform Sampling Methods for Sub-Linear Time K-Means Clustering

Sparse-Dense Subspace Clustering

Categorizing the Feature Space for Two-Class Imbalance Learning

Adaptive Matching of Kernel Means

Classification and Feature Selection Using a Primal-Dual Method and Projections on Structured Constraints

Scalable Direction-Search-Based Approach to Subspace Clustering

Motion Segmentation with Pairwise Matches and Unknown Number of Motions

Leveraging Quadratic Spherical Mutual Information Hashing for Fast Image Retrieval