ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Optimal Strategies for Comparing Covariates to Solve Matching Problems

Muhammad Ahmed Shah, Raphael Olivier, Bhiksha Raj

Auto-TLDR; Covariate Matching for Pairwise Verification and Ranking

Abstract Slides Poster

Many machine learning tasks can be posed as matching problems in which we are given a ``probe'' entry that we expect matches some of the entries in our ``gallery''. The general solution to these problems is to retrieve matching entries based on statistical dependencies between the probe and the gallery data that are learned using complex models. Often, however, there are other common {\em covariates} to the probe and gallery data which might be easily inferred and may explain some of the statistical dependencies between the two. In this paper we present a probabilistic framework to derive optimal matching strategies based only on covariate features for three broad tasks, namely \textit{$N$-way classification}, \textit{pairwise verification} and \textit{ranking}. We use canonical metrics to determine the maximum performance that can be expected if only covariate features are used and determine the marginal gain of using complex models. We find that covariate matching achieves an EER within 10\% of a CNN in the verification task, and an MAP within 22\% of the a DNN based model in the ranking task.

Similar papers

Lookalike Disambiguation: Improving Face Identification Performance at Top Ranks

Thomas Swearingen, Arun Ross

Auto-TLDR; Lookalike Face Identification Using a Disambiguator for Lookalike Images

Abstract Poster Similar

A face identification system compares an unknown input probe image to a gallery of face images labeled with identities in order to determine the identity of the probe image. The result of identification is a ranked match list with the most similar gallery face image at the top (rank 1) and the least similar gallery face image at the bottom. In many systems, the top ranked gallery images may look very similar to the probe image as well as to each other and can sometimes result in the misidentification of the probe image. Such similar looking faces pertaining to different identities are referred to as lookalike faces. We hypothesize that a matcher specifically trained to disambiguate lookalike face images and combined with a regular face matcher may improve overall identification performance. This work proposes reranking the initial ranked match list using a disambiguator especially for lookalike face pairs. This work also evaluates schemes to select gallery images in the initial ranked match list that should be re-ranked. Experiments on the challenging TinyFace dataset shows that the proposed approach improves the closed-set identification accuracy of a state-of-the-art face matcher.

Toward Text-Independent Cross-Lingual Speaker Recognition Using English-Mandarin-Taiwanese Dataset

Yi-Chieh Wu, Wen-Hung Liao

Auto-TLDR; Cross-lingual Speech for Biometric Recognition

Abstract Poster Similar

Over 40% of the world's population is bilingual. Existing speaker identification/verification systems, however, assume the same language type for both enrollment and recognition stages. In this work, we investigate the feasibility of employing multilingual speech for biometric application. We establish a dataset containing audio recorded in English, Mandarin and Taiwanese. Three acoustic features, namely, i-vector, d-vector and x-vector have been evaluated for both speaker verification (SV) and identification (SI) tasks. Preliminary experimental results indicate that x-vector achieves the best overall performance. Additionally, model trained with hybrid data demonstrates highest accuracy associated with the cost of data collection efforts. In SI tasks, we obtained over 91\% cross-lingual accuracy all models using 3-second audio. In SV tasks, the EER among cross-lingual test is at most 6.52\%, which is observed on the model trained by English corpus. The outcome suggests the feasibility of adopting cross-lingual speech in building text-independent speaker recognition systems.

SoftmaxOut Transformation-Permutation Network for Facial Template Protection

Hakyoung Lee, Cheng Yaw Low, Andrew Teoh

Auto-TLDR; SoftmaxOut Transformation-Permutation Network for C cancellable Biometrics

Optimal Strategies for Comparing Covariates to Solve Matching Problems

Similar papers

Lookalike Disambiguation: Improving Face Identification Performance at Top Ranks

Toward Text-Independent Cross-Lingual Speaker Recognition Using English-Mandarin-Taiwanese Dataset

SoftmaxOut Transformation-Permutation Network for Facial Template Protection

Open-World Group Retrieval with Ambiguity Removal: A Benchmark

Webly Supervised Image-Text Embedding with Noisy Tag Refinement

Multi-annotator Probabilistic Active Learning

Cross-spectrum Face Recognition Using Subspace Projection Hashing

How Unique Is a Face: An Investigative Study

One-Shot Representational Learning for Joint Biometric and Device Authentication

3D Facial Matching by Spiral Convolutional Metric Learning and a Biometric Fusion-Net of Demographic Properties

Identifying Missing Children: Face Age-Progression Via Deep Feature Aging

Age Gap Reducer-GAN for Recognizing Age-Separated Faces

On Resource-Efficient Bayesian Network Classifiers and Deep Neural Networks

DenseRecognition of Spoken Languages

ResMax: Detecting Voice Spoofing Attacks with Residual Network and Max Feature Map

Rank-Based Ordinal Classification

Probability Guided Maxout

Budgeted Batch Mode Active Learning with Generalized Cost and Utility Functions

3D Pots Configuration System by Optimizing Over Geometric Constraints

Cam-Softmax for Discriminative Deep Feature Learning

Attribute-Based Quality Assessment for Demographic Estimation in Face Videos

A Novel Random Forest Dissimilarity Measure for Multi-View Learning

Relative Feature Importance

Hybrid Network for End-To-End Text-Independent Speaker Identification

Learning Natural Thresholds for Image Ranking

Spatial Bias in Vision-Based Voice Activity Detection

Sample-Dependent Distance for 1 : N Identification Via Discriminative Feature Selection

Detection of Calls from Smart Speaker Devices

A Multilinear Sampling Algorithm to Estimate Shapley Values

Aggregating Dependent Gaussian Experts in Local Approximation

Fingerprints, Forever Young?

MD-kNN: An Instance-Based Approach for Multi-Dimensional Classification

Multi-Level Deep Learning Vehicle Re-Identification Using Ranked-Based Loss Functions

Explainable Online Validation of Machine Learning Models for Practical Applications

An Intransitivity Model for Matchup and Pairwise Comparison

Learning Disentangled Representations for Identity Preserving Surveillance Face Camouflage

GPSRL: Learning Semi-Parametric Bayesian Survival Rule Lists from Heterogeneous Patient Data

Adaptive L2 Regularization in Person Re-Identification

Person Recognition with HGR Maximal Correlation on Multimodal Data

Quasibinary Classifier for Images with Zero and Multiple Labels

The eXPose Approach to Crosslier Detection

Attentive Part-Aware Networks for Partial Person Re-Identification

Active Sampling for Pairwise Comparisons via Approximate Message Passing and Information Gain Maximization

Deep Ordinal Regression with Label Diversity

A Flatter Loss for Bias Mitigation in Cross-Dataset Facial Age Estimation

Audio-Video Detection of the Active Speaker in Meetings

Multi-Attribute Learning with Highly Imbalanced Data

Learning Sign-Constrained Support Vector Machines