Partial AUC Metric Learning Based Speaker Verification Back-End

Zhongxin Bai, Xiao-Lei Zhang, Jingdong Chen

Equal error rate (EER) is a widely used evaluation metric for speaker verification, which reflects the performance of a verification system at a given decision threshold. However, a value of threshold tuned from one application scenario is generally not optimal when the system is used in another scenario. This motivates the need for optimizing the performance at a range of decision thresholds. To fulfill this objective, we propose to optimize the parameters of a squared Mahalanobis distance metric for directly maximizing the partial area under the ROC curve (pAUC) given an interested range of false positive rate. Experimental results on the NIST SRE 2016 and the core tasks of the Speakers in the Wild (SITW) datasets illustrate the effectiveness of the proposed algorithm.　

Odyssey 2020

The Speaker and Language Recognition Workshop

Partial AUC Metric Learning Based Speaker Verification Back-End

Search in Audio

Speech Transcript

Related Recordings

Bayesian x-vector: Bayesian Neural Network based x-vector System for Speaker Verification

A Speaker Verification Backend for Improved Calibration Performance across Varying Conditions