Improving the characterization of the alternative hypothesis via minimum verification error training with applications to speaker verification

作者:

Highlights:

摘要

Speaker verification is usually formulated as a statistical hypothesis testing problem and solved by a log-likelihood ratio (LLR) test. A speaker verification system's performance is highly dependent on modeling the target speaker's voice (the null hypothesis) and characterizing non-target speakers’ voices (the alternative hypothesis). However, since the alternative hypothesis involves unknown impostors, it is usually difficult to characterize a priori. In this paper, we propose a framework to better characterize the alternative hypothesis with the goal of optimally distinguishing the target speaker from impostors. The proposed framework is built on a weighted arithmetic combination (WAC) or a weighted geometric combination (WGC) of useful information extracted from a set of pre-trained background models. The parameters associated with WAC or WGC are then optimized using two discriminative training methods, namely, the minimum verification error (MVE) training method and the proposed evolutionary MVE (EMVE) training method, such that both the false acceptance probability and the false rejection probability are minimized. Our experiment results show that the proposed framework outperforms conventional LLR-based approaches.

论文关键词:Genetic algorithm,Hypothesis testing,Log-likelihood ratio,Minimum verification error training,Speaker verification

论文评审过程:Received 18 August 2007, Revised 7 August 2008, Accepted 19 October 2008, Available online 31 October 2008.

论文官网地址:https://doi.org/10.1016/j.patcog.2008.10.015