Unsupervised speaker segmentation with residual phase and MFCC features

作者:

Highlights:

摘要

This paper proposes an unsupervised method for improving the automatic speaker segmentation performance by combining the evidence from residual phase (RP) and mel frequency cepstral coefficients (MFCC). This method demonstrates the complementary nature of speaker specific information present in the residual phase in comparison with the information present in the conventional MFCC. Moreover this method presents an unsupervised speaker segmentation algorithm based on support vector machine (SVM). The experiments show that the combination of residual phase and MFCC helps to identify more robustly the transitions among speakers.

论文关键词:Mel frequency cepstral coefficients,Residual phase,Speaker segmentation,Support vector machine

论文评审过程:Available online 23 February 2009.

论文官网地址:https://doi.org/10.1016/j.eswa.2009.02.040