Fuzzy integral based information fusion for classification of highly confusable non-speech sounds

作者：

Highlights：

•

摘要

Acoustic event classification may help to describe acoustic scenes and contribute to improve the robustness of speech technologies. In this work, fusion of different information sources with the fuzzy integral (FI), and the associated fuzzy measure (FM), are applied to the problem of classifying a small set of highly confusable human non-speech sounds. As FI is a meaningful formalism for combining classifier outputs that can capture interactions among the various sources of information, it shows in our experiments a significantly better performance than that of any single classifier entering the FI fusion module. Actually, that FI decision-level fusion approach shows comparable results to the high-performing SVM feature-level fusion and thus it seems to be a good choice when feature-level fusion is not an option. We have also observed that the importance and the degree of interaction among the various feature types given by the FM can be used for feature selection, and gives a valuable insight into the problem.

论文关键词：Acoustic event classification,Audio features,Fuzzy integral and fuzzy measure,Feature-level and decision-level information fusion,Feature selection,Interaction of information sources,Choquet integral

论文评审过程：Received 24 November 2006, Revised 7 September 2007, Accepted 28 October 2007, Available online 9 November 2007.

论文官网地址：https://doi.org/10.1016/j.patcog.2007.10.026