LDA-based data augmentation algorithm for acoustic scene classification

作者：

Highlights：

•

摘要

Deep neural network needs large amount of data for training, to obtain more data, many simple data augmentation algorithms have been proposed. In this paper, we propose a LDA-based data augmentation algorithm to extend the training set. The proposed LDA-based data augmentation algorithm uses the topic model LDA to detect the key audio words in the recordings, and further to detect the key audio events and non-key audio events for each recording; with the detected key-audio-event segments, for each acoustic scene class, the probability distribution of key-audio-event’s occurrence numbers, the probability distribution of key-audio-event’s locations under each occurrence number and the probability distribution of key-audio-event’s durations under each occurrence number is counted, and then the new recordings are generated according to these probability distributions. Experiments are done on the public TUT acoustic scenes 2016 dataset, and the experimental results show that compared with the other simple data augmentation algorithms, the proposed LDA-based data augmentation algorithm is more stable and effective, it can get better generalization ability for different kinds of neural network on different datasets.

论文关键词：Acoustic scene classification,Topic model,LDA,Key audio event,Non-key audio event

论文评审过程：Received 1 September 2019, Revised 29 January 2020, Accepted 31 January 2020, Available online 3 February 2020, Version of Record 4 April 2020.

论文官网地址：https://doi.org/10.1016/j.knosys.2020.105600