Time corpora: Epochs, opinions and changes

作者:

Highlights:

摘要

By using large corpora of chronologically ordered language, it is possible to explore diachronic phenomena, identifying previously unknown correlations between language usage and time periods, or epochs. We focused on a statistical approach to epoch delimitation and introduced the task of epoch characterization. We investigated the significant changes in the distribution of terms in the Google N-gram corpus and their relationships with emotion words.

论文关键词:Natural language processing,Large corpora analysis,Sentiment and social analysis,Diachronic analysis,Epoch detection

论文评审过程:Received 30 October 2013, Revised 6 March 2014, Accepted 9 April 2014, Available online 24 April 2014.

论文官网地址:https://doi.org/10.1016/j.knosys.2014.04.029