Natural language processing for Nepali text: a review
作者:Tej Bahadur Shahi, Chiranjibi Sitaula
摘要
Because of the proliferation of Nepali textual documents online, researchers in Nepal and overseas have started working towards its automated analysis for quick inferences, using different machine learning (ML) algorithms, ranging from traditional ML-based algorithms to recent deep learning (DL)-based algorithms. However, researchers are still unaware about the recent trends of NLP research direction in the Nepali language. In this paper, we survey different natural language processing (NLP) research works with associated resources in Nepali language. Furthermore, we organize the NLP approaches, techniques, and application tasks used in the Nepali language processing using the comprehensive taxonomy for each of them. Finally, we discuss and analyze based on such assimilated information for further improvement in NLP research works in the Nepali language. Our thorough survey bestows the detailed backgrounds and motivations to researchers, which not only opens up new potential avenues but also ushers towards further progress of NLP research works in the Nepali language.
论文关键词:Devanagari, Machine learning, Nepali language, Nepali linguistics, Natural language processing, Classification, Sentiment analysis
论文评审过程:
论文官网地址:https://doi.org/10.1007/s10462-021-10093-1