A novel cluster-based approach for keyphrase extraction from MOOC video lectures

作者:Abdulaziz Albahr, Dunren Che, Marwan Albahar

摘要

Massive open online courses (MOOCs) have emerged as a great resource for learners. Numerous challenges remain to be addressed in order to make MOOCs more useful and convenient for learners. One such challenge is how to automatically extract a set of keyphrases from MOOC video lectures that can help students quickly identify the right knowledge they want to learn and thus expedite their learning process. In this paper, we propose SemKeyphrase, an unsupervised cluster-based approach for keyphrase extraction from MOOC video lectures. SemKeyphrase incorporates a new semantic relatedness metric and a ranking algorithm, called PhraseRank, that involves two phases on ranking candidates. We conducted experiments on a real-world dataset of MOOC video lectures, and the results show that our proposed approach outperforms the state-of-the-art keyphrase extraction methods.

论文关键词:MOOCs, Automatic keyphrase extraction, Unsupervised learning, Cluster-based candidate ranking

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-021-01568-2