Computational and natural language processing based studies of hadith literature: a survey

作者:Aqil M. Azmi, Abdulaziz O. Al-Qabbany, Amir Hussain

摘要

Hadith is one of the most celebrated resources of Classical Arabic text. The hadiths, or Prophetic traditions (tradition for short), are narrations originating from the sayings and conduct of Prophet Muhammad. For Muslims, hadiths are the second most important source of Islamic jurisprudence after the Holy Qur’an. Each hadith consists of two parts, isnad and matn. Matn represents the actual text of the hadith, while isnad unwinds the chain of the authorities which precede and introduce the matn, the succession of people through whose channel the hadith reaches the last transmitter. The hadith corpus is huge and runs into hundreds of volumes. It has an even larger supporting work, e.g., commentaries, biographic material etc. Recently, there has been a renewed interest of this important subject by non-specialists. There are many research studies which have been published regarding hadith, specifically applying computational and natural language processing (NLP) techniques to help address some of the outstanding issues, or derive new insight into this classic resource. This paper surveys all major works that have addressed the subject of hadith through various computational and NLP methods, grouping them under three categories: hadith content-based studies, narration-based studies, and overall studies. We also take an in-depth look into pioneering works with many details appearing for the first time. Finally, we outline future research directions in Arabic hadith literature, including novel application of emerging natural language concept based sentiment and emotion mining techniques.

论文关键词:Classical Arabic corpus, Hadith science, Hadith text mining, Hadith authenticity, Islamic knowledge, Humanities computing, Survey

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10462-019-09692-w