A language modeling framework for expert finding

作者:

Highlights:

摘要

Statistical language models have been successfully applied to many information retrieval tasks, including expert finding: the process of identifying experts given a particular topic. In this paper, we introduce and detail language modeling approaches that integrate the representation, association and search of experts using various textual data sources into a generative probabilistic framework. This provides a simple, intuitive, and extensible theoretical framework to underpin research into expertise search. To demonstrate the flexibility of the framework, two search strategies to find experts are modeled that incorporate different types of evidence extracted from the data, before being extended to also incorporate co-occurrence information. The models proposed are evaluated in the context of enterprise search systems within an intranet environment, where it is reasonable to assume that the list of experts is known, and that data to be mined is publicly accessible. Our experiments show that excellent performance can be achieved by using these models in such environments, and that this theoretical and empirical work paves the way for future principled extensions.

论文关键词:Expert finding,Language modeling,Expertise search,Intranet search

论文评审过程:Received 30 December 2007, Revised 11 June 2008, Accepted 13 June 2008, Available online 20 September 2008.

论文官网地址:https://doi.org/10.1016/j.ipm.2008.06.003