An information-pattern-based approach to novelty detection

作者:

Highlights:

摘要

In this paper, a new novelty detection approach based on the identification of sentence level information patterns is proposed. First, “novelty” is redefined based on the proposed information patterns, and several different types of information patterns are given corresponding to different types of users’ information needs. Second, a thorough analysis of sentence level information patterns is elaborated using data from the TREC novelty tracks, including sentence lengths, named entities (NEs), and sentence level opinion patterns. Finally, a unified information-pattern-based approach to novelty detection (ip-BAND) is presented for both specific NE topics and more general topics. Experiments on novelty detection on data from the TREC 2002, 2003 and 2004 novelty tracks show that the proposed approach significantly improves the performance of novelty detection in terms of precision at top ranks. Future research directions are suggested.

论文关键词:Novelty detection,Information retrieval,Question answering,Information patterns,Named entities

论文评审过程:Received 11 July 2007, Revised 19 September 2007, Accepted 25 September 2007, Available online 3 December 2007.

论文官网地址:https://doi.org/10.1016/j.ipm.2007.09.013