High-performance FAQ retrieval using an automatic clustering method of query logs

作者：

Highlights：

•

摘要

To resolve some of lexical disagreement problems between queries and FAQs, we propose a reliable FAQ retrieval system using query log clustering. On indexing time, the proposed system clusters the logs of users’ queries into predefined FAQ categories. To increase the precision and the recall rate of clustering, the proposed system adopts a new similarity measure using a machine readable dictionary. On searching time, the proposed system calculates the similarities between users’ queries and each cluster in order to smooth FAQs. By virtue of the cluster-based retrieval technique, the proposed system could partially bridge lexical chasms between queries and FAQs. In addition, the proposed system outperforms the traditional information retrieval systems in FAQ retrieval.

论文关键词：Lexical disagreement problem,Query log clustering,FAQ retrieval,Cluster-based retrieval

论文评审过程：Received 10 November 2004, Available online 23 May 2005.

论文官网地址：https://doi.org/10.1016/j.ipm.2005.04.002