NTCIR-2 as a Rosetta stone in laboratory experiments of IR systems

作者:

Highlights:

摘要

This paper presents a laboratory based evaluation study of cross-language information retrieval technologies, utilizing partially parallel test collections, NTCIR-2 (used together with NTCIR-1), where Japanese–English parallel document collections, parallel topic sets and their relevance judgments are available. These enable us to observe and compare monolingual retrieval processes in two languages as well as retrieval across languages. Our experiments focused on (1) the Rosetta stone question (whether a partially parallel collection helps in cross-language information access or not?) and (2) two aspects of retrieval difficulties namely “collection discrepancy” and “query discrepancy”. Japanese and English monolingual retrieval systems are combined by dictionary based query translation modules so that a symmetrical bilingual evaluation environment is implemented.

论文关键词:Cross-language information retrieval,Parallel test collection,Query translation approach

论文评审过程:Received 10 June 2004, Accepted 14 June 2004, Available online 20 August 2004.

论文官网地址:https://doi.org/10.1016/j.ipm.2004.06.010