Concept integration of document databases using different indexing languages

作者:

Highlights:

摘要

An integrated information retrieval system generally contains multiple databases that are inconsistent in terms of their content and indexing. This paper proposes a rough set-based transfer (RST) model for integration of the concepts of document databases using various indexing languages, so that users can search through the multiple databases using any of the current indexing languages. The RST model aims to effectively create meaningful transfer relations between the terms of two indexing languages, provided a number of documents are indexed with them in parallel. In our experiment, the indexing concepts of two databases respectively using the Thesaurus of Social Science (IZ) and the Schlagwortnormdatei (SWD) are integrated by means of the RST model. Finally, this paper compares the results achieved with a cross-concordance method, a conditional probability based method and the RST model.

论文关键词:Rough set theory,Concept integration,Document database,Compatibility,Indexing language

论文评审过程:Received 17 May 2004, Accepted 27 September 2004, Available online 30 November 2004.

论文官网地址:https://doi.org/10.1016/j.ipm.2004.09.003