Jaccard similarity leads to the Marczewski-Steinhaus topology for information retrieval

作者:

Highlights:

摘要

We show that if the similarity function of a retrieval system leads to a (pseudo-) metric, the retrieval, the similarity and the Everett-Cater metric topology coincide and are generally different from the discrete topology. This is the case if we represent documents by lists and use the Jaccard similarity measure. The corresponding metric is then the Marczewski-Steinhaus metric. We further study the special case of a one-element query space consisting of a single-item query.

论文关键词:

论文评审过程:Received 29 April 1997, Accepted 19 September 1997, Available online 11 June 1998.

论文官网地址:https://doi.org/10.1016/S0306-4573(97)00067-8