Generating hierarchical document indices from common denominators in large document collections

作者:

Highlights:

摘要

This paper describes an effective, simple and efficient algorithm for computer generation of hierarchical indices from Document-Term matrices by means of calculating common denominator vectors from the document vector set. This procedure produces an intuitive, user-friendly hierarchical index of a document collection not unlike that which would be expected had a manual indexer set about to create an index or outline of a collection. The resulting index, when presented with a graphical user interface, provides the user with a natural easily comprehended view of the document collection that permits general browsing and informal search activities with an access method that requires no keyboard entry or prior knowledge of the vocabulary.

论文关键词:

论文评审过程:Available online 8 December 1999.

论文官网地址:https://doi.org/10.1016/0306-4573(95)00032-C