An experiment in automatic hierarchical document classification

作者:

Highlights:

摘要

A method of automatic document classification was developed as part of a larger research project in materials selection. Documents classed as QA by the Library of Congress classification system were clustered at six thresholds by keyword using the single link technique. The automatically generated clusters were then compared to the Library of Congress subclasses to which the documents had been assigned by human classifiers. Finally, a partial classified hierarchy was formed from the individual document clusters within a single threshold. Implications of the utility of grouping documents for on-line searching are discussed.

论文关键词:

论文评审过程:Received 25 November 1982, Available online 22 July 2002.

论文官网地址:https://doi.org/10.1016/0306-4573(83)90064-X