A New Term Significance Weighting Approach

作者:Jin Zhang, Tien N. Nguyen

摘要

The authors present a new term significance measure that integrates term frequency retrieval characteristics, term frequency, document collection characteristics, and both the term depth and width distribution characteristics. A new concept, the term depth distribution, is introduced and its impact on the term significance is analyzed. The authors address the features of the new term significance measure from the angles of the impact of the variables (parameters) on it and the iso-significance contour analyses. An experimental study was conducted to compare the newly developed approach with two other popular approaches from the perspectives of both efficiency and effectiveness. The results show that the newly developed approach achieves satisfactory performance. Issues for further research on this topic are suggested.

论文关键词:term significance, automatic term weighting, term weighting evaluation

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10844-005-0267-y