Deriving and verifying statistical distribution of a hyperlink-based Web page quality metric

作者:

Highlights:

摘要

The significance of modeling and measuring various attributes of the Web in part or as a whole is undeniable. Modeling information phenomena on the Web constitutes fundamental research towards an understanding that will contribute to the goal of increasing its utility. Although Web related metrics have become increasingly sophisticated, few employ models to explain their measurements. In this paper, we discuss issues related to metrics for Web page significance. These metrics are used for ranking the quality and relevance of Web pages in response to user needs. We focus on the problem of ascertaining the statistical distribution of some well-known hyperlink-based Web page quality metrics. Based on empirical distributions of Web page degrees, we derived analytically the probability distribution for the PageRank metric. We found out that it follows the familiar inverse polynomial law reported for Web page degrees. We verified the theoretical exercise with experimental results that suggest a highly concentrated distribution of the metric.

论文关键词:Web measurement,Quality metrics,PageRank,Statistical distribution

论文评审过程:Received 19 June 2002, Revised 20 November 2002, Accepted 22 January 2003, Available online 11 February 2003.

论文官网地址:https://doi.org/10.1016/S0169-023X(03)00034-X