A method for managing access to web pages: Filtering by Statistical Classification (FSC) applied to text

作者:

Highlights:

摘要

Various entities (e.g., parents, employers) that provide users (e.g., children, employees) access to web content wish to limit the content accessed through those computers. Available filtering methods are crude in that they too often block “acceptable” content while failing to block “unacceptable” content. This paper presents a general and flexible classification method based on statistical techniques applied to text material, that we call, Filtering by Statistical Classification (FSC). According to each individual entity's expressed opinions about what content in a training data set is or is not acceptable, FSC constructs a customized model to represent each individual entity's preferences. FSC then uses this customized model to examine new web content and to block unwanted content. The empirical results suggest that our method has greater predictive power than do a variety of existing approaches.

论文关键词:Content-based filtering,Decision support,Statistical classification techniques

论文评审过程:Available online 18 January 2005.

论文官网地址:https://doi.org/10.1016/j.dss.2004.11.015