Building virtual web views

作者:

Highlights:

摘要

We are so used to the ubiquitous World-Wide Web (WWW) that we take it for granted. There is no need to emphasize how dynamic, large, rich, and unstructured, yet important the web is. From researchers and engineers to children and retired elderly, everyone uses the WWW for a variety of needs. A multitude of tools and search engines were developed to find and retrieve resources from the web. However, everyone knows how frustrating the experience with search engines can be. It is very difficult to find, if ever found, relevant information or patterns from within resources on the Internet. The idea presented in this paper is to “warehouse” the web in a structure that would allow efficient information retrieval and knowledge discovery from the Internet. Warehousing the web in this context consists of creating different virtual web views with layered databases of descriptors organized hierarchicly. Using a declarative adhoc mining language, one can find and pinpoint explicit as well as implicit knowledge from the web warehouse.

论文关键词:Querying the web,Knowledge discovery,Resource discovery,Data warehousing,Web mining

论文评审过程:Received 19 June 2001, Revised 24 July 2001, Accepted 24 July 2001, Available online 3 October 2001.

论文官网地址:https://doi.org/10.1016/S0169-023X(01)00037-4