A smart web query method for semantic retrieval of web data

作者:

Highlights:

摘要

The efficient query and extraction of web data is often difficult, because web data does not conform to any data organization standard. In addition, the development of web search technology is still at a relatively early stage. Search engines provide only primitive data query capabilities, and require a detailed syntactic specification to retrieve relevant data. Furthermore, web data exists in a myriad of formats including PDF documents, images, and sound clips that are difficult to be queried. This research proposes a smart web query (SWQ) method for the semantic retrieval of web data. The SWQ method uses domain semantics represented as context ontologies to specify and formulate appropriate web queries to search. This method also relies on semantic search filters to identify and rank relevant web pages semi-automatically. Unlike traditional ontologies that are structured in a hierarchy, terms and their relationships that pertain to a particular domain are organized with a flexible structure by the context ontologies. An SWQ engine is being developed to test the proposed method. Financial trading (e.g. stocks, bonds, unit trusts) is adapted as an example domain (i.e., context) to test and validate the SWQ method and engine.

论文关键词:Semantic information retrieval,World wide web (WWW),Web search engine,Ontology development,Smart web query engine

论文评审过程:Received 27 March 2001, Revised 27 March 2001, Accepted 27 March 2001, Available online 7 June 2001.

论文官网地址:https://doi.org/10.1016/S0169-023X(01)00017-9