Applying question answering technology to locating malevolent online content

作者：

摘要

We have empirically compared two classes of technologies capable of locating potentially malevolent online content: 1) popular keyword searching, currently widely used by law enforcement and general public, and 2) emerging question answering (QA). The Google search engine exemplified the first approach. To exemplify the second, we further advanced the pattern based probabilistic QA approach and implemented a proof-of-concept prototype that was capable of finding web pages that provide the answers to the given questions, including non-factual ones (e.g. “How to build a pipe bomb?”). The answers to those question typically indicate the presence of malevolent content. Our findings suggest that QA technology can be a good addition to the traditional keyword searching for the task of locating malevolent online content and, possibly, for a more general task of interactive online information exploration.

论文关键词：Information systems security,Information retrieval,Question answering,World Wide Web

论文评审过程：Available online 26 July 2006.

论文官网地址：https://doi.org/10.1016/j.dss.2006.04.006