Geo-semantic-parsing: AI-powered geoparsing by traversing semantic knowledge graphs

作者:

Highlights:

• Machine learning and Artificial Intelligence on top of knowledge graphs improve geoparsing performance.

• Rich information in knowledge graphs allow to provide fine-grained geotags.

• The expansion step increases available information for the geoparsing task, significantly boosting recall.

• The selection step detects high-quality information for the geoparsing task, thus boosting precision.

• Our proposed GSP obtains massive F1 improvements, counterbalanced by an increased computational complexity.

摘要

Online social networks convey rich information about geospatial facets of reality. However in most cases, geographic information is not explicit and structured, thus preventing its exploitation in real-time applications. We address this limitation by introducing a novel geoparsing and geotagging technique called Geo-Semantic-Parsing (GSP). GSP identifies location references in free text and extracts the corresponding geographic coordinates. To reach this goal, we employ a semantic annotator to identify relevant portions of the input text and to link them to the corresponding entity in a knowledge graph. Then, we devise and experiment with several efficient strategies for traversing the knowledge graph, thus expanding the available set of information for the geoparsing task. Finally, we exploit all available information for learning a regression model that selects the best entity with which to geotag the input text. We evaluate GSP on a well-known reference dataset including almost 10 k event-related tweets, achieving F1 = 0.66. We extensively compare our results with those of 2 baselines and 3 state-of-the-art geoparsing techniques, achieving the best performance. On the same dataset, competitors obtain F1 ≤ 0.55. We conclude by providing in-depth analyses of our results, showing that the overall superior performance of GSP is mainly due to a large improvement in recall, with respect to existing techniques.

论文关键词:Geoparsing,Geotagging,Artificial intelligence,Knowledge graphs,Twitter

论文评审过程:Received 20 March 2020, Revised 19 June 2020, Accepted 22 June 2020, Available online 7 July 2020, Version of Record 28 July 2020.

论文官网地址:https://doi.org/10.1016/j.dss.2020.113346