Fast processing of graph queries on a large database of small and medium-sized data graphs

作者:

Highlights:

摘要

We propose a new way of indexing a large database of small and medium-sized graphs and processing exact subgraph matching (or subgraph isomorphism) and approximate (full) graph matching queries. Rather than decomposing a graph into smaller units (e.g., paths, trees, graphs) for indexing purposes, we represent each graph in the database by its graph signature, which is essentially a multiset. We construct a disk-based index on all the signatures via bulk loading. During query processing, a query graph is also mapped into its signature, and this signature is searched using the index by performing multiset operations. To improve the precision of exact subgraph matching, we develop a new scheme using the concept of line graphs. Through extensive evaluation on real and synthetic graph datasets, we demonstrate that our approach provides a scalable and efficient disk-based solution for a large database of small and medium-sized graphs.

论文关键词:Graph indexing,Query processing,Subgraph matching,Approximate graph matching,Line graphs

论文评审过程:Received 23 May 2013, Revised 19 February 2016, Accepted 19 March 2016, Available online 18 April 2016, Version of Record 10 June 2016.

论文官网地址:https://doi.org/10.1016/j.jcss.2016.04.002