Fast processing of graph queries on a large database of small and medium-sized data graphs

作者：

Highlights：

•

摘要

We propose a new way of indexing a large database of small and medium-sized graphs and processing exact subgraph matching (or subgraph isomorphism) and approximate (full) graph matching queries. Rather than decomposing a graph into smaller units (e.g., paths, trees, graphs) for indexing purposes, we represent each graph in the database by its graph signature, which is essentially a multiset. We construct a disk-based index on all the signatures via bulk loading. During query processing, a query graph is also mapped into its signature, and this signature is searched using the index by performing multiset operations. To improve the precision of exact subgraph matching, we develop a new scheme using the concept of line graphs. Through extensive evaluation on real and synthetic graph datasets, we demonstrate that our approach provides a scalable and efficient disk-based solution for a large database of small and medium-sized graphs.

论文关键词：Graph indexing,Query processing,Subgraph matching,Approximate graph matching,Line graphs

论文评审过程：Received 23 May 2013, Revised 19 February 2016, Accepted 19 March 2016, Available online 18 April 2016, Version of Record 10 June 2016.

论文官网地址：https://doi.org/10.1016/j.jcss.2016.04.002