A customizable inexact subgraph matching algorithm for attributed graphs
By: Tatyana Benko, Rebecca Jones, Lucas Tate
Potential Business Impact:
Finds hidden patterns in messy data relationships.
Graphs provide a natural way to represent data by encoding information about objects and the relationships between them. With the ever-increasing amount of data collected and generated, locating specific patterns of relationships between objects in a graph is often required. Given a larger graph and a smaller graph, one may wish to identify instances of the smaller query graph in the larger target graph. This task is called subgraph identification or matching. Subgraph matching is helpful in areas such as bioinformatics, binary analysis, pattern recognition, and computer vision. In these applications, datasets frequently contain noise and errors, thus exact subgraph matching algorithms do not apply. In this paper we introduce a new customizable algorithm for inexact subgraph matching. Our algorithm utilizes node and edge attributes which are often present in real-world datasets to narrow down the search space. The algorithm is flexible in the type of subgraph matching it can perform and the types of datasets it can process by its use of a modifiable graph edit distance cost function for pairing nodes. We show its effectiveness on family trees graphs and control-flow graphs.
Similar Papers
Charting the Design Space of Neural Graph Representations for Subgraph Matching
Machine Learning (CS)
Finds patterns in complex data faster.
Efficient Partition-based Approaches for Diversified Top-k Subgraph Matching
Databases
Finds different patterns in connected data faster.
Differentially Private Matchings
Data Structures and Algorithms
Finds private connections in secret networks.