Noisy Graph Patterns via Ordered Matrices
By: Jules Wulms, Wouter Meulemans, Bettina Speckmann
Potential Business Impact:
Finds hidden connections in messy data.
The high-level structure of a graph is a crucial ingredient for the analysis and visualization of relational data. However, discovering the salient graph patterns that form this structure is notoriously difficult for two reasons. (1) Finding important patterns, such as cliques and bicliques, is computationally hard. (2) Real-world graphs contain noise, and therefore do not always exhibit patterns in their pure form. Defining meaningful noisy patterns and detecting them efficiently is a currently unsolved challenge. In this paper, we propose to use well-ordered matrices as a tool to both define and effectively detect noisy patterns. Specifically, we represent a graph as its adjacency matrix and optimally order it using Moran's $I$. Standard graph patterns (cliques, bicliques, and stars) now translate to rectangular submatrices. Using Moran's $I$, we define a permitted level of noise for such patterns. A combination of exact algorithms and heuristics allows us to efficiently decompose the matrix into noisy patterns. We also introduce a novel motif simplification that visualizes noisy patterns while explicitly encoding the level of noise. We showcase our techniques on several real-world data sets.
Similar Papers
Local Clustering in Hypergraphs through Higher-Order Motifs
Social and Information Networks
Finds hidden groups in complex connections.
PCA recovery thresholds in low-rank matrix inference with sparse noise
Machine Learning (Stat)
Finds hidden patterns in messy data.
Higher order trade-offs in hypergraph community detection
Social and Information Networks
Finds hidden groups in complex connections.