Space efficient implementation of hypergraph dualization in the D-basis algorithm
By: Skylar Homan, Anoop Krishnadas, Kira Adaricheva
Potential Business Impact:
Finds patterns in data using less computer memory.
We present a new implementation of the $D$-basis algorithm called the Small Space which considerably reduces the algorithm's memory usage for data analysis applications. The previous implementation delivers the complete set of implications that hold on the set of attributes of an input binary table. In the new version, the only output is the frequencies of attributes that appear in the antecedents of implications from the $D$-basis, with a fixed consequent attribute. Such frequencies, rather than the implications themselves, became the primary focus in analysis of datasets where the $D$-basis has been applied over the last decade. The $D$-basis employs a hypergraph dualization algorithm, and a dualization implementation known as Reverse Search allows for the gradual computation of frequencies without the need for storing all discovered implications. We demonstrate the effectiveness of the Small Space implementation by comparing the runtimes and maximum memory usage of this new version with the current implementation.
Similar Papers
Dynamic Tree Databases in Automated Planning
Artificial Intelligence
Compresses computer problem-solving data, saving lots of space.
Optimal $(α,β)$-Dense Subgraph Search in Bipartite Graphs
Databases
Finds important connections faster in big networks.
Efficient Trace Frequency Queries in Sparse Graphs
Data Structures and Algorithms
Finds patterns in connected data faster.