Score: 2

Testing Suffixient Sets

Published: June 9, 2025 | arXiv ID: 2506.08225v1

By: Davide Cenzato, Francisco Olivares, Nicola Prezza

Potential Business Impact:

Finds text patterns faster by storing less.

Business Areas:
A/B Testing Data and Analytics

Suffixient sets are a novel prefix array (PA) compression technique based on subsampling PA (rather than compressing the entire array like previous techniques used to do): by storing very few entries of PA (in fact, a compressed number of entries), one can prove that pattern matching via binary search is still possible provided that random access is available on the text. In this paper, we tackle the problems of determining whether a given subset of text positions is (1) a suffixient set or (2) a suffixient set of minimum cardinality. We provide linear-time algorithms solving these problems.

Country of Origin
🇨🇱 🇮🇹 Italy, Chile

Repos / Data Links

Page Count
14 pages

Category
Computer Science:
Data Structures and Algorithms