Are Depth-2 Regular Expressions Hard to Intersect?
By: Rocco Ascone , Giulia Bernardini , Alessio Conte and more
Potential Business Impact:
Finds if two text patterns can match together.
We study the basic regular expression intersection testing problem, which asks to determine whether the intersection of the languages of two regular expressions is nonempty. A textbook solution to this problem is to construct the nondeterministic finite automaton that accepts the language of both expressions. This procedure results in a $\Theta(mn)$ running time, where $m$ and $n$ are the sizes of the two expressions, respectively. Following the approach of Backurs and Indyk [FOCS'16] and Bringmann, Gr{\o}nlund, and Larsen [FOCS'17] on regular expression matching and membership testing, we study the complexity of intersection testing for homogeneous regular expressions of bounded depth involving concatenation, OR, Kleene star, and Kleene plus. Specifically, we consider all combinations of types of depth-2 regular expressions and classify the time complexity of intersection testing as either linear or quadratic, assuming SETH. The most interesting result is a quadratic conditional lower bound for testing the intersection of a ''concatenation of +s'' expression with a ''concatenation of ORs'' expression: this is the only hard case that does not involve the Kleene star operator and is not implied by existing lower bounds for the simpler membership testing problem.
Similar Papers
Hardness of Regular Expression Matching with Extensions
Data Structures and Algorithms
Makes some text searches much harder to solve fast.
Improved Extended Regular Expression Matching
Data Structures and Algorithms
Finds patterns in text much faster.
Unconditional Time and Space Complexity Lower Bounds for Intersection Non-Emptiness
Formal Languages and Automata Theory
Makes computers check if word patterns match faster.